Memorization in Generative Networks and Mixtures of GANs

Luzi, Lorenzo

Memorization in Generative Networks and Mixtures of GANs

dc.contributor.advisor	Baraniuk, Richard	en_US
dc.creator	Luzi, Lorenzo	en_US
dc.date.accessioned	2020-04-23T16:22:41Z	en_US
dc.date.available	2021-05-01T05:01:11Z	en_US
dc.date.created	2020-05	en_US
dc.date.issued	2020-04-22	en_US
dc.date.submitted	May 2020	en_US
dc.date.updated	2020-04-23T16:22:41Z	en_US
dc.description.abstract	We demonstrate that memorization (perfectly fitting the training data) is necessary to avoid mode collapse in generative networks. Using a straightforward measure of the distance between the training data points and the closest point in the range of the generator, we study how well current generative models memorize in terms of the training dataset size, data distribution, and generator architecture. An important hallmark of our GOoF measure is that it does not require a second, trained model as with Frechet Inception Distance or Inception Score. The GOoF measure quantifies that the successful, popular generative models DCGAN, WGAN, and BigGAN fall far short of memorization. Our analysis inspires a new method to circumvent mode collapse by subsampling the training data (either randomly or with $k$-means clustering); we discuss the links to overparameterization. Mixtures of generative adversarial networks (GANs) are closely related to subsampling methods. We study these mixtures in the context of memorization and density estimation to show that mixtures of GANs are superior to training a single GAN under certain assumptions. Furthermore, we construct a theoretic framework that explains how single GANs, mixtures of GANs, conditional GANs, and Gaussian mixture GANs are all related to each other by modifying the typical GAN optimization problem. Finally, we show empirically that our modified optimization problem has a memorization sweet spot which can be found with hyperparameter tuning.	en_US
dc.embargo.terms	2021-05-01	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.citation	Luzi, Lorenzo. "Memorization in Generative Networks and Mixtures of GANs." (2020) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/108344">https://hdl.handle.net/1911/108344</a>.	en_US
dc.identifier.uri	https://hdl.handle.net/1911/108344	en_US
dc.language.iso	eng	en_US
dc.rights	Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.subject	GANs	en_US
dc.subject	memorization	en_US
dc.subject	mixtures	en_US
dc.title	Memorization in Generative Networks and Mixtures of GANs	en_US
dc.type	Thesis	en_US
dc.type.material	Text	en_US
thesis.degree.department	Electrical and Computer Engineering	en_US
thesis.degree.discipline	Engineering	en_US
thesis.degree.grantor	Rice University	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: LUZI-DOCUMENT-2020.pdf
Size:: 4.14 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 5.84 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 2.61 KB
Format:: Plain Text
Description:

Download

Collections

Rice University Theses and Dissertations