Overparameterization and double descent in PCA, GANs, and Diffusion models

dc.contributor.advisorBaraniuk, Richard Gen_US
dc.creatorLuzi, Lorenzoen_US
dc.date.accessioned2024-05-22T17:34:20Zen_US
dc.date.available2024-05-22T17:34:20Zen_US
dc.date.created2024-05en_US
dc.date.issued2024-04-19en_US
dc.date.submittedMay 2024en_US
dc.date.updated2024-05-22T17:34:20Zen_US
dc.description.abstractThis PhD thesis constitutes a synthesis of my doctoral work, which addresses various aspects of study related to generative modeling with a particular focus on overparameterization. Using a novel method we call pseudo-supervision, we investigate approaches toward characterization of overparameterization behaviors, including double descent, of GANs as well as PCA-like problems. Extending pseudo-supervision to diffusion models, we see that it can be used to create an inductive bias; we demonstrate that this allows us to train our model with lower generalization error and faster convergence time compared to the baseline. I additionally introduce a novel method called Boomerang to extend our study of diffusion models, showing that they can be used for local sampling in image manifolds. Finally, in an approach we titled WaM, I extend FID to include non-Gaussian distributions by using a Gaussian mixture model and a bound on the 2-Wasserstein metric for Gaussian mixture models to define a metric on non-Gaussian features.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationLuzi, Lorenzo. Overparameterization and double descent in PCA, GANs, and Diffusion models. (2024). PhD diss., Rice University. https://hdl.handle.net/1911/116219en_US
dc.identifier.urihttps://hdl.handle.net/1911/116219en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectoverparameterizationen_US
dc.subjectgenerative modelsen_US
dc.subjectgansen_US
dc.subjectdiffusion modelsen_US
dc.subjectdouble descenten_US
dc.titleOverparameterization and double descent in PCA, GANs, and Diffusion modelsen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentElectrical and Computer Engineeringen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LUZI-DOCUMENT-2024.pdf
Size:
7.69 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.98 KB
Format:
Plain Text
Description: