Ridge Regularization by Randomization in Linear Ensembles

dc.contributor.advisorBaraniuk, Richard Gen_US
dc.creatorLeJeune, Danielen_US
dc.date.accessioned2023-01-03T21:20:27Zen_US
dc.date.available2023-01-03T21:20:27Zen_US
dc.date.created2022-12en_US
dc.date.issued2022-11-21en_US
dc.date.submittedDecember 2022en_US
dc.date.updated2023-01-03T21:20:27Zen_US
dc.description.abstractEnsemble methods that average over a collection of independent predictors that are each limited to random sampling of both the examples and features of the training data command a significant presence in machine learning, such as the ever-popular random forest. Combining many such randomized predictors into an ensemble produces a highly robust predictor with excellent generalization properties; however, understanding the specific nature of the effect of randomization on ensemble method behavior has received little theoretical attention. We study the case of an ensembles of linear predictors, where each individual predictor is a linear predictor fit on a randomized sample of the data matrix. We first show a straightforward argument that an ensemble of ordinary least squares predictors fit on a simple subsampling can achieve the optimal ridge regression risk in a standard Gaussian data setting. We then significantly generalize this result to eliminate essentially all assumptions on the data by considering ensembles of linear random projections or sketches of the data, and in doing so reveal an asymptotic first-order equivalence between linear regression on sketched data and ridge regression. By extending this analysis to a second-order characterization, we show how large ensembles converge to ridge regression under quadratic metrics.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationLeJeune, Daniel. "Ridge Regularization by Randomization in Linear Ensembles." (2022) Diss., Rice University. <a href="https://hdl.handle.net/1911/114188">https://hdl.handle.net/1911/114188</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/114188en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectensemblesen_US
dc.subjectridge regressionen_US
dc.subjectsketchingen_US
dc.subjectrandom projectionsen_US
dc.subjectproportional asymptoticsen_US
dc.subjectrandom matrix theoryen_US
dc.titleRidge Regularization by Randomization in Linear Ensemblesen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentElectrical and Computer Engineeringen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LEJEUNE-DOCUMENT-2022.pdf
Size:
3.13 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: