Privacy-Preserving Machine Learning: The Role of Overparameterization and Solutions in Computational Imaging

dc.contributor.advisorBaraniuk, Richard G.en_US
dc.creatorTan, Jasperen_US
dc.date.accessioned2023-01-03T22:40:44Zen_US
dc.date.available2023-01-03T22:40:44Zen_US
dc.date.created2022-12en_US
dc.date.issued2022-12-02en_US
dc.date.submittedDecember 2022en_US
dc.date.updated2023-01-03T22:40:44Zen_US
dc.description.abstractWhile the accelerating deployment of machine learning (ML) models brings benefits to various aspects of human life, it also opens the door to serious privacy risks. In particular, it is sometimes possible to reverse engineer a given model to extract information about the data on which it was trained. Such leakage is especially dangerous if the model's training data contains sensitive information, such as medical records, personal media, or consumer behavior. This thesis is concerned with two big questions around this privacy issue: (1) "what makes ML models vulnerable to privacy attacks?" and (2) "how do we preserve privacy in ML applications?". For question (1), I present detailed analysis on the effect increased overparameterization has on a model's vulnerability to the membership inference (MI) privacy attack, the task of identifying whether a given point is included in the model's training dataset or not. I theoretically and empirically show multiple settings wherein increased overparameterization leads to increased vulnerability to MI even while improving generalization performance. However, I then show that incorporating proper regularization while increasing overparameterization can eliminate this effect and can actually increase privacy while preserving generalization performance, yielding a ``blessing of dimensionality'' for privacy through regularization. For question (2), I present results on the privacy-preserving techniques of synthetic training data simulation and privacy-preserving sensing, both in the domain of computational imaging. I first present a training data simulator for accurate ML-based depth of field (DoF) extension for time-of-flight (ToF) imagers, resulting in a 3.6x increase in a conventional ToF camera's DoF when used with a deblurring neural network. This simulator allows ML to be used without the need for potentially private real training data. Second, I propose a design for a sensor whose measurements obfuscate person identities while still allowing person detection to be performed. Ultimately, it is my hope that these findings and results take the community one step closer towards the responsible deployment of ML models without putting sensitive user data at risk.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationTan, Jasper. "Privacy-Preserving Machine Learning: The Role of Overparameterization and Solutions in Computational Imaging." (2022) Diss., Rice University. <a href="https://hdl.handle.net/1911/114196">https://hdl.handle.net/1911/114196</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/114196en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectprivacyen_US
dc.subjectmembership inferenceen_US
dc.subjectcomputational imagingen_US
dc.subjectoverparameterizationen_US
dc.subjectmachine learningen_US
dc.subjectdeep learningen_US
dc.subjecttime-of-flighten_US
dc.titlePrivacy-Preserving Machine Learning: The Role of Overparameterization and Solutions in Computational Imagingen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentElectrical and Computer Engineeringen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TAN-DOCUMENT-2022.pdf
Size:
113 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: