The Deep Rendering Model: Bridging Theory and Practice in Deep Learning

Nguyen, Minh Tan

The Deep Rendering Model: Bridging Theory and Practice in Deep Learning

dc.contributor.advisor	Baraniuk, Richard G	en_US
dc.creator	Nguyen, Minh Tan	en_US
dc.date.accessioned	2019-05-17T15:41:25Z	en_US
dc.date.available	2019-05-17T15:41:25Z	en_US
dc.date.created	2018-08	en_US
dc.date.issued	2018-10-10	en_US
dc.date.submitted	August 2018	en_US
dc.date.updated	2019-05-17T15:41:25Z	en_US
dc.description.abstract	A grand challenge in machine learning is the development of computational algorithms that match or outperform humans in perceptual inference tasks such as visual object and speech recognition. The key factor complicating such tasks is the presence of numerous nuisance variables, for instance, the unknown object position, orientation, and scale in object recognition or the unknown voice pronunciation, pitch, and speed in speech recognition. Recently, a new breed of deep learning algorithms has emerged for high-nuisance inference tasks; they are constructed from many layers of alternating linear and nonlinear processing units and are trained using large-scale algorithms and massive amounts of training data. The recent success of deep learning systems is impressive— they now routinely yield pattern recognition systems with near or super-human capabilities — but a fundamental question remains: Why do they work? Intuitions abound, but a coherent framework for understanding, analyzing, and synthesizing deep learning architectures has remained elusive. We answer this question by developing a new probabilistic framework for deep Learning, namely the Deep Rendering Model (DRM), based on a Bayesian generative probabilistic model that explicitly captures variation due to nuisance variables. The graphical structure of the model enables it to be learned from data using classical expectation-maximization techniques. Furthermore, by relaxing the generative model to a discriminative one, we can recover deep convolutional neural networks (DCNs) as well as its variants including the deep residual networks (ResNet) and the densely connected convolutional networks (DenseNet), providing insights into their successes and shortcomings as well as a principled route to their improvement. The DRMM is also applicable to semi-supervised and unsupervised learning tasks, achieving results that are state-of-the-art in several categories on the MNIST benchmark and comparable to state of the art on the CIFAR10 benchmark.	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.citation	Nguyen, Minh Tan. "The Deep Rendering Model: Bridging Theory and Practice in Deep Learning." (2018) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/105801">https://hdl.handle.net/1911/105801</a>.	en_US
dc.identifier.uri	https://hdl.handle.net/1911/105801	en_US
dc.language.iso	eng	en_US
dc.rights	Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.subject	deep learning	en_US
dc.subject	deep convolutional network	en_US
dc.subject	generative model	en_US
dc.subject	semi-supervised learning	en_US
dc.title	The Deep Rendering Model: Bridging Theory and Practice in Deep Learning	en_US
dc.type	Thesis	en_US
dc.type.material	Text	en_US
thesis.degree.department	Electrical and Computer Engineering	en_US
thesis.degree.discipline	Engineering	en_US
thesis.degree.grantor	Rice University	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: NGUYEN-DOCUMENT-2018.pdf
Size:: 8.68 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 5.84 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 2.6 KB
Format:: Plain Text
Description:

Download

Collections

Rice University Theses and Dissertations