LoFT: Finding Lottery Tickets through Filter-wise Training

Wang, Qihan

LoFT: Finding Lottery Tickets through Filter-wise Training

dc.contributor.advisor	Kyrillidis, Anastasios	en_US
dc.creator	Wang, Qihan	en_US
dc.date.accessioned	2022-09-23T21:46:58Z	en_US
dc.date.available	2022-11-01T05:01:14Z	en_US
dc.date.created	2022-05	en_US
dc.date.issued	2022-05-04	en_US
dc.date.submitted	May 2022	en_US
dc.date.updated	2022-09-23T21:46:58Z	en_US
dc.description.abstract	Recent work on pruning techniques and the Lottery Ticket Hypothesis (LTH) shows that there exist “winning tickets” in large neural networks. These tickets represent versions of the full model that can be trained separately to achieve comparable accuracy with respect to the full models. However, in practice the process of finding these tickets can be a burdensome task, especially when the original neural network gets larger: Often one has to pretrain the large model for at least a number of epochs. In this paper, we explore how we can empirically identify when such winning tickets emerge, and use this heuristic to design efficient pretraining algorithms. Our focus in this work is on convolutional neural networks (CNNs): To identify good filters within winning tickets, we propose a novel filter distance metric that well-represents the model convergence, without the need to know the true winning ticket or training the model in full. Our filter analysis behaves consistently with recent findings of neural network learning dynamics. Motivated by this metric, we present the LOttery ticket through Filter-wise Training algorithm, dubbed as LoFT. LoFT is a model-parallel pretraining algorithm that partitions convolutional layers in CNNs by filters to train them independently on different distributed workers, leading to reduced memory and communication costs during pretraining. Experiments show that LoFT achieves non-trivial savings in communication, while maintaining comparable or even better accuracy compared to other model-parallel training methods.	en_US
dc.embargo.terms	2022-11-01	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.citation	Wang, Qihan. "LoFT: Finding Lottery Tickets through Filter-wise Training." (2022) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/113345">https://hdl.handle.net/1911/113345</a>.	en_US
dc.identifier.uri	https://hdl.handle.net/1911/113345	en_US
dc.language.iso	eng	en_US
dc.rights	Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.subject	Machine Learning	en_US
dc.subject	Lottery Ticket Hypothesis	en_US
dc.title	LoFT: Finding Lottery Tickets through Filter-wise Training	en_US
dc.type	Thesis	en_US
dc.type.material	Text	en_US
thesis.degree.department	Computer Science	en_US
thesis.degree.discipline	Engineering	en_US
thesis.degree.grantor	Rice University	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: WANG-DOCUMENT-2022.pdf
Size:: 17.34 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 5.84 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 2.6 KB
Format:: Plain Text
Description:

Download

Collections

Rice University Theses and Dissertations