LoFT: Finding Lottery Tickets through Filter-wise Training

dc.contributor.advisorKyrillidis, Anastasiosen_US
dc.creatorWang, Qihanen_US
dc.date.accessioned2022-09-23T21:46:58Zen_US
dc.date.available2022-11-01T05:01:14Zen_US
dc.date.created2022-05en_US
dc.date.issued2022-05-04en_US
dc.date.submittedMay 2022en_US
dc.date.updated2022-09-23T21:46:58Zen_US
dc.description.abstractRecent work on pruning techniques and the Lottery Ticket Hypothesis (LTH) shows that there exist “winning tickets” in large neural networks. These tickets represent versions of the full model that can be trained separately to achieve comparable accuracy with respect to the full models. However, in practice the process of finding these tickets can be a burdensome task, especially when the original neural network gets larger: Often one has to pretrain the large model for at least a number of epochs. In this paper, we explore how we can empirically identify when such winning tickets emerge, and use this heuristic to design efficient pretraining algorithms. Our focus in this work is on convolutional neural networks (CNNs): To identify good filters within winning tickets, we propose a novel filter distance metric that well-represents the model convergence, without the need to know the true winning ticket or training the model in full. Our filter analysis behaves consistently with recent findings of neural network learning dynamics. Motivated by this metric, we present the LOttery ticket through Filter-wise Training algorithm, dubbed as LoFT. LoFT is a model-parallel pretraining algorithm that partitions convolutional layers in CNNs by filters to train them independently on different distributed workers, leading to reduced memory and communication costs during pretraining. Experiments show that LoFT achieves non-trivial savings in communication, while maintaining comparable or even better accuracy compared to other model-parallel training methods.en_US
dc.embargo.terms2022-11-01en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationWang, Qihan. "LoFT: Finding Lottery Tickets through Filter-wise Training." (2022) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/113345">https://hdl.handle.net/1911/113345</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/113345en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectMachine Learningen_US
dc.subjectLottery Ticket Hypothesisen_US
dc.titleLoFT: Finding Lottery Tickets through Filter-wise Trainingen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputer Scienceen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Scienceen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
WANG-DOCUMENT-2022.pdf
Size:
17.34 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: