TACoS: Transformer and Accelerator Co-Search Towards Ubiquitious Vision Transformer Acceleration

Puckett, Daniel

TACoS: Transformer and Accelerator Co-Search Towards Ubiquitious Vision Transformer Acceleration

dc.contributor.advisor	Lin, Yingyan	en_US
dc.creator	Puckett, Daniel	en_US
dc.date.accessioned	2024-01-22T22:13:30Z	en_US
dc.date.available	2024-01-22T22:13:30Z	en_US
dc.date.created	2023-08	en_US
dc.date.issued	2023-12-06	en_US
dc.date.submitted	August 2023	en_US
dc.date.updated	2024-01-22T22:13:30Z	en_US
dc.description.abstract	Recent works have combined pruned Vision Transformer (ViT) models and specialized accelerators to achieve strong accuracy/latency tradeoffs in many computer vision tasks. However, it takes a significant amount of expert labor to adapt these systems to real-world scenarios with specific accuracy, latency, power, and/or area constraints. Automating the design and exploration of these systems is a promising solution but is hampered by two unsolved problems: 1) Existing methods of pruning the attention maps of a ViT model involve fully training the model, pruning its attention maps, then fine-tuning the model. This is infeasible when exploring a design space containing millions of model architectures. 2) The design space is complicated and the system’s area efficiency, scalability, and data movement are hurt because we lack a unified accelerator template that efficiently computes each operation in sparse ViT models. To solve these problems, I propose TACoS: Transformer and Accelerator Co-Search, the first automated method to co-design pruned ViT model and accelerator pairs. TACoS answers the above challenges using 1) a novel ViT search algorithm that simultaneously prunes and fine-tunes many models at many different sparsity ratios, and 2) the first unified ViT accelerator template, which efficiently accelerates each operation in sparse ViT models using adaptable PEs and reconfigurable PE lanes. With these innovations, the TACoS framework quickly and automatically designs state-of-the-art systems for real-world applications and achieves accuracy/latency tradeoffs superior to hand-crafted ViT models and accelerators.	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.citation	Puckett, Daniel. "TACoS: Transformer and Accelerator Co-Search Towards Ubiquitious Vision Transformer Acceleration." (2023) Master's thesis, Rice University. https://hdl.handle.net/1911/115349	en_US
dc.identifier.uri	https://hdl.handle.net/1911/115349	en_US
dc.language.iso	eng	en_US
dc.rights	Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.subject	Vision Transformer	en_US
dc.subject	hardware accelerator	en_US
dc.subject	machine learning	en_US
dc.subject	pruning	en_US
dc.subject	sparsity	en_US
dc.title	TACoS: Transformer and Accelerator Co-Search Towards Ubiquitious Vision Transformer Acceleration	en_US
dc.type	Thesis	en_US
dc.type.material	Text	en_US
thesis.degree.department	Electrical and Computer Engineering	en_US
thesis.degree.discipline	Engineering	en_US
thesis.degree.grantor	Rice University	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: PUCKETT-DOCUMENT-2023.pdf
Size:: 1.62 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 5.84 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 2.98 KB
Format:: Plain Text
Description:

Download

Collections

Rice University Theses and Dissertations