Jermaine, Christopher2020-11-032020-11-032020-052020-10-30May 2020Luo, Shangyu. "Automatic Matrix Format Exploration for Large Scale Linear Algebra." (2020) Diss., Rice University. <a href="https://hdl.handle.net/1911/109492">https://hdl.handle.net/1911/109492</a>.https://hdl.handle.net/1911/109492The input of a linear algebra (LA) operation, such as matrices and vectors, could be stored in multiple ways: rows/columns, strips, blocks, etc. Usually, it is very difficult for a programmer to figure out the proper format to use to make a LA computation run fast. Predicting and optimizing the runtime behavior of a LA computation is not an easy task, even when one has expert knowledge of the underlying execution engine. The situation is particularly difficult if the computation consists of thousands of operations, and those operations must be run in a distributed manner. In this paper, we argue that we can render a parallel relational database to automatically explore the formats of LA computations. More specifically, our system would take in the existing code and analyze the operations in the code, explore different formats for those operations and select the most efficient formats, and finally automatically generate the new code to run those operations in their selected formats. We show that our implementation is able to find the formats that have a better performance than the formats that are manually picked up by an expert user of the system.application/pdfengCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.Distributed Database SystemsLarge-scale Linear AlgebraMachine LearningAutomatic Matrix Format Exploration for Large Scale Linear AlgebraThesis2020-11-03