A Resource-Aware Streaming-based Framework for Big Data Analysis

dc.contributor.advisorKoushanfar, Farinazen_US
dc.contributor.committeeMemberAazhang, Behnaamen_US
dc.contributor.committeeMemberBaraniuk, Richarden_US
dc.creatorDarvish Rouhani, Bitaen_US
dc.date.accessioned2016-01-07T17:30:40Zen_US
dc.date.available2016-06-01T05:01:05Zen_US
dc.date.created2015-12en_US
dc.date.issued2015-12-02en_US
dc.date.submittedDecember 2015en_US
dc.date.updated2016-01-07T17:30:40Zen_US
dc.description.abstractThe ever growing body of digital data is challenging conventional analytical techniques in machine learning, computer vision, and signal processing. Traditional analytical methods have been mainly developed based on the assumption that designers can work with data within the confines of their own computing environment. The growth of big data, however, is changing that paradigm especially in scenarios where severe memory and computational resource constraints exist. This thesis aims at addressing major challenges in big data learning problem by devising a new customizable computing framework that holistically takes into account the data structure and underlying platform constraints. It targets a widely used class of analytical algorithms that model the data dependencies by iteratively updating a set of matrix parameters, including but not limited to most regression methods, expectation maximization, and stochastic optimizations, as well as the emerging deep learning techniques. The key to our approach is a customizable, streaming-based data projection methodology that adaptively transforms data into a new lower-dimensional embedding by simultaneously considering both data and hardware characteristics. It enables scalable data analysis and rapid prototyping of an arbitrary matrix-based learning task using a sparse-approximation of the collection that is constantly updated inline with the data arrival. Our work is supported by a set of user-friendly Application Programming Interfaces (APIs) that ensure automated adaptation of the proposed framework to various datasets and System on Chip (SoC) platforms including CPUs, GPUs, and FPGAs. Proof of concept evaluations using a variety of large contemporary datasets corroborate the practicability and scalability of our approach in resource-limited settings. For instance, our results demonstrate 50-fold improvement over the best known prior-art in terms of memory, energy, power, and runtime for training and execution of deep learning models in deployment of different sensing applications including indoor localization and speech recognition on constrained embedded platforms used in today's IoT enabled devices such as autonomous vehicles, robots, and smartphone.en_US
dc.embargo.terms2016-06-01en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationDarvish Rouhani, Bita. "A Resource-Aware Streaming-based Framework for Big Data Analysis." (2015) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/87764">https://hdl.handle.net/1911/87764</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/87764en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectStreaming modelen_US
dc.subjectBig dataen_US
dc.subjectDense matrixen_US
dc.subjectLow-rank approximationen_US
dc.subjectHW/SW co-designen_US
dc.subjectDeep Learningen_US
dc.subjectScalable machine learningen_US
dc.titleA Resource-Aware Streaming-based Framework for Big Data Analysisen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentElectrical and Computer Engineeringen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Scienceen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DARVISHROUHANI-DOCUMENT-2015.pdf
Size:
14.28 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.85 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: