Leaky Buffer: A Novel Abstraction for Relieving Memory Pressure from Cluster Data Processing Frameworks

dc.contributor.advisorNg, T. S. Eugeneen_US
dc.contributor.committeeMemberCox, Alan Len_US
dc.contributor.committeeMemberJermaine, Christopher Men_US
dc.creatorLiu, Zhaoleien_US
dc.date.accessioned2016-01-25T16:47:38Zen_US
dc.date.available2016-01-25T16:47:38Zen_US
dc.date.created2015-12en_US
dc.date.issued2015-07-13en_US
dc.date.submittedDecember 2015en_US
dc.date.updated2016-01-25T16:47:38Zen_US
dc.description.abstractThe shift to the in-memory data processing paradigm has had a major influence on the development of cluster data processing frameworks. Numerous frameworks from the industry, open source community and academia are adopting the in-memory paradigm to achieve functionalities and performance breakthroughs. However, despite the advantages of these in-memory frameworks, in practice they are susceptible to memory-pressure related performance collapse and failures. The contributions of this thesis are two-fold. Firstly, we conduct a detailed diagnosis of the memory pressure problem and identify three preconditions for the performance collapse. These preconditions not only explain the problem but also shed light on the possible solution strategies. Secondly, we propose a novel programming abstraction called the leaky buffer that eliminates one of the preconditions, thereby addressing the underlying problem. We have implemented the leaky buffer abstraction in Spark for two distinct use cases. Experiments on a range of memory intensive aggregation operations show that the leaky buffer abstraction can drastically reduce the occurrence of memory-related failures, improve performance by up to 507% and reduce memory usage by up to 87.5%.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationLiu, Zhaolei. "Leaky Buffer: A Novel Abstraction for Relieving Memory Pressure from Cluster Data Processing Frameworks." (2015) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/88106">https://hdl.handle.net/1911/88106</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/88106en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectbig dataen_US
dc.subjectJVMen_US
dc.subjectmemoryen_US
dc.subjectSparken_US
dc.titleLeaky Buffer: A Novel Abstraction for Relieving Memory Pressure from Cluster Data Processing Frameworksen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputer Scienceen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Scienceen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LIU-DOCUMENT-2015.pdf
Size:
1.47 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: