Implementing Asynchronous Checkpoint/Restart for the Concurrent Collections Model

dc.contributor.advisorSarkar, Vivek
dc.contributor.committeeMemberMellor-Crummey, John
dc.contributor.committeeMemberChaudhuri, Swarat
dc.creatorVrvilo, Nick
dc.date.accessioned2016-01-27T22:06:14Z
dc.date.available2016-01-27T22:06:14Z
dc.date.created2014-05
dc.date.issued2014-08-12
dc.date.submittedMay 2014
dc.date.updated2016-01-27T22:06:14Z
dc.description.abstractIt has been claimed that what simplifies parallelism can also simplify resilience. Based on that assertion, we present the Concurrent Collections programming model (CnC) as an ideal target for a simple yet powerful resilience system for parallel computations. Specifically, we claim that the same attributes that simplify reasoning about parallel applications written in CnC will similarly simplify the implementation of a checkpoint/restart system within the CnC runtime. We define these properties of CnC in the context of a model built in K. To demonstrate how these simplifying properties of CnC help to simplify resilience, we have implemented a simple checkpoint/restart system within Rice’s Habanero C implementation of the CnC runtime. We show how the CnC runtime can fully encapsulate the checkpointing and restarting processes, allowing application programmers to gain all the benefits of resilience without any added effort beyond implementing the application in CnC, while avoiding the synchronization overheads present in traditional techniques.
dc.format.mimetypeapplication/pdf
dc.identifier.citationVrvilo, Nick. "Implementing Asynchronous Checkpoint/Restart for the Concurrent Collections Model." (2014) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/88191">https://hdl.handle.net/1911/88191</a>.
dc.identifier.urihttps://hdl.handle.net/1911/88191
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectConcurrent Collections
dc.subjectResilience
dc.subjectCheckpoint/Restart
dc.titleImplementing Asynchronous Checkpoint/Restart for the Concurrent Collections Model
dc.typeThesis
dc.type.materialText
thesis.degree.departmentComputer Science
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
VRVILO-THESIS-2014.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
939 B
Format:
Plain Text
Description: