Runtime Systems for Extreme Scale Platforms

Chatterjee, Sanjay

Runtime Systems for Extreme Scale Platforms

dc.contributor.advisor	Sarkar, Vivek	en_US
dc.contributor.committeeMember	Mellor-Crummey, John	en_US
dc.contributor.committeeMember	Zhong, Lin	en_US
dc.contributor.committeeMember	Budimlic, Zoran	en_US
dc.creator	Chatterjee, Sanjay	en_US
dc.date.accessioned	2014-07-11T19:13:54Z	en_US
dc.date.available	2014-07-11T19:13:54Z	en_US
dc.date.created	2013-12	en_US
dc.date.issued	2013-12-06	en_US
dc.date.submitted	December 2013	en_US
dc.date.updated	2014-07-11T19:13:56Z	en_US
dc.description.abstract	Future extreme-scale systems are expected to contain homogeneous and heterogeneous many-core processors, with O(10^3) cores per node and O(10^6) nodes overall. Effective combination of inter node and intra-node parallelism is recognized to be a major software challenge for such systems. Further, applications will have to deal with constrained energy budgets as well as frequent faults and failures. To aid programmers manage these complexities and enhance programmability, much of recent research has focused on designing state-of-art software runtime systems. Such runtime systems are expected to be a critical component of the software ecosystem for the management of parallelism, locality, load balancing, energy and resilience on extreme-scale systems. In this dissertation, we address three key challenges faced by a runtime system using a dynamic task parallel framework for extreme-scale computing. First, we address the challenge of integrating an intra-node task parallel runtime with a communication system for scalable performance. We present a runtime communication system, called HC-COMM, designed to use dedicated communication cores on a system. We introduce the HCMPI programming model which integrates the Habanero-C asynchronous dynamic task parallel language with the MPI message passing communication model on the HC-COMM runtime. We also introduce the HAPGNS model that enables dataflow programming for extreme-scale systems in which the user does not require knowledge of MPI. Second, we address the challenge of separating locality optimizations from a programmer with domain specific knowledge. We present a tuning framework, through which performance experts can optimize existing applications by specifying runtime operations aimed at co-scheduling of affinitized tasks. Finally, we address the challenge of scalable synchronization for long running tasks on a dynamic task parallel runtime. We use the phaser construct to present a generalized tree-based synchronization algorithm and support unified collective operations at both inter-node and intra-node levels. Overcoming these runtime challenges are a first step towards effective programming on extreme-scale systems.	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.citation	Chatterjee, Sanjay. "Runtime Systems for Extreme Scale Platforms." (2013) Diss., Rice University. <a href="https://hdl.handle.net/1911/76173">https://hdl.handle.net/1911/76173</a>.	en_US
dc.identifier.uri	https://hdl.handle.net/1911/76173	en_US
dc.language.iso	eng	en_US
dc.rights	Copyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.subject	Task parallelism	en_US
dc.subject	Data-flow	en_US
dc.subject	Runtime	en_US
dc.subject	Extreme-scale	en_US
dc.subject	Exascale	en_US
dc.subject	Communications	en_US
dc.subject	Locality	en_US
dc.subject	Tuning	en_US
dc.subject	Phaser	en_US
dc.subject	Synchronization	en_US
dc.title	Runtime Systems for Extreme Scale Platforms	en_US
dc.type	Thesis	en_US
dc.type.material	Text	en_US
thesis.degree.department	Computer Science	en_US
thesis.degree.discipline	Engineering	en_US
thesis.degree.grantor	Rice University	en_US
thesis.degree.level	Doctoral	en_US
thesis.degree.name	Doctor of Philosophy	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: PhDThesis_SanjayChatterjee_Signed.pdf
Size:: 3.21 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 951 B
Format:: Plain Text
Description:

Download

Collections

Rice University Theses and Dissertations