Oblivious yet High Performance Task Scheduling for Large Shared Clusters

dc.contributor.advisorNg, T. S. Eugene
dc.creatorWu, Dingming
dc.date.accessioned2019-05-17T13:21:24Z
dc.date.available2019-05-17T13:21:24Z
dc.date.created2018-05
dc.date.issued2018-06-11
dc.date.submittedMay 2018
dc.date.updated2019-05-17T13:21:24Z
dc.description.abstractData analytics in large scale clusters are gradually shifting from monolithic and centralized scheduling frameworks to distributed or hybrid scheduling frameworks. In these distributed or hybrid frameworks, task queues on workers have widely been adopted to reconcile the conflict of task placements by different cluster schedulers. While a lot of task scheduling policies are available for each worker, the impact of each policy on the task performance and the ultimate job performance is not well understood. Consequently, the choice of scheduling policy for task is usually quite \textit{ad hoc}, especially when the task runtime information is not available beforehand. This thesis explores the task queuing effect by examining and comparing different scheduling policies for workers. We present the design and implementation of a worker-level task scheduler, Runway, that is oblivious to the individual task runtime information while still provides high performance and fairness. We demonstrate Runway's effectiveness in reducing average task completion time while guaranteeing starvation-freedom through extensive evaluations. Results show that Runway can provide 5$\times$ task performance improvement, and 42\% job performance improvement under high load compared to the state-of-art solution.
dc.format.mimetypeapplication/pdf
dc.identifier.citationWu, Dingming. "Oblivious yet High Performance Task Scheduling for Large Shared Clusters." (2018) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/105585">https://hdl.handle.net/1911/105585</a>.
dc.identifier.urihttps://hdl.handle.net/1911/105585
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectCluster Scheduling
dc.subjectTask Scheduling
dc.subjectBig data frameworks
dc.subjectNon-clairvoyant Scheduling
dc.titleOblivious yet High Performance Task Scheduling for Large Shared Clusters
dc.typeThesis
dc.type.materialText
thesis.degree.departmentComputer Science
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
WU-DOCUMENT-2018.pdf
Size:
1.83 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: