Synchronization, coherence, and consistency for high performance shared memory multiprocessing

dc.contributor.advisorJump, J. Robert
dc.contributor.advisorSinclair, James B.
dc.creatorDwarkadas, Sandhya
dc.date.accessioned2009-06-03T23:55:38Z
dc.date.available2009-06-03T23:55:38Z
dc.date.issued1993
dc.description.abstractAlthough improved device technology has increased the performance of computer systems, fundamental hardware limitations and the need to build faster systems using existing technology have led many computer system designers to consider parallel designs with multiple computing elements. Unfortunately, the design of efficient and scalable multiprocessors has proven to be an elusive goal. This dissertation describes a hierarchical bus-based multiprocessor architecture, an adaptive cache coherence protocol, and efficient and simple synchronization support that together meet this challenge. We have also developed an execution-driven tool for the simulation of shared-memory multiprocessors, which we use to evaluate the proposed architectural enhancements. Our simulator offers substantial advantages in terms of reduced time and space overheads when compared to instruction-driven or trace-driven simulation techniques, without significant loss of accuracy. The simulator generates correctly interleaved parallel traces at run time, allowing the accurate simulation of a variety of architectural alternatives for a number of programs. Our results provide a quantitative analysis of the viability of large-scale bus-based memory hierarchies. We evaluate the effect on performance of several architectural enhancements, and discuss the tradeoffs between reducing contention and increasing latency as the number of levels in the memory hierarchy are increased. Toward this end, we have developed a cache coherence protocol for a hierarchical bus-based architecture that minimizes total communication overhead by utilizing all available (bus-provided) information. Based on our evaluation, we propose an integrated set of architectural design decisions. These include synchronization using a conditional test&set operation that eliminates excess bus traffic and contention, conditional access scheduling, where bus traffic is reduced by keeping track of pending bus accesses for every cache line, adaptive caching, where each cache line is assigned a coherence protocol based upon the expected or observed access behavior for that line, and the use of relaxed memory consistency models, where writes are aggressively buffered. We also present a new classification of memory consistency models that, in addition to unifying all existing models into a common framework, provides insight into the implications of these models with respect to access ordering.
dc.format.extent149 p.en_US
dc.format.mimetypeapplication/pdf
dc.identifier.callnoThesis E.E. 1993 Dwarkadas
dc.identifier.citationDwarkadas, Sandhya. "Synchronization, coherence, and consistency for high performance shared memory multiprocessing." (1993) Diss., Rice University. <a href="https://hdl.handle.net/1911/16618">https://hdl.handle.net/1911/16618</a>.
dc.identifier.urihttps://hdl.handle.net/1911/16618
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectComputer science
dc.subjectElectronics
dc.subjectElectrical engineering
dc.titleSynchronization, coherence, and consistency for high performance shared memory multiprocessing
dc.typeThesis
dc.type.materialText
thesis.degree.departmentElectrical Engineering
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
9408615.PDF
Size:
5.04 MB
Format:
Adobe Portable Document Format