Code Transformations to Improve Memory Parallelism

dc.citation.bibtexNamearticleen_US
dc.citation.journalTitleJournal of Instruction-Level Parallelismen_US
dc.citation.volumeNumber2en_US
dc.contributor.authorPai, Vijay S.en_US
dc.contributor.authorAdve, Sarita V.en_US
dc.contributor.orgCITI (http://citi.rice.edu/)en_US
dc.date.accessioned2007-10-31T00:57:27Z
dc.date.available2007-10-31T00:57:27Z
dc.date.issued2000-05-20
dc.date.modified2002-03-20en_US
dc.date.submitted2002-03-20en_US
dc.descriptionJournal Paperen_US
dc.description.abstractCurrent microprocessors incorporate techniques to exploit instruction-level parallelism (ILP). However, previous work has shown that these ILP techniques are less effective in removing memory stall time than CPU time, making the memory system a greater bottleneck in ILP-based systems than in previous-generation systems. These deficiencies arise largely because applications present limited opportunities for an out-of-order issue processor to overlap multiple read misses, the dominant source of memory stalls. This work proposes code transformations to increase parallelism in the memory system by overlapping multiple read misses within the same instruction window, while preserving cache locality. We present an analysis and transformation framework suitable for compiler implementation. Our simulation experiments show execution time reductions averaging 20% in a multiprocessor and 30% in a uniprocessor. A substantial part of these reductions comes from increases in memory parallelism. We see similar benefits on a Convex Exemplar.en_US
dc.identifier.citationV. S. Pai and S. V. Adve, "Code Transformations to Improve Memory Parallelism," <i>Journal of Instruction-Level Parallelism,</i> vol. 2, 2000.
dc.identifier.urihttps://hdl.handle.net/1911/20186
dc.language.isoeng
dc.subjectcompiler transformations*
dc.subjectout-of-order issue*
dc.subjectmemory parallelism*
dc.subjectlatency tolerance*
dc.subjectunroll-and-jam*
dc.subject.keywordcompiler transformationsen_US
dc.subject.keywordout-of-order issueen_US
dc.subject.keywordmemory parallelismen_US
dc.subject.keywordlatency toleranceen_US
dc.subject.keywordunroll-and-jamen_US
dc.titleCode Transformations to Improve Memory Parallelismen_US
dc.typeJournal article
dc.type.dcmiText
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Pai2000May1CodeTransf.PDF
Size:
163.6 KB
Format:
Adobe Portable Document Format
Collections