Wave Equation Based Stencil Optimizations on a Multi-core CPU

dc.contributor.advisorSymes, William W.en_US
dc.contributor.committeeMemberRiviere, Beatriceen_US
dc.contributor.committeeMemberWarburton, Timothyen_US
dc.creatorZhou, Muhongen_US
dc.date.accessioned2016-02-05T21:30:14Zen_US
dc.date.available2016-02-05T21:30:14Zen_US
dc.date.created2014-12en_US
dc.date.issued2014-11-04en_US
dc.date.submittedDecember 2014en_US
dc.date.updated2016-02-05T21:30:14Zen_US
dc.description.abstractWave propagation stencil kernels are engines of seismic imaging algo- rithms. These kernels are both compute- and memory-intensive. This work targets improving the performance of wave equation based stencil code parallelized by OpenMP on a multi-core CPU. To achieve this goal, we explored two techniques: improving vectorization by using hardware SIMD technology, and reducing memory traffic to mitigate the bottle- neck caused by limited memory bandwidth. We show that with loop interchange, memory alignment, and compiler hints, both icc and gcc compilers can provide fully-vectorized stencil code of any order with per- formance comparable to that of SIMD intrinsic code. To reduce cache misses, we present three methods in the context of OpenMP paralleliza- tion: rearranging loop structure, blocking thread accesses, and temporal loop blocking. Our results demonstrate that fully-vectorized high-order stencil code will be about 2X faster if implemented with either of the first two methods, and fully-vectorized low-order stencil code will be about 1.2X faster if implemented with the combination of the last two methods. Our final best-performing code achieves 20%∼30% of peak GFLOPs/sec, depending on stencil order and compiler.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationZhou, Muhong. "Wave Equation Based Stencil Optimizations on a Multi-core CPU." (2014) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/88412">https://hdl.handle.net/1911/88412</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/88412en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectseismic modelingen_US
dc.subjectacoustic wave propagationen_US
dc.subjecthigh performance computingen_US
dc.subjectSIMDen_US
dc.subjectcache optimizationsen_US
dc.subjectOpenMP parallelizationen_US
dc.subjectstencil optimizationen_US
dc.titleWave Equation Based Stencil Optimizations on a Multi-core CPUen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputational and Applied Mathematicsen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Artsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ZHOU-DOCUMENT-2014.pdf
Size:
3.68 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: