Wave Equation Based Stencil Optimizations on a Multi-core CPU

dc.contributor.advisorSymes, William W.
dc.contributor.committeeMemberRiviere, Beatrice
dc.contributor.committeeMemberWarburton, Timothy
dc.creatorZhou, Muhong
dc.date.accessioned2016-02-05T21:30:14Z
dc.date.available2016-02-05T21:30:14Z
dc.date.created2014-12
dc.date.issued2014-11-04
dc.date.submittedDecember 2014
dc.date.updated2016-02-05T21:30:14Z
dc.description.abstractWave propagation stencil kernels are engines of seismic imaging algo- rithms. These kernels are both compute- and memory-intensive. This work targets improving the performance of wave equation based stencil code parallelized by OpenMP on a multi-core CPU. To achieve this goal, we explored two techniques: improving vectorization by using hardware SIMD technology, and reducing memory traffic to mitigate the bottle- neck caused by limited memory bandwidth. We show that with loop interchange, memory alignment, and compiler hints, both icc and gcc compilers can provide fully-vectorized stencil code of any order with per- formance comparable to that of SIMD intrinsic code. To reduce cache misses, we present three methods in the context of OpenMP paralleliza- tion: rearranging loop structure, blocking thread accesses, and temporal loop blocking. Our results demonstrate that fully-vectorized high-order stencil code will be about 2X faster if implemented with either of the first two methods, and fully-vectorized low-order stencil code will be about 1.2X faster if implemented with the combination of the last two methods. Our final best-performing code achieves 20%∼30% of peak GFLOPs/sec, depending on stencil order and compiler.
dc.format.mimetypeapplication/pdf
dc.identifier.citationZhou, Muhong. "Wave Equation Based Stencil Optimizations on a Multi-core CPU." (2014) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/88412">https://hdl.handle.net/1911/88412</a>.
dc.identifier.urihttps://hdl.handle.net/1911/88412
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectseismic modeling
dc.subjectacoustic wave propagation
dc.subjecthigh performance computing
dc.subjectSIMD
dc.subjectcache optimizations
dc.subjectOpenMP parallelization
dc.subjectstencil optimization
dc.titleWave Equation Based Stencil Optimizations on a Multi-core CPU
dc.typeThesis
dc.type.materialText
thesis.degree.departmentComputational and Applied Mathematics
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelMasters
thesis.degree.nameMaster of Arts
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ZHOU-DOCUMENT-2014.pdf
Size:
3.68 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: