Cache-Efficient Graph Algorithms for Near Neighbor Search

dc.contributor.advisorShrivastava, Anshumalien_US
dc.creatorColeman, Benen_US
dc.date.accessioned2021-11-04T13:59:41Zen_US
dc.date.available2022-06-01T05:01:12Zen_US
dc.date.created2021-12en_US
dc.date.issued2021-11-01en_US
dc.date.submittedDecember 2021en_US
dc.date.updated2021-11-04T13:59:42Zen_US
dc.description.abstractGraph search has recently become one of the most successful algorithmic trends for near neighbor search. Several of the most popular and empirically successful algorithms are, at their core, a simple walk along a pruned near neighbor graph. Such methods consistently outperform other approaches and are a central component of industrial- scale information retrieval and recommendation systems. However, graph algorithms often suffer from issues related to the memory access pattern of graph traversal. Our measurements show that near neighbor search is no exception to this rule: popular graph indices have poor cache performance and rely on complex heuristics with a large memory cost. To address this problem, we apply graph reordering methods to near neighbor graphs. Graph reordering is a memory layout optimization that groups commonly-accessed nodes together in memory. We present exhaustive experiments that apply several reordering algorithms to the hierarchical navigable small-world (HNSW) graph, and we analyze the algorithms under the ideal cache model. We find that reordering improves the query time by up to 40%. We also demonstrate that popular heuristics can be replaced by simpler alternatives with no performance loss, and we show that the time needed to reorder the graph is negligible compared to the time required to construct the index.en_US
dc.embargo.terms2022-06-01en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationColeman, Ben. "Cache-Efficient Graph Algorithms for Near Neighbor Search." (2021) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/111627">https://hdl.handle.net/1911/111627</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/111627en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectnear neighbor searchen_US
dc.subjectgraphsen_US
dc.subjectcachingen_US
dc.titleCache-Efficient Graph Algorithms for Near Neighbor Searchen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentElectrical and Computer Engineeringen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Scienceen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
COLEMAN-DOCUMENT-2021.pdf
Size:
2.23 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: