Cache-Efficient Graph Algorithms for Near Neighbor Search

dc.contributor.advisorShrivastava, Anshumali
dc.creatorColeman, Ben
dc.date.accessioned2021-11-04T13:59:41Z
dc.date.available2022-06-01T05:01:12Z
dc.date.created2021-12
dc.date.issued2021-11-01
dc.date.submittedDecember 2021
dc.date.updated2021-11-04T13:59:42Z
dc.description.abstractGraph search has recently become one of the most successful algorithmic trends for near neighbor search. Several of the most popular and empirically successful algorithms are, at their core, a simple walk along a pruned near neighbor graph. Such methods consistently outperform other approaches and are a central component of industrial- scale information retrieval and recommendation systems. However, graph algorithms often suffer from issues related to the memory access pattern of graph traversal. Our measurements show that near neighbor search is no exception to this rule: popular graph indices have poor cache performance and rely on complex heuristics with a large memory cost. To address this problem, we apply graph reordering methods to near neighbor graphs. Graph reordering is a memory layout optimization that groups commonly-accessed nodes together in memory. We present exhaustive experiments that apply several reordering algorithms to the hierarchical navigable small-world (HNSW) graph, and we analyze the algorithms under the ideal cache model. We find that reordering improves the query time by up to 40%. We also demonstrate that popular heuristics can be replaced by simpler alternatives with no performance loss, and we show that the time needed to reorder the graph is negligible compared to the time required to construct the index.
dc.embargo.terms2022-06-01
dc.format.mimetypeapplication/pdf
dc.identifier.citationColeman, Ben. "Cache-Efficient Graph Algorithms for Near Neighbor Search." (2021) Master’s Thesis, Rice University. <a href="https://hdl.handle.net/1911/111627">https://hdl.handle.net/1911/111627</a>.
dc.identifier.urihttps://hdl.handle.net/1911/111627
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectnear neighbor search
dc.subjectgraphs
dc.subjectcaching
dc.titleCache-Efficient Graph Algorithms for Near Neighbor Search
dc.typeThesis
dc.type.materialText
thesis.degree.departmentElectrical and Computer Engineering
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
COLEMAN-DOCUMENT-2021.pdf
Size:
2.23 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: