Scalable Nearest Neighbor Algorithms for High Dimensional Data

University of British Columbia

PubMed
Indexed incrossrefpubmed

Abstract

For many computer vision and machine learning problems, large training sets are key for good performance. However, the most computationally expensive part of many computer vision and machine learning algorithms consists of finding nearest neighbor matches to high dimensional vectors that represent the training data. We propose new algorithms for approximate nearest neighbor matching and evaluate and compare them with previous algorithms. For matching high dimensional features, we find two algorithms to be the most efficient: the randomized k-d forest and a new algorithm proposed in this paper, the priority search k-means tree. We also propose a new algorithm for matching binary features by searching multiple…

Citation impact

1,406
total citations
FWCI
104.04
Percentile
100%
References
82
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Best bin first
  • k-nearest neighbors algorithm
  • Nearest-neighbor chain algorithm
  • Nearest neighbor search
  • Cover tree
  • Scalability
  • Matching (statistics)
No related works found for this paper.