Scalable Nearest Neighbor Algorithms for High Dimensional Data
University of British Columbia
Indexed incrossrefpubmed
Abstract
For many computer vision and machine learning problems, large training sets are key for good performance. However, the most computationally expensive part of many computer vision and machine learning algorithms consists of finding nearest neighbor matches to high dimensional vectors that represent the training data. We propose new algorithms for approximate nearest neighbor matching and evaluate and compare them with previous algorithms. For matching high dimensional features, we find two algorithms to be the most efficient: the randomized k-d forest and a new algorithm proposed in this paper, the priority search k-means tree. We also propose a new algorithm for matching binary features by searching multiple…
Citation impact
1,406
total citations
- FWCI
- 104.04
- Percentile
- 100%
- References
- 82
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Computer science
- Best bin first
- k-nearest neighbors algorithm
- Nearest-neighbor chain algorithm
- Nearest neighbor search
- Cover tree
- Scalability
- Matching (statistics)
No related works found for this paper.