Semi-Supervised Hashing for Large-Scale Search
IBM (United States) · Google (United States) · +1 more institution
Abstract
Hashing-based approximate nearest neighbor (ANN) search in huge databases has become popular due to its computational and memory efficiency. The popular hashing methods, e.g., Locality Sensitive Hashing and Spectral Hashing, construct hash functions based on random or principal projections. The resulting hashes are either not very accurate or are inefficient. Moreover, these methods are designed for a given metric similarity. On the contrary, semantic similarity is usually given in terms of pairwise labels of samples. There exist supervised hashing methods that can handle such semantic similarity, but they are prone to overfitting when labeled data are small or noisy. In this work, we propose a semi-supervised…
Citation impact
- FWCI
- 54.40
- Percentile
- 100%
- References
- 55
Authors
3Topics & keywords
- Locality-sensitive hashing
- Hash function
- Dynamic perfect hashing
- Computer science
- Universal hashing
- Feature hashing
- Overfitting
- Pattern recognition (psychology)