Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition
Hangzhou Dianzi University · Lenovo (China) · +1 more institution
Abstract
The click feature of an image, defined as the user click frequency vector of the image on a predefined word vocabulary, is known to effectively reduce the semantic gap for fine-grained image recognition. Unfortunately, user click frequency data are usually absent in practice. It remains challenging to predict the click feature from the visual feature, because the user click frequency vector of an image is always noisy and sparse. In this paper, we devise a Hierarchical Deep Word Embedding (HDWE) model by integrating sparse constraints and an improved RELU operator to address click feature prediction from visual features. HDWE is a coarse-to-fine click feature predictor that is learned with the help of an…
Citation impact
- FWCI
- 33.29
- Percentile
- 100%
- References
- 58
Authors
5Topics & keywords
- Computer science
- Feature (linguistics)
- Artificial intelligence
- Pattern recognition (psychology)
- Scalability
- Semantics (computer science)
- Embedding
- Feature extraction
- Quality Education