AugMMRev: An LLM-Augmented Multimodal Ranking Model for Personalized Image Material Retrieval
Hangzhou Dianzi University · Alibaba Group (China) · +2 more institutions
Abstract
Consumer electronics devices—including smartphones and smart cameras—generate massive volumes of image data. Image retrieval serves as a critical enabling technology for diverse image-centric applications in consumer electronic applications (like AI-powered photo retrieval in smartphone, efficient media asset and creative template retrieval in smart camera, streaming recommendation systems for smart TVs). Nevertheless, text-query-based image retrieval encounters unique challenges within consumer electronics environments. First, in consumer electronics applications, text-to-image retrieving queries are typically concise, frequently leading to ambiguity in intent. Second,numerous images lack textual descriptions…
Citation impact
- FWCI
- 108.94
- Percentile
- 100%
- References
- 0
Authors
9Topics & keywords
- Image retrieval
- Metadata
- Search engine indexing
- Ranking (information retrieval)
- Pipeline (software)
- Visual Word
- Mobile device
- Ambiguity