preprintOct 26, 2023GOLD OA
A Tale of Two Graphs: Freezing and Denoising Graph Structures for Multimodal Recommendation
Nanyang Technological University
Indexed inarxivcrossref
Abstract
Multimodal recommender systems utilizing multimodal features (e.g., images and textual descriptions) typically show better recommendation accuracy than general recommendation models based solely on user-item interactions. Generally, prior work fuses multimodal features into item ID embeddings to enrich item representations, thus failing to capture the latent semantic item-item structures. In this context, LATTICE proposes to learn the latent structure between items explicitly and achieves state-of-the-art performance for multimodal recommendations. However, we argue the latent graph structure learning of LATTICE is both inefficient and unnecessary. Experimentally, we demonstrate that freezing its item-item…
Citation impact
181
total citations
- FWCI
- 78.99
- Percentile
- 100%
- References
- 24
Citations per year
Authors
2Topics & keywords
Keywords
- Computer science
- Recommender system
- Graph
- Machine learning
- Theoretical computer science
- Artificial intelligence
- Information retrieval
UN Sustainable Development Goals
- Peace, Justice and strong institutions
No related works found for this paper.