preprintOct 26, 2023GOLD OA

A Tale of Two Graphs: Freezing and Denoising Graph Structures for Multimodal Recommendation

Nanyang Technological University

Indexed inarxivcrossref

Abstract

Multimodal recommender systems utilizing multimodal features (e.g., images and textual descriptions) typically show better recommendation accuracy than general recommendation models based solely on user-item interactions. Generally, prior work fuses multimodal features into item ID embeddings to enrich item representations, thus failing to capture the latent semantic item-item structures. In this context, LATTICE proposes to learn the latent structure between items explicitly and achieves state-of-the-art performance for multimodal recommendations. However, we argue the latent graph structure learning of LATTICE is both inefficient and unnecessary. Experimentally, we demonstrate that freezing its item-item…

Citation impact

181
total citations
FWCI
78.99
Percentile
100%
References
24
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Recommender system
  • Graph
  • Machine learning
  • Theoretical computer science
  • Artificial intelligence
  • Information retrieval
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding