Abstract

Cross-modal retrieval aims to enable flexible retrieval experience across different modalities (e.g., texts vs. images). The core of cross-modal retrieval research is to learn a common subspace where the items of different modalities can be directly compared to each other. In this paper, we present a novel Adversarial Cross-Modal Retrieval (ACMR) method, which seeks an effective common subspace based on adversarial learning. Adversarial learning is implemented as an interplay between two processes. The first process, a feature projector, tries to generate a modality-invariant representation in the common subspace and to confuse the other process, modality classifier, which tries to discriminate between…

Citation impact

770
total citations
FWCI
28.28
Percentile
100%
References
43
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Subspace topology
  • Modal
  • Modalities
  • Artificial intelligence
  • Modality (human–computer interaction)
  • Classifier (UML)
  • Representation (politics)
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.

Funding