Cross-modal Retrieval with Correspondence Autoencoder

Feng, Fangxiang; Wang, Xiaojie; Li, Ruifan

doi:10.1145/2647868.2654902

articleOct 31, 2014Closed access

Cross-modal Retrieval with Correspondence Autoencoder

FFFangxiang Feng XWXiaojie Wang RLRuifan Li

Beijing University of Posts and Telecommunications

Indexed incrossref

Abstract

The problem of cross-modal retrieval, e.g., using a text query to search for images and vice-versa, is considered in this paper. A novel model involving correspondence autoencoder (Corr-AE) is proposed here for solving this problem. The model is constructed by correlating hidden representations of two uni-modal autoencoders. A novel optimal objective, which minimizes a linear combination of representation learning errors for each modality and correlation learning error between hidden representations of two modalities, is used to train the model as a whole. Minimization of correlation learning error forces the model to learn hidden representations with only common information in different modalities, while…

Citation impact

598

total citations

FWCI: 28.17
Percentile: 100%
References: 27

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Autoencoder
Canonical correlation
Artificial intelligence
Modal
Computer science
Representation (politics)
Modality (human–computer interaction)
Pattern recognition (psychology)

No related works found for this paper.