CIDEr: Consensus-based image description evaluation

Vedantam, Ramakrishna; Zitnick, C. Lawrence; Parikh, Devi

doi:10.1109/cvpr.2015.7299087

articleJun 1, 2015Closed access

CIDEr: Consensus-based image description evaluation

RVRamakrishna Vedantam CLC. Lawrence Zitnick DPDevi Parikh

Virginia Tech · Microsoft (United States)

Indexed incrossref

Abstract

Automatically describing an image with a sentence is a long-standing challenge in computer vision and natural language processing. Due to recent progress in object detection, attribute classification, action recognition, etc., there is renewed interest in this area. However, evaluating the quality of descriptions has proven to be challenging. We propose a novel paradigm for evaluating image descriptions that uses human consensus. This paradigm consists of three main parts: a new triplet-based method of collecting human annotations to measure consensus, a new automated metric that captures consensus, and two new datasets: PASCAL-50S and ABSTRACT-50S that contain 50 sentences describing each image. Our simple…

Citation impact

4,692

total citations

FWCI: 102.75
Percentile: 100%
References: 79

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Computer science
Pascal (unit)
Benchmarking
Artificial intelligence
Metric (unit)
Benchmark (surveying)
Protocol (science)
Information retrieval

UN Sustainable Development Goals

Quality Education

No related works found for this paper.