preprintarXiv (Cornell University)Apr 1, 2015GREEN OA

Microsoft COCO Captions: Data Collection and Evaluation Server

Indexed inarxivdatacite

Abstract

In this paper we describe the Microsoft COCO Caption dataset and evaluation server. When completed, the dataset will contain over one and a half million captions describing over 330,000 images. For the training and validation images, five independent human generated captions will be provided. To ensure consistency in evaluation of automatic caption generation algorithms, an evaluation server is used. The evaluation server receives candidate captions and scores them using several popular metrics, including BLEU, METEOR, ROUGE and CIDEr. Instructions for using the evaluation server are provided.

Citation impact

1,631
total citations
FWCI
Percentile
References
47
Citations per year

Authors

7

Topics & keywords

Keywords
  • Coco
  • Computer science
  • Database
  • Operating system
  • Data collection
  • World Wide Web
  • Artificial intelligence
  • Mathematics
No related works found for this paper.