articleJan 1, 2003GOLD OA

Automatic evaluation of summaries using N-gram co-occurrence statistics

University of Southern California · Marina Del Rey Hospital

Indexed incrossref

Abstract

Following the recent adoption by the machine translation community of automatic evaluation using the BLEU/NIST scoring process, we conduct an in-depth study of a similar idea for evaluating summaries. The results show that automatic evaluation using unigram co-occurrences between summary pairs correlates surprising well with human evaluations, based on various statistical metrics; while direct application of the BLEU evaluation procedure does not always give good results.

Citation impact

1,586
total citations
FWCI
70.74
Percentile
100%
References
14
Citations per year

Authors

2

Topics & keywords

Keywords
  • NIST
  • BLEU
  • Computer science
  • n-gram
  • Machine translation
  • Artificial intelligence
  • Natural language processing
  • Evaluation of machine translation
No related works found for this paper.