Automatic evaluation of summaries using N-gram co-occurrence statistics

Lin, Chin-Yew; Hovy, Eduard

doi:10.3115/1073445.1073465

articleJan 1, 2003GOLD OA

Automatic evaluation of summaries using N-gram co-occurrence statistics

CLChin-Yew Lin EHEduard Hovy

University of Southern California · Marina Del Rey Hospital

Indexed incrossref

Abstract

Following the recent adoption by the machine translation community of automatic evaluation using the BLEU/NIST scoring process, we conduct an in-depth study of a similar idea for evaluating summaries. The results show that automatic evaluation using unigram co-occurrences between summary pairs correlates surprising well with human evaluations, based on various statistical metrics; while direct application of the BLEU evaluation procedure does not always give good results.

Citation impact

1,586

total citations

FWCI: 70.74
Percentile: 100%
References: 14

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

NIST
BLEU
Computer science
n-gram
Machine translation
Artificial intelligence
Natural language processing
Evaluation of machine translation

No related works found for this paper.