articleJan 1, 2015GOLD OA
chrF: character n-gram F-score for automatic MT evaluation
Humboldt-Universität zu Berlin
Indexed incrossref
Abstract
We propose the use of character n-gram F-score for automatic evaluation of machine translation output. Character ngrams have already been used as a part of more complex metrics, but their individual potential has not been investigated yet. We report system-level correlations with human rankings for 6-gram F1-score (CHRF) on the WMT12, WMT13 and WMT14 data as well as segment-level correlation for 6gram F1 (CHRF) and F3-scores (CHRF3) on WMT14 data for all available target languages. The results are very promising, especially for the CHRF3 score -for translation from English, this variant showed the highest segment-level correlations outperforming even the best metrics on the WMT14 shared evaluation task.
Citation impact
960
total citations
- FWCI
- 36.51
- Percentile
- 100%
- References
- 10
Citations per year
Authors
1Topics & keywords
Topics
Keywords
- Gram
- Character (mathematics)
- n-gram
- Computer science
- Artificial intelligence
- Natural language processing
- Mathematics
- Geology
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.