Automatic evaluation of machine translation quality using n-gram co-occurrence statistics
National Institute of Standards · National Institute of Standards and Technology
Abstract
Evaluation is recognized as an extremely helpful forcing function in Human Language Technology R&D. Unfortunately, evaluation has not been a very powerful tool in machine translation (MT) research because it requires human judgments and is thus expensive and time-consuming and not easily factored into the MT research agenda. However, at the July 2001 TIDES PI meeting in Philadelphia, IBM described an automatic MT evaluation technique that can provide immediate feedback and guidance in MT research. Their idea, which they call an "evaluation understudy", compares MT output with expert reference translations in terms of the statistics of short sequences of words (word N-grams). The more of these N-grams that a…
Citation impact
- FWCI
- 55.86
- Percentile
- 100%
- References
- 0
Authors
1Topics & keywords
- NIST
- IBM
- Machine translation
- Evaluation of machine translation
- Computer science
- Quality (philosophy)
- Translation (biology)
- Artificial intelligence
- Quality Education