articleJun 7, 2012Closed access

SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity

University of the Basque Country · Stanford University · +1 more institution

Abstract

Semantic Textual Similarity (STS) measures the degree of semantic equivalence between two texts. This paper presents the results of the STS pilot task in Semeval. The training data contained 2000 sentence pairs from pre-viously existing paraphrase datasets and ma-chine translation evaluation resources. The test data also comprised 2000 sentences pairs for those datasets, plus two surprise datasets with 400 pairs from a different machine trans-lation evaluation corpus and 750 pairs from a lexical resource mapping exercise. The sim-ilarity of pairs of sentences was rated on a 0-5 scale (low to high similarity) by human judges using Amazon Mechanical Turk, with high Pearson correlation scores, around 90%. 35…

Citation impact

679
total citations
FWCI
80.92
Percentile
100%
References
10
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Natural language processing
  • Semantic similarity
  • Paraphrase
  • Artificial intelligence
  • SemEval
  • Pearson product-moment correlation coefficient
  • Machine translation
No related works found for this paper.