articleJan 6, 2007Closed access

Computing semantic relatedness using Wikipedia-based explicit semantic analysis

Technion – Israel Institute of Technology

Abstract

Computing semantic relatedness of natural language texts requires access to vast amounts of common-sense and domain-specific world knowledge. We propose Explicit Semantic Analysis (ESA), a novel method that represents the meaning of texts in a high-dimensional space of concepts derived from Wikipedia. We use machine learning techniques to explicitly represent the meaning of any text as a weighted vector of Wikipedia-based concepts. Assessing the relatedness of texts in this space amounts to comparing the corresponding vectors using conventional metrics (e.g., cosine). Compared with the previous state of the art, using ESA results in substantial improvements in correlation of computed relatedness scores with…

Citation impact

1,989
total citations
FWCI
157.22
Percentile
100%
References
30
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Semantic similarity
  • Meaning (existential)
  • Cosine similarity
  • Semantic space
  • Natural language processing
  • Vector space
  • Artificial intelligence
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.