Abstract
Measuring the similarity between words, sentences, paragraphs and documents is an important component in various tasks such as information retrieval, document clustering, word-sense disambiguation, automatic essay scoring, short answer grading, machine translation and text summarization. This survey discusses the existing works on text similarity through partitioning them into three approaches; String-based, Corpus-based and Knowledgebased similarities. Furthermore, samples of combination between these similarities are presented.
Citation impact
805
total citations
- FWCI
- 39.53
- Percentile
- 100%
- References
- 40
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Computer science
- Similarity (geometry)
- Information retrieval
- Natural language processing
- Data science
- Artificial intelligence
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.