Abstract
During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining applications within R. We give a survey on text mining facilities in R and explain how typical application tasks can be carried out using our framework. We present techniques for count-based analysis methods, text clustering, text classification and string kernels.
Citation impact
1,140
total citations
- FWCI
- 29.76
- Percentile
- 100%
- References
- 78
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Computer science
- Cluster analysis
- String (physics)
- Data mining
- Text mining
- Data science
- Artificial intelligence
- Mathematics
UN Sustainable Development Goals
- Industry, innovation and infrastructure
No related works found for this paper.