articleJan 1, 2003GOLD OA
Improved automatic keyword extraction given more linguistic knowledge
Indexed incrossref
Abstract
In this paper, experiments on automatic extraction of keywords from abstracts using a supervised machine learning algorithm are discussed. The main point of this paper is that by adding linguistic knowledge to the representation (such as syntactic features), rather than relying only on statistics (such as term frequency and n-grams), a better result is obtained as measured by keywords previously assigned by professional indexers. In more detail, extracting NP-chunks gives a better precision than n-grams, and by adding the PoS tag(s) assigned to the term as a feature, a dramatic improvement of the results is obtained, independent of the term selection approach applied.
Citation impact
966
total citations
- FWCI
- 7.03
- Percentile
- 100%
- References
- 31
Citations per year
Authors
1Topics & keywords
Topics
Keywords
- Computer science
- Term (time)
- Artificial intelligence
- Point (geometry)
- Selection (genetic algorithm)
- Natural language processing
- Representation (politics)
- Feature extraction
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.