Character-Aware Neural Language Models

Harvard University Press · New York University

Indexed incrossref

Abstract

We describe a simple neural language model that relies only on character-level inputs. Predictions are still made at the word-level. Our model employs a convolutional neural network (CNN) and a highway net work over characters, whose output is given to a long short-term memory (LSTM) recurrent neural network language model (RNN-LM). On the English Penn Treebank the model is on par with the existing state-of-the-art despite having 60% fewer parameters. On languages with rich morphology (Arabic, Czech, French, German, Spanish, Russian), the model outperforms word-level/morpheme-level LSTM baselines, again with fewer parameters. The results suggest that on many languages, character inputs are sufficient for…

Citation impact

1,039
total citations
FWCI
150.25
Percentile
100%
References
69
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Treebank
  • Morpheme
  • Character (mathematics)
  • Natural language processing
  • Artificial intelligence
  • Language model
  • Recurrent neural network
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.