Improving Distributional Similarity with Lessons Learned from Word Embeddings

Levy, Omer; Goldberg, Yoav; Dagan, Ido

doi:10.1162/tacl_a_00134

articleTransactions of the Association for Computational LinguisticsDec 1, 2015DIAMOND OA

Improving Distributional Similarity with Lessons Learned from Word Embeddings

OLOmer Levy YGYoav Goldberg IDIdo Dagan

Bar-Ilan University

Indexed incrossrefdoaj

Abstract

Recent trends suggest that neural-network-inspired word embedding models outperform traditional count-based distributional models on word similarity and analogy detection tasks. We reveal that much of the performance gains of word embeddings are due to certain system design choices and hyperparameter optimizations, rather than the embedding algorithms themselves. Furthermore, we show that these modifications can be transferred to traditional distributional models, yielding similar gains. In contrast to prior reports, we observe mostly local or insignificant performance differences between the methods, with no global advantage to any single approach over the others.

Citation impact

1,340

total citations

FWCI: 250.19
Percentile: 100%
References: 31

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Computer science
Word (group theory)
Similarity (geometry)
Hyperparameter
Analogy
Embedding
Word embedding
Artificial intelligence

No related works found for this paper.

Funding

DF
Deutsche Forschungsgemeinschaft
Award: DA 1600/1-1