articleProcedia Computer ScienceJan 1, 2017DIAMOND OA

AraVec: A set of Arabic Word Embedding Models for use in Arabic NLP

Nile University

Indexed incrossref

Abstract

Advancements in neural networks have led to developments in fields like computer vision, speech recognition and natural language processing (NLP). One of the most influential recent developments in NLP is the use of word embeddings, where words are represented as vectors in a continuous space, capturing many syntactic and semantic relations among them. AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models. The first version of AraVec provides six different word embedding models built on top of three different Arabic content domains; Tweets, World Wide Web pages and…

Citation impact

512
total citations
FWCI
33.09
Percentile
100%
References
15
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Word embedding
  • Natural language processing
  • Artificial intelligence
  • Word (group theory)
  • Embedding
  • Preprocessor
  • Set (abstract data type)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.