articleJan 1, 2024GOLD OA

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Indexed incrossref

Abstract

Damai Dai, Chengqi Deng, Chenggang Zhao, R.x. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y.k. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2024.

Citation impact

178
total citations
FWCI
56.57
Percentile
100%
References
0
Citations per year

Authors

17

Topics & keywords

Keywords
  • Computer science
  • Natural language processing
  • Artificial intelligence
No related works found for this paper.

Funding