ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

GLM, Team; :; Zeng, Aohan; Xu, Bin; Wang, Bowen; Zhang, Chenhui; Yin, Da Gang; Dan, Zhang,; Rojas, Diego; Feng, Guanyu; Zhao, Hanlin; Lai, Hanyu; Yu, Hao; Wang, Hongning; Sun, Jiadai; Zhang, Jiajie; Cheng, Jiale; Gui, Jiayi; Tang, Jie; Zhang, Jing; Jingyu, Sun,; Li, Juanzi; Zhao, Lei; Wu, Lindong; Zhong, Lucen; Liu, M.; Huang, Minlie; Zhang, Peng; Zheng, Qinkai; Lu, Rui; Duan, Shuaiqi; Zhang, Shudan; Cao, Shulin; Yang, Shuxun; Tam, Weng Lam; Zhao, Wenyi; Liu, Xiao; Xiao, Xia; Zhang, Xiaohan; Gu, Xiaotao; Xin, LV; Liu, Xinghan; Liu, Xinyi; Yang, Xinyue; Song, Xixuan; Xunkai, Zhang,; An, Yifan; Xu, Yifan; Niu, Yilin; Yang, Yuantao; Li, Yueyan; Bai, Yushi; Dong, Yuxiao; Qi, Zehan; Wang, Zhaoyu; Yang, Zhen; Du, Zhengxiao; Hou, Zhenyu; Wang, Zihan

doi:10.48550/arxiv.2406.12793

preprintarXiv (Cornell University)Jun 18, 2024GREEN OA

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

TGTeam GLM ::AZAohan Zeng BXBin Xu BWBowen Wang

Indexed inarxivdatacite

Abstract

We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained on ten trillions of tokens mostly in Chinese and English, along with a small set of corpus from 24 languages, and aligned primarily for Chinese and English usage. The high-quality alignment is achieved via a multi-stage post-training process, which involves supervised fine-tuning and learning from human feedback.…

Citation impact

176

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

59

Topics & keywords

Topics

Topic Modeling92%

Keywords

Generalized linear model
Mathematics
Computer science
Econometrics
Statistics

No related works found for this paper.

Funding

TU
Tsinghua University