preprintarXiv (Cornell University)Jun 18, 2024GREEN OA

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Indexed inarxivdatacite

Abstract

We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained on ten trillions of tokens mostly in Chinese and English, along with a small set of corpus from 24 languages, and aligned primarily for Chinese and English usage. The high-quality alignment is achieved via a multi-stage post-training process, which involves supervised fine-tuning and learning from human feedback.…

Citation impact

176
total citations
FWCI
Percentile
References
0
Citations per year

Authors

59

Topics & keywords

Keywords
  • Generalized linear model
  • Mathematics
  • Computer science
  • Econometrics
  • Statistics
No related works found for this paper.

Funding