ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Indexed inarxivdatacite
Abstract
We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained on ten trillions of tokens mostly in Chinese and English, along with a small set of corpus from 24 languages, and aligned primarily for Chinese and English usage. The high-quality alignment is achieved via a multi-stage post-training process, which involves supervised fine-tuning and learning from human feedback.…
Citation impact
176
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
59Topics & keywords
Topics
Keywords
- Generalized linear model
- Mathematics
- Computer science
- Econometrics
- Statistics
No related works found for this paper.