Qwen3 Technical Report

Yang, An; Li, Anfeng; Yang, Baosong; Zhang, Beichen; Hui, Binyuan; Zheng, Bo; Yu, B. X.; Gao, Chang; Huang, C.; Lv, Chenxu; Zheng, Chujie; Liu, Dayiheng; Zhou, Fan; Huang, Fei; Feng, H; Ge, Hao; Wei, Haoran; Huan, Lin; Tang, Jialong; Yang, Jian; Tu, Jianhong; Zhang, Jianwei; Yang, Jianxin; Yang, Jiaxi; Zhou, Jing; Zhou, Jingren; Lin, Junyang; Dang, Kai; Bao, Keqin; Yang, Kexin; Yu, Le; Lianghao, Deng,; Li, Mei; Xue, Mingfeng; Li, Mingze; Zhang, Pei; Wang, Peng; Zhu, Qin; Men, Rui; Gao, Ruize; Liu, Shixuan; Luo, Shuang; Li, Tianhao; Tang, Tianyi; Yin, Wenbiao; Ren, Xingzhang; Wang, Xinyu; Zhang, Xinyu; Ren, Xuancheng; Yang, Fan; Yang, Su; Zhang, Yichang; Zhang, Yinger; Wan, Yu; Liu, Yuqiong; Wang, Zekun; Cui, Zeyu; Zhang, Zhenru; Zhou, Zhipeng; Qiu, Zihan

doi:10.48550/arxiv.2505.09388

preprintArXiv.orgMay 14, 2025GREEN OA

Qwen3 Technical Report

AYAn Yang ALAnfeng Li BYBaosong Yang BZBeichen Zhang BHBinyuan Hui

Indexed inarxivdatacite

Abstract

In this work, we present Qwen3, the latest version of the Qwen model family. Qwen3 comprises a series of large language models (LLMs) designed to advance performance, efficiency, and multilingual capabilities. The Qwen3 series includes models of both dense and Mixture-of-Expert (MoE) architectures, with parameter scales ranging from 0.6 to 235 billion. A key innovation in Qwen3 is the integration of thinking mode (for complex, multi-step reasoning) and non-thinking mode (for rapid, context-driven responses) into a unified framework. This eliminates the need to switch between different models--such as chat-optimized models (e.g., GPT-4o) and dedicated reasoning models (e.g., QwQ-32B)--and enables dynamic mode…

Citation impact

60

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

60

Topics & keywords

Topics

Keywords

Key (lock)
Task (project management)
Mode (computer interface)
Latency (audio)
Language model
Code (set theory)
Technical report

No related works found for this paper.