Fine-tuning large language models for domain adaptation: exploration of training strategies, scaling, model merging and synergistic capabilities

Lu, Wei; Luu, Rachel K.; Buehler, Markus J.

doi:10.1038/s41524-025-01564-y

articlenpj Computational MaterialsMar 28, 2025GOLD OA

Fine-tuning large language models for domain adaptation: exploration of training strategies, scaling, model merging and synergistic capabilities

WLWei Lu RKRachel K. Luu MJMarkus J. Buehler

Massachusetts Institute of Technology

Indexed incrossrefdoaj

Abstract

The advancement of Large Language Models (LLMs) for domain applications in fields such as materials science and engineering depends on the development of fine-tuning strategies that adapt models for specialized, technical capabilities. In this work, we explore the effects of Continued Pretraining (CPT), Supervised Fine-Tuning (SFT), and various preference-based optimization approaches, including Direct Preference Optimization (DPO) and Odds Ratio Preference Optimization (ORPO), on fine-tuned LLM performance. Our analysis shows how these strategies influence model outcomes and reveals that the merging of multiple fine-tuned models can lead to the emergence of capabilities that surpass the individual…

Citation impact

76

total citations

FWCI: 150.85
Percentile: 100%
References: 38

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Adaptation (eye)
Fine-tuning
Computer science
Scaling
Domain adaptation
Language model
Domain (mathematical analysis)
Training (meteorology)

No related works found for this paper.