articlenpj Computational MaterialsMar 28, 2025GOLD OA

Fine-tuning large language models for domain adaptation: exploration of training strategies, scaling, model merging and synergistic capabilities

Massachusetts Institute of Technology

Indexed incrossrefdoaj

Abstract

The advancement of Large Language Models (LLMs) for domain applications in fields such as materials science and engineering depends on the development of fine-tuning strategies that adapt models for specialized, technical capabilities. In this work, we explore the effects of Continued Pretraining (CPT), Supervised Fine-Tuning (SFT), and various preference-based optimization approaches, including Direct Preference Optimization (DPO) and Odds Ratio Preference Optimization (ORPO), on fine-tuned LLM performance. Our analysis shows how these strategies influence model outcomes and reveals that the merging of multiple fine-tuned models can lead to the emergence of capabilities that surpass the individual…

Citation impact

76
total citations
FWCI
150.85
Percentile
100%
References
38
Citations per year

Authors

3

Topics & keywords

Keywords
  • Adaptation (eye)
  • Fine-tuning
  • Computer science
  • Scaling
  • Domain adaptation
  • Language model
  • Domain (mathematical analysis)
  • Training (meteorology)
No related works found for this paper.