Fine-tuning large language models for domain adaptation: exploration of training strategies, scaling, model merging and synergistic capabilities
Massachusetts Institute of Technology
Indexed incrossrefdoaj
Abstract
The advancement of Large Language Models (LLMs) for domain applications in fields such as materials science and engineering depends on the development of fine-tuning strategies that adapt models for specialized, technical capabilities. In this work, we explore the effects of Continued Pretraining (CPT), Supervised Fine-Tuning (SFT), and various preference-based optimization approaches, including Direct Preference Optimization (DPO) and Odds Ratio Preference Optimization (ORPO), on fine-tuned LLM performance. Our analysis shows how these strategies influence model outcomes and reveals that the merging of multiple fine-tuned models can lead to the emergence of capabilities that surpass the individual…
Citation impact
76
total citations
- FWCI
- 150.85
- Percentile
- 100%
- References
- 38
Citations per year
Authors
3Topics & keywords
Keywords
- Adaptation (eye)
- Fine-tuning
- Computer science
- Scaling
- Domain adaptation
- Language model
- Domain (mathematical analysis)
- Training (meteorology)
No related works found for this paper.