articleIEEE AccessJan 1, 2026GOLD OA

AutoML-Pipeline: A RAG-Enhanced Code Generation Framework With Pre-Validation for Cloud-Native Machine Learning Workflows

WZWenyu ZhaoTCTingjie ChenJCJ. C. YangLQLei Qiu

Microsoft Research Asia (China) · University of Utah · +1 more institution

Indexed incrossrefdoaj

Abstract

The proliferation of cloud-native machine learning platforms has significantly accelerated model development and deployment cycles. However, constructing and maintaining heterogeneous pipeline code spanning multiple languages (Python, YAML, Spark SQL) and cloud-specific configurations remains labor-intensive and error-prone. Existing LLM-based code generation tools lack awareness of runtime constraints and historical execution patterns, frequently producing code with resource misconfigurations or dependency conflicts that fail upon deployment. To address these challenges, we propose AutoML-Pipeline, a closed-loop code generation framework that integrates Retrieval-Augmented Generation (RAG) with reinforcement…

Citation impact

10
total citations
FWCI
245.97
Percentile
100%
References
0
Too recent for citation history.

Authors

4

Topics & keywords

Keywords
  • Workflow
  • Code generation
  • Code (set theory)
  • Computational learning theory
  • Source code
No related works found for this paper.