OOTDiffusion: Outfitting Fusion Based Latent Diffusion for Controllable Virtual Try-On

Xu, Yuhao; Gu, Tao; Chen, Weifeng; Chen, Arlene

doi:10.1609/aaai.v39i9.32973

articleProceedings of the AAAI Conference on Artificial IntelligenceApr 11, 2025DIAMOND OA

OOTDiffusion: Outfitting Fusion Based Latent Diffusion for Controllable Virtual Try-On

YXYuhao Xu TGTao Gu WCWeifeng ChenACArlene Chen

Indexed incrossref

Abstract

We present OOTDiffusion, a novel network architecture for realistic and controllable image-based virtual try-on (VTON). We leverage the power of pretrained latent diffusion models, designing an outfitting UNet to learn the detailed garment features. Without a redundant warping process, the garment features are precisely aligned with the target human body via the proposed outfitting fusion in the self-attention layers of the denoising UNet. In order to further enhance the controllability, we introduce outfitting dropout to the training process, which enables us to adjust the strength of the garment features through classifier-free guidance. Our comprehensive experiments on the VITON-HD and Dress Code datasets…

Citation impact

54

total citations

FWCI: 35.58
Percentile: 100%
References: 0

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Fusion
Computer science
Diffusion
Artificial intelligence
Physics
Philosophy
Linguistics

No related works found for this paper.