AnyDoor: Zero-shot Object-level Image Customization
University of Hong Kong · Alibaba Group (United States)
Abstract
This work presents AnyDoor, a diffusion-based image generator with the power to teleport target objects to new scenes at user-specified locations with desired shapes. Instead of tuning parameters for each object, our model is trained only once and effortlessly generalizes to diverse object-scene combinations at the inference stage. Such a challenging zero-shot setting requires an adequate characterization of a certain object. To this end, we complement the commonly used identity feature with detail features, which are carefully designed to maintain appearance details yet allow versatile local variations (e.g., lighting, orientation, posture, etc.), supporting the object in favorably blending with different…
Citation impact
- FWCI
- 32.77
- Percentile
- 100%
- References
- 0
Authors
6Topics & keywords
- Zero (linguistics)
- Shot (pellet)
- Personalization
- Object (grammar)
- Computer science
- Computer vision
- Image (mathematics)
- Artificial intelligence