SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Model
Carnegie Mellon University · Adobe Systems (United States) · +1 more institution
Abstract
Generic image inpainting aims to complete a corrupted image by borrowing surrounding information, which barely generates novel content. By contrast, multi-modal inpainting provides more flexible and useful controls on the inpainted content, e.g., a text prompt can be used to describe an object with richer attributes, and a mask can be used to constrain the shape of the inpainted object rather than being only considered as a missing area. We propose a new diffusion-based model named SmartBrush for completing a missing region with an object using both text and shape-guidance. While previous work such as DALLE-2 and Stable Diffusion can do text-guided inapinting they do not support shape guidance and tend to…
Citation impact
- FWCI
- 22.26
- Percentile
- 100%
- References
- 53
Authors
5Topics & keywords
- Inpainting
- Computer science
- Artificial intelligence
- Leverage (statistics)
- Object (grammar)
- Computer vision
- Controllability
- Image (mathematics)