Review of large vision models and visual prompt engineering
Northwestern Polytechnical University · University of Georgia · +10 more institutions
Abstract
Visual prompt engineering is a fundamental methodology in the field of visual and image artificial general intelligence. As the development of large vision models progresses, the importance of prompt engineering becomes increasingly evident. Designing suitable prompts for specific visual tasks has emerged as a meaningful research direction. This review aims to summarize the methods employed in the computer vision domain for large vision models and visual prompt engineering, exploring the latest advancements in visual prompt engineering. We present influential large models in the visual domain and a range of prompt engineering methods employed on these models. It is our hope that this review provides a…
Citation impact
- FWCI
- 22.54
- Percentile
- 100%
- References
- 227
Authors
21Topics & keywords
- Computer science
- Domain (mathematical analysis)
- Field (mathematics)
- Artificial intelligence
- Data science
- Human–computer interaction
- Mathematics