Exploring CLIP for Assessing the Look and Feel of Images
Nanyang Technological University
Indexed incrossref
Abstract
Measuring the perception of visual content is a long-standing problem in computer vision. Many mathematical models have been developed to evaluate the look or quality of an image. Despite the effectiveness of such tools in quantifying degradations such as noise and blurriness levels, such quantification is loosely coupled with human language. When it comes to more abstract perception about the feel of visual content, existing methods can only rely on supervised models that are explicitly trained with labeled data collected via laborious user study. In this paper, we go beyond the conventional paradigms by exploring the rich visual language prior encapsulated in Contrastive Language-Image Pre-training (CLIP)…
Citation impact
534
total citations
- FWCI
- 31.44
- Percentile
- 100%
- References
- 61
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Computer science
- Perception
- Task (project management)
- Quality (philosophy)
- Prior probability
- Artificial intelligence
- Noise (video)
- Image (mathematics)
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.