articleIEEE Geoscience and Remote Sensing MagazineApr 19, 2024Closed access

Vision-Language Models in Remote Sensing: Current progress and future trends

King Abdullah University of Science and Technology · New York University Abu Dhabi · +4 more institutions

Indexed incrossref

Abstract

The remarkable achievements of ChatGPT and Generative Pre-trained Transformer 4 (GPT-4) have sparked a wave of interest and research in the field of large language models (LLMs) for artificial general intelligence (AGI). These models provide intelligent solutions that are closer to human thinking, enabling us to use general artificial intelligence (AI) to solve problems in various applications. However, in the field of remote sensing (RS), the scientific literature on the implementation of AGI remains relatively scant. Existing AI-related research in RS focuses primarily on visual-understanding tasks while neglecting the semantic understanding of the objects and their relationships. This is where vision-LMs…

No related works found for this paper.