articleIEEE Transactions on Geoscience and Remote SensingJan 1, 2026Closed access

A Size-Aware Graph Embedding Approach to Remote Sensing Image Captioning With Object Relative Size Information

China University of Petroleum, East China

Indexed incrossref

Abstract

Remote sensing image captioning is the task of automatically generating descriptive texts for remotely sensed scenes and objects. A common shortcoming of existing methods is the inadequate consideration of object size, which often leads to captions that either omit size information or provide imprecise size descriptions. To overcome this deficiency, we develop a novel framework composed of three modules: (a) an object confirmation and relative size estimation module, (b) a graph construction and graph convolution module, and (c) a caption generation module. Our framework comprehensively characterizes object size to generate more quantitatively informative captions. Furthermore, we introduce a new evaluation…

Citation impact

5
total citations
FWCI
139.72
Percentile
100%
References
28
Too recent for citation history.

Authors

5

Topics & keywords

Keywords
  • Correctness
  • Object (grammar)
  • Graph
  • Benchmark (surveying)
  • Embedding
  • Construct (python library)
  • Object detection
  • Closed captioning
No related works found for this paper.

Funding