articleSep 1, 2009Closed access

Decomposing a scene into geometric and semantically consistent regions

Stanford University

Indexed incrossref

Abstract

High-level, or holistic, scene understanding involves reasoning about objects, regions, and the 3D relationships between them. This requires a representation above the level of pixels that can be endowed with high-level attributes such as class of object/region, its orientation, and (rough 3D) location within the scene. Towards this goal, we propose a region-based model which combines appearance and scene geometry to automatically decompose a scene into semantically meaningful regions. Our model is defined in terms of a unified energy function over scene appearance and structure. We show how this energy function can be learned from data and present an efficient inference technique that makes use of multiple…

Citation impact

701
total citations
FWCI
37.40
Percentile
100%
References
29
Citations per year

Authors

3

Topics & keywords

Keywords
  • Artificial intelligence
  • Computer science
  • Object (grammar)
  • Computer vision
  • Inference
  • Pixel
  • Representation (politics)
  • Segmentation
No related works found for this paper.