articleScientific DataMar 12, 2025GOLD OA

CMAB: A Multi-Attribute Building Dataset of China

Tsinghua University

PubMed
Indexed incrossrefdoajpubmed

Abstract

Rapidly acquiring three-dimensional (3D) building data, including geometric attributes like rooftop, height and orientations, as well as indicative attributes like function, quality, and age, is essential for accurate urban analysis, simulations, and policy updates. Current building datasets suffer from incomplete coverage of building multi-attributes. This paper presents the first national-scale Multi-Attribute Building dataset (CMAB) with artificial intelligence, covering 3,667 spatial cities, 31 million buildings, and 23.6 billion m² of rooftops with an F1-Score of 89.93% in OCRNet-based extraction, totaling 363 billion m³ of building stock. We trained bootstrap aggregated XGBoost models with city…

Citation impact

56
total citations
FWCI
44.75
Percentile
100%
References
71
Citations per year

Authors

3

Topics & keywords

Keywords
  • China
  • Computer science
  • Geography
  • Information retrieval
  • Statistics
  • Mathematics
  • Archaeology
UN Sustainable Development Goals
  • Sustainable cities and communities
No related works found for this paper.

Funding