articleJournal of Transportation Engineering Part B PavementsFeb 10, 2026Closed access

PaveCap: A Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation

Dakota State University

Indexed incrossref

Abstract

This research introduces a multimodal framework for automated pavement condition assessment that provides pavement condition index (PCI) predictions and qualitative descriptions using a single-shot PCI estimation network and a fine-grained dense captioning network. The PCI estimation network uses YOLOv8, the segment anything model, and a four-layer convolutional neural network for PCI prediction. The dense captioning network uses a YOLOv8 backbone, a transformer architecture, and a convolutional feed-forward module to generate textual descriptions. To train and evaluate these networks, we developed a pavement dataset containing bounding box annotations, textual descriptions, and PCI values. The PCI estimation…

Citation impact

4
total citations
FWCI
12.16
Percentile
98%
References
31
Citations per year

Authors

5

Topics & keywords

Keywords
  • Closed captioning
  • Conventional PCI
  • Convolutional neural network
  • Bounding overwatch
  • Estimation
  • Metric (unit)
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.