PaveCap: A Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation
Indexed incrossref
Abstract
This research introduces a multimodal framework for automated pavement condition assessment that provides pavement condition index (PCI) predictions and qualitative descriptions using a single-shot PCI estimation network and a fine-grained dense captioning network. The PCI estimation network uses YOLOv8, the segment anything model, and a four-layer convolutional neural network for PCI prediction. The dense captioning network uses a YOLOv8 backbone, a transformer architecture, and a convolutional feed-forward module to generate textual descriptions. To train and evaluate these networks, we developed a pavement dataset containing bounding box annotations, textual descriptions, and PCI values. The PCI estimation…
Citation impact
4
total citations
- FWCI
- 12.16
- Percentile
- 98%
- References
- 31
Citations per year
Authors
5Topics & keywords
Topics
Keywords
- Closed captioning
- Conventional PCI
- Convolutional neural network
- Bounding overwatch
- Estimation
- Metric (unit)
UN Sustainable Development Goals
- Industry, innovation and infrastructure
No related works found for this paper.