TY - JOUR
T1 - Comparing Different Deep Learning Architectures as Vision-Based Multi-Label Classifiers for Identification of Multiple Distresses on Asphalt Pavement
AU - Espindola, Aline Calheiros
AU - Rahman, Mujib
AU - Mathavan, Senthan
AU - Júnior, Ernesto Ferreira Nobre
N1 - Funding Information:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brazil (CAPES) under Grant [Finance Code 001].
PY - 2023/5
Y1 - 2023/5
N2 - Distress measurement is essential in pavement management. Image-based distress identification is increasingly becoming an integral part of traffic speed network-level road condition surveys. This allows an aggregated summary of road conditions over the whole network, so it does not require an exact distress location within the lane. In this context, multi-label classification (MLC), based on convolutional neural networks (CNN), is proposed as a potential solution for distress identification from a network-level right-of-way (ROW) video survey. MLC has the advantage of low computing resource consumption, as it is implemented from lightweight classification networks. In this work, the developed MLC models used three different CNN architectures (VGG16, ResNet-34, and ResNet-50) to detect potholes, cracks, patches, and bleeding. The best model obtained 97% average accuracy with an F1-score of 93% in distress identification despite the variability in imaging hardware. This makes it possible to generalize the classification algorithm, allowing versatile applications and incorporating it into network-level pavement management systems. This model has good potential for fast and accurate distress identification from a video survey, avoiding the need for various types of expensive sensors like laser scanners.
AB - Distress measurement is essential in pavement management. Image-based distress identification is increasingly becoming an integral part of traffic speed network-level road condition surveys. This allows an aggregated summary of road conditions over the whole network, so it does not require an exact distress location within the lane. In this context, multi-label classification (MLC), based on convolutional neural networks (CNN), is proposed as a potential solution for distress identification from a network-level right-of-way (ROW) video survey. MLC has the advantage of low computing resource consumption, as it is implemented from lightweight classification networks. In this work, the developed MLC models used three different CNN architectures (VGG16, ResNet-34, and ResNet-50) to detect potholes, cracks, patches, and bleeding. The best model obtained 97% average accuracy with an F1-score of 93% in distress identification despite the variability in imaging hardware. This makes it possible to generalize the classification algorithm, allowing versatile applications and incorporating it into network-level pavement management systems. This model has good potential for fast and accurate distress identification from a video survey, avoiding the need for various types of expensive sensors like laser scanners.
KW - Mechanical Engineering
KW - Civil and Structural Engineering
UR - https://journals.sagepub.com/doi/pdf/10.1177/03611981221127273
U2 - 10.1177/03611981221127273
DO - 10.1177/03611981221127273
M3 - Article
VL - 2677
SP - 24
EP - 39
JO - Transportation Research Record: Journal of the Transportation Research Board
JF - Transportation Research Record: Journal of the Transportation Research Board
IS - 5
ER -