Comparing Different Deep Learning Architectures as Vision-Based Multi-Label Classifiers for Identification of Multiple Distresses on Asphalt Pavement

Aline Calheiros Espindola; Mujib Rahman; Senthan Mathavan; Ernesto Ferreira Nobre Júnior

doi:10.1177/03611981221127273

Comparing Different Deep Learning Architectures as Vision-Based Multi-Label Classifiers for Identification of Multiple Distresses on Asphalt Pavement

Aline Calheiros Espindola, Mujib Rahman, Senthan Mathavan, Ernesto Ferreira Nobre Júnior

Research output: Contribution to journal › Article › peer-review

Abstract

Distress measurement is essential in pavement management. Image-based distress identification is increasingly becoming an integral part of traffic speed network-level road condition surveys. This allows an aggregated summary of road conditions over the whole network, so it does not require an exact distress location within the lane. In this context, multi-label classification (MLC), based on convolutional neural networks (CNN), is proposed as a potential solution for distress identification from a network-level right-of-way (ROW) video survey. MLC has the advantage of low computing resource consumption, as it is implemented from lightweight classification networks. In this work, the developed MLC models used three different CNN architectures (VGG16, ResNet-34, and ResNet-50) to detect potholes, cracks, patches, and bleeding. The best model obtained 97% average accuracy with an F1-score of 93% in distress identification despite the variability in imaging hardware. This makes it possible to generalize the classification algorithm, allowing versatile applications and incorporating it into network-level pavement management systems. This model has good potential for fast and accurate distress identification from a video survey, avoiding the need for various types of expensive sensors like laser scanners.

Original language	English
Pages (from-to)	24-39
Number of pages	16
Journal	Transportation Research Record: Journal of the Transportation Research Board
Volume	2677
Issue number	5
Early online date	28 Oct 2022
DOIs	https://doi.org/10.1177/03611981221127273
Publication status	Published - May 2023

Bibliographical note

Funding Information:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brazil (CAPES) under Grant [Finance Code 001].

Keywords

Mechanical Engineering
Civil and Structural Engineering

Access to Document

10.1177/03611981221127273

Cite this

@article{e4df77e4aabe4201a01c9678c7fb430f,

title = "Comparing Different Deep Learning Architectures as Vision-Based Multi-Label Classifiers for Identification of Multiple Distresses on Asphalt Pavement",

abstract = "Distress measurement is essential in pavement management. Image-based distress identification is increasingly becoming an integral part of traffic speed network-level road condition surveys. This allows an aggregated summary of road conditions over the whole network, so it does not require an exact distress location within the lane. In this context, multi-label classification (MLC), based on convolutional neural networks (CNN), is proposed as a potential solution for distress identification from a network-level right-of-way (ROW) video survey. MLC has the advantage of low computing resource consumption, as it is implemented from lightweight classification networks. In this work, the developed MLC models used three different CNN architectures (VGG16, ResNet-34, and ResNet-50) to detect potholes, cracks, patches, and bleeding. The best model obtained 97% average accuracy with an F1-score of 93% in distress identification despite the variability in imaging hardware. This makes it possible to generalize the classification algorithm, allowing versatile applications and incorporating it into network-level pavement management systems. This model has good potential for fast and accurate distress identification from a video survey, avoiding the need for various types of expensive sensors like laser scanners.",

keywords = "Mechanical Engineering, Civil and Structural Engineering",

author = "Espindola, {Aline Calheiros} and Mujib Rahman and Senthan Mathavan and J{\'u}nior, {Ernesto Ferreira Nobre}",

note = "Funding Information: The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Coordena{\c c}{\~a}o de Aperfei{\c c}oamento de Pessoal de N{\'i}vel Superior-Brazil (CAPES) under Grant [Finance Code 001]. ",

year = "2023",

month = may,

doi = "10.1177/03611981221127273",

language = "English",

volume = "2677",

pages = "24--39",

number = "5",

}

Comparing Different Deep Learning Architectures as Vision-Based Multi-Label Classifiers for Identification of Multiple Distresses on Asphalt Pavement. / Espindola, Aline Calheiros; Rahman, Mujib; Mathavan, Senthan et al.
In: Transportation Research Record: Journal of the Transportation Research Board, Vol. 2677, No. 5, 05.2023, p. 24-39.

Research output: Contribution to journal › Article › peer-review

TY - JOUR

T1 - Comparing Different Deep Learning Architectures as Vision-Based Multi-Label Classifiers for Identification of Multiple Distresses on Asphalt Pavement

AU - Espindola, Aline Calheiros

AU - Rahman, Mujib

AU - Mathavan, Senthan

AU - Júnior, Ernesto Ferreira Nobre

N1 - Funding Information: The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brazil (CAPES) under Grant [Finance Code 001].

PY - 2023/5

Y1 - 2023/5

N2 - Distress measurement is essential in pavement management. Image-based distress identification is increasingly becoming an integral part of traffic speed network-level road condition surveys. This allows an aggregated summary of road conditions over the whole network, so it does not require an exact distress location within the lane. In this context, multi-label classification (MLC), based on convolutional neural networks (CNN), is proposed as a potential solution for distress identification from a network-level right-of-way (ROW) video survey. MLC has the advantage of low computing resource consumption, as it is implemented from lightweight classification networks. In this work, the developed MLC models used three different CNN architectures (VGG16, ResNet-34, and ResNet-50) to detect potholes, cracks, patches, and bleeding. The best model obtained 97% average accuracy with an F1-score of 93% in distress identification despite the variability in imaging hardware. This makes it possible to generalize the classification algorithm, allowing versatile applications and incorporating it into network-level pavement management systems. This model has good potential for fast and accurate distress identification from a video survey, avoiding the need for various types of expensive sensors like laser scanners.

AB - Distress measurement is essential in pavement management. Image-based distress identification is increasingly becoming an integral part of traffic speed network-level road condition surveys. This allows an aggregated summary of road conditions over the whole network, so it does not require an exact distress location within the lane. In this context, multi-label classification (MLC), based on convolutional neural networks (CNN), is proposed as a potential solution for distress identification from a network-level right-of-way (ROW) video survey. MLC has the advantage of low computing resource consumption, as it is implemented from lightweight classification networks. In this work, the developed MLC models used three different CNN architectures (VGG16, ResNet-34, and ResNet-50) to detect potholes, cracks, patches, and bleeding. The best model obtained 97% average accuracy with an F1-score of 93% in distress identification despite the variability in imaging hardware. This makes it possible to generalize the classification algorithm, allowing versatile applications and incorporating it into network-level pavement management systems. This model has good potential for fast and accurate distress identification from a video survey, avoiding the need for various types of expensive sensors like laser scanners.

KW - Mechanical Engineering

KW - Civil and Structural Engineering

UR - https://journals.sagepub.com/doi/pdf/10.1177/03611981221127273

U2 - 10.1177/03611981221127273

DO - 10.1177/03611981221127273

M3 - Article

VL - 2677

SP - 24

EP - 39

JO - Transportation Research Record: Journal of the Transportation Research Board

JF - Transportation Research Record: Journal of the Transportation Research Board

IS - 5

ER -

Comparing Different Deep Learning Architectures as Vision-Based Multi-Label Classifiers for Identification of Multiple Distresses on Asphalt Pavement

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Comparing Different Deep Learning Architectures as Vision-based Multi-Label Classifiers for Identification of Multiple Distresses on Asphalt Pavement

Cite this