To read this content please select one of the options below:

A transformer-based deep learning method for automatic pixel-level crack detection and feature quantification

Ankang Ji (The Hong Kong Polytechnic University, Hong Kong, China)
Xiaolong Xue (School of Management, Guangzhou University, Guangzhou, China)
Limao Zhang (National Center of Technology Innovation for Digital Construction, School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, China)
Xiaowei Luo (Department of Architecture and Civil Engineering, City University of Hong Kong, Hong Kong, China)
Qingpeng Man (Harbin Institute of Technology, Harbin, China)

Engineering, Construction and Architectural Management

ISSN: 0969-9988

Article publication date: 28 December 2023

139

Abstract

Purpose

Crack detection of pavement is a critical task in the periodic survey. Efficient, effective and consistent tracking of the road conditions by identifying and locating crack contributes to establishing an appropriate road maintenance and repair strategy from the promptly informed managers but still remaining a significant challenge. This research seeks to propose practical solutions for targeting the automatic crack detection from images with efficient productivity and cost-effectiveness, thereby improving the pavement performance.

Design/methodology/approach

This research applies a novel deep learning method named TransUnet for crack detection, which is structured based on Transformer, combined with convolutional neural networks as encoder by leveraging a global self-attention mechanism to better extract features for enhancing automatic identification. Afterward, the detected cracks are used to quantify morphological features from five indicators, such as length, mean width, maximum width, area and ratio. Those analyses can provide valuable information for engineers to assess the pavement condition with efficient productivity.

Findings

In the training process, the TransUnet is fed by a crack dataset generated by the data augmentation with a resolution of 224 × 224 pixels. Subsequently, a test set containing 80 new images is used for crack detection task based on the best selected TransUnet with a learning rate of 0.01 and a batch size of 1, achieving an accuracy of 0.8927, a precision of 0.8813, a recall of 0.8904, an F1-measure and dice of 0.8813, and a Mean Intersection over Union of 0.8082, respectively. Comparisons with several state-of-the-art methods indicate that the developed approach in this research outperforms with greater efficiency and higher reliability.

Originality/value

The developed approach combines TransUnet with an integrated quantification algorithm for crack detection and quantification, performing excellently in terms of comparisons and evaluation metrics, which can provide solutions with potentially serving as the basis for an automated, cost-effective pavement condition assessment scheme.

Keywords

Acknowledgements

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in the manuscript. This research is supported by the National Natural Science Foundation of China (NSFC) (No. 72301233) and the National Social Science Fund of China (No. 18ZDA043).

Since submission of this article, the following author(s) have updated their affiliation(s): Ankang Ji is at Shenzhen Research Institute, The Hong Kong Polytechnic University, Shenzhen, China.

Citation

Ji, A., Xue, X., Zhang, L., Luo, X. and Man, Q. (2023), "A transformer-based deep learning method for automatic pixel-level crack detection and feature quantification", Engineering, Construction and Architectural Management, Vol. ahead-of-print No. ahead-of-print. https://doi.org/10.1108/ECAM-06-2023-0613

Publisher

:

Emerald Publishing Limited

Copyright © 2023, Emerald Publishing Limited

Related articles