Improved basic elements detection algorithm for bridge engineering design drawings based on YOLOv5

Ning An; Linsheng Huang; Mengnan Hu; Junan Zhu; Chuanjian Wang

doi:10.1017/S089006042400026X

Improved basic elements detection algorithm for bridge engineering design drawings based on YOLOv5

Published online by Cambridge University Press: 16 December 2024

Ning An

Linsheng Huang ,

Mengnan Hu ,

Junan Zhu and

Chuanjian Wang

Show author details

Ning An: Affiliation:
School of Electronic Information Engineering, Anhui University, Hefei, Anhui, China
Linsheng Huang: Affiliation:
School of Internet, Anhui University, Hefei, Anhui, China
Mengnan Hu: Affiliation:
School of Internet, Anhui University, Hefei, Anhui, China
Junan Zhu*: Affiliation:
School of Internet, Anhui University, Hefei, Anhui, China
Chuanjian Wang*: Affiliation:
School of Internet, Anhui University, Hefei, Anhui, China
*: Corresponding authors: Junan Zhu and Chuanjian Wang; Emails: [email protected]; [email protected]
Corresponding authors: Junan Zhu and Chuanjian Wang; Emails: [email protected]; [email protected]

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Bridge engineering design drawings basic elements contain a large amount of important information such as structural dimensions and material indexes. Basic element detection is seen as the basis for digitizing drawings. Aiming at the problem of low detection accuracy of existing drawing basic elements, an improved basic elements detection algorithm for bridge engineering design drawings based on YOLOv5 is proposed. Firstly, coordinate attention is introduced into the feature extraction network to enhance the feature extraction capability of the algorithm and alleviate the problem of difficult recognition of texture features inside grayscale images. Then, targeting objectives across different scales, the standard 3 × 3 convolution in the feature pyramid network is replaced with switchable atrous convolution, and the atrous rate is adaptively selected for convolution computation to expand the sensory field. Finally, experiments are conducted on the bridge engineering design drawings basic elements detection dataset, and the experimental results show that when the Intersection over Union is 0.5, the proposed algorithm achieves a mean average precision of 93.6%, which is 3.4% higher compared to the original YOLOv5 algorithm, and it can satisfy the accuracy requirement of bridge engineering design drawings basic elements detection.

Keywords

bridge engineering design drawings basic elements detection improved YOLOv5 attention mechanism atrous convolution

Type: Research Article
Information: AI EDAM , Volume 38 , 2024 , e22

DOI: https://doi.org/10.1017/S089006042400026X [Opens in a new window]
Copyright: © The Author(s), 2024. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Brock, A, Lim, T, Ritchie, JM and Weston, N. 2017. Convnet-based optical recognition for engineering drawings. In 37th Computers and Information in Engineering Conference, DETC2017-68186. American Society of Mechanical Engineers.Google Scholar

Dong, Z, Ji, X, Wang, X, Gu, Y, Wang, J and Qi, D. 2023a. Icncs: Internal cascaded neuromorphic computing system for fast electric vehicle state of charge estimation. IEEE Transactions on Consumer Electronics.CrossRef Google Scholar

Dong, Z, Zhao, Y and Tian, FF (2023b) Character recognition and detection algorithm for power grid engineering drawings based on improved convolutional neural network. Electronic Design Engineering 31(13), 27–31Google Scholar

Elyan, E, Jamieson, L and Ali-Gombe, A (2020) Deep learning for symbols detection and classification in engineering drawings. Neural Networks 129, 91–102CrossRef Google Scholar PubMed

Esteva, A, Kuprel, B, Novoa, RA, Ko, J, Swetter, SM, Blau, HM and Thrun, S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature 542(7639), 115–118CrossRef Google Scholar PubMed

Fan, F and Guan, JH (2012) Engineering drawing string and labeling information extraction. Computer Engineering and Application 48(7), 161–164Google Scholar

He, K, Zhang, X, Ren, S and Sun, J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 37(9), 1904–1916CrossRef Google Scholar PubMed

Hou, Q, Zhou, D and Feng, J. 2021. Coordinate attention for efficient mobile network design. In Proceedings of the Ieee/Cvf Conference on Computer Vision and Pattern Recognition, 13713–13722.Google Scholar

Hu, J, Shen, L and Sun, G. 2018. Squeeze-and-excitation networks. In Proceedings of the Ieee Conference on Computer Vision and Pattern Recognition, 7132–7141.Google Scholar

Jaderberg, M, Simonyan, K, Zisserman, A, et al. 2015. Spatial transformer networks. Advances in Neural Information Processing Systems 28.Google Scholar

Jamieson, L, Moreno-Garcia, CF and Elyan, E. 2020. Deep learning for text detection and recognition in complex engineering diagrams. In 2020 International Joint Conference on Neural Networks (Ijcnn), 1–7. IEEE.Google Scholar

Ji, X, Dong, Z, Han, Y, Lai, CS and Qi, D. 2023. A brain-inspired hierarchical interactive in-memory computing system and its application in video sentiment analysis. IEEE Transactions on Circuits and Systems for Video Technology.CrossRef Google Scholar

Ji, X, Dong, Z, Lai, CS and Qi, D (2022) A braininspired in-memory computing system for neuronal communication via memristive circuits. IEEE Communications Magazine 60(1), 100–106CrossRef Google Scholar

Lin, T-Y, Dollár, P, Girshick, R, He, K, Hariharan, B and Belongie, S. 2017. Feature pyramid networks for object detection. In Proceedings of the Ieee Conference on Computer Vision and Pattern Recognition, 2117–2125.Google Scholar

Liu, L, Chen, Y and Liu, X. 2019. Engineering drawing recognition model with convolutional neural network. Proceedings of the 2019 International Conference on Robotics, Intelligent Control and Artificial Intelligence (RICAI 2019), 112–116.Google Scholar

Liu, S, Qi, L, Qin, H, Shi, J and Jia, J. 2018. Path aggregation network for instance segmentation. In Proceedings of the Ieee Conference on Computer Vision and Pattern Recognition, 8759–8768.Google Scholar

Nabipour, M, Nayyeri, P, Jabani, H, Mosavi, A, Salwana, E and Shahab, S (2020) Deep learning for stock market prediction. Entropy 22(8), 840CrossRef Google Scholar PubMed

Qiao, S, Chen, L-C and Yuille, A. 2021. Detectors: Detecting objects with recursive feature pyramid and switchable atrous convolution. In Proceedings of the Ieee/Cvf Conference on Computer Vision and Pattern Recognition, 10213–10224.Google Scholar

Selvaraju, RR, Cogswell, M, Das, A, Vedantam, R, Parikh, D and Batra, D. 2017. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the Ieee International Conference on Computer Vision, 618–626.Google Scholar

Son, H, Lee, J, Cho, S and Lee, S. 2021. Single image defocus deblurring using kernel-sharing parallel atrous convolutions. In Proceedings of the Ieee/Cvf International Conference on Computer Vision, 2642–2650.Google Scholar

Song, X, Li, YC and Liu, JF (2011) Topology-based engineering drawing recognition method. Journal of Shenyang University of Architecture (Natural Science Edition) 27(4), 6Google Scholar

Wang, C-Y, Mark Liao, H-Y, Wu, Y-H, Chen, P-Y, Hsieh, J-W and Yeh, I-H. 2020. Cspnet: A new backbone that can enhance learning capability of cnn. In Proceedings of the Ieee/Cvf Conference on Computer Vision and Pattern Recognition Workshops, 390–391.Google Scholar

Wang, J, Chen, Y, Dong, Z and Gao, M (2023) Improved yolov5 network for real-time multi-scale traffic sign detection. Neural Computing and Applications 35(10), 7853–7865CrossRef Google Scholar

Woo, S, Park, J, Lee, J-Y and Kweon, I-S. 2018. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (Eccv), 3–19.Google Scholar

Yang, M, Zhao, Y and Deng, X (2022) Two-dimensional drawing recognition of duct planes based on improved cascade rcnn. Journal of Civil Engineering and Management 4, 39Google Scholar

Zhang, S, Yao, L, Sun, A and Tay, Y (2019) Deep learning based recommender system: A survey and new perspectives. ACM computing surveys (CSUR) 52(1), 1–38CrossRef Google Scholar

Zhao, K, Han, Q, Zhang, C-B, Xu, J and Cheng, M-M (2022) Deep hough transform for semantic line detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 4793–4806Google Scholar PubMed

Zhao, Y, Deng, X and Lai, H (2021) Reconstructing bim from 2d structural drawings for existing buildings. Automation in Construction 128, 103750CrossRef Google Scholar

Zheng, Z, Wang, P, Liu, W, Li, Y, Ye, R and Ren, D. 2020. Distance-iou loss: Faster and better learning for bounding box regression. In Proceedings of the Aaai Conference on Artificial Intelligence, 34:12993–13000.CrossRef Google Scholar

Article contents

Improved basic elements detection algorithm for bridge engineering design drawings based on YOLOv5

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests