Hostname: page-component-669899f699-swprf Total loading time: 0 Render date: 2025-04-27T19:47:06.107Z Has data issue: false hasContentIssue false

Improved basic elements detection algorithm for bridge engineering design drawings based on YOLOv5

Published online by Cambridge University Press:  16 December 2024

Ning An
Affiliation:
School of Electronic Information Engineering, Anhui University, Hefei, Anhui, China
Linsheng Huang
Affiliation:
School of Internet, Anhui University, Hefei, Anhui, China
Mengnan Hu
Affiliation:
School of Internet, Anhui University, Hefei, Anhui, China
Junan Zhu*
Affiliation:
School of Internet, Anhui University, Hefei, Anhui, China
Chuanjian Wang*
Affiliation:
School of Internet, Anhui University, Hefei, Anhui, China
*
Corresponding authors: Junan Zhu and Chuanjian Wang; Emails: [email protected]; [email protected]
Corresponding authors: Junan Zhu and Chuanjian Wang; Emails: [email protected]; [email protected]

Abstract

Bridge engineering design drawings basic elements contain a large amount of important information such as structural dimensions and material indexes. Basic element detection is seen as the basis for digitizing drawings. Aiming at the problem of low detection accuracy of existing drawing basic elements, an improved basic elements detection algorithm for bridge engineering design drawings based on YOLOv5 is proposed. Firstly, coordinate attention is introduced into the feature extraction network to enhance the feature extraction capability of the algorithm and alleviate the problem of difficult recognition of texture features inside grayscale images. Then, targeting objectives across different scales, the standard 3 × 3 convolution in the feature pyramid network is replaced with switchable atrous convolution, and the atrous rate is adaptively selected for convolution computation to expand the sensory field. Finally, experiments are conducted on the bridge engineering design drawings basic elements detection dataset, and the experimental results show that when the Intersection over Union is 0.5, the proposed algorithm achieves a mean average precision of 93.6%, which is 3.4% higher compared to the original YOLOv5 algorithm, and it can satisfy the accuracy requirement of bridge engineering design drawings basic elements detection.

Type
Research Article
Copyright
© The Author(s), 2024. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Brock, A, Lim, T, Ritchie, JM and Weston, N. 2017. Convnet-based optical recognition for engineering drawings. In 37th Computers and Information in Engineering Conference, DETC2017-68186. American Society of Mechanical Engineers.Google Scholar
Dong, Z, Ji, X, Wang, X, Gu, Y, Wang, J and Qi, D. 2023a. Icncs: Internal cascaded neuromorphic computing system for fast electric vehicle state of charge estimation. IEEE Transactions on Consumer Electronics.CrossRefGoogle Scholar
Dong, Z, Zhao, Y and Tian, FF (2023b) Character recognition and detection algorithm for power grid engineering drawings based on improved convolutional neural network. Electronic Design Engineering 31(13), 2731Google Scholar
Elyan, E, Jamieson, L and Ali-Gombe, A (2020) Deep learning for symbols detection and classification in engineering drawings. Neural Networks 129, 91102CrossRefGoogle ScholarPubMed
Esteva, A, Kuprel, B, Novoa, RA, Ko, J, Swetter, SM, Blau, HM and Thrun, S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature 542(7639), 115118CrossRefGoogle ScholarPubMed
Fan, F and Guan, JH (2012) Engineering drawing string and labeling information extraction. Computer Engineering and Application 48(7), 161164Google Scholar
He, K, Zhang, X, Ren, S and Sun, J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 37(9), 19041916CrossRefGoogle ScholarPubMed
Hou, Q, Zhou, D and Feng, J. 2021. Coordinate attention for efficient mobile network design. In Proceedings of the Ieee/Cvf Conference on Computer Vision and Pattern Recognition, 1371313722.Google Scholar
Hu, J, Shen, L and Sun, G. 2018. Squeeze-and-excitation networks. In Proceedings of the Ieee Conference on Computer Vision and Pattern Recognition, 71327141.Google Scholar
Jaderberg, M, Simonyan, K, Zisserman, A, et al. 2015. Spatial transformer networks. Advances in Neural Information Processing Systems 28.Google Scholar
Jamieson, L, Moreno-Garcia, CF and Elyan, E. 2020. Deep learning for text detection and recognition in complex engineering diagrams. In 2020 International Joint Conference on Neural Networks (Ijcnn), 17. IEEE.Google Scholar
Ji, X, Dong, Z, Han, Y, Lai, CS and Qi, D. 2023. A brain-inspired hierarchical interactive in-memory computing system and its application in video sentiment analysis. IEEE Transactions on Circuits and Systems for Video Technology.CrossRefGoogle Scholar
Ji, X, Dong, Z, Lai, CS and Qi, D (2022) A braininspired in-memory computing system for neuronal communication via memristive circuits. IEEE Communications Magazine 60(1), 100106CrossRefGoogle Scholar
Lin, T-Y, Dollár, P, Girshick, R, He, K, Hariharan, B and Belongie, S. 2017. Feature pyramid networks for object detection. In Proceedings of the Ieee Conference on Computer Vision and Pattern Recognition, 21172125.Google Scholar
Liu, L, Chen, Y and Liu, X. 2019. Engineering drawing recognition model with convolutional neural network. Proceedings of the 2019 International Conference on Robotics, Intelligent Control and Artificial Intelligence (RICAI 2019), 112116.Google Scholar
Liu, S, Qi, L, Qin, H, Shi, J and Jia, J. 2018. Path aggregation network for instance segmentation. In Proceedings of the Ieee Conference on Computer Vision and Pattern Recognition, 87598768.Google Scholar
Nabipour, M, Nayyeri, P, Jabani, H, Mosavi, A, Salwana, E and Shahab, S (2020) Deep learning for stock market prediction. Entropy 22(8), 840CrossRefGoogle ScholarPubMed
Qiao, S, Chen, L-C and Yuille, A. 2021. Detectors: Detecting objects with recursive feature pyramid and switchable atrous convolution. In Proceedings of the Ieee/Cvf Conference on Computer Vision and Pattern Recognition, 1021310224.Google Scholar
Selvaraju, RR, Cogswell, M, Das, A, Vedantam, R, Parikh, D and Batra, D. 2017. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the Ieee International Conference on Computer Vision, 618626.Google Scholar
Son, H, Lee, J, Cho, S and Lee, S. 2021. Single image defocus deblurring using kernel-sharing parallel atrous convolutions. In Proceedings of the Ieee/Cvf International Conference on Computer Vision, 26422650.Google Scholar
Song, X, Li, YC and Liu, JF (2011) Topology-based engineering drawing recognition method. Journal of Shenyang University of Architecture (Natural Science Edition) 27(4), 6Google Scholar
Wang, C-Y, Mark Liao, H-Y, Wu, Y-H, Chen, P-Y, Hsieh, J-W and Yeh, I-H. 2020. Cspnet: A new backbone that can enhance learning capability of cnn. In Proceedings of the Ieee/Cvf Conference on Computer Vision and Pattern Recognition Workshops, 390391.Google Scholar
Wang, J, Chen, Y, Dong, Z and Gao, M (2023) Improved yolov5 network for real-time multi-scale traffic sign detection. Neural Computing and Applications 35(10), 78537865CrossRefGoogle Scholar
Woo, S, Park, J, Lee, J-Y and Kweon, I-S. 2018. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (Eccv), 319.Google Scholar
Yang, M, Zhao, Y and Deng, X (2022) Two-dimensional drawing recognition of duct planes based on improved cascade rcnn. Journal of Civil Engineering and Management 4, 39Google Scholar
Zhang, S, Yao, L, Sun, A and Tay, Y (2019) Deep learning based recommender system: A survey and new perspectives. ACM computing surveys (CSUR) 52(1), 138CrossRefGoogle Scholar
Zhao, K, Han, Q, Zhang, C-B, Xu, J and Cheng, M-M (2022) Deep hough transform for semantic line detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 47934806Google ScholarPubMed
Zhao, Y, Deng, X and Lai, H (2021) Reconstructing bim from 2d structural drawings for existing buildings. Automation in Construction 128, 103750CrossRefGoogle Scholar
Zheng, Z, Wang, P, Liu, W, Li, Y, Ye, R and Ren, D. 2020. Distance-iou loss: Faster and better learning for bounding box regression. In Proceedings of the Aaai Conference on Artificial Intelligence, 34:1299313000.CrossRefGoogle Scholar