Hostname: page-component-745bb68f8f-g4j75 Total loading time: 0 Render date: 2025-01-27T11:13:55.908Z Has data issue: false hasContentIssue false

EHDC: enhanced dilated convolution framework for underwater blurred target recognition

Published online by Cambridge University Press:  26 July 2022

Lei Cai*
Affiliation:
School of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, China
Xiaochen Qin
Affiliation:
School of Information Engineering, Henan Institute of Science and Technology, Xinxiang, China
Tao Xu
Affiliation:
School of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, China
*
*Corresponding author. E-mail: [email protected]

Abstract

The autonomous underwater vehicle (AUV) has a problem with feature loss when recognizing small targets underwater. At present, algorithms usually use multi-scale feature extraction to solve the problem, but this method increases the computational effort of the algorithm. In addition, low underwater light and turbid water result in incomplete information on target features. This paper proposes an enhanced dilated convolution framework (EHDC) for underwater blurred target recognition. Firstly, this paper extracts small target features through hybrid dilated convolution networks, increasing the perceptive field of the algorithm without increasing the computational power of the algorithm. Secondly, the proposed algorithm learns spatial semantic features through an adaptive correlation matrix and compensates for the missing features of the target. Finally, this paper fuses spatial semantic features and visual features for the recognition of small underwater blurred targets. Experiments show that the proposed method improves the recognition accuracy by 1.04% compared to existing methods when recognizing small underwater blurred targets.

Type
Research Article
Copyright
© The Author(s), 2022. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Lu, L., Li, H., Ding, Z. and Guo, Q., “An improved target detection method based on multiscale features fusion,” Microw. Opt. Technol. Lett. 62(9), 14511460 (2020).CrossRefGoogle Scholar
Wang, P., Chen, P., Yuan, Y., Liu, D., Huang, Z., Hou, X. and Cotrell, G., “Understanding Convolution for Semantic Segmentation,” In: 2018 18th IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA (2018).Google Scholar
Sun, Q. and Cai, L., “Multi-AUV Target Recognition Method Based on GAN-meta Learning,” In: 2020 5th International Conference On Advanced Robotics and Mechatronics (ICARM 2020), Shenzhen, China (2020) pp. 374379.Google Scholar
Cai, L., Chen, C. and Chai, H., “Underwater distortion target recognition network (UDTRNet) via enhanced image features,” Comput. Intell. Neurosci. 1(9), 110 (2021).Google Scholar
Jian, M., Qi, Q., Dong, J., Yin, Y. and Lam, K. M., “Integrating QDWD with pattern distinctness and local contrast for underwater saliency detection,” J. Vis. Commun. Image Represent. 53, 3141 (2018).CrossRefGoogle Scholar
Kong, W., Hong, J., Jia, M., Yao, J., Cong, W., Hu, H. and Zhang, H., “YOLOv3-DPFIN: A dual-path feature fusion neural network for robust real-time sonar target detection,” IEEE Sens. J. 20(7), 37453756 (2019).CrossRefGoogle Scholar
Wu, Q., An, Z., Chen, H., Qian, X. and Sun, L., “Small target recognition method on weak features,” Multimed. Tools Appl. 80(3), 41834201 (2021).CrossRefGoogle Scholar
Gongor, F. and Tutsoy, O., “Design and implementation of a facial character analysis algorithm for humanoid robots,” Robotica 37(11), 18351849 (2019).CrossRefGoogle Scholar
Li, J., Zhang, F., Xiang, Y., Pan, S., “Towards small target recognition with photonics-based high resolution radar range profiles,” Opt. Express 29(20), 3157431581 (2021).CrossRefGoogle ScholarPubMed
Cao, C., Hou, Q., Gulliver, T. A. and Lan, Q., “A passive detection algorithm for low-altitude small target based on a wavelet neural network,” Soft Comput. 24(14), 1069310703 (2020).CrossRefGoogle Scholar
Shuang-Chen, W. and Zheng-Rong, Z., “Small target detection in infrared images using deep convolutional neural networks,” J. Infrared Millim. Waves 38(3), 371 (2019).Google Scholar
He, Y., Zhang, C., Mu, T., Yan, T., Wang, Y. and Chen, Z., “Multiscale local gray dynamic range method for infrared small-target detection,” IEEE Geosci. Remote Sens. Lett. 18(10), 18461850 (2020).CrossRefGoogle Scholar
Kannappan, P. and Tanner, H. G., “Distance-based global descriptors for multi-view object recognition,” Robotica 38(1), 106117 (2020).CrossRefGoogle Scholar
Deng, H., Sun, X. and Zhou, X., “A multiscale fuzzy metric for detecting small infrared targets against chaotic cloudy/sea-sky backgrounds,” IEEE Trans. Cybern. 49(5), 16941707 (2018).CrossRefGoogle ScholarPubMed
Cheng, L. B., Jiang, Z. H., H.Li, B. W. and Huang, Q., “Target-tools recognition method based on an image feature library for space station cabin service robots,” Robotica 34(4), 925941 (2016).CrossRefGoogle Scholar
Li, W., Zhang, X., Peng, Y. and Dong, M., “DMNet: A network architecture using dilated convolution and multiscale mechanisms for spatiotemporal fusion of remote sensing images,” IEEE Sens. J. 20(20), 1219012202 (2020).CrossRefGoogle Scholar
Wang, Y., Hu, S., Wang, G., Chen, C. and Pan, Z., “Pan “Multi-scale dilated convolution of convolutional neural network for crowd counting,” Multimed. Tools Appl. 79(1), 10571073 (2020).CrossRefGoogle Scholar
Jian, M., Liu, X., Luo, H., Lu, X., Yu, H. and Dong, J., “Underwater image processing and analysis: A review,” Signal Process. Image Commun. 91, 116088 (2021).CrossRefGoogle Scholar
Jian, M., Qi, Q., Yu, H., Dong, J., Cui, C., Nie, X., Zhang, H., Yin, Y. and Lam, K. M., “The extended marine underwater environment database and baseline evaluations,” Appl. Soft. Comput. 80, 425437 (2019).CrossRefGoogle Scholar
Fang, J. and Liu, G., “Visual object tracking based on mutual learning between cohort multiscale feature-fusion networks with weighted loss,” IEEE Trans. Circuits Syst. Video Technol. 31(3), 10551065 (2020).CrossRefGoogle Scholar
Shen, C., Zhao, X., Fan, X., Lian, X., Zhang, F., Kreidieh, A. R. and Liu, Z., “Multi-receptive field graph convolutional neural networks for pedestrian detection,” IET Intell. Transp. Syst. 13(9), 13191328 (2019).CrossRefGoogle Scholar
Gama, F., Isufi, E., Leus, G. and Ribeiro, A., “Graphs, convolutions, and neural networks: From graph filters to graph neural networks,” IEEE Signal Process. Mag. 37(6), 128138 (2020).CrossRefGoogle Scholar
Fu, B., Fu, S., Wang, L., Dong, Y. and Ren, Y., “Deep residual split directed graph convolutional neural networks for action recognition,” IEEE Multimed. 27(4), 917 (2020).CrossRefGoogle Scholar
Lu, Y., Chen, Y., Zhao, D., Liu, B., Lai, Z. and Chen, J., “CNN-G: Convolutional neural network combined with graph for image segmentation with theoretical analysis,” IEEE Trans. Cogn. Dev. Syst. 13(3), 631644 (2020).CrossRefGoogle Scholar
Zhang, J., Jin, X., Sun, J., Wang, J., Sangaiah, A. K., “Spatial and semantic convolutional features for robust visual object tracking,” Multimed. Tools Appl. 79(21), 1509515115 (2020).CrossRefGoogle Scholar
Zhang, P. and Zhang, J. X., “Deep learning analysis based on multi-sensor fusion data for hemiplegia rehabilitation training system for stoke patients,” Robotica 40(3), 780797 (2022).CrossRefGoogle Scholar
Tian, S., Kang, L., Xing, X., Li, Z., Zhao, L., Fan, C. and Zhang, Y., “Siamese graph embedding network for object detection in remote sensing images,” IEEE Geosci. Remote Sens. Lett. 2(4), 602606 (2020).Google Scholar
Li, H., Qiu, K., Chen, L., Mei, X., Hong, L., Tao, C., “SCAttNet: Semantic segmentation network with spatial and channel attention mechanism for high-resolution remote sensing images,” IEEE Geosci. Remote Sens. Lett. 18(5), 905909 (2020).CrossRefGoogle Scholar
Yin, L. and Hu, H., “Enhanced global attention upsample decoder based on enhanced spatial attention and feature aggregation module for semantic segmentation,” Electron. Lett. 56(13), 659661 (2020).CrossRefGoogle Scholar
Wang, S., Lan, L., Zhang, X. and Luo, Z., “GateCap: Gated spatial and semantic attention model for image captioning,” Multimed. Tools Appl. 79(17), 1153111549 (2020).CrossRefGoogle Scholar
Jian, M., Wang, J., Yu, H. and Wang, G. G., “Integrating object proposal with attention networks for video saliency detection,” Inf. Sci. 576, 819830 (2021).CrossRefGoogle Scholar
Avelin, B. and Nyström, K., “Neural ODEs as the deep limit of ResNets with constant weights,” Anal. Appl. 19(3), 397437 (2021).CrossRefGoogle Scholar
Zhang, X., Chen, Z., Wu, Q. J., Cai, L., Lu, D. and Li, X., “Fast semantic segmentation for scene perception,” IEEE Trans. Ind. Inform. 15(2), 11831192 (2018).CrossRefGoogle Scholar
Yang, T., Wei, Y., Tu, Z., Zeng, H., Kinsy, M. A., Zheng, N. and Ren, P., “Design space exploration of neural network activation function circuits,” IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 38(10), 19741978 (2018).CrossRefGoogle Scholar
Li, Q., Peng, X., Qiao, Y. and Peng, Q., “Learning label correlations for multi-label image recognition with graph networks,” Pattern Recognit. Lett. 138(1), 378384 (2020).CrossRefGoogle Scholar
Li, Y., Zhang, X. and Chen, D., “Csrnet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes,” In: 31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, pp. 10911100.Google Scholar
Jiang, J., Lyu, C., Liu, S., He, Y. and Hao, X., “RWSNet: A semantic segmentation network based on SegNet combined with random walk for remote sensing,” Int. J. Remote Sens. 41(2), 487505 (2020).CrossRefGoogle Scholar
Tian, H., Zheng, Y. and Jin, Z., “MobileNet-SSD MicroScope Using Adaptive Error Correction Algorithm: Real-Time Detection of License Plates on Mobile Devices,” In: 6th International Conference on Energy, Environment and Materials Science (EEMS), Hulun Buir, China (2020) pp. 10911100.Google Scholar
Hu, X., Li, H., Li, X. and Wang, C., “MobileNet-SSD MicroScope using adaptive error correction algorithm: Real-time detection of license plates on mobile devices,” IET Intell. 14(2), 110118 (2020).Google Scholar