YOLOv8-CDD: A Salient Target Detection Model for Underwater Cultural Heritage in Complex Environments

Tiancheng Liu

doi:10.6919/ICJE.202505_11(5).0040

Authors

Tiancheng Liu

DOI:

https://doi.org/10.6919/ICJE.202505_11(5).0040

Keywords:

Underwater Object Detection; Cultural Heritage; Salient Region Recognition.

Abstract

Existing object detection methods often experience a decline in accuracy when directly applied to underwater scenarios, making it difficult to meet the precision and reliability requirements of practical cultural heritage detection tasks. To address common challenges in underwater heritage detection-such as low image contrast, severe target occlusion, and insufficient detection accuracy-this paper proposes a novel detection model, YOLOv8-CDD. The model is designed to improve the recognition of salient target regions in complex underwater conditions, thereby enhancing the automation and efficiency of archaeological detection. YOLOv8-CDD is developed by deeply optimizing the backbone and detection head of the original YOLOv8 architecture. It integrates a C2f-DCNv4 structure, the CBAM attention mechanism, and the Dynamic4 detection head to significantly enhance feature extraction and the perception of small or occluded objects, while maintaining high computational efficiency. The model is systematically evaluated on both the public DUO dataset and a self-constructed underwater archaeological image dataset. Experimental results show that YOLOv8-CDD achieves an mAP@0.5 of 84.9% and an mAP@0.5:0.95 of 63.8% on the DUO dataset, representing improvements of 4.0% and 3.4%, respectively, over the baseline. On the self-constructed dataset, it attains an mAP@0.5 of 70.4%, with a gain of 4.1%.

Downloads

Download data is not yet available.

References

[1] Fu C, Fan X, Xiao J, et al. Learning Heavily-Degraded Prior for Underwater Object Detection[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2023, 33(11): 6887–6896.

[2] Dinakaran R, Zhang L, Li C T, et al. Robust and Fair Undersea Target Detection with Automated Underwater Vehicles for Biodiversity Data Collection[J]. Remote Sensing, 2022, 14(15): 3680.

[3] Chen L, Zhou F, Wang S, et al. SWIPENET: Object detection in noisy underwater scenes[J]. Pattern Recognition, 2022, 132: 108926.

[4] Chen L, Li T, Zhou A, et al. Underwater object detection in noisy imbalanced datasets[J]. Pattern Recognition, 2024, 155: 110649.

[5] Qi J, Gong Z, Xue W, et al. An Unmixing-Based Network for Underwater Target Detection From Hyperspectral Imagery[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14: 5470–5487.

[6] Wang B, Wang Z, Guo W, et al. A dual-branch joint learning network for underwater object detection[J]. Knowledge-Based Systems, 2024, 293: 111672.

[7] Zhang J, Zhang J, Zhou K, et al. An Improved YOLOv5-Based Underwater Object-Detection Framework[J]. Sensors, 2023, 23(7): 3693.

[8] Lei F, Tang F, Li S. Underwater Target Detection Algorithm Based on Improved YOLOv5[J]. Journal of Marine Science and Engineering, 2022, 10(3): 310.

[9] Cai S, Li G, Shan Y. Underwater object detection using collaborative weakly supervision[J]. Computers and Electrical Engineering, 2022, 102: 108159.

[10] Zhang F, Cao W, Gao J, et al. Underwater Object Detection Algorithm Based on an Improved YOLOv8[J]. Journal of Marine Science and Engineering, 2024, 12(11): 1991.

[11] Alsuwaylimi A A. Enhanced YOLOv8-Seg Instance Segmentation for Real-Time Submerged Debris Detection[J]. IEEE Access, 2024, 12: 117833–117849.

[12] Cai S, Zhang X, Mo Y. A Lightweight underwater detector enhanced by Attention mechanism, GSConv and WIoU on YOLOv8[J]. Scientific Reports, 2024, 14(1): 25797.

[13] Chen J, Er M J. Dynamic YOLO for small underwater object detection[J]. Artificial Intelligence Review, 2024, 57(7): 165.

[14] Xiong Y, Li Z, Chen Y, et al. Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications[A/OL]. arXiv, 2024[2025-03-14]. https://arxiv.org/abs/2401.06197.

[15] Woo S, Park J, Lee J Y, et al. CBAM: Convolutional Block Attention Module[M]// In: Ferrari V, Hebert M, Sminchisescu C, et al. Computer Vision – ECCV 2018. Cham: Springer, 2018, 11211: 3–19.

[16] Dai X, Chen Y, Xiao B, et al. Dynamic Head: Unifying Object Detection Heads with Attentions[A/OL]. arXiv, 2021[2025-03-17]. https://arxiv.org/abs/2106.08322.