计算机技术与发展2025,Vol.35Issue(5):152-157,6.DOI:10.20165/j.cnki.ISSN1673-629X.2025.0126
基于自适应多尺度融合的RGB-D岩画图像分割模型
RGB-D Petroglyph Image Segmentation Model Based on Adaptive Multi-scale Feature Fusion
摘要
Abstract
Rock arts are significant cultural heritage for understanding ancient human societies,cultures,religions,and environments.To accurately segment complex and multi-scale patterns and symbols in rock arts,we employ depth images to compensate for the missing spatial geometric information for RGB images.We propose an adaptive multi-scale fusion RGB-D(RGB-Depth)segmentation model,termed the Adaptive Multi-scale Fusion Network(AMFNet).The design of an adaptive multi-scale feature fusion network integrates depth spatial information with 2D image texture information to fully exploit complementary information and enhance the model's segmentation performance.This model first employs large convolutional kernels to expand the receptive field,then decomposes the conv-olutional kernels.Subsequently,it utilizes a dynamic spatial selection mechanism to select feature maps corresponding to convolutional kernels of different scales,adaptively fusing multi-scale features to enhance the spatial feature expression ability of targets of different scales in rock arts.Experimental results demonstrate that the proposed model achieves higher mean intersection over union(mIoU)and pixel accuracy(PA)on the 3D-Pitoti rock art dataset compared to the other three methods,outperforming the state-of-the-art BEGL+UNet approach by 5.3 percentage points and 3.0 percentage points in mIoU and PA,respectively,thereby validating its effectiveness.At the same time,it has been verified that the spatial geometric information of depth image provides complementary information for the seg-mentation model in the field of rock art image segmentation which the foreground and background are extremely similar.关键词
岩画图像分割/多模态/神经网络/深度学习/数据融合/多尺度/注意力机制Key words
petroglyph image segmentation/multimodality/neural networks/deep learning/data fusion/multi-scale/attention mechanism分类
信息技术与安全科学引用本文复制引用
白川平,职昕,张芳琴,王治学,周明全..基于自适应多尺度融合的RGB-D岩画图像分割模型[J].计算机技术与发展,2025,35(5):152-157,6.基金项目
国家重点研发计划(2020YFC1523301,2020YFC1523303) (2020YFC1523301,2020YFC1523303)
国家自然科学基金(62271393,62262054) (62271393,62262054)