| 注册
首页|期刊导航|计算机技术与发展|基于自适应多尺度融合的RGB-D岩画图像分割模型

基于自适应多尺度融合的RGB-D岩画图像分割模型

白川平 职昕 张芳琴 王治学 周明全

计算机技术与发展2025,Vol.35Issue(5):152-157,6.
计算机技术与发展2025,Vol.35Issue(5):152-157,6.DOI:10.20165/j.cnki.ISSN1673-629X.2025.0126

基于自适应多尺度融合的RGB-D岩画图像分割模型

RGB-D Petroglyph Image Segmentation Model Based on Adaptive Multi-scale Feature Fusion

白川平 1职昕 2张芳琴 3王治学 3周明全2

作者信息

  • 1. 西北大学信息科学与技术学院,陕西西安 710127||宁夏师范大学数学与计算机科学学院,宁夏固原 756099||宁夏师范大学人工智能与智慧医疗工程技术研究中心,宁夏固原 756099
  • 2. 西北大学信息科学与技术学院,陕西西安 710127||西北大学文化遗产数字化国家地方联合工程研究中心,陕西 西安 710127
  • 3. 宁夏师范大学数学与计算机科学学院,宁夏固原 756099
  • 折叠

摘要

Abstract

Rock arts are significant cultural heritage for understanding ancient human societies,cultures,religions,and environments.To accurately segment complex and multi-scale patterns and symbols in rock arts,we employ depth images to compensate for the missing spatial geometric information for RGB images.We propose an adaptive multi-scale fusion RGB-D(RGB-Depth)segmentation model,termed the Adaptive Multi-scale Fusion Network(AMFNet).The design of an adaptive multi-scale feature fusion network integrates depth spatial information with 2D image texture information to fully exploit complementary information and enhance the model's segmentation performance.This model first employs large convolutional kernels to expand the receptive field,then decomposes the conv-olutional kernels.Subsequently,it utilizes a dynamic spatial selection mechanism to select feature maps corresponding to convolutional kernels of different scales,adaptively fusing multi-scale features to enhance the spatial feature expression ability of targets of different scales in rock arts.Experimental results demonstrate that the proposed model achieves higher mean intersection over union(mIoU)and pixel accuracy(PA)on the 3D-Pitoti rock art dataset compared to the other three methods,outperforming the state-of-the-art BEGL+UNet approach by 5.3 percentage points and 3.0 percentage points in mIoU and PA,respectively,thereby validating its effectiveness.At the same time,it has been verified that the spatial geometric information of depth image provides complementary information for the seg-mentation model in the field of rock art image segmentation which the foreground and background are extremely similar.

关键词

岩画图像分割/多模态/神经网络/深度学习/数据融合/多尺度/注意力机制

Key words

petroglyph image segmentation/multimodality/neural networks/deep learning/data fusion/multi-scale/attention mechanism

分类

信息技术与安全科学

引用本文复制引用

白川平,职昕,张芳琴,王治学,周明全..基于自适应多尺度融合的RGB-D岩画图像分割模型[J].计算机技术与发展,2025,35(5):152-157,6.

基金项目

国家重点研发计划(2020YFC1523301,2020YFC1523303) (2020YFC1523301,2020YFC1523303)

国家自然科学基金(62271393,62262054) (62271393,62262054)

计算机技术与发展

1673-629X

访问量8
|
下载量0
段落导航相关论文