首页|期刊导航|山西大学学报（自然科学版）|一种基于多尺度内容感知的图像篡改定位方法

一种基于多尺度内容感知的图像篡改定位方法

张雷王宝丽陆晓栋闫成梁常敏慧

山西大学学报（自然科学版）2026，Vol.49Issue(2)：209-219,11.

山西大学学报（自然科学版）2026，Vol.49Issue(2)：209-219,11.DOI:10.13451/j.sxu.ns.2025093

一种基于多尺度内容感知的图像篡改定位方法

A Multi-scale Content-aware Localization Method for Image Manipulation

张雷 ¹王宝丽 ¹陆晓栋 ¹闫成梁 ²常敏慧¹

作者信息

1. 运城学院数学与信息技术学院,山西运城 044000
2. 太原师范学院计算机科学与技术学院,山西晋中 030619
折叠

摘要

Abstract

With the rapid advancement of image editing technologies,image manipulation localization is facing increasingly complex challenges.Existing methods exhibit limitations in capturing multi-scale contextual information of manipulated regions,and they of-ten rely on fixed sampling kernels during the multi-scale feature fusion process,which makes it difficult to focus on local variations of features,thereby restricting the localization accuracy.To address these issues,we propose an image manipulation localization method based on multi-scale content-aware.Firstly,a feature fusion module based on multi-scale content-aware is designed,which can dynamically generate adaptive sampling kernels for each position in the feature map,enabling the model to locate the approxi-mate range of manipulated regions at a coarse-grained level and identify manipulated edge features at a fine-grained level.Secondly,a depthwise separable convolutional decoder is employed to replace the traditional multilayer perceptron for prediction,further en-hancing detection accuracy.Finally,a joint loss function combining binary cross-entropy loss and Dice loss is proposed,effectively improving the model's robustness and generalization capabilities.Cross-dataset experimental results on multiple public datasets dem-onstrate that,in terms of Pixel-level F1 score,the proposed method achieves 69.4％,25.4％,37.8％,and 83.2％on the CASIAv1,De-facto-12k,Coverage,and Columbia datasets,respectively.Compared to the mainstream MVSS-Net++and the latest IML-ViT,it achieves average improvements of 10.2％and 2.1％,respectively,significantly enhancing the accuracy of image tampering localiza-tion.

关键词

特征融合/深度可分离卷积/Vision Transformer/联合损失

Key words

feature fusion/depthwise separable convolution/Vision Transformer/joint loss

分类

信息技术与安全科学

引用本文复制引用

张雷,王宝丽,陆晓栋,闫成梁,常敏慧..一种基于多尺度内容感知的图像篡改定位方法[J].山西大学学报（自然科学版）,2026,49(2):209-219,11.

基金项目

国家自然科学基金(61703363) （61703363）

山西省基础研究计划项目(202403021221206) （202403021221206）

数据挖掘与工业智能应用科研创新团队资助项目(YCXYTD-202402) （YCXYTD-202402）

运城学院科研项目(YQ-2020021) （YQ-2020021）

山西大学学报（自然科学版）

ISSN：0253-2395

访问量0

下载量0

段落导航