| 注册
首页|期刊导航|山西大学学报(自然科学版)|一种基于多尺度内容感知的图像篡改定位方法

一种基于多尺度内容感知的图像篡改定位方法

张雷 王宝丽 陆晓栋 闫成梁 常敏慧

山西大学学报(自然科学版)2026,Vol.49Issue(2):209-219,11.
山西大学学报(自然科学版)2026,Vol.49Issue(2):209-219,11.DOI:10.13451/j.sxu.ns.2025093

一种基于多尺度内容感知的图像篡改定位方法

A Multi-scale Content-aware Localization Method for Image Manipulation

张雷 1王宝丽 1陆晓栋 1闫成梁 2常敏慧1

作者信息

  • 1. 运城学院 数学与信息技术学院,山西 运城 044000
  • 2. 太原师范学院 计算机科学与技术学院,山西 晋中 030619
  • 折叠

摘要

Abstract

With the rapid advancement of image editing technologies,image manipulation localization is facing increasingly complex challenges.Existing methods exhibit limitations in capturing multi-scale contextual information of manipulated regions,and they of-ten rely on fixed sampling kernels during the multi-scale feature fusion process,which makes it difficult to focus on local variations of features,thereby restricting the localization accuracy.To address these issues,we propose an image manipulation localization method based on multi-scale content-aware.Firstly,a feature fusion module based on multi-scale content-aware is designed,which can dynamically generate adaptive sampling kernels for each position in the feature map,enabling the model to locate the approxi-mate range of manipulated regions at a coarse-grained level and identify manipulated edge features at a fine-grained level.Secondly,a depthwise separable convolutional decoder is employed to replace the traditional multilayer perceptron for prediction,further en-hancing detection accuracy.Finally,a joint loss function combining binary cross-entropy loss and Dice loss is proposed,effectively improving the model's robustness and generalization capabilities.Cross-dataset experimental results on multiple public datasets dem-onstrate that,in terms of Pixel-level F1 score,the proposed method achieves 69.4%,25.4%,37.8%,and 83.2%on the CASIAv1,De-facto-12k,Coverage,and Columbia datasets,respectively.Compared to the mainstream MVSS-Net++and the latest IML-ViT,it achieves average improvements of 10.2%and 2.1%,respectively,significantly enhancing the accuracy of image tampering localiza-tion.

关键词

特征融合/深度可分离卷积/Vision Transformer/联合损失

Key words

feature fusion/depthwise separable convolution/Vision Transformer/joint loss

分类

信息技术与安全科学

引用本文复制引用

张雷,王宝丽,陆晓栋,闫成梁,常敏慧..一种基于多尺度内容感知的图像篡改定位方法[J].山西大学学报(自然科学版),2026,49(2):209-219,11.

基金项目

国家自然科学基金(61703363) (61703363)

山西省基础研究计划项目(202403021221206) (202403021221206)

数据挖掘与工业智能应用科研创新团队资助项目(YCXYTD-202402) (YCXYTD-202402)

运城学院科研项目(YQ-2020021) (YQ-2020021)

山西大学学报(自然科学版)

0253-2395

访问量0
|
下载量0
段落导航相关论文