首页|期刊导航|现代信息科技|基于样本损失值变化统一性的后门样本隔离

基于样本损失值变化统一性的后门样本隔离

张家辉

现代信息科技2024，Vol.8Issue(11)：44-48,5.

现代信息科技2024，Vol.8Issue(11)：44-48,5.DOI:10.19850/j.cnki.2096-4706.2024.11.009

基于样本损失值变化统一性的后门样本隔离

Backdoor Sample Isolation Based on the Uniformity of Samples'Loss Value Changes

张家辉¹

作者信息

1. 西安电子科技大学网络与信息安全学院,陕西西安 710126
折叠

摘要

Abstract

Backdoor attacks pose a potential threat to applying AI applications.Unlearning-based robust training methods achieve training models with no backdoor on untrusted datasets by isolating a subset of backdoor samples and unlearning it.However,incorrectly isolating and unlearning clean samples can lead to performance degradation of the model on clean data.In order to reduce false isolation of clean samples thus protecting model performance on clean data,a backdoor sample isolation scheme based on the uniformity of samples'loss value changes is proposed.During the training process,samples have large and uniform changes in loss value,and samples with low uniformity of loss value changes in the isolated and potential backdoor samples set are removed.Experimental results indicate that the application of the scheme benefits reducing the false isolation of clean samples,and protects the model's performance on clean data without compromising on defending against backdoor attacks.

关键词

人工智能安全/后门防御/鲁棒训练/后门样本隔离/神经网络模型

Key words

Artificial Intelligence security/backdoor defense/robust training/backdoor sample isolation/neural network model

分类

信息技术与安全科学

引用本文复制引用

张家辉..基于样本损失值变化统一性的后门样本隔离[J].现代信息科技,2024,8(11):44-48,5.

现代信息科技

ISSN：2096-4706

访问量0

下载量0

段落导航