人工智能后门防御评估方法及其架构研究OACSTPCD

Research on method and architecture for defense assessment of artificial intelligence backdoors

中文摘要

英文摘要

为了应对人工智能系统可能面临的后门攻击风险,研究人员已经开发了一系列后门防御策略.现有防御方法评估标准的多样性,使得跨方法比较成为一大挑战,因此提出了一种人工智能后门防御统一评估框架.该框架旨在为不同层面(包括数据集级别和模型级别等)的防御策略,提供一个公共的评价标准.在数据集级别,主要通过准确率来评估后门检测的有效性;而在模型级别,则主要关注攻击成功率等指标.人工智能后门防御统一评估框架,能够在相同的评价标准下,对比和分析不同后门防御方法的性能…查看全部>>

In response to the potential risk of backdoor attacks faced by artificial intelligence systems,a range of backdoor defense strategies are developed.The diversity of the evaluation criteria for existing defense method,makes cross-method comparisons a significant challenge.Hence,a unified evaluation framework base on artificial intelligence backdoors was proposed.This framework aimed to provide a common standard for evaluating different levels of defense strat…查看全部>>

作者：谢天;李强;鞠卓亚;韩嘉祺;易平

作者单位：上海交通大学网络空间安全学院,上海 20024032178部队科技创新中心,北京,10001232178部队科技创新中心,北京,10001232178部队科技创新中心,北京,100012上海交通大学网络空间安全学院,上海 200240

分类：计算机与自动化

中文关键词：人工智能安全后门攻击后门防御统一评估

英文关键词：artificial intelligence securitybackdoor attackbackdoor defenseunified evaluation

刊名：《智能科学与技术学报》 2024 (3)

页码/页数：381-393,13

基金：国家自然科学基金项目(No.62202290) The National Natural Science Foundation of China(No.62202290)

DOI：10.11959/j.issn.2096-6652.202430

您当前未登录！

去登录

点击加载更多...

人工智能后门防御评估方法及其架构研究OACSTPCD

Research on method and architecture for defense assessment of artificial intelligence backdoors

评论