首页|期刊导航|空间电子技术|人工智能模型正确性测试技术研究

人工智能模型正确性测试技术研究

肖旭明高蕾曾彦淞门永平

空间电子技术2025，Vol.22Issue(z1)：148-155,8.

空间电子技术2025，Vol.22Issue(z1)：148-155,8.DOI:10.3969/j.issn.1674-7135.2025.S1.013

人工智能模型正确性测试技术研究

Research on correctness testing techniques for artificial intelligence models

肖旭明 ¹高蕾 ¹曾彦淞 ¹门永平¹

作者信息

1. 中国空间技术研究院西安分院,西安 710000
折叠

摘要

Abstract

In recent years,various natural language processing models and image recognition models have been widely deployed in military equipment.However,the quality and performance levels of artificial intelligence(AI)models vary widely,making effective evaluation and testing challenging.There is an urgent need to propose testing methods tailored for military AI models to accomplish correctness evaluation.AI algorithm models rely on massive raw data as training inputs,undergo training through algorithmic models,and ultimately output prediction results via trained models.Nevertheless,due to the"black-box"nature of these algorithms and potential issues such as uneven data distribution in raw datasets,model prediction errors frequently occur,leading to safety incidents involving human lives and property.How to measure the correctness of AI models has become a critical research topic.This paper approaches the issue from two dimensions:model data and model algorithms.For model data,a data quality evaluation scheme based on clustering algorithms is proposed to identify raw data that significantly impacts model decision-making.By modifying and removing redundant portions of datasets,the quality of datasets is enhanced.For model algorithms,a correctness verification scheme based on fuzz testing theory is introduced.Guided by fuzz testing principles,minor perturbations are applied to alter original inputs,generating mutated inputs.The correctness of model algorithms is verified by detecting misclassifications of these mutated inputs.Additionally,to assess testing adequacy,a neuron coverage-guided testing method is proposed.This method maximizes the coverage of the neuron state space to identify misjudgment points in model algorithms,thereby improving the correctness of AI models.The proposed techniques aim to provide systematic solutions for evaluating and enhancing the reliability of AI models in mission-critical military applications.

关键词

人工智能模型测试/模型正确性测试/聚类算法/模糊测试

Key words

artificial intelligence model testing/model correctness testing/clustering algorithms/fuzz testing

引用本文复制引用

肖旭明,高蕾,曾彦淞,门永平..人工智能模型正确性测试技术研究[J].空间电子技术,2025,22(z1):148-155,8.

空间电子技术

ISSN：1674-7135

访问量0

下载量0

段落导航