首页|期刊导航|四川大学学报（自然科学版）|基于异质图属性增强的恶意软件变种检测方法

基于异质图属性增强的恶意软件变种检测方法

孙锦涛李祺李晓龙

四川大学学报（自然科学版）2024，Vol.61Issue(3)：15-29,15.

四川大学学报（自然科学版）2024，Vol.61Issue(3)：15-29,15.DOI:10.19907/j.0490-6756.2024.030002

基于异质图属性增强的恶意软件变种检测方法

Malware variant detection method based on heterogeneous graph attribute enhancement

孙锦涛 ¹李祺 ¹李晓龙²

作者信息

1. 北京邮电大学网络空间安全学院,北京 100876
2. 国网宁夏电力有限公司电力科学研究院,银川 750011
折叠

摘要

Abstract

Nowadays,an increasing number of attackers have been circumventing malware detection by modi-fying the source code of malicious software.The complex relationships among malware variants in code re-use,coding style,attack behavior and other aspects pose significant challenges to malware analysis.In recent years,graph neural networks have been widely applied to the tasks of malware classification and detection due to their powerful capabilities in modeling graph-structured data and learning complex relationships be-tween entities.This approach has enabled the modeling of complex relationships between malware and its variants,overcoming the limitations of isolated analysis.However,existing methods,on the one hand,lack a comprehensive characterization of the multi-dimensional complex relationships among malware and its vari-ants,leading to the underutilization of these complex interrelations.On the other hand,they focus only on the topological structure of malware,ignoring the semantic information of entities,allowing attackers to eas-ily forge features through adversarial methods and thus evade detection.In addition,the deficiency of seman-tic information in entities such as Windows API and communication IPs further hinders the extraction and rep-resentation of semantic information.Therefore,achieving the integration of the comprehensive relationships and the feature semantic information is crucial for enhancing the robustness and accuracy of malware variant detection.Accordingly,the authors propose a malware variant detection method,which is enhanced by the attributes of the heterogeneous graph.Specifically,the authors construct a heterogeneous information net-work to capture the complex relationship between malware and its features.Utilizing this network,the mal-ware variant detection is transformed into a node classification problem in a heterogeneous graph.Then,the authors formulate semantic attributes for the entity nodes to enhance the representation of node information.For entity nodes where semantic information is sparse,the authors derive the semantic information of the enti-ties from external open-source data to address their semantic deficiency.Finally,guided by topological rela-tionships,the authors utilize an attention mechanism to aggregate information from nodes with attributes to compensate for those without attributes,achieving attribute completion.Following an iterative optimization approach,the authors alternately optimize the completion process and the heterogeneous graph node embed-ding process,formulating a unified method for malware variant detection that leverages attribute completion in heterogeneous graph.Experimental results show that our proposed method significantly enhances the per-formance of malware variant detection,outperforming other state-of-the-art models across multiple datasets.

关键词

恶意软件变种检测/异质图神经网络/特征增强/属性补全

Key words

Malware variant detection/Heterogeneous graph neural networks/Feature enhancement/Attri-bute completion

分类

信息技术与安全科学

引用本文复制引用

孙锦涛,李祺,李晓龙..基于异质图属性增强的恶意软件变种检测方法[J].四川大学学报（自然科学版）,2024,61(3):15-29,15.

基金项目

国家自然科学基金项目(62172055) （62172055）

宁夏自然科学基金课题(2021AAC03511) （2021AAC03511）

四川大学学报（自然科学版）

OA北大核心CSTPCD

ISSN：0490-6756

访问量0

下载量0

段落导航