计算机应用研究2012,Vol.29Issue(3):1019-1023,5.DOI:10.3969/j.issn.1001-3695.2012.03.060
基于最大信息熵模型的异常流量分类方法
Anomalous traffic classification based on maximum entropy model
摘要
Abstract
The machine learning model based on maximum entropy principles has been applied successfully in natural language processing, such as machine translation, text auto-classification and speech recognition. This model was first used in network anomalous traffic classification with our exploration. As the maximum entropy model used binary feature function, which was fit for processing nominal feature, it adopted the discrete method based on entropy to preprocessing the training data set. It generated the final feature set by extracting features from KDD99 dataset with CFS algorithm. Finally, employed the BLVM algorithm to evaluate the parameters and got an exponential model subjected to maximum entropy constrain. The model was compared with Naive Bayes, Bayes Net, SVM and C4. 5 by precision, callback and F-Measure. The results of experiment show that the maximum entropy model has better classification efficiency, especially under small size of training data set.关键词
最大信息熵模型/异常流量/离散化/特征选择/参数估计Key words
maximum entropy model/ anomalous traffic/ discretezation/ feature selection/ parameter evaluation分类
信息技术与安全科学引用本文复制引用
钱亚冠,关晓惠,王滨..基于最大信息熵模型的异常流量分类方法[J].计算机应用研究,2012,29(3):1019-1023,5.基金项目
国家"973"计划基金资助项目(2007CB307102) (2007CB307102)
国家科技支撑计划基金资助项目 (2008BAH37B02) (2008BAH37B02)