计算机工程与应用2013,Vol.49Issue(2):92-96,5.DOI:10.3778/j.issn.1002-8331.1208-0007
一种针对同音词伪装的反垃圾短信系统设计
System design against spam message disguised with homonym
胡德敏 1胡金龙1
作者信息
- 1. 上海理工大学光电信息与计算机工程学院,上海200093
- 折叠
摘要
Abstract
As the progress of the spam message filtering technology, characteristics of spam message are changing all the time. Of them, spam message disguised with homonym can easily escape from filtering system. Feature that homonym shares same pinyin makes it possible that by replacing key words with pinyin it can pick up common vector and disguised vector. Making such two vectors as input of the filter system based on Bayesian respectively, it can get two independent outputs, by analyzing the outputs, the system can tell the spam message from the normal. Experimental result confirms that this system can identify spam message disguised with homonym effectively.关键词
垃圾短信/贝叶斯分类/分词/概率/提取Key words
spam message/ Bayesian classification/ words spit/ possibility/ extract分类
信息技术与安全科学引用本文复制引用
胡德敏,胡金龙..一种针对同音词伪装的反垃圾短信系统设计[J].计算机工程与应用,2013,49(2):92-96,5.