郑州大学学报(理学版)2011,Vol.43Issue(1):65-69,74,6.
一种应用于博客的垃圾评论识别方法
A Research on Identifying Comments Spam for Blog Comments
摘要
Abstract
A new method to identify blog comments spam was proposed. The short comments were identified by the network common words first, and made K rounds to identify the comments which used the improved similarity formula. Following every identifies, the weights of keywords and extend keywords were adjusted. All the comments were identified to the category. The spam reviews were filter again by the network common words and the keywords, and more legitimate comments were identified. Experimental results showed that the method, to some extent, improved the recognition accuracy.关键词
博客垃圾评论/相似度/语义信息Key words
blog comments/ spam similarity/ semantic information分类
信息技术与安全科学引用本文复制引用
邓冰娜,王煜,刘宇..一种应用于博客的垃圾评论识别方法[J].郑州大学学报(理学版),2011,43(1):65-69,74,6.基金项目
河北省教育厅科学研究重点项目,编号ZH200804. ()