计算机应用与软件2011,Vol.28Issue(6):242-246,5.
基于内容分析的中文BBS话题检测系统的设计与实现
DESIGN AND IMPLEMENTATION OF CHINESE BBS TOPIC DETECTION SYSTEM BASED ON CONTENT ANALYSIS
摘要
Abstract
Through analyzing and studying BBS topic model, topic similarity, topic assessment standard and topic development trend, the paper puts forward a content analysis based Chinese BBS topic detection algorithm, including obtaining BBS information by web crawlers,processing BBS information with URL and Xpath based webpage templates, realizing BBS information participles by ICTLAS, clustering BBS topics by Carrot2, analyzing hot topics based on the power spectrum and predicting topics based on time sequences. Finally a Chinese BBS topic detection system is realized by applying J2EE SDK and Eclipse IDE as well as combining such technologies as Hibernate and GWT etc.A number of tests have been performed on multiple BBS; all have achieved fine results.关键词
BBS话题检测/网络爬虫/话题聚类/热点分析Key words
BBS Topic detection / Web crawler/ Topic clustering /Hot spot analysis引用本文复制引用
赵艳红,聂哲..基于内容分析的中文BBS话题检测系统的设计与实现[J].计算机应用与软件,2011,28(6):242-246,5.基金项目
深圳市科技计划项目资助课题(07KJce140). (07KJce140)