网络与信息安全学报2016,Vol.2Issue(5):30-38,9.DOI:10.11959/j.issn.2096-109x.2016.00049
基于主题模型的微博话题检测算法
Micro-blog topic detection algorithm based on topic model
摘要
Abstract
Micro-blog data has the characteristic of real-time, volume, short-text, and noise-rich. So it is a challenge for the traditional topic detection technology. A novel micro-blog topic detection algorithm based on topic model was proposed. Firstly, the micro-blog data was expressed as text word matrix and word relation matrix. The topic word was extracted from the two vectors. Secondly, the topic model was obtained with clustering. Finally, the topic detection of micro-blog was obtained by clustering text and topic model. Experimental results show that the algo-rithm proposed can effectively detection the text topic, and with the best parameter group of precision, recall rate,F, and the valueF is about 95%.关键词
话题检测/主题模型/文档词条矩阵/词语关联矩阵Key words
topic detection/topic model/text word matrix/word relation matrix分类
信息技术与安全科学引用本文复制引用
黄华军,谭骏珊,秦姣华..基于主题模型的微博话题检测算法[J].网络与信息安全学报,2016,2(5):30-38,9.基金项目
国家自然科学基金资助项目(No.61304208);湖南省自然科学基金资助项目(No.13JJ2031);中南林业科技大学青年科学研究基金资助项目(No.QJ2012009A) Foundation Items:The National Natural Science Foundation of China (No.61304208), The Natural Science Foundation of Hunan Province (No.13JJ2031),Youth Scientific Research Foundation of Central South University of Forestry &Technology (No.QJ2012009A) (No.61304208)