计算机工程2017,Vol.43Issue(2):57-62,6.DOI:10.3969/j.issn.1000-3428.2017.02.010
基于热度矩阵的微博热点话题发现
Microblog Hot Topics Detection Based on Heat Matrix
摘要
Abstract
Existing methods or models of microblog hot topics detection are sensitive to the quantity and the scale of microblog,and the detection process is slow.Hence,this paper proposes a topic model based on heat matrix.It uses the heat matrix to obtain heat and the topic-word probability distribution of every latent topic,and uses the common heat of words to extract the semantic relationship between words.Then the hot topics and hot words can be identified accurately.Experimental results on real microblog show that,compared with Latent Dirichlet Allocation (LDA) model,the proposed model has higher efficiency and accuracy rate.It can detect the hot topics which are consistent with real-time events,so that it has better effect in hot spot identification.关键词
热度矩阵/主题模型/微博/话题发现/文本挖掘Key words
heat matrix/topic model/microblog/topic detection/text mining分类
信息技术与安全科学引用本文复制引用
聂文汇,曾承,贾大文..基于热度矩阵的微博热点话题发现[J].计算机工程,2017,43(2):57-62,6.基金项目
国家自然科学基金重点项目(U1135005). (U1135005)