| 注册
首页|期刊导航|网络与信息安全学报|基于主题模型的微博话题检测算法

基于主题模型的微博话题检测算法

黄华军 谭骏珊 秦姣华

网络与信息安全学报2016,Vol.2Issue(5):30-38,9.
网络与信息安全学报2016,Vol.2Issue(5):30-38,9.DOI:10.11959/j.issn.2096-109x.2016.00049

基于主题模型的微博话题检测算法

Micro-blog topic detection algorithm based on topic model

黄华军 1谭骏珊 1秦姣华1

作者信息

  • 1. 中南林业科技大学计算机与信息工程学院,湖南长沙 410004
  • 折叠

摘要

Abstract

Micro-blog data has the characteristic of real-time, volume, short-text, and noise-rich. So it is a challenge for the traditional topic detection technology. A novel micro-blog topic detection algorithm based on topic model was proposed. Firstly, the micro-blog data was expressed as text word matrix and word relation matrix. The topic word was extracted from the two vectors. Secondly, the topic model was obtained with clustering. Finally, the topic detection of micro-blog was obtained by clustering text and topic model. Experimental results show that the algo-rithm proposed can effectively detection the text topic, and with the best parameter group of precision, recall rate,F, and the valueF is about 95%.

关键词

话题检测/主题模型/文档词条矩阵/词语关联矩阵

Key words

topic detection/topic model/text word matrix/word relation matrix

分类

信息技术与安全科学

引用本文复制引用

黄华军,谭骏珊,秦姣华..基于主题模型的微博话题检测算法[J].网络与信息安全学报,2016,2(5):30-38,9.

基金项目

国家自然科学基金资助项目(No.61304208);湖南省自然科学基金资助项目(No.13JJ2031);中南林业科技大学青年科学研究基金资助项目(No.QJ2012009A) Foundation Items:The National Natural Science Foundation of China (No.61304208), The Natural Science Foundation of Hunan Province (No.13JJ2031),Youth Scientific Research Foundation of Central South University of Forestry &Technology (No.QJ2012009A) (No.61304208)

网络与信息安全学报

2096-109X

访问量0
|
下载量0
段落导航相关论文