郑州轻工业学院学报(自然科学版)Issue(2):58-61,4.DOI:10.3969/j.issn.2095-476X.2014.02.015
微博话题检测S P & HC聚类算法分析
The SP & HC clustering algorithm analysis of micro-blog topic detection
摘要
Abstract
Aiming at the problem that micro-blog had large amount of information,the coalescing hierarchical clustering algorithm was not suitable and Single-Pass clustering algorithm results was not accurate,a new algorithm SP&HC integrating hierarchical clustering algorithm and Single-Pass clustering algorithm were put forward.It used Single-Pass clustering algorithm to make the large number of micro-blog text become into the simple clustering,in order to collect some small amplitude and high cohesive theme topic.This greatly streamlined the content and quantity of the topic,until making the theme topic hierarchical cluste-ring algorithm to achieve the requirements;then it used hierarchical clustering algorithm to carry out a sim-ilar topic clustering,until that conditions met defaults.The simulation experiment results showed that the performance on recall and accuracy of the algorithm was better than the first two algorithms.关键词
微博/热点话题/层次聚类算法/Single-Pass聚类算法/SP&HC聚类算法Key words
micro-blog/hot topic/hierarchical clustering algorithm/Single-Pass clustering algorithm/SP&HC clustering algorithm分类
信息技术与安全科学引用本文复制引用
甘勇,姜森,杨佳佳..微博话题检测S P & HC聚类算法分析[J].郑州轻工业学院学报(自然科学版),2014,(2):58-61,4.基金项目
国家自然科学基金项目 ()