计算机技术与发展2016,Vol.26Issue(12):6-11,6.DOI:10.3969/j.issn.1673-629X.2016.12.002
基于LDA模型和AP聚类的主题演化分析
Topic Evolution Analysis Based on LDA Model and AP Clustering
摘要
Abstract
With the rapid development of Internet,the network information presents explosive growth,and the topic evolution analysis can help people get more valuable information from the massive Internet data. Evolutionary trajectory analysis of the topic is helpful for peo-ple to understand the antecedents and consequences of the event and to better predict the development trend of theme events,assistance of control. Aiming at the problem of threshold setting and topic shift in the method of a single event evolution analysis,a new LDA-AP model is proposed. In this method,the LDA model is used to model the news texts in different time windows,and the topic of different time windows is obtained. Then the AP clustering algorithm is used to analyze the multiple topic in different time windows,in which topic similarity calculation using the JS divergence with attenuation factor to measure. Finally the evolution analysis of multiple topic is conduc-ted. Through experimental comparison with the reference method,the results show that the proposed method can effectively improve the performance of the topic evolution,and the evolution trend of multiple news events with time is better analyzed.关键词
多主题演化/时间窗口/LDA模型/AP聚类算法/JS散度Key words
multiple topic evolution/time window/LDA model/AP clustering algorithm/JS distance分类
信息技术与安全科学引用本文复制引用
倪丽萍,刘小军,马驰宇..基于LDA模型和AP聚类的主题演化分析[J].计算机技术与发展,2016,26(12):6-11,6.基金项目
国家自然科学青年基金(71301041,71271071) (71301041,71271071)