| 注册
首页|期刊导航|计算机工程|基于联合非负矩阵分解的话题变迁检测方法

基于联合非负矩阵分解的话题变迁检测方法

陈梦伟 吕钊 崔修涛

计算机工程2018,Vol.44Issue(1):35-43,9.
计算机工程2018,Vol.44Issue(1):35-43,9.DOI:10.3969/j.issn.1000-3428.2018.01.006

基于联合非负矩阵分解的话题变迁检测方法

Topic Change Detection Method Based on Joint Nonnegative Matrix Factorization

陈梦伟 1吕钊 1崔修涛2

作者信息

  • 1. 华东师范大学计算机科学与技术系,上海200241
  • 2. 上海长江计算机有限公司,上海200241
  • 折叠

摘要

Abstract

In large-scale temporal documents similarities and differences do not have the ability to identily topics from temporal documents and to track and analyze topics over time.To this end,a method of topic change detection for temporal document corpus is proposed.Similar topics and similarities and foundations are found in the temporal document corpus.Using the improved joint Nonnegative Matrix Factorization (NMF) algorithm,similarities and differences were found in the the timeseries document.To avoid the introduction of noise topics,by calculating the topic of all topic entropy,in order to obtain high-quality topics.Use the word cloud and trend graph to analyze the trend of topic change.Experimental results of two real data sets,20Newsgroups and LTN2011 show that this method can effectively find similarities and differences from the tempord of documents,and the extraction topic is effect and the accuracy is high.

关键词

联合非负矩阵分解/话题模型/时序异同话题/优质话题/话题变迁检测

Key words

Joint Nonnegative Matrix Factorization (NMF)/topic model/temporal similarities and differences topic/high quality topic/topic change detection

分类

信息技术与安全科学

引用本文复制引用

陈梦伟,吕钊,崔修涛..基于联合非负矩阵分解的话题变迁检测方法[J].计算机工程,2018,44(1):35-43,9.

基金项目

上海市科学技术委员会科研计划项目(16511102702) (16511102702)

上海市经济和信息化委员会项目(150643). (150643)

计算机工程

OA北大核心CSCDCSTPCD

1000-3428

访问量0
|
下载量0
段落导航相关论文