计算机科学与探索2013,Vol.7Issue(8):747-753,7.DOI:10.3778/j.issn.1673-9418.1305004
混合模型的微博交叉话题发现
Extracting Overlapping Topics from Micro-Blog Based on Mixture Model
摘要
Abstract
Micro-blog is a new platform to share and disseminate information quickly.It is characterized by huge amount of scattered and diverse information.The most of traditional topics extraction algorithms are partitioning method,which do not consider the relationship between the topics,so there are some limitations.This paper focuses on the task of news topics extraction from large-scale short posts of micro-blog service.The word segmentation is processed according to the characteristics of the micro-blog text using the Chinese word segmentation software with high accuracy and ambiguity recognition,which is developed by Institute of Noetics and Wisdom,Southwest Jiaotong University.And then,this paper proposes an overlapping topic detection algorithm based on mixture model.The experimental results prove the feasibility and validity of the algorithm.关键词
微博/交叉话题发现/混合模型Key words
micro-blog/ overlapping topic detection/ mixture model分类
信息技术与安全科学引用本文复制引用
詹勇,杨燕,王红军..混合模型的微博交叉话题发现[J].计算机科学与探索,2013,7(8):747-753,7.基金项目
The National Natural Science Foundation of China under Grant Nos.61170111,61003142,61134002(国家自然科学基金) (国家自然科学基金)
the Fundamental Research Funds for the Central Universities of China under Grant No.SWJTU11ZT08(中央高校基本科研业务费专项资金). (中央高校基本科研业务费专项资金)