计算机应用与软件Issue(2):46-50,5.DOI:10.3969/j.issn.1000-386x.2016.02.011
基于主题词的微博热点话题发现
HOT MICROBLOGGING TOPICS DISCOVERY BASED ON SUBJECT TERMS
摘要
Abstract
In recent years,microblogging websites have become the publishing platform of massive information.While providing conven-ience to users,the abundant microblogging information also brings in the risk of information overload.Hot topics discovery can reduce the risk of information overload and improve user experience.Aiming at this,in this paper we present a subject terms-based hot topics discovery meth-od for Chinese microblogging in combination with longest common substrings and Wikipedia knowledge.First,it acquires the high-frequency longest common substring of microblogging as candidate subject terms of description topics.Secondly,it utilises Wikipedia knowledge to screen candidate subject terms.Finally,it collects and clusters the subject terms to discover the topics,and calculates the energy of each top-ic and then selects the hot topics among them.Experiment conducted on real dataset demonstrate that our method can effectively discover hot microblogging topics.关键词
维基百科/最长公共子串/热点话题发现/微博Key words
Subject term/Wikipedia/Longest common substring/Hot topic discovery/Microblogging分类
信息技术与安全科学引用本文复制引用
叶成绪,杨萍,刘少鹏..基于主题词的微博热点话题发现[J].计算机应用与软件,2016,(2):46-50,5.基金项目
国家社会科学基金项目(13BXW037);教育部春晖计划项目 ()