| 注册
首页|期刊导航|天津科技大学学报|基于Single-Pass的在线话题检测改进算法

基于Single-Pass的在线话题检测改进算法

马永军 刘洋 李亚军 汪睿

天津科技大学学报2017,Vol.32Issue(6):73-78,6.
天津科技大学学报2017,Vol.32Issue(6):73-78,6.DOI:10.13364/j.issn.1672-6510.20160432

基于Single-Pass的在线话题检测改进算法

An Improved Algorithm Based on Single-Pass for Online Topic Detection

马永军 1刘洋 2李亚军 1汪睿1

作者信息

  • 1. 天津科技大学计算机科学与信息工程学院,天津 300457
  • 2. 天津科技大学食品安全管理与战略研究中心,天津 300222
  • 折叠

摘要

Abstract

At present,the main research method of existing topic detection is to use Single-Pass and its improved algorithm for clustering analysis.However,these algorithms use a single similarity calculation method without considering the struc-tural characteristics of the text,which affects the clustering accuracy.This research hasimproved the similarity calculation method of Single-Pass and proposed a multi-similarity computation combination strategy which toke the title,abstract,time, place names and source into consideration,and used the analytic hierarchy process to calculate and assign them different weights.As food safety is a widely concerned topic,we analyzed the data about food safety in the last three years which we could get with the web crawler.The results show that the improved Single-Pass clustering algorithm proposed in this paper has a higher topic detection accuracy.

关键词

网络舆情/Single-Pass/相似度计算/食品安全

Key words

internet public opinion/Single-Pass/similarity calculation/food safety

分类

信息技术与安全科学

引用本文复制引用

马永军,刘洋,李亚军,汪睿..基于Single-Pass的在线话题检测改进算法[J].天津科技大学学报,2017,32(6):73-78,6.

基金项目

天津市教委重大项目(2014ZD22) (2014ZD22)

天津市应用基础与前沿技术研究计划(14JCQNJC00300) (14JCQNJC00300)

天津科技大学学报

OACSTPCD

1672-6510

访问量0
|
下载量0
段落导航相关论文