| 注册
首页|期刊导航|计算机应用与软件|基于混合特征学习的微博转发预测方法

基于混合特征学习的微博转发预测方法

马晓峰 王磊 陈观淡

计算机应用与软件2016,Vol.33Issue(11):249-252,257,5.
计算机应用与软件2016,Vol.33Issue(11):249-252,257,5.DOI:10.3969/j.issn.1000-386x.2016.11.058

基于混合特征学习的微博转发预测方法

A MICROBLOGGING RETWEET PREDICTION METHOD BASED ON HYBRID FEATURES LEARNING

马晓峰 1王磊 2陈观淡2

作者信息

  • 1. 上海数据分析与处理技术研究所 上海 201112
  • 2. 中国科学院自动化研究所复杂系统管理与控制国家重点实验室 北京 100190
  • 折叠

摘要

Abstract

Microblogging retweet prediction is one of the key problems in information dissemination,which plays important roles in public opinion monitoring,advertising,and business decision making.The process of information dissemination is influenced by many factors such as user interest,microblogging author’s influence,and content of post,etc.The challenge of improving prediction performance is how to capture the important features for retweet prediction.In this paper,we propose a retweet prediction method based on hybrid features learning. Firstly,the method introduces and analyses the impacts of hybrid features including social influence locality,user features,and microblogging content features.Then,it builds the retweet prediction model based on classification algorithms.Finally,it compares the results of different types of microblog.Experimental results on Sina Weibo datasets show that local social influence features,user features and microblogging content features affect the retweet prediction,and the greatest impact is the micro-blog content features.Random forest method has the best performance,and the accuracy rate can reach 83.1%.Compared to Naive Bayes,logistic regression and SVM,the accuracy rate increased by an average of about 7.4%,the highest increase of about 10.8%.In addition,the method has an advantage on topics about natural disasters,environment,trial,rights,which shows that these kinds of events contain stronger retweet patterns.

关键词

微博/混合特征学习/转发预测

Key words

Microblogging/Hybrid features learning/Retweet prediction

分类

信息技术与安全科学

引用本文复制引用

马晓峰,王磊,陈观淡..基于混合特征学习的微博转发预测方法[J].计算机应用与软件,2016,33(11):249-252,257,5.

计算机应用与软件

OACSTPCD

1000-386X

访问量0
|
下载量0
段落导航相关论文