| 注册
首页|期刊导航|计算机技术与发展|基于文本属性的微博用户相似度研究

基于文本属性的微博用户相似度研究

李梦洁 邵曦

计算机技术与发展2018,Vol.28Issue(5):17-22,6.
计算机技术与发展2018,Vol.28Issue(5):17-22,6.DOI:10.3969/j.issn.1673-629X.2018.05.005

基于文本属性的微博用户相似度研究

Research on Micro-blog User Similarity Based on Text Similarity

李梦洁 1邵曦1

作者信息

  • 1. 南京邮电大学 通信与信息工程学院,江苏 南京210003
  • 折叠

摘要

Abstract

Traditional similarity calculation method ignores the subjective information of the users,which is an important element that re-flects user's interest point.In order to fully describe the user's information,the user's background information and their interactive con-tent on the social platform should be considered.Therefore,we present a calculating method of Micro-blog user similarity,which is bound up with text similarity.The user similarity is mainly divided by the background similarity and interest similarity which is mainly deter-mined by the text similarity.The cosine similarity should be calculated after the word segmentation and TF-IDF.User similarity is also described by user's location,the device they use,the time they send Weibo,the text they re-post and the relationship between them.Fi-nally,the method uses AHP to determine the weight of each attribute and build an integrated similarity calculation model.Through the ex-periment,systematically compared with the calculating method of user similarity combined with text similarity and the one before impro-ving,the results show that the former increase the F1metric by 34.3%,which shows its superiority.

关键词

微博/社交网络/用户相似度/文本相似度/余弦相似度/层次分析法

Key words

Micro-blog/social network/user similarity/text similarity/cosine similarity/analytic hierarchy process

分类

信息技术与安全科学

引用本文复制引用

李梦洁,邵曦..基于文本属性的微博用户相似度研究[J].计算机技术与发展,2018,28(5):17-22,6.

基金项目

国家自然科学青年基金项目(61401227) (61401227)

计算机技术与发展

OACSTPCD

1673-629X

访问量0
|
下载量0
段落导航相关论文