计算机工程Issue(5):7-11,5.DOI:10.3969/j.issn.1000-3428.2014.05.002
基于标签的微博人脉网络挖掘算法和结构分析
Mining Algorithm and Structural Analysis of Microblog Interpersonal Relationship Network Based on Tag
摘要
Abstract
For the widespread use of microblog business and the impact on data mining techniques, a mining algorithm of microblog interpersonal relationship network is proposed based on the fuzzy matching of tag, and the characteristics of the network are analyzed. Use the tag of the users, the algorithm mainly considers word morpheme, order, and word length to calculate the match degree of the words when matching the tag. For weakening the influence that using different users as a starting point may have different result, ordinary users and celebrities as a starting point separately are used. At the same time, the structural characteristics of the network are studied, and the analysis results show that the network has small-world and scale-free properties. The results show that the mining error rate of celebrities and common users friends who are interested in IT. When mining 10 celebrity users’ friends, the average error rate of the algorithm is 14.08%, and 10.63%for common users.关键词
标签/微博/人脉网络/模糊匹配/数据挖掘/结构特征Key words
tag/microblog/interpersonal relationship network/fuzzy matching/data mining/structural characteristics分类
信息技术与安全科学引用本文复制引用
王莎,张连明..基于标签的微博人脉网络挖掘算法和结构分析[J].计算机工程,2014,(5):7-11,5.基金项目
国家自然科学基金资助项目(60973129);广东省自然科学基金资助项目(S2011010000812)。 (60973129)