首页|期刊导航|工程科学与技术|基于增量学习的社交网络链路预测

基于增量学习的社交网络链路预测

舒坚陈芷晨

工程科学与技术2025，Vol.57Issue(2)：1-11,11.

工程科学与技术2025，Vol.57Issue(2)：1-11,11.DOI:10.12454/j.jsuese.202300866

基于增量学习的社交网络链路预测

Social Networks Link Prediction Based on Incremental Learning

舒坚 ¹陈芷晨²

作者信息

1. 南昌航空大学软件学院,江西南昌 330063
2. 南昌航空大学软件学院,江西南昌 330063||江西应用科技学院未来技术学院,江西南昌 330100
折叠

摘要

Abstract

Objective Various types of relationships in social networks exist between nodes,and the number of nodes changes over time.The heterogeneity and dynamic topology pose significant challenges for link prediction.Specifically,this study addresses three key issues:(1)an incremental up-date strategy for random walk sequences,(2)the extraction of the correlation of causal relationships,and(3)the construction of the mutual per-ceptron. Methods Therefore,an incremental learning social network link prediction(IL-SNLP)method is proposed.IL-SNLP consists of two compon-ents:the node embedding model and the prediction model.The node embedding model structures the network in layers based on relationship types.An incremental update strategy is designed for each network layer to generate updated random walk sequences,employing a temporal ran-dom walk approach.The incremental skip-gram model is then utilized to extract the embedding vectors of nodes from the random walk sequences in each layer.A probabilistic model is employed to extract the correlation of causal relationships between relationship types,enhancing the repres-entation of nodes across different layers.In the prediction model,a multilayer perceptron(MLP)is utilized to construct the mutual perceptron,which predicts links by processing the embedding vectors of node pairs.The specific process of the node embedding model consists of temporal random walks,an incremental update strategy for random walk sequences,an incremental skip-gram model,and extracting the correlation of caus-al relationships.In the process of generating random walk sequences based on temporal random walks,the next node is selected if its timestamp is greater than the current timestamp.This approach ensures that both local structural and temporal information are preserved within these se-quences.An incremental update strategy is introduced to update the random walk sequences in scenarios involving data increments.This updated strategy considers three situations:completely outdated sequences,partially outdated sequences,and the emergence of new nodes.The increment-al skip-gram model is employed to extract and update the embedding vectors of nodes from the revised random walk sequences.This model auto-matically updates the unigram table and the noise distribution of all nodes using the updated random walk sequences without requiring access to previous data.For each relationship type,a conditional probability model is employed to calculate the probability of other relationship types oc-curring following a given relationship type.This model extracts the correlation of causal relationships.The correlation of causal relationships is incorporated into the node embedding vectors to generate the final embedding vectors.The prediction model utilizes an MLP to construct the mu-tual perceptron.The contribution of a node to a link is determined using the inner product of vectors.The contribution of a node pair to a link is then obtained by summing the individual contributions of both nodes.The computed node pair contribution is passed through a sigmoid function to determine the probability of a link forming. Results and Discussions Experiments are conducted to evaluate the prediction performance of IL-SNLP on three real-world social networks with varying sizes and sparsity levels:Super User,Math Overflow,and Ask Ubuntu.These datasets contain three relationship types:answering ques-tions,commenting on answers,and commenting on questions.The evaluation metrics include AUC and F1-score.The experiments encompass several aspects,including comparative analysis,validation of effectiveness,assessment of the impact of the temporal random walk strategy,and evaluation of the effect of different incremental granularities on IL-SNLP.The comparison experiment results shows that the IL-SNLP enhances prediction performance over the baseline methods by 10.08%to 67.60%in AUC and 1.76%to 64.67%in F1-score.This experiment demon-strates the superior prediction performance of IL-SNLP.In addition,the baseline models do not perform effectively on sparse networks.However,the prediction performance of IL-SNLP on Ask Ubuntu and Super User decreases by only 4.54%and 5.25%,respectively,compared to its performance on Math Overflow.IL-SNLP exhibits a more stable prediction performance than the baseline methods.The validation of effect-iveness experiments includes the validation of the prediction performance effectiveness of the causal relationship and mutual perceptron,as well as the validation of the time performance effectiveness of IL-SNLP.These results showed 1)that IL-SNLP improves the AUC by 51.82%to 67.78%and the F1-score by 61.47%to 81.91%across the three datasets,validating the two modules'effectiveness and demonstrates that the cor-relation of the causal relationship and the mutual perceptron enhances the model's prediction performance;2)after only 16 to 20 iterations of training with inherited parameters,the model achieves the same performance as a model trained with non-inherited parameters after 200 iterations;3)IL-SNLP improves time efficiency by 30.78%to 257.58%.This experiment demonstrated that IL-SNLP significantly reduces the running time and validates training effectiveness with inherited parameters.In addtion,the effect of two temporal random walk strategies on IL-SNLP shows that the random walk with bias is more effective;the effect of different incremental granularity levels on IL-SNLP reveals that the prediction per-formance of IL-SNLP on larger-scale networks is more affected by incremental granularity. Conclusions Accordingly,IL-SNLP exhibits superior prediction performance and significantly enhances runtime efficiency.In practical applica-tion scenarios,IL-SNLP can support the discovery and prediction of anomalous relationships in social networks,ensuring their security and sta-bility.

关键词

社交网络/链路预测/增量学习/时序随机游走/概率模型

Key words

social network/link prediction/incremental learning/temporal random walk/probabilistic model

分类

信息技术与安全科学

引用本文复制引用

舒坚,陈芷晨..基于增量学习的社交网络链路预测[J].工程科学与技术,2025,57(2):1-11,11.

基金项目

国家自然科学基金项目(62362052 （）

62062050) （）

江西省教育厅科学技术研究项目(GJJ2403207) （GJJ2403207）

江西省研究生创新专项基金项目(YC2022-S764) （YC2022-S764）

工程科学与技术

OA北大核心

ISSN：2096-3246

访问量0

下载量0

段落导航