| 注册
首页|期刊导航|数据采集与处理|基于多重随机性与隐私保护的栈式随机森林算法

基于多重随机性与隐私保护的栈式随机森林算法

宋奕霖 王士同

数据采集与处理2025,Vol.40Issue(5):1222-1238,17.
数据采集与处理2025,Vol.40Issue(5):1222-1238,17.DOI:10.16337/j.1004-9037.2025.05.009

基于多重随机性与隐私保护的栈式随机森林算法

Trestle Random Forest Based on Multiple Randomness and Privacy Protection

宋奕霖 1王士同1

作者信息

  • 1. 江南大学人工智能与计算机学院,无锡 214122
  • 折叠

摘要

Abstract

As an effective ensemble learning algorithm for classification and regression tasks,the random forest(RF)also faces challenges in improving generalization ability and privacy protection.In response to this challenge,this paper proposes an improved Bernoulli-multinomial stacked random forest(BMS-RF)algorithm based on multiple randomness and privacy protection.The basic idea is to introduce Bernoulli distribution Dropout partial feature vectors to select candidate feature vectors in the stage of constructing decision tree splitting features and splitting point selection.By randomly selecting splitting features and splitting points through two polynomial distributions,each decision tree adopts a non numerical query index mechanism to add noise for maintaining its privacy protection mechanism.When integrating classifiers,a multi-layer stack structure is introduced to randomly project the output of the previous layer and concatenate the source training set as new inputs,so that each forest can share the spatial information of the source samples and improve the classification performance of the base learner layer by layer.Theoretical analysis of the consistency and privacy ability of this algorithm shows that BMS-RF can significantly improve classification performance through a stack structure.Experimental results on 14 small and medium-sized datasets verify that the algorithm not only reduces running time but also has better generalization performance.When the privacy protection is strong,it can achieve classification performance similar to RF variants on the basis of simplifying the structure and improving running speed.

关键词

随机森林/集成分类/栈式结构/隐私保护/决策树

Key words

random forest(RF)/integrated classification/stack structure/privacy protection/decision tree

分类

信息技术与安全科学

引用本文复制引用

宋奕霖,王士同..基于多重随机性与隐私保护的栈式随机森林算法[J].数据采集与处理,2025,40(5):1222-1238,17.

基金项目

国家重点研发专项计划(2022YFE0112400). (2022YFE0112400)

数据采集与处理

OA北大核心

1004-9037

访问量0
|
下载量0
段落导航相关论文