| 注册
首页|期刊导航|计算机科学与探索|绝对不平衡样本分类的集成迁移学习算法

绝对不平衡样本分类的集成迁移学习算法

么素素 王宝亮 侯永宏

计算机科学与探索2018,Vol.12Issue(7):1145-1153,9.
计算机科学与探索2018,Vol.12Issue(7):1145-1153,9.DOI:10.3778/j.issn.1673-9418.1705080

绝对不平衡样本分类的集成迁移学习算法

Ensemble Transfer Learning Algorithm for Absolute Imbalanced Data Classification

么素素 1王宝亮 2侯永宏1

作者信息

  • 1. 天津大学 电气自动化与信息工程学院,天津 300072
  • 2. 天津大学 信息与网络中心,天津 300072
  • 折叠

摘要

Abstract

According to the problem of mining with absolute imbalanced data, this paper proposes an ensemble transfer learning algorithm based on cascade structure. The algorithm consists of two parts: the transfer learning and the data selection. At the transfer learning stage, to solve the problem that the weight of auxiliary domain data is irre-versible in the TrAdaBoost algorithm, the weight recovery factor is introduced. At the data selection stage, the algo-rithm gradually deletes the noise samples and redundant samples of the auxiliary domain at each node of cascade structure. The algorithm makes full use of the auxiliary domain data while ensuring the leading role of the target do-main. The experimental results on the real data sets show that the algorithm has better effect on the F-measure value and G-mean value under the condition of absolute imbalanced data. Therefore, the proposed algorithm can solve the problem of absolute imbalance of training data to a certain extent.

关键词

集成迁移学习/级联模型/不平衡数据/TrAdaBoost

Key words

ensemble transfer learning/cascade module/imbalanced data/TrAdaBoost

分类

信息技术与安全科学

引用本文复制引用

么素素,王宝亮,侯永宏..绝对不平衡样本分类的集成迁移学习算法[J].计算机科学与探索,2018,12(7):1145-1153,9.

基金项目

The National Natural Science Foundation of China under Grant No. 61571325(国家自然科学基金). (国家自然科学基金)

计算机科学与探索

OA北大核心CSCDCSTPCD

1673-9418

访问量0
|
下载量0
段落导航相关论文