计算机工程与应用2018,Vol.54Issue(11):116-121,6.DOI:10.3778/j.issn.1002-8331.1701-0118
结合半监督与主动学习的时间序列PU问题分类
Time series classification based on PU problem with semi-supervised learning and active learning
摘要
Abstract
Semi-supervised learning is often applied in time series classification based on PU problem,but the boundary data classification is difficult to be accurately labeled in semi-supervised learning method.To resolve the problem,this paper applies the active learning method to build classification of PU problem with a method named OAL(Only Active Learning),which applies active learning to select part of unlabeled data sample,and then labeled with expert manually. To select the most informative data sample to label by expert,it builds a series of classifiers to calculate the difference of an unlabeled data sample,and takes the distribution of the sample into consideration and then applies the amount of infor-mation in the data sample as a data selection strategy for active learning.As OAL cannot get enough labeled data set with limit time and expert,it proposes a way based on OAL which combines semi-supervised learning and active learning and labeled sample with high consistency automatically to increase the amount of labeled data in the training data and ensure the quality of training data.Experiments show that the method proposed can construct more accurate classifiers compared to semi supervised learning for PU data set.关键词
时间序列/正例和无标记样本(PU)问题/分类/主动学习/半监督学习Key words
time series/Positive and Unlabled(PU)problem/classification/active learning/semi-supervised learning分类
信息技术与安全科学引用本文复制引用
陈娟,朱福喜..结合半监督与主动学习的时间序列PU问题分类[J].计算机工程与应用,2018,54(11):116-121,6.基金项目
国家自然科学基金(No.61272277). (No.61272277)