| 注册
首页|期刊导航|计算机工程与应用|FODU:不确定数据集中快速离群点检测方法

FODU:不确定数据集中快速离群点检测方法

钟毓灵 王习特 白梅 朱斌 李冠宇

计算机工程与应用2019,Vol.55Issue(19):105-114,140,11.
计算机工程与应用2019,Vol.55Issue(19):105-114,140,11.DOI:10.3778/j.issn.1002-8331.1807-0150

FODU:不确定数据集中快速离群点检测方法

FODU:Fast Outlier Detection Approach on Uncertain Data Sets

钟毓灵 1王习特 1白梅 1朱斌 1李冠宇1

作者信息

  • 1. 大连海事大学 信息科学技术学院,辽宁 大连 116000
  • 折叠

摘要

Abstract

Outlier detection is a hot topic in the field of data management, which has been widely applied to many fields such as medical diagnosis, financial fraud, environment monitoring and many others. At present, along with the application of sensors in data acquisition, people have realized the universality of uncertain data in many fields. Compared with certain data, it is much more difficult to detect outliers on uncertain data sets. To solve the problems, a Fast Outlier Detection approach on Uncertain data sets(FODU)is proposed. Firstly, an index construction strategy inspired by hierarchical ideas is given, which not only overcomes the limitation of the traditional index structure on multi-dimensional data management, but also can prune the searching space quickly. Furthermore, to detect uncertain outliers efficiently, a new filtering algorithm is proposed. Utilizing batch filtering and single point filtering, this approach can reduce redundant calculations and improve inspection efficiency. Then, to avoid the expansion of the possible world, an approach to compute the abnormal probability of data objects is given. At last, the efficiency and effectiveness of the proposed approaches are verified through a series of simulation experiments. The experimental results show that compared with the previous approaches, the proposed algorithm can significantly improve the computation efficiency of outlier detection on uncertain data.

关键词

离群点检测/不确定性数据/分层次划分/批量过滤

Key words

outlier detection/uncertain data/hierarchical partitioning/batch filtering

分类

信息技术与安全科学

引用本文复制引用

钟毓灵,王习特,白梅,朱斌,李冠宇..FODU:不确定数据集中快速离群点检测方法[J].计算机工程与应用,2019,55(19):105-114,140,11.

基金项目

国家自然科学基金青年基金(No.61602076,No.61702072) (No.61602076,No.61702072)

中国博士后科学基金面上项目(No.2017M611211,No.2017M621122) (No.2017M611211,No.2017M621122)

中央高校基本科研业务费专项资金(No.3132018191) (No.3132018191)

国家重点研发计划项目(No.2017YFC1404606). (No.2017YFC1404606)

计算机工程与应用

OA北大核心CSCDCSTPCD

1002-8331

访问量0
|
下载量0
段落导航相关论文