| 注册
首页|期刊导航|高技术通讯(英文版)|Scalable classification by clustering: Hybrid can be better than Pure

Scalable classification by clustering: Hybrid can be better than Pure

Deng Shengchun He Zengyou Xu Xiaofei

高技术通讯(英文版)2007,Vol.13Issue(2):131-135,5.
高技术通讯(英文版)2007,Vol.13Issue(2):131-135,5.

Scalable classification by clustering: Hybrid can be better than Pure

Scalable classification by clustering: Hybrid can be better than Pure

Deng Shengchun 1He Zengyou 1Xu Xiaofei1

作者信息

  • 1. Department of Computer Science and Engineering,Harbin Institute of Technology,Harbin 150001,P.R.China
  • 折叠

摘要

Abstract

The problem of scalable classification by clustering in large databases was discussed. Clustering based classification method first generates clusters using clustering algorithms . To classify new coming data points , it finds the k nearest clusters of the data point as neighbors , and assign each data point to the dominant class of these neighbors . Existing algorithms incorporated class information in making clustering decisions and produced pure clusters (each cluster associated with only one class) . We presented hybrid cluster based algorithms , which produce clusters by unsupervised clustering and allow each cluster associated with multiple classes . Experimental results show that hybrid cluster based algorithms outperform pure ones in both classification accuracy and training speed.

关键词

classification/clustering/data mining

Key words

classification/clustering/data mining

分类

化学化工

引用本文复制引用

Deng Shengchun,He Zengyou ,Xu Xiaofei..Scalable classification by clustering: Hybrid can be better than Pure[J].高技术通讯(英文版),2007,13(2):131-135,5.

基金项目

Supported by the High Technology Research and Development Programme of China (No.2Q02AA413310) and the IBM SUR Research Fund. (No.2Q02AA413310)

高技术通讯(英文版)

OAEI

1006-6748

访问量0
|
下载量0
段落导航相关论文