| 注册
首页|期刊导航|中华医学杂志(英文版)|Evaluation of clustering algorithms for gene expression data using gene ontology annotations

Evaluation of clustering algorithms for gene expression data using gene ontology annotations

MA Ning ZHANG Zheng-guo

中华医学杂志(英文版)2012,Vol.125Issue(17):3048-3052,5.
中华医学杂志(英文版)2012,Vol.125Issue(17):3048-3052,5.DOI:10.3760/cma.j.issn.0366-6999.2012.17.015

Evaluation of clustering algorithms for gene expression data using gene ontology annotations

Evaluation of clustering algorithms for gene expression data using gene ontology annotations

MA Ning 1ZHANG Zheng-guo1

作者信息

  • 1. Department of Biomedical Engineering,Institute of Basic Medical Sciences,Chinese Academy of Medical Sciences,School of Basic Medicine,Peking Union Medical College,Beijing 100005,China
  • 折叠

摘要

Abstract

Background Clustering is a useful exploratory technique for interpreting gene expression data to reveal groups of genes sharing common functional attributes.Biologists frequently face the problem of choosing an appropriate algorithm.We aimed to provide a standalone,easily accessible and biologically oriented criterion for expression data clustering evaluation.Methods An external criterion utilizing annotation based similarities between genes is proposed in this work.Gene ontology information is employed as the annotation source.Comparisons among six widely used clustering algorithms over various types of gene expression data sets were carried out based on the criterion proposed.Results The rank of these algorithms given by the criterion coincides with our common knowledge.Single-linkage has significantly poorer performance,even worse than the random algorithm.Ward's method archives the best performance in most cases.Conclusions The criterion proposed has a strong ability to distinguish among different clustering algorithms with different distance measurements.It is also demonstrated that analyzing main contributors of the criterion may offer some guidelines in finding local compact clusters.As an addition,we suggest using Ward's algorithm for gene expression data analysis.

关键词

microarray/ gene expression/ clustering/ gene ontology

Key words

microarray/ gene expression/ clustering/ gene ontology

引用本文复制引用

MA Ning,ZHANG Zheng-guo..Evaluation of clustering algorithms for gene expression data using gene ontology annotations[J].中华医学杂志(英文版),2012,125(17):3048-3052,5.

基金项目

This work was financially supported by the Chinese Medical Boards of New York,Inc.(No.CMB#03787). (No.CMB#03787)

中华医学杂志(英文版)

OACSCDCSTPCDMEDLINESCI

0366-6999

访问量2
|
下载量0
段落导航相关论文