首页|期刊导航|中华医学杂志(英文版)|Evaluation of clustering algorithms for gene expression data using gene ontology annotations
中华医学杂志(英文版)2012,Vol.125Issue(17):3048-3052,5.DOI:10.3760/cma.j.issn.0366-6999.2012.17.015
Evaluation of clustering algorithms for gene expression data using gene ontology annotations
Evaluation of clustering algorithms for gene expression data using gene ontology annotations
摘要
Abstract
Background Clustering is a useful exploratory technique for interpreting gene expression data to reveal groups of genes sharing common functional attributes.Biologists frequently face the problem of choosing an appropriate algorithm.We aimed to provide a standalone,easily accessible and biologically oriented criterion for expression data clustering evaluation.Methods An external criterion utilizing annotation based similarities between genes is proposed in this work.Gene ontology information is employed as the annotation source.Comparisons among six widely used clustering algorithms over various types of gene expression data sets were carried out based on the criterion proposed.Results The rank of these algorithms given by the criterion coincides with our common knowledge.Single-linkage has significantly poorer performance,even worse than the random algorithm.Ward's method archives the best performance in most cases.Conclusions The criterion proposed has a strong ability to distinguish among different clustering algorithms with different distance measurements.It is also demonstrated that analyzing main contributors of the criterion may offer some guidelines in finding local compact clusters.As an addition,we suggest using Ward's algorithm for gene expression data analysis.关键词
microarray/ gene expression/ clustering/ gene ontologyKey words
microarray/ gene expression/ clustering/ gene ontology引用本文复制引用
MA Ning,ZHANG Zheng-guo..Evaluation of clustering algorithms for gene expression data using gene ontology annotations[J].中华医学杂志(英文版),2012,125(17):3048-3052,5.基金项目
This work was financially supported by the Chinese Medical Boards of New York,Inc.(No.CMB#03787). (No.CMB#03787)