四川大学学报(自然科学版)2006,Vol.43Issue(2):334-340,7.
概率密度函数的特征相关法DNA序列分析
Analysis of DNA Sequences Using Feature Correlation of Probability Density Function
罗代升 1罗辑 1谢明 1吴晓红 1余艳梅1
作者信息
- 1. 四川大学电子信息学院,成都,610064
- 折叠
摘要
Abstract
Propose an unbiased method of feature extraction and classification for DNA sequence analysis. In the method, statistical and correlation features are extracted from raw DNA sequence data and the mean correlation features of a sample DNA sequence to all given classes are calculated. If the maximal mean correlation feature exceeds the mean correlation feature of an existing class, the sample is grouped into the corresponding class. Otherwise, it is group into a new class.Using a set of sample DNA sequences, we demonstrate that the method is suitable for analysis of any DNA sequence data without a priori knowledge of functional information. Such approach should be useful in discovering conserved sequence elements in the human genome.关键词
生物信息解译/DNA序列结构分析/特征提取/模式分类Key words
bioinformatics/DNA sequence analysis/feature extraction/pattern classification分类
信息技术与安全科学引用本文复制引用
罗代升,罗辑,谢明,吴晓红,余艳梅..概率密度函数的特征相关法DNA序列分析[J].四川大学学报(自然科学版),2006,43(2):334-340,7.