计算机应用与软件2017,Vol.34Issue(12):42-46,5.DOI:10.3969/j.issn.1000-386x.2017.12.008
基于位置信息的非比对序列聚类方法
ALIGNMENT-FREE MODEL FOR SEQUENCE CLUSTERING METHOD BASED ON LOCATION INFORMATION
摘要
Abstract
Alignment-free similarity model for sequence calculates the similarity between the sequences by using the statistical information of the sequences,which has the advantage of fast calculation and high precision.Alignment-free model for sequence clustering method based on position information was proposed.The features of sequences can be obtained by combining the LF entropy of the corresponding word which was calculated from the Local Frequency of every word with the K-mers model,and the frequency of every word.This new method can be applied to protein clustering.The experimental results showed this new method improved the accuracy of clustering effectively.关键词
K-词/LF熵/K-means聚类/位置信息Key words
K-mers/Local frequency entropy/Sequence clustering/Position information分类
信息技术与安全科学引用本文复制引用
魏静,徐彭娜,江育娥,林劼..基于位置信息的非比对序列聚类方法[J].计算机应用与软件,2017,34(12):42-46,5.基金项目
国家自然科学基金项目(61472082) (61472082)
福建省自然科学基金项目(2014J01220) (2014J01220)