计算机应用研究2011,Vol.28Issue(5):1688-1691,4.DOI:10.3969/j.issn.1001-3695.2011.05.027
蛋白质二级结构的协同训练预测方法
Protein secondary structure co-training prediction method
摘要
Abstract
Machine learning based protein secondary structure prediction methods suffered low prediction accuracy because they ignored the amino acid hydrophobic property and the interaction between far away amino acids.In order to solve this problem, comparative experiments had been done.A sequence of hydrophobic value could be build by replacing the amino acid by its hydrophobic value.Experiments show that the BP neural network using long amino hydrophobic value sequence works well in prediction of E structure which is controlled mainly by long amino acid-amino acid interaction.Because both the Profile space and the hydrophobic energy value space were sufficient and redundant views, this paper proposed a Co-training algorithm.In the proposed algorithm, there were two classifiers.One was SVM classifier trained in Profile space, and the other was BP neural network classifier trained in hydrophobic value space, and they predicted one amino acid secondary structure independently.ff these two classifiers had different prediction results with one amino acid, an arbitration rule proposed was employed to make the final decision which was based on an active selecting strategy.Suspected sample and creditable sample were defined according to the characteristics of the classifiers and spaces to arbitrate the controversial prediction results.The experimental results show that the proposed algorithm has higher prediction accuracy both in E structure which controlled mainly by long interaction and H structure which controlled mainly by short interaction than existing algorithms.关键词
协同训练/蛋白质/二级结构预测/支持向量机/神经网络Key words
co-training/ proteil/ secondary structure prediction/ SVM( support vector machine)/ neural network分类
信息技术与安全科学引用本文复制引用
刘君,熊忠阳,王银辉..蛋白质二级结构的协同训练预测方法[J].计算机应用研究,2011,28(5):1688-1691,4.基金项目
中国博士后科学基金资助项目(20070420711) (20070420711)
重庆市科委自然科学基金资助项目(2007BB2372) (2007BB2372)