| 注册
首页|期刊导航|色谱|蛋白质组质谱分析中基于串并联支持向量机的肽段色谱保留时间预测方法

蛋白质组质谱分析中基于串并联支持向量机的肽段色谱保留时间预测方法

张纪阳 张代兵 张伟 谢红卫

色谱2012,Vol.30Issue(9):857-863,7.
色谱2012,Vol.30Issue(9):857-863,7.DOI:10.3724/SP.J.1123.2012.06021

蛋白质组质谱分析中基于串并联支持向量机的肽段色谱保留时间预测方法

A new peptide retention time prediction method for mass spectrometry based proteomic analysis by a serial and parallel support vector machine model

张纪阳 1张代兵 1张伟 1谢红卫1

作者信息

  • 1. 国防科学技术大学机电工程与自动化学院,湖南长沙410073
  • 折叠

摘要

Abstract

The online reversed-phase liquid chromatography (RPLC) contributes a lot for the large scale mass spectrometry based protein identification in proteomics. Retention time (RT) as an important evidence can be used to distinguish the false positive/true positive peptide identifications. Because of the nonlinear concentration curve of organic phase in the whole range of run time and the interactions among peptides, the sequence based RT prediction of peptides has low accuracy and is difficult to generalize in practice, and thus is less effective in the validation of peptide identifications. A serial and parallel support vector machine (SP-SVM) method was proposed to characterize the nonlinear effect of organic phase concentration and the interactions among peptides. The SP-SVM contains a support vector regression (SVR) only for model training (named as p-SVR) and 4 SVM models (named as C-SVM, 1-SVR, s-SVR and n-SVR) for the RT prediction. After distinguishing the peptide chromatographic behavior by C-SVM, 1-SVR and s-SVR were used to predict the peptide RT specifically to improve the accuracy. Then the peptide RT was normalized by n-SVR to characterize the peptide interactions. The prediction accuracy was improved significantly by applying this method to the processing of the complex sample dataset. The coefficient of the determination between predictive and experimental RTs reaches 0.95, the prediction error range was less than 20% of the total LC run time for more than 95% cases, and less than 10% of the total LC run time for more than 70% cases. The performance of this model reaches the best of known so far. More important, the SP-SVMmethod provides a framework to take into account the interactions among peptides in chroma-tographic separation, and its performance can be improved further by introducing new data processing and experiment strategy.

关键词

液相色谱-质谱/串并联支持向量机/保留时间/预测精度/肽段鉴定/蛋白质组学

Key words

liquid chromatography-mass spectrometry (LC-MS)/ serial and parallel support vectormachine (SP-SVM)/ retention time/ prediction accuracy/ peptide identifications/ proteomics

分类

化学化工

引用本文复制引用

张纪阳,张代兵,张伟,谢红卫..蛋白质组质谱分析中基于串并联支持向量机的肽段色谱保留时间预测方法[J].色谱,2012,30(9):857-863,7.

基金项目

国家自然科学基金青年基金项目(31000587). (31000587)

色谱

OA北大核心CSCDCSTPCDMEDLINE

1000-8713

访问量0
|
下载量0
段落导航相关论文