| 注册
首页|期刊导航|分析化学|基于色谱峰形优劣的代谢组学峰检测参数优化算法比较

基于色谱峰形优劣的代谢组学峰检测参数优化算法比较

盛阳昊 王珏 蒋跃平

分析化学2024,Vol.52Issue(1):130-137,中插44-中插46,11.
分析化学2024,Vol.52Issue(1):130-137,中插44-中插46,11.DOI:10.19756/j.issn.0253-3820.221383

基于色谱峰形优劣的代谢组学峰检测参数优化算法比较

Comparison of Metabolomics Peak-Picking Parameter Optimization Algorithms Based on Chromatographic Peak Shape

盛阳昊 1王珏 2蒋跃平3

作者信息

  • 1. 中南大学湘雅医院药学部,长沙 410008||复杂基质样本生物分析湖南省重点实验室,长沙 410000||中南大学湘雅医院,国家老年疾病临床医学研究中心,长沙 410008
  • 2. 中南大学湘雅医院药学部,长沙 410008||复杂基质样本生物分析湖南省重点实验室,长沙 410000
  • 3. 中南大学湘雅医院药学部,长沙 410008||中南大学湘雅医院,国家老年疾病临床医学研究中心,长沙 410008||长沙医学院药学院,长沙 410200
  • 折叠

摘要

Abstract

Peak picking is one of the essential steps in non-targeted metabolomics data preprocessing based on liquid chromatography-mass spectrometry.Among various peak-picking algorithms,centWave algorithm based on continuous wavelet transform has been widely adopted in high-resolution mass spectrometry.In this study,the optimization effects of two centWave parameter optimization algorithms,IPO and centWave Sweep,were compared.Two datasets including metabolite standards and urine were used for comprehensive evaluation of these two algorithms with respect to three indicators:good peak shape ratio,reliable peak ratio,and repeatable peak ratio.To quickly and accurately distinguish good and bad peak shapes,three ensemble learning algorithms,random forest,adaboost and gradient boosting decision tree,were selected to establish a model for distinguishing chromatographic peak shape.Finally,according to the accuracy and F1 score,random forest was selected to establish a discrimination model(Accuracy 93.5%,F1 score 0.938).Compared with recommended parameters of XCMS Online,the proportion of reliable peaks and the proportion of repeatable peaks of two parameter optimization algorithms were improved in different datasets.However,when it came to the proportion of peaks with good shape,there was no significant difference between the optimized parameters and the parameters recommended by XCMS Online in different datasets.Furthermore,all three parameter settings resulted in relatively low proportions of peaks with good shape.The results indicated that the current parameter optimization algorithm was unable to improve the proportion of peaks with good shape.An excessive number of bad shape peaks could not only decrease the statistical power of analysis but also generate false positive results.Therefore,it was critical to perform additional confirmation of potential markers in the practical application of metabolomics researches.

关键词

代谢组学/峰检测/centWave/集成学习

Key words

Metabolomics/Peak-picking/centWave/Ensemble learning

引用本文复制引用

盛阳昊,王珏,蒋跃平..基于色谱峰形优劣的代谢组学峰检测参数优化算法比较[J].分析化学,2024,52(1):130-137,中插44-中插46,11.

基金项目

复杂基质样本生物分析湖南省重点实验室基金项目(No.2017TP1037)资助. Supported by the Hunan Key Laboratory for Bioanalysis of Complex Matrix Samples(No.2017TP1037). (No.2017TP1037)

分析化学

OA北大核心CSTPCD

0253-3820

访问量0
|
下载量0
段落导航相关论文