| 注册
首页|期刊导航|中国医学科学院学报|基于机器学习分析的浸润性乳腺癌蛋白质编码基因的标志物鉴定

基于机器学习分析的浸润性乳腺癌蛋白质编码基因的标志物鉴定

武乐 闵开元 柳江枫 梁万丰 杨晔宏 胡刚 杨俊涛

中国医学科学院学报2024,Vol.46Issue(2):147-153,7.
中国医学科学院学报2024,Vol.46Issue(2):147-153,7.DOI:10.3881/j.issn.1000-503X.15717

基于机器学习分析的浸润性乳腺癌蛋白质编码基因的标志物鉴定

Identification of Protein-Coding Gene Markers in Breast Invasive Carcinoma Based on Machine Learning

武乐 1闵开元 2柳江枫 2梁万丰 1杨晔宏 2胡刚 1杨俊涛2

作者信息

  • 1. 南开大学统计与数据科学学院, 天津 300071
  • 2. 中国医学科学院 北京协和医学院 基础医学研究所重大疾病共性机制研究全国重点实验室, 北京 100005
  • 折叠

摘要

Abstract

Objective To screen out the biomarkers linked to prognosis of breast invasive carcinoma based on the analysis of transcriptome data by random forest(RF),extreme gradient boosting(XGBoost),light gradient boosting machine(LightGBM),and categorical boosting(CatBoost).Methods We obtained the ex-pression data of breast invasive carcinoma from The Cancer Genome Atlas and employed DESeq2,t-test,and Cox univariate analysis to identify the differentially expressed protein-coding genes associated with survival prog-nosis in human breast invasive carcinoma samples.Furthermore,RF,XGBoost,LightGBM,and CatBoost mod-els were established to mine the protein-coding gene markers related to the prognosis of breast invasive cancer and the model performance was compared.The expression data of breast cancer from the Gene Expression Omnibus was used for validation.Results A total of 151 differentially expressed protein-coding genes related to survival prog-nosis were screened out.The machine learning model established with C3orf80,UGP2,and SPC25 demonstrated the best performance.Conclusions Three protein-coding genes(UGP2,C3orf80,and SPC25)were screened out to identify breast invasive carcinoma.This study provides a new direction for the treatment and diagnosis of breast invasive carcinoma.

关键词

浸润性乳腺癌/生物标志物/蛋白质编码基因/UGP2/C3orf80/SPC25

Key words

breast invasive carcinoma/biomarker/protein-coding genes/UGP2/C3orf80/SPC25

分类

医药卫生

引用本文复制引用

武乐,闵开元,柳江枫,梁万丰,杨晔宏,胡刚,杨俊涛..基于机器学习分析的浸润性乳腺癌蛋白质编码基因的标志物鉴定[J].中国医学科学院学报,2024,46(2):147-153,7.

基金项目

国家自然科学基金(31970649)和中国医学科学院医学与健康科技创新工程(CIFMS2021-I2M-1-057、CIFMS2021-12M-1-001) (31970649)

中国医学科学院学报

OA北大核心CSTPCDMEDLINE

1000-503X

访问量0
|
下载量0
段落导航相关论文