肿瘤防治研究2026,Vol.53Issue(4):251-266,16.DOI:10.3971/j.issn.1000-8578.2026.25.0694
基于单细胞与机器学习的三阴性乳腺癌上皮细胞亚群解析与预后模型构建
Single-Cell and Machine Learning-Based Identification of Epithelial Subsets and Prognostic Modeling in Triple-Negative Breast Cancer
摘要
Abstract
Objective To investigate the heterogeneity and key molecular features of epithelial cells in triple-negative breast cancer(TNBC),identify prognostic biomarkers,and develop a robust survival prediction model.Methods Using TNBC single-cell transcriptomic data,epithelial cells were extracted,normalized,and subclustered to characterize their molecular signatures and functional differences.High-dimensional weighted gene co-expression network analysis(hdWGCNA)was applied to establish co-expression modules in epithelial cells.Multiple machine learning algorithms were integrated to select key prognostic genes and develop a risk-score model,whose performance was evaluated using receiver operating characteristic(ROC)curves and Kaplan-Meier(K-M)survival analysis.In addition,the immune microenvironment features and potential drug-response differences between the high-and low-risk groups were systematically assessed.Finally,PCR was performed to validate the expression differences of the key genes between tumor and normal tissues.Results We characterized the composition and molecular features of TNBC epithelial subpopulations and identified a TNBC-associated epithelial subset.By integrating hdWGCNA with machine learning approaches,10 key genes were selected to construct a prognostic model,which effectively stratified patients into distinct survival-risk groups and demonstrated favorable predictive performance in ROC and K-M analyses.Immune profiling revealed the differences in the infiltration levels of seven immune cell types and immune function-related features between the high-and low-risk groups.Drug-sensitivity analysis suggested potential differential responses to eight agents across the risk groups.PCR validation further confirmed the differential expression of the ten signature genes between tumor and normal tissues.Conclusion This study reveals epithelial heterogeneity in TNBC at single-cell resolution and establishes a 10-gene prognostic model,which may facilitate the stratification of TNBC risk and the evaluation of immune characteristics and potential therapeutic strategies.关键词
三阴性乳腺癌/上皮细胞/机器学习/预后模型/单细胞RNA测序/hdWGCNAKey words
Triple-negative breast cancer/Epithelial cell/Machine learning/Prognostic model/scRNA-seq/hdWGCNA分类
医药卫生引用本文复制引用
吴金鹏,郭雪,刘恩谷,林峰,李鸿涛..基于单细胞与机器学习的三阴性乳腺癌上皮细胞亚群解析与预后模型构建[J].肿瘤防治研究,2026,53(4):251-266,16.基金项目
"Tianshan Talents Cultivation Program"Technology and Innovation Leading Talents Project(No.2023TSYCLJ0039) "天山英才"培养计划科技创新领军人才项目(No.2023TSYCLJ0039) (No.2023TSYCLJ0039)