基于生物信息学的克罗恩病中炎症相关关键基因的筛选及验证OA北大核心CSTPCD
Screening and validation of key inflammation-related genes in Crohn's disease based on bioinformatics
目的 应用生物信息学方法筛选并验证炎症相关基因在克罗恩病(CD)中的表达情况.方法 分别从基因表达综合(GEO)数据库和分子特征数据库(Msigdb)下载CD数据集(GSE193677、GSE66407和GSE179285)和炎症反应相关基因(IRGs).利用GSE193677数据集进行单样本基因集富集分析(ssGSEA),明确CD和健康对照人群中免疫细胞和炎症水平;然后利用DESeq2方法于CD数据集进行基因差异表达分析,以|log2FC|≥1且校正P<0.05标准筛选获得CD患者和健康样本之间的差异表达基因(DEGs).DEGs和IRGs取交集获得炎症相关差异表达基因(IRDEGs).对IRDEGs进行基因本体(GO)和京都基因和基因组数据库(KEGG)通路富集分析,利用STRING数据库进行蛋白质-蛋白质相互作用(PPI)网络分析.利用Cytoscape软件的cyto-Hubba插件筛选确定关键IRDEGs.利用GSE66407和GSE179285数据集对CD组与健康对照组、炎症组及非炎症组关键IRDEGs的表达进行验证.结果 GSE193677数据集中ssGSEA分析表明,CD患者炎症细胞表达明显高于健康对照(P<0.001),免疫细胞中树突状细胞(P<0.001)、中性粒细胞(P<0.001)和巨噬细胞(P<0.05)水平等也明显上调.通过差异表达分析获得450个DEGs,其中378个上调,72个下调.将DEGs和IRGs(200个)取交集,获得24个IRDEGs.GO和KEGG富集分析显示IRDEGs在T细胞增殖、CXCR受体通路、丝氨酸蛋白抑制剂、IL-17信号等通路中显著富集.PPI分析和Cytoscape筛选确定了7个关键IRDEGs,包括IL1A、SELE、CXCL11、CSF3、CXCL9、CXCL10和CCL7.7个关键IRDEGs均在不同程度上与临床病情活动度呈正相关(均P<0.05).验证结果显示,GSE66407数据集中,CD组和炎症组IL1A、SELE、CXCL11、CXCL9、CXCL10和CCL7表达均明显上调(均P<0.01);GSE179285数据集中,CD组SELE、CXCL11、CSF3、CXCL9、CXCL10和CCL7表达均高于健康对照组(均P<0.01),炎症组7个关键IRDEGs表达水平均高于非炎症组(均P<0.05).结论 本研究明确了炎症相关基因在CD中的表达情况;确认了IL1A、SELE、CXCL11、CSF3、CXCL9、CXCL10和CCL77个关键IRDEGs,为CD的临床诊治提供了有利支持.
Objective To screen and verify the expression of inflammation-related genes in Crohn's disease(CD)using bioinformatics methods.Methods Datasets of CD(GSE193677,GSE66407,and GSE179285)and inflammatory-related genes(IRGs)were down-loaded from the Gene Expression Omnibus(GEO)database and Molecular Signatures Database(MSigDB),respectively.First,single sample gene set enrichment analysis(ssGSEA)was performed on the GSE193677 dataset to determine the levels of immune cells and inflammation in patients with CD and healthy individuals.Next,DESeq2 was used to perform differential gene expression(DGE)analysis on the CD dataset.Differentially expressed genes(DEGs)between patients with CD patients and healthy individuals were identified using|log2FC|≥1 and adjusted for P<0.05.The intersections of DEGs and IRGs were used to obtain inflammation-related differentially expressed genes(IRDEGs).Gene Ontology(GO)and Kyoto Encyclopedia of Genes and Genomes(KEGG)enrichment analyses were performed for the IRDEGs.STRING and Cytoscape software were used to perform protein-protein interaction(PPI)network analyses.Fur-thermore,the cytoHubba plug-in of Cytoscape software was used to select and identify key IRDEGs.The variations in expression of key IRDEGs between patients with CD and healthy individuals and between inflammatory and non-inflammatory groups were verified through the GSE66407 and GSE179285 datasets.Results ssGSEA analyses of the GSE193677 dataset showed that the expression of inflamma-tory cells in patients with CD was significantly higher than that in healthy controls(P<0.001).Immune cell levels of dendritic cells(P<0.001),neutrophils(P<0.001),and macrophages(P<0.05)were also significantly up-regulated.Through DGE analysis,450 DEGs were identified,including 378 up-regulated and 72 down-regulated genes.The intersection of 200 IRGs identified from the DEG database obtained 24 IRDEGs.GO and KEGG enrichment analyses showed that the IRDEGs were significantly enriched in T cell prolifera-tion,the CXC chemokine receptor(CXCR)pathway,serine protein inhibition,and interleukin(IL)-17 signaling pathways.PPI analyses and Cytoscape software identified seven key IRDEGs,namely IL-1α,selectin E(SELE),C-X-C motif chemokine ligand(CXCL)11,colony stimulating factor(CSF)3,CXCL9,CXCL10,and chemokine CC motif ligand(CCL)7.The seven key IRDEGs showed positive correla-tion with clinical disease activity to varying degrees(P<0.05).Validation showed that,in the GSE66407 dataset,IL1A,SELE,CXCL11,CXCL9,CXCL10,and CCL7expression levels were significantly up-regulated in the CD and inflammation groups(all P<0.01).In the GSE179285 dataset,SELE,CXCL11,CSF3,CXCL9,CXCL10,and CCL7expression levels in the CD group was higher than those in the healthy control group(all P<0.01).The expression levels of the seven key IRDEGs in the inflammatory group was higher than those in the non-inflammatory group(all P<0.05).Conclusion The present study verified the expression of inflammation-related genes in CD and identified seven key IRDEGs,including IL1A,SELE,CXCL11,CSF3,CXCL9,CXCL10,and CCL7,which provides support for future research in the diagnosis and treatment of patients with CD.
于明鑫;杨爱明
中国医学科学院北京协和医学院北京协和医院消化内科,北京 100730
临床医学
克罗恩病炎症反应差异表达基因生物信息学
Crohn's diseaseinflammatory responsedifferentially expressed genesbioinformatics
《中国医科大学学报》 2024 (004)
324-331 / 8
国家临床重点专科项目(ZK108000);中央高水平医院临床研究专项专科提升项目(2022-PUMCH-B-024);中央高水平医院临床研究专项重点项目(2022-PUMCH-C-063)
评论