| 注册
首页|期刊导航|电子学报|基于个性化随机游走的基因-表型关联分析

基于个性化随机游走的基因-表型关联分析

谭好江 王峻 余国先 陈建 郭茂祖

电子学报2024,Vol.52Issue(5):1619-1632,14.
电子学报2024,Vol.52Issue(5):1619-1632,14.DOI:10.12263/DZXB.20220416

基于个性化随机游走的基因-表型关联分析

Individual Random Walks for Gene-Phenotype Association Analysis

谭好江 1王峻 2余国先 3陈建 4郭茂祖5

作者信息

  • 1. 山东大学软件学院,山东济南 250101||山东大学人工智能国际联合研究院,山东济南 250101
  • 2. 山东大学人工智能国际联合研究院,山东济南 250101
  • 3. 山东大学软件学院,山东济南 250101
  • 4. 中国农业大学农学院,北京 100083
  • 5. 北京建筑大学电气与信息工程学院,北京 100044
  • 折叠

摘要

Abstract

Association analysis between genes and phenotypes is crucial to reveal the inherent genetic association of organisms.Random walk-based algorithms can fuse multiple omics data,aggregate the label information of first-order or higher-order neighbors,complete the association information between different nodes in the network,improve the accuracy of association prediction and further discover the potential genetic associations between genes and phenotypes.However,existing random walk algorithms usually treat each node equally and ignore the varying importance of different nodes,as such non-important nodes can be excessively propagated and the model performance is compromised.To this end,an indi-vidual multiple random walks(iMRW)algorithm based on multi-omics data fusion is proposed.On the heterogeneous ge-netic network composed with genes,miRNAs and phenotype nodes,we design the individual multiple random walks strate-gy based on the network topology,assign nodes of different importance with different walking lengths.We then complete the genetic information of different nodes by fusing multi-source association matrix,Gaussian interaction profile kernel sim-ilarity and random walk,and accurately obtain the gene-phenotype association prediction matrix.Under different experi-mental settings,iMRW can achieve the best prediction performance compared with the state-of-the-art algorithms.The case study with respect to maize photosynthetic ability and starch content further confirm the usefulness and effectiveness of iMRW in identifying potential gene-phenotype associations.

关键词

基因-表型关联/随机游走/异质网络/多组学数据融合/网络拓扑结构

Key words

gene-phenotype associations/random walk/heterogeneous network/multi-omics data fusion/network topology

分类

信息技术与安全科学

引用本文复制引用

谭好江,王峻,余国先,陈建,郭茂祖..基于个性化随机游走的基因-表型关联分析[J].电子学报,2024,52(5):1619-1632,14.

基金项目

国家自然科学基金(No.62031003,No.62072380) (No.62031003,No.62072380)

山东大学中央高校基本业务费(No.2020GN061) National Natural Science Foundation of China(No.62031003,No.62072380) (No.2020GN061)

Fundamental Re-search Funds of Shandong University(No.2020GN061) (No.2020GN061)

电子学报

OA北大核心CSTPCD

0372-2112

访问量0
|
下载量0
段落导航相关论文