电子学报2024,Vol.52Issue(5):1619-1632,14.DOI:10.12263/DZXB.20220416
基于个性化随机游走的基因-表型关联分析
Individual Random Walks for Gene-Phenotype Association Analysis
摘要
Abstract
Association analysis between genes and phenotypes is crucial to reveal the inherent genetic association of organisms.Random walk-based algorithms can fuse multiple omics data,aggregate the label information of first-order or higher-order neighbors,complete the association information between different nodes in the network,improve the accuracy of association prediction and further discover the potential genetic associations between genes and phenotypes.However,existing random walk algorithms usually treat each node equally and ignore the varying importance of different nodes,as such non-important nodes can be excessively propagated and the model performance is compromised.To this end,an indi-vidual multiple random walks(iMRW)algorithm based on multi-omics data fusion is proposed.On the heterogeneous ge-netic network composed with genes,miRNAs and phenotype nodes,we design the individual multiple random walks strate-gy based on the network topology,assign nodes of different importance with different walking lengths.We then complete the genetic information of different nodes by fusing multi-source association matrix,Gaussian interaction profile kernel sim-ilarity and random walk,and accurately obtain the gene-phenotype association prediction matrix.Under different experi-mental settings,iMRW can achieve the best prediction performance compared with the state-of-the-art algorithms.The case study with respect to maize photosynthetic ability and starch content further confirm the usefulness and effectiveness of iMRW in identifying potential gene-phenotype associations.关键词
基因-表型关联/随机游走/异质网络/多组学数据融合/网络拓扑结构Key words
gene-phenotype associations/random walk/heterogeneous network/multi-omics data fusion/network topology分类
信息技术与安全科学引用本文复制引用
谭好江,王峻,余国先,陈建,郭茂祖..基于个性化随机游走的基因-表型关联分析[J].电子学报,2024,52(5):1619-1632,14.基金项目
国家自然科学基金(No.62031003,No.62072380) (No.62031003,No.62072380)
山东大学中央高校基本业务费(No.2020GN061) National Natural Science Foundation of China(No.62031003,No.62072380) (No.2020GN061)
Fundamental Re-search Funds of Shandong University(No.2020GN061) (No.2020GN061)