|国家科技期刊平台
首页|期刊导航|自动化学报(英文版)|Learning Sequential and Structural Dependencies Between Nucleotides for RNA N6-Methyla-denosine Site Identification

Learning Sequential and Structural Dependencies Between Nucleotides for RNA N6-Methyla-denosine Site IdentificationOACSTPCDEI

Learning Sequential and Structural Dependencies Between Nucleotides for RNA N6-Methyla-denosine Site Identification

英文摘要

N6-methyladenosine(m6A)is an important RNA methylation modification involved in regulating diverse biological processes across multiple species.Hence,the identification of m6A modification sites provides valuable insight into the biological mechanisms of complex diseases at the post-transcriptional level.Although a variety of identification algorithms have been pro-posed recently,most of them capture the features of m6A modifi-cation sites by focusing on the sequential dependencies of nucleotides at different positions in RNA sequences,while ignor-ing the structural dependencies of nucleotides in their three-dimensional structures.To overcome this issue,we propose a cross-species end-to-end deep learning model,namely CR-NSSD,which conduct a cross-domain representation learning process integrating nucleotide structural and sequential dependencies for RNA m6A site identification.Specifically,CR-NSSD first obtains the pre-coded representations of RNA sequences by incorporat-ing the position information into single-nucleotide states with chaos game representation theory.It then constructs a cross-domain reconstruction encoder to learn the sequential and struc-tural dependencies between nucleotides.By minimizing the reconstruction and binary cross-entropy losses,CR-NSSD is trained to complete the task of m6A site identification.Extensive experiments have demonstrated the promising performance of CR-NSSD by comparing it with several state-of-the-art m6A identification algorithms.Moreover,the results of cross-species prediction indicate that the integration of sequential and struc-tural dependencies allows CR-NSSD to capture general features of m6A modification sites among different species,thus improv-ing the accuracy of cross-species identification.

Guodong Li;Bowei Zhao;Xiaorui Su;Dongxu Li;Yue Yang;Zhi Zeng;Lun Hu

Xinjiang Technical Institute of Physics and Chemistry,Chinese Academy of Science,Urumqi 830011,ChinaCollege of Computer,Xi'an Jiaotong University,Xi'an 710049,China

Cross-domain reconstructioncross-species predic-tionN6-methyladenosine(m6A)modification siteRNA sequencesequential and structural dependencies

《自动化学报(英文版)》 2024 (010)

2123-2134 / 12

This work was supported in part by the National Natural Science Foundation of China(62373348),the Natural Science Foundation of Xinjiang Uygur Autonomous Region(2021D01D05),the Tianshan Talent Training Program(2023TSYCLJ0021),and the Pioneer Hundred Talents Program of Chinese Academy of Sciences.

10.1109/JAS.2024.124233

评论