| 注册
首页|期刊导航|计算机应用研究|串联重复序列比对的位置筛选方法

串联重复序列比对的位置筛选方法

温华铭 徐云 杨金宝

计算机应用研究2024,Vol.41Issue(7):2160-2164,5.
计算机应用研究2024,Vol.41Issue(7):2160-2164,5.DOI:10.19734/j.issn.1001-3695.2023.12.0614

串联重复序列比对的位置筛选方法

Position filtering method for tandem repeat sequence alignment

温华铭 1徐云 1杨金宝2

作者信息

  • 1. 中国科学技术大学计算机科学与技术学院,合肥 230027||安徽省高性能计算重点实验室,合肥 230027
  • 2. 华中农业大学信息学院,武汉 430070
  • 折叠

摘要

Abstract

Tandem repeat sequences are difficult part in genome construction,due to the high similarity between repeated units and the ambiguity in copy numbers,it often result in multiple candidate positions during sequence alignment.The chal-lenge lies in rapidly and accurately filtering out the correct alignment positions.Existing methods address this issue by using seeds(short sequences selected from sequencing fragments)to locate and extend candidate alignment positions,but overlook the distinctive characteristics of tandem repeat sequences when selecting seeds.To tackle the problem,this paper proposed a position filtering method for tandem repeat sequence alignment,which filtered alignment results by calculating the similarity of rare kmer sequences.Additionally,it implemented a strategy of merging rare kmers to expedite computation,coupled with a fuzzy search based on edit distance to enhance filtering information density.Experimental results demonstrate that this ap-proach improves both the recall and accuracy of alignment results on simulated datasets while achieving approximately a 2-fold increase in computational speed compared to existing methods,with notable parallel acceleration effects.

关键词

串联重复/单分子实时测序/序列比对/种子-扩展法

Key words

tandem repeat/single molecule real-time sequencing/sequence alignment/seed-and-extend method

分类

信息技术与安全科学

引用本文复制引用

温华铭,徐云,杨金宝..串联重复序列比对的位置筛选方法[J].计算机应用研究,2024,41(7):2160-2164,5.

基金项目

国家自然科学基金面上项目(61672480) (61672480)

国家外专局111引智计划资助项目(BP0719016) (BP0719016)

计算机应用研究

OA北大核心CSTPCD

1001-3695

访问量0
|
下载量0
段落导航相关论文