计算机技术与发展Issue(12):1-4,10,5.DOI:10.3969/j.issn.1673-629X.2013.12.001
超长DNA序列的高效压缩算法研究
An Efficient Compression Algorithm for Huge DNA Sequence
摘要
Abstract
The DNA sequence is composed of only four base,but has lots of data. The effective compression for DNA data can save much time. There are several DNA sequence oriented compression methods like Biocompress,DNACompress and CTW+LZ. These algorithms can achieve good compression ratio,but has sacrificed too much time searching for similar areas. In order to solve the problem,a new al-gorithm Dzip was presented,by means of multiple layers compression techniques like improved RLE,delta encoding,variable integers. In comparison with current DNA sequence oriented compression methods,the standard DNA benchmark results indicate that the new algo-rithm can achieve at least 28 times faster in running time.关键词
双序列比对/DNA数据压缩/可编程门阵列/差分编码/可变长整形Key words
pair wise sequence alignment/DNA data compression/field programmable gate arrays ( FPGA)/delta encoding/variable inte-gers分类
信息技术与安全科学引用本文复制引用
欧阳继超,冯萍,康继昌..超长DNA序列的高效压缩算法研究[J].计算机技术与发展,2013,(12):1-4,10,5.基金项目
国家“863”高技术发展计划项目(2003AA001018) (2003AA001018)
航空科学基金(02F53031) (02F53031)