中国医疗设备2017,Vol.32Issue(5):1-5,27,6.DOI:10.3969/j.issn.1674-1633.2017.05.001
关于全基因组关联研究的自动化元分析初探
Exploring Automated Meta Analyses of Genome-Wide Association Studies
摘要
Abstract
With the rapid development of natural language processing and text mining technology, the step of extracting data from literature began changing from manual extraction to automation by computer. In the past cases, researchers searched entire articles sentence by sentence to looking for key words or key sentences. But the thorough searching without focus points wasted much time. In thispaper, we took genome-wide association study (GWAS) as the example to develop the strategies of data automatics extraction for meta-analysis through clearing the positions of data elements we cared about in the included studies in advance to help computers extract the complete data quickly and accurately by searching only parts of the literature. At the same time, we used a GWAS study about Alzheimer's disease as a case study to search and extract data from all the included studies according to the strategies that we developed. Results showed that our strategies not only shortened the time of extraction, but also kept the success rate and accuracy more than 90%. Our research provided effective strategies and a guide for the research of automatic extraction of GWAS data, which has a promoting effect on the development of meta-analysis to the big data era.关键词
基因关联研究/元分析/数据定位/数据提取/单核苷酸多态性Key words
genome-wide association study/meta-analysis/data location/data automatics extraction/single nucleotide polymorphism分类
生物科学引用本文复制引用
冀燃,李冬果,张大保..关于全基因组关联研究的自动化元分析初探[J].中国医疗设备,2017,32(5):1-5,27,6.基金项目
科技部"973"项目(2014CB744604) (2014CB744604)
北京市教委科技计划面上项目(KM201010025004 ()
KM201410025013) ()
北京市脑重大疾病研究院基金项目(BIBDPXM2014_014226_000016). (BIBDPXM2014_014226_000016)