计算机与现代化Issue(9):232-234,238,4.DOI:10.3969/j.issn.1006-2475.2012.09.060
多路数据源的蛋白质搜索信息整合方法
Integrated Method of Information Search for Protein from Different Resource of Database
陈雅琦 1朱斐1
作者信息
- 1. 苏州大学计算机科学与技术学院,江苏苏州215006
- 折叠
摘要
Abstract
With the rapid development of biological science and technology, people know more and more about the basic substance, protein. However, the more information, the more difficulties people meet in searching. To get it efficient and precise message of protein, this paper designs a method to extract the information through NCBI and Binding DB for example. The method is about obtaining efficient information with no redundancy. It extracts keywords to form bigram from the information entry which is searched, and then divides it into groups. In each group, the detailed keyword extraction and grouping information is done, and cycles the processes till N-gram is generated, so that it achieves the purpose of getting rid of redundancy and integrates the ordering of information.关键词
蛋白质搜索/关键字/二元组/信息整合/NCBI/Binding DBKey words
information search for protein/keywords/bigram/information integration/NCBI/Binding DB分类
信息技术与安全科学引用本文复制引用
陈雅琦,朱斐..多路数据源的蛋白质搜索信息整合方法[J].计算机与现代化,2012,(9):232-234,238,4.