情报杂志Issue(12):130-134,64,6.
一种基于领域本体的药品研发信息抽取方法
A Drugs R&D Information Extraction Method Based on Domain Ontology
摘要
Abstract
Taking annual report as a carrier, this paper proposes an information extraction method for biological companies' drugs R&D in-formation based on domain ontology. First, the domain ontology dictionaries according to the basic process of drugs R&D is constructed and the process of times words and negative words after preprocessing the sample PDF documents is introduced in detail. Then, extracting and normalizing drugs R&D information are done by using mapping principle as well as trigger, inheritance, and selection mechanisms. Fi-nally, computing precision ratio and recall ratio based on the results of information extraction proves the validity of the method.关键词
信息抽取/领域本体/映射原理/生物医药公司/药品研发/年度报告Key words
information extraction/domain ontology/mapping principle/biological companies/drugs R&D/annual report分类
社会科学引用本文复制引用
蒋艳辉,姚靠华,周双文,王薇..一种基于领域本体的药品研发信息抽取方法[J].情报杂志,2012,(12):130-134,64,6.基金项目
国家自然科学基金湖南大学青年教师基金项目“基于语义的上市公司年报文本信息质量测度方法及应用”(编号:71201052) (编号:71201052)