高技术通讯2012,Vol.22Issue(2):165-170,6.DOI:10.3772/j.issn.1002-0470.2012.02.009
基于语义的文本地理范围提取方法
A semantics-based method for extracting geographic scopes of texts
摘要
Abstract
To process geographic information in Web pages, this paper presents a novel method for extracting the geographic scopes of documents. It assigns the multi-scale geographic scope to a document through a three-stage process for dealing with geographic semantics. Firstly, the toponyms in a document are recognized under the support of the geographic knowledge base. Secondly, the ambiguous toponyms are disambiguated based on geographic and non-geographic semantics, and the evidences for disambiguation are combined by the evidence theory. Lastly, a geo-referenced tree is constructed based on a cognitive theory and the geographic focuses are obtained according to sematic relationships. The geographic location of a document is therefore determined. The above method was implemented in GeoSearcher, a prototype system for geographic information retrieval. The evaluation results show that the proposed method can reach the higher accuracy.关键词
地理信息检索(GIR)/文本地理范围/证据理论Key words
geographic information retrieval ( GIR) / geographic scope of texts/ evidence theory引用本文复制引用
张毅,王星光,陈敏,刘瑜..基于语义的文本地理范围提取方法[J].高技术通讯,2012,22(2):165-170,6.基金项目
863计划(2007AA120502)和国家自然科学基金(41171296)资助项目. (2007AA120502)