计算机应用研究2018,Vol.35Issue(4):1072-1075,4.DOI:10.3969/j.issn.1001-3695.2018.04.024
基于上下文信息的中文命名实体消歧方法研究
Chinese named entity disambiguation method research based on context information
摘要
Abstract
In the process of semantic annotation,in order to eliminate the ambiguity problem of the text in a given named entity and the mapping of the knowledge base entities,this paper put forward a context based semantic similarity value of the sorted named entity disambiguation method.Disambiguation method included three sections that entity preprocessing,constructing candidate list of entities and similarity value ranking algorithms.In view of the problem of the named entity reference multiplicity,it used the new entity to represent the preprocess method to extract the standard entity.Then it used the online encyclopedia in Chinese to construct the semantic knowledge base,and got the semantic list of standard entities.At the same time,this paper also put forward using the similarity value ranking method for solving standard substance and semantic list mapping referential ambiguity problem,for in the knowledge base not found semantic entity disambiguation processing by clustering algorithm.The results of the experiment show that the proposed method can effectively reflect the real data set of Chinese Web pages to the corresponding non-ambiguous entities in the knowledge base.关键词
命名实体/语义知识库/聚类/语义列表Key words
named entity/semantic knowledge base/clustering/semantic list分类
信息技术与安全科学引用本文复制引用
王旭阳,姜喜秋..基于上下文信息的中文命名实体消歧方法研究[J].计算机应用研究,2018,35(4):1072-1075,4.基金项目
国家自然科学基金资助项目(61563030) (61563030)