计算机应用与软件2017,Vol.34Issue(4):11-15,41,6.DOI:10.3969/j.issn.1000-386x.2017.04.003
基于词向量的中文微博实体链接方法
ENTITY LINKING METHOD OF CHINESE MICRO-BLOG BASED ON WORD VECTOR
摘要
Abstract
Entity linking refers to a given entity referring to an item and its text, linking it to a target entity in a given knowledge base.Due to the characteristics of micro-blog content sparse, non-standard terms, the use of traditional methods less effective.In order to accurately link to a given entity in microblogging, a method based on word vector for Chinese microblogging entity linking is proposed.First, the knowledge base is extended, and synonyms are extracted from the Chinese Wikipedia to construct the synonym list.Then, using the word vector to solve typos and foreign name transliteration problem.Finally, the entity link is calculated by computing the semantic similarity between the entity and the candidate entity.The experimental results show that the micro-averaged accuracy of the proposed method is 91.4% on the NLP&CC2013 evaluation data.关键词
实体链接/词向量/维基百科/同义词Key words
Entity linking/Word vector/Wikipedia/Synonyms分类
信息技术与安全科学引用本文复制引用
毛二松,王波,唐永旺,梁丹..基于词向量的中文微博实体链接方法[J].计算机应用与软件,2017,34(4):11-15,41,6.基金项目
国家社会科学基金项目(14BXW028). (14BXW028)