数据采集与处理2017,Vol.32Issue(1):149-156,8.DOI:10.16337/j.1004-9037.2017.01.018
结合规则与语义的中文人称代词指代消解
Coreference Resolution of Chinese Personal Pronouns With Combination of Semantics and Rules
摘要
Abstract
Coreference resolution is a widely used technology to judge whether pronouns can match with the entity existing before in the text,which plays a crucial role in intelligent processing for massive text information on internet.A coreference resolution study,especially on the frequently-used Chinese personal pronouns,was carried out with the result of a developed algorithm with the combination of semantics and rules.Based on fundamental filtration rules,an improved mechanism specific to apposition was also adopted.To raise the accuracy of calculating the synonyms distances,the algorithm identified the associative word of personal pronouns and selected antecedents based method for analyzing semantic relations and selecting high relevancy antecedent,which was realized with the aid of Tongyici Cilin and HowNet.Comparison experiments with different methods and experiments on the real corpus dataset were conducted,and results show that the presented method achieves higher validity and obvious improvement.关键词
指代消解/人称代词/规则/候选先行词/语义特征Key words
coreference resolution/person pronouns/rules/antecedent/semantic relations分类
信息技术与安全科学引用本文复制引用
张文艳,李存华,仲兆满,王艺,李莉..结合规则与语义的中文人称代词指代消解[J].数据采集与处理,2017,32(1):149-156,8.基金项目
江苏省教育厅产业化推进(JHB2012-61)资助项目. (JHB2012-61)