华中师范大学学报(自然科学版)2018,Vol.52Issue(3):316-321,6.DOI:10.19603/j.cnki.1000-1190.2018.03.005
基于主题标签和CRF的中文微博命名实体识别
Named entity recognition of Chinese microblog based on theme tag and CRF
摘要
Abstract
In recent years,the rapid development of network mediamicro blog provides a new carrier for the research of named entity recognition.Considering Chinese micro-blog text is short,Chinese micro-blog expression is not clear,Chinese micro blog is seriously networked and so on,the paper proposed a named entity recognition method of Chinese microblog combined of rules and statistics.Firstly,the proposed method uses the theme tag of Chinese microblog to filter the processed data,then devised feature templates for recognition method based on conditional random fields.In order to meet the requirements of the experiment,this paper combines the traditional web crawler and API method to collect data.Experimental results show that the proposed method can effectively improve the effectiveness of named entity recognition of Chinese microblog.关键词
命名实体/中文微博/主题标签/条件随机场Key words
named entity recognition/Chinese microblog/conditional random fields (CRF)分类
信息技术与安全科学引用本文复制引用
朱颢东,杨立志,丁温雪,冯嘉美..基于主题标签和CRF的中文微博命名实体识别[J].华中师范大学学报(自然科学版),2018,52(3):316-321,6.基金项目
河南省科技计划项目(152102210149,152102210357) (152102210149,152102210357)
河南省高等学校青年骨干教师资助计划项目(2014GGJS-084) (2014GGJS-084)
河南省高等学校重点科研项目(16A520030) (16A520030)
郑州轻工业学院校级青年骨干教师培养对象资助计划项目(XGGJS02) (XGGJS02)
郑州轻工业学院博士科研基金资助项目(2010BSJJ038). (2010BSJJ038)