| 注册
首页|期刊导航|计算机应用与软件|双层 CRF与规则相结合的中文地名识别方法研究

双层 CRF与规则相结合的中文地名识别方法研究

孙虹 陈俊杰

计算机应用与软件Issue(11):175-177,182,4.
计算机应用与软件Issue(11):175-177,182,4.DOI:10.3969/j.issn.1000-386x.2014.11.043

双层 CRF与规则相结合的中文地名识别方法研究

RESEARCH ON CHINESE TOPONYM RECOGNITION METHOD WITH TWO-LAYER CRF AND RULES COMBINATION

孙虹 1陈俊杰1

作者信息

  • 1. 太原理工大学科学与技术学院 山西 太原 030024
  • 折叠

摘要

Abstract

We use a method which is based on the combination of two-layer CRF model and rules to improve the performance of Chinese toponym recognition.The first layer of CRF model uses the single character feature to recognise the placenames, and adds the recognition results to the dictionary.The second layer of CRF model recognises the placenames by using four features including the part of speech, the word referring the left word boundary, the word referring the right word boundary and the processed dictionary characteristics.Finally, rules are utilised to filtering, trimming and supplementing the recognition result.Through two-layer CRF model to acquire long-distance feature of the text, we solve the problem of inconsistent markup of the same word due to its different position, and the recall rate is increased by combining the rules made according to the features of the toponymic linguistics.Experiment shows that the method of combining the two-layer CRF with the rules achieves preferable good effect on Chinese toponym recognition, and the open test on MSRA corpus of the Bakeoff 2007 reaches the accuracy of 95.32%, recall rate of 90.34%and F number of 94.12%respectively.

关键词

自然语言处理/中文地名识别/双层CRF模型/规则

Key words

Natural language processing/Chinese toponym recognition/Two-layer/CRF model Rules

分类

信息技术与安全科学

引用本文复制引用

孙虹,陈俊杰..双层 CRF与规则相结合的中文地名识别方法研究[J].计算机应用与软件,2014,(11):175-177,182,4.

基金项目

国家重点开放实验室课题项目( SKLSE 2012-09-30)。 ()

计算机应用与软件

OACSCDCSTPCD

1000-386X

访问量0
|
下载量0
段落导航相关论文