| 注册
首页|期刊导航|北京中医药大学学报|基于条件随机场的《伤寒论》中医术语自动识别

基于条件随机场的《伤寒论》中医术语自动识别

孟洪宇 谢晴宇 常虹 孟庆刚

北京中医药大学学报Issue(9):587-590,4.
北京中医药大学学报Issue(9):587-590,4.DOI:10.3969/j.issn.1006-2157.2015.09.006

基于条件随机场的《伤寒论》中医术语自动识别

Automatic identification of TCM terminology in Shanghan Lun based on conditional random field

孟洪宇 1谢晴宇 2常虹 3孟庆刚1

作者信息

  • 1. 北京中医药大学基础医学院 北京100029
  • 2. 中国中医科学院中医临床基础医学研究所
  • 3. 内蒙古包头医学院
  • 折叠

摘要

Abstract

Objective To explore the methods of automatic identification of TCM terminology and to ex-pand the forms of natural language processing in TCM documents.Methods Based on the methods of conditional random field( CRF) , annotation and automatic identification on terms of symptoms, diseases, pulse-types and prescriptions recorded in Shanghan Lun as the research subjects, the effects of different combinations of the features, such as Chinese character itself, part of speech, word boundary and term category label, on identification of terminology were analyzed and the most effective combination was selected.Results The TCM terminology automatic identification model, combining with the features of Chinese character itself, part of speech, word boundary and term category label, had the precision of 85.00%, recall of 68.00%and F score of 75.56%.Conclusion The multi-features model of combi-nation of Chinese character itself, part of speech, word boundary and the term category label achieved the best identifying result in all combinations.

关键词

中医术语/条件随机场/伤寒论/自动识别

Key words

TCM terminology/conditional random fields/ShangHan Lun/automatic identification

分类

医药卫生

引用本文复制引用

孟洪宇,谢晴宇,常虹,孟庆刚..基于条件随机场的《伤寒论》中医术语自动识别[J].北京中医药大学学报,2015,(9):587-590,4.

基金项目

国家自然科学基金项目 ()

北京中医药大学学报

OA北大核心CSCDCSTPCD

1006-2157

访问量0
|
下载量0
段落导航相关论文