基于多特征条件随机场的《金匮要略》症状药物信息抽取研究OA
Research on Symptom and Medicine Information Abstraction of TCM Book Jin Gui Yao Lue Based on Conditional Random Field
目的:结合自然语言处理方法,研究可以有效抽取中医古籍中所含症状和药物文本实体信息的方法。方法以《金匮要略》为例,采用条件随机场(CRF)算法,先将文本进行分词处理,然后以词性、基于键值对的中医诊断标记集作为辅助特征,通过症状-药物 BIO 标签为训练特征来训练出模型,然后利用该模型对测试集文本进行自动标签标注。结果基于多特征 CRF 自动标注的结果准确率达到84.5%,召回率达到70.9%,F测度值达到77.1%。结论运用CRF方法加入词性、中医…查看全部>>
Objective To find an efficient way to abstract symptoms and medicine information from TCM book Jin Gui Yao Lue through combination of natural language processing method. Methods Taking Jin Gui Yao Lue as an example and by using conditional random fields (CRF), texts were processed according to words, and then part of speech and key assignments based on TCM diagnosis marker group were set as auxiliary features. Symptom-medicine BIO labels were set as the trai…查看全部>>
叶辉;姬东鸿
广州中医药大学,广东 广州 510016武汉大学,湖北 武汉 430007
医药卫生
条件随机场《金匮要略》症状药物信息抽取中医古籍
conditional random fields (CRF)Jin Gui Yao Luesymptom and medicine information abstractionancient TCM books
《中国中医药图书情报杂志》 2016 (5)
14-17,4
2014广东省中医药局建设中医药强省科研课题(20141073);广东财政专项(2013170)
评论