计算技术与自动化Issue(4):132-136,5.
基于条件随机场的人物信息抽取
Character Information Extraction Based on Conditional Random Fields
郑轶1
作者信息
- 1. 东北石油大学 计算机与信息技术学院,黑龙江 大庆 163318
- 折叠
摘要
Abstract
This paper considered the character information extraction from the Baike HTML as a sequence labeling ques-tion,and used CRFs to label the raw data.This paper also detailed the methods of data analysis and feature selection,and the way to extract information from the raw data directly,which do not contain the data preprocessing part and the sentence parser part.By this way,it developed the efficiency of information extraction effectively.And two comparable tests show that the method proposed can extract the character information from the row HTML accurately.关键词
CRFs/人物/人物信息/信息抽取Key words
CRFs/CRF/character/information extraction分类
信息技术与安全科学引用本文复制引用
郑轶..基于条件随机场的人物信息抽取[J].计算技术与自动化,2015,(4):132-136,5.