计算机工程与应用2016,Vol.52Issue(15):97-100,4.DOI:10.3778/j.issn.1002-8331.1512-0299
基于条件随机场的中文领域分词研究
Chinese word segmentation research based on Conditional Random Field
摘要
Abstract
According to the Conditional Random Field for Chinese word segmentation, the field is hard to adaptive. A combination of CRF and domain dictionary is proposed to improve the field adaptability, and for eliminating ambiguity, this paper uses fixed word collocation, verb dictionary and word probability by the rule of word formation. The experiental results show that this approach improves the accuracy and adaptability of the word segmentation. F value of the segmenta-tion results in computer and medical fields is increased by 7.6%and 8.7%.关键词
中文分词/条件随机场/领域自适应/歧义消解/领域分词/逆向最大匹配算法Key words
Chinese word segmentation/Conditional Random Field(CRF)/domain adaption/ambiguity resolution/domain segmentation/reverse directional maximum match method分类
计算机与自动化引用本文复制引用
朱艳辉,刘璟,徐叶强,田海龙,马进..基于条件随机场的中文领域分词研究[J].计算机工程与应用,2016,52(15):97-100,4.基金项目
国家自然科学基金(No.61170102);国家社科基金资助项目(No.12BYY045);湖南省教育厅重点项目(No.15A049)。 ()