计算机应用与软件2016,Vol.33Issue(10):94-97,4.DOI:10.3969/j.issn.1000-386x.2016.10.021
基于词语搭配关系的一种中文分词歧义性消除方法
A CHINESE SEGMENTATION DISAMBIGUATION METHOD BASED ON WORD COLLOCATION RELATIONSHIP
摘要
Abstract
In Chinese there are the fix collocation relationships between words.This paper presents a disambiguation method for Chinese segmentation based on word collocation.It firstly pre-segments the sentences by using the forward maximum matching method and backward maximum matching method,and carries out the word ambiguity detection and tags the part of speech,and then it matches the ambiguous words with word collocation dictionary or makes distinguishment on verb-object collocations,thus achieves the more accurate results of document words disambiguation.The proposed method reaches good results as shown in contrast experiments of word ambiguity detection and word collo-cation detection.关键词
词语搭配/最大匹配/中文分词/歧义性/动宾搭配Key words
Word collocation/Maximum match/Chinese segmentation/Word ambiguity/Verb-object collocations分类
信息技术与安全科学引用本文复制引用
郭丙华,俞亚,李中华..基于词语搭配关系的一种中文分词歧义性消除方法[J].计算机应用与软件,2016,33(10):94-97,4.基金项目
国家自然科学基金项目(61201087);广东省特色创新项目(2014KTSCX191)。 ()