计算机技术与发展2012,Vol.22Issue(5):134-136,140,4.
基于SVM和词间特征的新词识别研究
Research on New Word Identification Based on SVM and Word Characteristics
摘要
Abstract
Chinese word segmentation is difficult to deal with ambiguity and unknown words recognition. Propose the new word mode features as well as various word internal patterns from the training corpus of positive and negative samples to quantify extraction, and then through the training of support vector machine get new support vector classification. On the test corpus with absolute discounting method new candidate is extracted and selected, and with the training corpus to extract word patterns to quantify according to the new classification support vector on the SVM test,through a portion of the rule filter to get the final word recognition results.关键词
自然语言处理/支持向量机/新词识别/词间特征Key words
natural language processing/support vector machine/new word recognition/word feature分类
信息技术与安全科学引用本文复制引用
徐远方,李成城..基于SVM和词间特征的新词识别研究[J].计算机技术与发展,2012,22(5):134-136,140,4.基金项目
国家自然科学基金项目(2002AA117010-07) (2002AA117010-07)
内蒙古师范大学校基金(GCRC09001,ZRYB08018) (GCRC09001,ZRYB08018)