计算机工程与应用2011,Vol.47Issue(35):147-149,3.DOI:10.3778/j.issn.1002-8331.2011.35.041
改进的自适应汉维句子对齐
Improved adaptive algorithm for Chinese-Uyghur sentence alignment
摘要
Abstract
This paper proposes an improved adaptive algorithm for Chinese-Uyghur sentence alignment.Traditional alignment methods can not well adapt to change in types of corpus,the algorithm makes ues of current Chinese-Uyghur text length ratio of bytes and historical matching model, modifies the alignment model parameters dynamically to meet the changes in types of corpus and improves sentence alignment algorithm performance.Compared with alignment algorithm based on length, alignment improves alignment accuarcy 3.5 percentage and recall 2.7 percentage, compared with mixed-aligned model .alignment improves 1.9 percentage and 1.8 percentage.Experimental results show that the algorithm can adapt to change in types of corpus well.关键词
双语语料/句子对齐/自适应Key words
bilingual corpora/sentence alignment/adaptive分类
信息技术与安全科学引用本文复制引用
田生伟,禹龙,杨飞宇..改进的自适应汉维句子对齐[J].计算机工程与应用,2011,47(35):147-149,3.基金项目
新疆自治区高校科研计划重点项目(No.XJEDU2009105). (No.XJEDU2009105)