首页|期刊导航|计算机应用与软件|基于统计机器翻译的汉维词对齐研究

基于统计机器翻译的汉维词对齐研究

刘建明吐尔根·依不拉音艾山·吾买尔

计算机应用与软件2011，Vol.28Issue(4)：57-59,90,4.

基于统计机器翻译的汉维词对齐研究

RESEARCH ON STATISTICAL MACHINE TRANSLATION-BASED CHINESE-UYGHUR WORD ALIGNMENT

刘建明 ¹吐尔根·依不拉音 ¹艾山·吾买尔¹

作者信息

1. 新疆大学信息科学与工程学院,新疆乌鲁木齐830046
折叠

摘要

Abstract

This paper describes a Chinese-Uyghur word alignment system which is based on statistical machine translation. There are two models in processing procedure of the system:pre-process and word alignment. The pre-process includes Chinese text pre-process and Uyghur text pre-process, in it the pre-process procedure for Uyghur text is as follows:First the Uyghur is transferred to Latin-Uyghur,then the exceptional characters in Latin-Uyghur will be replaced by the unambiguous characters. The implementation process of word alignment is:First, IBM model 1-3 is used,then in combination with the heuristic theory by Och the optinisation is conducted,and a Chinese-Uyghur word alignment system based on statistical machine translation is constructed. Experimental results show that this system is feasible.

关键词

词对齐/IBM Modell-3/启发式优化

Key words

Word alignment/ IBM Model 1-3/ Heuristic optimisation

引用本文复制引用

刘建明,吐尔根·依不拉音,艾山·吾买尔..基于统计机器翻译的汉维词对齐研究[J].计算机应用与软件,2011,28(4):57-59,90,4.

基金项目

国家自然科学基金(60663006) （60663006）

国家语委科研项目(MZ115-75).刘建明,硕士生,主研领域:计算机应用及信息处理. （MZ115-75）

计算机应用与软件

OA北大核心CSCDCSTPCD

ISSN：1000-386X

访问量0

下载量0

段落导航