计算机应用与软件Issue(8):64-67,109,5.DOI:10.3969/j.issn.1000-386x.2015.08.015
维吾尔语数词类命名实体的识别与翻译
RECOGNITION AND TRANSLATION OF UYGHUR NAMED ENTITIES IN NUMERALS CLASS
摘要
Abstract
Aiming at the problem that Uyghur named entities ( time, date, money, percentage ) in numerals class are inaccurately translated in Uyghur-Chinese machine translation, we designed a Uyghur-Chinese parallel corpus-based recognition and translation system for Uyghur named entities in numerals class by analysing the formation laws and boundary information of these named entities.Uyghur basic numerals are recognised and translated through finite automata in combination with triggering words, and the translation templates will be automatically extracted from Uyghur-Chinese parallel corpus, the templates will then be matched to implement the translation.The F value of recognition achieves 91%in Uyghur named entities in numerals class, the system effectively improves the quality of Uyghur-Chinese machine translation.关键词
平行语料/数词类/命名实体/维汉机器翻译/有限自动机Key words
Parallel corpus/Numerals class/Named entities/Uyghur-Chinese machine translation/Finite automata分类
信息技术与安全科学引用本文复制引用
张磊,杨雅婷,米成刚,李晓..维吾尔语数词类命名实体的识别与翻译[J].计算机应用与软件,2015,(8):64-67,109,5.基金项目
中国科学院战略性先导科技专项项目(XDA06030400);中国科学院“西部之光”人才培养计划“西部博士”项目( XBBS201216);新疆维吾尔自治区青年科技创新人才培养工程项目(2013731021);中国科学院西部行动计划项目(KGZD-EW-501)。 ()