| 注册
首页|期刊导航|郑州大学学报(工学版)|基于伪孪生网络的无监督学习多语言神经机器翻译方法

基于伪孪生网络的无监督学习多语言神经机器翻译方法

都力铭 屈丹 张传财 席阳丽

郑州大学学报(工学版)2025,Vol.46Issue(6):8-14,7.
郑州大学学报(工学版)2025,Vol.46Issue(6):8-14,7.DOI:10.13705/j.issn.1671-6833.2025.03.008

基于伪孪生网络的无监督学习多语言神经机器翻译方法

Unsupervised Learning Multilingual Neural Machine Translation Based on Pseudo-siamese Network

都力铭 1屈丹 2张传财 3席阳丽1

作者信息

  • 1. 郑州大学 网络空间安全学院,河南 郑州 450002
  • 2. 郑州大学 网络空间安全学院,河南 郑州 450002||网络空间部队信息工程大学 先进计算与智能工程(国家级)实验室,河南 郑州 450001
  • 3. 网络空间部队信息工程大学 先进计算与智能工程(国家级)实验室,河南 郑州 450001
  • 折叠

摘要

Abstract

When unsupervised neural machine translation was trained with monolingual data,it inevitably brought a lot of noise information.The errors of the machine translation model accumulated continuously during the training iteration process,affecting the translation effect.To solve this problem,in this study an unsupervised neural ma-chine translation method was proposed based on pseudo-siamese network on the basis of cross-lingual pre-training model(XLM).The model encoder was divided into two modules,in which the pseudo-Siamese network part intro-duced a noise filtering gate mechanism to filter the noise features in the encoding process,so that the model could better learn the mapping relationship between the source language and the target language.The experimental results showed that in the interactive translation task between English,German,French,and Romanian,the proposed method had an average improvement of 3.5 percentage points compared with the baseline system,which proved its superiority in translation effect.Ablation experiments were used to verify the effectiveness of each component of the model.At the same time,the performance test of the method with different noise conditions was simulated in the German-English translation task,and it also showed good noise resistance.

关键词

无监督机器翻译/伪孪生网络/单语数据/噪声过滤门机制/跨语言预训练模型

Key words

unsupervised machine translation/pseudo-siamese network/monolingual data/noise filtering gate mechanism/cross-language pretraining model

分类

信息技术与安全科学

引用本文复制引用

都力铭,屈丹,张传财,席阳丽..基于伪孪生网络的无监督学习多语言神经机器翻译方法[J].郑州大学学报(工学版),2025,46(6):8-14,7.

基金项目

国家自然科学基金资助项目(62171470) (62171470)

河南省中原科技创新领军人才项目(234200510019) (234200510019)

河南省自然科学基金项目(232300421240) (232300421240)

郑州大学学报(工学版)

OA北大核心

1671-6833

访问量0
|
下载量0
段落导航相关论文