首页|期刊导航|无线电通信技术|基于多语种文本符号的艺术图像生成模型

基于多语种文本符号的艺术图像生成模型

唐宏卓诗语

无线电通信技术2025，Vol.51Issue(3)：486-492,7.

无线电通信技术2025，Vol.51Issue(3)：486-492,7.DOI:10.3969/j.issn.1003-3114.2025.03.007

基于多语种文本符号的艺术图像生成模型

Art Image Generation Model Based on Multilingual Text Symbols

唐宏 ¹卓诗语²

作者信息

1. 四川工商职业技术学院教务处,四川成都 611830
2. 四川大学机械工程学院,四川成都 610207
折叠

摘要

Abstract

Using only text symbols to generate images is called Text-to-Image(TTI)task,which has important application pros-pects in art design.Due to the lack of annotated image data in different languages,TTI research mainly focuses on English,and exist-ing TTI models cannot use existing data in other languages to generate images.Based on the above consideration,the potential of Multi-lingual TTI(MTTI)and current neural machine translation-guided MTTI systems is investigated.By leveraging multilingual text sym-bols,an Art Image Generation Model Based on Multilingual Text Symbols(AIG-MTS)is proposed.By learning weights and integrating knowledge of multilingual texts,differences between languages can be reduced and model performance can be improved.Experiments are conducted on standard datasets COCO-CN,Multi30K Task2 and LAION-5B,and the AIG-MTS model performs best on all datasets when compared with mainstream algorithms.

关键词

设计领域/多语种/文本生成图像/多模态编码器/神经机器翻译

Key words

design field/multilingual/TTI/multimodal encoder/neural machine translation

分类

信息技术与安全科学

引用本文复制引用

唐宏,卓诗语..基于多语种文本符号的艺术图像生成模型[J].无线电通信技术,2025,51(3):486-492,7.

基金项目

四川省科技计划资助(2022YFG0186) Sichuan Science and Technology Program(2022YFG0186) （2022YFG0186）

无线电通信技术

OA北大核心

ISSN：1003-3114

访问量0

下载量0

段落导航