首页|期刊导航|软件导刊|一种全局信息增强的语义图像合成方法

一种全局信息增强的语义图像合成方法

刘勇李俊岐陈永强

软件导刊2024，Vol.23Issue(10)：214-220,7.

软件导刊2024，Vol.23Issue(10)：214-220,7.DOI:10.11907/rjdk.231647

一种全局信息增强的语义图像合成方法

A Method for Semantic Image Synthesis with Global Information Enhancement

刘勇 ¹李俊岐 ¹陈永强¹

作者信息

1. 武汉纺织大学计算机与人工智能学院||湖北省服装信息化工程技术研究中心,湖北武汉 430200
折叠

摘要

Abstract

Semantic image synthesis is an important application and research direction in the field of image translation.Its aim is to generate real images that are consistent with image descriptions using input semantic images,such as semantic segmentation maps,maps and sketches.In response to the problems of blurry image features and lack of correlation in texture details due to the lack of global information in semantic image synthesis tasks based on generative adversarial networks(GANs),this paper proposes a global information-enhanced semantic image synthesis method based on the pix2pix network model,combined with an external attention mechanism.Firstly,an external attention mecha-nism is introduced in the upsampling stage of the generator with a U-net structure to enhance the spatial correlation between generated image pixels.Secondly,deep residual modules are used in the upsampling layers of the generator to improve the quality of generated images while en-hancing the diversity of the generated images.Finally,the discriminator incorporates global information to enhance its discrimination ability.Experimental evaluations on the Cityscape,Landscape,and Edges2shoes datasets demonstrate the effectiveness of the proposed model.Com-pared to the baseline model,the improved method achieves improvements of 57.37,26.74,and 1.78 in terms of the FID(Fréchet Inception Distance)metric for the Cityscape,Landscape,and Edges2shoes datasets,respectively.The results show that the proposed model can effec-tively utilize global information to enhance the correlation of texture details in generated images and improve the quality of generated images.

关键词

图像翻译/语义图像合成/生成对抗网络/深度学习/计算机视觉

Key words

image-to-image translation/semantic image synthesis/generative adversarial network/deep learning/computer vision

分类

信息技术与安全科学

引用本文复制引用

刘勇,李俊岐,陈永强..一种全局信息增强的语义图像合成方法[J].软件导刊,2024,23(10):214-220,7.

软件导刊

ISSN：1672-7800

访问量0

下载量0

段落导航