| 注册
首页|期刊导航|机电工程技术|结合Transformer的扩散模型用于人脸美丽预测

结合Transformer的扩散模型用于人脸美丽预测

甘俊英 黎慧聪 陈汉添 庄圳鑫 陈真

机电工程技术2026,Vol.55Issue(3):74-79,6.
机电工程技术2026,Vol.55Issue(3):74-79,6.DOI:10.3969/j.issn.1009-9492.2025.00055

结合Transformer的扩散模型用于人脸美丽预测

Diffusion Model with Transformer for Facial Beauty Prediction

甘俊英 1黎慧聪 1陈汉添 1庄圳鑫 1陈真1

作者信息

  • 1. 五邑大学电子与信息工程学院,广东 江门 529020
  • 折叠

摘要

Abstract

Overfitting to noisy labels in the database leads to weak generalization capability and reduced prediction accuracy in facial beauty prediction tasks.To address the issue,a Transformer-integrated diffusion model for label denoising and reconstruction during training is proposed.The model learns conditional probability distributions to control the generation process through"classifier guidance,"comprising a conditional information encoder and a denoising network.First,pre-trained weights of Swin Transformer are transferred and fine-tuned to obtain preliminary predictions as output priors.Second,these priors are utilized as the mean of the endpoint in the reverse process of the diffusion model,regulating denoising transitions at each timestep.Finally,facial beauty features are extracted and fed into the diffusion model for inference to generate prediction results.Experimental validation on three facial beauty databases,SCUT-FBP5500,LSAFBD,and CelebA,demonstrates that the proposed model outperforms baseline diffusion model and existing facial beauty prediction methods.In terms of accuracy,the model achieves 76.50%,72.65%,and 81.78%on the three databases respectively,surpassing the baseline diffusion model by 0.73%,1.76%,and 1.12%,and outperforming existing facial beauty prediction methods by 1.00%,4.42%,and 0.37%.The approach effectively addresses noisy label issues,enhances prediction performance,and can be widely applied to other image classification tasks or related fields.

关键词

人脸美丽预测/扩散模型/Transformer/条件信息编码器

Key words

facial beauty prediction/diffusion model/Transformer/conditional information encoder

分类

信息技术与安全科学

引用本文复制引用

甘俊英,黎慧聪,陈汉添,庄圳鑫,陈真..结合Transformer的扩散模型用于人脸美丽预测[J].机电工程技术,2026,55(3):74-79,6.

基金项目

国家自然科学基金(6177010044) (6177010044)

机电工程技术

1009-9492

访问量0
|
下载量0
段落导航相关论文