| 注册
首页|期刊导航|现代电子技术|视觉-语言多模态下的多任务人脸年龄估计

视觉-语言多模态下的多任务人脸年龄估计

何江 池静 池佳稷 高松

现代电子技术2024,Vol.47Issue(14):171-176,6.
现代电子技术2024,Vol.47Issue(14):171-176,6.DOI:10.16652/j.issn.1004-373x.2024.14.026

视觉-语言多模态下的多任务人脸年龄估计

Multi-task face age estimation in vision-language multimodality

何江 1池静 1池佳稷 2高松3

作者信息

  • 1. 河北工程大学 信息与电气工程学院,河北 邯郸 056038
  • 2. 拉彭兰塔理工大学 电气工程学院,南卡累利亚 拉彭兰塔 53850
  • 3. 邯郸市第三建筑工程有限公司,河北 邯郸 056001
  • 折叠

摘要

Abstract

Existing age estimation methods are based only on face images and cannot fully utilize the linguistic contextual information behind the images.In addition,these methods usually focus on the optimization of a single age estimation task,ignoring the information brought by similar tasks to improve the model performance.To address the above problems,a multi-task face age estimation method based on vision-language multimodality is proposed,which utilizes prompt text information to provide richer and more accurate image understanding and a priori knowledge for age estimation.Meanwhile,a multi-task learning method is introduced to combine the age classification task with the ordinal regression task by utilizing the complementarity between tasks to obtain better performance.In order to obtain reliable prediction results,two multi-task result fusion methods are investigated:weighted averaging and task regression,and ablation experiments are conducted on the weighting factor of the weighted averaging method to find a suitable set of weighting factors.In comparison with the state-of-the-art methods,the mean absolute error(MAE)of the proposed method is reduced by 7.32%on the UTK-FACE dataset,its MAE is reduced by 1.20%,and its cumulative score(CS)is improved by 0.11%on the Morph Ⅱ dataset.

关键词

年龄估计/视觉-语言多模态/多任务学习/加权平均法/提示文本/任务回归器

Key words

age estimation/visual-language multimodality/multitask learning/weighted average method/prompt text/task regressor

分类

信息技术与安全科学

引用本文复制引用

何江,池静,池佳稷,高松..视觉-语言多模态下的多任务人脸年龄估计[J].现代电子技术,2024,47(14):171-176,6.

基金项目

邯郸市科学技术研究与发展计划项目(21422031252) (21422031252)

现代电子技术

OA北大核心CSTPCD

1004-373X

访问量0
|
下载量0
段落导航相关论文