现代电子技术2024,Vol.47Issue(15):146-150,5.DOI:10.16652/j.issn.1004-373x.2024.15.024
基于CB-ViT的青少年视线估计算法研究
Research on adolescent gaze estimation algorithm based on CB-ViT
摘要
Abstract
Gaze estimation technology is widely applied in the fields such as human-computer interaction(HCI),virtual reality,and medical diagnostic assistance.However,the existing public datasets are primarily adult-oriented,so the gaze estimation algorithms trained on these datasets show suboptimal performance when applied to adolescents.To address this issue,a youth-specific gaze dataset named ″Young-Gaze″,which encompasses gaze data from 107 adolescents,is collected.In addition,a novel 2D gaze estimation algorithm is proposed.This algorithm is on the basis of ViT(vision transformer)and incorporates a context broadcasting(CB)module,which significantly enhances the feature representation capability of the network model by integrating both eyes' features at different levels.Experimentally,this algorithm demonstrates superior performance on the dataset Young-Gaze.Its error is kept within 5.42 cm,so it surpasses the other existing 2D gaze estimation methods.Besides its notable performance on Young-Gaze,it also shows good results when trained and tested on the public 2D gaze datasets GazeCapture and MPIIFaceGaze.The above facts indicate that the proposed algorithm is not only suitable for the adolescent,but also applicable for the adults effectively.关键词
视线估计/头部姿态/CNN/特征融合/ViT/上下文广播Key words
gaze estimation/head posture/CNN/feature fusion/ViT/CB分类
信息技术与安全科学引用本文复制引用
严青松,毛建华,刘志,陆小锋..基于CB-ViT的青少年视线估计算法研究[J].现代电子技术,2024,47(15):146-150,5.基金项目
温州市重大科技创新攻关项目(ZY2023003) (ZY2023003)