| 注册
首页|期刊导航|高技术通讯|语音驱动的三维高斯人脸运动生成方法

语音驱动的三维高斯人脸运动生成方法

李小娟 陈姝宇

高技术通讯2026,Vol.36Issue(3):221-229,9.
高技术通讯2026,Vol.36Issue(3):221-229,9.DOI:10.3772/j.issn.1002-0470.2026.03.001

语音驱动的三维高斯人脸运动生成方法

Audio-driven 3D facial motion generation via Gaussian splatting

李小娟 1陈姝宇2

作者信息

  • 1. 中国科学院计算技术研究所 北京 100190||中国科学院大学 北京 100049
  • 2. 中国科学院计算技术研究所 北京 100190
  • 折叠

摘要

Abstract

With the development of digital human technology,synthesizing realistic facial motions that align with audio has become a significant research focus.Existing image-based facial motion synthesis methods are often limited to specific camera angles and struggle with accurately expressing facial details.The core issue lies in the lack of effec-tive 3D representation.To address this problem,this paper proposes an audio-driven 3D facial motion generation method via Gaussian splatting.The method first combines 3D Gaussian splatting with a parametric facial model to perform 3D modeling of dynamic facial data,establishing a relationship between the Gaussian representation and the mesh model.For motion generation,audio-driven movements are mapped to the vertex displacements on the facial model,and dynamic facial Gaussian deformation is achieved through mesh deformation.Compared to existing meth-ods,the proposed Gaussian-based facial motion generation method demonstrates superior 3D consistency and image quality,along with significantly improved generation and rendering efficiency.

关键词

高斯泼溅/语音驱动/参数化人脸模型

Key words

Gaussian splatting/audio-driven/parametric facial model

引用本文复制引用

李小娟,陈姝宇..语音驱动的三维高斯人脸运动生成方法[J].高技术通讯,2026,36(3):221-229,9.

基金项目

国家自然科学基金青年基金(62102403)资助项目. (62102403)

高技术通讯

1002-0470

访问量0
|
下载量0
段落导航相关论文