AED-NeRF:Audio-Driven and EmotionEditing Dynamic Neural Radiance Fields for Expressive Talking Face Avatar

下载PDF

导出

摘要 While neural radiance field(NeRF)methods have shown promising results in generating talking faces,existing studies primarily focus on the correlation between avatars and driving sources.However,these studies often overlook emotion modeling,resulting in the generation of emotionless or unnatural facial animations.In response,this paper introduces an audio-driven and emotion-editing dynamic NeRF(AED-NeRF)approach,designed for the real-time generation of expressive talking face avatars driven by audio inputs.Specifically,we integrate audio features into a grid-based NeRF to compensate for the lack of a deformation channel,successfully capturing lip dynamics and enabling end-to-end generation from audio-driven sources to talking face avatars.Emotion labels,comprising emotion categories and intensity levels,guide the proposed NeRF framework to implicitly model visual emotions,allowing for explicit control and editing of facial expressions.Extensive qualitative and quantitative experiments validate the effectiveness and advantages of our proposed method,demonstrating its ability to achieve real-time,photo-realistic talking face avatar generation across different audio and emotion scenarios.

作者 Lu Ping Song Li Shi Wenzhe Lin Zonghao Ling Jun

机构地区 State Key Laboratory of Mobile Network and Mobile Multimedia Technology

出处《ZTE Communications》 2026年第1期72-80,共9页 中兴通讯技术(英文版)

基金 supported by ZTE Industry-University-Institute Cooperation Funds under Grant No.IA20230921015。

关键词 talking face avatar neural radiance fields AED-NeRF

分类号 TP391.41 [自动化与计算机技术]

ZTE Communications

2026年第1期

浏览历史

内容加载中请稍等...

AED-NeRF:Audio-Driven and EmotionEditing Dynamic Neural Radiance Fields for Expressive Talking Face Avatar

相关作者

相关机构

相关主题

浏览历史