Real-Time Speech-Driven 3D Face Animation
نویسندگان
چکیده
In this paper, we present an approach for real-time speech-driven 3D face animation using neural networks. We first analyze a 3D facial movement sequence of a talking subject and learn a quantitative representation of the facial deformations, called the 3D Motion Units (MUs). A 3D facial deformation can be approximated by a linear combination of the MUs weighted by the MU parameters (MUPs) – the visual features of the facial deformation. The facial movement sequence synchronizes with a audio track. The audio track is digitized and the audio features of each frame are calculated. A real-time audio-to-MUP mapping is constructed by training a set of neural networks using the calculated audio-visual features. The audio-visual features are divided into several groups based on the audio features. One neural network is trained per group to map the audio features to the corresponding MUPs. Given a new audio feature vector, we first classify it into one of the groups and select the corresponding neural network to map the audio feature vector to MUPs, which are used for face animation. The quantitative evaluation shows the effectiveness of the proposed approach.
منابع مشابه
Visual speech synthesis from 3D video
Data-driven approaches to 2D facial animation from video have achieved highly realistic results. In this paper we introduce a process for visual speech synthesis from 3D video capture to reproduce the dynamics of 3D face shape and appearance. Animation from real speech is performed by path optimisation over a graph representation of phonetically segmented captured 3D video. A novel similarity m...
متن کاملA Low Bit-rate Web-enabled Synthetic Head with Speech-driven Facial Animation
In this paper, an approach that animates facial expressions through speech analysis is presented. An individualized 3D head model is first generated by modifying a generic head model, where a set of MPEG-4 Facial Definition Parameters (FDPs) has been pre-defined. To animate realistic facial expressions of the 3D head model, key frames of facial expressions are calculated from motion-captured da...
متن کاملReal-time speech-driven face animation with expressions using neural networks
A real-time speech-driven synthetic talking face provides an effective multimodal communication interface in distributed collaboration environments. Nonverbal gestures such as facial expressions are important to human communication and should be considered by speech-driven face animation systems. In this paper, we present a framework that systematically addresses facial deformation modeling, au...
متن کاملReal-Time Speech-Driven Face Animation
This chapter presents our research on real-time speech-driven face animation. First, a visual representation, called Motion Unit (MU), for facial deformation is learned from a set of labeled face deformation data. A facial deformation can be approximated by a linear combination of MUs weighted by the corresponding MU parameters (MUPs), which are used as the visual features of facial deformation...
متن کاملGenerating Visemes for Realistic Animation
Efficient, realistic face animation is still a challenge. A system is proposed that yields realistic visemes for speech animation. This paper discusses the extraction of these visemes. It starts from real 3D face dynamics, observed at frame rate for thousands of points on the faces of speaking actors. A generic 3D mesh is fitted to the data throughout 3D time sequences. This is based on a combi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002