Creating a Speech Enabled Avatar from a Single Photograph - PowerPoint PPT Presentation

1 / 30
About This Presentation
Title:

Creating a Speech Enabled Avatar from a Single Photograph

Description:

Avatars from video sequences. Bregler et al 1997, Ezzat et al 2002, etc ... 3D Avatars. Limitations and Future Work. Automatic facial feature detection ... – PowerPoint PPT presentation

Number of Views:83
Avg rating:3.0/5.0
Slides: 31
Provided by: suj2
Category:

less

Transcript and Presenter's Notes

Title: Creating a Speech Enabled Avatar from a Single Photograph


1
Creating a Speech Enabled Avatar from a Single
Photograph
Dmitri Bitouk Shree K. Nayar
Columbia University
2
Speech Enabled Avatar
Input photograph
3
Speech Enabled Avatar
Input photograph
Avatar
4
Speech Enabled Avatar
Input photograph
Avatar
  • Applications
  • mobile messaging and video conferencing
  • news reporting and information kiosks
  • novel user interfaces

5
Facial Motion Synthesis Challenges
  • Mapping phonemes to static mouth shapes produces
    unrealistic, jerky animations
  • Co-articulation facial articulations can be
    dominated the preceding as well upcoming phonemes
  • Asynchrony facial motion may precede the
    corresponding sound

6
Related Work
  • Avatars from video sequences
  • Bregler et al 1997, Ezzat et al 2002, etc
  • 2D Avatars from photographs
  • Blanz et al 2003, CrazyTalkTM , MotionPortraitTM

7
Generic Facial Motion Model
Prototype Surface
Deformed Surface
- Facial motion parameters
Bitouk 2006
8
Generic Facial Motion Model
9
Facial Motion Transfer
Prototype Face
Novel Faces
Bitouk 2006
10
Facial Motion Transfer
Prototype Face
Novel Faces
Bitouk 2006
11
Hidden Markov Models
  • Phonemes /B/, /K/, /AA/, /IY/, etc
  • With lexical /B/, /K/, /AA0/, /AA1/, /IY0/,
    /IY1/, etc
  • stress
  • Triphones

Facial motion parameters
12
Training Hidden Markov Models
  • Training set consists of motion capture data
  • Baum-Welch embedded re-estimation
  • Cluster triphone states to predict triphones not
    seen in the training set

13
Facial Motion Synthesis from Text
Time-labeled phonemes
14
Fitting the Prototype Model to an Image
2D Prototype Face
Photograph
15
Fitting the Prototype Model to an Image
2D Prototype Face
Photograph
16
Facial Motion Synthesis
17
Eye Motion Synthesis
18
Eyeball Texture Synthesis
Eye Image
Synthesized Eyeball Texture
19
Eye Motion Synthesis
Eye Motion Geometry
20
Eye Motion and Blinking
21
Visual Text-to-Speech Synthesis
22
Visual Text-to-Speech Synthesis
23
Facial Motion Synthesis from Speech
Time-labeled phonemes
24
Facial Motion Synthesis from Speech
25
3D Avatars
Captured Stereo Image
Mirror View
Direct View
Gluckman Nayar, 2001
26
3D Avatars
Rectified Images
3D Model
Mirror View
Direct View
27
3D Avatars
Point cloud engraved inside a glass cube
Digital projector
Nayar Anand, 2007
28
3D Avatars
29
Limitations and Future Work
  • Automatic facial feature detection
  • Synthesis of rigid head motion
  • Expressive speech
  • Web demo of our system will be available in
  • early April
  • www.cs.columbia.edu/CAVE/

30
The End
Write a Comment
User Comments (0)
About PowerShow.com