Emotion Recognition

About This Presentation

Title:

Emotion Recognition

Description:

Formants & their Bandwidths. MFCC (Mel Frequency Cepstral Coefficients) Classifiers ... Max Formant. Happy. Min MFCC. Neutral. Excited. Decision tree( cont' ... – PowerPoint PPT presentation

Number of Views:6024

Avg rating:5.0/5.0

Slides: 19

Provided by: AIE

Category:

more less

Transcript and Presenter's Notes

Title: Emotion Recognition

1
Emotion Recognition

Iris Bass
Thao Nguyen

Supervised by Dr. Ishwar K. Sethi
2
(No Transcript)
3
Objective

Human computer interaction
Voice Stream
Discriminate between emotions

4
Why do we need emotion recognition?

Interactive Voice Response (IVR)
Robotics
Computer/video games

5
Steps
6
Features

Pitch
Energy
Speaking Rate
Formants their Bandwidths
MFCC (Mel Frequency Cepstral Coefficients)

7
Classifiers

SVM
Decision Tree
Other Methods
- HMM
- KNN

8
Support Vector Machine

1) Problem statement
Given l examples ( x1, y1),, ( xl, yl )
where xi is a feature vector of length d
and yi 1 , -1
Find a classifier with the decision function f(x
) such that y f( x )
where y is the class label for x

9
Support Vector Machine( cont)

2) Separable case
Separating hyperplane
w.x b 0
Constraints
w.xi b 1
w.xi b -1
Margin
2/ w

10
Support Vector Machine( cont)

3) Nonlinear-separable case
Map the training data from RN to some higher
dimensional space H
Use Kernel function
K( x, y ) ( x.y 1 )d
K( x, y ) exp( -((x y)2 )/ ( 2?2) )
K( x, y ) tanh( x.y - ? )

11
Decision Tree( ID3 )
12
Decision tree( cont )

Use entropy and information gain
Entropy( S ) ?-pilog2pi
Gain( S, A) Entropy(S)
?(Sv/S).Entropy(Sv)

13
Wav Samples
Found some samples in the following websites
http//www.computing.dundee.ac.uk/staff/i
rmurray/hamlet.asp Phrase 1 - "This is not what I
expected Unemotion ( file hamlet01 ) Anger
( file hamlet02 ) Happiness
( file hamlet03 ) Sadness
( file hamlet04 ) Fear
( file hamlet05 ) Disgust
( file hamlet06 ) Phrase 2 - "You have asked me
that question so many times Unemotion
( file hamlet07 ) Anger (
file hamlet08 ) Happiness ( file
hamlet09 ) Sadness ( file
hamlet10 ) Fear ( file
hamlet11 ) Disgust ( file
hamlet12 )
14
Spider-Man( 2002)
DEMO
15
Future Work

Accuracy
Classification
Database
Gender

16
References

R. Kent, and C. Read, Acoustic Analysis of
Speech Second Edition, Thomas Learning, 2002
Rabiner, L. and Schafer, R. Digital Processing of
Speech Signals, Prentice-Hall, 1978
J. Deller, J. Proakis, and J.Hansen,
Discrete-Time Processing of Speech Signals,
Macmillian, 1993
T. Mitchell, Machine Learning, McGraw-Hill 1997
M. Schröder (2003). Emotional speech synthesis
for emotionally-rich virtual worlds. Proc. of
Workshop on emotionally rich virtual worlds with
emotion synthesis at the 8th International
Conference on 3D Web Technology (Web3D), 10.
March 2003, St.Malo, France.
O.-W. Kwon, K. Chan, J. Hao, and T.-W. Lee,
Emotion Recognition by speech Signals," Proc.
EUROSPEECH 2003, Geneva, Switzerland, Sept. 2003.
Mingkun Li, Support Vector Machine, Intelligent
Information Engineering Lab, Oakland University,
2003
Dr. Ishwar K. Sethi, Language Identification for
a Voice Stream Portizoration System, Intelligent
Information Engineering Lab, Oakland University,
2003
http//www.fbckenner.org/.../ images/future.gif
future

17
References Continued

A. Nogueriras, A. Moren, A. Bonafonte, and J.
Marino Speech Emotion Recognition Using Hidden
Markov Models Proc. EUROSPEECH, Scandinavia,
2001
Feng Yu,Eric Chang,Ying-Qing Xu, Heung-Yeung Shum
. Emotion Detection from Speech to Enrich
Multimedia Content. The Second IEEE Pacific-Rim
Conference on Multimedia, Beijing,October 24-26,
2001
S. Yacoub, S. Simske, X. Lin, J. Burns
HPL-2003-136 Recognition of Emotions in
Interactive Voice Response Systems -2003
http//www.asel.udel.edu/speech/tutorials/acoustic
s/freq_domain.html Sound in the Frequency
Domain 1996
http//www.asel.udel.edu/speech/tutorials/acoustic
s/time_domain.html Sound in the Time Domain
1996
http//mi.eng.cam.ac.ukkkc21/thesis_main.html
The Formulation of Support Vector Machine 1998
www.robotoys.com/.../ st_prod.html?p_prodid151
I Cybie

18
(No Transcript)

Write a Comment

User Comments (0)

About PowerShow.com

Emotion Recognition - PowerPoint PPT Presentation

Emotion Recognition

Formants & their Bandwidths. MFCC (Mel Frequency Cepstral Coefficients) Classifiers ... Max Formant. Happy. Min MFCC. Neutral. Excited. Decision tree( cont' ... – PowerPoint PPT presentation