Title: Emotion Recognition
1Emotion Recognition
Supervised by Dr. Ishwar K. Sethi
2(No Transcript)
3Objective
- Human computer interaction
- Voice Stream
- Discriminate between emotions
4Why do we need emotion recognition?
- Interactive Voice Response (IVR)
- Robotics
- Computer/video games
5Steps
6Features
- Pitch
- Energy
- Speaking Rate
- Formants their Bandwidths
- MFCC (Mel Frequency Cepstral Coefficients)
7Classifiers
- SVM
- Decision Tree
- Other Methods
- - HMM
- - KNN
8Support Vector Machine
- 1) Problem statement
- Given l examples ( x1, y1),, ( xl, yl )
- where xi is a feature vector of length d
- and yi 1 , -1
- Find a classifier with the decision function f(x
) such that y f( x ) - where y is the class label for x
9Support Vector Machine( cont)
- 2) Separable case
- Separating hyperplane
- w.x b 0
- Constraints
- w.xi b 1
- w.xi b -1
- Margin
- 2/ w
10Support Vector Machine( cont)
- 3) Nonlinear-separable case
- Map the training data from RN to some higher
dimensional space H - Use Kernel function
- K( x, y ) ( x.y 1 )d
- K( x, y ) exp( -((x y)2 )/ ( 2?2) )
- K( x, y ) tanh( x.y - ? )
-
-
11Decision Tree( ID3 )
12Decision tree( cont )
-
- Use entropy and information gain
- Entropy( S ) ?-pilog2pi
- Gain( S, A) Entropy(S)
- ?(Sv/S).Entropy(Sv)
-
13Wav Samples
Found some samples in the following websites
http//www.computing.dundee.ac.uk/staff/i
rmurray/hamlet.asp Phrase 1 - "This is not what I
expected Unemotion ( file hamlet01 ) Anger
( file hamlet02 ) Happiness
( file hamlet03 ) Sadness
( file hamlet04 ) Fear
( file hamlet05 ) Disgust
( file hamlet06 ) Phrase 2 - "You have asked me
that question so many times Unemotion
( file hamlet07 ) Anger (
file hamlet08 ) Happiness ( file
hamlet09 ) Sadness ( file
hamlet10 ) Fear ( file
hamlet11 ) Disgust ( file
hamlet12 )
14Spider-Man( 2002)
DEMO
15Future Work
- Accuracy
- Classification
- Database
- Gender
16References
- R. Kent, and C. Read, Acoustic Analysis of
Speech Second Edition, Thomas Learning, 2002 - Rabiner, L. and Schafer, R. Digital Processing of
Speech Signals, Prentice-Hall, 1978 - J. Deller, J. Proakis, and J.Hansen,
Discrete-Time Processing of Speech Signals,
Macmillian, 1993 - T. Mitchell, Machine Learning, McGraw-Hill 1997
- M. Schröder (2003). Emotional speech synthesis
for emotionally-rich virtual worlds. Proc. of
Workshop on emotionally rich virtual worlds with
emotion synthesis at the 8th International
Conference on 3D Web Technology (Web3D), 10.
March 2003, St.Malo, France. - O.-W. Kwon, K. Chan, J. Hao, and T.-W. Lee,
Emotion Recognition by speech Signals," Proc.
EUROSPEECH 2003, Geneva, Switzerland, Sept. 2003.
- Mingkun Li, Support Vector Machine, Intelligent
Information Engineering Lab, Oakland University,
2003 - Dr. Ishwar K. Sethi, Language Identification for
a Voice Stream Portizoration System, Intelligent
Information Engineering Lab, Oakland University,
2003 - http//www.fbckenner.org/.../ images/future.gif
future
17References Continued
- A. Nogueriras, A. Moren, A. Bonafonte, and J.
Marino Speech Emotion Recognition Using Hidden
Markov Models Proc. EUROSPEECH, Scandinavia,
2001 - Feng Yu,Eric Chang,Ying-Qing Xu, Heung-Yeung Shum
. Emotion Detection from Speech to Enrich
Multimedia Content. The Second IEEE Pacific-Rim
Conference on Multimedia, Beijing,October 24-26,
2001 - S. Yacoub, S. Simske, X. Lin, J. Burns
HPL-2003-136 Recognition of Emotions in
Interactive Voice Response Systems -2003 - http//www.asel.udel.edu/speech/tutorials/acoustic
s/freq_domain.html Sound in the Frequency
Domain 1996 - http//www.asel.udel.edu/speech/tutorials/acoustic
s/time_domain.html Sound in the Time Domain
1996 - http//mi.eng.cam.ac.ukkkc21/thesis_main.html
The Formulation of Support Vector Machine 1998 - www.robotoys.com/.../ st_prod.html?p_prodid151
I Cybie
18(No Transcript)