Advanced Digital Signal Processing - PowerPoint PPT Presentation

1 / 38
About This Presentation
Title:

Advanced Digital Signal Processing

Description:

Speech production can be thought as an acoustic filtering operation: ... Spectrogram. Spectrogram with LPC roots. rk are poles. Cepstral Analysis ... – PowerPoint PPT presentation

Number of Views:2671
Avg rating:3.0/5.0
Slides: 39
Provided by: Jus6
Category:

less

Transcript and Presenter's Notes

Title: Advanced Digital Signal Processing


1
DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF
JOENSUU JOENSUU, FINLAND
  • Advanced Digital Signal Processing
  • Lecture 11
  • LPC. Cepstral Analysis.
  • Alexander Kolesnikov

2
Speech Production
3
Speech production mechanism
Vocal tract pharengeal and oral cavities Nasal
tract nasal cavities Articulators vocal folds
(cords), soft palate (or velum), tongue, teeth
and lips.
4
Speech production as filtering
Speech production can be thought as an acoustic
filtering operation 1) Excitation (input) is
provided by the organs below the vocal
tract (e.g., larynx, lungs) 2) Filter vocal
and nasal tracts. Articulators change the shape
of the filter.
5
Simplified model of speech production
x(n)
h(n)
u(n)
6
Linear Predictive Coding
7
LPC model
Given speech sample at time n, x(n), can be
approximated as a linear combination of the past
p speech samples, such that
This approach is based on model of speech
production
u(n) is a normalized excitation, G is the gain
of the excitation.
8
Transfer function
Recursive equation
Z-transform
Transfer function
H(z) is all-pole IIR filter.
9
Prediction error
The linear combination of p past speech samples
is the estimate or prediction
Prediction error
10
Model learning
The basic problem of linear prediction is to
determine the set of predictor coefficients
a1,,ap, directly from the speech signal
Find such a set of predictor coefficients
a1,,ap that minimize the mean-squared
prediction error
11
Minimum of the prediction error
12
Correlation function
where K(i,j) is correlation function
13
Yule-Walker equations
Mean squared prediction error
Solution is given by the system of linear
equations
Yule-Walker equations
14
Theory
Yule-Walker equations
Prediction error
15
Practice
In the absence of of knowledge about probability
distribution... Estimate correlation function
over a short segment of speech waveform.
1) Autocorrelation method 2) Covariance method
16
1) Autocorrelation method
Signal is windowed in order to minimize
discontinuities at beginning and end of the
interval
w(n) is a finite length window of length N
samples.
n0
nN-1
n0
17
1) Autocorrelation method
This is Toeplitz matrix. The system has fast
algorithm for solution (see Levinson-Durbin
recursive algorithm, O(N2) operations)
18
2) Covariance method
Fix the interval over which the mean-squared
error is computed
19
2) Covariance method
The matrix is still symmetric, but is no longer
Toeplitz. The matrix inversion O(N3) operations.
20
Three methods
1) The autocorrelation method 2) The covariance
method 3) The lattice method.
21
Example
Covariation and autocorrelation methods
22
Normalized prediction error
where
Prediction error as a function of predictor order
p.
23
Spectral analysis via LPC
24
Spectrogram
25
Spectrogram with LPC roots
rk are poles.
26
Cepstral Analysis
27
A homorphic transformation
A homorphic transformation y(n)D(x(n)) is a
transformation that converts a convolution
into a sum
In simple words separate (high frequency) source
from the (low frequency) filter
28
Fourier Transform
Given a discrete-time signal x(n).
29
Cepstral transform
Speech
Cepstrum
DFT
Window
log
IDFT
Quefrency
30
Example 1 Signal
t 00.011.27 x0 sin(2pi45t) x x0
0.5zeros(1,20) x0(1108) f145 Hz f0100
Hz ?t0.2 sec
Time, sec
0.2 sec
Signal contains an echo, with 0.5 amplitude, 0.2
seconds after the beginning of the signal.
31
Example 1 DFT
Time, sec
S fft(x,128) f f0(063)/128 plot(f,abs(S(1
64)))
Frequency, Hz
45 Hz
32
Example 1 Cepstrum
Frequency, sec
C cceps(x) plot(t,C)
Quefrency, sec
Cepstrum extracts periodicity in spectrum
33
Example 1 How?
DFT
Time, sec
Frequency, Hz
IDFT
semilogy(f,abs(S(164)))
echo?oscilations in S(f)?peaks in C(q)
Quefrency, sec
34
Example 2 Signal and DFT
Time, sec
yhamming(512) . x
Sabs(fft(y))
Frequency, Hz
35
Example 2 DFT and Cepstrum
Quefrency, sec
Frequency, Hz
Cifft(log(S))
Sfft(y)
36
Example 2 Cepstrum
The first 20 cepstral coefficients
The lower numbered coefficients provide the
envelope information. The remainder of the
detail is mostly contained in the peaks which
are separated by the pitch period ( 70 sample)
and provide the fine detail pitch information.
37
Example Cepstrum of phones
Quefrency, sec
Quefrency, sec
38
LPC ? Cepstrum
Transfer function of LPC filter
Take the logarithm
where coefficients h(n) can be obtained by the
recursion
Write a Comment
User Comments (0)
About PowerShow.com