Efficient TimeScale Modification of Speech and Clear Voice Systems - PowerPoint PPT Presentation

1 / 26
About This Presentation
Title:

Efficient TimeScale Modification of Speech and Clear Voice Systems

Description:

SA1 Speech synthesis - based on acoustical unit concatenation. SA2 Foreign ... SA5 Voice mail ... MA3 Film/soundtrack synchronization. MA4 Audio compression ... – PowerPoint PPT presentation

Number of Views:120
Avg rating:3.0/5.0
Slides: 27
Provided by: bobla46
Category:

less

Transcript and Presenter's Notes

Title: Efficient TimeScale Modification of Speech and Clear Voice Systems


1
Efficient Time-Scale Modification of Speech
and Clear Voice Systems
2
Time-Scale Modification
3
Frequency-Scale Modification
4
APPLICATIONS
Speech related applications include
SA1 Speech synthesis - based on acoustical unit
concatenation SA2 Foreign language
learning SA3 Audio-typing, and the training
thereof SA4 Accelerated aural reading for the
blind SA5 Voice mail speed up / slow
down SA6 Voice transformation - e.g. making a
female voice sound male SA7 Speech
recognition SA8 Film/speech synchronization SA9 Sp
eech Compression SA10 Noise reduction
5
APPLICATIONS (cont.d)
Music related applications include
MA1 Music Transposition MA2 Music study and
editing MA3 Film/soundtrack synchronization MA4 Au
dio compression MA5 Noise reduction
6
Existing Approaches
1. Time-domain techniques Most of the early
algorithms fall into this category. They are
based on overlap-add (OLA) methods. 2.
Frequency-domain techniques Most of the
algorithms which have been grouped in this
category are based on short-time Fourier
transform (STFT) or phase vocoder methods and as
such might strictly be considered joint
time-frequency techniques. 3. Parametric
techniques These algorithms are based on
modeling the audio signal production mechanism
and then modifying the resulting model parameters
to realise the required TSM/FSM of the signal by
resynthesis from the modified parameters. The
majority of the algorithms in this group are
based on the linear predictive (LP) model of
speech production and as such have been mainly
directed at speech related applications.
7
Existing Approaches (cont.d)
1. Time-domain Overlap-Add methods, also called
sampling methods, splice methods, circular-buffer
methods. This category is the same as category
(1) above. 2. STFT/VOCODER methods. This is the
same as category (2) above. 3. LPC (linear
predictive coding)-based methods. This is very
similar to category (3) above except that now it
excludes non LPC-based parametric
models. 4. Methods based on modelling the signal
as a sum of sinusoids with time-varying
parameters. 5. Methods based on decomposing the
signal into a sinusoidal part and a stochastic
part.
8
Time-Domain Overlap-Add Methods
Dudleys pitch-synchronous gating
1938
9
Gabors modified sound-film projector, Gabor 46
10
Fairbanks modified tape recorder, Fairbanks 54
We hasten the boy off my garage path to show
which edge young owls could view
11
Shift register implementation of time-scale
compression by the sampling method, Lee 72
12
RAM based implementation of TSM by the sampling
method, Lee 72
13
Synchronized Overlap-and-Add (SOLA) Roucos 85
14
STFT/VOCODER Methods
Dudleys vocoder, Dudley 39
15
Flanagans phase vocoder analysis, Flanagan 66
16
Flanagans phase vocoder synthesis, Flanagan 66
17
Linear Prediction Methods
Dudleys speech production model
18
Simulating a female voice from parameters
derived from a male voice, Atal 71
19
Sinusoidal Modelling Methods
Sinusoidal analysis/synthesis system, Quatieri
85
20
Sinusoidal Plus Stochastic Modelling Methods
Analysis part of the SMS system, Serra 90
21
Review Conclusion
TSM/FSM Approach Comparison
22
Synchronized Overlap-Add (SOLA)
SOLA Time-Scale Expansion and Compression
23
Normalised cross-correlation measure
Simplified normalised cross-correlation measure
24
A Novel Time-Scale Modification Algorithm
Adaptive Overlap-Add (AOLA)
25
AOLA vs SOLA computational load comparison
26
TSM results ? 0.5 and 2.0. Utterance water
from TIMIT Speech Corpus DARPA, Signal name
\TIMIT\TEST\DR1\FELC0\SA1.WAV
Write a Comment
User Comments (0)
About PowerShow.com