Voice source characterisation - PowerPoint PPT Presentation

1 / 41
About This Presentation
Title:

Voice source characterisation

Description:

Title: Het tijd-stem continuum Author: PCM Last modified by: User Created Date: 1/23/2001 11:37:39 AM Document presentation format: On-screen Show Company – PowerPoint PPT presentation

Number of Views:81
Avg rating:3.0/5.0
Slides: 42
Provided by: pcm79
Category:

less

Transcript and Presenter's Notes

Title: Voice source characterisation


1
Voice source characterisation
  • Gerrit Bloothooft
  • UiL-OTS Utrecht University

2
Voice research
  • To describe and model the properties of the
    vocal sound source from view points of
  • Physiology
  • Acoustics
  • Perception

3
Importance of the voice
  • Speech synthesis
  • Towards natural sounding synthesis
  • Speech recognition
  • Using source properties in recognition
  • Speaker recognition/identification
  • Voice source characteristics are essential
  • Diagnosis
  • Pathologies, voice classifications

4
Voice possibilities
  • Limited use of voice in speech
  • Range of the fundamental frequency
  • Vocal intensity range
  • Spectral variation

5
Focus in this presentation
  • How do acoustic voice source characteristics
    vary as a functionof F0 and vocal intensity

6
Voice profile measurement
  • Thirties Intensity range as function of various
    pitches
  • manual measurement
  • Eighties Automatic computation ofF0 and
    Intensity
  • computer measurement
  • visual feedback
  • additional parameters

7
Measurement unit
  • One decibel
  • One semi-tone

8
Measurement procedure
  • Subject in front of computer screen
  • Microphone on head set (30 cm)
  • Just phonate, sing, and see the result
    immediately
  • Best results with recording protocol
  • Feed back stimulates extreme phonations

9
Voice profile / density
Vocal Intensity (dB SPL)
Sample density
Fundamental frequency (Hz)
10
Voice profile / speech area
Vocal Intensity (dB SPL)
Sample density
Fundamental frequency (Hz)
11
Acoustic voice quality parameters
  • Jitter
  • Stability of periodicity
  • Asymmetry in vocal folds
  • Crest factor
  • Max amplitude divided by average energy
  • Relates to spectral slope
  • Many more

12
Crest factor
Vocal Intensity (dB SPL)
Crest factor
Fundamental frequency (Hz)
13
(No Transcript)
14
Real time presentation
  • Screen presentation
  • One data point per F0-I cell
  • Advanced data storage new
  • Full audio signal
  • Full distribution of data per F0-I cell
  • Data for screen presentation

15
Advantages
  • Reusability of recordings
  • Statistical analysis per F0-I cell
  • Study of time-varying behavior

16
Crest factor
Vocal Intensity (dB SPL)
Crest factor
Fundamental frequency (Hz)
17
Crest factor median smoothed
Median smoothing of crest factor
Vocal Intensity (dB SPL)
Crest factor
Fundamental frequency (Hz)
18
Vocal Registers
  • Different movement patterns of the vocal folds
  • Pulse register (creaky voice)
  • Modal register
  • Falsetto register

19
Pulse register
  • Less than 50 Hz
  • Irregular
  • Long closed period

20
Pulse register
Vocal Intensity (dB SPL)
Fundamental Frequency (Hz)
21
Modal register
  • Normal use of voice
  • Active role of M. Vocalis
  • Vocal folds thick and completely vibrating
  • Wide range in F0 and intensity
  • Flat spectrum

22
Modal register
Vocal Intensity (dB SPL)
Fundamental frequency (Hz)
23
Falsetto register
  • Higher pitches
  • M. Vocalis passive, tense vocal ligaments through
    M.Cricothyroidus
  • Edge vibration of vocal volds
  • Sound poor in higher harmonics (in untrained
    subjects)

24
Falsetto register
Vocal Intensity (dB SPL)
Fundamental frequency (Hz)
25
Register overlap
Vocal Inensity (dB SPL)
Fundamental frequency (Hz)
26
Chest- en head voice
  • Refer to secundary vibratory sensations in the
    body
  • Chest voice loud modal register
  • Head voice
  • males higher, softer modal register in overlap
    area with falsetto register
  • women falsetto register

27
Chest voice and Head voice
chest
Vocal Intensity (dB SPL)
head
Fundamental frequency (Hz)
28
Registers and voice profiles
  • With a description using
  • Iso-crest factor lines
  • Iso-jitter lines

29
Iso-crest factor lines
6 dB
Vocal Intensity (dB SPL)
Crest factor
4 dB
Fundamental frequency (Hz)
30
Iso-jitter lines
Vocal Intensity (dB SPL)
Jitter ()
3
Fundamental frequency (Hz)
31
New representation
  • Areas defined by iso-parameter lines
  • crest factor lt 4 dB
  • crest factor gt 4 dB, lt 6 dB
  • crest factor gt 6 dB
  • jitter lt 3
  • relative rise time lt 6

32
Areas in the phonetogram
RRT lt 6 pressed-like
Crest factor lt 4 dB sine-like
Vocal Intensity (dB SPL)
Jitter gt 3, unstable
Fundamental frequency (Hz)
33
Vocal registers in the phonetogram
Chest voice boundary
Vocal Intensity (dB SPL)
Falsetto upper boundary
Modal lower boundary
Fundamental frequency (Hz)
34
Comparison of voice profiles
  • Characterisation of
  • Voice pathologies
  • Voice classifications
  • Reuse stored voice profiles of subjects with
    known voice history

35
Important features
  • Contour has limited value
  • but most research goes into that direction (norm
    profiles)
  • Distribution of acoustical parameters across the
    voice profile tells much more

36
We need
  • Unit for comparison
  • Voice profile unit defined by small range of F0
    and Vocal Intensity
  • Distributions of acoustic voice parameters per
    unit
  • Probability density function per parameter
  • Model
  • Hidden Markov Model

37
Unit model
  • two unconnected states per phonetogram unit
  • vocal registers
  • start and end of phonetion

38
Correspondences
  • Speech Voice Profile
  • phoneme model F0/I unit model
  • not labeled labeled by F0 and I
  • spectral envelope acoustic voice parameters
  • language model unrestricted transitions
  • forced alignment
  • recognition

39
Crest factor distributions
 
40
Most distinctive states
Vocal Intensity (dB SPL)
Distinctiveness
Fundamental frequency (Hz)
41
Conclusions
  • Voice profiles can enhance our understanding of
    vocal behaviour in a visually attractive way
  • Current data storage opens a series of important
    research topics
  • Market opportunities for light versions
Write a Comment
User Comments (0)
About PowerShow.com