Title: Voice source characterisation
1Voice source characterisation
- Gerrit Bloothooft
- UiL-OTS Utrecht University
2Voice research
- To describe and model the properties of the
vocal sound source from view points of - Physiology
- Acoustics
- Perception
3Importance of the voice
- Speech synthesis
- Towards natural sounding synthesis
- Speech recognition
- Using source properties in recognition
- Speaker recognition/identification
- Voice source characteristics are essential
- Diagnosis
- Pathologies, voice classifications
4Voice possibilities
- Limited use of voice in speech
- Range of the fundamental frequency
- Vocal intensity range
- Spectral variation
5Focus in this presentation
- How do acoustic voice source characteristics
vary as a functionof F0 and vocal intensity
6Voice profile measurement
- Thirties Intensity range as function of various
pitches - manual measurement
- Eighties Automatic computation ofF0 and
Intensity - computer measurement
- visual feedback
- additional parameters
7Measurement unit
- One decibel
- One semi-tone
8Measurement procedure
- Subject in front of computer screen
- Microphone on head set (30 cm)
- Just phonate, sing, and see the result
immediately - Best results with recording protocol
- Feed back stimulates extreme phonations
9Voice profile / density
Vocal Intensity (dB SPL)
Sample density
Fundamental frequency (Hz)
10Voice profile / speech area
Vocal Intensity (dB SPL)
Sample density
Fundamental frequency (Hz)
11Acoustic voice quality parameters
- Jitter
- Stability of periodicity
- Asymmetry in vocal folds
- Crest factor
- Max amplitude divided by average energy
- Relates to spectral slope
- Many more
12Crest factor
Vocal Intensity (dB SPL)
Crest factor
Fundamental frequency (Hz)
13(No Transcript)
14Real time presentation
- Screen presentation
- One data point per F0-I cell
- Advanced data storage new
- Full audio signal
- Full distribution of data per F0-I cell
- Data for screen presentation
15Advantages
- Reusability of recordings
- Statistical analysis per F0-I cell
- Study of time-varying behavior
16Crest factor
Vocal Intensity (dB SPL)
Crest factor
Fundamental frequency (Hz)
17Crest factor median smoothed
Median smoothing of crest factor
Vocal Intensity (dB SPL)
Crest factor
Fundamental frequency (Hz)
18Vocal Registers
- Different movement patterns of the vocal folds
- Pulse register (creaky voice)
- Modal register
- Falsetto register
19Pulse register
- Less than 50 Hz
- Irregular
- Long closed period
20 Pulse register
Vocal Intensity (dB SPL)
Fundamental Frequency (Hz)
21Modal register
- Normal use of voice
- Active role of M. Vocalis
- Vocal folds thick and completely vibrating
- Wide range in F0 and intensity
- Flat spectrum
22Modal register
Vocal Intensity (dB SPL)
Fundamental frequency (Hz)
23Falsetto register
- Higher pitches
- M. Vocalis passive, tense vocal ligaments through
M.Cricothyroidus - Edge vibration of vocal volds
- Sound poor in higher harmonics (in untrained
subjects)
24Falsetto register
Vocal Intensity (dB SPL)
Fundamental frequency (Hz)
25Register overlap
Vocal Inensity (dB SPL)
Fundamental frequency (Hz)
26Chest- en head voice
- Refer to secundary vibratory sensations in the
body - Chest voice loud modal register
- Head voice
- males higher, softer modal register in overlap
area with falsetto register - women falsetto register
27Chest voice and Head voice
chest
Vocal Intensity (dB SPL)
head
Fundamental frequency (Hz)
28Registers and voice profiles
- With a description using
- Iso-crest factor lines
- Iso-jitter lines
29Iso-crest factor lines
6 dB
Vocal Intensity (dB SPL)
Crest factor
4 dB
Fundamental frequency (Hz)
30Iso-jitter lines
Vocal Intensity (dB SPL)
Jitter ()
3
Fundamental frequency (Hz)
31New representation
- Areas defined by iso-parameter lines
- crest factor lt 4 dB
- crest factor gt 4 dB, lt 6 dB
- crest factor gt 6 dB
- jitter lt 3
- relative rise time lt 6
32Areas in the phonetogram
RRT lt 6 pressed-like
Crest factor lt 4 dB sine-like
Vocal Intensity (dB SPL)
Jitter gt 3, unstable
Fundamental frequency (Hz)
33Vocal registers in the phonetogram
Chest voice boundary
Vocal Intensity (dB SPL)
Falsetto upper boundary
Modal lower boundary
Fundamental frequency (Hz)
34Comparison of voice profiles
- Characterisation of
- Voice pathologies
- Voice classifications
- Reuse stored voice profiles of subjects with
known voice history
35Important features
- Contour has limited value
- but most research goes into that direction (norm
profiles) - Distribution of acoustical parameters across the
voice profile tells much more
36We need
- Unit for comparison
- Voice profile unit defined by small range of F0
and Vocal Intensity - Distributions of acoustic voice parameters per
unit - Probability density function per parameter
- Model
- Hidden Markov Model
37Unit model
- two unconnected states per phonetogram unit
- vocal registers
- start and end of phonetion
38Correspondences
- Speech Voice Profile
- phoneme model F0/I unit model
- not labeled labeled by F0 and I
- spectral envelope acoustic voice parameters
- language model unrestricted transitions
- forced alignment
- recognition
39Crest factor distributions
40Most distinctive states
Vocal Intensity (dB SPL)
Distinctiveness
Fundamental frequency (Hz)
41Conclusions
- Voice profiles can enhance our understanding of
vocal behaviour in a visually attractive way - Current data storage opens a series of important
research topics - Market opportunities for light versions