Voice Activated Un-Lock Technology - PowerPoint PPT Presentation

1 / 41

About This Presentation

Title:

Voice Activated Un-Lock Technology

Description:

Voice Activated Un-Lock Technology V.A.U.L.T A Matlab based Simulation Robot Control via Voice 25 pin D-type Male Connector Parallel port of computer :3 registers ... – PowerPoint PPT presentation

Number of Views:420

Avg rating:3.0/5.0

Slides: 42

Provided by: Visha2

Category:

more less

Transcript and Presenter's Notes

Title: Voice Activated Un-Lock Technology

1
Voice Activated Un-Lock Technology

V.A.U.L.T
A Matlab based Simulation

2
By

Siddharth Advani B2213401
Anand Gokhale B2213420
Vishal Jain B2213426
Guided by
Dr. P.M. Patil

3
OBJECTIVE

Correct decision on a speakers identity claim
given a speech segment (password)

4
MOTIVATION

Speech contains speaker specific characteristics
Voiceprint as a biometric (distinguishing trait)
Natural economical way of identification

5
DEFINATIONS

Client speaker registered on the system
Impostor speaker who claims a false identity

Mel-filtering a frequency scaling that takes
into account the fact that the ear is sensitive
to linear changes in frequency below 1000 Hz and
logarithmic change in frequency above 1000 Hz
6
What is Simulation?

A simulation is the imitation of the operation of
a real world process or system over
time.
Using MATLAB as a tool, VAULT aims at simulating
a voice recognition system

7
Software Implementation
8
MATLAB

Features
Interpreter ? Meant for simulation in RD
High performance numerical computation
Signal Processing Toolbox

9
Visual Basic

Features
Easy to implement.
Very user friendly, interactive.
Compatible with MATLAB and any Windows version.
Less complicated than the GUI of MATLAB.
Any Microsoft application can be embedded in the
VB.

10
Zones Of VAULT
11
Phase 1 - Identification
FEATURE EXTRACTION
PATTERN RECOGNITION
USER ID
WORD
SYSTEM DATABASE
TRAINING
12
PHASE 1 - IDENTIFICATION
THE WORD IS DIVIDED INTO SEGMENTS 256 SAMPLES IN
EACH SEGMENT
13
PHASE 1 - IDENTIFICATION
PROCESS
FEATURE EXTRACTION
VECTOR QUANTIZER
DECISION
WORD
14
PHASE 1 - IDENTIFICATION
CEPSTRUM COEFFICIENTS ARE QUANTIZED USING A
CODEBOOK OF 128 VECTORS
15
PHASE 1 - IDENTIFICATION
1
2
CLIENT
3
3
DISTANCE12
4
16
Database
Identification
EVERY SPEAKER IS GIVEN A TAG
Zero
4
17
PHASE 2 - Authentication
ACCEPT REJECT
PATTERN RECOGNITION
FEATURE EXTRACTION
PASS-WORD
SYSTEM DATABASE
TRAINING
18
PHASE 2 - AUTHENTICATION
19
PHASE 2 - AUTHENTICATION
USER
20
PHASE 2 - AUTHENTICATION
THRESHOLD
PASSWORD
DECISION
ACCEPT/ REJECT
DISTANCE
USER
CLIENT
THRESHOLD DECIDES THE DECISION
21
Main Obstacle

How to define and extract the unique features of
human voice
CEPSTRUM
cepstrum(frame)IDFT(log(DFT(frame)))

22
PATTERN MATCHING
Dynamic Time Warping Vector Quantization Nearest
Neighbour
Hidden Markov Model Gaussian Mixture Model
23
VECTOR QUANTIZATION

Goal finding how the data is clustered
A (feature) vector space is broken into cells
Speaker model codebook
Codebook set of prototype vectors (codevectors)
Codevector vector computed from "similar" single
(feature) vectors (e.g. 8 cepstrum coefficients
makes 1 codevector)

24
CLUSTERING
25
RESULTS
THRESHOLD 5
REJECT
ACCEPT
26
PERFORMANCE EVALUATION

False Rejection (FR)
A client request as himself/herself is rejected
False Acceptance (FA)
An impostor request as a client is accepted
Genuine Acceptance (GA)
A client request as himself/herself is accepted

27
ACCURACY

FAR (False Acceptance Rate) Prob. of false
acceptance
Estimate false acceptances
----------------------------------------
false claims
FRR (False Rejection Rate) Prob. of false
rejection
Estimate false rejections
----------------------------------------
true claims
GAR (Genuine Acceptance Rate) Prob. of genuine
acceptance
Estimate true acceptances
----------------------------------------
true claims

28
GRAPHS
29
THRESHOLD

The threshold T can be determined by
1) choosing T to satisfy a fixed FA or FR
criterion
2) varying T to find different FA/FR ratios and
choosing T to give the desired FA/FR ratio.

30
SOURCES OF ERROR