FPMS TCTS - PowerPoint PPT Presentation

1 / 25

About This Presentation

Title:

FPMS TCTS

Description:

Communications immersives pour la narration tendue par les m langes audiovisuels ... Detection of embolism from tomography-based images. SYPOLE (RW/INITIATIVES) ... – PowerPoint PPT presentation

Number of Views:75

Avg rating:3.0/5.0

Slides: 26

Provided by: thie67

Category:

more less

Transcript and Presenter's Notes

Title: FPMS TCTS

1
FPMS - TCTS

Circuit Theory And Signal Processing Lab
Avenue Copernic 1, B-7000 Mons
http//www.tcts.fpms.ac.be

TCTS as a teaching unit
4 academics
2 technicians
courses
Circuit Theory
Switching Theory
(digital) Signal Processing
Filtering Theory
Information Theory
Biomedical Signal Processing Imaging
Computers Architectures
Automated Manufacturing Systems
Embedded Systems

TCTS as a teaching unit
4 academics
2 technicians
courses
Circuit Theory
Switching Theory
(digital) Signal Processing
Filtering Theory
Information Theory
Biomedical Signal Processing Imaging
Computers Architectures
Automated Manufacturing Systems
Embedded Systems

Important contribution in 3 certificates
(thematic modules) Multimedia ,
Bio-engineering Embedded systems
4

TCTS as a research unit
25 researchers
20 projects

TCTS partnership with MULTITEL asbl
15 admins
65 engineers

TCTS research tracks
Speech processing
Speech Synthesis (TTS)
Speech Recognition (ASR)
Voice quality assessment
Audio processing
Image processing
Biomedical data processing
Hardware for signal processing

TCTS research ongoing projects
CINEMA (RW/INIATIVES)Communications immersives
pour la narration étendue par les mélanges
audiovisuels
COST277 (EU/COST)Non linear speech processing
COST278 (EU/COST)Spoken language interaction in
telecommunication
DOMINI (RW/FIRST-EUROPE)Multimodal access and
managment of digital medical records
DREAMS (RW/WALEO-BIOMEDICAL)Development of
automatic method for polysomnogram (sleep
recording) diagnosis
F3M (RW/RESEAU-I)Multimodal solutions for using
wearable computers in wireless environments
GRA2PHO (RW/FIRST-EUROPE)Methods for multiple
grapheme-to-phoneme transcription (required for
ASR)
ICC (RW/WIST-IT)Multimodal interface for
automatic interpretation of natural sketch and
plan
iMED (RW/FIRST-EUROPE)Detection of embolism from
tomography-based images

TCTS research ongoing projects (contd)
MAIS (RW/WIST-IT)Low-cost, low-consumption smart
card for distant reading (windshield embedded)
MERCATOR (RW/WALEO-BIOMEDICAL)Image
coregistration for video-enhanced surgery
MOUSTIC (RW/FIRST-SPINOFF)New frameworks for
broadcasting traffic information (radio and GSM
networks)
MUSIQUE (RW/FIRST-SPINOFF)Fault detection in
production process by multiple signal analysis
PIST (RW/WIST-IT)Positionning in transport based
on heterogeneous sensors
PSOC-ASR (RW/FIRST-EUROPE)Speech recognition on
Programmable-System-On-a-Chip (dedicated
hardware)
STOP (RW/EUREKA)Speech dynamics and voice
quality analysis for improved speech synthesis
SIMILAR (EU/FP6-NoE)Detection of embolism from
tomography-based images
SYPOLE (RW/INITIATIVES)Portable device for
text/image recognition and speech synthesis to
blind people

TCTS resources for IRMA project
Long research record in ASR
COST249 (EU/COST, 1994-2000)Continuous speech
recognition over the telephone network
COST278 (EU/COST, 2001- )Spoken language
interaction in telecommunication
HIRMANET (EU/FP6-NoE, 1993-1995)Hidden Markov
models and neural networks for ASR
SPRACH (EU/ESPRIT, 1995-1998) Improvement of HMM
/ ANN speech recognition algorithms
THISL (EU/ESPRIT, 1997-2000)Thematic indexing of
spoken language
RESPITE (EU/ESPRIT, 1998-2001)Recognition of
speech by partial information techniques
DIVINES (EU/FP6-STREP, 2004-2007)Diagnostic and
intrinsic variabilities in natural speech

collaboration with MULTITEL
10

TCTS resources for IRMA project
Long research record in ASR
COST249 (EU/COST, 1994-2000)Continuous speech
recognition over the telephone network
COST278 (EU/COST, 2001- )Spoken language
interaction in telecommunication
HIRMANET (EU/FP6-NoE, 1993-1995)Hidden Markov
models and neural networks for ASR
SPRACH (EU/ESPRIT, 1995-1998) Improvement of HMM
/ ANN speech recognition algorithms
THISL (EU/ESPRIT, 1997-2000)Thematic indexing of
spoken language
RESPITE (EU/ESPRIT, 1998-2001)Recognition of
speech by partial information techniques
DIVINES (EU/FP6-STREP, 2004-2007)Diagnostic and
intrinsic variabilities in natural speech

strong expertise in speaker-independent robust
ASR for small to medium vocabulary
collaboration with MULTITEL
11

TCTS resources for IRMA project
Past research in speaker recognition
ARTHUR (RW, 2000-2003)Telecommunications for
emergency services
COST250 (EU/COST, 1995-2000)Speaker recognition
in telephony
NIST evaluation (USA/DARPA, 2000)Speaker
recognition track

collaboration with MULTITEL
12

TCTS resources for IRMA project
Past research in speaker recognition
ARTHUR (RW, 2000-2003)Telecommunications for
emergency services
COST250 (EU/COST, 1995-2000)Speaker recognition
in telephony
NIST evaluation (USA/DARPA, 2000)Speaker
recognition track

expertise in speaker identification and speaker
verification
collaboration with MULTITEL
13

TCTS resources for IRMA project
Long development experience
MODIVOC (RW/INITIATIVES, 2002-2004) Distributed
speech recognition
ICC, DOMINI, PSOC-ASR, F3M, SIMILAR
ALLADIN (EU/FP6-STREP, 2004-2007)Natural
language based decision support in
neuro-rehabilitation
MAJORCAL (RW/EUREKA, 2003-2005)Customer
relationship management call center
LASCOT (RW/EUREKA, 2003-2005)Large scale
collaborative decision support technology
WEARIT_at_WORK (EU/FP6-IP, 2004-2009)Empowering
the mobile worker by wearable computing

collaboration with MULTITEL
14

TCTS resources for IRMA project
Long development experience
MODIVOC (RW/INITIATIVES, 2002-2004) Distributed
speech recognition
ICC, DOMINI, PSOC-ASR, F3M, SIMILAR
ALLADIN (EU/FP6-STREP, 2004-2007)Natural
language based decision support in
neuro-rehabilitation
MAJORCAL (RW/EUREKA, 2003-2005)Customer
relationship management call center
LASCOT (RW/EUREKA, 2003-2005)Large scale
collaborative decision support technology
WEARIT_at_WORK (EU/FP6-IP, 2004-2009)Empowering
the mobile worker by wearable computing

strong expertise in software developpement
collaboration with MULTITEL
15

TCTS contributions in IRMA project
Automatic speech recognition for
Indexation of spoken material
Capture of spoken input in retrieval interface
Speech-based biometrics
Speaker-based document segmentation
User verification

TCTS contributions in IRMA project
ASR for spoken document indexation and
retrieval an important problem
ConferencesICASSP2004Topics in Speech
Understanding Systems
ICSLP2004Spoken and Natural Language
Understanding (I and II)Spoken Language
Identification, Translation and Retrieval (I and
II)
LREC2004Spoken Indexation and Retrieval
ICASSP2005Spoken Language Understanding and
Dialog
INTERSPEECH2005Spoken Language Extraction and
Retrieval (I and II)Spoken Language
Understanding (I and II)

TCTS contributions in IRMA project
ASR for spoken document indexation and
retrieval an important problem
EvaluationsNIST (DARPA)Broadcast news
transcription and indexation in English
ESTER (AFGP)Broadcast news transcription and
indexation in French
ProjectsTHISL www.dcs.shef.ac.uk/spandh/projects/
thisl/Thematic indexing of spoken language
AMI www.amiproject.orgMulti-modal interaction
in the contexte of meetings
M4 www.m4project.org Structuring, browsing and
querying of an archive of automatically analysed
meetings

TCTS contributions in IRMA project
ASR for spoken document indexation and
retrieval an important problem
PRODUCTSAURIX Speech Recognition
Systemwww.aurix.com/product/index.htm
NEXIDIA Speech Recognition Systemwww.nexidia.com
/product.asp
SCANSOFT Dragon AudioMiningwww.scansoft.com/audi
omining
BBN AudioIndexerwww.bbn.com/For_Government_Custo
mers/Data_Indexing_and_Mining/Audio_Indexer.htmlA
RTSUMwww.artsum.netAutonomy -
Viragewww.blinkx.tv

TCTS contributions in IRMA project
ASR for spoken document indexation and
retrieval an important problem
Open Software for LVCSRSphinxcmusphinx.sourcefor
ge.net/html/cmusphinx.php
HTKhtk.eng.cam.ac.uk
ISIPwww.cavs.msstate.edu/hse/ies/projects/speech
/software/
STRUTwww.tcts.fpms.ac.be/asr/

TCTS contributions in IRMA project
STRUT (Speech Training and Recognition
Toolkit)www.tcts.fpms.ac.be/asr/

- plate-forme logicielle pour le prototypage
rapide de systèmes de reconnaissance
automatique de parole
- flexibilité de la modélisation acoustique
(dépendant ou non du locuteur, modèles de mots
ou de phonèmes, etc)
- flexibilité de la modélisation grammaticale
(parole continue ou mots isolés, lexique
jusquà 10000 mots, grammaire détats finis, etc)
- analyses acoustiques robustes (bruit,
réverbération, microphone, etc)
- détection de parole, mesures de confiance, etc
- disponibilité dans la plupart des langues
européennes et laméricain
développement rapide de nouvelles langues /
lexiques / grammaires
EAR STRUT-based SDK

TCTS contributions in IRMA project
ASR for spoken document indexation and
retrieval where do we operate ?

Lexicon
Grammar
Feature Extraction
Acoustic Model
Decoder
Words(or N-best, lattices)
Acoustic vectors
Local classification(eg, phonemes)
Speech signal
I cost ltltltR cost gtgtgt
I cost ltltR cost gtgt
I cost gtgtR cost ltlt
I cost gtgtgtR cost ltltlt
Out-Of-Vocbulary (OOV) words ?
22