ArtificialBrainandOffice Matebasedon BrainInformationProcessingMechanism - PowerPoint PPT Presentation

1 / 29

About This Presentation

Title:

ArtificialBrainandOffice Matebasedon BrainInformationProcessingMechanism

Description:

(a) SNR weighted ITD histogram (b) local peaks of the histogram ... Assign each zero-crossing interval power to one of the nearest ITD source ... – PowerPoint PPT presentation

Number of Views:27

Avg rating:3.0/5.0

Slides: 30

Provided by: youngi3

Category:

more less

Transcript and Presenter's Notes

Title: ArtificialBrainandOffice Matebasedon BrainInformationProcessingMechanism

1
Artificial Brain and Office Mate based on Brain I
nformation Processing Mechanism

2007. 9. 14
Young-Ik Kim
Brain Science Research Center, KAIST

2
Contents

Introduction- About Brain Neuro-Informatics
Research Programof BSRC, KAIST- Main
functionalities of human brain
Implementation of artificial brain system and its
mechanisms - Auditory part- Vision part- Agent
(Service) part
Artificial Brain System Demo
Toward more challenging problems

3
Introduction

About Brain Neuro-Informatics Research Program-
The third phase project of Brain Science Research
Center (BSRC) in KAIST. - Funded by Korean
Ministry of Commerce, Industry, and Energy. -
Complete research period 2004. 7 2008. 3
Research focus - Understanding brain
information processing mechanism- Developing
brain-like intelligent systems (Artificial Brain)

4
Introduction

Motivations - We have achieved a great
development of computer technologies, but the
ability of machines is limited to simple tasks
which require human beings have to order what to
do. - We lack the specific and concrete
algorithms to solve practical problems in the
real world. - A human brain is the best model
in solving practical problems in the real world,
and we came up with neural networks based on the
human neural information processing

5
Main Functionalities of Human Brain
6
Artificial Brain System - Development Env.

The development team- 11 research groups in 6
universities- 3 parts auditory, vision,
agent (secretary)
Each group generates functional modules-
developed independently- integrated using the
de-centralized system service (DSS) on the
Microsoft .Net framework. - The common language
runtime (CLR) property in .Net framework enables
each module can be developed in any languages
like C, C, Java, etc.

7
Artificial Brain System Overall Configuration
Expression Recognition
Speech Separation
Face Recognition
Object Recognition
Sound Localization
Speaker Recognition
Speech Recognition
Attention Area
Vision Module
Auditory Module
Stereo- Camera
Stereo- Microphone
TCP/IP
Service Module (Agent)
Speaker
Robot Head Movement
Text-to- Speech
Robot Control
Response Sentence Generation
Knowledge-Base
Context Analysis
Dialog Manager
8
Auditory Part Module Diagram

Flow diagram for auditory perception

Speech
Active Noise Canceller
Auditory Filterbank
Voice Activity Detection
Stereo- Microphone
Noises
Speech Recognition
BSS (ICA)
Sound Localization
Speaker Recognition
Masking
Keyword Recognition
9
Auditory Part Mechanisms
I. Binaural pathways and sound localization

The superior olivary complex (SOC) receives
bilateral ascending input from the auditory
ventral cochlea nucleus (AVCN) and descending
input from the ipsilateral inferior colliculus
(IC).
The medial superior olive (MSO) cells are
sensitive to interaural time difference (ITD) and
the lateral superior olive (LSO) cells are
sensitive to interaural intensity difference
(IID).

The auditory signal is represented by the time at
which upward zero-crossing occurs and the peak
amplitude within the zero-crossing interval (D.
Kim et.al., 1999).
Binaural cue extraction- detect zero-crossing
times - measure zero-crossing interval powers
- The ITD and IID

SNR estimation (Y. Kim et.al, 2007)
Identification of reliable ITD samples (a)
filtered signal (b) measured ITDs(c) SNR
estimation(d) selected ITDs with SNRgt15 dB

Localization of multiple sound sources(a) SNR
weighted ITD histogram(b) local peaks of the
histogram(c) normalized by the largest peak (d)
selected dominant peaks with threshold value 0.3

13
Auditory Part Mechanisms
II. Masking of interfering sounds

Cocktail party problem- Human speech perception
is robust in the presence of diffusive noise and
interfering sounds. - But, machine speech
recognition remains problematic in such
conditions.
Auditory masking? - When a sound is masked, it
is eliminated from perception as if the sound
never reached the ear. - Sound source can be
segregated by identifying the segments of the
sources in the time-freq. domain.

Directional mask estimation (Y. Kim et.al., 2006)
- Assign each zero-crossing interval power to
one of the nearest ITD source- Mask based on the
target-to-interferers power ratio for each
time-freq. segments
Example mask estimation (a mixture of 3 sounds)-
Target and interfering speeches located at 0,
-30, 30 degrees.
(a) Ideal mask
(b) Estimated mask

15
Vision Part Module Diagram

Flow diagram of visual perception

16
Vision Part Mechanisms
Biological visual pathway of bottom-up and
top-down processing
17

The segmentation problem?- finding different
objects in the image.. - But what is the image
of a single object?- Is a nose an object? Is a
head one?
Finding salient regions in an image! - Human
brain draws attention to the salient object in
the image. - The saliency of an image may be
determined by the combination of local and global
aspects.

The architecture of bottom-up saliency map model

(Choi et.al,2006)-
I intensity, E edge, S symmetry- CSDN
center-surround difference and normalization-
ICA independent component analysis- SM
saliency map, SP saliency point- IOR
inhibition of return

Experimental results of bottom-up selective
attention- The saliency map model generates
candidates of interesting regions.

20
Service Part - Modules
21
Service Part - Scenarios

Service domains of the OffceMate- schedule
management- patent search - new knowledge
acquisition from the internet - object
perception in an office
A Demo for the schedule management

22
Toward More Challenging Problems

Keyword spotting model with top-down attention
Context-dependent information processing

23
Selective Attention with an HMM (C. Lee et.al.,
2007)

Train HMMs with training set
For testing pattern, calc. likelihood for all
classes
Choose Nc best for candidates
For each model,
Set attention filter to 0.
Update attention filter
Calc. new likelihood of changed input
Repeat 2)-3) until likelihood converges
Calc. confidence measure M
Choose maximum M

23
24
Keyword Spotting Model with Attention
FB
VAD
signal
Compare Likelihood Decision Making
Confidence Measure
OOV Rejection
Attention Filter
Keyword?
Confidence Measure
OOV Rejection
Attention Filter
Activation Attention
25
Keyword spotting performance with SA
26
Context-dependent information processing

What is a context? - In memory, our experiences
are represented in structure that cluster
together with related information.- Little is
known about the neural underpinnings of
contextual analysis and scene perception.
Searching for relevant mechanisms - K-Line by
M. Minsky, The Society of Mind, 1986. - Sequence
seeking and counter streams by S. Ullman, Cereb.
Cortex, 1995. - Proactive brain using analogies
and associations by M. Bar, TRENDS in Cog. Sci.,
2007.

27
Translating analogies to predictive association
(M. Bar, 2007)
28
Context-dependent information processing

Some Big Questions! - What are the computational
mechanisms mediating the transformation of a past
memory into a future thought? - How does the
brain handle completely novel situations where no
reliable predictions can be generated? - ...
Try a keyword-based context generation - In
our service area, there are 4 static domains. -
Using the keywords in the domain, we can change
the context in our service domains. - But in
real situations, the keyword cannot be
pre-determined! - More dynamic context
generation and management are needed for
efficient services.

29
Conclusions

Human brain is the best model in solving
practical problems in the real world.
Our artificial brain system OfficeMate
incorporates many current findings of information
processing mechanisms in human brain.
New and challenging research areas are waiting
for our attentions!
Thank youAny research idea or comments are
welcomed!youngik_at_kaist.ac.kr

Write a Comment

User Comments (0)