Title: Behaviors and Layers in Spoken Dialogue Systems
1Behaviors and Layers in Spoken Dialogue Systems
- Pontus Wärnestål
- Linköping University
- Speech Technology, KTH
- Jan 21, 2005
2Some Problems with Dialogue Systems
hard to design and build
low or no re-use
Practical Engineering Issues
complex architectures
symbol- grounding problem
Usability and Acceptance issues
Theoretical Issues
lack of harmony between theory and
architectures/models
lacking implementations of interaction-level
phenomena
long processing and delays
sense-think-act model fallacy
lack of robustness
3Solution?
- There might be
- theoretical,
- practical, and
- usability-related
- findings triggered by a behavior-based approach
to dialogue system construction
4A Glance at AI/Robotics Research
- Reactivity
- Autonomous adaptation to dynamic (non-discrete)
real-world problems - Less top-down planning and representation
- More low-level behaviors emergence
- The behavior-based paradigm
- Resolved similar/analogous problems...but in
limited scope
5Behaviors and Layers in Dialogue Systems
- Towards more realistic dialogue systems?
- Natural interaction ? higher usability
- Behavior design ? less complex development
- Sounder cognitive theory, and taking
psycholinguistic and communication theory into
account - So, what do we mean by layers and behaviors?
6Layers in Software Engineering
- e.g. TCP/IP stack, operating systems
- Three main benefits (Garlan Shaw, 1994)
- design on different levels of abstraction
- enhancement/scalability/maintenance
- reusable layer interface standards
7Layers as Behaviors
- The Subsumption architecture (Brooks, 1991)
- Low-level specification
- Emergent complex and adaptive behavior
- Mainly used for robots to avoid unknown obstacles
- Avoid obstacles ? dialogue
- a big leap
- what about top-down planning?
8Subsumption Architecture
9Reactive, Process Control and Content Layers
10Benefits
- Gestures and body language (including facial
gestures) are integrated with the communication
content, without artificial communication
protocols - Behaviors run concurrently at the expected times
and without unnatural delays (reactive layer in
the range of 150-500 ms) - Miscommunication and speech overlaps are handled
in a natural way, by using stops and restarts.
11Interaction and Content Layers
- Separates interaction-level phenomena from
content and context management (Lemon et al.,
2003) - Interaction-level phenomena
- realistic timing and turn-taking
- immediate grounding
- more realistic (continuous) feedback
- barge-in management
- NP selection (anaphora management).
12Language Engineering Conclusions
- Reuse
- Scalability and maintenance
- Robustness
- Design methodology
- interaction-level phenomena seem easier to handle
with low-level reactivity á la behavior-based
design
13Usability Conclusions
- Stress interaction and communication natural
dialogue - Increase dialogue system usability with better
timing, continuous feedback, grounding,
14Theoretical Conclusions
- Towards a theory of intelligence
- Incorporating Communication theory
- somewhat ignored in the classical approach(?)