Larry Rudolph - PowerPoint PPT Presentation

1 / 24

About This Presentation

Title:

Larry Rudolph

Description:

Great for drawing attention to content. Audience is primary consumer ... Differs from Intelligent-room use. Small number of identified gestures ... – PowerPoint PPT presentation

Number of Views:41

Avg rating:3.0/5.0

Slides: 25

Provided by: larryr2

Category:

more less

Transcript and Presenter's Notes

Title: Larry Rudolph

1
A Presentation Manager Developed with
the Communications-Oriented Programming and
Routing Environment CORE

Larry Rudolph
Oxygen Research Group
Laboratory for Computer Science

2
Goals

Integrate Many Oxygen Technologies
Application Driven One that
We understand
Personally use often
Should be more human-centric
Develop Architectural Infrastructure
Exposes new requirements

3
Application Scenario
4
Integration of Technologies

Speech
Vision
Handhelds (H21)
Search Engine
Location Manager
Intentional Names
Ad-hoc networks

SFS File System
Microphone Array
Camera Array
Projector Array
Meta-Glue
Collaboration

Used in non-standard, challenging way
5
Vision / Gesture Recognition

Laser Pointer
Great for drawing attention to content
Audience is primary consumer
Secondary use to control presentation
But it is not a mouse
Semantics are tied to slide context
Differs from Intelligent-room use
Small number of identified gestures
Gestures easily punctuated
Low computational overhead
Soon will be handled with a H21

6
Vision / Gesture Recognition

Laser Pointer
Great for drawing attention to content
Audience is primary consumer
Secondary use to control presentation
But it is not a mouse
Semantics are tied to slide context
Differs from Intelligent-room use
Small number of identified gestures
Gestures easily punctuated
Low computational overhead
Soon will be handled with a H21

7
Speech Recognition

Galaxy is geared towards Dialog
Dialog does not suit a presentation
A prompt is an alienating distraction
Navigation commands primarily for audience
Different Use of Galaxy
No for audio feedback
There is natural feedback.
No false-positives
For dialog, better to guess than ignore
For us, high cost for incorrect guess
Most words are not relevant to speech system

8
Slide Tracking

Vary sensitivity, e.g. Next Slide
less likely meaningful at start
more likely after many words have been said.
Play video only relevant on slides with video
Recognizer will follow along
Keep track of what has been said
Slide-dependent recognition (a domain/slide)
Multi-level Commands
Individual Slide Navigation
Presentation Manager
Command Manager
Implemented via multiple recognizers

9
Three Output Modes

Speaker view (notes)
Projection
Also gets archived
Just slides shown
Associate audio/laser/questions with each slide
Merge personal notes with slides
Collaboration
Projector
Multiple projectors (no shadows!)

This summer
10
System Architecture Requirements

Something simple composable
Communications-oriented
Dynamic, rule-based
Can add commands during run time
Using ordinary speech
Compatible with other Oxygen Techs.
Easy to debug (even by naïve users)

11
CORECommunication Oriented Routing Environment

Connects stuff

Larry Bear
12
CORE Essentials
Nodes Specify via INS
Cam deviceweb-cam location518
PTRvision deviceprocess OSLinuxFileLaser
Vision, ..
CORE
Laser Vision
13
CORE Essentials
Links specify with nicknames
Lcamera,vision (Cam,PTRvision)
Slide Speech
Presentation Speech
Command Speech
CORE
Laser Vision
14
CORE Essentials
Messages flow over the links
Next Slide!
Slide Speech
Presentation Speech
Command Speech
CORE
Laser Vision
15
CORE Essentials
Messages flow over the links
Slide Speech
Presentation Speech
Command Speech
CORE
Next Slide!
Next Slide!
Laser Vision
16
CORE Essentials
Messages flow over the links
Slide Speech
Presentation Speech
ADVANCE
Command Speech
CORE
ADVANCE
ADVANCE
Laser Vision
17
CORE Essentials
How do we change output for questions?
Questions?
Slide Speech
Presentation Speech
Command Speech
CORE
Laser Vision
18
CORE Essentials
Slide Speech
Presentation Speech
Command Speech
Question?
CORE
Question?
Laser Vision
19
CORE Essentials
RULES (trigger,action)
( MESSQuestion , Lslide,lcd -- Lslide,qlcd )
Slide Speech
Presentation Speech
Questions
Command Speech
CORE
Questions
Questions
Laser Vision
20
Deep Issue 1Extract I/O Validation