Title: Interaction
1Interaction Speech and Pen
2Natural input
- Universal design
- Take advantage of familiarity, existing knowledge
- Alternative input output
- Multi-modal interfaces
- Getting off the desktop
3Speech dialogue
- Why use it?
- Hands busy
- Mobility required
- Eyes occupied
- Conditions preclude use of keyboard
- Visual impairment
- Physical limitation
4Speech Input
- Speaker recognition
- Tell which person it is (voice print)
- Monitoring, recording
- Speech recognition
- Identify words
- IBM ViaVoice, Dragon Dictate, ...
- Natural language understanding
- (does not necessarily involve audio)
5Recognition Dimensions
- Speaker dependent/independent
- Parametric patterns are sensitive to speaker
- With training (dependent) can get better
- Vocabulary
- Some have 50,000 words
- Isolated word vs. continuous speech
- Continuous where words stop begin
- Typically a pattern match, no context used
Did youvs. Didja
6Recognition Example
- My flight experience uh,
- I started off as a private
- pilot and then I spent 20
- years flying as a navigator
- on C130s in the air force
- and I currently fly as a
- first officer on the SAAB
- 340 turboprop.
- my flight experience a and
- I started off as a private
- pilot and I spent 20
- years flying is a navigator
- on C-130s in the Air Force
- and I currently fly as a
- first officer on the sound
- 340 turboprop.
7And another example
- Here here's one of the
- thoughts I'd throw in on
- that. I'd agree with
- Howard that the
- combination of checklists
- and flows is the way way
- to manage things best.
- hair and here's one of the
- outside troodon and
- an eye to agree with
- power that the
- domination checklists
- and flows is the way
- that managed things best
8Errors
- Systems make four types of errors
- Substitution - one for another
- Rejection - detected, but not recognized
- Insertion - added
- Deletion - not detected
- So how do you recover from an error?
9Natural Language Understanding
- Putting meaning to the words
- Input might be spoken or typed
- Holy grail of Artificial Intelligence problems
10Eliza - a Faked Understanding
- Person Men are all alike.ELIZA In what
way?Person They're always bugging us about
something or other.ELIZA Can you think of a
specific example?Person Well, my boyfriend
made me come here.ELIZA Your boyfriend made
you come here?Person He says I'm depressed
much of the time.ELIZA I am sorry to hear that
you are depressed.Person It's true. I am
unhappy.ELIZA Do you think coming here will
help you not to be unhappy?Person I need some
help, that much seems certain.ELIZA What would
it mean to you if you got some help?Person
Perhaps I could learn to get along with my
mother.ELIZA Tell me more about your family - http//www-ai.ijs.si/eliza/eliza.html
- Weizenbaum, J., "ELIZA -- A computer program for
the study of natural language communication
between man and machine", Communications of the
ACM 9(1)36-45, 1966
11NL Factors/Terms
- Syntactic
- Grammar or structure
- Prosodic
- Inflection, stress, pitch, timing
- Pragmatic
- Situated context of utterance, location, time
- Semantic
- Meaning of words
12SR/NLU Issues
- Advantages
- Easy to learn and remember
- Powerful
- Fast, efficient (not always)
- Little screen real estate
- Disadvantages
- Assumes domain knowledge
- Doesnt work well enough yet
- Requires confirmation
- And recognition will always be error-prone
- Expensive to implement
- Unrealistic expectations can generate mistrust
13Speech Output
- Tradeoffs in speed, naturalness and
understandability - Male or female voice?
- Technical issues (freq. response of phone)
- User preference (depends on the application)
- Rate of speech
- Technically up to 550 wpm!
- Depends on listener
- Synthesized or Pre-recorded?
- Synthesized Better coverage, flexibility
- Recorded Better quality, acceptance
14Speech Output
- Synthesis
- Quality depends on software ()
- Influence of vocabulary and phrase choices
- http//www.research.att.com/ttsweb/tts/demo.phpt
op - Recorded segments
- Store tones, then put them together
- The transitions are difficult (e.g., numbers)
15Designing Speech Interaction
- Constrain vocabulary
- Limit valid commands
- Structure questions wisely (Yes/No)
- Manage the interaction
- Examples?
- Slow speech rate, but concise phrases
- Design for failsafe error recovery
- Visual record of input/output
- Design for the user Wizard of Oz
16Speech Tools/Toolkits
- Java Speech SDK
- FreeTTS 1.1.1 http//freetts.sourceforge.net/docs/
index.php - IBM JavaBeans for speech
- Microsoft speech SDK (Visual Basic, etc.)
- OS capabilities (speech recognition and synthesis
built in to OS) (TextEdit) - VoiceXML
17General Issues Speech/NL
- Who is in control - user or computer
- Initial training required
- Learning time to become proficient
- Speed of use
- Generality/flexibility/power
- Special skills - typing
- Screen space required
- Computational resources required
18Non-speech audio
- Good for indicating changes, since we ignore
continuous sounds - Traditionally used for warnings, alarms or status
information - Provides secondary representation
- Supports visual interface
- Provides information that helps reduce error
- Tradeoff in using natural (real) sounds vs.
synthesized noises.
19Non-speech audio examples
- Error ding
- Info beep
- Email arriving ding
- Recycle
- Battery critical
- Logoff
- Logon
- Others?
20Pen, Touch, Mobile interaction
21Pen, Touch, Mobile dialog
- Stylus or finger
- Tradeoffs of each?
- Pen as a standard mouse (doubleclick?)
- Variety of platforms
- Desktop touch screens or input pads (Wacom)
- Tablet PCs
- Handheld and Mobile devices
- Electronic whiteboards
- Platforms often involve variety of size and other
constraints
22Mobile devices
- More common as more platforms available
- PDA
- Cell phone
- Ultra mobile tablets
- GPS
- Smaller display (160x160), (320x240)
- Few buttons, different interactions
- Free-form ink
- Soft keyboard
- Numeric keyboard gt text
- Stroke recognition
- Hand printing / writing recognition
23http//www.blackberry.com/
http//www.oqo.com/
24Soft Keyboard
- Presents a small diagram of keyboard
- You click on buttons/keys with pen
- QWERTY vs. alphabetical
- Tradeoffs?
- Alternatives?
25Numeric Keypad
- You press out letters of your word, it matches
the most likely word, then gives optional choices - Faster than multiple presses per key
- Used in mobile phones
- http//www.t9.com/
26Other pen text input
- Graffiti Palm mobile devices
- Unistroke recognition
- Experimental
- Cirrin
- World level unistroke
- Quickwriting
- Harder to learn than graffiti
27Hand Printing / Writing Recognition
- Recognizing letters and numbers and special
symbols - Lots of systems (commercial too)
- English, kanji, etc.
- Not perfect, but people arent either!
- People - 96 handprinted single characters
- Computer - gt97 is really good
28Recognition Issues
- Boxed vs. Free-Form input
- Sometimes encounter boxes on forms
- Printed vs. Cursive
- Cursive is much more difficult
- Letters vs. Words
- Cursive is easier to do in words vs individual
letters, as words create more context - Usually requires existence of a dictionary
- Real-time vs. off-line
29Pen Gesture Commands
- Might mean delete
- Insert
- Paragraph
Define a series of (hopefully) simple drawing
gesturesthat mean different commands in a system
30Pen Use Modes
- Often, want a mix of free-form drawing and
special commands - How does user switch modes?
- Mode icon on screen
- Button on pen
- Button on device
31Error Correction
- Having to correct errors can slow input
tremendously - Strategies
- Erase and try again (repetition)
- When uncertain, system shows list of best guesses
(n-best list) - Others??
32Free-form Ink
- Ink is the data, take as is
- Human is responsible forunderstanding
andinterpretation - Often time-stamped
- Applications
- Signature verification
- Notetaking
- Electronic whiteboards
- Sketching
33Electronic whiteboards
- Smartboard and Mimio
- Can integrate with projection
- Large surface to interact with
- Issues?
http//www.mimio.com/
http//www.smarttech.com/
34Touch tables
- Which techniques might be similar to smaller
touchscreens? - Which would differ?
- How similar and different from interactive white
boards?
Microsoft Surface
35Real paper
- Anoto digital paper and pen technology
(http//www.anoto.com/) - Other pens available
- Issues?
http//www.logitech.com/
http//www.epos-ps.com/
36General Issues Pen input
- Who is in control - user or computer
- Initial training required
- Learning time to become proficient
- Speed of use
- Generality/flexibility/power
- Special skills - typing
- Screen space required
- Computational resources required
37Other interesting interactions
- Gesture input
- Wii
- Lots of other specialized hardware for tracking
- 3D interaction
- Stereoscopic displays
- Virtual reality
- Immersive displays such as glasses, caves
- Augmented reality
- Head trackers and vision based tracking
- Tangible interaction
- Use physical objects to express input