Title: Understanding Human Behavior from Sensor Data
1Understanding Human Behavior from Sensor Data
- Henry Kautz
- University of Washington
2Trends
3Growing Ubiquitous Sensing Infrastructure
- GPS
- Wi-Fi localization
- RFID tags
- Wearable sensors
4Advances in Artificial Intelligence
- Graphical models
- Particle filtering
- Belief propagation
- Statistical relational learning
5Crisis in Caring for the Cognitively Disabled
- Epidemic of Alzheimers
- Community integration of 7.5 million citizens
with MR - 100,000 _at_ year disabled by TBI
- Post-traumatic stress syndrome
- Caregiver burnout
6...An Opportunity
- Create methods for modeling and interpreting
human behavior from sensor data - In order to develop assistive technologies to
support independent living by people with
cognitive disabilities - Help people perform activities of daily living
- Monitor behavior to prevent health crises
7Outline
- Learning and reasoning about transportation
routines - ACCESS personal guidance system
- Understanding activities of daily living
- CARE monitoring system
- Further directions
8ACCESS
Assisted Cognition in Community, Employment,
Social Settings
9Motivation Community Access for the Cognitively
Disabled
10The Problem
- Using public transit is cognitively challenging
- Learning bus routes and numbers
- Transfers
- Recovering from mistakes
- Point to point shuttle service impractical
- Slow
- Expensive
- Current GPS units hard to use
- Require extensive user input
- Error-prone near buildings, inside buses
- No help with transfers, timing
11Goal
- A personal guidance system that
- Requires no explicit programming by user or
caregiver - Proactively assists user in completing
transportation plans - Recognizes user errors, and helps user recover
12Technical Problem
- Given a data stream from a wearable GPS unit...
- Infer the users location and mode of
transportation (foot, car, bus, bike, ...) - Predict the users destination
- Detect user errors
13GPS Receivers
- GeoStats GPS logger
- Data capacity 3 weeks _at_ 0.5 second frequency
- Battery life 72 hours
- Nokia 6600 Cell Phone
- Java
- Bluetooth GPS unit
- Internet connectivity for off-board processing
- Battery life 8 hours
14GIS Data
- Street map
- Census 2000
- Bus routes stops
- Seattle Metro
- Business / residential areas
- MapPoint
15Challenges of GPS Data
- Many dead zones in urban areas
- Sparse measurements inside cars and buses
- Systematic error ? 10 meters
- Multi-path propagation
- Mapping errors
16Representation
- Map is a directed graph G(V,E)
- Location
- Edge (block)
- Distance along edge
- Actual GPS reading
- Movement
- Mode foot, car, bus, indoors influences
velocity - Change edges at intersections
- Change mode at bus stops, parked car, buildings
- Tracking (filtering) Given some prior estimate,
- Update position mode according to motion model
- Correct according to next GPS reading
17Motion Model for Mode
18Dynamic Bayesian Network I
mk-1
mk
Transportation mode
xk-1
xk
Edge, velocity, position
qk-1
qk
Data (edge) association
zk-1
zk
GPS reading
Time k
Time k-1
19Rao-Blackwellised Particle Filtering
- Particle filtering
- Evolve approximation to state distribution using
samples (particles) - Supports multi-modal distributions and discrete
variables (mode, edge) - Rao-Blackwellisation
- Particles include distributions over variables
- Each particle is a Kalman filter (Gaussian along
edge) - Improved accuracy with fewer particles
20Learning to Predict User
- Prior knowledge general constraints on
transportation use - Vehicle speed range
- Bus stops
- Learning specialize model to particular user
- 30 days GPS readings of one user, logged every
second - Unlabeled data
- Learn edge and mode transition parameters using
expectation-maximization (EM)
21Mode Location Tracking
Measurements Projections Bus mode Car mode Foot
mode
Green Red Blue
22Predictive Accuracy
How to improve the models predictive power?
Probability of correctly predicting the future
5 blocks 50 accuracy
City Blocks
23Transportation Routines
B
A
Workplace
Home
- Goal intended destination
- Workplace, home, friends, restaurants,
- Trip segments ltstart, end, modegt
- Home to Bus stop A on Foot
- Bus stop A to Bus stop B on Bus
- Bus stop B to workplace on Foot
24Dynamic Bayesian Net II
gk-1
gk
Goal
tk-1
tk
Trip segment
mk-1
mk
Transportation mode
xk-1
xk
Edge, velocity, position
qk-1
qk
Data (edge) association
zk-1
zk
GPS reading
Time k
Time k-1
25Unsupervised Hierarchical Learning
- Use previous model to infer
- Goals - locations where user stays for
longperiods of time - Transition points - locations with high mode
transition probability - Trip segments paths connecting transition
points or goals - Perform EM learning on the hierarchical model
- Learn transition parameters
- Between goals
- Between trip segments, given the goal
- Between edges modes, given the trip segment
26High Probability Trip SegmentsConditioned on Goal
Goal Workplace
Goal Home
27Predicting Goal and Path
Predicted goal Predicted path
28Improvement in Predictive Accuracy
45 blocks 50 accuracy
29Detecting User Errors
- Learned model represents typical correct behavior
- Model is a poor fit to user errors
- We can use this fact to detect errors!
- Cognitive Mode
- Normal model functions as before
- Error switch in prior (untrained) parameters for
mode and edge transition
30Dynamic Bayesian Net III
Cognitive mode normal, error
Goal
Trip segment
Transportation mode
Edge, velocity, position
Data (edge) association
GPS reading
31Goal Clamping
- The users goal may be explicitly known
- Ask the user to confirm highest-probability goal
- Appointment calendar
- Incorporating such information clamps the goal
- Distinguishes novelty from errors
32Detecting User Errors
Untrained Trained
Clamped
33ACCESS Prototype
- Cell phone with GPS, camera, high-speed internet
access - Prompts when it infers that user is
34ACCESS Prototype
- Cell phone with GPS, camera, high-speed internet
access - Prompts when it infers that user is
- About to begin a transportation plan
- Confirm destination?
- Here is yourroute!
35ACCESS Prototype
- Cell phone with GPS, camera, high-speed internet
access - Prompts when it infers that user is
- About to change mode
- This is your bus!
- Your stop is next!
36ACCESS Prototype
- Cell phone with GPS, camera, high-speed internet
access - Prompts when it infers that user is
- Making an error
- You missed your stop!
- Here is how to getback on track
37ACCESS Prototype
- Cell phone with GPS, camera, high-speed internet
access - Prompts when it infers that user is
- Visiting a new destination
- Please take a picture!
38Status
- Medical partnerships
- Funding by National Institute of Disability
Rehabilitation Research (NIDRR) - Partnership with UW Center for Technology and
Disability Studies - User caregiver needs studies (TBI MR)
- Data collection by job coaches
- Extension to indoor navigation
- Hospitals, nursing homes, assisted care
communities - Wi-Fi localization
- Multi-modal interface
- Speech, graphics, text
- Guidance strategies
- Proactive / Just in time
- Coordinates / Landmarks
- WOZ design study
39WOZ
40Papers
- Patterson, Liao, Fox, Kautz, UBICOMP 2003
- Inferring High Level Behavior from Low Level
Sensors - Patterson et al, UBICOMP-2004
- Opportunity Knocks a System to Provide Cognitive
Assistance with Transportation Services - Liao, Fox, Kautz, AAAI 2004 (Best Paper)
- Learning and Inferring Transportation Routines
41CARE
Cognitive Assistance in Real-world Environments
42Goal
- A home monitoring system that
- Assists user in performing activities of daily
living - Tracks activities, and provides prompts and
warnings as needed - Can be deployed in an ordinary home
- Does not require the user to learn a different
way to perform the activities the system
adapts, not the user
43Short-Term Application
- Accurate, automated ADL logs
- Changes in routine often precursor to illness,
accidents - Human monitoring intrusive inaccurate
Image Courtesy Intel Research
44Technical Requirements
- Sensor hardware that can be practically deployed
in a ordinary home - Methods for activity tracking from sensor data
- Methods for automated prompting that consider
- Probability of user errors
- Probability of system errors
- Cost / benefit tradeoffs
45Object-Based Activity Recognition
- Activities of daily living involve the
manipulation of many physical objects - Kitchen stove, pans, dishes,
- Bathroom toothbrush, shampoo, towel,
- Bedroom linen, dresser, clock, clothing,
- We can recognize activities from a time-sequence
of object touches
46Sensing Object Manipulation
- RFID Radio-frequency identification tags
- Small
- Semi-passive
- Durable
- Cheap
- Near future use products own tags
47Wearable RFID Readers
- Designed by Intel Research Seattle
- Will be shared with other Intel partners later
this year - 13.56MHz reader, radio, power supply, antenna
- 12 inch range, 12-150 hour lifetime
48Experiment Morning Activities
- 10 days of data from the morning routine in an
experimenters home - 61 tagged objects
- 11 activities
- Often interleaved and interrupted
- Many shared objects
49Methodology
- Goal simplest model that can robustly track
activities - Comparison
- Hidden Markov Model
- Dynamic Bayesian Network with aggregate features
- DBN with aggregation and abstraction smoothing
50Hidden Markov Model
- Trained on labeled data
- 10-fold cross validation
- 88 accuracy
- 9.4 errors per 20 minute episode
51Cause of Errors
- Observations were types of objects
- Spoon, plate, fork
- Typical errors confusion between activities
- Using one object repeatedly
- Using different objects of same type
- Critical distinction in many ADLs
- Eating versus setting table
- Dressing versus putting away laundry
52Aggregate Features
- HMM with individual object observations fails
- No generalization!
- Solution add aggregate variables
- Bit-vector maintains history of objects touched
- Aggregate distribution nodes sum number of
distinct instances - Aggregate nodes treated as pseudo-observations
when an activity transitions - DBN encoding avoids explosion of HMM
53DBN with Aggregation
- Average number of errors reduced from 9.4 to 6.5
(31) - Deterministic nodes add minimal computational
overhead
54Improving Robustness
- Both HMM and DBN fail if novel (but reasonable)
objects are used - Solution smooth parameters over abstraction
hierarchy of object types
55(No Transcript)
56Abstraction Smoothing
- Methodology
- Train on 10 days data
- Test where one activity substitutes one object
- Change in error rate
- Without smoothing 26 increase
- With smoothing 1 increase
57Experiment ADL Form Filling
- Tagged real home with 108 tags
- 14 subjects each performed 12 of 14 ADLs in
arbitrary order - Used glove-based reader
- Given trace, recreate activities
58Results Detecting ADLs
RFID
Inferring ADLs from Interactions with
Objects Philipose, Fishkin, Perkowitz, Patterson,
Hähnel, Fox, and Kautz IEEE Pervasive Computing,
4(3), 2004
59Summary CARE
- Activities of daily living can be learned and
robustly tracked using RFID tag data - Simple, direct sensors can often replace (or
augment) general machine vision - Works for essentially all ADLs defined in
healthcare literature
60Next Steps
61Key Next Problems
- Decision-theoretic control of user interfaces
- Prompts helpful or distracting?
- User error, or user model error?
- Context-dependent costs
- Decision-theoretic natural language processing
62Key Next Steps
- Measures of effectiveness
- ACCESS user studies with potential clients
(controlled conditions) this spring - CARE Intel / UW / Wash Dept Social Services
developing trial for caregiver evaluation - Goal Long-term deployment in naturalistic
settings - Homes, nursing homes, assisted care facilities
63Future Research
- Physiological sensors
- Heart rate, respiration, temperature
- Heterogeneous sensors
- Environmental wearables machine vision
- Smart homes
- Systems for improving self-awareness
- Emotional self-regulation
- Social pragmatics
- Target populations
- Autism spectrum disorders
- Traumatic brain injury
64General Architecture
common-sense knowledge
decision making
user profile
physical behavior
userinterface
caregiveralerts
machinelearning
sensors
65Conclusion Why Now?
- An early goal of AI was to create programs that
could understand ordinary human experience - This goal proved elusive
- Missing tools for probabilistic inference
- Systems not grounded in real world
- Lacked compelling purpose
- Today we have the mathematical tools, the
sensors, and the motivation
66Other Research
67Building Social Network Models from Sensor
DataNSF Human Social Dynamics
Coded Database
codeidentifier
real-time audio feature extraction
audiofeatures
WiFistrength
68Modal Markov LogicApplied to Dialog
UnderstandingDARPA CALO
ASK_IF(S, H, P) ? B(S, B(H,P) v B(H,P))
69Planning as SatisfiabilityNSF Intelligent Systems
- Unless PNP, no polynomial time algorithm for SAT
- But great practical progress in recent years
- 1980 100 variable problems
- 2005 100,000 variable problems
- Can we use SAT as a engine for planning?
- 1996 competitive with state of the art
- ICAPS 2004 Planning Competition 1st prize,
optimal STRIPS planning - Inspired research on bounded model-checking
70Efficient Model CountingNSF Intelligent Systems
- SAT can a formula be satisfied?
- SAT how many ways can a formula be satisfied?
- Compact translation discrete Bayesian networks ?
SAT - Efficient model counting (Sang, Beame, Kautz
2004, 2005) - Formula caching
- Component analysis
- New branching heuristics
- Cachet fastest modelcounting algorithm
71Credits
- Graduate students
- Don Patterson, Lin Liao, Ashish Sabharwal,
Yongshao Ruan, Tian Sang, Harlan Hile, Alan Liu,
Bill Pentney, Brian Ferris - Colleagues
- Dieter Fox, Gaetano Borriello, Dan Weld, Matthai
Philipose, Tanzeem Choudhury - Funders
- NIDRR, Intel, NSF, DARPA