Title: Activity Analysis based on Human Pose Detection
1Activity Analysis based on Human Pose Detection
Anima Mishra Sharma Y8104001 Mtech, EE Thesis
Supervisors Dr. Amitabha Mukerjee Dr. KS
Venkatesh
2Outline
- Motivation
- Literature Survey
- Challenges
- Brief Overview of pose estimation
- Proposed Method for Activity Analysis
- Results
- Future Work
3Motivation
- Having vast applications
- Video Surveillance
- Tracking
- Activity recognition
- Gait analysis
- Classification
- Segmentation etc.
4Literature Survey
- A Cascade Of Feed-forward Classifier For Fast
Pedestrian Detection, Chen, ACCV 2007 - No articulated bodies, holistic approach
- Detecting Pedestrians by Learning Shapelet
Features, G Mori, CVPR 2007 - Stresses neck-head pattern, non-articulated
- Learning to parse images of articulated bodies,
D Ramanan, NIPS 2006 - Part based, articulated poses
5Datasets
- People dataset
- http//www.ics.uci.edu/dramanan/papers/parse/inde
x.html - Weizmann dataset
- http//www.wisdom.weizmann.ac.il/vision/SpaceTime
Actions.html
6Challenges
- Challenges for pose estimation
- Large variation in appearances
- Variety of clothing, skin color, shape
- Different illumination conditions
- Variation of posture
- Occlusions
- Challenges for activity analysis
- Variable action duration
- Variable actor speed and stride
7Brief Overview
- Learning to parse images of articulated bodies,
D Ramanan, NIPS 2006
8Images of articulated bodies
- Pose estimation from a static image
- Iterative parsing process Sequentially learns
better and better features tuned to a particular
image, simultaneously estimates pose of the
articulated object - No skin or face detection
- No explicit foreground segmentation
- Employs part based model
9Person Model
Tree structure of person model
Deformable model
- Model is parameterized by probability
distributions capturing geometric arrangements of
parts and local part appearances
10Iterative algorithm
- Match an edge based deformable model to the image
to obtain soft estimates of body part locations - Use estimated body part position to build a rough
region model for each body part and background - Soft estimates of body from the new model are
then used to build new region models
11Part Model
First Row Conditional Random Field Model (Edge
based template) Second Row Conditional Random
Field Model (Region based template)
12Figure1
Build a deformable pose model based on edges
13Figure2
Use the initial parse from the previous example
to build a region model for each part
14Figure3
15Activity Analysis
- Extending Learning to parse images of
articulated bodies to Videos
16Proposed Method
- Obtain foreground information
- Estimate pose for each frame
- Regularize noisy pose estimates
- Use temporal collection of centroid and
orientation of parts as raw features - Convert raw features into equal length feature
vectors - Train a classifier by feeding these feature
vectors
17Propose Method
18Pose Estimation
- Assumptions
- Video clips are captured by a static camera
- Extracted foregrounds are available
- The pose estimates are accurate
- Tools
- Dev Ramanans MATLAB code for pose
- Improved version for more iterations and color
coded part boxes
19Convergence of iterations
Normalized difference in consecutive parses
Number of iterations
20Pose Regularization
- Videos have advantage of presence of previous and
next frames - Filter temporally to correct noisy locations
across the frames - Constrain resulting boxes to be rectangular
21Feature Extraction
- Joint locations are difficult to calculate
- Obtain centre of box, and its orientation
- Stack centers and orientations temporally
- Obtain equal length feature vectors
22Results Pose Estimation
23Results Pose Estimation
24Results Pose Estimation
25Results Pose Estimation
26Results Video eli_jump
27Results Video eli_jack
28Results Video eli_run
29Results Video eli_walk
30Future Work
- Improve pose estimation
- Generating pose based action features
- Training classifier for supervised learning
- Clustering for unsupervised learning
31Thank You