Sequence Models in Modern AI - PowerPoint PPT Presentation

About This Presentation

Title:

Sequence Models in Modern AI

Description:

Solve restricted problem Find all the faces Recognize a person Align two images Modern Computer Vision Applications Face / Object detection ... shape library ... – PowerPoint PPT presentation

Number of Views:74

Avg rating:3.0/5.0

Slides: 45

Provided by: ginal5

Learn more at: https://www.classes.cs.uchicago.edu

Category:

more less

Transcript and Presenter's Notes

Title: Sequence Models in Modern AI

1
Sequence Models in Modern AI

Probabilistic sequence models
HMMs, N-grams
Train from available data
Classification with contextual influence
Robust to noise/variability
E.g. Sentences vary in degrees of acceptability
Provides ranking of sequence quality
Exploits large scale data, storage, memory, CPU

2
Computer Vision

CMSC 25000
Artificial Intelligence
March 1, 2007

3
Roadmap

Motivation
Computer vision applications
Is a Picture worth a thousand words?
Low level features
Feature extraction intensity, color
High level features
Top-down constraint shape from stereo, motion,..
Case Study Vision as Modern AI
Fast, robust face detection (Viola Jones 2002)

4
Perception

From observation to facts about world
Analogous to speech recognition
Stimulus (Percept) S, World W
S g(W)
Recognition Derive world from percept
Wg(S)
Is this possible?

5
Key Perception Problem

Massive ambiguity
Optical illusions
Occlusion
Depth perception
Objects are closer than they appear
Is it full-sized or a miniature model?

6
Image Ambiguity
7
Handling Uncertainty

Identify single perfect correct solution
Impossible!
Noise, ambiguity, complexity
Solution
Probabilistic model
P(WS) aP(SW) P(W)
Maximize image probability and model probability

8
Handling Complexity

Dont solve the whole problem
Dont recover every object/position/color
Solve restricted problem
Find all the faces
Recognize a person
Align two images

9
Modern Computer Vision Applications

Face / Object detection
Medical image registration
Face recognition
Object tracking

10
Vision Subsystems
11
Image Formation
12
Images and Representations

Initially pixel images
Image as NxM matrix of pixel values
Alternate image codings
Grey-scale intensity values
Color encoding intensities of RGB values

13
Images
14
Grey-scale Images
15
Color Images
16
Image Features

Grey-scale and color intensities
Directly access image signal values
Large number of measures
Possibly noisy
Only care about intensities as cues to world
Image Features
Mid-level representation
Extract from raw intensities
Capture elements of interest for image
understanding

17
Edge Detection
18
Edge Detection

Find sharp demarcations in intensity
1) Apply spatially oriented filters
E.g. vertical, horizontal, diagonal
2) Label above-threshold pixels with edge
orientation
3) Combine edge segments with same orientation
line

19
Top-down Constraints

Goal Extract objects from images
Approach apply knowledge about how the world
works to identify coherent objects reconstruct
3D

20
Motion Optical Flow

Find correspondences in sequential images
Units which move together represent objects

21
Stereo
22
Stereo Depth Resolution
23
Texture and Shading
24
Edge-Based 2-3D Reconstruction
Assume world of solid polyhedra with 3-edge
vertices Apply Waltz line labeling via
Constraint Satisfaction
25
Basic Object Recognition

Simple idea
extract 3-D shapes from image
match against shape library"
Problems
extracting curved surfaces from image
representing shape of extracted object
representing shape and variability of library
object classes
improper segmentation, occlusion
unknown illumination, shadows, markings, noise,
complexity, etc.
Approaches
index into library by measuring invariant
properties of objects
alignment of image feature with projected library
object feature
match image against multiple stored views
(aspects) of library object
machine learning methods based on image
statistics

26
Hand-written Digit Recognition
27
Summary

Vision is hard
Noise, ambiguity, complexity
Prior knowledge is essential to constrain problem
Cohesion of objects, optics, object features
Combine multiple cues
Motion, stereo, shading, texture,
Image/object matching
Library features, lines, edges, etc
Apply domain knowledge Optics
Apply machine learning NN, NN, CSP, etc

28
Computer Vision Case Study

Rapid Object Detection using a Boosted Cascade
of Simple Features, Viola/Jones 01
Challenge
Object detection
Find all faces in an arbitrary images
Real-time execution
15 frames per second
Need simple features, classifiers

29
Rapid Object Detection Overview