Video Epitomes

About This Presentation

Title:

Video Epitomes

Description:

Patch size - large patches to get large structure and nice stitching, small to get details. Use small patches with prior on indices to stitch them together ... – PowerPoint PPT presentation

Number of Views:34

Avg rating:3.0/5.0

Slides: 34

Provided by: vincent61

Category:

more less

Transcript and Presenter's Notes

Title: Video Epitomes

1
Video Epitomes
IEEE CVPR 2005San Diego, CAJune 22, 2005

Vincent Cheung
Probabilistic and Statistical Inference Group
Electrical and Computer Engineering
University of Toronto
Joint Work Brendan J. Frey (U. Toronto)
and Nebojsa Jojic (Microsoft Research)

2
Image Epitome

Jojic, N., Frey, B., Kannan, A. (2003).
Epitomic analysis of appearance and shape. In
Proc. IEEE ICCV.
Miniature, condensed version of the image
Models the images textural components
Applications
object detection
texture segmentation
image retrieval
compression

3
Learning the Epitome
4
Reconstruction with the Epitome

Replace patches in an image / video with patches
from the epitome
Overlapping patches are averaged
Made to agree using a variational parameter

5
Missing Observations Scenarios
6
Shifted Cumulative Sum Algorithm

Compute
Distances between all patches in input video and
all patches in epitome (E-Step)
Sufficient statistics (M-Step)
Use cumulative sums to efficiently perform
computations that is invariant to patch size
Get computations for all patch sizes
simultaneously
Naïve O(Vep)
Convolution/FFT O(Veloge)
SCS O(Ve)

7
Video Super-Resolution

Super-resolve a low-resolution wide-angle video
given a high-resolution zoomed-in shot
Learn the video epitome of the zoomed-in sequence
Use the high-resolution epitome to reconstruct
the low-resolution sequence

1
8
Video Super-Resolution Result (1)
Low Res
Low Res
2
Epitome
9
Video Super-Resolution Result (2)
10
Learning from Videos with Missing Observations (1)

Fill in missing portions of a video
Learn the epitome only on observed values
Initialize the missing data with random values
Iteratively reconstruct, only updating the
missing values

11
Learning from Videos with Missing Observations (2)
12
Video Missing Channels Fill-in

Each RGB color channel for each pixel missing
with 50 probability
We know whichchannels are missing

13
Video Missing Channels Fill-in Result (1)

Learn the epitome only on observed values
Initialize the missing data with random values
Iteratively reconstruct, only updating the
missing values

2,3
1
14
Video Missing Channels Fill-in Result (2)
Epitome Result
Gaussian Filter Result
15
Dropped Frames Recovery

Streaming video with frames dropped
Recovery only using the epitome of the corrupted
video

16
Conclusion

Extended the concept of epitomes to video
sequences
Compactly representing spatial and temporal
features
Efficient algorithm for learning epitomes
Video epitome is a natural representation for
several applications
Super-resolution
Inpainting
Missing channels
Dropped frames
Videos available at

http//www.psi.toronto.edu/vincent/videoepitome.h
tml
17
http//www.psi.toronto.edu/vincent/videoepitome.h
tml
18
References

Cheu05 Cheung, V., Frey, B., and Nebojsa, N.
(2005). Video epitomes. In Proc. IEEE CVPR.
Jojic03 Jojic, N., Frey, B., Kannan, A.
(2003). Epitomic analysis of appearance and
shape. In Proc. IEEE ICCV.
Frey03 Frey, B. Jojic, N. (2003). Advances
in algorithms for inference and learning in
complex probability models. IEEE Trans. PAMI.

19
Future Work

Determining the size of the epitome
Dependent on the complexity of the image / video
Minimum description length
Variational Bayesian
Optimal patch size(s)
Problem specific
Additional transformations into the epitome
Rotation
Scale
Additional video epitome applications
Super-resolution
Layer separation
Object recognition

20
Image Epitome Examples
21
Video Epitome Example
22
Learning the Epitome
23
Image Epitome Issues

Patch size - large patches to get large structure
and nice stitching, small to get details
Use small patches with prior on indices to stitch
them together
Close patches in image should map to close areas
in epitome
Show screen shot of GUI
Early results with index prior
Epitome size
Currently arbitrarily chosen
Learn the size of the epitome
Dependent on problem, but can use a cost function
(MDL approach), Chinese restaurant process

24
Computational Issues

Learning the epitome under this generative model
is computationally expensive
Expectation step
Estimate the posterior
Compute a weighted Euclidean distance between
patches in the image and patches in the epitome
Maximization step
Update the epitome
Collect patch-based statistics using the
posterior
Want to implement these steps efficiently,eg.
reuse computations

25
Shifted Cumulative Sum Algorithm

Require to compute distances between all patches
in the input video with all patches in the
epitome for learning and reconstruction (estimate
the mapping posteriors)
Use cumulative sums to efficiently compute
distances between all-patches that is invariant
to patch size
Similar trick used in M-step to collect
sufficient statistics
Naïve O(Vep)
Convolution/FFT O(Veloge)
SCS O(Ve)

26
Shifted Cumulative Sum Algorithm
27
Shifted Cumulative Sum Algorithm
28
Shifted Cumulative Sum Algorithm
29
Shifted Cumulative Sum Algorithm
30
Collecting Sufficient Statistics
31
Image Missing Channels Fill-in
32
Missing Channels

Generalization of the video inpainting problem
Inpainting
Missing entire pixels
Missing Channels
Missing one or more of the red, green, or blue
(RGB) components of a given pixel
Epitome must consolidate multiple patches
together to piece together the missing channel
information
No training patch contains all the channel
information
Use the epitome to fill-in the missing data

33
Walking Video Epitome

Write a Comment

User Comments (0)