Course Progress - PowerPoint PPT Presentation

1 / 54

About This Presentation

Title:

Course Progress

Description:

Different granularity in its descriptions to allow different levels of discrimination ... Visual DS Type. Segment DS. MediaInfo DS. Classification. Schema ... – PowerPoint PPT presentation

Number of Views:59

Avg rating:3.0/5.0

Slides: 55

Provided by: drphil7

Category:

more less

Transcript and Presenter's Notes

Title: Course Progress

1
Course Progress

Image Compression
Video Compression
MPEG
H.261

2
Video Compression

Development of Video Compression Standards by
International Telecommunications Union and Motion
Picture Expert Group
By ITU alone
H.261-gt H.263-gtH.263, H.263
By ITU MPEG
H.262/MPEG2-gt H.264
By MPEG alone
MPEG1 -gt MPEG4

3
MPEG

MPEG1
first video compression standard from MPEG in
1993
Compress video into sequence of image frames
MPEG2
Enhanced video compression standard in 1994
MPEG4
object based video compression standard in 1999
Compress video into composing objects

4
MPEG

MPEG7
an Extensible Markup Language (XML) based
multimedia content description standard
MPEG21
an open multimedia framework standard

5
MPEG

Two Source Intermediate Formats (SIF)
352240 pixels/frame, 420 subsampling, 30
frames/second, progressive scan, 30 mbps
352288 pixels/frame, 420 subsampling, 25
frames/second, progressive scan, 30 mbps
MPEG-1 compress SIF video with raw data rate 30
mbps to about 1.1 mbps at VHS VCR quality

6
MPEG Compression

Digital storage media and channels
Asymmetric applications
Movies, video-on-demand, education-on-demand,
Point of sales
Compression is done once at production
Frequent use of decompression process
Symmetric applications
Video mail, video conferencing
Similar compression and decompression frequency

7
MPEG-2 Compression

Asymmetric compression
Balance between intra-frame and inter-frame
coding
Two basic techniques
Inter-frames Block based motion compensation to
reduce temporal redundancy
Intra-frame DCT based compression to reduce
spatial redundancy

8
MPEG Structure

An MPEG stream consists of many Group of Pictures
(GOP).
Each GOP consists of 3 types of pictures
I-frame, P-frame, and B-frame
I-frame intra-pictures
Starting points for random access
Moderately compressed using JPEG
P-frame predicted pictures
Coded with a reference to a past picture
Used as a reference for future predicted pictures
B-frame bi-directional predicted pictures
Require past and future reference for prediction
Highest compression

9
Inter-frame Structure

The sequence of displaying frames are
P-frames are displayed between I frames
B-frames are inserted between I/P-frames
Dependency among frames
P frames are predicted from the previous I/P
frame.
B frames are interpolated from the previous and
future I/P frames

1 2 3 4 5 6
7 8 9 10
time
I B B P B B
P B B I
Predictions
10
Sequence of frames

Due to the dependency among frames, the order of
frames are changed in transmission and storage.
Transmission order I frames, P frames, then B
frames
Example
Sequence (IBBPBBPBB) (IBBPBBPBB) (IBB
Transmit order (PBBPBB)(IBBPBBPBB) (IBB
(IPB) ratio (126)
In 25 frames/second, random access provided at
through 9/25 seconds, about 360 msec.

11
I-Frame Encoder
Color Space convertor
Entropy Encoder
Quantizer
FDCT
Compressed I-frame
I-frame
Similar to the JPEG image compression
12
P/B-Frame Encoder
Error terms
FDCT
Color Space convertor
Entropy Encoder

Compressed frame
P/B-frame
Motion estimator
RGB?YUV
Reference frames
13
Motion Estimation

Motion compensation to compensate inter-frame
differences due to motion
Block matching techniques Find the best matching
block from the previous frame
Forward prediction

Previous frame
Current frame
A
Best match
A
14
Motion Compensated Interpolation

Interpolate the blocks from the previous and
future frames
Bi-directional prediction

Future frame
Previous frame
Current frame
A
B
C
15
MPEG Compression bit rates
MPEG-1
MPEG-2
16
MPEG Compression

Each MPEG video is divided into GOP of I, P, and
B-frames.
I frames are independently compressed like JPEG
images. They are starting points for random
access.
P-frames are compressed using motion estimation
with reference to the previous frames.
B-frames are compressed using interpolation
between previous and future I/P-frames
The frames from the same GOP are stored or
transmitted in the IPB order.

17
MPEG4 System

System Requirements
Coded representation of interactive audiovisual
scenes
Identification of elementary streams
Object content information
User interaction

Source Olivier Avaro, Carsten Herpel Julien
Signès, MPEG4 Tutorial
18
Decoding
Network
...
Ex MPEG-2 Transport
Display and Local User Interaction
19
Return Channel
Network
TransMux
DAI
20
Representation of Multimedia Content

Identification of Elementary Streams Object
Descriptor
Scene Description BIFS
Animation of the Scene Animation Streams
Object Content Information OCI
Identification of Intellectual Property

21
Object Descriptor

Identification of Elementary Streams
Stream types and decoder configuration,
Streams identification and location,
object content description.

Association between Streams
Coding dependency between streams,
Clock dependencies between streams,
Association between Scene streams and Media
streams.

22
Binary Format for Scenes (BIFS)

To compose together MPEG-4 media type
2D3D, naturalsynthetic, audiovideo,
storedstreamed
in the same environment.

BIFS is based on VRML (Virtual Reality
Manipulation Language)
it is a set of nodes to represent the primitive
scene objects to be composed, the scene graph
constructs, the behavior and interactivity

23
BIFS

Additionally to VRML, BIFS defines
2D capabilities,
Integration of 2D and 3D,
Advanced Audio Features,
a timing model,
BIFS - Update protocol to modify the scene in
time,
BIFS - Animation protocol to animate the scene in
time,
a binary encoding for the scene.

24
Summary to MPEG4

A coded representation of interactive audiovisual
scenes based on composition and rendering
Identification of elementary streams
Animate object scenes using object content
information
Allow user interaction with return channel

25
MPEG7

Multimedia objects may be described by some
descriptions called metadata.
Example ltdurationgt of a video, lttime pointgt
within an audio, ltauthorgt of an image, ltkeywordgt
of an object, ltTextAnnotationgt of an object, etc
Different content creation software/hardware may
describe the same object type using different
names and formats.
? Incompatibility among different software
? Difficult to search/index/compare objects from
different databases

26
MPEG7

Provide core technologies allowing the
description of audiovisual data contents
May include still pictures, graphics, 3D models,
audio, speech, video, and composition information
Different granularity in its descriptions to
allow different levels of discrimination

27
MPEG7

The main elements of the MPEG-7 standard are
Description tools descriptors that define the
syntax and semantics of each metadata element
DDL defines the syntax of the description tools
and allows the creation of new description
schemes
Classification Schema defines a list of typical
terms used in many applications together with
their meanings.
Extensibility supported through MPEG-7 schema
extensions mechanism
System tools support binary coded representation
for efficient storage and transmission.

28
MPEG7
Data Definition Language (DDL)
definition
Extension
definition
Classification Schema
MPEG7 document ltMpeg7gt lt/Mpeg7gt
Visual D Type MediaProfile D MediaQuality D
Visual DS Type Segment DS MediaInfo DS
instantiation
Description Scheme
Descriptors
Encoding delivery
29
MPEG7
Complete description
Content description
Content management

Content entity
Content abstraction
User description
Creation description
Multimedia content
Semantic description
Summary description

Type derivation hierarchy

Video
Audio
Image
30
MPEG-7 Example

A list of integer values can be defined as
follows
ltsimpleType nameintegerVectorgt
ltlist itemTypeinteger/gt
lt/simpleTypegt

31
MPEG-7 Example

All MPEG-7 documents should have the following
header
lt?xml version1.0 encodingiso-8859-1?gt
ltMpeg7 xmlnsurnmpegmpeg7schema2001
xmlnsxsihttp//www.w3.or/2001/XMLSchema-instanc
e
xmlnsmpeg7urnmpegmpeg7schema2001
xmlnsxmlhttp//www.w3.org/XML/1998/namespace
xsischemaLocationurnmpegmpeg7schema2001
Mpeg7-2001.xsdgt
lt! Your MPEG-7 content - gt
lt/Mpeg7gt

32
MPEG-21

Many elements exist to build an infrastructure
for the delivery and consumption of multimedia
content
No big picture to describe how these elements
relate to each other

33
MPEG-21

MPEG21 defines an open MM framework
Enable transparent and augment use of MM
resources across a wide range of networks and
services
Cover the entire content delivery chain
Content creation, production, delivery,
personalization, consumption, presentation, and
trade.
Describes the framework as a generic architecture

34
MPEG-21

MPEG-21 defines
Digital item a fundamental unit of distribution
and transaction
User model the concept of users interacting with
DIs.

35
MPEG-21

Digital Item (DI)
a hierarchical container of resources, metadata,
and other digital item
User
User is any entity that interacts in the MPEG-21
environment or that makes use of a DI.
E.g. individuals, corporations, consumer can be
users

36
MPEG-21

Relationship among users and digital items.
User may use content in many ways (publish,
deliver, consume, and so on).

Digital item
use
interaction
user
user
37
MPEG-21

Seven key architectural elements
Digital Item Declaration gives precise
description of what constitutes a DI.
Digital Item Identification identifies and
describes a DI.
Content management and usage provide interfaces
and protocols that enable creation, manipulation,
search, access, storage, and (re)use of content
across its delivery and consumption chain
Intellectual Property Management and Protection
enables DIs and their rights to be persistently
and reliably managed and protected

38
MPEG-21

Seven key architectural elements (cont.)
Terminals and networks provide tools that enable
the interoperable and transparent access to
content across networks and terminals
Content representation defines how the media
resources are represented
Event reporting supplies the metrics and
interfaces that enable users to understand
precisely the performance of all reportable events

39
MPEG-21 Example

The following XML declares the root element DIDL
and several phto albums are grouped together in a
DI.
ltDIDLgt
ltcontainergt
ltDescriptorgt
ltStatement mimeType text/plaingtMy Photo
Albums in Itemslt/Statementgt
lt/Descriptorgt
ltItem idAlbum1gt lt/Itemgt
ltItem idAlbum2gt lt/Itemgt
lt/Containergt
lt/DIDLgt

40
ITU

ITU International Telecommunication Union
2 video formats
Common Intermediate Format (CIF)
compress video signals in half the resolution of
BT.601 420 format in both horizontal and
vertical dimensions.
352288 pixels, 420 subsampling, 30 frames
progressive scan, 37 mbps.

41
ITU

2 video formats
QCIF
Half the resolution of CIF in both horizontal and
vertical directions.
176144 pixels/frame, 420 subsampling, 30
frames/second, progressive scan, 9 mbps.
Both video formats are non-interlaced and
developed for video conferencing applications.

42
H.261

ISDN (integrated service digital network) lines
only allow transmission rates in multiples of 64
kbps.
H.261 compress the CIF and QCIF to p64kbps,
where p 1, 2, , 30.
Compress a CIF signal with a raw data rate of 37
mbps to 128-384 kbps with reasonable quality.
Compress QCIF signal with a raw data rate of 9
mbps to 64-128 kbps.

43
H.261 Encoder

Features of a H.261 encoder
H.261 subdivides the image into macroblocks of
size 16x16 pels
A macroblock consists of 4 luminance and 2
chrominance blocks (1 for Cb and 1 for Cr)
Each macroblock is transformed using a 8x8 DCT
for each block to reduce spatial redundancy
Uses the Differential Pulse Code Modulation
(DPCM) to exploit temporal redundancy
Uses unidirectional integer-pel forward motion
compensation

44
H.261 Encoder
p
Mux
Coding Control
t
qz
q
Q
T
Video in

Q-1
Channel
T-1

d
P
F
f
45
H.261 Encoder

T Transform
Q Quantizer
P Picture memory with motion estimator and
motion compensation unit
F Loop filter

p flag for Intra/Inter
t flag for transmitted or not
qz quantizer indication
q quantizing index for transform coefficients
d motion vector
f switching on/off of the loop filter

46
H.261 Encoder

Each macroblock is transformed using a 8x8 DCT
(block T)
DCT coefficients are
scanned using a zigzag scan
quantized
converted into a pair of symbols
Encoded using variable length codewords

DC AC01 AC07
AC70
AC77
47
H.261 Encoder

Differential Pulse Code Modulation (DPCM)
Encodes the difference from the previous DC
coefficient
Two quantizers for DCT coefficients (block Q)
A uniform quantizer with stepsize of 8 is used in
intra-mode for DC coeff.
A nearly uniform midtread quantizer with stepsize
of 2 to 62 for AC coeff.

48
H.261 Encoder

A nearly unifom midtread quantizer with input
amplitude x and output amplitude Q(x).
The dead zone, from -T to T, is quantized to
zero. Otherwise, the stepsize is uniform.
The dead zone avoids coding many small coeff.
that would mainly contribute to coding noise.

-T
T
49
H.261 Encoder

Each symbol includes two fields
the number of coefficients that are quantized to
zero
the amplitude of the current nonzero coefficient.
The symbols are encoded using Variable Length
Codewords (VLC) like Huffman coding.
The encoder sends an End of Block (EOB) after the
last nonzero coefficient.
E.g. The quantized DCT coefficients
5 0 0 2 3 0 0 4 0 0 0 0 0 0 1 0 0 0 0 0 0
are converted into(0, 5), (2, 2), (0, 3), (2,
4), (6, 1), EOB.

50
H.261 Encoder

Uses unidirectional integer-pel forward motion
compensation (block P)
Uses a 2D loop filter to low-pass the
motion-compensated prediction signal (block F)
Decreases the prediction error and the
blockliness of the prediction image

51
H.261 Encoder

The encoder transmits mainly 2 classes of
information for each macroblock
DCT coefficients from the transform of the
prediction error signal
Motion vectors that are estimated by the motion
estimator.
Motion vectors are limited to 16 pels.

52
H.261 Encoder

In intra-mode, the bit stream contains transform
coefficients for each block.
In inter-mode, the encoder has a choice of
sending
a differentially coded motion vector (MVD) with
or without the loop filter on.
a coded block pattern (CBP) to specify the blocks
for which transform coefficients will be
transmitted

53
Summary to Video Compression

MPEG-2 compresses video into a collection of
I/P/B frames
I-frames uses DCT
P-frames uses motion estimation
B-frames uses bi-directional interpolation
MPEG-4 animates video as interactive scenes of
elementary streams.
MPEG-7 describes multimedia objects with
metadata.
MPEG-21 builds a framework for users to
manipulate digital items which are resources and
metadata of multimedia objects

54
Summary to Video Compression

H.261
Divides images into 16x16 pel macroblocks
Transforms macroblock with 8x8 DCT
Encodes DCT coefficients using Differential Pulse
Code Modulation and Variable Length Codewords
Uses unidirectional integer-pel forward motion
compensation

Write a Comment

User Comments (0)