Title: Semantic Contentbased Access To Hypervideo Databases
1Semantic Content-based Access To Hypervideo
Databases
Haitao Jiang Major Professor Ahmed K.
Elmagarmid Computer Science Department Purdue
University 1998
2Organization Of The Talk
- Introduction And Review Of Related Work
- Logical Hypervideo Data Model (LHVDM)
- Semantic Content-based Video Queries
- A Web-based Logical Hypervideo Database (WLHVDB)
- Conclusion
3Unique Characteristics Of Video Data
- Semantics rich and ambiguous
- Relationship ill-defined
- Structure unclear
- Dimension spatial and temporal
- Volume huge
4Video Data Content
- Visual Content
- Audio Content
- Text Content
- Semantics Content
5Research Problems
- Video Data Modeling
- Video Data Indexing
- Video Data Query
- Video Browsing
6 Video Data Model Requirements
- Content-based Data Access
- Video Data Abstraction
- Variable Data Access Granularity
- Dynamic And Incremental Video Annotation
7Video Data Model Requirements (Con.)
- Video Data Independence
- Spatial And Temporal Characteristics
- Video And Meta-data Sharing And Reuse
8Related Work
- Video Data Modeling, Indexing, And Querying
- Video Objects
- Video Browsing
9Video Data Modeling, Indexing, and Querying
- Traditional Database Approach
- Visual Content Or Segmentation-based Approach
- Stratification Or Annotation Layering Approach
10 Traditional Database Approach
- Categorize And Predefine Video Data
Attributes/Values - Use Traditional Databases And SQL
- Inflexible And Limited
- Examples VISION, Video Database Browser
11 Segmentation-based Models
- Parse And Segment Video Streams
- Index On Visual Features Of RFrames
- Extract High Level Logical Structure And
Semantics By Classifying Against Domain Models
12Segmentation-based Models (con.)
- Can Be Fully Automated
- Lack Of Flexibility
- Limited Semantics
- Video Streams Need To Be Well-structured
- Examples JACOB, QBIC, Informedia
13Stratification
- Segment Video Semantics
- Concept Of Logical Video Data
- Allows For Semantic Content-based Video Access
- Annotation Can Be Tedious And Biased
- Examples VideoStar, Algebraic Video
14Stratification(con.)
- Existing Models
- Has Limited Temporal Queries
- Has Limited Video Browsing Mechanism
- Lack Multi-user Views And Data Sharing
- Lack Modeling Of Video Objects
- Lack Spatial And Spatial-Temporal Query
Capabilities
15Different Forms Of Video Annotation
- Multi-layer Icons - MediaStream
- Keywords
- Free Text Documents
- Other Types Of Annotation?
16Sources Of Video Annotations
- Closed Caption
- Text In Video Frames highlight detection and OCR
- Voice Recognition
- Manual Annotation
17Annotation Support In A Video Data Model
- Annotation of Arbitrary Sequence
- Incremental Creation, Deletion, And Modification
- Multi-user Annotation Sharing
- Arbitrary Overlap Of Annotations
18Video Objects
- Index On Spatial And Temporal Information
- MBR as the Spatial Representation
- Narrow Focus And Lack Of Data Abstraction
- Limited Video Queries
- Example AVIS, CVOT
19Video Browsing
- Visual Content-based Browsing
- Film Strips
- Salient Images
- Scene Clustering Graph
- Need Semantic Content-based Browsing
- Need Inter-Video Navigation
20Research Motivations
- Visual Content-based Video Access IS Important
BUT Lack Semantics - Users Often Prefer Semantic Content-based Video
Data Access - Lots Applications Digital Video Library And
Distance Learning etc. - Web Is An Emerging Way Of Information Sharing
21Research Goal
- Goal To Provide Effective And Flexible Semantic
Content-based Video Data Access In A Distributed
and Multi-user Sharing Environment - Both Spatial And Temporal Video Queries
- Heterogeneous Applications And User Views
- Semantic Content-based Browsing
22Logical Hypervideo Data Model (LHVDM)
- Definition
- Hierarchical Video Abstractions
- Hot Video Object Modeling
- Video Indexing
- Video Semantic Association And Hypervideo
- A Generic Video Database Architecture
23Logical Hypervideo Data Model (con.)
- (PV, PVS, LV, LVS, HO, CD, LINKS, UV, MAP)
- PV Set Of Physical Video Streams
- PVS Set Of Physical Video Segments
- LV Set Of Logical Video Streams
- LVS Set Of Logical Video Segments
- HO Set Of Hot Objects
- CD Set Of Content Descriptions
- LINK Set Of Video Hyperlinks
- UV Set Of User Views
- MAP Set Of Mapping Relations
24Logical Hypervideo Data Model (con.)
- MAP includes
- PV PVS Easy Data Manipulation
- PVS LV Data Independence And Data Reuse
- LV LVS Multi-user View
- LV,LVSHOEffective Query
- LV,LVS,HO,CDUV Multi-user View Sharing
- LV,LVS,HO,LINKSCD Semantic Content-based
Access - Video Hyperlinks Effective Video Browsing
25Hierarchical Video Abstractions
User Views (UVs)
Logical Hypervideo Data Model (LHVDM)
Logical Video Segments (LVSs)
Hot Objects (HOs)
Video Hyperlinks
Logical Video Streams (LVs)
Physical Video Segments (PVSs)
Physical Video Streams (PVs)
26Hot Video Objects
- What Is A Hot Video Object
- A Logical Video Abstraction
- A Sub-Frame Region That Is Hot In A Set Of
Logical Frame Sequence - Why Call Them Hot Object?
- Target Of Interest
- Hyperlink Property (Hot Video Spot)
27Hot Video Objects (con.)
- Implicit Hot Video Object
- Why Hot Object Modeling Is Important?
- More Precise Video Annotation And Query
- Capture Spatial Characteristics of Video Data
28Hot Video Object Model
29Hot Object Tracking
- Template Matching
- Active Contour
- E ? Eint (1- ?) Eext
- Eint(Vi) ?Vi 2 ?Vi 2
- Eext ?c (-N(V (t))?I(V (t)))dt
- Eext (Vi) 1 - (D(Vi)/255 1) (-N(Vi)?I(Vi ))
30Experiments With Hot Video Object Tracking
31Video Indexing
- Main Indexes Semantic Content Descriptions
- Content Descriptions
- Hot Objects v.s. LVSs v.s. LVs v.s. LINKSs
- Systems v.s. Users
- Sharing Access Control
- Auxiliary Indexes Various Mappings
32Video Semantic Association and Hypervideo
- Why Semantic Association Is Important?
- More Effective Video Data Access
- Video Hyperlinks Represent Semantic Associations
- Hypervideo And Hypervideo Databases
- Flexible And User Adaptive Hypervideo Databases
33A Generic Video Database System Architecture
Users
Video Indexes
Knowledge Inference Engine
GUI
Hot Objects
Spatial Information
Geometric Engine
Integrator
LVS
Information Retrieval Engine
Video Annotations
Video Hyperlinks
LV
Video Data
Video Database Engines
Video Database
34Semantic Content-based Video Query Language
- (Expr, Granularity, Scope, Space)
- Expr video query expression
- Granularity
- Logical Video Streams (v)
- Logical Video Segments (s)
- Hot Objects (o)
35Semantic Content-based Video Query Language
(con.)
- Scope
- System Annotations Only (s)
- Including User Annotations (u)
- Including Sharable Other Users Annotations (a)
36Semantic Content-based Video Query Language
(con.)
- Space
- Subset Of VDB
- Can Be Result Of Another Query
- Allows Recursive Query Refinement
- Example Q2 (expr, o, u, Q1)
37Boolean and IR Query Operators
- AND, OR, NOT
- NOT Is Not Safe
- ADJ (adjacent)
- Regular Expression And Approximate Matching
38Temporal Query Operators
- Thirteen Interval Relations Allen83
Before
After
Ends
Starts
Equal
Meets
Overlaps
During
39Temporal Query Operators (con.)
- Operators For LVS
- Interval Temporal Operators
- Operators For HO
- Instance (or Point) Temporal Operators more
precise query specification - Interval Temporal Operators
40Spatial Query Operators
- Directional
- Topological
- Distance
North
West
East
South
41Query Processing
- Recursive And Top-Down/Bottom-UP
- Support Distribute Evaluation
- Close World Assumption?
- Answer No (Raw Video Data) And Yes (Within
Users View) - Reason Video Data Is Semantically Rich
42Query Search Space
- User Definable
- System Owned Subset of VDB Are Always Searched
- Users Queries Are Processed Within Ones View
- Determined By A Querys Granularity, Scope,
Space, And User
43Efficient Query Processing
- Query Augmentation or Pre-filtering
- Query Evaluation Order
- Query Caching And Knowledge Base
44Query Examples
- Simple Queries
- Find video clips that has a red BMW Z3 in it
- Q1 ((red Ù BMW Z3), o, -, -)
45Query Examples (con.)
- Temporal Query
- Find video clips in which a scene with a bird
flying appears after the scene with a child
eating ice cream - Q3 ((bird Ù flying) Tafter (child Ù eating Ù
ice cream), o, -,-) - Q3 ((bird Ù flying) Iafter (child Ù eating Ù
ice cream), s, -,-)
46Query Examples (con.)
- Spatial Query
- Find video clips in which the Vice President Al
Gore standing to the right of President Clinton
who is giving his Union speech at Washington DC - Q2a (Union speech Ù Washington Ù DC, s, -,-)
- Q2b (((Vice President ? Gore) Sright (President
? Clinton)), o, -, Q2a)
47Query Examples (con.)
- Spatial Queries
- Find video clips in which a blue bird is flying
over a kids head - Q4 ((blue Ù bird Ù flying) Sabove ((child ? kid
? boy ? girl) Ù head), o, -,-)
48Query Examples (con.)
- Spatio-Temporal Query
- Find video clips in which a police car with
siren on is chasing a red Porsche and hit on it - Qa ((police car siren), o, -,-)
- Qb (red Prosche), o, -,-)
- Q ((Qa Sapproach Qb) Ibefore (Qa Stouch Qb),
o, -,-)
49Web-based Logical Hypervideo Database System
(WLHVDB)
- System Architecture
- Video Wrapper And Lazy Delivery
- Populating The Video Database
- Distributed Query Processing
- Access Control And User Profiling
50System Architecture
Physical Video Data
Video Parser
Logical Video Data
Account Manager
Video Annotations
Video Annotation Engine
Server
User Profile Manager
Video Hyperlinks
Query Processing Server
User Views
Access Control Manager
IR Engine
Video Indexes
User Profiles
Server Cache Manager
Various Tools and Scripts
Server Query Cache
Query Input
Client Query Cache
Query Result Presentation
Media Player
Client Cache Manager
Client
Data Editor
Client
Client
51Glimpse (GLobal IMPlicit SEarch)
- Small Index 2-4
- Full Text Boolean Queries
- Arbitrary Approximate And Regular Expression
Matching - Efficient (occurrences among 4500 files of total size of 69MB
52Video Wrapper And Lazy Delivery
- Why Need Them?
- Huge Date Volume v.s. Limited Bandwidth
- Why Lazy Delivery?
- Avoid Sending Video Data Information
- What is a Video Wrapper?
- Multi-resolution Video Representation
- Adaptive Local Refinement Based On Interest
53Current Video Wrapper Implementation
Video Poster
Clip Posters
RFrames
PVSs
54Populating the VDB
Video Parsing and Segmentation
Video Representation Construction
Video Wrapper
Video Capture
LVSs and Annotations
Video Indexes
LVs
Closed Caption Capture
Hot Objects
Object Tracking
DBA
Users
Analog Video
55Query Processing
Client
Server
Video Query
Query Parsing and Syntax Checking
Show Error Message
Processing IR Sub-queries
No Error
Search Users View
Sending IR Sub-queries
Processing Boolean and Spatio-temporal Operators
Sending Partial Results
Result
56Server and Client-Side Query Caching
IR Sub-queries
Server
Client
Miss
Miss
Client-side Cache
Server-side Cache
Query IR Engine
Hit
Hit
Update Cache
Get Results
Get Result
Send Results
Update Cache
Results
57Video Browsing
- Loop of Query-Browsing-Play
- Inter- And Intra-Video Browsing
- User Adaptive
- Video Wrapper Refinement Process
58Video Browsing (con.)
Video Query
Video Posters
Video Board
RFrames
Annotations
Audio Stream
Video Hyperlinks
Video Stream
59Access Control and User Profiling
- Different Categories Of Users And Groups
- Different Permissions On Video Data And Metadata
- Users Need To Authenticate Themselves
- User Activities And Local Environment Information
Are Recorded
60Summary of Major Contributions
- A Novel Video Data Model (LHVDM) That Supports
- Multi-level Video Abstraction
- Video Data Independence
- Multi-user Data Sharing
- Dynamic And Incremental View Update
- Variable Access Granularities
61Summary of Major Contributions (con.)
- A Novel Video Data Model (LHVDM) That Represents
- Both Spatial And Temporal Video Characteristics
- Hot Video Objects
- Video Semantics And Semantic Associations
62Summary of Major Contributions (con.)
- A Novel Video Data Model (LHVDM) That
- Supports User Adaptive Video Browsing
- Hyperlinks Video Entities For More Efficient
Browsing - Can Be Extended To Other Multimedia Data Such As
Audio Data
63Summary of Major Contributions (con.)
- A Video Query Language That Allows
- Easy Query Formulation
- Video Semantic Content-based Queries
- Both Spatial And Temporal Constraints
- Hot Object-based Video Queries
- User Selectable Granularity, Space, and Scope
64Summary of Major Contributions (con.)
- A Generic Video Database System Architecture
That Is - Modular, Flexible, And Scalable
- Readily To Be Distributed
- Easy To Be Implemented
65Summary of Major Contributions (con.)
- The Design And Implementation Of A Web-based
Prototype That Uses - A Novel Video Wrapper And Lazy Evaluation
Approach - Distributed Query Processing And Sub-query
Caching Schema - Multi-user Data Access Control And View Sharing
- User Profiling
66Future Work
- Identify New Applications And Perform More
Extensive Tests - Explore And Integrate Other Forms Of Video
Annotations Such As Visual Features - Extended To Other Multimedia Data Such As Slides,
Images, And Audio
67Future Work(con.)
- Knowledge-based Video Access
- Automatic Generation of Video Wrappers
- Video Data Security and Access Control
68Related Publications
- Survey and Books
- A. K. Elmagarmid and H. Jiang. Multimedia Video
(chapter), Encyclopedia of Electrical and
Electronics Engineering. John Wiley Sons. 1998,
In press. - A. K. Elmagarmid, H, Jiang and et al. Video
Database Systems Issues, Products and
Applications. Kluwer Academic Publishers, 1997. - H. Jiang, A. Helal, A. K. Elmagarmid, and A.
Joshi. Scene Change Detection Techniques for
Video Database Systems. ACM Multimedia Sys.,
6186-195, May 1998. - H. Jiang and A. K. Elmagarmid. Video Databases
State of the Art, State of the Market and State
of Practice. Proc. 2nd Intl. Workshop on
Multimedia Info. Sys., Page 87-91, West Point,
New York, September 26-28, 1996.
69Related Publications (con.)
- Video Analysis And Computer Vision
- H. Jiang and A. K. Elmagarmid. Extract Visual
Content Representation in Video Databases, Proc.
of Intel Conf. on Imaging Sci., Sys., and Tech.
(CISST'97), Las Vegas, Nevada, June 30 - July 3,
1997. - H. Jiang and J. Dailey. A Video Database System
for Studying Animal Behavior. Proc. SPIE
Photonics East'96 - Multimedia Storage and
Archiving Sys. Intl. Conf., Page 162-173, Volume
SPIE-2916, Boston, MA, November 18-19, 1996.
70Related Publications (con.)
- Video Data Model, Indexing, and Access
- H. Jiang, D. Montesi, and A. K. Elmagarmid.
Integrate Video and Text for Content-based
Accesses to Video Databases. J. of Multimedia
Sys. and Tools. 1998, accepted. - H. Jiang and A. K. Elmagarmid. WVTDB - A
Web-based VideoText Database System. Special
Issue on Data and Knowl. Management in Multimedia
Sys., IEEE Trans. on Data and Knowl. Eng.. 1998,
accepted.
71Related Publications (con.)
- Video Data Model, Indexing, and Access
- H. Jiang and A. K. Elmagarmid. Spatial and
Tempora Content-based Queries in Hypervideo
Databases. Special Issue on Multimedia Data
Management, The Very Large Database J.. 1998,
submitted. - F. Kokkoras, H. Jiang, I. Valhavas, A. K.
Elmagarmid, and E. N. Houstis. Smart VideoText
An Intelligent Video Database System, CSD-TR
97-049, Department of Computer Sciences, Purdue
University, West Lafayette, IN 47907, USA, 1997.
72END OF THE PRESENTATION