ConQuer: Efficient Management of Inconsistent Databases - PowerPoint PPT Presentation

1 / 15
About This Presentation
Title:

ConQuer: Efficient Management of Inconsistent Databases

Description:

ConQuer: Efficient Management of Inconsistent Databases. Presented by: ... Disagreement Between Sources. Which tuple for Peter should we delete? ... – PowerPoint PPT presentation

Number of Views:27
Avg rating:3.0/5.0
Slides: 16
Provided by: a15230
Category:

less

Transcript and Presenter's Notes

Title: ConQuer: Efficient Management of Inconsistent Databases


1
ConQuer Efficient Management of Inconsistent
Databases
  • Presented by
  • Ariel Fuxman (Univ. of Toronto)
  • Joint work with
  • Renée J. Miller (Univ of Toronto)
  • Diego Fuxman (Univ. Nacional del Sur)

2
ConQuer
  • A system designed to answer SQL queries over
    inconsistent databases

Income
Name
40K
Peter
200K
Peter
INCONSISTENT DATABASE
400K
Paul
110K
Mary
130K
Mary
name should be the key
3
One Application
  • Customer Relationship Management (CRM)

Sales
Integrated Customer Database
Shipping
Customer Support
Web Forms
Demographic Data
4
Disagreement Between Sources
  • Which tuple for Peter should we delete?
  • Removing both tuples loses consistent
    information
  • Deciding the correct income may require human
    intervention


address
income
name
.
276 College Street
40K
Peter
sales
400K

100 Bloor Street
Paul
110K

20 Union Street
Mary

address
income
name
.
276 College Street
200K
Peter
web
400K

100 Bloor Street
Paul
130K

20 Union Street
Mary
5
Inconsistent Integrated Database
Transfer all conflicting tuples to the integrated
database
Integrated Database
Sales
INCONSISTENT DATABASE
Web
6
Query Answering
Offering a Platinum credit card
  • qGet customers who make more than 100K

Peter,Paul,Mary
Peter should NOT be offered a Platinum card!!
income
name
40K
sales
Peter
200K
web
Peter
400K
sales/web
Paul
110K
sales
Mary
130K
web
Mary
7
Semantics of Query Answering
  • Get customers who possibly make more than 100K
  • Peter, Paul, Mary
  • Get customers who certainly make more than 100K
  • Paul, Mary

CONSISTENT ANSWER Arenas et al. 99
8
Repairs
Repairs
Inconsistent database
income
custid
40K
sales
Peter
200K
web
Peter
400K
sales/web
Paul
110K
sales
Mary
130K
web
Mary
Key custid
9
Consistent Query Answers
qGet customers who make more than 100K
Repairs
q
Paul
Mary
CONSISTENT ANSWER Paul,Mary
CONSISTENT ANSWERSAnswers obtained no matter
which repair we choose
q
Paul
Mary
Peter
q
Paul
Mary
Peter
q
Paul
Mary
10
Problem
  • Potentially HUGE number of repairs!

11
ConQuer
  • ConQuer is a system designed to compute
    consistent answers efficiently
  • avoids explicit construction of repairs
  • reuses commercial database technology

12
ConQuers Solution
Query q Keys
Consistent answer to q
ConQuers Rewriting Algorithm ICDT 05 SIGMOD
05
Commercial database engine
Inconsistent database
Rewritten Q
13
Contributions
  • Rewriting algorithm
  • From a large class of SPJ SQL queries
  • Into SQL queries
  • Rewriting for queries with grouping and
    aggregation
  • Optimized rewriting
  • Exploits precomputed information, if available
  • Experimental evaluation
  • Large databases
  • TPC-H queries

14
Demo
  • Present a case study of an inconsistent database
    about airports and cities
  • Explain the automatically generated rewritings
  • Deal with Select-Project-Join queries with
    grouping and aggregation

15
ConQuer papers
  • A. Fuxman, E. Fazli, and R. J. Miller. ConQuer
    Efficient Management of Inconsistent Databases,
    SIGMOD 2005.
  • A. Fuxman and R. J. Miller. First-Order Query
    Rewriting for Inconsistent Databases, ICDT 2005.
Write a Comment
User Comments (0)
About PowerShow.com