Model-independent Schema and Data Translation - PowerPoint PPT Presentation

About This Presentation
Title:

Model-independent Schema and Data Translation

Description:

Last Name. School. Dept. Name. Address. ID. ID. ID. Dept_ID. Emp_ID ... A common dictionary (for an ER design tool) Entity. OID. Schema. Name. 301. 1. Employees ... – PowerPoint PPT presentation

Number of Views:19
Avg rating:3.0/5.0
Slides: 35
Provided by: profpaol6
Category:

less

Transcript and Presenter's Notes

Title: Model-independent Schema and Data Translation


1
Model-independent Schema and Data Translation
  • Paolo Atzeni
  • 14/10/2009

2
References
  • P. Atzeni, G. Gianforme, and P. Cappellari. A
    Universal Metamodel and Its Dictionary.
    Transactions on Large-Scale Data-and
    Knowledge-Centered Systems I, 2009
  • P. Atzeni, P. Cappellari, R. Torlone, P. A.
    Bernstein, and G. Gianforme. Model-independent
    schema translation. VLDB Journal, 2008
  • P. Atzeni, P. Cappellari, and P. A. Bernstein.
    Model independent schema and data translation.
    EDBT 2006.

3
Schema and data translation
  • Schema translation
  • given schema S1 in model M1 and model M2
  • find a schema S2 in M2 that corresponds to S1
  • Schema and data translation
  • given also a database D1 for S1
  • find also a database D2 for S2 that contains the
    same data as D1

4
A wider perspective
  • (Generic) Model Management
  • A proposal by Bernstein et al (2000 )
  • Includes a set of operators on
  • schemas and mappings between schemas
  • The main operators
  • Match
  • Merge
  • Diff
  • ModelGen (schema translation)

5
A long standing issue
  • Translations from a model to another have been
    studied since the 1970s
  • Whenever a new model is defined, techniques and
    tools to generate translations are studied
  • However, proposals and solutions are usually
    model specific
  • Given an ER schema, find the suitable relational
    schema that implements it
  • the original paper (Chen 1976) contains the
    basics
  • further discussions by many (e.g. Markowitz and
    Shoshani 1989)
  • illustrated in every textbook
  • Similarly with
  • any other conceptual model and any other logical
    one
  • XML and relational (or object)

6
 A simple example
  • An object relational database, to be translated
    in a relational one
  • Source the OR-model
  • Target the relational model

7
 Example, 2
  • How do we traslate?
  • Two possibilities. Does the OR model allow for
    keys?
  • If YES (EmpNo and Name are keys)

8
 Example, 3
  • How do we traslate?
  • Two possibilities. Does the OR model allow for
    keys?
  • If NOT

9
Many different models (plus variants )
OO
ER
OR
XSD
Relational

10
Heterogeneity
  • We need to handle artifacts and data in various
    models
  • Data are defined wrt to schemas
  • Schemas wrt to models
  • How can models be defined? We need metamodels

Data
11
A metamodel approach
  • The constructs in the various models are rather
    similar
  • can be classified into a few categories (Hull
    King 1986)
  • Abstract (entity, class, )
  • Lexical set of printable values (domain)
  • Aggregation a construction based on (subsets
    of) cartesian products (relationship, table)
  • Function (attribute, property)
  • Hierarchies
  • We can fix a set of metaconstructs (each with
    variants)
  • abstract, lexical, aggregation, function, ...
  • the set can be extended if needed, but this will
    not be frequent
  • A model is defined in terms of the metaconstructs
    it uses

12
 The metamodel approach, example
  • The ER model
  • Abstract (called Entity)
  • Function from Abstract to Lexical (Attribute)
  • Aggregation of abstracts (Relationship) 
  • The OR model
  • Abstract (Table with ID)
  • Function from Abstract to Lexical (value-based
    Attribute)
  • Function from Abstract to Abstract (reference
    Attribute)
  • Aggregation of lexicals (value-based Table)
  • Component of Aggregation of Lexicals (Column)

13
 The supermodel
  • A model that includes all the meta-constructs (in
    their most general forms)
  • Each model is subsumed by the supermodel (modulo
    construct renaming)
  • Each schema for any model is also a schema for
    the supermodel (modulo construct renaming)
  • In the example, a model that generalizes OR and
    relational
  • Each translation from the supermodel SM to a
    target model M is also a translation from any
    other model to M
  • given n models, we need n translations, not n2 

14
Generic translations
Supermodel
2. Translation
1. Copy
3. Copy
Source model
Target model
Translation composition 1,2 3
15
Translations within the supermodel
  • We still have too many models
  • we have few constructs, but each has several
    independent features which give rise to variants
  • for example, within simple OR model versions,
  • Key may be specifiable or not
  • Generalizations may be allowed or not
  • Foreign keys may be used or not
  • Nesting may be used or not
  • Combining all these, we get hundreds of models!
  • The management of a specific translation for each
    model would be hopeless

16
The metamodel approach, translations
  • As we saw, the constructs in the various models
    are similar
  • can be classified according to the metaconstructs
  • translations can be defined on metaconstructs,
  • there are standard, known ways to deal with
    translations of constructs (or variants theoreof)
  • Elementary translation steps can be defined in
    this way
  • Each translation step handles a supermodel
    construct (or a feature thereof) "to be
    eliminated" or "transformed"
  • Then, elementary translation steps to be combined
  • A translation is the concatenation of elementary
    translation steps

17
Many different models
OO
ER
OR
XSD
Relational

18
Many different models (and variants )
OR w/ PK, gen, ref, FK
OR w/ PK, gen, ref
OR w/ PK, gen, FK
OR w/ PK, ref, FK
OR w/ PK, ref
OR w/ gen, ref
OR w/ PK, FK
OR w/ ref
Relational

19
A complex translation, example
(0,N)
(0,N)
  • Target simple object model
  • Eliminate N-ary relationships
  • Eliminate attributes from relationships
  • Eliminate many-to-many relationships
  • Replace relationships with references
  • Eliminate generalizations

20
Complex translations
N-ary ER w/ gen
Elim. N-ary relationships Elim. Relationship
attr.s Elim. MN relationships Replace
relationships with references Elim OO
generalizations Elim ER generalizations
Binary ERw/ gen
N-ary ER w/o gen
Bin ER w/ gen w/o attr on rel
Binary ER w/o gen
Bin ER w/o gen w/o attr on rel
Bin ER w/ gen w/o MN rel
OO w/ gen
Bin ER w/o gen w/o MN rel

Relational
OO w/o gen
21
 A more complex example
  • An object relational database, to be translated
    in a relational one
  • Source an OR-model
  • Target the relational model

22
A more complex example, 2
Dept
DEPT
EMP
ID
ID
Name
Last Name
Address
Dept_ID
Target relational model
Eliminate generalizations Add keys Replace refs
with FKs
ENG
ID
School
Emp_ID
23
A more complex example, 3
DEPT
EMP
ID
ID
Last Name
Name
Address
Dept_ID
Target relational model
Eliminate generalizations Add keys Replace refs
with FKs Replace objects with tables
ENG
ID
School
Emp_ID
24
Many different models (and variants )
OR w/ PK, gen, ref, FK
OR w/ PK, gen, ref
OR w/ PK, gen, FK
OR w/ PK, ref, FK
OR w/ PK, ref
OR w/ gen, ref
OR w/ PK, FK
Source
OR w/ ref
Eliminate generalizations Add keys Replace refs
with FKs Replace objects with tables
Relational
Target
25
Translations in MIDST (our tool)
  • Basic translations are written in a variant of
    Datalog, with OID invention
  • We specify them at the schema level
  • The tool "translates them down" to the data level
    (both in off-line and run-time manners, see
    later)
  • Some completion or tuning may be needed

26
A Multi-Level Dictionary
  • Handles models, schemas and data
  • Has both a model specific and a model independent
    component
  • Relational implementation, so Datalog rules can
    be easily specified

27
A common dictionary (for an ER design tool)
AttributeOfEntity AttributeOfEntity AttributeOfEntity AttributeOfEntity AttributeOfEntity AttributeOfEntity AttributeOfEntity
OID Schema Name isIdent isNullable Type Entity
401 1 EmpNo T F Int 301
402 1 Name F F Text 301
404 1 Name T F Char 302
405 1 Address F F Text 302
501 3 Code T F Int 201

Entity Entity Entity
OID Schema Name
301 1 Employees
302 1 Departments
201 3 Clerks
202 3 Offices
Relationship Relationship Relationship Relationship Relationship Relationship Relationship
OID Schema Name Entity1 Entity2
401 1 Memb 301 302

28
Multi-Level Dictionary
29
The supermodel description
MSM-Property MSM-Property MSM-Property MSM-Property
OID Name Constr. Type
11 Name 1 String
12 Name 2 String
13 IsKey 2 Boolean
14 IsNullable 2 Boolean
15 Type 2 String
16 Name 3 String
17 Name 4 String
18 IsIdentifier 4 Boolean
19 IsNullable 4 Boolean
20 Type 4 String
21 IsFunct1 5 Boolean
22 IsOptional1 5 Boolean
23 Role1 5 String
24 IsFunct2 5 Boolean
25 IsOptional2 5 Boolean
26 Role2 5 String
MSM-Construct MSM-Construct MSM-Construct
OID Name IsLex
1 AggregationOfLexicals F
2 ComponentOfAggrOfLex T
3 Abstract F
4 AttributeOfAbstract T
5 BinaryAggregationOfAbstracts F
MSM-Reference MSM-Reference MSM-Reference MSM-Reference
OID Name Construct Target
30 Aggregation 2 1
31 Abstract 4 3
32 Abstract1 5 3
33 Abstract2 5 3
30
Model descriptions
MM-Model MM-Model
OID Name
1 Relational
2 Entity-Relationship
3 Object
MSM-Construct MSM-Construct MSM-Construct
OID Name IsLex
1 AggregationOfLexicals F
2 ComponentOfAggrOfLex T
3 Abstract F
4 AttributeOfAbstract T
5 BinaryAggregationOfAbstracts F
MM-Construct MM-Construct MM-Construct MM-Construct
OID Model MSM-Constr Name
1 2 3 ER_Entity
2 2 4 ER_Attribute
3 2 5 ER_Relationship
4 1 1 Rel_Table
5 1 2 Rel_Column
6 3 3 OO_Class
MSM-Property MSM-Property MSM-Property MSM-Property
OID Name Construct Type

MM-Property MM-Property MM-Property MM-Property

MSM-Reference MSM-Reference MSM-Reference MSM-Reference
OID Name Construct

MM-Reference MM-Reference MM-Reference MM-Reference

31
Schemas in a model
EmpNo
Employees
SM-Construct SM-Construct SM-Construct
OID Name IsLex

3 ER-Entity F
4 ER-AttributeOfEntity T

Name
Name
Departments
Address
ER-AttributeOfEntity ER-AttributeOfEntity ER-AttributeOfEntity ER-AttributeOfEntity ER-AttributeOfEntity ER-AttributeOfEntity ER-AttributeOfEntity
OID Schema Name isIdent isNullable Type AbstrOID
401 1 EmpNo T F Int 301
402 1 Name F F Text 301
404 1 Name T F Char 302
405 1 Address F F Text 302
501 3 Code T F Int 201

ER-Entity ER-Entity ER-Entity
OID Schema Name
301 1 Employees
302 1 Departments
201 3 Clerks
202 3 Offices
ER schemas
32
Schemas in the supermodel
EmpNo
Employees
MSM-Construct MSM-Construct MSM-Construct
OID Name IsLex

3 Abstract F
4 AttributeOfAbstract T

Name
Name
Departments
Address
SM-AttributeOfAbstract SM-AttributeOfAbstract SM-AttributeOfAbstract SM-AttributeOfAbstract SM-AttributeOfAbstract SM-AttributeOfAbstract SM-AttributeOfAbstract
OID Schema Name isIdent isNullable Type AbstrOID
401 1 EmpNo T F Int 301
402 1 Name F F Text 301
404 1 Name T F Char 302
405 1 Address F F Text 302
501 3 Code T F Int 201

SM-Abstract SM-Abstract SM-Abstract
OID Schema Name
301 1 Employees
302 1 Departments
201 3 Clerks
202 3 Offices
Supermodel schemas
33
Multi-Level Repository, generation and use
description
Supermodel description (mSM)
Model descriptions (mM)
model
Supermodel schemas (SM)
Model specific schemas (M)
schema
Structure fixed, content provided by tool
designers
Structure fixed, content provided by model
designers out of mSM
Supermodel instances (i-SM)
Model specific instances (i-M)
data
Structure generated by the tool from the content
of mM
Structure generated by the tool from the content
of mSM
model independence
model generic
model specific
Structure generated by the tool from the content
of mSM
34
  • continued next week
Write a Comment
User Comments (0)
About PowerShow.com