Title: Diapositiva 1
1FONDAZIONE RINASCIMENTO DIGITALE
PREMIS Implementation Fair 2009
Foundation promoted by Ente Cassa di Risparmio of
Florence
Implementation in Italy
Angela Di Iorio
Presidio Officers' Club, San Francisco, October
7th, 2009
2The double-layered interoperability issues
consequence
Organizational and technological interoparability
issues break the resources flow among
repositories and the opportunities of OAIS
Archival Information Packages(AIPs) exchange
TECHNOLOGICAL
METADATA
ARCHIVING TECHNOLOGIES
CONTENTS
ORGANIZATIONAL
METHODOLOGIES
STRATEGIES
POLICIES
BUSINESS RULES
AIP exchange
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
3ARTAT- Partnership and Application Context
The experiment application will start with three
repositories
The ICCU istitutional repository named MAGTECA,
which is the grounding archive of the italian
national Digital Library Portal and
Cultural-Tourist Network (www.internetculturale.it
). It contains more than 2.000.000 of digitalized
images with the corresponding metadata
consolidated in more than 29.000 documents. The
metadata framework is encoded in MAG
(http//www.iccu.sbn.it/genera.jsp?id267).
Magazzini Digitali is a project undertaken by
Fondazione Rinascimento Digitale and National
Library of Florence (http//www.rinascimento-digit
ale.it/index.php?SEZ28) The selected objects
from the repository are Doctoral Thesis that are
harvested by the repository from the italian
universities institutional repositories. The
metadata framework is encoded in MPEG21-DIDL
(http//www.chiariglione.org/mpeg/standards/mpeg-2
1/mpeg-21.htm).
The digital repository of the Library Archive
of the British School at Rome which have digital
images of items from the collections of historic
photographs, prints and maps The digitalized
collections comprehend around 40.000 images with
more then 13.900 metadata documents.
(http//digitalcollections.bsrome.it/). The
metadata framework is encoded in METS
(www.loc.gov/standards/mets).
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
4MAG
Mandatory
ltmetadigitgt
Optional
R Repeatable
ltgengt
ltbibgt
ltdisgt
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
5MPEG21-DIDL
container
item
descriptor
item
descriptor
component
descriptor
component
descriptor
resource
resource
component
descriptor
resource
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
6METS
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
7ARTAT- Approach
Participants Inquiry Phase
PML Production Phase
Inquiry phase
PREMIS Data Dictionary
Repository Preservation Metadata Mapping
Semantic Units Roadmap
Getting information
- types of objects,
- management procedures
- preservation metadata status
Participating Repository Information Questionnaire
Preservation Metadata Status Questionnaire
Repository PML Model
Repository
Design Development Software Components
Getting outcomes
- Learning by doing of concepts and
- definitions that belong to the knowledge
- domain of preservation metadata
- management
- Improving faculty and quality of
- communication about digital preservation
- among partners network, by means of the
- adoption of common semantics
- Self-documenting project backgrounds and
- developments
Repository Preservation Status Survey
PML
Participating Repository Documentation
PREMIS Implementation Documentation
PML Repository Documentation
ARTAT Outcomes Documentation
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
8ARTAT- Approach
Inquiry phase
Repository information questionnaire
Num. rif. File Types File Extensions File Name Example Files Total Mb Total Storage System
BNCF_1 Object file pdf 187263.pdf 200 1200 Mb DB
BNCF_1 XML file xml 187263.xml 150 1,2 MB DB
BNCF_2 Object file - master tif 43572.tif 300 20794 MB File System
BNCF_2 Object file - derivative jpg 43572.jpg 300 DB
BNCF_2 XML file xml 43572.xml 91 DB
BNCF_3 Object file - master tif wp723639.tif 100 File System
BNCF_3 Object file derivative jpg wp723639.jpg 100 DB
BNCF_3 XML file xml wp723639.jpg DB
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
9ARTAT- Approach
Inquiry phase
Repository information questionnaire
Institution Metadata standard type Standard Name Version
ICCU Wrapper MAG 2.01
ICCU Descriptive DC simple
ICCU Technical MIX 0.1 draft
BNCF Wrapper MPEG21-DIDL
BNCF Descriptive DC simple
BNCF Technical MIX(Jhove)
BSR Wrapper METS 1.4
BSR Descriptive MODS 3.3
BSR Technical MIX(Jhove) 1.0
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
10ARTAT- Approach
Inquiry phase
Repository information questionnaire
OCLC/RLG PREMIS Working Group, 2004. Implementing
Preservation Repositories for Digital Materials
Current Practice and Emerging Trends in the
Cultural Heritage Community. Report by the joint
OCLC/RLG Working Group Preservation Metadata
Implementation Strategies (PREMIS). Dublin, O.
OCLC Online Computer Library Center, Inc.
http//www.oclc.org/research/projects/pmwg/surveyr
eport.pdf
4.3. How are metadata and materials stored within
the preservation repository? For example, one
repository might zip metadata together with
content files and store the zip file as a single
entity. Another repository might store metadata
in relational database tables and content files
as individual entities in a file system. A third
repository might use multiple METADATA What
categories of metadata are (or will be) stored by
and used by your preservation repository? Please
check all that apply. __ rights and permissions
__ provenance (document history) __ technical
metadata __ administrative and management
information __ bibliographic/descriptive __
structural metadata __ other If you are using or
planning to use metadata elements from one or
more published scheme, which schemes are you
using? Please check all that apply. Does your
repository record information about these types
of entities? Please check all that apply.
Describe the sort of metadata that is (or will
be) recorded about each of these entities, giving
a few specific metadata elements as examples. __
collection __ logical object such as a book or
photograph __ non-digital source object __ file
__ bitstream (a bitstream may be equivalent to a
file, a subset of a file such as a binary object
embedded in a PDF, or greater than a file such as
a digital video stored in three parts) __
metadata __ other How is metadata obtained (or
expected to be obtained) by the preservation
repository? For example, is it submitted by
depositors, extracted automatically by the
repository's computer programs, other? If
different methods are used for different sets of
metadata, please note all of them. 5.6. How is
metadata stored and updated in your preservation
repository? If multiple methods are used, please
explain. __ in a relational database __ in an XML
database __ in an object-oriented database __ in
a proprietary database or format __ in flat files
__ bundled with related content files
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
11ARTAT- Approach
Inquiry phase
Preservation Metadata Status questionnaire
Woodyard-Robinson, D.,2007. Implementing the
PREMIS data dictionary a survey of approaches.
The PREMIS Maintenance Activity sponsored by the
Library of Congress, 4 june 2007.
http//www.loc.gov/standards/premis/implementation
-report-woodyard.pdf
ARTAT PREMIS Semantic Unit Roadmap
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
12Preservation Metadata Layer (PML)
Requirements
Repository's AIP
- PREMIS conformance requirement
descriptive
structural
administrative
technical Metafile.xml
metadata file/s
- Comprehensiveness requirement
object/s
- PML independence from AIPs
PML
structural
administrative
technical
metadata file/s
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
13Preservation Metadata Layer (PML)
Structure
Repository's AIP
descriptive
structural
administrative
technical Metafile.xml
metadata file/s
Describes technically and structurally the AIPs
content
object/s
Populates a framework of PREMIS semantic units
from repository preservation metadata and from
other source of information located by the
inquiry phase
PML
structural
administrative
technical
metadata file/s
The target is not only the objects but the
metadata which describes the AIP package.
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
14Preservation Metadata Layer (PML)
Repository's AIP
descriptive
structural
administrative
technical Metafile.xml
metadata file/s
The part describing the AIP metadata, in other
words the meta-metadata, is considered the core
of PML.
object/s
objectIdentifier metafile.xml
objectCategory objectCharacteristics significant
Properties codedMETS significantProperties
contentMODS significantProperties
contentMIX storage relationship description
describing img01.xml linkingEventIdentifier
artat_e000001
objectIdentifier img01.jpg objectCategory object
Characteristics storage relationship
descriptiondescribed by metafile.xml
linkingEventIdentifier artat_e000001
eventIdentifier artat_e000001 eventType PML
production eventDateTime linkingAgentIdentifier
ARTAT-SW linkingObjectIdentifier
metafile.xml linkingObjectIdentifier img01.jpg
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
15Preservation Metadata Layer (PML)
objectIdentifier b3b37810-3b73-4ac7-bc42-5754d64c
bc03.xml objectCategory objectCharacteristics sig
nificantProperties codedMPEG21-DIDL significant
Properties contentDC significantProperties
contentMIX storage relationship description
describing 10077/2707/1/tesi.pdf
linkingEventIdentifier artat_e000001
This is the creation of the PML core which
describes the metadata part of AIPs it is the
meta-metadata
eventIdentifier artat_e000001 eventType PML
production eventDateTime linkingAgentIdentifier
ARTAT-SW linkingObjectIdentifierb3b37810-3b73-4
ac7-bc42-5754d64cbc03.xml linkingObjectIdentifier
10077/2707/1/tesi.pdf
This is the creation of events entity metadata,
created by the ARTAT tools
objectIdentifier 10077/2707/1/tesi.pdf objectCat
egory objectCharacteristics storage relationship
descriptiondescribed by b3b37810-3b73-4ac7-bc42-
5754d64cbc03.xml linkingEventIdentifier
artat_e000001
This is the duplication of technical metadata in
PREMIS code
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
16Translation In Translation Out
STRUCTURAL METADATA
SELECTION
ADMINISTRATIVE METADATA
Participants Inquiry Phase
TECHNICAL METADATA
TECHNICAL METADATA
DUPLICATION
DESCRIPTION
INTEGRATION
CORE
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio
17A low impact digital objects lifecycle event
Alien AIP
low impact event
search objects
objects type
Event management
get objects
event detail
We could figure out the normal management of AIP
impacts only on the PML. In general we could
imagine that the preservation metadata will grow
every time an event happens
objects metadata
update metadata
updated metadata
CORE
PREMIS subset
redundant metadata
Making the existing resources supplied with a
preservation framework
Angela Di Iorio
PREMIS - PRESERVATION METADATA IMPLEMENTATION
STRATEGIES
Accademia dei Lincei, Rome February 6th, 2009
18A high impact digital objects lifecycle event
Event management
migration event
objects type
event detail
Alien AIP
?
new objects
create
search objects
objects metadata
get objects
new metadata connections to migrated objects
inherited metadata
CORE
a high impact event where not only the PML, but
also the objects and/or metadata objects have
been involved. In the last case, redundant
metadata probably will be involved and updated by
means of checking differences obtained by the
comparison of original AIPs metadata and PML
duplication metadata. The fact of changing the
original AIPs objects forces to find solutions
about the creation of new one AIP and how it
should be structured.
Making the existing resources supplied with a
preservation framework
Angela Di Iorio
PREMIS - PRESERVATION METADATA IMPLEMENTATION
STRATEGIES
Accademia dei Lincei, Rome February 6th, 2009
19Thanking
Thanks for your kind attention . and Questions
Time.
contacts information Angela Di Iorio Fondazione
Rinascimento Digitale Researcher angeladiiorioat
gmaildotcom Maurizio Lunghi Fondazione
Rinascimento Digitale Scientific
Director lunghiatrinascimento-digitaledotit
Implementation in Italy
PREMIS Implementation Fair 2009
Presidios Officers Club San Francisco October
7th, 2009
by Angela Di Iorio