How to express MARC in XML - PowerPoint PPT Presentation

1 / 15
About This Presentation
Title:

How to express MARC in XML

Description:

Liv Aasa Holm, JBI-HIO, Norway; Christer Larsson, The Royal ... vs. XML is (just) fashionable ? Useful ! more flexible syntax, i.e. has more 'expressive power' ... – PowerPoint PPT presentation

Number of Views:61
Avg rating:3.0/5.0
Slides: 16
Provided by: irinaoberl
Category:

less

Transcript and Presenter's Notes

Title: How to express MARC in XML


1
How to express MARC in XML
  • ELAG 2004. Workshop 10 Report

2
Participants
  • Liv Aasa Holm, JBI-HIO, Norway
  • Christer Larsson, The Royal Library, LIBRIS
    Department, Sweden
  • Dan Matei, CIMEC - Institute for Cultural Memory,
    Romania
  • Anne Munkebyaune, BIBSYS, Norway
  • Mona-Lise Pedersen, BIBSYS, Norway
  • Nils Pharo, Oslo University College, Norway.

3
Why XML ?
  • XML is (really) useful ? vs. XML is (just)
    fashionable ? Useful !
  • more flexible syntax, i.e. has more expressive
    power
  • it allows more (and finer) syntactic constraints
  • it allows (unrestricted) hierarchies in a record
  • it is here to stay (?)
  • a lot of tools available.

4
Problems with MARCs
  • not too flexible (too flexible ! Ole)
  • only 2 (or 3 ?) hierarchical levels
  • some tags express two things the nature of the
    related entity and the kind of relationship with
    the record
  • 11 principle not observed (i.e. the records are
    not normalized) but they are self-contained
    !

5
Aim to devise an XML-based bibliographic format
  • Approach A mechanically express MARC in XML.
    Already done (by LC) marcxml, see
    www.loc.gov/standards/marcxml/
  • Approach B consider the synthesis of authority,
    bibliographic and holdings MARCs in a new format
    (i.e. a bibliographic MARC-up language ?),
    lets code-name-it MARCX (not Karl..., not ...
    Brothers !).

6
Use cases
  • A. internal (database) format
  • B. transportation (serialization) format
    transport scenarios
  • to/from union catalogues normalized files,
    i.e. records of instances of base entities
    records for their relationships
  • for presentation (i.e. display) un-normalized,
    self-contained (MARC-like) records
  • for ... something else (?) records with FRBR
    families of bibliographic objects, e.g. works
    with their expressions.

7
Integrating framework FRBR
  • schema (and/or DTD) including types for
  • works
  • expressions
  • manifestations
  • items
  • persons
  • ...
  • concepts
  • subject headings
  • relationships.

8
Relationships identifiers
  • need for unique identifiers within a file
  • need for global unique identifiers
  • need for large amounts of unique identifiers,
    i.e. automatic generation
  • options URIs, GUIDs Global Unique Identifiers.

9
Relationships options
  • reified (Topic Maps like)
  • ltrelation typesometypegt
  • ltsourcegtid-slt/sourcegt
  • lttargetgtid-tlt/targetgt
  • lt/relationgt
  • within source
  • ltinstance ....gt
  • ...
  • ltrelation typesometypegtid-tlt/relationgt
  • ...
  • lt/instancegt

10
Relationships the type problem
  • the type as attribute
  • ltrelation typeauthorgtidlt/relationgt
  • vs.
  • the type as element
  • ltrelationgt
  • lttypegtauthorlt/typegt
  • lttargetgtidlt/targetgt
  • lt/relationgt
  • which is more convenient for ontology
    controlled types ?

11
Types/elements inner structure
  • to conserve the MARC blocks ? No !
  • to re-group data elements by their nature, e.g.
    title and notes on title
  • to use as many hierarchical level as necessary
    (but not more).

12
Types/elements general pattern
  • ltidentifiersgt
  • ...
  • lt/identifiersgt
  • ltdescriptiongt
  • ...
  • lt/descriptiongt
  • ltrelationsgt
  • ...
  • lt/relationsgt

13
Language independence (1)
  • for multilingual records
  • element localized text (ltext), with
    attributes
  • language
  • script
  • transliteration standard.
  • e.g.
  • ltltext langen scriptlatingt
  • What the hell is going on ?
  • lt/ltextgt

14
Language independence (2)
  • cataloguing rule areas in the language of the
    material
  • ltbibliographicDescription langengt
  • lttitle typepropergt
  • Romeo and Juliet
  • lt/titlegt
  • lttitle typeparallel langfrgt
  • Romeo et Juliette
  • lt/titlegt
  • lttitle typetranslated langdegt
  • Romeo und Julietta
  • lt/titlegt
  • lt/bibliographicDescriptiongt

15
Conclusions ?
  • To tag or not to tag ? To tag !
  • In MARCX
  • finer (and more controllable) granularity
  • less redundancy
  • more compact records
  • more human-readable records
  • lots of ready-made tools.
  • Another lingua franca ?
  • INTERMARC redivivus !
Write a Comment
User Comments (0)
About PowerShow.com