Title: OCLC Research
1OCLC Research
- Lorcan DempseyVP Research, OCLC
- February 2004(see next slide for where this
presentation was given)
2- Different versions of this presentation were
given at the following meetings - OCLC Australian Advisory CouncilMelbourne,
February 1, 2004 - National Library of Australia, Canberra,
February 6, 2004 - OCLC Members CouncilDublin, Ohio, February 9,
2004
3Overview
- MARC 21
- MARC-XML, MODS, Dublin Core, Onix, LOM
- EAD, TEI, DC, MARC
- METS, SCORM, DIDL,
- DDI, FGDC, ..
- MARC AMC, EAD, DC, RSLP
- OAIS, METS, OCLC/RLG,
- Z39.50, SRU/W, Xquery,
- SOAP, WSDL, UDDI,
- GIF, TIFF, PNG, JPEG,
- XML, RDF, DAMLOIL, ..
- DDC, LCSH, LCC, TGN, AAT,
- PURL, DOI, ISTC, URN, ERROL, POI,
- XRML, ODRL, ..
- ZTHES, VDEX, TIF, ..
4Research possibilities
- .. are endless!
- Becoming more complex as more activities enter a
network space. - Focus
- on maximizing impact of a limited resource.
- on where can make an internal and external
impact. - on making valuable work more visible
- on engaging external partners in useful
collaboration.
5Overview
6(No Transcript)
7Collection and user analysis
The idea of the balancedbut unreadcollection is
disappearing.
Librarians cannot change user behavior so they
need to meet the user.
- Change creates demand for better data.
- Growing interest in knowing more about
- Characteristics
- Gaps and overlaps
- Use
- Tuning collections based on data.
- Focus collection spending where creates most
value.
8OR objectives
- Support better management decisions by
- Making data work
- Exploring user behaviors.
9Some projects
- Characteristics of collections
- WorldCat
- CIC
- Compare ILL, circulation and holdings data.
- Last copy what is irreplaceable?
- ARL Global Resources.
- Exploring coverage of overseas titles in ARL
libraries.
- Large scale user behavior study
- IMLS project with OSU and OCLC
10Comparing CIC Collection Profiles
11(No Transcript)
12Content management
- Digital asset management a growing concern
- Cultural heritage, special collections,
- Learning objects
- Institutional repositories
- Issues
- Repository selection and interoperability
- Securing long term access to digital assets
13Content management
- Digital preservation
- Economics of digital preservation
- Consensus making OCLC/RLG working groups
- Preservation metadata (PREMIS)
- Repository architectures
- Contributions to Dspace codebase to support its
interoperability - OAI
- SRW
- Reference models
- IMS repository interoperability
14(No Transcript)
15System and service architecture
- The library systems environment is getting more
complex - ILS
- Digital asset management
- Resolution
- Portal
- Resource sharing
- License management
- Auth
- Build, buy, opensource?
- Integration
- Integrated workflow
- Portal
- Cataloging
-
16Unplug and play
- In this model business functionality is unplugged
from large integrated centralized workflow
applications so that it can be more readily
integrated inside local applications and
workflow. In this way, functionality
potentially reaches a wider audience, provides
more value to existing audiences, and extends the
life of legacy applications.
17OR objectives
- Investigate new ways of structuring and viewing
WorldCat and associated knowledge structures - Exploit emerging technologies, open standards and
protocols to prototype new services
18Some projects
- Unplug and play
- Metadata schema transformation
- E-prints UK
- Terminology services
- Name authority services
- XISBN
- Text searching
- Fast searching on Beowulf clusters
- Harvesting
- NDLTD Union Catalog
19Metadata schema transformation
Crosswalk repository client
A metadata crosswalk
Web services layer
A record
Record translation client
Metadata schema translator
A transformed record
20(No Transcript)
21xISBN
- An experimental web service
- Give it an ISBN, it returns all related ISBNs
- Based on WorldCat
- Designed for machine-to-machine data exchange
- Examples
- Check user ILL requests against all
editions/versions in OPAC - Find librarys editions when user finds any
edition/version of item on Amazon - Check OPAC for all editions during
selection/acquisitions/gift book processing
22Searching for the book on Amazon
23LibraryLookup bookmarklet
Is the book at my library?
http//www.amazon.co.uk/exec/obidos/ASIN/186046495
5/qid1075134526/sr1-1/refsr_1_10_1/202-6426661-
8213436
24xISBN bookmarklet
Is the book at my library?
xISBN
http//www.amazon.co.uk/exec/obidos/ASIN/186046495
5/qid1075134526/sr1-1/refsr_1_10_1/202-6426661-
8213436
ADDED
ADDED
ADDED
ADDED
ADDED
25(No Transcript)
26(No Transcript)
27Knowledge organization and semantic web
"The Semantic Web is an extension of the current
web in which information is given well-defined
meaning, better enabling computers and people to
work in cooperation." -- Tim Berners-Lee, James
Hendler, Ora Lassila, The Semantic Web,
Scientific American, May 2001
28Mmmm.
29OR objectives
- To release the value of the historical library
investment in controlled vocabularies and
knowledge structures - Redeploy tools for accessing or assigning names,
subjects, and classification numbers - Make knowledge organization services more
accessible.
30Projects
- FAST
- Terminology services
- FRBR
- Automatic classification
- VIAF Virtual International Authority File
- Library of Congress, Die Deutsche Bibliothek
31FAST Geographic Search by Area
Avalon Lake Bellaire, Lake
Charlevoix, Lake Fletcher Pond Munro
Lake Ocqueoc Lake
Bar 1 Bay 5 Bridge
1 Channel 2 Civil 23 Forest
4 Island 4 Lake 6 Park
10 Ppl 92 Stream 10
32Knowledge org systems
- Plethora of vocabularies
- Incompatible approaches to encoding
- Few connections
- Education
- GEM Subjects, ERIC Thesaurus, LCSH, CIP
(Classification of instructional programs) - Cultural Heritage
- AAT, Thesaurus for Graphic Materials (TGM)
Subjects Genre Terms - Not built for the web
- Link to concepts
33Terminology servicesWebulating knowledge
organization
- The goal of this project is to offer accessible,
modular, web-based terminology services. - Make vocabularies more available for
- Metadata creation
- Searching
-
- Refine and extend mappings
- Represent vocabularies in major encoding
standards, e.g., MARC, Zthes, TIF - Prototype custom web services as appropriate
342.6 million fiction records from Worldcat,
clustered by OCLCs FRBR algorithm Make greater
use of data (genres, settings, imaginary
characters, etc)
35Work display
36Work/expression display
37Work/expression/manifestation
38(No Transcript)
39Interoperability
- Extract maximum value from investment in
- Metadata
- Content
- Services
- By ensuring that they are
- Sharable
- Reusable
- Recombinable
40OR objectives
- Provide leadership in Internet and information
standardization - Help to raise the visibility of the values and
value of librarianship
41Some examples
- Dublin Core
- Central to library, cultural heritage and related
communities. - Harvested data OAI
- 8 Governments
- Corporations and NGOs
- Protocols
- Z39.50, SRW/U, OAI, Zthes
- Identifiers
- INFO URI, PURL
- Registries
- DCMI, OpenURL, Info URI
- Everywhere !
Cliff Lynch on Info URI it represents an
important new step in collaboration ACROSS
standards organizations, and I think the work
is of real importance to the CNI community.
42(No Transcript)