wliu@libnet.sh.cn

1 / 108
About This Presentation
Title:

wliu@libnet.sh.cn

Description:

Title: Author: keven Last modified by: keven Created Date: 4/9/2005 10:52:24 PM Document presentation format: – PowerPoint PPT presentation

Number of Views:4
Avg rating:3.0/5.0
Slides: 109
Provided by: Kev59

less

Transcript and Presenter's Notes

Title: wliu@libnet.sh.cn


1
???????
???????????? 2005 ??
  • ??
  • wliu_at_libnet.sh.cn

2
????????????
  • ???????
  • ??????
  • ??????????????????
  • OpenURL??
  • Z39.50 ????
  • OAI????????

3
??????
  • ???????
  • ??????
  • ???????
  • ????,????(Harvesting)???,?OAI
  • ???????
  • ???????????????????,?Z39.50, SDLIP, OpenURL
  • ??????????,??????????????????

4
????????
  • ??????
  • Abstracting/Indexing Services
  • Publishers (e.g. Elsevier ScienceDirect)
    important to be able to pull togther
  • Aggregators
  • Electronic reserves
  • Local databases including digitization projects
  • OAI resources
  • Institutional repositories

5
??????
6
????
  • ????????????
  • ????????????????
  • ??????????????
  • ??????????????????
  • ?Silver Platters SilverLinker
  • ????????????,?????Access Denied???

7
????
8
????
  • on the fly
  • ??????????,???????URL
  • ??????????metadata???
  • Metadata??????????????
  • link-to
  • ???????????????URLs
  • deeper link

9
How to interlinking
  • ????????
  • OpenURL
  • ??????
  • OAI????????????
  • (Open Archives Initiative Protocol for
    Metadata Harvesting)
  • ?????????(??)???????

10
DOI - CrossRef
DOI
????
11
DOI - CrossRef
????
DOI
http//dx.doi.org/10.1093/toxsci/57.1.95
12
OpenURL
  • ?Web????????
  • ??
  • ???Web??????????????
  • ?????????? (Tag),??Web??????
  • ??
  • ??????INTERNET??????????
  • ??????(Target)???????????(Source)???????
  • ??????(Source)????????(Target)???????????

13
OpenURL
14
OpenURL
  • ????
  • ????????Internet?CGI?????HTTP GET?HTTP POST??
  • ???OPENURL??????
  • BASE-URL??????OPENURL???????,?http//www.sfx.co.i
    l/sfxmenu?
  • DESCRIPTION??????????????metadata?????
  • ??metadata?????????

15
OpenURL BASE URL QUERY
  • ??Metadata(Value),?ISSN?ISBN???URL
  • http//server.lanl.gov/menu?genrearticleissn123
    45678volume12issue3spage1epage8date1998
    aulastSmithaufirstPaul
  • ????????????????-DOI
  • http//server.lanl.gov/menu?iddoi10.1000/6382-1

16
??OpenURL?? ????????
17
?SFX???????? ???
18
SFX
  • ?OpenURL?????????????
  • ??????????????
  • ?????WEB???????????
  • ????????????OPAC?WEB??
  • ?????????
  • ??SFX?????
  • ??SFX???????

19
SFX Server
  • ???(source)???????????
  • ?Medline?Web of science?OPAC
  • ??SFX?hooks??,???????????
  • ?????(service)????OpenURL???
  • ????????metadata,???????
  • ??????????????
  • ??(targets)?????????
  • ???????OPAC?ILL????????????

20
Sourse Service - Target
Service
Source
Target
21
DOI CrossRef OpenURL - SFX
????
DOI
http//dx.doi.org/10.1093/toxsci/57.1.95
22
??????
23
??WOS???
http//sfxserver.uni.edu/sfxmenu?sidProvider2AI1
pid987641
24
Z39.50??? ??
  • Z39.50????? ????--???????????
  • Z39.50????????????, ?????????????????????????

25
????????????
  • ???????????????????????????
  • ??????????/?????????????
  • ?????????????????????

26
Z39.50?????? ????
  • ? ?
  • Z39.50--1988
  • Z39.50--1992
  • Z39.50--1995
  • ???????
  • Z39.50???
  • Z39.50???
  • Z39.50???????

27
?????????
  • Z39.50??????????/???????????
  • ?Z39.50?????,???????????????????????(Origin)???
    ?(Target)?
  • ???????? ?????????????????,??????????

28
Z39.50??????
????
???
???
??
??
????
?????
??
??
??
???
Z-server
????
29
??Z39.50????????????
??????????????
Z39.50????
BER??
TCP/IP?????
30
????????
  • Z39.50??????????????????
  • ????????????,????????????
  • ????????,???????????????
  • ????????,????????????,?????????????????????

31
Z39.50???????
  • ???? ????
  • ??? ???
  • ?? ??
  • ?? ?????
  • ????? ??
  • ?? ??
  • ?? ??
  • ???? ????
  • ??????? ????????????????
  • ?? (?????????????)
  • ???? ????
  • ?? ??

32
Z39.50????????
  • ??
  • ??/??????
  • ????
  • ???????
  • ????
  • ??????
  • ??????????????????????
  • ???????
  • ????

33
Z39.50???
Z39.50???????????
  • Scan(??)
  • Sort(??)
  • Resource-report(????)
  • Extended-services(????)?
  • Init(???)
  • Search(??)
  • Present(??)
  • Delete(??)

34
?????
  • ?Z39.50?????,bib-1????????,??6????????
  • ???????????????????
  • ?????????????????????????
  • ????????????????????????
  • ????????????????????????
  • ??????????????????????????????
  • ????????????

35
??????
  • ???????????(Simple Unstructured Text Record
    Syntax,??SUTRS)?
  • ??????(Generic Record Syntax,??GRS-1)?
  • ????????(OPAC Record Syntax)?
  • ??????(Summary Record Syntax,??Summary)?
  • ??????(EXtensible Markup Language,??XML)?

36
???????????
  • Z39.50??????????????,????????????????????(object
    identifier number)??????Z39.50???????????

Ausmarc Canmarc CCF Danmarc
Finmarc Ibermarc Intermarc Librismarc
MAB Normarc Picamarc SBN
UKmarc Unimarc USmarc
37
??Profile(????)
  • ??Z39.50??????????,????????????????profile(????)??
    ????????????????Bath Profile,?????????????????????
    ??????
  • ??????????????
  • ?????????
  • ?????????
  • ???

38
  • (???)
  • ????????????????????,????????????????????????
  • ?????????????????????,??????????????????????
  • ????????????
  • ??Bath?????????????,????????????????
  • ?????????????????????????????,???????????????????

39
????????????????
??? ?????? ???
???? ????
?????? ??????
?????? ??????
?????? ??????
?????? ??????
?????? ??????
???????? ????????
?????? ??????
???? ????
40
Z39.50?????
  • Z39.50??????????????,???????????????????????
  • ???(Initialization)
  • ??(Search)
  • ??(Present)
  • ??????Z39.50???,???????????????

41
?????????
  • ???????????????????????,????
  • ???????????
  • ??????????????????????
  • ???????????????

42
????????
  • ??????????????
  • ?????????????????????????????,?????????????(?????
    ????????)????
  • ????????????(Profiles)?????????????????,??????????
    ????????????????

43
????????
  • ???????????????????????????????,????
  • ?????????,??????????,????????????
  • ?????????,???????????????,??????????????

44
Z????
45
SRW/U
  • SRW
  • Search and Retrieve Web Service
  • SRU
  • Search and Retrieve URL Service
  • SRW/U
  • Search and Retrieve for the Web

46
ZING
ZINGZ39.50 (international) Next Generation
srw
sru
47
ZING
srw
sru
cql
Common Query Language
48
ZING
srw
sru
cql
49
ZING
srw
sru
zeeRex
cql
ez3950
ZOOM
50
ZING
srw
sru
zeeRex
cql
ez3950
ZOOM
51
ZING
Z39.50 Explain Explained and Re-engineered in XML
srw
sru
zeeRex
52
ZING
srw
sru
zeeRex
Z39.50
53
ZING
srw
sru
zeeRex
54
ZING
Z39.50 Object Oriented Model
srw
sru
ZOOM
55
ZING
srw
sru
ZOOM
Z39.50
56
ZING
srw
sru
ZOOM
57
ZING
srw
sru
58
SRW/U Features
  • SOAP and URL-based access
  • Common Query Language -- CQL
  • Retains several Z39.50 abstractions.
  • .. And discards a few

59
SRW/U Features
  • SOAP and URL-based access
  • Common Query Language -- CQL
  • Retains several Z39.50 abstractions.
  • .. And discards a few

60
SRW/U Features
  • SOAP and URL-based access
  • Common Query Language -- CQL
  • Retains several Z39.50 abstractions .
  • .. And discards a few

61
SRW/U Features
  • SOAP and URL-based access
  • Common Query Language -- CQL
  • Retains several Z39.50 abstractions .
  • .. And discards a few

62
SRW/U retains these Z39.50 concepts.
  • result sets
  • abstract access points
  • abstract record schemas
  • application level diagnostics
  • Explain

63
.. But differs from Z39.50 in these respects
  • Web-based
  • Connectionless
  • XML
  • Protocol (no ASN.1)
  • records (no record syntax)
  • CQL -- user-friendly query

64
The Weaknesses of Classic Z39.50
  • Not popular with the Web community
  • Connection-based Sessions
  • Binary Encoding
  • Transmitted directly over TCP/IP
  • Complicated

65
Z39.50 Complicated?
  • 11 Native Services
  • Init, Search, Present, DeleteResultSet,
    AccessControl, ResourceControl,
    TriggerResourceControl, ResourceReport, Scan,
    Sort, Segment.
  • 7 Extended Services
  • PersistentResultSet, PersistentQuery,
    PeriodicQuerySchedule, ItemOrder, DatabaseUpdate,
    ExportSpecification, ExportInvocation

66
Z39.50 Complicated?
  • 156 page standard
  • 147 in the 2002 draft
  • Vocabulary from the OSI community
  • Target, Origin, APDU, A-association,
    Z-association..
  • Records described with ASN.1
  • Attribute Sets

67
The Strengths of Classic Z39.50
  • Result Sets (a.k.a. Statefulness)
  • Abstraction
  • Abstract Access Points (Attribute Sets)
  • Abstract Record Schemas
  • Explain

68
SRW Search and Retrieve on the Web
  • SOAP (Simple Object Access Protocol) Based
  • HTTP
  • XML
  • Records Described in WSDL (Web Service
    Description Language)
  • 1 Service SearchAndRetrieve
  • No Documentation -)

69
Semantically equivalent to classic Z39.50
  • Gateways Trivial
  • Preserves the experience of the ZIG without the
    baggage of the standard
  • Syntactic equivalence discounted

70
SRW The Basics
  • Only one database per request
  • String (not structure) based queries
  • Index Sets, not Attribute Sets
  • One Record Syntax (XML)

71
The SRW Request
  • String Query
  • Integer StartRecord
  • Integer MaximumRecords
  • String RecordSchema

72
The SRW Response
  • String ResultSetReference
  • ResultSetName
  • ResultSetTimeToLive
  • Integer TotalHits
  • Records
  • Status (Status Code Diagnostic)

73
CQL Common Query Language
  • Loosely based on CCL Search
  • Boolean Proximity Operators
  • Index Sets Indexes
  • Truncation Characters , ?
  • Example
  • dc.titleharry potter or bib1.isbn123-456-78x

74
Explain Now Possible
  • Not practical for classic Z39.50
  • List of supported Schemas
  • List of supported Indexes

75
SRU Search and Retrieve with URLs
  • SRW Without the SOAP Wrapper
  • Intended for Thin Clients
  • Adds ResponseSchema Parameter
  • Example
  • http//deimos.oclc.org/SRW/etdcat?querynuclearma
    xRecords10

76
Outlook for SRW/SRU
  • A solution looking for a problem?
  • Not likely to be quickly adopted by the library
    community
  • Could show up first in other communities
    MetaSearch, Geospatial, Biodiversity, OAI(?)
  • Will be seen as a competitor with XML Query

77
Links
  • http//www.loc.gov/
  • LC Home Page
  • http//www.loc.gov/srw
  • SRW/U Home Page
  • http//www.loc.gov/cql
  • CQL Home Page
  • rden_at_loc.gov

78
OAI Open Archival Initiative
79
??
  • Open Archives Initiative Protocol for Metadata
    Harvesting,(OAI-PMH)??OAI??,??????????????????????
    ????,???????Web?????????????????????????
  • ????????????
  • ???????????
  • ????????????????

80
??
  • ?????????????HTTP??????,???metadata???
  • ??????????(Repository)????????????

81
????
82
????
  • DP????????????,??OAI????metadata?????????????????
    ???OAI????????metadata??????(Web???)?
  • SP???????OAI????????????,?????metadata??????????
  • ???(Repository)??HTTP,??OAI????????????????
  • ???(Set)???????????????????,?????????????????????
    ,?????????,???(node)????????,???????????????
  • ??(Record)??????????????OAI??,?????????XML???????
    ????

83
??????????
  • Header-????????????,???????
  • Unique identifier-????????????
  • Datestamp-?????????????????,???????????????
  • Metadata-??????metadata???
  • About-??????????????????,???????

84
????
  • ??OAI??1.0??,???OAI??????????Content-Type??,????te
    xt/xml?,?XML?????????Unicode?UTF-8????,?UTF-8??XML
    ?????,????8???Unicode?????,?XML????prolog????
  • lt?xml version1.0 encodingUTF-8?gt

85
????
  • definitions concepts
  • repository
  • record
  • identifier
  • datestamp
  • set
  • protocol features
  • HTTP encoding
  • metadata prefix schema
  • flow control
  • protocol requests
  • supporting requests
  • harvesting requests

86
Repository
87
Record
ltrecordgt ltheadergt ltidentifiergtoaieg001lt/ident
ifiergt ltdatestampgt1999-01-01lt/datestampgt lt/head
ergt ltmetadatagt ltdc xmlnshttp//purl.org/dcgt
lttitlegtMy Examplelt/titlegt lt/dcgt lt/metadatagt
ltaboutgt ltea xmlnshttp//www.arXiv.org/ea
ltusagegtNo restrictionslt/usagegt lt/eagt lt/aboutgtlt
/recordgt
88
Identifiers
oaincstrlncstrl.cornellcs/TR94-1418
locally unique key for extracting a record from a
repository
oai-identifier oaiarchive-identifierrecord-ide
ntifier
89
Selective Harvesting - Datestamps
90
Selective Harvesting - Sets
S2
91
Set Specifics
  • Repositories define hierarchical organization
  • Each item in a repository may be organized in one
    set, several sets, or no sets at all
  • Meaning of sets or of set hierarchy is not
    defined in protocol
  • Individual communities may formulate common set
    configurations

92
HTTP encoding - requests
BASE-URL -----------gt an.oa.org/OAI-scriptkeyword
arguments --gt verbListIdentiferssetS1
GET http//an.oa.org/OAI-script?verbListIdenti
ferssetS1
POST POST http//an.oa.org/OAI-script
HTTP/1.0 Content-Length 78 Content-Type
application/x-www-form-urlencoded
verbListIdentiferssetS1
93
HTTP encoding - responses
ltxml version1.0 encodingUTF-9
?gtltGetRecord xmlnshttp//oai.namespace.uri
xmlnsxsihttp//w3.namespace.uri xsischemaL
ocationhttp//oai.namespace.uri http//oai.sc
hemaURLgt ltresponseDategt2000-19-01T193030-0400
lt/responseDategt ltrequestURLgthttp//an.oa.org/OAI-
script?verbGetRecord ampidentifieroai3Aar
Xiv3A0001 ampmetadataPrefixoai_dclt/request
URLgt ltrecordgt record contents lt/record addit
ional recordslt/GetRecordgt
94
Metadata Prefix and Schema
  • Support for harvesting multiple metadata formats
  • Metadata schema each format must have a
    validating XML schema at a publicly accessible
    URL (communities may define shared formats and
    schema.
  • Metadata prefix each repository maps a prefix to
    the schema it supports, which is used in protocol
    requests.
  • Support for unqualified Dublin Core mandatory
  • Reserved schema URL at http//www.openarchives.org
    /OAI/dc.xsd
  • Reserved prefix oai_dc.

95
Flow Control
96
Flow Control Specifics
  • Applies to all protocol requests that return
    lists ListRecords, ListIdentifiers, ListSets
  • ResumptionToken is opaque
  • Semantics of partitioning of responses within
    resumption requests is undefined
  • Time-to-live of resumptionToken is not defined by
    the protocol

97
OAI harvesting tools
service provider
data provider
  • Supporting protocol requests
  • Identify
  • ListMetadataFormats
  • ListSets
  • Harvesting protocol requests
  • ListRecords
  • ListIdentifiers
  • GetRecord

98
Supporting Protocol Requests
service provider
data provider
Identify
  • Repository name
  • Base-URL
  • Admin e-mail
  • OAI protocol version
  • Description Container

99
Supporting Protocol Requests
service provider
data provider
ListMetadataFormats
  • REPEAT
  • Format prefix
  • Format XML schema
  • /REPEAT

100
Supporting Protocol Requests
service provider
data provider
ListSets
  • REPEAT
  • Set Specification
  • Set Name
  • /REPEAT

101
Harvesting Requests
service provider
data provider
froma
untilb
setklm ListRecords metadataPrefixoai_dc
  • REPEAT
  • Identifier
  • Datestamp
  • Metadata
  • About Container
  • /REPEAT

102
Harvesting Requests
service provider
data provider
froma

untilb ListIdentifiers setklm
  • REPEAT
  • Identifier
  • Datestamp
  • /REPEAT

103
Harvesting Requests
service provider
data provider
identifieroaimlib123a
GetRecord metadataPrefixoai_dc
  • Identifier
  • Datestamp
  • Metadata
  • About

104
?????????
105
?????????
106
?????????
107
Z39.50?OpenURL?OAI?????????
  • ????
  • ?????????????????
  • ????
  • Z39.50????MARC???,??????????
  • OpenURL??????????,???????????
  • OAI??????????????
  • ??????
  • OpenURL???????target,?????
  • OAI ????????,????????
  • ????
  • ??????????,?????????
  • ????????????,??
  • ?????????(?DOI)
  • ????????????
  • ????????(?CrossRef)
  • ???????(?SFX)

108
?????
  • ?????????????
  • ????????
  • ???????????
  • ???????????????????
  • DOI CrossRef?OpenURL SFX
  • OAI
  • Z39.50 ILL
Write a Comment
User Comments (0)