Title: wliu@libnet.sh.cn
1???????
???????????? 2005 ??
2????????????
- ???????
- ??????
- ??????????????????
- OpenURL??
- Z39.50 ????
- OAI????????
3??????
- ???????
- ??????
- ???????
- ????,????(Harvesting)???,?OAI
- ???????
- ???????????????????,?Z39.50, SDLIP, OpenURL
- ??????????,??????????????????
4????????
- ??????
- Abstracting/Indexing Services
- Publishers (e.g. Elsevier ScienceDirect)
important to be able to pull togther - Aggregators
- Electronic reserves
- Local databases including digitization projects
- OAI resources
- Institutional repositories
5??????
6????
- ????????????
- ????????????????
- ??????????????
- ??????????????????
- ?Silver Platters SilverLinker
- ????????????,?????Access Denied???
7????
8????
- on the fly
- ??????????,???????URL
- ??????????metadata???
- Metadata??????????????
- link-to
- ???????????????URLs
- deeper link
9How to interlinking
- ????????
- OpenURL
- ??????
- OAI????????????
- (Open Archives Initiative Protocol for
Metadata Harvesting) - ?????????(??)???????
10DOI - CrossRef
DOI
????
11DOI - CrossRef
????
DOI
http//dx.doi.org/10.1093/toxsci/57.1.95
12OpenURL
- ?Web????????
- ??
- ???Web??????????????
- ?????????? (Tag),??Web??????
- ??
- ??????INTERNET??????????
- ??????(Target)???????????(Source)???????
- ??????(Source)????????(Target)???????????
13OpenURL
14OpenURL
- ????
- ????????Internet?CGI?????HTTP GET?HTTP POST??
- ???OPENURL??????
- BASE-URL??????OPENURL???????,?http//www.sfx.co.i
l/sfxmenu? - DESCRIPTION??????????????metadata?????
- ??metadata?????????
15OpenURL BASE URL QUERY
- ??Metadata(Value),?ISSN?ISBN???URL
- http//server.lanl.gov/menu?genrearticleissn123
45678volume12issue3spage1epage8date1998
aulastSmithaufirstPaul - ????????????????-DOI
- http//server.lanl.gov/menu?iddoi10.1000/6382-1
16??OpenURL?? ????????
17?SFX???????? ???
18SFX
- ?OpenURL?????????????
- ??????????????
- ?????WEB???????????
- ????????????OPAC?WEB??
- ?????????
- ??SFX?????
- ??SFX???????
19SFX Server
- ???(source)???????????
- ?Medline?Web of science?OPAC
- ??SFX?hooks??,???????????
- ?????(service)????OpenURL???
- ????????metadata,???????
- ??????????????
- ??(targets)?????????
- ???????OPAC?ILL????????????
20Sourse Service - Target
Service
Source
Target
21DOI CrossRef OpenURL - SFX
????
DOI
http//dx.doi.org/10.1093/toxsci/57.1.95
22??????
23??WOS???
http//sfxserver.uni.edu/sfxmenu?sidProvider2AI1
pid987641
24Z39.50??? ??
- Z39.50????? ????--???????????
- Z39.50????????????, ?????????????????????????
25????????????
- ???????????????????????????
- ??????????/?????????????
- ?????????????????????
26Z39.50?????? ????
- ? ?
- Z39.50--1988
- Z39.50--1992
- Z39.50--1995
- ???????
- Z39.50???
- Z39.50???
- Z39.50???????
27?????????
- Z39.50??????????/???????????
- ?Z39.50?????,???????????????????????(Origin)???
?(Target)? - ???????? ?????????????????,??????????
28Z39.50??????
????
???
???
??
??
????
?????
??
??
??
???
Z-server
????
29??Z39.50????????????
??????????????
Z39.50????
BER??
TCP/IP?????
30????????
- Z39.50??????????????????
- ????????????,????????????
- ????????,???????????????
- ????????,????????????,?????????????????????
31Z39.50???????
- ???? ????
- ??? ???
- ?? ??
- ?? ?????
- ????? ??
- ?? ??
- ?? ??
- ???? ????
- ??????? ????????????????
- ?? (?????????????)
- ???? ????
- ?? ??
32Z39.50????????
- ??
- ??/??????
- ????
- ???????
- ????
- ??????
- ??????????????????????
- ???????
- ????
33Z39.50???
Z39.50???????????
- Scan(??)
- Sort(??)
- Resource-report(????)
- Extended-services(????)?
- Init(???)
- Search(??)
- Present(??)
- Delete(??)
34?????
- ?Z39.50?????,bib-1????????,??6????????
- ???????????????????
- ?????????????????????????
- ????????????????????????
- ????????????????????????
- ??????????????????????????????
- ????????????
35??????
- ???????????(Simple Unstructured Text Record
Syntax,??SUTRS)? - ??????(Generic Record Syntax,??GRS-1)?
- ????????(OPAC Record Syntax)?
- ??????(Summary Record Syntax,??Summary)?
- ??????(EXtensible Markup Language,??XML)?
36???????????
- Z39.50??????????????,????????????????????(object
identifier number)??????Z39.50???????????
Ausmarc Canmarc CCF Danmarc
Finmarc Ibermarc Intermarc Librismarc
MAB Normarc Picamarc SBN
UKmarc Unimarc USmarc
37??Profile(????)
- ??Z39.50??????????,????????????????profile(????)??
????????????????Bath Profile,?????????????????????
?????? - ??????????????
- ?????????
- ?????????
- ???
38- (???)
- ????????????????????,????????????????????????
- ?????????????????????,??????????????????????
- ????????????
- ??Bath?????????????,????????????????
- ?????????????????????????????,???????????????????
39????????????????
??? ?????? ???
???? ????
?????? ??????
?????? ??????
?????? ??????
?????? ??????
?????? ??????
???????? ????????
?????? ??????
???? ????
40Z39.50?????
- Z39.50??????????????,???????????????????????
- ???(Initialization)
- ??(Search)
- ??(Present)
- ??????Z39.50???,???????????????
41?????????
- ???????????????????????,????
- ???????????
- ??????????????????????
- ???????????????
42????????
- ??????????????
- ?????????????????????????????,?????????????(?????
????????)???? - ????????????(Profiles)?????????????????,??????????
????????????????
43????????
- ???????????????????????????????,????
- ?????????,??????????,????????????
- ?????????,???????????????,??????????????
44Z????
45SRW/U
- SRW
- Search and Retrieve Web Service
- SRU
- Search and Retrieve URL Service
- SRW/U
- Search and Retrieve for the Web
46ZING
ZINGZ39.50 (international) Next Generation
srw
sru
47ZING
srw
sru
cql
Common Query Language
48ZING
srw
sru
cql
49ZING
srw
sru
zeeRex
cql
ez3950
ZOOM
50ZING
srw
sru
zeeRex
cql
ez3950
ZOOM
51ZING
Z39.50 Explain Explained and Re-engineered in XML
srw
sru
zeeRex
52ZING
srw
sru
zeeRex
Z39.50
53ZING
srw
sru
zeeRex
54ZING
Z39.50 Object Oriented Model
srw
sru
ZOOM
55ZING
srw
sru
ZOOM
Z39.50
56ZING
srw
sru
ZOOM
57ZING
srw
sru
58SRW/U Features
- SOAP and URL-based access
- Common Query Language -- CQL
- Retains several Z39.50 abstractions.
- .. And discards a few
59SRW/U Features
- SOAP and URL-based access
- Common Query Language -- CQL
- Retains several Z39.50 abstractions.
- .. And discards a few
60SRW/U Features
- SOAP and URL-based access
- Common Query Language -- CQL
- Retains several Z39.50 abstractions .
- .. And discards a few
61SRW/U Features
- SOAP and URL-based access
- Common Query Language -- CQL
- Retains several Z39.50 abstractions .
- .. And discards a few
62SRW/U retains these Z39.50 concepts.
- result sets
- abstract access points
- abstract record schemas
- application level diagnostics
- Explain
63.. But differs from Z39.50 in these respects
- Web-based
- Connectionless
- XML
- Protocol (no ASN.1)
- records (no record syntax)
- CQL -- user-friendly query
64The Weaknesses of Classic Z39.50
- Not popular with the Web community
- Connection-based Sessions
- Binary Encoding
- Transmitted directly over TCP/IP
- Complicated
65Z39.50 Complicated?
- 11 Native Services
- Init, Search, Present, DeleteResultSet,
AccessControl, ResourceControl,
TriggerResourceControl, ResourceReport, Scan,
Sort, Segment. - 7 Extended Services
- PersistentResultSet, PersistentQuery,
PeriodicQuerySchedule, ItemOrder, DatabaseUpdate,
ExportSpecification, ExportInvocation
66Z39.50 Complicated?
- 156 page standard
- 147 in the 2002 draft
- Vocabulary from the OSI community
- Target, Origin, APDU, A-association,
Z-association.. - Records described with ASN.1
- Attribute Sets
67The Strengths of Classic Z39.50
- Result Sets (a.k.a. Statefulness)
- Abstraction
- Abstract Access Points (Attribute Sets)
- Abstract Record Schemas
- Explain
68SRW Search and Retrieve on the Web
- SOAP (Simple Object Access Protocol) Based
- HTTP
- XML
- Records Described in WSDL (Web Service
Description Language) - 1 Service SearchAndRetrieve
- No Documentation -)
69Semantically equivalent to classic Z39.50
- Gateways Trivial
- Preserves the experience of the ZIG without the
baggage of the standard - Syntactic equivalence discounted
70SRW The Basics
- Only one database per request
- String (not structure) based queries
- Index Sets, not Attribute Sets
- One Record Syntax (XML)
71The SRW Request
- String Query
- Integer StartRecord
- Integer MaximumRecords
- String RecordSchema
72The SRW Response
- String ResultSetReference
- ResultSetName
- ResultSetTimeToLive
- Integer TotalHits
- Records
- Status (Status Code Diagnostic)
73CQL Common Query Language
- Loosely based on CCL Search
- Boolean Proximity Operators
- Index Sets Indexes
- Truncation Characters , ?
- Example
- dc.titleharry potter or bib1.isbn123-456-78x
74Explain Now Possible
- Not practical for classic Z39.50
- List of supported Schemas
- List of supported Indexes
75SRU Search and Retrieve with URLs
- SRW Without the SOAP Wrapper
- Intended for Thin Clients
- Adds ResponseSchema Parameter
- Example
- http//deimos.oclc.org/SRW/etdcat?querynuclearma
xRecords10
76Outlook for SRW/SRU
- A solution looking for a problem?
- Not likely to be quickly adopted by the library
community - Could show up first in other communities
MetaSearch, Geospatial, Biodiversity, OAI(?) - Will be seen as a competitor with XML Query
77Links
- http//www.loc.gov/
- LC Home Page
- http//www.loc.gov/srw
- SRW/U Home Page
- http//www.loc.gov/cql
- CQL Home Page
- rden_at_loc.gov
78OAI Open Archival Initiative
79??
- Open Archives Initiative Protocol for Metadata
Harvesting,(OAI-PMH)??OAI??,??????????????????????
????,???????Web????????????????????????? - ????????????
- ???????????
- ????????????????
80??
- ?????????????HTTP??????,???metadata???
- ??????????(Repository)????????????
81????
82????
- DP????????????,??OAI????metadata?????????????????
???OAI????????metadata??????(Web???)? - SP???????OAI????????????,?????metadata??????????
- ???(Repository)??HTTP,??OAI????????????????
- ???(Set)???????????????????,?????????????????????
,?????????,???(node)????????,??????????????? - ??(Record)??????????????OAI??,?????????XML???????
????
83??????????
- Header-????????????,???????
- Unique identifier-????????????
- Datestamp-?????????????????,???????????????
- Metadata-??????metadata???
- About-??????????????????,???????
84????
- ??OAI??1.0??,???OAI??????????Content-Type??,????te
xt/xml?,?XML?????????Unicode?UTF-8????,?UTF-8??XML
?????,????8???Unicode?????,?XML????prolog???? - lt?xml version1.0 encodingUTF-8?gt
85????
- definitions concepts
- repository
- record
- identifier
- datestamp
- set
- protocol features
- HTTP encoding
- metadata prefix schema
- flow control
- protocol requests
- supporting requests
- harvesting requests
86Repository
87Record
ltrecordgt ltheadergt ltidentifiergtoaieg001lt/ident
ifiergt ltdatestampgt1999-01-01lt/datestampgt lt/head
ergt ltmetadatagt ltdc xmlnshttp//purl.org/dcgt
lttitlegtMy Examplelt/titlegt lt/dcgt lt/metadatagt
ltaboutgt ltea xmlnshttp//www.arXiv.org/ea
ltusagegtNo restrictionslt/usagegt lt/eagt lt/aboutgtlt
/recordgt
88Identifiers
oaincstrlncstrl.cornellcs/TR94-1418
locally unique key for extracting a record from a
repository
oai-identifier oaiarchive-identifierrecord-ide
ntifier
89Selective Harvesting - Datestamps
90Selective Harvesting - Sets
S2
91Set Specifics
- Repositories define hierarchical organization
- Each item in a repository may be organized in one
set, several sets, or no sets at all - Meaning of sets or of set hierarchy is not
defined in protocol - Individual communities may formulate common set
configurations
92HTTP encoding - requests
BASE-URL -----------gt an.oa.org/OAI-scriptkeyword
arguments --gt verbListIdentiferssetS1
GET http//an.oa.org/OAI-script?verbListIdenti
ferssetS1
POST POST http//an.oa.org/OAI-script
HTTP/1.0 Content-Length 78 Content-Type
application/x-www-form-urlencoded
verbListIdentiferssetS1
93HTTP encoding - responses
ltxml version1.0 encodingUTF-9
?gtltGetRecord xmlnshttp//oai.namespace.uri
xmlnsxsihttp//w3.namespace.uri xsischemaL
ocationhttp//oai.namespace.uri http//oai.sc
hemaURLgt ltresponseDategt2000-19-01T193030-0400
lt/responseDategt ltrequestURLgthttp//an.oa.org/OAI-
script?verbGetRecord ampidentifieroai3Aar
Xiv3A0001 ampmetadataPrefixoai_dclt/request
URLgt ltrecordgt record contents lt/record addit
ional recordslt/GetRecordgt
94Metadata Prefix and Schema
- Support for harvesting multiple metadata formats
- Metadata schema each format must have a
validating XML schema at a publicly accessible
URL (communities may define shared formats and
schema. - Metadata prefix each repository maps a prefix to
the schema it supports, which is used in protocol
requests. - Support for unqualified Dublin Core mandatory
- Reserved schema URL at http//www.openarchives.org
/OAI/dc.xsd - Reserved prefix oai_dc.
95Flow Control
96Flow Control Specifics
- Applies to all protocol requests that return
lists ListRecords, ListIdentifiers, ListSets - ResumptionToken is opaque
- Semantics of partitioning of responses within
resumption requests is undefined - Time-to-live of resumptionToken is not defined by
the protocol
97OAI harvesting tools
service provider
data provider
- Supporting protocol requests
- Identify
- ListMetadataFormats
- ListSets
- Harvesting protocol requests
- ListRecords
- ListIdentifiers
- GetRecord
98Supporting Protocol Requests
service provider
data provider
Identify
- Repository name
- Base-URL
- Admin e-mail
- OAI protocol version
- Description Container
99Supporting Protocol Requests
service provider
data provider
ListMetadataFormats
- REPEAT
- Format prefix
- Format XML schema
- /REPEAT
100Supporting Protocol Requests
service provider
data provider
ListSets
- REPEAT
- Set Specification
- Set Name
- /REPEAT
101Harvesting Requests
service provider
data provider
froma
untilb
setklm ListRecords metadataPrefixoai_dc
- REPEAT
- Identifier
- Datestamp
- Metadata
- About Container
- /REPEAT
102Harvesting Requests
service provider
data provider
froma
untilb ListIdentifiers setklm
- REPEAT
- Identifier
- Datestamp
- /REPEAT
103Harvesting Requests
service provider
data provider
identifieroaimlib123a
GetRecord metadataPrefixoai_dc
- Identifier
- Datestamp
- Metadata
- About
104?????????
105?????????
106?????????
107Z39.50?OpenURL?OAI?????????
- ????
- ?????????????????
- ????
- Z39.50????MARC???,??????????
- OpenURL??????????,???????????
- OAI??????????????
- ??????
- OpenURL???????target,?????
- OAI ????????,????????
- ????
- ??????????,?????????
- ????????????,??
- ?????????(?DOI)
- ????????????
- ????????(?CrossRef)
- ???????(?SFX)
108?????
- ?????????????
- ????????
- ???????????
- ???????????????????
- DOI CrossRef?OpenURL SFX
- OAI
- Z39.50 ILL