Title: Interoperability: architectures and connections
1Interoperabilityarchitectures and connections
"Hyper Clumps, Mini Clumps and National
Catalogues resource discovery for the 21st
century 11th November 2004, British Library,
London
John Gilby, M25 Systems Team, LSE Ashley Sanders,
Copac Team, MIMAS
2Contents
- Overview of technical architecture of union
catalogues (Copac InforM25) - Introduce Z39.50 to Z39.50 middleware issues to
consider - CC-interop and JAFER
- Installation, configuration testing
- Results set issues and searching times
3A reminder, Z39.50 is
- a standard for information retrieval
- a client/server relationship
- Z-client stand-alone in PC or associated with
web server/user interface - Z-server - generally a module in library systems
- a method for communication between disparate
computer systems (such as a library catalogue and
a users PC)
4Copac
- has 26 libraries (including large research,
academic and BL, NLS) - geographically covers whole of UK
- JISC funded, administered by MIMAS
- has control over indexes and searching process
- can be searched via Z39.50
- periodic data loads
- live circulation data via Z39.50 very
successful and popular with users - Copac V3 experimental Z39.50 searching of Copac
and National Library of Wales
5CURL/Copac database creation
Record pre-processingstandardisation problem
identification
Incoming MARC records from contributing
institutions
CURL database creationMARC21 UKMARC
Copac databaseZ-server, OpenURL web interface
Formation of consolidated and individual records
indexes
Duplicate checkspass/fail
web Z39.50
6Distributed catalogue
- typically has up to 40 library catalogues
(academic CAIRNS, InforM25, RIDING Public -
WiLL) - regionally based
- funded by regional organisation
- rely on institutional catalogues for record
standards, indexing and Z-server configurations - some control over Z39.50 searching process
- data is as up to date as library OPAC
- clump software combines result sets and
presents them to user - generally cannot accept queries outside of user
interface
7Union catalogues
User
network
Distributed catalogueZ-client software and user
interface
Copacsingle, large database
network
Z-server/institutional library systems
8Z to Z Middleware
Institution Z-serverA
Z39.50
Z39.50toZ39.50Middleware
Remote user Z-client
Institution Z-serverB
Z39.50
e.g. Copac V3
Local user web interface
e.g. M25 libraries
9 Connection Issues
- When to make connections ?
- Which Z-servers ?
- selecting some/all, landscaping
- Access Authentication
- handled by middleware
- Timing of middleware response
- users client is expecting single response
- middleware has to wait for Z-servers to respond
before it responds to client - automatic time-out advisable
10Search Result Set Issues
- Query transformation
- multiple Z-servers behave differently to an
incoming query - user sends query in their own format (attribute
set) - need to avoid failed searches
- middleware transforms query to form suitable for
individual Z-servers - Response aggregation
- users client cannot know hits/Z-server
- client must display origin of record
- various options
11and so to JAFER
- Middleware options for CC-interop
- graft Z39.50 server onto existing InforM25
software - develop completely new software
- use existing available software
- JAFER Toolkit Project (JISC 5/99 Programme)
- readily available supported
- could do most of what was required
12Working with JAFER
- JAFER http//www.jafer.org/
- increased the JAFER logging facilities
- established subsets of libraries for searching
- produced XSLT stylesheets
- Created new Copac Interface
- copy of standard Copac web interface tailored
for testing JAFER
13Copac searching InforM25
14Copac searching InforM25
15Copac searching InforM25
16Copac searching InforM25
17Search tests
- Search set 1 - Copac Z39.50 criteria
- no query transformations
- Search set 2 - M25 best practice settings
- query transforms applied
18Search test results 1
- Access failed
- variable always, sometimes, occasional, never
- Talis Aleph access problems
- firewall problems
- Access succeeded
- some searches received no response
19Search test results 2
- Response with Copac search settings
- 203 searches carried out
- 95 failed to return a result (0 or more records)
- Response with InforM25 settings
- 199 searches carried out
- 3 failed to return a result (0 or more records)Â
20Middleware benefits
- Simplifies access to range of catalogues
- Query transformation improves search success rate
- Virtual catalogue staff can
- provide centralised development and maintenance
- identify and investigate problems
- act as a central contact point
- Can interconnect the (JISC) Information
Environment - Potentially useful for a National Catalogue
21Search problems/solutions
- Users lose control of query
- Search consistency
- failure of catalogues to respond
- lowest common denominator or all options?
- catalogues searching different fields
- catalogues searching fields in different ways
- Standardisation
- profiles eg. Bath Profile
- work on index standardisation
22Response times
- Improved access to resources
- benefits end-user and library staff
- BUT
- impacts on local catalogue
- over-large result sets
- duplication of materialÂ
- Response times
- impact on local catalogue searcher
- impact on virtual catalogue searcher
23Response time test
- Hourly search for Austen
- record time taken to obtain search result
- does not include record collection or result
processing - Number of searches responding
- c.90 within 2 seconds
- c.4 within 4-27seconds
- Overall response time governed by slowest
catalogue - Timeouts for slow to- or non-responding catalogues
24Restricted searches
- Should all searches be sent to all catalogues?
- control where searches are sent initially
- pre-defined search groups - by location/subject?
- Better to deal with large result sets through
ranking and/or sorting? - which brings us back to response times
25Summary what next ?
- JAFER tests - middleware works
- Enables distributed catalogues to be plugged
into the IE - Dynamic resource selection is technically
feasible - Clump services interested
- Further investigations
- Response-time tests
- Results processing
26Further details
- Reports on the project website
- http//ccinterop.cdlr.strath.ac.uk/documents.htm
- Copac Team
- copac_at_mimas.ac.uk
- M25 Systems Team
- m25libadmin_at_lse.ac.uk