CASTOR SRM v1.1 experience - PowerPoint PPT Presentation

About This Presentation
Title:

CASTOR SRM v1.1 experience

Description:

First production version deployed in December 2003. Other sites having deployed the CASTOR SRM ... Immutability of request identifier. Request id is a 32 bit word ... – PowerPoint PPT presentation

Number of Views:40
Avg rating:3.0/5.0
Slides: 15
Provided by: sdm5
Learn more at: https://sdm.lbl.gov
Category:

less

Transcript and Presenter's Notes

Title: CASTOR SRM v1.1 experience


1
CASTOR SRM v1.1 experience
  • Presentation at SRM meeting
  • 01/09/2004, Berkeley
  • Olof Bärring, CERN-IT

2
Outline
  • CASTOR SRM v1.1 implementation
  • Interoperability tests
  • Problems found
  • SRM specification
  • GSI
  • SRM _at_ GGF GSM WG
  • Input to the definition of SRM-Basic
  • Conclusions and outlook

3
CASTOR SRM v1.1
  • Implements the vital operations
  • get, put, getRequestStatus, setFileStatus,
    getProtocols
  • No-ops
  • pin, unPin, getEstGetTime, getEstPutTime
  • Implemented but optionally disabled (requested by
    LCG)
  • advisoryDelete
  • CASTOR GSI (CGSI) plug-in for gSOAP
  • Also used in GFAL
  • Evolution _at_ CERN
  • First prototype in summer 2003
  • First production version deployed in December
    2003
  • Other sites having deployed the CASTOR SRM
  • CNAF (INFN/Bologna)
  • PIC (Barcelona)

4
CASTOR SRM v1.1
5
Deployment
castorgrid.cern.ch DNS loadbalancing
Test/dev node
gridftp01 srm gridftp
gridftp02 srm gridftp
gridftp03 srm gridftp
gridftp04 srm gridftp
gridftp05 srm gridftp
gridftp06 srm gridftp
RFIOco
CASTOR (stager, nameserver, ...)
6
Interoperability tests
  • CASTOR SRM has been running interoperability
    tests with various clients, notably
  • GFAL (Jean-Philippe)
  • EDG replica manager (Peter)
  • FNAL/dCache SRM (Timur)

7
Problems found
  • The interoperability problems can be classified
    as
  • Due to problems with the SRM specification
  • Due to assumptions in SRM or SOAP implementations
  • Due to GSI incompatibilities
  • The debugging of GSI incompatibilities is by far
    the most difficult and time consuming

8
Problems with SRM spec (1)
  • Lack of enumeration
  • All enumeration-like types are strings
  • Client needs to find a common denominator (e.g.
    cast all strings in capital letters)
  • Request and file state lifecycles
  • Concise for put or get
  • Draft proposal submitted by Timur for copy. Not
    yet adopted by CASTOR SRM implementation.
  • Undefined for mkPermanent, pin, unpin
    (probably irrelevant for the latter two)?
  • Request history
  • What an SRM should with requests that have
    reached the Done or Failed status

9
Problems with SRM spec (2)
  • Immutability of request identifier
  • Request id is a 32 bit word
  • Unspecified if an SRM can reuse request ids for
    finished (Done or Failed) requests
  • SURL (Site URL) semantics
  • Is it an URL or URI?
  • If URL, does it support relative and absolute
    paths?
  • If URI ? name space is virtually flat for an
    arbitrary client
  • Pin lifetime
  • Pin lifetime is defined to be subject for site
    policy
  • No way to query the remaining pin lifetime for a
    particular file
  • Current definition appears useless for any
    practical purpose

10
Problems with SRM spec (3)
  • Exception handling and error propagation
  • Unspecified if a multi-file request should fail
    when a subset of the files got an error
  • Unspecified if and when an SRM can do retries
  • Only one error message, global for all files in a
    multi-file request, is available for reporting
  • Format and contents of error message undefined
  • advisoryDelete ! delete
  • It may be vital to know what the effect is
  • No effect at all (if so, what happens if SURL is
    reused for a new file?)
  • Only remove disk resident copy (if so, when?)
  • Remove HSM file (if so, when?)
  • Directory creation on the fly for put requests
  • If a put requests specifies a SURL
    corresponding to a path for which one or several
    sub-directory levels do not exist, should it
    create the missing dirs on the fly (provided the
    client has the appropriate permissions)?

11
Problems due to SRM or SOAP implementation details
  • SRM WSDL discovery
  • FNAL client put severe constraints on the wsdl
    publication
  • Bug in gSOAP v2.3 WSDL importer
  • Various bugs in CASTOR SRM found but not reported
    here ?

12
GSI problems (1)
  • CASTOR (GSI) EDG RC (Java TrustManager)
  • TrustManager does not use GSI default of SSL
    handshake credential delegation, but just a SSL
    handshake
  • TrustManager client would not work with SSL 3.0,
    which is forced by GSI
  • Solution EDG RC uses CoG (Globus Java Security
    Implementation) instead
  • CASTOR (GSI) FNAL dCache (Java CoG)
  • FNAL client only used a limited number of
    algorithms for encryption that were not matching
    those provided by standard GSI
  • Limited Proxy certificate
  • GSI error reporting not working properly

13
GSI problems (2)
  • Administration and deployment issues
  • EDG globus patch for supporting for dynamic pool
    accounts requires GRIDMAPDIR environment to be
    declared, even if default location was used for
    the security files
  • configuration problems (right Root CA not
    trusted)
  • CERN CA changed the Certificate naming scheme
    (number added at the end of DN). New certificates
    were not automatically propagated (to, for
    instance, FNAL).
  • The effort for debugging GSI problems will scale
    with the number of SRM implementations
  • Establishing a SRM reference implementation for
    certifying new servers and clients would help

14
Conclusions and outlook
  • CASTOR SRM v1.1 is in production since a couple
    of months at CERN and some other CASTOR Tier-1
    sites
  • SRM interoperability does not come for free
  • Definition not concise enough, room for too much
    site specific interpretation
  • Is GSI interoperability an illusion and, if so,
    will it continue to be so? ?
  • We have currently no plans for a CASTOR SRM v2.1
    implementation. Would rather like to tighten up
    SRM v1.1 in the context of the GGF GSM WG and the
    SRM-Basic definition
Write a Comment
User Comments (0)
About PowerShow.com