Title: An Introduction to the
1An Introduction to the
By Roman Olschanowsky roman2u_at_sdsc.edu
2Outline
- SDSC and History of SRB
- Example Project
- Introduction to SRB
- Discussion on SRB basics
- SRB Clients
- Overview of a Data Grid
- Infrastructure
- Topology
3www.sdsc.edu
Human infrastructure Experienced
multi-disciplinary staff support a broad spectrum
of national science, engineeringand technology
projects
Archival Systems
Blue Gene/L (Due 12/04)
6 PB
DataStar IBM Power4
Networking Visualization Storage and
Compute Resources
2.8/5.7 TF
10.4 TF
4Sites Using the SRB
5SDSC SRB Projects (60 million, .5 PB )
- Digital Libraries
- UCB, Umich, UCSB, Stanford,CDL
- NSF NSDL - UCAR / DLESE
- NASA Information Power Grid
- Astronomy
- National Virtual Observatory
- 2MASS Project (2 Micron All Sky Survey)
- Particle Physics
- Particle Physics Data Grid (DOE)
- GriPhyN
- SLAC Synchrotron Data Repository
- Medicine
- Digital Embryo (NLM)
- Earth Systems Sciences
- ESIPS
- LTER
- Persistent Archives
- NARA
- LOC
6The SCEC Project
- Southern California Earthquake Center
- 400 people, the best earthquake seismologists in
the country (33 states) and several from abroad
(9 countries). (Sep. 2004 SCEC AHM attendees) - Simulating a 7.7 earthquake in the L.A. basin
- 10 year effort
- 100 TB of input data ( soil conditions,
topography, grid coordinates, etc ) - 240 procs on SDSC Datastar cluster, 5 days, 1 TB
RAM, 2GB/sec IO - Thanks!
- SDSC, scientific applications group, with porting
the code parallelizing the calculation and the
IO and generalizing the code for scaling up to a
large run. Offered invaluable insights regarding
IO management. - SRB, took care of draining the GPFS cache
regularly, moving 43 TB of data safely to
archive storage. That task was completed a mere
36 hours after the end of the calculation. - The SRB was critical in this achievement.
7SDSC SRB Example
8Storage Resource Broker (SRB)
- A distributed file system (Data Grid)
- Client-server architecture.
- Abstracts physical
- SRB provides the ability to transparently share
data across remote sites. - Heterogeneous Resources
- Single sign on
- Single logical file hierarchy
9What we are familiar with
10What we are not familiar with, yet
11How do the file systems differ?
- Logical Abstraction
- Folders are NOT physical
- Files do NOT inherit physical location
- Everything is potentially distributed
- Access Control
- Permissions are NOT rwxrwxrwx
- Permissions ARE on a object by object basis
- Groups and permissions ARE more similar to NTFS
- Domains
- Geographical / logical grouping of users
- Namespace scalability john_at_harvard john_at_mit
- Also doubles as groups
12Interfaces to theStorage Resource Broker
- inQ Windows Client
- Scommands UNIX, DOS Command line Client
- Jargon Java API and GUI components
- mySRB Web Client
- Matrix WSDL, Data Grid Workflows
- C, C C and C API
- Python Python API
- Perl Perl API
13Common Scommands (69 total)
- Sinit
- Senv
- Spwd
- Sls
- Scd
- Sget
- Sput
- Ssh
- Scp
- Smv (logical)
- Sphymove (physical)
- Srm
- Smkdir
- Srmdir
- Serror
- Schmod
14mySRB
15BIRN Portal (perl based)
16NEEScentral Portal (php based)
17Biomedical Informatics Research Network (BIRN)
- Major collaboration with SDSC, several of the
projects Co-Investigators and Co-PIs are at
SDSC. - BIRNs purpose is to provide its consortium of
neuroscience laboratories the ability to share,
compute, and collaborate. - The Storage Resource Broker provides the ability
to transparently share data across remote sites.
18The BIRN SRB Data Grid
19Doing this Manually
20The BIRN Data Grid
21The grid is in the details
22File Replication
- Sls
- /home/Demo/SRB-Tutorial/files-2
- Doc.txt
- Sls -l
- /home/Demo/SRB-Tutorial/files-2
- romanoly 0 z-ucsd-ncmir-nas1 15
2003-07-09-05.15 Doc.txt - romanoly 1 z-jhu-cis-nas0
15 2003-07-09-05.16 Doc.txt - romanoly 2 z-stanford-lucas-nas 15
2003-07-09-05.16 Doc.txt - romanoly 3 z-umn-cmrr-nas0 15
2003-07-09-05.16 Doc.txt - romanoly 4 z-uci-bic-nas0
15 2003-07-09-05.17 Doc.txt
23SRB Location or Slave Server
SRB
Physical Resources z-jhu-cis-nas0
z-jhu-cis-nas1
Location
z-jhu-cis-nas2
SRB
Logical Resource
jhu-cis-nas
24Pooling physical resources
25Logical / Compound Resources
SRB
instant replication
fast archival
resource pooling
My-Resource
SRB
26Logical Resources
27Collaboration Reports
28Thanks!
- SRB handles large data and provides the ability
to share and collaborate on distributed
heterogeneous resources. - Questions?
-
- www.sdsc.edu/srb
- srb_at_sdsc.edu