An Introduction to the - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

An Introduction to the

Description:

An Introduction to the – PowerPoint PPT presentation

Number of Views:44
Avg rating:3.0/5.0
Slides: 29
Provided by: Rom566
Category:
Tags: introduction | yon

less

Transcript and Presenter's Notes

Title: An Introduction to the


1
An Introduction to the
By Roman Olschanowsky roman2u_at_sdsc.edu
2
Outline
  • SDSC and History of SRB
  • Example Project
  • Introduction to SRB
  • Discussion on SRB basics
  • SRB Clients
  • Overview of a Data Grid
  • Infrastructure
  • Topology

3
www.sdsc.edu
Human infrastructure Experienced
multi-disciplinary staff support a broad spectrum
of national science, engineeringand technology
projects
Archival Systems
Blue Gene/L (Due 12/04)
6 PB
DataStar IBM Power4
Networking Visualization Storage and
Compute Resources
2.8/5.7 TF
10.4 TF
4
Sites Using the SRB
5
SDSC SRB Projects (60 million, .5 PB )
  • Digital Libraries
  • UCB, Umich, UCSB, Stanford,CDL
  • NSF NSDL - UCAR / DLESE
  • NASA Information Power Grid
  • Astronomy
  • National Virtual Observatory
  • 2MASS Project (2 Micron All Sky Survey)
  • Particle Physics
  • Particle Physics Data Grid (DOE)
  • GriPhyN
  • SLAC Synchrotron Data Repository
  • Medicine
  • Digital Embryo (NLM)
  • Earth Systems Sciences
  • ESIPS
  • LTER
  • Persistent Archives
  • NARA
  • LOC

6
The SCEC Project
  • Southern California Earthquake Center
  • 400 people, the best earthquake seismologists in
    the country (33 states) and several from abroad
    (9 countries). (Sep. 2004 SCEC AHM attendees)
  • Simulating a 7.7 earthquake in the L.A. basin
  • 10 year effort
  • 100 TB of input data ( soil conditions,
    topography, grid coordinates, etc )
  • 240 procs on SDSC Datastar cluster, 5 days, 1 TB
    RAM, 2GB/sec IO
  • Thanks!
  • SDSC, scientific applications group, with porting
    the code parallelizing the calculation and the
    IO and generalizing the code for scaling up to a
    large run. Offered invaluable insights regarding
    IO management. 
  • SRB, took care of draining the GPFS cache
    regularly,  moving 43 TB of data safely to
    archive storage.   That task was completed a mere
    36 hours after the end of the calculation.
  • The SRB was critical in this achievement.

7
SDSC SRB Example
8
Storage Resource Broker (SRB)
  • A distributed file system (Data Grid)
  • Client-server architecture.
  • Abstracts physical
  • SRB provides the ability to transparently share
    data across remote sites.
  • Heterogeneous Resources
  • Single sign on
  • Single logical file hierarchy

9
What we are familiar with
10
What we are not familiar with, yet
11
How do the file systems differ?
  • Logical Abstraction
  • Folders are NOT physical
  • Files do NOT inherit physical location
  • Everything is potentially distributed
  • Access Control
  • Permissions are NOT rwxrwxrwx
  • Permissions ARE on a object by object basis
  • Groups and permissions ARE more similar to NTFS
  • Domains
  • Geographical / logical grouping of users
  • Namespace scalability john_at_harvard john_at_mit
  • Also doubles as groups

12
Interfaces to theStorage Resource Broker
  • inQ Windows Client
  • Scommands UNIX, DOS Command line Client
  • Jargon Java API and GUI components
  • mySRB Web Client
  • Matrix WSDL, Data Grid Workflows
  • C, C C and C API
  • Python Python API
  • Perl Perl API

13
Common Scommands (69 total)
  • Sinit
  • Senv
  • Spwd
  • Sls
  • Scd
  • Sget
  • Sput
  • Ssh
  • Scp
  • Smv (logical)
  • Sphymove (physical)
  • Srm
  • Smkdir
  • Srmdir
  • Serror
  • Schmod
  • Dont forget to Sexit!

14
mySRB
15
BIRN Portal (perl based)
16
NEEScentral Portal (php based)
17
Biomedical Informatics Research Network (BIRN)
  • Major collaboration with SDSC, several of the
    projects Co-Investigators and Co-PIs are at
    SDSC.
  • BIRNs purpose is to provide its consortium of
    neuroscience laboratories the ability to share,
    compute, and collaborate.
  • The Storage Resource Broker provides the ability
    to transparently share data across remote sites.

18
The BIRN SRB Data Grid
19
Doing this Manually
20
The BIRN Data Grid
21
The grid is in the details
22
File Replication
  • Sls
  • /home/Demo/SRB-Tutorial/files-2
  • Doc.txt
  • Sls -l
  • /home/Demo/SRB-Tutorial/files-2
  • romanoly 0 z-ucsd-ncmir-nas1 15
    2003-07-09-05.15 Doc.txt
  • romanoly 1 z-jhu-cis-nas0
    15 2003-07-09-05.16 Doc.txt
  • romanoly 2 z-stanford-lucas-nas 15
    2003-07-09-05.16 Doc.txt
  • romanoly 3 z-umn-cmrr-nas0 15
    2003-07-09-05.16 Doc.txt
  • romanoly 4 z-uci-bic-nas0
    15 2003-07-09-05.17 Doc.txt

23
SRB Location or Slave Server
SRB
Physical Resources z-jhu-cis-nas0
z-jhu-cis-nas1
Location
z-jhu-cis-nas2
SRB
Logical Resource
jhu-cis-nas
24
Pooling physical resources
25
Logical / Compound Resources
SRB
instant replication
fast archival
resource pooling
My-Resource
SRB
26
Logical Resources
27
Collaboration Reports
28
Thanks!
  • SRB handles large data and provides the ability
    to share and collaborate on distributed
    heterogeneous resources.
  • Questions?
  • www.sdsc.edu/srb
  • srb_at_sdsc.edu
Write a Comment
User Comments (0)
About PowerShow.com