Title: Course Introduction (cont.) and Introduction to Globus
1Course Introduction (cont.) and Introduction to
Globus
- Joint work of USC Information Sciences Institute
- and Argonne National Laboratory
2Last timeDefined computational grids
- Emerging computational and networking
infrastructure - pervasive, uniform, and reliable access to
remote data, computational, sensor, and human
resources - Enable entirely new approaches to applications
and problem solving - remote resources the rule, not the exception
3Example Aeronautic Design
Collaboration
Simulation
Instrumentation
Design data
4Why Now?
- The Internet as infrastructure
- Increasing bandwidth, advanced services
- Advances in storage capacity
- Terabyte store is 150,000
- Increased availability of compute resources
- clusters, supercomputers, etc.
- Advanced applications
- simulation based design, advanced scientific
instruments, ...
5Tomorrows InfrastructureNot Just Faster and
More Reliable
O(109) nodes
Caching
Resource Discovery
QoS
- Application-centric heterogeneous, mobile
end-systems many embedded capabilities rich
services user-level quality of service
6Today
- How does grid computing differ from traditional
distributed computing? - Where do grids get their names?
- What basic services must be provided by a grid
infrastructure? - Introduction to Globus
7A Grid Application Scenario
- A distributed simulation involving 10
supercomputers at 10 different locations - How do you know where they are?
- How do you identify yourself to each?
- How do you get permission to use them?
- How do you submit remote jobs?
- How do you get access to resources on all the
machines simultaneously? - What happens if a machine fails?
- How are input/output files managed?
8Basic Grid Services
- Security
- Authentication both client and server
- Authorization what privileges does the client
have? - Access control Sites want local control of
operations that remote users are allowed to
perform - Confidential data transfer using encryption
9Basic Grid Services (cont.)
- Resource management
- Mechanism for submitting jobs to remote locations
- Local policies for use, management, resource
configuration - Scheduling of important resources
- Coordinating scarce, expensive resources
(e.g., cooperating supercomputers) - Advanced reservations to guarantee
- Quality of service
- Completion of operations (e.g., reserve disk
space for a large data transfer)
10Basic Grid Services (cont.)
- Information Services
- Register and query information about grid
resources - Where are all the Cray T3Es in the grid?
- Where is a storage system with 250 gigabytes of
free space that transfers data at 1 gigabit/sec? - Centerpiece for many Grid components
- Performance measurement services
- What is the current bandwidth of the link from
jupiter.isi.edu to apogee.sdsc.edu? - Dynamic environment assume the information
service contains old information
11Basic Grid Services (cont.)
- Efficient Data Transfers
- Secure (authentication, encryption)
- Parallel transfers
- Partial file transfers
- Third-party transfers
- Reliable transfers
- Replica Management Service
- Large (petabyte-scale) datasets
- Multiple stored and cached copies
- Select the best copy with best performance
12Basic Grid Services (cont.)
- Fault detection
- Detect and report failure of component of a
computation - Limited by ability to distinguish between network
partition and system failure - Goal make low-level operations reliable
- No libraries for checkpoint and restart
- Cant checkpoint a socket
- Only application knows how to checkpoint and
restart - Likewise, storage system must do logging
13Major Grid ComputingInfrastructure Projects
- The Globus Project
- Bag of services model for grid computing
- USC Information Sciences Institute and Argonne
National Laboratory (Chicago) - We will use Globus for most of the examples in
this class - The Legion Project
- Object-oriented approach to grid computing
- The Condor Project
- Schedule computations on pool of resources
14Today
- How does grid computing differ from traditional
distributed computing? - Where do grids get their names?
- What basic services must be provided by a grid
infrastructure? - Introduction to Globus
15Grid Services Architecture
High-energy physics data analysis
Collaborative engineering
On-line instrumentation
Applications
Regional climate studies
Parameter studies
16The Globus Approach
- The Globus toolkit provides a range of basic Grid
services - Security, information, fault detection,
communication, resource management, ... - These services are simple and orthogonal
- Can be used independently, mix and match
- Programming model independent
- For each there are well-defined APIs
- Standards are used extensively
- E.g., LDAP, GSS-API, X.509, ...
17Grid Security Infrastructure
- Single-sign on, run anywhere if authorized
- Standards based (GSS, SSL, X.509)
- GSS-API Interface
- Identity/credential mapping at each resource
- Limited delegation of rights
- Integrated into wide variety of tools
- Globus Resource Management
- Secure shell, FTP,
- Storage Resource Broker
18Authentication Model
- Authentication is done on a user basis
- Single authentication step allows access to all
grid resources - No communication of plaintext passwords
- Most sites will use conventional account
mechanisms - You must have an account on a resource to use
that resource - Sites may use generic Grid accounts
- Not common, but Globus can deal with it
19Grid Security Infrastructure
- Based on public key technology
- Standard X.509 certificate, same as certificates
used for the Web - Each user has
- a Grid user id (called a Subject Name)
- /CUS/OGlobus/OUniversity of Southern
California/OUInformation Sciences
Institute/CNAnn Chervenak - a private key (like a password)
- a certificate signed by a Certificate Authority
(CA) - A gridmap file at each site specifiesgrid-id
to local-id mapping
20Certificate Based Authentication
- User has a certificate, signed by a trusted
certificate authority (CA) - Certificate contains users name and public key
- Globus project operates a CA
- Users private key is used to encode a challenge
string - Public key is used to decode the challenge
- If you can decode it, you know the user
- Treat your private key carefully!!
- Private key is stored in encrypted form
21User Proxies
- Minimize exposure of users private key
- A temporary credential for use by our
computations - We call this a user proxy certificate
- Allows process to act on behalf of user
- User-signed user proxy certificate stored in
local file - Proxys private key is not encrypted
- Rely on file system security, proxy certificate
file must be readable only by the owner
22Delegation
- Remote creation of a user proxy
- Allows remote process to act on behalf of the
user - Avoids sending passwords or private keys across
the network
23Single sign-onvia grid-id
CREDENTIAL
Assignment of credentials to user proxies
Globus Credential
Mutual user-resource authentication
Site 2
Mapping to local ids
Authenticated interprocess communication
GSSAPI multiple low-level mechanisms
Certificate
24Resource Management
- Globus Resource Allocation Manager (GRAM)
- Uniform interface to resource management
- Globus Arch. for Reservation and Allocation
- Co-allocation of compute resources
- Immediate and advance reservation of network and
computers in prototype form - Fault detection service
- Network measurement tools
- Code management and distribution infrastructure
25Resource Management
- Resource Specification Language (RSL) is used to
communicate requirements - The Globus Resource Allocation Manager (GRAM) API
allows programs to be started on remote
resources, despite local heterogeneity - A layered architecture allows application-specific
resource brokers and co-allocators to be defined
in terms of GRAM services
26Resource Management Architecture
RSL specialization
RSL
Application
Information Service
Queries
Info
Ground RSL
Simple ground RSL
Local resource managers
GRAM
GRAM
GRAM
LSF
EASY-LL
NQE
27Resource Specification Language
- Common notation for exchange of information
between components - RSL provides two types of information
- Resource requirements Machine type, number of
nodes, memory, etc. - Job configuration Directory, executable, args,
environment - API provided for manipulating RSL
28RSL Syntax
- Elementary form parenthesis clauses
- (attribute op value value )
- Operators Supported
- lt, lt, , gt, gt , !
- Some supported attributes
- executable, arguments, environment, stdin,
stdout, stderr, resourceManagerContact,resourceMa
nagerName - Unknown attributes are passed through
- May be handled by subsequent tools
29Constraints
- For example
- (countgt5) (countlt10)
- (max_time240) (memorygt64)
- (executablemyprog)
- Create 5-10 instances of myprog, each on a
machine with at least 64 MB memory that is
available to me for 4 hours
30Multirequest
- A multirequest allows us to specify multiple
resource needs, for example - ( (count5)(memorygt64)
- (executablep1))
- ((networkatm) (executablep2))
- Execute 5 instances of p1 on a machine with at
least 64M of memory - Execute p2 on a machine with an ATM connection
- Multirequests are central to co-allocation
31Co-allocation
- Simultaneous allocation of a resource set
- Handled via optimistic co-allocation based on
free nodes or queue prediction - In the future, advance reservations will also be
supported
32A Co-allocation Multirequest
( (resourceManagerContact
flash.isi.edu754/CUS//CNflash.isi.edu-fork)
(count1) (label"subjob A")
(executable my_app1) ) (
(resourceManagerContact
sp139.sdsc.edu8711/CUS//CNsp097.sdsc.edu-lsf
") (count2) (label"subjob B")
(executablemy_app2) )
33Job Submission Interfaces
- Globus Toolkit includes several command line
programs for job submission - globus-job-run Interactive jobs
- globus-job-submit Batch/offline jobs
- globusrun Flexible scripting infrastructure
- Others are building better interfaces
- General purpose
- Condor-G, PBS, GRD, Hotpage, etc
- Application specific
- ECCE, Cactus, Web portals
34Grid Information Services
- Publish and retrieve information about system
elements - Used for discovery, configuration, scheduling
- Distributed collection of information servers and
index nodes - LDAP V3 as wire protocol and API
35Examples of Useful Information
- Characteristics of a compute resource
- IP address, software available, system
administrator, networks connected to, OS version,
load - Characteristics of a network
- Bandwidth and latency, protocols, logical
topology - Characteristics of the Globus infrastructure
- Hosts, resource managers
36Grid Information Service
- Provide access to static and dynamic information
regarding system components - A basis for configuration and adaptation in
heterogeneous, dynamic environments - Requirements and characteristics
- Uniform, flexible access to information
- Scalable, efficient access to dynamic data
- Access to multiple information sources
- Decentralized maintenance
37The Globus ToolkitMetacomputing Directory Service
- Store information in a distributed directory
- Directory stored in collection of LDAP servers
- Directory can be updated by
- Information providers and tools
- Applications (i.e., users)
- Backend tools which generate info on demand
- Information dynamically available to
- Tools
- Applications
38Directory Service Functions
- White Pages
- Look up the IP number, amount of memory, etc.,
associated with a particular machine - Yellow Pages
- Find all the computers of a particular class or
with a particular property - Temporary inconsistencies are often considered
okay - In a distributed system, you often do not know
the state of a resource until you actually use it - Information is often used as hints
- Information itself can contain ttl, etc.
39MDS Approach
Application
- Based on LDAP
- Lightweight Directory Access Protocol v3 (LDAPv3)
- Standard data model
- Standard query protocol
- Globus specific schema
- Host-centric representation
- Globus specific tools
- GRIS, GIIS
- Data discovery, publication,
Middleware
LDAP API
GRIS
GIIS
SNMP
NWS
NIS
LDAP
40Grid Resource Information Service
- Server which runs on each resource
- Given the resource DNS name, you can find the
GRIS server (well known port 2135) - Provides resource specific information
- Much of this information may be dynamic
- Load, process information, storage information,
etc. - GRIS gathers this information on demand
- White pages lookup of resource information
- Ex How much memory does machine have?
- Yellow pages lookup of resource options
- Ex Which queues on machine allow large jobs?
41Grid Index Information Service
- GIIS describes a class of servers
- Gathers information from multiple GRIS servers
- Each GIIS is optimized for particular queries
- Ex1 Which Alliance machines are gt16 process
SGIs? - Ex2 Which Alliance storage servers have gt100Mbps
bandwidth to host X? - Akin to web search engines
- Organization GIIS
- The Globus Toolkit ships with one GIIS
- Caches GRIS info with long update frequency
- Useful for queries across an organization that
rely on relatively static information
42Referral Service
- Links together multiple GRIS and/or GIIS servers
into a single LDAP namespace - Referral servers contain no actual content
43Data Grid Services
- Access to remote data
- Uniform access to diverse, remote storage
management systems - Cache management
- Transport services
- Standards based (GSI, FTP protocol)
- Client API, Extensible server, support for third
party transfer - Replica Management
44Data Intensive Issues Include
- High-speed, reliable access to remote data
- Automated discovery of best copy of data
- Manage replication to improve performance
- Co-schedule compute, storage, network
- Enforce access control on data
45The Globus Data Grid
- Two major components
- 1. Data Transport and Access
- Common protocol
- Secure, efficient, flexible, extensible data
movement - Family of tools supporting this protocol
- 2. Replica Management Architecture
- Simple scheme for managing
- multiple copies of files
- collections of files
46Motivation for a Common Data Access Protocol
- Existing distributed data storage systems
- DPSS, HPSS focus on high-performance access,
utilize parallel data transfer, striping - DFS focus on high-volume usage, dataset
replication, local caching - SRB connects heterogeneous data collections,
uniform client interface, metadata queries - Problems
- Incompatible protocols
- Each require custom client
- Partitions available data sets and storage
devices - Each protocol has subset of desired functionality
47A Common, Secure, EfficientData Access Protocol
- Common, extensible transfer protocol
- Decouple low-level data transfer mechanisms from
the storage service - Advantages
- New, specialized storage systems are
automatically compatible with existing systems - Existing systems have richer data transfer
functionality - Interface to many storage systems
- HPSS, DPSS, file systems
- Plan for SRB integration
48A UniversalAccess/Transport Protocol
- Suite of communication libraries and related
tools that support - GSI security
- Third-party transfers
- Parameter set/negotiate
- Partial file access
- Reliability/restart
- Logging/audit trail
- All based on a standard, widely deployed protocol
- Integrated instrumentation
- Parallel transfers
- Striping (cf DPSS)
- Policy-based access control
- Server-side computation
- later
49And the Universal Protocol is GSI-FTP
- Why FTP?
- Ubiquity enables interoperation with many
commodity tools - Already supports many desired features, easily
extended to support others - Well understood and supported
- We use the term GSI-FTP to refer to
- Transfer protocol which meets requirements
- Family of tools which implement the protocol
- Note GSI-FTP gt FTP
- Note that despite name, GSI-FTP is not restricted
to file transfer!
50Replica Management
- Maintain a mapping between logical names for
files and collections and one or more physical
locations - Important for many applications
- Example CERN HLT data
- Multiple petabytes of data per year
- Copy of everything at CERN (Tier 0)
- Subsets at national centers (Tier 1)
- Smaller regional centers (Tier 2)
- Individual researchers will have copies
51Our Approach to Replica Management
- Identify replica cataloging and reliable
replication as two fundamental services - Layer on other Grid services GSI, transport,
information service - Use LDAP as catalog format and protocol, for
consistency - Use as a building block for other tools
- Advantage
- These services can be used in a wide variety of
situations
52Replica Manager Components
- Replica catalog definition
- LDAP object classes for representing
logical-to-physical mappings in an LDAP catalog - Low-level replica catalog API
- globus_replica_catalog library
- Manipulates replica catalog add, delete, etc.
- High-level reliable replication API
- globus_replica_manager library
- Combines calls to file transfer operations and
calls to low-level API functions create,
destroy, etc.
53Replica Catalog Structure A Climate Modeling
Example
Replica Catalog
Logical Collection C02 measurements 1998
Logical Collection C02 measurements 1999
Filename Jan 1998 Filename Feb 1998
Logical File Parent
Location jupiter.isi.edu
Location sprite.llnl.gov
Filename Mar 1998 Filename Jun 1998 Filename
Oct 1998 Protocol gsiftp UrlConstructor
gsiftp//jupiter.isi.edu/ nfs/v6/climate
Filename Jan 1998 Filename Dec 1998 Protocol
ftp UrlConstructor ftp//sprite.llnl.gov/
pub/pcmdi
Logical File Jan 1998
Logical File Feb 1998
Size 1468762
54Replica Catalog Servicesas Building Blocks
Examples
- Combine with information service to build replica
selection services - E.g. find best replica using performance info
from NWS and MDS - Use of LDAP as common protocol for info and
replica services makes this easier - Combine with application managers to build data
distribution services - E.g., build new replicas in response to frequent
accesses
55Relationship to Metadata Catalogs
- Metadata services describe data contents
- Have defined a simple set of object classes
- Must support a variety of metadata catalogs
- MCAT being one important example
- Others include LDAP catalogs, HDF
- Community metadata catalogs
- Agree on set of attributes
- Produce names needed by replica catalog
- Logical collection name
- Logical file name
56A Model Architecture for Data Grids
Attribute Specification
Replica Catalog
Metadata Catalog
Application
Multiple Locations
NWS
Logical Collection and Logical File Name
Selected Replica
Replica Selection
MDS
gsiftp commands
Performance Information and Predictions
Disk Cache
Tape Library
Disk Array
Disk Cache
Replica Location 1
Replica Location 2
Replica Location 3
57Fault Detection Globus Heartbeat Monitor
- Detect and report failure of component of a
computation - Limited by ability to distinguish between network
partition and system failure - Optionally used within Globus Toolkit to monitor
status of system processes - Can also be used to construct special fault
monitors for applications - Example Netsolve
58Fault Detection (cont.)
- Goal make low-level operations reliable
- No libraries for checkpoint and restart
- Cant checkpoint a socket
- Only application knows how to checkpoint and
restart - Likewise, storage system must do logging
59Heartbeat Monitor
Application Level Fault Handler
!
System Monitoring Tools
Process and Host Heartbeat
Process and Host Heartbeat
Host 2
Host 1
Process Status Inquiry
Process Status Inquiry
Register/ Unregister
Register/ Unregister
60Grid Enabled Tools
- Message Passing Interface
- Multi-method communication, specialized
- CAVERNsoft
- Shared state for collaborative environments
- Condor, Nimrod-G
- High-throughput computing
- User level tools
- FTP, SSH
61Thursday, September 7
- How does grid computing differ from traditional
distributed computing? - Where do grids get their names?
- Grid hardware
- Grid applications
62Distributed Computing A Quick Review
- Andrew Tannenbaum
- A distributed system is a collection of
independent computers that appear to the users of
the system as a single computer.
63Distributed Systems Hardware
- Distributed in the local area
- Memory organization
- Shared-memory multiprocessors
- Single virtual address space shared by all CPUs
- Multicomputers with private memories
- Separate address spaces
- Interconnection network organization
- Bus-based
- A single shared network, backplane, bus or cable
- Switch-based
- Individual connections between machines
64Simplest Hardware A Bus-based Shared-Memory
Multiprocessor
Processor
Processor
Processor
Memory
Cache
Cache
Cache
Bus
- Shared memory
- Caches must be kept consistent
- Bus bandwidth limits to 64 processors
65Bus-based Distributed Shared-Memory
(DSM)Multiprocessor
Memory
Memory
Memory
Memory
Cache
Cache
Cache
Cache
Processor
Processor
Processor
Processor
Bus
- Each processor contains portion of shared
memory - Local accesses fast, remote accesses slow
- NUMA non-uniform memory access
66Switch-Based Multicomputer Workstation Cluster
Work-station
Work-station
Ethernet Switch
Work-station
Work-station
Work-station
Work-station
- Workstations share resources
file servers, printers, storage archives - Schedule jobs
- Use idle workstations
67HardwareWhat is different in a grid?
- Heterogeneous hardware environment
- computing platforms
- network connections
- storage systems and caches
- Wide-area distribution
- Wide-area network latency and bandwidth
- Resources in different administration domains
- Dynamic environment
- Resources enter and leave grid
68Software Issues in Distributed Operating
Systems
- Communication models
- Client-Server Model
- Remote procedure call
- Group communication
- In a grid
- Algorithms must tolerate wide-area latency for
message transfers - Avoid large numbers of messages
- Typically perform larger transfers, initiate
remote jobs rather than procedure calls
69Software Issues in Distributed Operating
Systems
- Synchronization
- Clock synchronization
- Election algorithms determine a coordinator
- Atomic transactions
- In a grid
- With wide-area latencies, typically perform
synchronization on larger grain - Can implement atomic operations
70Software Issues inDistributed Operating Systems
- Processes and Processors
- Threads
- Allocating Processors
- Scheduling and co-scheduling resources
- Fault tolerance
- In a grid scheduling, allocation, fault
tolerance issues get more complicated in the wide
area environment
71Software Issues in a Distributed Operating
System
- Distributed file systems
- File service that reads and writes file, controls
access - Creating, deleting managing directories
- Naming
- Sharing
- Caching and consistency
- Replication and updates
- In a grid, same issues complicated by wide area
distribution, different administrative domains,
enormous data sets
72Software Issues for a Distributed Operating
System
- Distributed Shared Memory
- Generally applies to machines in a LAN
- Each processor contains memory corresponding to
part of the shared memory address space - Each processor caches data from other processors
- Many consistency algorithms
- In a grid EASIER! Globus does not support a
shared address space - Legion has a single shared object space
73Summary Heterogeneity makes things harder in a
grid
- Heterogeneous software and hardware
- Different administrative domains
- Different policies for use and management of
local resources - Must do coordinated scheduling
- Different security policies
- Dynamic environment
- Must discover resources
- Robust in the presence of network, resource
failures
74Today
- How does grid computing differ from traditional
distributed computing? - Where do grids get their names?
- Grid Hardware
- Grid Applications
75Where do computational grids get their names?
- A computational grid is a hardware and software
infrastructure that provides dependable,
consistent, pervasive, and inexpensive access to
high-end computational capabilities. - Name (and definition) imply an analogy to the
electric power grid - Power inexpensive, universally available
- Enabled new devices and industries
76An Infrastructure AnalogyThe Electric Power Grid
- Revolutionary development transmission and
distribution of electricity - Before power accessible in crude forms
- human work
- horses
- water power
- steam engines
- Today cheap, reliable power universally
available
77Electric Power Grid (cont.)
- Power to billions of devices
- Efficient
- Low-cost
- Reliable
- North America 10,000 generators linked to
billions of outlets - Heterogeneous components, distributed ownership
- Interconnections between regions
share reserve capacity, trade excess power
78Electric Power Grid (cont.)
- Required more than just technology
- Regulatory, political and institutional
development - Infrastructure for monitoring and management
- Huge social impact
- Fundamentally changed work and home life
- Huge environmental impact
- Consume resources, generate pollution, global
warming,
79Another Infrastructure Analogy Railroads and
the Rise of Chicago
- Early 1800s Chicago was a small field of onions
on a very large lake - Impact of railroad infrastructure
- Trains used for shipment of goods
- Chicago a cache for agricultural products
- New financial institutions, technologies and
industries - Board of Trade
- Stockyards, refrigerated rail cars
- Midwests native ecosystems destroyed
- Bison replaced by cattle
- Prairie replaced by wheat and corn fields
80New Infrastructure Has Serious Social
Consequences
- More examples highway system, telephone
network, banking system - What changes will the Grid infrastructure bring
about? - Enable unimagined applications
- Likely to have positive and negative effects
- Are we ready to deal with the rate of change?
- Processing power, bandwidth, storage all growing
exponentially
81Based on Infrastructure Analogies Desired
Characteristics of Grids
- Pooling of resources
- Compute cycles, data, people, sensors
- Dependable service
- Predictable
- Sustained performance
- Often high-performance
82Grid Characteristics (cont.)
- Consistent service
- Standard services available
- Via standard interfaces
- Enable application development
- Pervasive
- Services always available
- Inexpensive
- Otherwise not widely accepted and used
83Application Examples
- Online instrumentation
- Distributed supercomputing
- Collaborative engineering
- High-throughput computing
- Remote job submission, meta-queueing
84Online Instrumentation
Advanced Photon Source
wide-area dissemination
desktop VR clients with shared controls
real-time collection
archival storage
tomographic reconstruction
DOE X-ray grand challenge ANL, USC/ISI, NIST,
U.Chicago
85Grid ApplicationsDistributed Supercomputing
- Solve problems that cannot be solved using a
single system - Example applicationdistributed, interactive
simulation involving 100,000s of entities - Difficult issues
- Co-scheduling of scarce, expensive resources
- Algorithms that scale to many nodes and tolerate
latency - Achieving and sustaining high performance across
heterogeneous systems
86Globus ExampleDistributed Supercomputing
- SF-Express distributed, interactive simulation
- 100K vehicles (2002 goal) using 13 computers,
1386 nodes, 9 sites - Largest DIS ever done
- Globus mechanisms for
- Resource allocation
- Distributed startup
- I/O and configuration
- Fault detection
NCSA Origin
Caltech Exemplar
CEWES SP
Maui SP
P. Messina et al., Caltech
87Grid ApplicationsHigh-Throughput Computing
- Schedule large numbers of loosely-coupled or
independent tasks - Tie together idle workstations
- Put unused cycles to work
- Example applications chip design, solving
cryptographic problems - Systems
- Condor manages pool of hundreds of workstations
around the world - Entropia startup company
88High-Throughput ComputingSETI_at_home
89Grid ApplicationsData-Intensive Computing
- Geographically distributed data repositories,
digital libraries and databases - Up to petabytes of data
- Example applications High-energy physics
experiments, climate modeling, human genome
project databases - Challenging Issues
- High-performance data transfers in wide-area
environments - Management of caching and replication
90Globus Data-Intensive Computing
How do midwest flood frequencies under 2xCO2
scenario compare with historical data?
91Grid ApplicationsCollaborative Computing
- Enabling and enhancing human interactions
- Virtual shared spaces
- Shared resources data archives, simulations
- Example applications collaborative design or
collaborative exploration of data sets - Challenges
- Real-time requirements of human perception
- Rich interactions
92Globus ExampleCollaborative Engineering
Manipulate shared virtual space Simulation
components Multiple flows Control, Text, Video,
Audio, Database, Simulation, Tracking,
Haptics, Rendering Uses Globus communication
CAVERNsoft UIC Electronic Visualization
Laboratory
93Grids Changing Science
- NSF National Earthquake Engineering Center
- Integrated instrumentation, collaboration,
simulation environment - National Environmental
- High-energy Physics Grid (GriPhyn)
- CERN Data Grid
94Current and Future Applications
- Interesting applications exist today
- More sophisticated applications will follow
- Characteristics
- Appetite for resources (CPU, memory, storage)
- Synchronization
- Only satisfied by multiple systems
- Need high availability of resources
95Who will use grids?
- 1. Governments
- Disaster response, national defense, national
collaboratory, strategic
computing reserve - 2. Private grids for institutions
- Relatively low-cost, small-scale
- Central management
- Example hospitals and medical personnel
96Who will use grids? (cont.)
- 3. Virtual Grid
Multi-institution collaboration - Large, fluid, highly-distributed community
- Hundreds of researchers and students around the
world - Share instruments, data archives, software,
computers - Public Grid
- Enormous community
- Consumers, service providers, resource providers,
network providers
97Summary
- Grids will change the way we do science and
engineering - Transition of services and application to
production use - Future will see increases sophistication and
scope of services, tools, and applications