Globus - PowerPoint PPT Presentation

About This Presentation
Title:

Globus

Description:

Title: Globus Project Future Directions Author: Ian Foster Last modified by: Ian Foster Created Date: 7/22/1999 10:07:28 PM Document presentation format – PowerPoint PPT presentation

Number of Views:246
Avg rating:3.0/5.0
Slides: 62
Provided by: IanF235
Learn more at: https://www.mcs.anl.gov
Category:

less

Transcript and Presenter's Notes

Title: Globus


1
GlobusBridging the Gap
Ian Foster Argonne National Laboratory University
of Chicago Univa Corporation
Steve Tuecke CEO, Univa Corporation
2
Overview
  • Why Globus
  • New partners
  • Whats next
  • The larger context
  • The Globus community

3
Overview
  • Why Globus
  • Motivation
  • Uses
  • New partners
  • Whats next
  • The larger context
  • The Globus community

4
The Application-Infrastructure Gap
  • Dynamicand/orDistributedApplications

5
Bridging the GapService-Oriented Infrastructure
Users
  • Service-oriented applications
  • Wrap applications asservices
  • Compose applicationsinto workflows
  • Service-orientedinfrastructure
  • Provision physicalresources to support
    application workloads

Composition
Workflows
Invocation
ApplnService
ApplnService
Provisioning
6
Globus is Service-Oriented Infrastructure
Technology
  • Software for service-oriented infrastructure
  • Service enable new existing resources
  • E.g., GRAM on computer, GridFTP on storage
    system, custom application service
  • Uniform abstractions mechanisms
  • Tools to build applications that exploit
    service-oriented infrastructure
  • Registries, security, data management,
  • Open source open standards
  • Each empowers the other
  • Enabler of a rich tool service ecosystem

7
Globus asService-Oriented Infrastructure
User Application
User Application
User Application
Database
Specialized resource
Computers
Storage
8
A Typical eScience Use of GlobusNetwork for
Earthquake Eng. Simulation
Links instruments, data, computers, people
9
An eBusiness Use of GlobusSAP Demonstration _at_
GlobusWorld
  • 3 Globus-enabled applns
  • CRM Internet Pricing Configurator (IPC)
  • CRM Workforce Management (WFM)
  • SCM Advanced Planner Optimizer (APO)
  • Applications modified to
  • Adjust to varying demand resources
  • Use Globus to discover provision resources

SAP AG R/3 Internet Pricing Configurator (IPC)
10
Cheaper and/or Faster ...
1
9
1
9
11
Overview
  • Why Globus
  • New partners
  • Globus Alliance
  • Globus Consortium
  • Univa
  • Whats next
  • The larger context
  • The Globus community

12
How Globus Works
  • Globus is a distributed open source community
    with many contributors users
  • CVS, documentation, bugzilla, email lists
  • Modular structure allows many to contribute
  • Globus Alliance Board provides governance when
    needed
  • Meritocracy individuals who demonstrate ongoing
    contributions commitment
  • Primarily what to include, when to release
  • Globus Alliance is an informal partnership of
    organizations led by Board members

13
Evolution of the Globus Alliance
  • Argonne/U.Chicago (Childers, Foster) 1995
  • USC/ISI (Kesselman) 1995
  • Edinburgh (Atkinson, Parsons) 2003
  • Swedish PDC (Johnsson, Mulmo) 2003
  • NCSA (Welch) 2004
  • Univa (Czajkowski, Tuecke) 2004
  • Other contributors will surely be added

14
From eScience to eBusiness
  • Since 2001, growing interest in Globus for
    commercial use
  • Enterprises, IT vendors, ISVs asking Globus
    leaders to address commercial needs
  • But hard to do in a research laboratory
  • In response, we have created two new
    organizations
  • Globus Consortium
  • Univa

15
Globus Consortium(www.globusconsortium.com)
  • Nonprofit organization funded by companies to
    advance Globus Toolkit for enterprise use
  • Initial sponsor members HP, IBM, Intel, Sun
  • Initial contributors Nortel, Univa
  • First two projects already identified
  • Member-driven software quality improvements
  • Contributions to job submission standards
  • Other projects to be defined, e.g.
  • Develop new features key to enterprise use
  • Education outreach

16
  • Provider of commercial support, services,
    products around open source Globus
  • Commercial distribution of GT4 beyond
  • Integration with enterprise systems
  • Committed to open source open standards
  • Founded by Tuecke, Foster, Kesselman
  • Tuecke left Argonne to be CEO
  • Foster, Kesselman remain at Argonne, ISI
  • Experienced management team
  • Rich Miller, Vas Vasiliadis, Paul Davé, Bob
    Mandel

17
Overview
  • Why Globus
  • New partners
  • Whats next
  • Globus Toolkit v4
  • The larger context
  • The Globus community

18
Globus asService-Oriented Infrastructure
User Application
User Application
User Application
Tool
Reliable File Transfer
Tool
Uniform interfaces, security mechanisms, Web
service transport, monitoring
MDS-Index
MyProxy
DAIS
GRAM
GridFTP
Database
Specialized resource
Computers
Storage
19
Globus Toolkit
  • Core Web services
  • Infrastructure for building new services
  • Security
  • Apply uniform policy across distinct systems
  • Execution management
  • Provision, deploy, manage services
  • Data management
  • Discover, transfer, access large data
  • Monitoring
  • Discover monitor dynamic services

20
Globus Open Source Grid Software
Web ServicesComponents
Non-WS Components
Data Management
Security
CommonRuntime
Execution Management
Information Services
21
GT4 Components
Your C Client
Your Python Client
Your Java Client
Your Python Client
Your Python Client
Your C Client
Your C Client
CLIENT
Your Java Client
Your Java Client
Your Python Client
Your C Client
Your Java Client
Interoperable WS-I-compliant SOAP messaging
X.509 credentials common authentication
RFT
GRAM
Delegation
Index
Trigger
Archiver
Your C Service
CAS
OGSA-DAI
Your Python Service
GTCP
Your Java Service
Your Java Service
RLS
Pre-WS MDS
SimpleCA
MyProxy
GridFTP
Pre-WS GRAM
C WS Core
pyGlobus WS Core
Java Services in Apache Axis Plus GT Libraries
and Handlers
C Services using GT Libraries and Handlers
Python hosting, GT Libraries
SERVER
22
Our Goals for GT4
  • Usability, reliability, scalability,
  • Web service components have quality equal or
    superior to pre-WS components
  • Documentation at acceptable quality level
  • Consistency with latest standards (WS-, WSRF,
    WS-N, etc.) and Apache platform
  • WS-I Basic (Security) Profile compliant
  • New components, platforms, languages
  • And links to larger Globus ecosystem

23
GT4 Web Services Core
  • Supports both Globus services (GRAM, RFT,
    Delegation, etc.) user-developed services
  • Redesign to enhance scalability, modularity,
    performance, usability
  • Leverages existing WS standards
  • WS-I Basic Profile WSDL, SOAP, etc.
  • WS-Security, WS-Addressing
  • Adds support for emerging WS standards
  • WS-Resource Framework, WS-Notification
  • Java, Python, C hosting environments

24
GT4 Web Services Core
25
Grid Security
  • Control access to shared services
  • Address autonomous management, e.g., different
    policy in different work-groups
  • Support multi-user collaborations
  • Federate through mutually trusted services
  • Local policy authorities rule
  • Allow users to set up dynamic trust domains
  • Personal collection of resources working together
    based on trust of user

26
GT4 Security
Users
27
GT4 Security
  • Extensible authorization framework based on Web
    services standards
  • SAML-based authorization callout
  • As specified in GGF OGSA-Authz WG
  • Integrated policy decision engine
  • XACML policy language, per-operation policies,
    pluggable
  • Credential management services
  • MyProxy (One time password support)
  • Community Authorization Service
  • Standalone delegation service
  • Implements IETF RFC

28
GT4s Use of Security Standards
29
GT4 Data Management
  • Stage large data to/from nodes
  • Replicate data for performance reliability
  • Locate data of interest
  • Provide access to diverse data sources
  • File systems, parallel file systems, hierarchical
    storage (GridFTP)
  • Databases (OGSA DAI)

30
GT4 Data Functions
  • Find your data Replica Location Service
  • Managing 40M files in production settings
  • Move/access your data GridFTP, RFT
  • High-performance striped data movement
  • 27 Gbit/s memory-to-memory on a 30 Gbit/s link
    (90 utilization) with 32 IBM TeraGrid nodes.
  • 17.5 Gbit/s disk-to-disk limited by the storage
    system
  • Reliable movement of 120,000 files (so far)
  • Couple data execution management
  • GRAM uses GridFTP and RFT for staging

31
GridFTP in GT4
Disk-to-disk onTeraGrid
  • 100 Globus code
  • No licensing issues
  • Stable, extensible
  • IPv6 Support
  • XIO for different transports
  • Striping ? multi-Gb/sec wide area transport
  • Pluggable
  • Front-end e.g., future WS control channel
  • Back-end e.g., HPSS, cluster file systems
  • Transfer e.g., UDP, NetBLT transport

32
Reliable File TransferThird Party Transfer
  • Fire-and-forget transfer
  • Web services interface
  • Many files directories
  • Integrated failure recovery

RFT Client
SOAP Messages
Notifications(Optional)
RFT Service
GridFTP Server
GridFTP Server
33
Replica Location Service
  • Identify location of files via logical to
    physical name map
  • Distributed indexing of names, fault tolerant
    update protocols
  • GT4 version scalable stable
  • Managing 40 million files across 10 sites

Index
Index
Local DB Update send (secs) Bloom filter (secs) Bloom filter (bits)
10K lt1 2 1 M
1 M 2 24 10 M
5 M 7 175 50 M
34
Execution Management (GRAM)
  • Common WS interface to schedulers
  • Unix, Condor, LSF, PBS, SGE,
  • More generally interface for process execution
    management
  • Lay down execution environment
  • Stage data
  • Monitor manage lifecycle
  • Kill it, clean up
  • A basis for application-driven provisioning

35
GT4 GRAM
  • 2nd-generation WS implementation optimized for
    performance, flexibility, stability, scalability
  • Streamlined critical path
  • Use only what you need
  • Flexible credential management
  • Credential cache delegation service
  • GridFTP RFT used for data operations
  • Data staging streaming output
  • Eliminates redundant GASS code

36
GT4 GRAM StructureWSRF/WSN Poster Child
Service host(s) and compute element(s)
GT4 Java Container
Compute element
Local job control
GRAM services
GRAM services
Local scheduler
Job functions
sudo
GRAM adapter
Delegate
Transfer request
Delegation
Client
Delegate
GridFTP
User job
RFT File Transfer
FTP control
FTP data
Remote storage element(s)
GridFTP
37
Monitoring and Discovery
  • Every service should be monitorable and
    discoverable using common mechanisms
  • WSRF/WSN provides those mechanisms
  • A common aggregator framework for collecting
    information from services, thus
  • MDS-Index Xpath queries, with caching
  • MDS-Trigger perform action on condition
  • MDS-Archiver Xpath on historical data
  • Deep integration with Globus containers
    services every GT4 service is discoverable
  • GRAM, RFT, GridFTP, CAS,

38
GT4 Monitoring Discovery
Clients (e.g., WebMDS)
GT4 Container
WS-ServiceGroup
MDS-Index
Registration WSRF/WSN Access

adapter
GT4 Cont.
GT4 Container
MDS-Index
MDS-Index
Custom protocols for non-WSRF entities
Automated registration in container
GridFTP
RFT
GRAM
User
39
GT4 Documentationis Much Improved!
40
Working with GT4
  • Download and use the software, and provide
    feedback
  • Join gt4friends_at_globus.org mail list
  • Review, critique, add to documentation
  • Globus Doc Project http//gdp.globus.org
  • Tell us about your GT4-related tool, service, or
    application
  • Email foster_at_mcs.anl.gov

41
Overview
  • Why Globus
  • New partners
  • Whats next
  • The larger context
  • Web services, utility computing, virtualization,
  • The Globus community

42
Infrastructure ComplexityWhats the Answer?
  • Web services
  • Utility computing
  • Virtualization
  • Data center automation
  • Adaptive enterprise

Recall Service-oriented applications
demand service-oriented infrastructure
43
Overview
  • Why Globus
  • New partners
  • Whats next
  • The larger context
  • The Globus community
  • Whos using Globus?
  • What are they using it for?
  • The Globus ecosystem
  • Growing the Globus community

44
  • 26776 U.S.
  • 2753 China
  • 1318 Japan
  • 1017 India
  • 750 U.K.
  • 495 Italy
  • 488 Germany
  • 391 Brazil
  • 328 S. Korea
  • 306 Taiwan
  • 268 France
  • 241 Canada
  • 211 Viet Nam
  • 211 Spain
  • 202 Russia
  • 187 Pakistan
  • 159 Australia
  • 142 Singapore
  • 131 Greece

68 Belgium 66 Venezuela 66 Romania 64
Indonesia 62 Mexico 61 Turkey 60 Malaysia 58
Portugal 57 Austria 54 Ireland 44 Hong Kong
40 Hungary 38 Egypt 38 Argentina 34 Uruguay
31 Ukraine 29 Slovakia 25 Israel 23
Yugoslavia 23 Iran 22 Bulgaria 22 Uzbekistan
22 Czech Rep. 22 N. Korea 21 Lithuania 21
Croatia
20 Finland 20 New Zealand 17 Nigeria 17 South
Africa 16 Jordan 16 Slovenia 16 Afghanistan
15 Denmark 15 Philippines 14 Vanuatu 14
Luxembourg 14 Tunisia 12 Virgin Is. (U.K.) 12
Peru 12 Yemen 11 Norway 11 Algeria 11 Armenia
10 Iceland 9 Zambia 9 Virgin Is. (U.S.) 9
Uganda 9 Bosnia Herz. 8 Kenya 7 Zimbabwe 7
Saudi Arabia
7 Ecuador 7 Macedonia 6 Bolivia 6 Comoros 6
Zaire 6 Lebanon 5 Puerto Rico 5 Namibia 5
Togo 5 Tajikistan 5 Paraguay 5 Albania 5
Sudan 4 Estonia 4 Camaroon 4 Ghana 4 Tuvalu
4 Costa Rica 4 Cuba 4 UAE 4 Tonga 4 W. Samoa
4 Tanzania 3 Syria 3 Bahamas 3 Ethiopia
3 Mongolia 3 Sri Lanka 3 Wallis Futuna Is. 3
Belarus 3 Bangladesh 2 Falkland Islands 2
Kuwait 2 Sierra Leone 2 Trinidad Tobago 2
Guyana 2 American Samoa 2 Andorra 2 Georgia 2
Cook Islands 2 Turkmenistan 2 Gabon 2 The
Gambia 2 Kazakhstan 2 Macau 2 Malta 2 Jamaica
2 Latvia 2 Turks Caicos 1 Bhutan 1 Ascension
Island 1 Cyprus
1 Mozambique 1 Tokelau 1 Greenland 1 Nepal 1
Swaziland 1 Iraq 1 Serbia 1 Barbados 1
Cambodia 1 Qatar 1 Saint Vincent 1 Laos 1 San
Marino 1 Libya 1 Benin 1 Angola 1 Chad 1
Gibraltar 1 Haiti 1 Guatemala 1 Malawi 1
Equatorial Guinea 1 Palau 1 Bermuda 1 Botswana
1 Suriname
38669 downloads in 2004 from globus.org
45
  • 26776 U.S.
  • 2753 China
  • 1318 Japan
  • 1017 India
  • 750 U.K.
  • 495 Italy
  • 488 Germany
  • 391 Brazil
  • 328 S. Korea
  • 306 Taiwan
  • 268 France
  • 241 Canada
  • 211 Viet Nam
  • 211 Spain
  • 202 Russia
  • 187 Pakistan
  • 159 Australia
  • 142 Singapore
  • 131 Greece

68 Belgium 66 Venezuela 66 Romania 64
Indonesia 62 Mexico 61 Turkey 60 Malaysia 58
Portugal 57 Austria 54 Ireland 44 Hong Kong
40 Hungary 38 Egypt 38 Argentina 34 Uruguay
31 Ukraine 29 Slovakia 25 Israel 23
Yugoslavia 23 Iran 22 Bulgaria 22 Uzbekistan
22 Czech Rep. 22 N. Korea 21 Lithuania 21
Croatia
20 Finland 20 New Zealand 17 Nigeria 17 South
Africa 16 Jordan 16 Slovenia 16 Afghanistan
15 Denmark 15 Philippines 14 Vanuatu 14
Luxembourg 14 Tunisia 12 Virgin Is. (U.K.) 12
Peru 12 Yemen 11 Norway 11 Algeria 11 Armenia
10 Iceland 9 Zambia 9 Virgin Is. (U.S.) 9
Uganda 9 Bosnia Herz. 8 Kenya 7 Zimbabwe 7
Saudi Arabia
7 Ecuador 7 Macedonia 6 Bolivia 6 Comoros 6
Zaire 6 Lebanon 5 Puerto Rico 5 Namibia 5
Togo 5 Tajikistan 5 Paraguay 5 Albania 5
Sudan 4 Estonia 4 Camaroon 4 Ghana 4 Tuvalu
4 Costa Rica 4 Cuba 4 UAE 4 Tonga 4 W. Samoa
4 Tanzania 3 Syria 3 Bahamas 3 Ethiopia
3 Mongolia 3 Sri Lanka 3 Wallis Futuna Is. 3
Belarus 3 Bangladesh 2 Falkland Islands 2
Kuwait 2 Sierra Leone 2 Trinidad Tobago 2
Guyana 2 American Samoa 2 Andorra 2 Georgia 2
Cook Islands 2 Turkmenistan 2 Gabon 2 The
Gambia 2 Kazakhstan 2 Macau 2 Malta 2 Jamaica
2 Latvia 2 Turks Caicos 1 Bhutan 1 Ascension
Island 1 Cyprus
1 Mozambique 1 Tokelau 1 Greenland 1 Nepal 1
Swaziland 1 Iraq 1 Serbia 1 Barbados 1
Cambodia 1 Qatar 1 Saint Vincent 1 Laos 1 San
Marino 1 Libya 1 Benin 1 Angola 1 Chad 1
Gibraltar 1 Haiti 1 Guatemala 1 Malawi 1
Equatorial Guinea 1 Palau 1 Bermuda 1 Botswana
1 Suriname
Top 10
26776 U.S. 2753 China 1318 Japan 1017
India 750 U.K. 495 Italy 488
Germany 391 Brazil 328 S. Korea 306
Taiwan
38669 downloads in 2004 from globus.org
46
GlobalCommunity
47
Grids for Physics LHC Computing Grid
48
Paradigm Shift Transition to Grid
424 CPU Years
Jun 8020 25 of DC04
May 8911 11 of DC04
Aug 2773 42 of DC04
Jul 7723 22 of DC04
Slide Tony Doyle, University of Glasgow
49
  • Grid2003 ? Open Science Grid
  • 30 sites (2100-2800 CPUs) growing
  • 400-1300 concurrent jobs
  • 8 substantial applications CS experiments
  • Running since October 2003

Korea
http//www.ivdgl.org/grid2003
50
ExampleOSGWorkflows
Genome sequence analysis
Sloan digital sky survey
Physics data analysis
51
Further Application Examples
Fusion Grid 1000s of jobs
STAR 5 TB transfer (SRM, GridFTP)
Earth System Grid O(100TB) online data
NASA/NVO Mosaics from multiple sources
52
???? DartGrid?????
Weve used GT3 to build the largest database
grid system for Traditional Chinese Medicine,
integrating about 50 TCM-relevant databases.
(Zhejiang U.)
53
Nimrod-G, Australia
Earth Sciences
Astrophysics
Air Pollution
Antenna Design
Computational Chemistry
Circuit Design
Monte Carlo
Cardiac Modelling
CFD
Airfoil Design
Climate
Public Health Policy
54
The Globus Ecosystem
  • Globus components address core issues relating to
    resource access, monitoring, discovery, security,
    data movement, etc.
  • GT4 being the latest version
  • A larger Globus ecosystem of open source and
    proprietary components provide complementary
    components
  • A growing list of components
  • These components can be combined to produce
    solutions to Grid problems
  • Were building a list of such solutions

55
Many Tools Build on, or Can Contribute to,
GT4-Based Grids
  • Condor-G, DAGman
  • MPICH-G2
  • GRMS
  • Nimrod-G
  • Ninf-G
  • Open Grid Computing Env.
  • Commodity Grid Toolkit
  • GriPhyN Virtual Data System
  • Virtual Data Toolkit
  • GridXpert Synergy
  • Platform Globus Toolkit
  • VOMS
  • PERMIS
  • GT4IDE
  • Sun Grid Engine
  • PBS scheduler
  • LSF scheduler
  • GridBus
  • TeraGrid CTSS
  • NEES
  • IBM Grid Toolbox

56
DocumentingThe Grid Ecosystem
The Grid Ecosystem Software Components for Grid
Systems And Applications
www.grids-center.org
57
Example Solutions
  • Portal-based User Reg. System (PURSE)
  • VO Management Registration Service
  • Service Monitoring Service
  • TeraGrid TGCP Tool
  • Lightweight Data Replicator
  • GriPhyN Virtual Data System

58
The Globus Developer Community
  • We want to facilitate community contributions to
    the Globus software
  • Increasingly open governance mechanisms
  • Documentation of larger ecosystem
  • What else can we do?
  • Tell us how you think we should evolve
  • E.g., adopt Apache processes?
  • Contribute more of Globus to Apache?
  • Weve already made major contributions to Apache
    Axis, WS-Addressing, and WS-Security
  • Also WSRF/WSN ? Apache (Apollo, Hermes)

59
Globus and its User Community
  • How can we best support you?
  • We try to provide the best software we can
  • We use bugzilla other community tools
  • We work to grow the set of contributors
  • How can you best support us?
  • Become a contributor of software, bug fixes,
    answers to questions, documentation
  • Provide us with success stories that can justify
    continued Globus development
  • Promote Globus within your communities
  • Contact me at foster_at_mcs.anl.gov

60
2005 and Beyond
  • We have a solid Web services base
  • We now want to build, on that base, a open source
    service-oriented infrastructure
  • Virtualization
  • New services for provisioning, data management,
    security, VO management
  • End-user tools for application development
  • Etc., etc.
  • See Carl Kesselmans talk for more details!

61
SummaryGlobus is Bridging the Gap
  • Application to infrastructure
  • Science to commercial
  • Web services to Grid
  • Data to computation
  • Silo to silo
  • Organization to organization
  • Country to country
  • Developer to developer
  • Developer to user

62
Acknowledgments
  • Carl Kesselman, our long-time Globus
    co-conspirators
  • Gregor von Laszewski, Kate Keahey, Jennifer
    Schopf, Mike Wilde, other Argonne colleagues
  • Globus Alliance members at Argonne, U.Chicago,
    USC/ISI, Edinburgh, PDC
  • Miron Livny, U.Wisconsin Condor project, Rick
    Stevens, Argonne U.Chicago
  • Other partners in Grid technology, application,
    infrastructure projects
  • DOE SciDAC program, NSF Middleware Initiative,
    NASA, IBM for generous support

63
For More Information
  • Globus Alliance
  • www.globus.org
  • Globus Consortium
  • www.globusconsortium.com
  • Global Grid Forum
  • www.ggf.org
  • Background information
  • www.mcs.anl.gov/foster

2nd Edition www.mkp.com/grid2
Write a Comment
User Comments (0)
About PowerShow.com