Title: PowerPoint-Pr
1Sun in Scientific Engineering Computing
Grid Computing for Life Sciences
Wolfgang Gentzsch
Director Grid Computing
BioGrid Symposium, Singapore, October, 2001
2The BIG Challenges Reality !
- Computing Reality Moore's Law
- CPU "power" 2X/18-24 months, constant cost
- Genomics Reality
- Information now 2X/6 months (Genbank), 4/01
12.4B 11/mo - Interoperability Post-Genomics is the BIG
PROBLEM - Biopharma/Economic Reality
- NCE's/year dropping RD increasing
- Academic Reality
- My favorite URLs
- Run my algorithms 24x7 !
3Computing Reality The Net Effect Take it to the
nth
CPU Density/Power, Connectivity/Bandwidth,
Node/Value
Net Effect
MetcalfesLaw
1,000,000X
100,000X
GildersLaw
10,000X
1,000X
MooresLaw
100X
10X
1X
1980
1990
2000
4Genomics Reality Without IT, Data is
Just....Data!
Data
Information
Knowledge
Action
5Economic Reality Drug Discovery is Frustrating!
- Industry Expectations
- 7 Market Growth
- 3-5 NCEs/annum
- RD Costs 350M-500M
- Average sales 265M/annum/drug
- Realities
- Only realizable in areas of therapeutic and
geographical strength - Currently 0.6 NCE/annum
- Best estimates are approx 700M/drug
- Only 10 achieve gt180M in annual sales
6Key Bioinformatics Software on Solaris
- BLASTx, FASTA, SMITH-WATERMAN
- Phred/Phrap/Consed, Cross_Match, LASSAP
- HMMx, CLUSTALx, FrameMatch, D2, NCBI Toolkit
- EMBOSS, Artemis, Phylip, Darwin, MAGPIE
- BioSCOUT, SRS, BIOPENDIUM, GCG
- DoubleTwist, InforMax, ExPASy, ISYS
- Oracle, SQLGT/LIMS, ... Etc.!
- xmultiple versions
7Selected Major Accounts
- Academic
- U MN, U WI, SDSC/NPACI, NCGR, Wash U (St Louis),
Harvard, Rockefeller, CBR-RBC (Canada), UCL,
Cambridge U, Humboldt U (Ger), Sydney U, U
Queensland, NHRI (Taiwan), Weizmann, InfoBiogen,
U. Tokyo, Virginia Bioinformatics Institute,
Delaware Bioinformatics Institute, Beijing
Genomics Institute, etc. - Commercial
- Most BioPharma, Monsanto, Genset, Gene-IT,
Keygene, Incyte, OGS, MGW Biotech, DNA Print
Genomics, etc.
8Executive Support for Life Sciences
"Sun is committed to working with the life
science community to identify and tackle
computing/informatics challenges and requirements
in the post-genomics era."
Dr Greg Papadopoulos Sr VP CTO, 2/01
9SE Computational Biology Initiative
Vertical Initiatives
Geoscience GIS Weather/Climate Seismic
Engineering MCAE EE eEngineering
BioX/Comp. Bio. Bioinformatics Proteomics Pgx
Technology Desktop (Scientific Desktop,
Visualization, DCC, Thin Clients and Development
of solution stacks)
Grid Computing (Showcase Implementations of
iPlanet-Portal, Sun Grid Engine, SMC, Sun
Clustertools for Tier 1 Tier 3 Grid Computing)
10- Sun Community Support
- HPCGrid
- HPC Consortium
- Computational Biology Special Interest Group
- COE
- Informatics Advisory Council (SDN)
- Events
11Network of Excellence Centers
- Steering Board for every Section
- Regional/Global Events
- COEs around the world
- Industry Collaborations
Company 1
Center A
Center B
Center F
BioComputing Section
Company 2
Center C
Center E
Center D
12COE in Computational Biology
- U. Wisconsin Madison
- Virginia Bioinformatics Institute
- Beijing Genomics Institute
- Delaware Biotechnology Institute
- ...
- Other COEs with CB Components
- Ohio Supercomputer Center/Children's Hosp.
Cincinnati
13 Summary Sun's Grid Computing Offerings
- Sun's existing scalable Grid Computing software
stack - Open source building blocks (SGE, Broker,
ClusterTools, TCP Portal, Jxta, ...) - Encourage Your research contribution to open
source (community) - Integration with Globus etc. (SGE/Broker-Globus-S
GEs) - Sun Center of Excellence Program (cooperation ! )
- Collaboration, joint Grid projects, Sun GridSIG,
. . .
14Different Levels of Grids
- Stage 1 - 1 Owner / 1
ClusterCluster Grid Domain of SGE
Technical Computing Stack - Stage 2 - Multiple Owners, 1 Clusters, 1
Enterprise, 1 SiteCampus Grid Domain of
SGE/EE Multicluster Solutions - Stage 3 - Multiple Sites, Multiple
EnterprisesGlobal Grid Domain
of SGE/EE plus Grid Frameworks
15(No Transcript)
16Grid Computing _at_ Sun
- "The Network is the Computer"
- Java, Jini, Jxta, . . .
- July'00 Acquisition of Gridware
- "Grid" projects since 1995, Julius, Medusa,
Eroppa, Unicore, Autobench, . . . - Grid Engine, free, open source, ubiquitous, open
API - Department for Grid Computing (inSun VSP)
Cluster SW/Stack, Grid SW/Stack, Grid Computing
Lab, customer pilots - Sun Grid Computing Council
17(No Transcript)
18 Sun Technical Computing Portal "prototype"The
only (soon) commercially available hw/sw
solution that...
- Enables quick deployment of tech apps over
Internet, similar to mail and calendaring - Combines light-weight architecture with
- Industry-proven security and system management
- Based on iPlanet and Sun Grid Engine
19(No Transcript)
20 Sun Distributed Resource Management
- Load balancing maximizes resource utilization
- Transparent job submission machine selection
- Monitoring and accounting
gt SGE Sun Grid Engine, open source - Guaranteeing required resources
- Full control over resource utilization
- Fair and share based resource usage
- Implementation of management policies
gt
Sun Grid Engine Broker, open source
21Managing Compute Resources with Sun
Grid Engine Broker
Project C
Team B-4
User 1
Department 5
Department 4
Contractor X
User 2
Department 3
Project A
Department 2
Manage the full matrix of demand - Users
-Teams -Projects
Department 1
Department resource access
Campus wide resource demand
22Sun Grid Engine Status
- Ubiquitous, free, open source, open APIs
- Current Release SGE 5.2.3 (July 2001)
- Over 12,000 downloads (Sept 2001)
(1 Mio downloads in 2084)
gtgt SGE The Leading
RMS - OpenSource (July 2001) 500,000 lines,
Sept 2001
1000 downloads - Today Grid Computing everywhere in Sun !!
www.sun.com/gridware .../hpc .../edu
/...
23(No Transcript)
24 Globus SGE/Broker on Top of SGE
- Demo'd at Argonne National Lab ANL, ARL Army
Research Lab, Raytheon, and San Diego SDSC - On 2 SGE clusters (eg SDSC, 30 cpus and 70
cpus) - Globus/SGE interaction through GRAM scripts
- Globus jobs from ANL submitted to ARL cluster
- Next step SGE/EE on top of Globus
25 SGE/Broker as Part of Globus
- SGE Scheduling decisions to select remote site
- SGE acting as the resource broker for Globus
- Globus multi-site communication, authentication,
security, file transfers,... - SGE/Globus interface to be developed
- SGE/Broker submits and tracks jobs to remote
systems using Globus services
26 Sun and Open Grid Standards
- Example DRMAA Distributed Resource Management
Application API - "The Glue" between Distributed Resource
Management and Applications/Tools - gt Makes resource management transparent
- Proposed new Working Group at Global Grid Forum
in Frascati/Rome, October 2001 - Presented by Veridian, Intel, and Sun
27 Sun's Grid Strategy
- Strong Grid core team for developing and
productizing core components (like SGE, Grid
Broker,TCP) - Sun Grid Computing Council Integrate Sun
technologies and products and port the
environment to all Sun platforms - Sun's partners take care of other computing
platforms - Collaborate with the Grid community, IT partners
and our customers to build all kinds of different
Grids - Sun currently is proposing, designing and
building some 50 Grids with research labs,
universities and industry
Sun Grid software stack available TODAY