Henri Casanova - PowerPoint PPT Presentation

1 / 33
About This Presentation
Title:

Henri Casanova

Description:

Fundamental for drug design (psychiatry) Large user base (yearly MCell workshop) Parallel MC simulations at the molecular level. Activated receptors. The MCell ... – PowerPoint PPT presentation

Number of Views:46
Avg rating:3.0/5.0
Slides: 34
Provided by: casa8
Category:

less

Transcript and Presenter's Notes

Title: Henri Casanova


1

The NPACI MCell project applications, software,
research, and impact.
  • Henri Casanova
  • Grid Research And Innovation Laboratory (GRAIL)
  • San Diego Supercomputer Center
  • Computer Science and Engineering Dept.
  • University of California, San Diego

2
The MCell Project
  • MCell Monte Carlo Cell simulator
  • Developed at Salk and PSC
  • Gain knowledge about neuro-transmission
  • Fundamental for drug design (psychiatry)
  • Large user base (yearly MCell workshop)
  • Parallel MC simulations at the molecular level

3
Activated receptors
4
The MCell application
5
A General Model
Input data
Tasks
Raw Output
Post-processing
Final Output
6
Grid Computing
Grid Software Inftrastructure
Enable Resource Sharing among users,
applications, institutions
7
MCell on the Grid
  • Feasible
  • Loosely coupled application
  • Can exploit enormous amounts of resources
  • Challenges
  • Scheduling?
  • How to make decisions to assign computation/data
    to resources?
  • Logistics of application deployment
  • Deployment by hand and with ad-hoc scripts will
    not scale

8
Scheduling MCell
9
Scheduling of PSAs
10
List Scheduling with Dynamic Priorities
  • We leverage previous work on list scheduling with
    Dynamic priority
  • We added a notion of adaptivity
  • We added a notion of data locality
  • We developed a new heuristic (XSufferage)
  • We evaluated heuristics in simulation
  • We demonstrated
  • Effective use of data replication/locality for
    performance
  • Robustness to performance prediction errors and
    performance fluctuations due to adaptivity
  • Casanova et al., HCW00

11
Experimental Validation
Casanova et al., SC01
12
APST Deployment Software
13
Parameter Sweep Applications
  • MCell is but a representative
  • Large number of computational tasks
  • Little synchronization
  • High performance
  • Potentially large data-sets
  • Potentially parallel sub-tasks
  • PSAs arise in many fields of Science and
    Engineering

14
APST Prototype software
  • Transparent Deployment Make it easy for users to
    launch/monitor PSAs over common Grid
    infrastructure (Globus, GridFTP, MDS, Condor,
    etc.)
  • Automatic Scheduling Achieve high performance
    with available resources
  • Simple Interface XML-based to describe
    application and resources
  • APST has been used for MCell in pseudo-production
    since 2001 (supported by NPACI)
  • Casanova et al., IJHPCA01 Casanova et al.,
    Grid02

15
APST Resource Description (1)
  • Descriptions of sites and storage
  • ltstoragegt
  • ltdisk iddisk1 datadir/home/data/gt
  • ltgridftp serverstorage.site1.edu /gt
  • lt/diskgt
  • ltdisk iddisk2gt
  • ltcp serverstorage.site2.edu /gt
  • lt/diskgt
  • lt/storagegt

16
APST Resource Description (2)
  • Descriptions of compute hosts
  • ltcomputegt
  • lthost idhost1 diskdisk1gt
  • ltglobus serverhost1.site1.edu /gt
  • lt/hostgt
  • lthost idhost2 diskdisk2gt
  • ltglobus serverhost2.site2.edu procs40 /gt
  • lt/condorgt
  • lt/hostgt
  • lt/computegt

17
APST Resource Description (3)
  • Descriptions of information sources
  • ltgridinfogt
  • ltnws servernws.site1.edu /gt
  • ltmds servermds.site2.edu /gt
  • ltmds servermds.globus.org /gt
  • lt/gridinfogt

18
APST App. Description
  • Description of application tasks
  • lttasksgt
  • lttask executablemcell argumentsdfp
    inputdfp.mdl outputdfp.out
    stderrdfp.err cost10 /gt
  • lttask executablemcell argumentshbtx
    inputhbtx.mdl outputhbtx.out
    stderrhbtx.err cost2 /gt
  • lt/tasksgt

19
APST Implementation
20
The Virtual Instrument Project
21
VI Goals
  • To build a Grid execution environment for MCell
    that provides
  • Computational Steering
  • Database for managing application data
  • User interface / portal

22
VI Software Architecture
Tomography
Create MCell/VI Project
Electron Microscopy
23
VI Software Architecture
compute
Grid Services
process
data
storage
Grid Storage and Compute Resources
24
VI Software Architecture
DReAMM (OpenDX)
25
SC02 Demo
BlueHorizon SP SDSC (California)
My Laptop SC02 (Baltimore)
Presto III TITECH (Japan)
GRAIL Lab UCSD (California)
Meteor Cluster SDSC (California)
26
APST in use
27
APST Broader Impact
  • APST provides an easy way to deploy applications
    on the Grid and is being used by an ever-larger
    user community

28
New Research Question?
29
Divisible Workload
Why cant APST partition the app workload by
itself?
  • Divisible Load Scheduling (Robertazzi 1996) How
    to partition the workload to maximize
    performance?
  • Trade-off
  • Large chunks
  • low overhead
  • low communication/computation overlap
  • sensitivity to performance prediction errors
  • Small chunks
  • high overhead
  • high communication/computation overlap
  • robustness to performance prediction errors

30
Our Contribution
  • UMR algorithm Yang, Casanova, IPDPS03
  • Increases chunk size throughout execution
  • Uses more realistic model than previously
    proposed algorithms
  • Uses a number of restrictions to cope with the
    model
  • And yet outperforms previously proposed
    algorithms
  • Robust UMR RUMR Yang, Casanova, HPDC03?
  • Increases and then decreases chunk size
    throughout execution
  • Outperforms previously proposed algorithms in the
    presence of performance prediction errors
  • Currently being implemented as part of APST

31
Conclusion and Futures
32
Summary Progress Flow
CS Research (Scheduling)
Prototype Software (APST 1.0)
Application (MCell)
EOL, Viztools, etc.
2001
1999
2000
2002
CS Research (Steering)
Production Software (APST 2.0)
2002
2002
CS Research (Divisible Load Scheduling)
Prototype Software (VI)
2003
2003
33
Future Work
  • Deployment of EOL
  • Prototype EOL/APST version in place
  • Large-scale demo for SC03
  • Releases of APST - v2.0 just released
  • NPACKAGE release
  • NMI release

http//grail.sdsc.edu
Write a Comment
User Comments (0)
About PowerShow.com