Remote Production and Regional Analysis Centers - PowerPoint PPT Presentation

About This Presentation
Title:

Remote Production and Regional Analysis Centers

Description:

Remote Analysis Centers(This picture needs to be changed. I've put in a new after this one) ... Centers. Provide various services. IAC. Institutional ... – PowerPoint PPT presentation

Number of Views:24
Avg rating:3.0/5.0
Slides: 24
Provided by: wwwhe
Learn more at: http://www-hep.uta.edu
Category:

less

Transcript and Presenter's Notes

Title: Remote Production and Regional Analysis Centers


1
Remote Production and Regional Analysis Centers
Iain Bertram 24 May 2002 Draft 1 Lancaster
University
2
Overview
  • Definitions
  • What is Remote Analysis
  • MC Production
  • Status
  • Future
  • Remote Analysis
  • Remote Code Development
  • DØ-Grid
  • Conclusions

3
Definitions
  • What is Remote?
  • We have defined it as follows Local d0mino,
    FNAL Farms, OnlineRemote clued0, club,
    non-FNAL.
  • 76 Institutions - 670 people on MastheadMost of
    these work remoteley

4
  • What is Remote?
  • We have defined it as follows Local d0mino,
    FNAL Farms, OnlineRemote clued0, club,
    non-FNAL.
  • 76 Institutions - 670 people on MastheadMost of
    these work remoteley

5
Goals
  • Remote Production Facilities will provide the
    bulk of processing power and storage for the DØ
    collaboration.
  • Production Tasks are
  • Monte Carlo (MC) Production
  • Reconstruction and Secondary reprocessing of the
    data.
  • CPU intensive user Analysis Jobs.
  • Data storage
  • MC Production
  • Run IIB Data Rate ? 50 Hz
  • Aim MC rate of 25 Hz

6
Monte Carlo Production - Current
  • All Production Monte Carlo Running Off-Site
  • Full Chain, generation, simulation, digitization,
    reconstruction, standard analysis are run.
  • A fully functional SAM station, capable of
    storing and retrieving data from any location.
  • MC_Runjob allows fully parameterized running of
    MC using macros
  • Redhat Linux 6.2 or 7.1
  • Tarballs created from Official Releases
  • In process of implementing integrated request
    system using SAM (what do you mean by this?)

7
Monte Carlo Production Rates
Monte Carlo Production Total 17.2 M Events Full
Geant Simulation
8
MC Requirements RunIIB
  • Requirements for 25 Hz Monte Carlo Rate
  • 25 Event full Geant - Rest Parameterized MC
  • Time Per Geant Event on 500 MHz PIII 216
    seconds
  • Digitize, Reconstruct and Analyze each event 6
    times
  • Time per Geant event 550 seconds
  • Meet 25 Hz Goal requires 5k 500 MHz CPU's

9
MC Production Cost requirements
  • Same Cost Assumptions as Fermi Farms
  • Dual CPU Node 2,500
  • One I/O Node per 100 node's
  • No remote mass storage costs included
  • Three Year Purchase Profile

10
Software Requirements
  • Operating Systems
  • Current RH 7.1
  • Future DØ Fermi Redhat Linux or a similar Redhat
    release
  • DØ will support official builds for the Fermi
    Redhat Releases
  • Remote production sites could be shared
    facilities
  • Will not be able to upgrade operating systems
    purely to meet DØ software requirements.

11
Remote Analysis
  • LHC and Grid Devlopments lead to many new large
    Computing Resources.
  • Need to access data software, and database
    information remote from FNAL
  • Integrate Software with new GRID Tools
  • Propose to set up Regional Analysis Centers
  • A series of locations that offer centralized
    access points for data analysis for remote users.

12
(No Transcript)
13
DØRACE Deployment Map (US and EU only)
Processing Center
Analysis Site w/ SAM
Analysis Site w/o SAM
No DØRACE
14
Remote Analysis Centers(This picture needs to be
changed. Ive put in a new slide after this one)
Normal Interaction Communication Path
Occasional Interaction Communication Path
Regional Analysis Centers
Provide Various Services
Institutional Analysis Centers
?.
Desktop Analysis Stations
15
Remote Analysis Centers
Central Analysis Center (CAC)
.
Regional Analysis Centers
Provide various services
RAC
RAC
Institutional Analysis Centers
.
.
IAC
IAC
IAC
IAC
.
Desktop Analysis Stations
.
DAS
DAS
16
RAC Requirements
  • RAC will provide
  • Significant CPU Power
  • Access to complete DØ Thumbnails and some DSTs
  • Access to DØ Codes
  • Storage Space
  • An institute with large concentrated and
    available computing resources
  • 100s of CPUs
  • 10s of TBs of disk cache
  • 100s Mbytes of network bandwidth
  • Possibly equipped with Mass Storage

17
Costs for RAC
  • Assume Shared Facility
  • Costs Based on Lancaster Farm
  • Year 00 200 CPU, 2TB disks, 30 TB expandable
    Mass Storage
  • Cost 600 k.
  • Will need larger facility.....
  • Each RAC 1M...... (For run IIa only)
  • Need to put better numbers in here,,,
  • Required disk storage is about 50TB for Run IIa
    alone.
  • The compute resource should be taken from the
    overall CS plan document.

18
Remote Analysis Requirements
  • Off Site Data Reconstruction
  • Access the appropriate DØ calibration, and
    luminosity databases. It is assumed that these
    will be accessed via a server interface and not
    require a local copy of the database.
  • Transfer of relevant tier of data
  • Well synchronized reconstruction executable
  • Generic User Analysis Jobs
  • The user will be required to meet several
    mandatory conditions
  • The software must be built using Dørte to ensure
    that the package can run without the full DØ
    release.
  • Any input/output data must be stored in SAM.
    (depending on the user. If the user is
    representing a physics group, this is correct but
    if the user is doing the analysis for his own
    analysis, this should not be the case.)
  • That the jobs can be fully described and
    submitted to a SAM queue.
  • Grid Enhancements
  • It is assumed that Remote Production Jobs will
    make full use of the current DØGrid project. Use
    of the Grid is not specific to remote analysis,
    but rather is a coherent part of the overall
    computing plan. (Remote analysis also is a
    coherent part of the computing plan. This
    statement sort of implicates that the remote
    analysis is not really in the overall plan)

19
CLUB
  • Analysis Backend to Cluedo
  • For analyzing 1TB data samples
  • Similar to MC farm's
  • Essentially a batch engine.
  • Fermilab to provide seed and infrastructure.
  • This cluster is a RAC for IACs around Fermilab
    and for Fermilab physicist. Should be included
    in the pilot RAC program.
  • Assume 200 CPU system (100 nodes 250k)
  • 15 TB data storage (10 servers) (50k)
  • Looks like 300k per CLUB type installation

20
Open Issues
  • Remote Mass Storage Costs.
  • Network Requirements
  • Depend on development of grid (Still there are
    basic requirements to just simply transfer data
    from CAC to RAC. 14MB/sec aggregated at FNAL for
    Run IIa.)
  • Data to processors or program to data
  • Available Resources Incomplete List Not Easy to
    Plan
  • Expect minimum 600 processors.

21
DØRAC Implementation Timescale
  • Implement First RAC by Oct. 1, 2002
  • CLUBs cluster at FNAL and Karlsruhe, Germany
  • Cluster associated IACs
  • Transfer TMB (10kB/evt) data set constantly from
    CAC to the RACs
  • Workshop on RAC in Nov., 2002
  • Implement the next set of RAC by Apr. 1, 2003
  • Implement and test DØGridware as they become
    available
  • The DØGrid should work by the end of Run IIa
    (2004), retaining the DØRAM architecture
  • The next generation DØGrid, a truly gridfied
    network without

22
Pilot DØRAC Program
  • RAC Pilot sites
  • Karlsruhe, Germany ? Already agreed to do this
  • CLUBS, Fermilab ? Need to verify
  • What should we accomplish?
  • Transfer TMB files as they get produced
  • A File server (both hardware and software) at CAC
    for this job
  • Request driven or constant push?
  • Network monitoring tools
  • To observe network occupation and stability
  • From CAC to RAC
  • From RAC to IAC
  • Allow IAC users to access the TMB
  • Observe
  • Use of the data set
  • Accessing pattern
  • Performance of the accessing system
  • SAM system performance for locating

23
  • The user account assignment?
  • Resource (CPU and Disk space) need?
  • What are the needed Grid software functionality?
  • To interface with the users
  • To locate the input and necessary resources
  • To gauge the resources
  • To package the job requests
Write a Comment
User Comments (0)
About PowerShow.com