Plans for Integrated Simulation and Reconstruction Framework - PowerPoint PPT Presentation

1 / 11
About This Presentation
Title:

Plans for Integrated Simulation and Reconstruction Framework

Description:

Two simulations packages, based on Geant3 and Geant4 ... L-Tracker Pattern recognition, fitting integrated into GMC. Standalone T-Tracker PatRec and Fitter ... – PowerPoint PPT presentation

Number of Views:75
Avg rating:3.0/5.0
Slides: 12
Provided by: yury
Category:

less

Transcript and Presenter's Notes

Title: Plans for Integrated Simulation and Reconstruction Framework


1
Plans for Integrated Simulation and
Reconstruction Framework
  • Yury KolomenskyUC Berkeley

2
Existing Software Infrastructure
  • Much progress made on MECO simulations thus far
  • Two simulations packages, based on Geant3 and
    Geant4
  • Beamline simulations, backgrounds, detector
    resolutions and efficiency
  • Reconstruction code
  • L-Tracker Pattern recognition, fitting
    integrated into GMC
  • Standalone T-Tracker PatRec and Fitter
  • Calorimeter response
  • Covered in the previous talks
  • A handful of developers, somewhat disjoint
    package structure

3
The Next Steps
  • Key questions for the experiment
  • Magnet design and impact on physics capabilities
  • E.g. field uniformity
  • Longitudinal vs Transverse Tracker design
  • Trigger and DAQ development
  • Calorimeter reconstruction, tracking
  • These will require increasingly more
    sophisticated software capabilities
  • Detailed simulations
  • Integrated reconstruction algorithms
  • Standard benchmarks
  • Signal and backgrounds
  • Geometry
  • Fields

4
Integrated Simulation/Analysis
  • As the sophistication of the software increase,
    and more people get involved in the project,
    overall design issues become important
  • Integration of simulation and reconstruction/anal
    ysis
  • Flexibility (various detector packages, physics
    signatures, backgrounds)
  • Code maintenance and portability
  • Documentation
  • Ultimately, would like a system that can be
    migrated to online and offline operation w/o
    major redesign

5
The Framework
  • A fairly flexible online/offline system was
    designed by CDF/BaBar
  • Based on C and tcl scripting language
  • Modular structure to accommodate different
    inputs, outputs, execution sequence
  • Extensive and extendable set of build tools for
    various architectures (Linux Solaris,
    historically supported OSF and HP-UX)
  • In various flavors exists in major HEP
    experiments
  • BaBar, CDF
  • I have a lot of experience with the internals of
    this Framework, having ported it at least twice
    (E158, ILC nanoBPM project)
  • No wheels invented here

6
The Framework Best Features
  • Execution is organized in modules chunks of code
    which perform specific tasks
  • Abstract interface for each module beginJob(),
    endJob(), beginRun(), endRun(), event()
  • Each module is independent of each other (code
    dependence management code in parallel) but
    modules pass data to each other
  • Execution sequence (which modules are run and in
    what order) can be changed at run time with tcl
    scripts
  • Online, simulation, offline is handled that way
  • Event structure is extensible
  • Type-safe interfaces to add/get data to/from
    event only modules that directly use particular
    data objects need to know about them
  • Extensible Run-dependent environment
  • Handle constants that change slowly in a
    type-safe manner
  • Tcl run-time interface
  • Change parameters of modules, add/remove modules,
    change inputs/outputs, etc.

7
The Framework
8
Code Management
  • Code organized in packages
  • Each package is responsible for specific task
    (e.g. calorimeter digitization, PatRec, etc.)
  • Corresponds to a linkable library
  • Assigned to a responsible person
  • By default, code base is in CVS
  • Accessed either by AFS or ssh
  • Revision system allow parallel development,
    version control
  • Handles merges, creation/deletion of new files
    and packages, fallback mechanism

9
Code Management (cont)
  • Regular software releases
  • Snapshots of software, taken periodically
  • Every few months
  • Copies of releases can be either installed
    locally, or accessed (AFS) from the central
    location
  • Each user downloads a small snapshot of the
    release, checking out only packages they need to
    recompile
  • Build tools SoftRelTools (SRT) from BaBar
  • Several OS/compiler architectures
  • Solaris, RedHat/SL Linux fully functional
  • Language support for Fortran, C, C, Java, ROOT
    shared libraries
  • Again, no need to reinvent the wheel

10
Where Do We Start ?
  • Im starting on porting the MECO simulation into
    The Framework
  • Inputs GMC and G4 simulations up to hit
    creation
  • GMC inputs through disk files, G4 through either
    disk files or Framework input modules
  • Backgrounds through disk files
  • Digitization from hits to digital signatures
    (digis)
  • Reconstruction
  • Fortran-based PatRec and C-based TTracker
    PatRec
  • Outputs ROOT/HBOOK
  • Other simulation and reconstruction packages to
    be incorporated (e.g. calorimeter, trigger)

11
Manpower
  • Most of the hard work is actually in subsystems
  • Software infrastructure 1 FTE
  • Code port/maintenance, optimization, release
    building, QC/QA
  • 1 FTE at the start of the project, tail off
    when operational
  • The rest in subsystem code
  • Estimate 1 FTE for simulations/reconstruction
    for each major subsystem (L-tracker, T-tracker,
    Calorimeter and Trigger, CR shield, magnet),
    mostly committed
  • Plenty of opportunity for new blood

12
Scale and Performance
  • The Framework has been deployed in a variety of
    experiments and applications
  • BaBar simulation, reconstruction, analysis, L3
    trigger, online
  • O(10M) lines of code, performance of 100 Hz (L3)
    to 0.1 Hz (integrated sim/reco executable)
  • Inputs proprietary binary files, Objectivity
    database, ROOT files, network
  • E158 online and offline analysis
  • O(400k) lines of code, 120 Hz
  • Inputs proprietary binary and ascii files,
    network
  • ILC nanoBPM project online
  • O(10k) lines of code, 10 Hz
  • Inputs ascii files, EPICS channels

13
Documentation
  • Extremely important !
  • This is somewhat of a cultural issue. E.g. BaBar
    started off poorly, still very little code
    documentation
  • Several levels of detail
  • Tutorials for new users
  • Should be self-contained, e.g. allow an
    undergrad to get up and running with an example
    in finite amount of time
  • Code documentation in README files, comments
  • Can use automatic documentation tools
  • MECO notes for algorithms, performance, etc

14
QA and QC
  • Quality assurance good programming practices
  • Training
  • Code inspection
  • Quality control
  • Physics performance tests (e.g. standard
    simulation)
  • CPU performance and memory leaks
  • Standard tools from BaBar exist
Write a Comment
User Comments (0)
About PowerShow.com