Title: Operational Machines: ASCI White
1Operational Machines ASCI White
- Presented to SOS7
- Mark Seager
- seager_at_llnl.gov
- 925-423-3141
- ICCD ADH for Advanced Technology
- Lawrence Livermore National Laboratory
This work was performed under the auspices of the
U.S. Department of Energy by the University of
California, Lawrence Livermore National
Laboratory under Contract No. W-7405-Eng-48.
2Q1 Is your machine living up to the performance
expectations? If yes, how? If not, what is the
root cause?
- ASCI White is providing robust cycles to the
tri-laboratory community - Application performance relative to peak is less
than expected - FMA 5-16 of floating point arithmetic
instructions issued - Some users sacrifice (turn off) compiler
optimization to have strict reproducibility - Modern coding techniques lead to poor memory
bandwidth utilization - Low cache-line payload utilization
- OOP and non-uniform grids ? several memory
references per floating point operation
3Q2 What is the MTBI?
4What are the topmost reasons for HW interrupts?
5What are the topmost reasons for SW interrupts?
6What is the average utilization rate?
7Q3 What is the primary complaint, if any, from
the users?
- Not enough time on the machine
- Users want more access
- Scalability of MPI
- MPI_ALLREDUCE
- MPI_BARRIER
- Extremely long job startup