One-sided Communication Implementation in FMO Method - PowerPoint PPT Presentation

1 / 25
About This Presentation
Title:

One-sided Communication Implementation in FMO Method

Description:

... in FMO Method. J. Maki, Y. Inadomi, T. Takami, R. Susukita , H. Honda, ... matrix element. of (needed in SCF calculation) involves 4 center tow electron integrals ... – PowerPoint PPT presentation

Number of Views:27
Avg rating:3.0/5.0
Slides: 26
Provided by: mak95
Category:

less

Transcript and Presenter's Notes

Title: One-sided Communication Implementation in FMO Method


1
One-sided Communication Implementation in FMO
Method
  • J. Maki, Y. Inadomi, T. Takami, R. Susukita, H.
    Honda,J. Ooba, T. Kobayashi, R. Nogita, K. Inoue
    and M. AoyagiComputing and Communications
    Center, Kyushu UniversityFukuoka Industry,
    Science Technology Foundation

2
Overview of FMO method (1)
  • FMO(Fragment Molecular Orbital method) has been
    developed by Kitaura (AIST) and co-workers to
    calculate electronic states of a macromolecule
    such as a protein and nucleotide
  • The target molecule is divided into fragments
    (monomers)
  • ab initio molecular orbital (MO) calculation is
    carried out on each fragment and fragment pair
    (dimer).
  • Usually executed with high parallel efficiency
    (gt90)
  • The method can execute all-electron calculation
    on a protein molecule with 10,000 atoms.

3
Example of fragmentation
4
Overview of FMO method (2)
monomer calculation
Hamiltonian
environmentalelectrostatic potential
(potential from other monomers)
electron density of monomer J
monomer calculation depends on other monomers
electron density
iterated until all electron densities
unchanged(SCC procedure)
5
Overview of FMO method (3)
Schrødinger equation
RHF (Restricted Hartree-Fock) SCF method
Molecular Orbital (MO)
MO coefficients
basis functions
electron density
density matrix elements
number of basis functions
6
Overview of FMO method (4)
dimer calculation
Hamiltonian
environmental electrostatic potential
Schrødinger equation
Total energy
not all dimers are calculated by SCF
7
dimer-es approximation
  • For distant dimers, SCF calculation are not
    carried out
  • Energy is calculated by electrostatic
    approximation

ES (non-SCF)dimer
monomer
SCFdimer
8
Flow chart of FMO calculation
electronic structure calculation for monomer
not yet converged
convergence check of SCC procedure
already converged
electronic structure calculation for SCF-dimer
ES dimer calculation
total energy calculation
9
Density requirements and update in FMO calculation
electronic structure calculation for monomer
not yet converged
convergence check of SCC procedure
already converged
electronic structure calculation for SCF-dimer
ES-dimer calculation
total energy calculation
10
Approximation of
calculation (1)
potential from the electrons of fragment K
involves 4 center tow electron integrals
number of basis functions
order of calculation costs
4-center two electron integral calculationsfor
all environment monomers are prohibitive
11
Approximation of
calculation (2)
esp-aoc approximation
overlapmatrix of fragment K
( )
Mulliken AO population of
12
Approximation of
calculation (3)
esp-ptc approximation
point charge approx. of
Mulliken atomic charge of the nucleus A
number of environment monomers for which 4-center
two electron integral calculations are carried out
13
Hypothetical petascale computing environment
Number of CPUs
1PFlops 100,000 CPU
(current one CPU peak performance 10GFlops)
14
Memory requirement in FMO
RIJ
1,000 350MB 4MB
5,000 1.7GB 95MB
10,000 3.4GB 380MB
50,000 17GB 9.3GB
100,000 34GB 37GB
It is difficult for each process to store all
necessary data
15
OSC implementation in OpenFMO (1)
  • In its execution, only the process of rank 0 is
    used as a server process for dynamic load
    balancing. All the other processes are worker
    processes and are divided into groups and the
    two-level parallelization is used as the other
    implementations.

Group00
Group01
Group02
node00
node01
node02
node03
node04
0
1
2
3
4
5
6
7
8
9
16
OSC implementation in OpenFMO (2)
Group00
Group01
Group02
node00
node01
node02
node03
node04
0
1
2
3
4
5
6
7
8
9
MPI_Get
memory window created by MPI_Win
density matrix data
17
Estimation of communication cost (1)
Assumptions
  1. The sizes of all density matrices are the
    same.All groups have the same number of worker
    processes.
  2. The time to put or get one density matrix is
    equal to the average point-to-point communication
    time .
  3. MPI_Bcast implements a binomial tree algorithm so
    that the time to broadcast one density matrix
    over processes is obtained by .
  4. And for OSC scheme, dynamic load balancing works
    well and the delay due to competing put or get
    requests is ignorable. Therefore all groups
    execute the same number of monomer and dimer jobs
    and this means that all groups have the same

18
Estimation of communication cost (2)
10,000-100,000
96,000
32
SCC iterations
20
neighboring monomers (average)
7
neighboring monomers (average)
12
monomers with which a monomerforms a SCF dimer
17
SCF dimers
point-to-point communication time to transfer one
density matrix
19
Estimation of communication cost (3)
OSC scheme
number of monomerget requests
total number of dimers
number of SCF dimers
number of SCFdimer get requests
number of ES dimers
number of ES dimer get requests
total number of get requests
20
Estimation of communication cost (4)
OSC scheme
cost of MPI_Bcast
cost for one get requests
total cost of put requests
total cost per one group
Bcast scheme
total cost per one group
21
Estimation of communication cost (5)
22
Simulation using skelton code (1)
  • FMO calculation was simulated using skelton code
  • quantum calculation parts are removed
  • Skelton code accumulates estimated time of
  • molecular integral calculation
  • Fock matrix builging, SCF procedure
  • Time of each communication are estimated and
    accumulated
  • MPI_Send, MPI_recv, MPI_get, MPI_putestimated
    from measurements
  • MPI_Bcast, MPI_Allreduceestimated from point to
    point communication timeby assuming binomial
    tree, butterfly algorithm

23
Simulation using skelton code (2)
  • integral calculation time estimation formula

integrals required for environmental potential
1 electron
A1.12072E-05 B6.03297E-08
2 electron (3center)
A5.28555E-05 B2.60718E-07
2 electron (4center)
A0.00050 B3.26399E-06
integrals required for SCF
A7.57123E-05 B9.33117E-07
  • Fock builging time estimation

A5.84313E-05 B1.12603E-06
24
simulation using skelton codes (3)
Aquaporin protein
PDBID 2F2B,
492, 14,492 atoms
6-31G basis set
OSC cost lt Bcast cost
Linux cluster in RIKEN Super Combined Cluster
(RSCC) was used
25
Summary
  • OSC of MPI-2 standard has been implemented in a
    new FMO code to reduce the memory requirement per
    process.
  • Evaluation of communication costs shows that OSC
    scheme has an advantage over the Bcast scheme for
  • These results show OSC scheme is prospective for
    the petascale computing environment.
Write a Comment
User Comments (0)
About PowerShow.com