2D-PME method and REX-MS method - Application of computational chemistry -

advertisement
2D-PME method and REX-MS method
- Application of computational chemistry -
Masaaki Kawata
Grid Technology Research Center, AIST, JAPAN
m.kawata@aist.go.jp
National Institute of Advanced Industrial Science and Technology
Outline
Introduction
What is 2D-PME method ?
- how fast 2D-PME method is ! What is REX-MS method ?
- how efficient REX-MS method is ! Perspective
- combination of two methods -
National Institute of Advanced Industrial Science and Technology
Introduction
Task parallel calculation
ex. Parallel molecular dynamics or Monte
Carlo simulations.
Data parallel calculation
ex. optimum-pairing-search of drug
compounds (parameter survey)
National Institute of Advanced Industrial Science and Technology
Introduction(2)
Task parallel calculation
Given a conformation of target compound,
accurate estimation of physical quantities,
whose accuracy are comparable to
experiment, requires large amount of
computational resources.
⇒ parallel algorithm of molecular simulation
→ Parallel calculation of Coulomb interaction
=2D-PME method
National Institute of Advanced Industrial Science and Technology
Introduction(3)
Data parallel calculation
Statistical interpretation should be included
in molecular simulation. But now we can’t do
statistical operation due to limited
computational resources.
Statistical operation requires extensive
survey of parameters space.
⇒ New strategy dealing with statistical
ensemble
→ REX-MS method
National Institute of Advanced Industrial Science and Technology
What is 2D-PME method ?
2D-PME method stands for Two-Dimensional
Particle Mesh Ewald method.
Fast and accurate method to calculate
Coulomb interaction in three-dimensional
systems with two-dimensional periodicity
(quasi-2D systems).
z
x
y
National Institute of Advanced Industrial Science and Technology
Why quasi-2D systems ?
Nano application
⇒surface of nano-structures
Bio application
⇒ membrane protein for
drug design
should be treated as
Quasi-2D system
National Institute of Advanced Industrial Science and Technology
What is quasi-2D system
H2O
H2O
Au
CH3(CH2)nS
z
y
x
(Left) Quasi two-dimensional simulation box, i.e., three-dimensional box with two-dimensional
periodicity in the (x, y) directions and with non-periodicity in the z direction. Original
particles are contained in the central box with lengths of the sides, Lx, Ly, and Lz in the x, y,
and z directions, respectively. (Right) Self-assembled monlayer membrane system. Images of
the simulation box are repeated in the (x,y) directions.
National Institute of Advanced Industrial Science and Technology
Coulomb interaction
7
kcal/mol
Coulomb interaction has longer tail
more than size of simulation box
→ Ewald method
qNa+ qCl−
6
5
r
4
3
2
More than 90 % of CPU time is
1
consumed for calculation of Coulomb 0
interactions
→ fast Ewald method
100
Acceleration of molecular simulations
⇒ Acceleration of Ewald method
National Institute of Advanced Industrial Science and Technology
200
300
400
Angstrom
500
2D Ewald method
1
Φ= ∑
2 h x ,h y
'
N
qi q j
N
∑∑ r − r
i =1 j =1
i
j
Φ kk ≠0
+ hx + h y
1
=
hx ×h y
Gx −1Gy −1 ∞
∑∑ ∑Q(t , t
t x =0 t y =0 t z =−∞
x
y
, t z ) × Ψ • Q(t x , t y , t z )
≈ Φr + Φkk ≠0 + Φkk =0 + Φs
α
Φ =−
π
s
Φ
k
k =0
N
[
N
∑q
2
i
i =1
= ∑ qi B Φkk =0 g , zi
]
i =1
(
N N
qi q j
1
'
Φ = ∑ ∑∑
erfc α rij + h x + h y
2 h x ,h y i =1 j =1 rij + h x + h y
r
)
National Institute of Advanced Industrial Science and Technology
2D-PME method
Φ
k
k ≠0
1
=
hx ×h y
Gx −1Gy −1 ∞
∑∑ ∑Q(t , t
t x =0 t y =0 t z =−∞
x
y
, t z ) × Ψ • Q(t x , t y , t z )
S (m ) =
N
∑q
j =1
Charge q
j
exp (− 2π i m ⋅ r j )
Contribution from each grid point: Q
(k1,k2,k3)
S (m) ≅ C(m) ⋅ F(Q)(m)
National Institute of Advanced Industrial Science and Technology
Results(1)
System 1
System 2
System 3
National Institute of Advanced Industrial Science and Technology
Results(2)
CPU timea for a single-step MD calculation for a given accuracy, by using the
original method and the 2D-PME method.
System 1
System 2
System 3
Number of
charges
2928
2955
2817
Lx=Ly (Å)
32.15
22.29
46.35
Lz (Å)
32.15
66.87
15.45
Original Ewald
method (s)
1098.29
1455.58
1828.77
2D-PME
method(s)
2.38
6.52
2.86
Speedup
461.5
223.2
639.4
CPU time on Compaq Alpha Station XP1000 (Alpha21264 667MHz).
National Institute of Advanced Industrial Science and Technology
Flow chart of parallel MD calculations with
the 2D-PME method.
Update and redistribute r j and r& j
Calculate bonded interactions
r
r
Calculate Φ and F
together with van der Waals interactions
k
k
Calculate Φ k = 0and F k = 0
Construct u jfrom r ′j
Construct
M λ ( λ = x , y , z ) and their derivatives
Set Q ( t x , t y , t z )
Data transform as illustrated (forward)
forward
2D FFT and 1D Fourier integral
k
Calculate Φ k ≠ 0
~
~
Calculate Ψ ( m x , m y , m z ) ⋅ Q ( m x , m y , m z )
2D FFT and 1D Fourier integral
Data transform as illustrated (backward)
k
Calculate F k ≠ 0
Sum of all interactions
National Institute of Advanced Industrial Science and Technology
backward
Results (3)
Speedup factor of the parallel
MD calculations for the quasi-2D
systems by using the 2D-PME
method. The solid line is for
calculations with an SP switch
(300MB/sec BI-Direction) with
user space protocol, and the
dashed line is for calculations
with internet protocol. The
dotted line is the ideal speedup
factor, assuming an infinitely fast
network connecting the nodes.
60
Speedup factor
50
40
30
20
10
10
20
30
40
50
Number of processors
60
National Institute of Advanced Industrial Science and Technology
What is REX-MS method?
REX-MS stands for Replica Exchange
Molecular Simulation method.
Computationally efficient sampling method in
the phase space.
⇒ global optimization problem
suitable for grid environment
National Institute of Advanced Industrial Science and Technology
In REX-MS method
Replica1
Replica2
ReplicaN
1. Run N independent
simulation (N replicas) with
N different parameters,
respectively.
2. Exchange information
among replicas during the
simulation.
3. Search optimum solution
among N simulations.
Extended statistical ensemble
Sampling by using REX strategy is more efficient than
that by the sum of N independent simulations (not REX).
National Institute of Advanced Industrial Science and Technology
Two type of REX-MS methods
REX-MC method (by t.ikegami & h.takemiya)
Replica exchange Monte Carlo calculation
REX-MD method (by m.ito)
Replica exchange Molecular Dynamics toolkit
National Institute of Advanced Industrial Science and Technology
REX-MC (by t.ikegami)
Potential energy survey of molecules using
direct method (Combination of REX-MC with
ab-initio MO calculation)
⇒Superior to random walk survey of
complicated potential surface
National Institute of Advanced Industrial Science and Technology
Gridifying the program
Two levels of parallelization
Coarse grained: parallel montecarlo sampling
Fine grained: parallel ab-initio
energy calculation
Dynamic task scheduling,
machine reconfiguration
Bookkeeper
Reconfiguration request
Task scheduling for balancing Monitoring
Reconfiguration
load on a heterogeneous
computing resources
Machine scheduling for
reconfiguring machine sets on
the fly
Dynamic scheduling
REXMC client
Task allocation
Servers
meta-computing test bed
10 institutes/20 Supercomputers
ab initio calculation
National Institute of Advanced Industrial Science and Technology
HPC Challenge in SC2002
Metacomputing Test-bed
AIST
10 institutions (3 continentals) / 20 parallel computer (7 types)
High Performance Computing Center Stuttgart (HLRS),
Sandia National Laboratories (SNL),
Pittsburgh Supercomputing Center (PSC),
REXMC
Client Center (AIST),
REXMC Client
Grid Technology
Research
For C20 triplet
For C20 singlet
Manchester Computing Centre (MCC),
National Center for High Performance Computing (NCHC),
Japan Atomic Energy Research Institute (JAERI),
Korea Institute of Science and Technology Information (KISTI),
European Center of Parallelism in Barcelona (CEPBA/CIRI),
Bookkeeper
Finnish IT center for Science (CSC).
National Institute of Advanced Industrial Science and Technology
Potential Survey on the Metacomputing
testbed
Parallel execution of 16(32)
MC samples
Running over 150 hours with changing
machine configurations
Calculating 145 MC samples/hour using
860 CPU’s (at maximum)
Potential survey for C20 might be
completed in a month
(cf. > ~30 years on a single CPU)
Negligible Communication cost
< 1% of total time
Dynamic scheduling/configuration
mechanism is useful for
long time simulation
Simulation on the unstable environment
National Institute of Advanced Industrial Science and Technology
REMD toolkit (by m.ito)
A toolkit was developed to build a replica-exchange method
program suitable for solving the multiple-minima problem.
It is designed as an object-oriented framework to generate
variants of simulation programs by assembling the toolkit
components and force field programs.
The toolkit components provides the parallelization
mechanisms for various computational environments and the
sampling methods.
An arbitrary force field implementation can be plugged into
the toolkit to generate an executable program.
REMD toolkit
MPI
Serial
Grid
MC
Mindy(NAMD)
MD
New
Model
National Institute of Advanced Industrial Science and Technology
Potential
Estimation of thermal distributions for molecular
structures is essential for elucidating biological
functionalities, e.g. a ligand-receptor binding.
Such estimation, however, is notoriously difficult
at low temperature because of a multiple-minima
problem.
Replica exchange method (REM) and replicaexchange molecular dynamics method (REMD) can
overcome the difficulty.
Each component of an REM/REMD algorithm, a
statistical ensemble, conformational sampling
method, and potential energy function has to be
customized to suit a particular biomolecular system.
Simulation programs are likely to adjust to various
parallelization environments.
An object-oriented framework can facilitate to
generate a variants of REM/REMD programs
suitable for various molecular systems and
different computational environments such as PC
clusters and Grid.
Component-based software development
Energy
Why do we need a software framework for
REM/REMD simulations?
National Institute of Advanced Industrial Science and Technology
構造座標
Efficiency of the REMD toolkit
The efficiency of the
generated program
(toolkit +
Mindy(NAMD)) was
examined by
estimzating the heat
capacity.
The error was found
to decay naturally
according to the
central limit theorem.
National Institute of Advanced Industrial Science and Technology
Perspective
Combination of 2D-PME method and REX-MS
method leads to fast, accurate and extensive
survey of the screening.
⇒higher throughput in drug design and in
material design of nano-structure.
Implementation of those methods on grid
environment brings new stage of the design
processes.
⇒ a promising application of grid
technology.
National Institute of Advanced Industrial Science and Technology
Reference
For more details about:
2D-PME, ask me or m.kawata@aist.go.jp
REX-MC, ask h.takemiya (here)
or mail to t-ikegami@aist.go.jp
REMD toolkit, mail to masakatsu-ito@aist.go.jp
National Institute of Advanced Industrial Science and Technology
Download