Discovering Unique, Low-energy Transition States of Small, Non-cyclic Molecules Using Evolutionary

advertisement
Discovering Unique, Low-energy
Transition States of Small, Non-cyclic
Molecules Using Evolutionary
Molecular Memetic Computing
M.M.H. Ellabaan1, Y.S. Ong2, S.D. Handoko2, C.K. Kwoh3, and H. Man4
1
Center for Systems Microbiology, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
Centre for Computational Intelligence, School of Computer Engineering, Nanyang Technological University, Singapore
3
BioInformatics Research Centre, School of Computer Engineering, Nanyang Technological University, Singapore
4
Department of Biology, Boston University, Boston, MA, USA
2
Abstract—In the last few decades, identification of the transition states has experienced significant
growth in research interests from various scientific communities. As per the transition states theory,
reaction paths and landscape analysis as well as many thermodynamic properties of biochemical
systems can be accurately identified through the transition states. Transition states describe the
paths of molecular systems in transiting across stable states. In this article, we present a novel
Molecular Memetic Computing (MMC) methodology for the discovery of unique, low-energy
transition states and showcase the efficacy of identifying the transition states using the evolutionary
nature of the memetic computing paradigm. In essence, the MMC is equipped with the tree-based
representation of non-cyclic molecules and the covalent-bond-driven evolutionary operators in
addition to the typical backbone of memetic algorithms. Herein, we employ genetic algorithm for
the global search, Berny algorithm for the individual learning, and make use of the valley-adaptive
clearing scheme as the niching strategy in the spirit of Lamarckian learning. Experiments with a
number of small non-cyclic molecules demonstrated excellent efficacy of the MMC compared to
recent advances of several state-of-the-art algorithms. Not only did the MMC uncover the largest
number of transition states, but it also incurred the least amount of computational costs.
1. Introduction
Small molecules with a few dozens of atoms play important roles in biology, chemistry, and
medicine. Alongside their usage as the combinatorial building blocks in chemical synthesis, small
molecules have become essential for screening, designing, discovering, and synthesizing new drugs
[1]. Discovery of the low-energy stereoisomers of small molecules has gained growing attention of
the scientific community in the past decades as these stereoisomers may lead to the discovery of
more effective drug molecules compared to their traditional counterparts [2–4]. Such stereoisomers
are not always available in nature. They may require special synthesis, for which knowledge on their
transition states are often required in order to identify the minimum energy pathway to achieve the
synthesis. The transition states describe molecular configurations assumed during the synthesis of a
stereoisomer from another stereoisomer. Transient nature of the transition states makes
computational study a necessity as it is difficult—if not impossible—to observe them through wetlab experiments. Knowledge on the transition states helps scientists not only in the synthesis of drug
isomers, but also in the identification of drug sensitivities to environmental changes. The lower the
energy of the transition states associated with a given drug isomer, the more vulnerable will the
drug be to the environmental changes. Such information may be used to measure the relative ranks
of drug analogs. Analogs that are less sensitive to environmental changes can then be recommended
subject to pathological conditions.
Identification of the transition states is a computationally challenging task. Given a small molecule,
its energy landscape is generally highly constrained and involves vast search space. While no explicit
constraint function is normally formulated, the energy terms that serve as the objective function can
easily approach infinity, suggesting the nonsensical or the naturally impossible atomic configuration
of the molecule. Furthermore, the energy calculation of the molecule is generally computationally
expensive. With a single evaluation requiring minutes to hours of CPU time, the challenge is hence to
maximize the production of meaningful conformations so as not to waste the precious
computational resources evaluating many nonsensical structures. To-date, many optimization
approaches have been proposed to identify the transition states. These can be classified into the
conventional and the stochastic approaches [5].
Conventional approaches assume the availability of domain knowledge to produce reasonably good
initial guesses of the transition states. Methods in this category can be further segregated into
double-ended and single-ended techniques [6]. Given pairs of known stereoisomers (reactants and
products), the double-ended methods find transition states that bridges them [7–9]. In contrast, the
single-ended methods find transition states in the vicinity of some initial guesses [10]. Upon
availability of such domain-specific a priori information, conventional approaches converge
efficiently to precise transition states. Otherwise, considerable efforts would be spent attempting to
“repair” the “bad” initial guesses.
In contrast to conventional approaches, significantly fewer stochastic population-based methods
have been explored for locating the transition states, making these fertile area for research
investigations. Among the related stochastic optimization techniques proposed to-date, Chaudhury
et al. studied the simulated-annealing approach for locating saddle points [11] and extended their
efforts with the inclusion of gradients information [12]. Bungay et al. considered coupling the
genetic algorithm with eigenvalues information to bias the evolutionary search towards transition
states [13]. Despite their stochastic nature, these methods incidentally also require domain-specific
a priori information for their proper operations due to the vast search space involved. Furthermore,
it is worth noting that while stochastic search methods are able to reliably identify the regions of
interest where transition states may be located, they generally suffer from slow and inefficient
convergence to high-quality solutions.
Taking these cues, we present in this article a novel evolutionary Molecular Memetic Computing
(MMC) methodology requiring zero domain knowledge on the initial guesses of the transition states.
The essence of MMC lies on the tree-based representation of non-cyclic molecules and the covalentbond-driven evolutionary operators in addition to the typical backbone of memetic algorithms—the
population-based global search method and the individual-based life-time learning procedure. In this
article, we employ genetic algorithm [14] for the global search, Berny algorithm [15] for the
individual learning, and make use of the valley-adaptive clearing scheme [16] as the niching strategy
in the spirit of Lamarckian learning [17, 18]. Using the specialized representation and operators, the
MMC is capable of producing offspring that represent meaningful atomic configurations of a
molecule most—if not all—of the time. These in turn shall serve as good initial guesses of the
transition states. Additionally, we formulate fitness function that biases the memetic search towards
finding low-energy transition states. Experiments with five real non-cyclic molecules at the HartreeFock level of ab initio calculation employing the STO-3G basis set have been conducted to assess the
efficacy of the MMC against the recent advances of several state-of-the-art algorithms. Not only
does the MMC uncover the largest set of transition states, but it also incurs the least amount of
computational costs. Furthermore, the evolutionary memetic computing paradigm allows
knowledge discovery that would be valuable for subsequent studies [19].
In the remaining of this article, mathematical formulation of the problem of searching for the
transition states will be briefly discussed in Section 2. Algorithmic details of the MMC will then be
presented in Section 3. Experimental results and in-depth discussions will follow next in Section 4.
These include the presentation of knowledge discovery from the evolutionary data in the forms of
factors affecting the number of transition states, landscape analysis, as well as density distribution of
the transition states. Section 5 finally concludes this article and provides plausible immediate future
works.
2. Problem Formulation
Energy landscape has proven to be a useful conceptual framework in the field of protein folding and
molecular optimization [20–22]. The landscape can be formally defined as an ordered set of three
* |
+ is the set of all possible structural
(
), where
components, i.e.
configurations of a molecule with atoms,
represents the energy terms that serve as the
fitness function, and
denotes the distance measure between two structural
configurations in . The structural configuration space is therefore a set of physically consistent
three-dimensional molecular structures. Each structure
is a vector of
real numbers in
angstroms (Å), representing the Cartesian coordinates of each of the atoms of the molecule. The
potential energy function gives an indication of the height of the energy landscape. Herein, we
work with the first-principle calculations at the Hartree-Fock level using the STO-3G basis set [23]. To
measure the similarity between two structural configurations in , the Ultrafast Shape Recognition
(USR) [24, 25] is considered in this article.
Transition state denotes the configuration where the minimum energy barrier is needed to cross
from the low-energy vicinity of the reactant to the low-energy vicinity of the product. Transition
states play a key role in theoretical and physical chemistry. Their molecular structures and energy
values help in the identification of reaction rates [26] and minimum energy paths [27].
Mathematically, transition state can be defined as a stationary point on the landscape with only
one negative eigenvalue in the Hessian matrix [6], i.e.
* |
( )
+
( ). Scientists are generally interested on
where is an eigenvalue of the Hessian matrix
specific transition states with the following properties:

low-energy, i.e.
( )
where
is some user-defined threshold


| ( )
( )|
( )
unique, i.e.
where and are the maximum
acceptable similarities in the potential energy and the structural configuration spaces,
respectively
precise, i.e. ( )
3. Molecular Memetic Computing
As an emerging field, memetic computing has gained an increasing research attention in the recent
few decades with growing number of successes [28–36]. Inspired by Dawkins’ notion of a meme, it
was first introduced as the Memetic Algorithm (MA)—a marriage between population-based global
search and individual-based local search which respectively facilitate exploration and exploitation of
the search space. Not only MA is capable of escaping local optimum trap, but also it is converging
rapidly to some optimum with sufficient precision due to the life-time learning of the individual. Todate, MAs have been crafted for solving real-world problems in science and engineering more
efficiently [29, 31–32, 37–42].
In this section, specialized representation and operators for the evolution of molecular structure will
be presented. These are crucial for efficient discovery of the low-energy transition states of small
non-cyclic molecules using the molecular memetic computing (MMC) methodology. Specialized
fitness function will also be formulated to enable efficient selection using the stochastic roulettewheel procedure. Finally, the valley-adaptive clearing scheme [16] and Berny algorithm [15] for
maintaining diversity and enabling life-time individual learning, respectively, will be discussed briefly.
3.1. Tree-based Representation
The most straightforward representation of a molecular structure is undoubtedly the string of
concatenated Cartesian coordinates of all its atoms. However, various constraints of a molecular
system, such as covalent bonding, make such representation prone to evolving into nonsensical
structures. To alleviate this problem, we propose a tree-based representation of the molecular
structure where nodes represent atoms, links describes covalent bonds, and subtrees translates into
substructures as depicted in Figure 1. Furthermore, the tree-based representation allows the design
of specialized evolutionary operators that produces meaningful evolved structures most of the time
as will be discussed next.
xC1,yC1,zC1
xN,yN,zN
xH1,yH1,zH1
xH3,yH3,zH3
xH2,yH2,zH2
xH4,yH4,zH4
xC2,yC2,zC2
xH5,yH5,zH5
xH6,yH6,zH6
xC3,yC3,zC3
xH7,yH7,zH7
xH8,yH8,zH8
xC4,yC4,zC4
xO1,yO1,zO1
xO2,yO2,zO2
xH9,yH9,zH9
Figure 1. Tree-based Representation of Non-cyclic Molecule
3.2. Covalent-bond-driven Molecular Evolutionary Operators
3.2.1. Initialization
Constructing the tree-based representation given a seed molecular structure using the procedure
described above produces only one individual. To initialize a population, random sets of mutation
are applied to the seed individual such that multiple structures with different configurations are
achieved as depicted in Figure 2.
Figure 2. Initial Population of MMC
3.2.2. Crossover
Crossover between two molecular configurations aims at interchanging the substructures of
different molecular configurations such that favourable substructures are allowed to replicate across
generations. With reference to the tree-based representation, crossover can be easily performed by
choosing a random link as the crossover point following the alignment of the two molecular
configurations using Kabsch procedure [43] as illustrated in Figure 3. The resulted structures will
therefore most likely observe the constraints imposed by the covalent bonds. Even when the
constraints are not very strictly observed, the evolved structures could have been easily repaired.
chromosomes aligned
Figure 3. Covalent-bond-driven Crossover
3.2.3. Mutation
Mutation of a molecular configuration aims at effecting random changes to covalent bonds about
which rotation and along which stretching may be performed. With reference to the tree-based
representation, mutation can be easily attained by carrying out random rotation about or translation
(subject to minimum and maximum allowable bond lengths [44]) along a randomly chosen link as
shown in Figure 4. In this manner, the resulted structures will always observe the constraints
imposed by the covalent bonds.
(a) bond rotation
(b) bond stretching
Figure 4. Covalent-bond-driven Mutation
3.3. Transition State Fitness Function
The fitness function
is designed to evaluate quantitatively how suitable a generated structural
configuration of a molecule satisfies as a candidate solution of a low-energy transition state. It is
defined mathematically as
( )
( )
(
( )
)(|| |
|
)
where | | and denote the number of negative eigenvalues and a significant small value to prevent
division-by-zero error, respectively. The numerator serves to favour solutions that meet the userdefined low-energy threshold of
. The denominator, on the other hand, is designed to bias the
search towards transition states.
3.4. Valley-adaptive Clearing Scheme
To maintain diversity of the evolving populations, a valley-adaptive clearing scheme [16] is
incorporated to adapt to the non-uniform width of valleys in the energy landscape. Individuals in the
population are clustered into groups that share common valleys. The least fit individuals from each
group are then cleared by relocating them to random regions on the search space. The elite
individual per valley group will then be refined using Berny algorithm [15] described next in the spirit
of Lamarckian learning [17, 18].
3.5. Berny Algorithm
With its capability of converging to the transition states precisely, Berny algorithm enables life-time
individual learning of the elite individual in each valley group. Berny geometry optimization
algorithm is a modified version of the Schlegel algorithm [15] that operates by adjusting the signs of
inadequate eigenvalues. When multiple negative eigenvalues are encountered, all negative
eigenvalues are replaced by their absolute value except the one with the smallest magnitude. When
no negative eigenvalue is present, the sign of the least positive eigenvalue is negated. Following the
life-time individual learning, valley elites that satisfy as transition states are archived and checked
against possible duplicates.
4. Results and Discussions
To showcase the efficacy of the evolutionary MMC methodology detailed above, an experimental
study was performed at the Hartree-Fock level of ab initio calculation of the energy, derivatives, and
eigenvalues with STO-3G basis set. A population size of 10 individuals and an evolution length of
1,000 generations were assumed. As the molecules of interest are the following small non-cyclic
compounds (also shown in Figure 5):





GABA (gamma-aminobutyric acid): a chief inhibitory neurotransmitter that regulates
neuronal excitability in the mammalian central nervous system.
GABOB (gamma-amino-beta-hydroxybutyric acid or buxamin): a derivative of the
neurotransmitter GABA that works mainly as the anti-epileptic drug, offering significant destressing and anti-aging benefits.
Leucine: an essential amino acid important for haemoglobin formation that potently
activates the mammalian target of rapamycin kinase for cell growth regulation.
Tromethamine: an alkalizing agent and buffer in enzymatic reactions that affects the
balance of water and electrolytes in the body; also used in the synthesis of surface-active
agents and pharmaceuticals, in particular, the treatment of metabolic acidosis.
Valine: another essential amino acid characterized with stimulant activity that regulates the
immune function of the body and helps in the development and growth of the muscle as
well as tissue repair; also a main precursor in the penicillin biosynthetic pathway; has been
found useful in treatments involving muscle, mental, and emotional upsets and for insomnia
and nervousness.
(a) GABA
(c) Leucine
(b) GABOB
(d) Tromethamine
(e) Valine
Figure 5. Structural View of the Small Non-cyclic Molecules of Interest
Carbon atoms are represented in green, Nitrogen in blue, Oxygen in red, and Hydrogen in silver.
Figure 6 depicts the number of transition states discovered and the computational cost incurred by
different algorithms, namely the Stochastic Multi-start Local Search using Berny algorithm (SMLS)
[45] as the representative of single-ended methods, the GRRM [46] as the representative of doubleended methods, the Sequential-Niching Memetic Algorithm (SNMA) [30] that represents the recent
advances in niching and memetic algorithms, and finally the proposed MMC methodology with
specialized representation and evolutionary operators. As a double-ended method, the GRRM
requires the reactant and product information to uncover the transition states between them. In
contrast, the SMLS, the SNMA, and the MMC only need to provide initial guesses of the transition
states prior to executing the life-time individual learning procedure. Intuitively, the GRRM shall incur
less computational costs than the other three methods. However, it is witnessed from Figure 6 that
the GRRM is only more efficient than the SMLS and the SNMA. Without the tree-based
representation, initial guesses produced by the SMLS and the SNMA can easily be nonsensical
configurations that require considerable efforts to repair. With tree-based representation and
specialized evolutionary operators, the MMC is observed to be not only more efficient, but also
more effective than the GRRM as well as the SMLS and the SNMA in uncovering the transition states.
In term of number of transition states uncovered, the SNMA actually performs comparably to the
GRRM. The use of niching algorithm in SNMA allows selective life-time individual learning which in
turn gives way to a better exploration of the search space than in the case of SMLS. This
improvement is achievable without requiring the reactant and product information as with the case
of GRRM. In MMC, the use of valley-adaptive clearing scheme as the niching algorithm coupled with
the tree-based representation and the covalent-bond-driven evolutionary operators have further
improved the efficacy of using memetic computing methodology in the discovery of unique, lowenergy transition states of small, non-cyclic molecules.
Figure 6. Performance of SMLS, SNMA, GRMM, and MMC on GABA, GABOB, Leucine, Tromethamine, and Valine in
Terms of Number of Transition States Uncovered and Number of Energy Calculations Performed
With the significantly larger number of uncovered transition states, the use of evolutionary MMC
has furthermore allowed knowledge discovery from data generated over the course of the evolution
rather than requiring prior knowledge as with the case of the other methods. Table 1 tabulates the
characteristics of the uncovered transition states in terms of number of oxygen, carbon, and
hydrogen atoms as well as number of the total atoms and the rotatable bonds. Observation suggests
oxygen atoms play a key role in the flexibility of the molecule, allowing larger number of transition
states to be uncovered. Carbon atoms, in contrast, seem to limit the flexibility of the molecule,
decreasing the number of transition states to be uncovered. Meanwhile, hydrogen atoms are
observed not to play important role in the number of transition states uncovered. Additionally, it is
observed that the number of rotatable bonds play an obvious role in the number of transition states
uncovered while the total number of atoms has not shown any significant impact. The large number
of transition states of GABOB and thromethamine indicates the flexibility of the molecules, allowing
them to interact with broader range of molecular systems. GABOB as an anti-epileptic drug, for
example, is a more effective neuro-inhibitor than GABA from which it is derived. The flexibility that
GABOB can adopt allows it to interact more efficiently with larger set of GABA-ergic system
components in the central nervous systems.
Table 1. Characteristics of Uncovered Transition States in Terms of Number of Atoms, Number of Rotatable Bonds, and
Fitness-Distance Correlation
#transition_state
#oxygen
#carbon
#hydrogen
#total_atoms
#rotatable_bonds
FDC
GABA
256
2
4
9
16
3
26.57
GABOB
643
3
4
9
17
3
3.37
Leucine
305
2
6
13
22
3
54.48
Thrometamine
519
3
4
11
19
3
31.83
Valine
159
2
5
11
19
2
92.20
In Table 1, characteristics of the energy landscape of the molecules are also tabulated as FitnessDistance Correlation (FDC) [13] to measure the difficulty of the landscape. FDC is essentially the
Pearson product moment correlation between the energy differences and the structural differences
of the solutions to the lowest-energy transition states as shown below
(
(
) (
)
)
(
) and ( ) are the covariance and the standard deviation, respectively, while
where
and
represent the energy difference and the USR structure dissimilarity between each transition
state and the lowest-energy transition state, respectively. The FDC categorizes energy landscapes
into well-ordered, rough, or deceptive. High correlation with FDC value above 60 indicates wellordered landscape such that optimization methods can locate the transition states quite easily
whereas low correlation indicates rough landscape in which optimization techniques could be misled
to sub-optimal solutions. Negative correlation, meanwhile, indicates a deceptive landscape where
the lowest-energy transition state is located among the high energy transition states. From Table 1,
the non-cyclic molecules we investigate, except valine, are found to possess rough landscapes. Of
remarkable difficulty is the landscape of GABOB with FDC value of 3.37. The MMC has however
successfully uncovered around six to eight times more of its transition states than those found by
the other methods, owing the success to the tree-based representation and the covalent-bonddriven evolutionary operators. This further showcases the strengths of the evolutionary memetic
computing paradigm when coupled with the appropriate representation and operators such that
little or no effort would be spent exploring the region where no solution may be located.
With the large number of transition states uncovered, we further our analysis to include the
distribution of uncovered transition states by the MMC across the relative energy and the structure
dissimilarity, making reference to the lowest-energy transition states, as shown in Figure 7.
Distribution of the uncovered transition states of the molecules under investigation are depicted in
separate subgraphs using the scatter plots. Of remarkable achievement is the broad range of
transition states uncovered by the MMC as can be concluded from observing the x-axis of the scatter
plots. In other words, the MMC has been capable of discovering unique transition states with high
degree of structural dissimilarities. Observing the y-axis, on the other hand, it is noted that the
uncovered transition states are distinguishable into low- and high-energy transition states. The
presence of high-energy transition states in the region of low dissimilarities could easily hinder the
discovery of low-energy transition states in the region of high dissimilarities. Fortunately, this is not
the case with the MMC. The valley-adaptive clearing scheme used to maintain the population
diversity has indeed been an effective measure for the successful and efficacious exploration of the
energy landscape. Together with the tree-based representation and the covalent-bond-driven
evolutionary operators, the MMC has incurred the least amount of computational costs in addition
to uncovering the largest set of transition states. This shows the evolutionary memetic computing
paradigm is indeed a powerful and efficient tool for solving complex problems when equipped with
appropriate representation and operators.
(a) GABA
(c) Leucine
(b) GABOB
(d) Thrometamine
(e) Valine
Figure 7. Distribution of Uncovered Transition States across Relative Energy and Structure Dissimilarity with reference to
the Lowest-energy Transition State
5. Conclusion and Future Works
With the importance of small molecules in biology, chemistry, and medicine, this article has been
utterly dedicated to the discovery of the transition states of small non-cyclic molecules. The
mathematical formulation and a brief description of the nonlinear programming problem of finding
the transition states have been presented. A novel Molecular Memetic Computing (MMC)
methodology that requires no prior knowledge for the discovery of transition states has been
proposed. At the heart of the method are the tree-based representation of non-cyclic molecules and
the covalent-bond-driven evolutionary operators. In the spirit of Lamarckian learning [17, 18], the
specialized representation and operators are used to complement the memetic computing
technique comprising a genetic algorithm [14] for the population-based global search, the Berny
algorithm [15] for the life-time individual learning, and the valley-adaptive clearing scheme [16] for
maintaining the population diversity.
The evolutionary MMC methodology shows promising application of memetic computing techniques
in the field of molecular optimization by demonstrating excellent efficacy against several state-ofthe-art approaches. Not only did the MMC uncover the largest number of transition states, but it
also incurred the least amount of computational costs. The use of evolutionary MMC, furthermore,
allows knowledge discovery from data generated over the course of the evolution rather than
requiring prior knowledge as with the case of many earlier methods [6–13]. It has been found out
that oxygen atoms play a key role in the flexibility of the molecule, allowing larger number of
transition states to be uncovered. In contrast, carbon atoms have been observed to limit the
flexibility of the molecule, decreasing the number of transition states to be uncovered. Meanwhile,
hydrogen atoms have not been observed to play any important role in the number of transition
states uncovered. Additionally, it has also been found out—as one would expected—that the
number of rotatable bonds play an obvious role in the number of transition states uncovered while
the total number of atoms has not shown any significant impact. On a side note, large number of the
transition states of GABOB and thromethamine suggests the flexibility of the molecules, allowing
them to interact with broader range of molecular systems. GABOB as an anti-epileptic drug, for
instance, is a more effective neuro-inhibitor than GABA, from which it is derived. The flexibility that
GABOB can adopt allows it to interact more efficiently with larger set of GABA-ergic system
components in the central nervous systems.
With the generic tree-based representation, the MMC can be easily adapted for subsequent studies
of other small non-cyclic molecules that are important in biology, chemistry, and medicine. To cover
molecules that involve ring structures, an extension to the tree-based representation shall be
studied. Extensions to the current covalent-bond-driven evolutionary operators shall also be
devised. To allow working with larger molecules, techniques to intelligently restrict the execution of
individual learning procedure [47–50] shall be explored. Finally, Web Service that would allow
scholars to search for the transition states of molecular systems of their choices shall be developed.
In the end, our aim shall be to paves the way to establish the landscape connectivity graph, the
master piece of the comprehensive chemical reactions.
References
1. Chen, J., S.J. Swamidass, Y. Dou, J. Bruand, and P. Baldi, “ChemDB: A Public Database of
Small Molecules and Related Chemoinformatics Resources,” Bioinformatics, 21(22),
pp.4133–4139, 2005.
2. Lagorce, D., T. Pencheva, B.O. Villoutreix, and M.A. Miteva, “DG-AMMOS: A New Tool to
Generate 3D Conformation of Small Molecules Using Distance Geometry and Automated
Molecular Mechanics Optimization for in silico Screening,” BMC Chemical Biology, 9(1),
2009.
3. Sperandio, O., M. Souaille, F. Delfaud, M.A. Miteva, and B.O.Villoutreix, “MED-3DMC: A New
Tool to Generate 3D Conformation Ensembles of Small Molecules with A Monte Carlo
Sampling of the Conformational Space,” European Journal of Medicinal Chemistry, 44(4),
pp.1405–1409, 2009.
4. Miteva, M.A., F. Guyon, and P. Tufféry, “Frog2: Efficient 3D Conformation Ensemble
Generator for Small Compounds,” Nucleic Acids Research, 38, pp.W622–W627, 2010.
5. Ellabaan, M.M.H., Y.S. Ong, M.H. Lim, and J.L. Kuo, “Finding Multiple First-order Saddle
Points Using A Valley-adaptive Clearing Genetic Algorithm,” in Proceedings of 2009 IEEE
International Symposium on Computational Intelligence in Robotics and Automation,
pp.457–462, 2009.
6. Trygubenko, S.A. and D.J. Wales, “A Doubly Nudged Elastic Band Method for Finding
Transition States,” Journal of Chemical Physics, 120(5), pp.2082–2094, 2004.
7. Carr, J.M., S.A. Trygubenko, and D.J. Wales, “Finding Pathways Between Distant Local
Minima,” Journal of Chemical Physics, 122(23), pp.234903–234909, 2005.
8. Koslover, E.S. and D.J. Wales, “Comparison of Double-ended Transition State Search
Methods,” Journal of Chemical Physics, 127(13), pp.134102–134113, 2007.
9. Sheppard, D., R. Terrell, and G. Henkelman, “Optimization Methods for Finding Minimum
Energy Paths,” Journal of Chemical Physics, 128(13), pp.134106–134115, 2008.
10. Olsen, R.A., G.J. Kroes, G. Henkelman, A. Arnaldsson, and H. Jónsson, “Comparison of
Methods for Finding Saddle Points without Knowledge of the Final States,” Journal of
Chemical Physics, 121(20), pp.9776–9792, 2004.
11. Chaudhury, P. and S.P. Bhattacharyya, “A Simulated Annealing Based Technique for Locating
First-order Saddle Points on Multidimensional Surfaces and Constructing Reaction Paths:
Several Model Studies,” Journal of Molecular Structure: THEOCHEM, 429, pp.175–186, 1998.
12. Chaudhury, P., S.P. Bhattacharyya, and W. Quapp, “A Genetic Algorithm Based Technique for
Locating First-order Saddle Point Using A Gradient-dominated Recipe,” Chemical Physics,
253(2–3), pp.295–303, 2000.
13. Bungay, S.D., R.A. Poirier, and R.J. Charron, “Optimization of Transition State Structures
Using Genetic Algorithms,” Journal of Mathematical Chemistry, 28(4), pp.389–401, 2000.
14. Holland, J.H., Adaptation in Natural and Artificial Systems, MIT Press, 1992.
15. Schlegel, H.B., “Optimization of Equilibrium Geometries and Transition Structures,” Journal
of Computational Chemistry, 3(2), pp.214–218. 1982.
16. Ellabaan, M.M.H. and Y.S. Ong, “Valley-adaptive Clearing Scheme for Multimodal
Optimization Evolutionary Search,” in Proceedings of the Ninth International Conference on
Intelligent Systems Design and Applications, pp.1–6, 2009.
17. Ong, Y.S. and A.J. Keane, “Meta-lamarckian Learning in Memetic Algorithm,” IEEE
Transactions on Evolutionary Computation, 8(2), pp.99–110, 2004.
18. Le, M.N., Y.S. Ong, Y. Jin, and B. Sendhoff, “Lamarckian Memetic Algorithms: Local Optimum
and Connectivity Structure Analysis,” Memetic Computing, 1(3), pp.175–190, 2009.
19. Ang, J.H., K.C. Tan, and A.A. Mamun, “An Evolutionary Memetic Algorithm for Rule
Extraction,” Expert Systems with Applications, 37(2), pp.1302–1315, 2010.
20. Brooks, C.L. 3rd, J.N. Onuchic, and D.J. Wales, “Statistical Thermodynamics. Taking A Walk on
A Landscape,” Science, 293(5530), pp.612–613, 2001.
21. Wales, D.J., Energy Landscapes: Applications to Clusters, Biomolecules and Glasses,
Cambridge University Press, 2003.
22. Soh, H., Y.S. Ong, Q.C. Nguyen, Q.H. Nguyen, M.S. Habibullah, T. Hung, and J.L. Kuo,
“Discovering Unique, Low-Energy Pure Water Isomers: Memetic Exploration, Optimization,
and Landscape Analysis,” IEEE Transactions on Evolutionary Computation, 14(3), pp.419–
437, 2010.
23. Jensen, F., Introduction to Computational Chemistry, Wiley, 1999.
24. Zhou, T., K. Lafleur, A. Caflisch, “Complementing Ultrafast Shape Recognition with An Optical
Isomerism Descriptor,” Journal of Molecular Graphics and Modelling, 29(3), pp.443–449,
2010.
25. Ballester, P.J. “Ultrafast Shape Recognition: Method and Applications,” Future Medicinal
Chemistry, 3(1), pp.65–78, 2011.
26. Truong, T.N. and D.G. Truhlar “Ab initio Transition State Theory Calculations of the Reaction
Rate for OH+CH4→H2O+CH3,” Journal of Chemical Physics, 93(3), pp.1761–1769, 1990.
27. Henkelman, G. and G. Jóhannesson, “Methods for Finding Saddle Points and Minimum
Energy Paths,” Progress in Theoretical Chemistry and Physics, 5, pp.269–302, 2002.
28. Moscato, P., “On Evolution, Search, Optimization, Genetic Algorithms, and Martial Arts:
Towards Memetic Algorithms,” Caltech Concurrent Computation Program, 1989.
29. Neri, F., J. Toivanen, G. Cascella, and Y.S. Ong, “An Adaptive Multimeme Algorithm for
Designing HIV Multidrug Therapies,” IEEE/ACM Transactions on Computational Biology and
Bioinformatics, 4(2), pp.264–278, 2007.
30. Vitela, J.E. and O. Castanos, “A Real-coded Niching Memetic Algorithm for Continuous
Multimodal Function Optimization,” in Proceedings of 2008 IEEE Congress on Evolutionary
Computation, pp.2170–2177, 2008.
31. Ong, Y.S., M.H. Lim, and X.S. Chen, “Memetic Computation—Past, Present, & Future,” IEEE
Computational Intelligence Magazine, 5(2), pp.24–36, 2010.
32. Chen, X.S., Y.S. Ong, M.H. Lim, and K.C. Tan, “A Multi-facet Survey on Memetic
Computation,” IEEE Transactions on Evolutionary Computation, 15(5), pp.591–607, 2011.
33. Cheng, H.C., T.C. Chiang, and L.C. Fu, “A Two-stage Hybrid Memetic Algorithm for
Multiobjective Job Shop Scheduling,” Expert Systems with Applications, 38(9), pp.10983–
10998, 2011.
34. Zhang, J., Z. Zhan, Y. Lin, N. Chen, Y. Gong, J. Zhong, H.S.H. Chung, Y. Li, and Y. Shi,
"Evolutionary Computation Meets Machine Learning: A Survey," IEEE Computational
Intelligence Magazine, 6(4), pp.68–75, 2011.
35. Jin, Y., K. Tang, X. Yu, B. Sendhoff, and X. Yao, “A Framework for Finding Robust Optimal
Solutions Over Time,” Memetic Computing, in Press.
36. Le, M.N., Y.S. Ong, S. Menzel, Y. Jin, and B. Sendhoff, "Evolution by Adapting Surrogates,"
Evolutionary Computation, in Press.
37. Chia, J.Y., C.K. Goh, K.C. Tan, and V.A. Shim, "Memetic Informed Evolutionary Optimization
via Data Mining," Memetic Computing, 3(2), pp.73–87, 2011.
38. Damas, ., O. Cord n, and . antamar a, edi a Ima e Re istration sin
o tionary
Computation: An Experimental Survey," IEEE Computational Intelligence Magazine, 6(4),
pp.26–42, 2011.
39. Meng, Y., Y. Zheng, and Y. Jin, "Autonomous Self-reconfiguration of Modular Robots by
Evolving A Hierarchical Mechanochemical Model," IEEE Computational Intelligence
Magazine, 6(1), pp.43–54, 2011.
40. Shim, V.A., K.C. Tan, J.Y. Chia, and J.K. Chong, "Evolutionary Algorithms for Solving Multiobjective Travelling Salesman Problems," Flexible Services and Manufacturing, 23(2),
pp.207–241, 2011.
41. Le, M.N., Y.S. Ong, Y. Jin, and B. Sendhoff, "Gene Meets Meme in Computational
Intelligence: A Theoretic Modeling of Symbiotic Evolution," IEEE Computational Intelligence
Magazine, 7(1), pp.20–35, 2012.
42. Thomas S.A. and Y. Jin, "Evolving Connectivity Between Genetic Oscillators and Switches
Using Evolutionary Algorithms," Journal of Bioinformatics and Computational Biology, in
Press.
43. Kabsch, W., “A Solution for the Best Rotation to Relate Two Sets of Vectors,” Acta
Crystallographica, 32(5), pp.922–923, 1976.
44. Weast, R.C., Handbook of Chemistry and Physics, CRC Press, 1984.
45. Kok, S. and C. Sandrock, “Locating and Characterizing the Stationary Points of the Extended
Rosenbrock Function,” Evolutionary Computation, 17(3), pp.437–453, 2009.
46. Maeda, S., Y. Matsuda, S. Mizutani, A. Fujii, and K. Ohno, “Long-range Migration of A Water
Molecule to Catalyze A Tautomerization in Photoionization of the Hydrated Formamide
Cluster,” Journal of Physical Chemistry A, 114(44), pp.11896–11899, 2010.
47. Handoko, S.D., C.K. Kwoh, and Y.S. Ong, “Using Classification for Constrained Memetic
Algorithm: A New Paradigm,” in Proceedings of IEEE International Conference on Systems,
Man and Cybernetics, pp.547–552, 2008.
48. Handoko, S.D., C.K. Kwoh, and Y.S. Ong, “Classification-assisted Memetic Algorithms for
Equality-constrained Optimization Problems,” Lecture Notes in Computer Science, 5866,
pp.391–400, 2009.
49. Handoko, S.D., C.K. Kwoh, and Y.S. Ong, “Feasibility Structure Modeling: An Effective
Chaperone for Constrained Memetic Algorithms,” IEEE Transactions on Evolutionary
Computation, 14(5), pp.740–758, 2010.
50. Handoko, S.D., C.K. Kwoh, Y.S. Ong, and J. Chan, “Classification-assisted Memetic Algorithms
for Solving Optimization Problems with Restricted Equality Constraint Function Mapping,” in
Proceedings of 2011 IEEE Congress on Evolutionary Computation, pp.1209–1216, 2011.
Download