Document 10842988

advertisement
Hindawi Publishing Corporation
Computational and Mathematical Methods in Medicine
Volume 2012, Article ID 469769, 9 pages
doi:10.1155/2012/469769
Research Article
Predictive Models for Maximum Recommended Therapeutic Dose
of Antiretroviral Drugs
Michael Lee Branham,1 Edward A. Ross,2 and Thirumala Govender1
1 School
2 School
of Pharmacy and Pharmacology, University of KwaZulu-Natal, Durban 4001, South Africa
of Medicine, University of Florida, Gainesville, FL 32601, USA
Correspondence should be addressed to Michael Lee Branham, malimahweh@gmail.com
Received 11 September 2011; Revised 4 November 2011; Accepted 18 November 2011
Academic Editor: John Hotchkiss
Copyright © 2012 Michael Lee Branham et al. This is an open access article distributed under the Creative Commons Attribution
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly
cited.
A novel method for predicting maximum recommended therapeutic dose (MRTD) is presented using quantitative structure
property relationships (QSPRs) and artificial neural networks (ANNs). MRTD data of 31 structurally diverse Antiretroviral drugs
(ARVs) were collected from FDA MRTD Database or package inserts. Molecular property descriptors of each compound, that is,
molecular mass, aqueous solubility, lipophilicity, biotransformation half life, oxidation half life, and biodegradation probability
were calculated from their SMILES codes. A training set (n = 23) was used to construct multiple linear regression and back
propagation neural network models. The models were validated using an external test set (n = 8) which demonstrated that MRTD
values may be predicted with reasonable accuracy. Model predictability was described by root mean squared errors (RMSEs),
Kendall’s correlation coefficients (tau), P-values, and Bland Altman plots for method comparisons. MRTD was predicted by a
6-3-1 neural network model (RMSE = 13.67, tau = 0.643, P = 0.035) more accurately than by the multiple linear regression
(RMSE = 27.27, tau = 0.714, P = 0.019) model. Both models illustrated a moderate correlation between aqueous solubility of
antiretroviral drugs and maximum therapeutic dose. MRTD prediction may assist in the design of safer, more effective treatments
for HIV infection.
1. Introduction
Acquired immunodeficiency syndrome (AIDS) is a degenerative disease of the immune and central nervous systems caused by the human immunodeficiency virus (HIV).
There are an estimated 33.2 million people living with
HIV/AIDS globally [1–3]. Of this number, 22.5 million
are in Sub-Saharan Africa, which represents 67.8% of
the global number [3]. Antiretroviral drugs (ARVs) may
be classified as nucleoside reverse transcriptase inhibitors
(NRTIs), nucleotide reverse transcriptase inhibitors (NtRTI),
non-nucleoside reverse transcriptase inhibitors (NNRTIs),
protease inhibitors (PI), and more recently as fusion or integrase inhibitors [4, 5]. Since most ARVs have low aqueous
solubility and poor bioavailability, several alternative drug
delivery strategies have been proposed to optimize systemic
concentrations [6, 7]. The important biopharmaceutical
properties that need to be considered for effective ARV
delivery systems might include solubility, pKa, lipophilicity,
permeability, stability in biological fluids, gastrointestinal
metabolism, and where possible viral reservoir targeting [6,
8, 9]. To overcome suboptimal biopharmaceutical properties,
ARVs are often prescribed at high daily doses which increase
the occurrence of adverse side effects and toxicities [10, 11].
Combination therapy, comprising at least three anti-HIV
drugs, has become a standard treatment of AIDS [12], but
here again the potential for adverse side effects and drugrelated noncompliance increases. To address these issues,
computational methods have been used to predict doselimiting toxicities of a few antiretroviral drugs [13, 14] or to
optimize ARV formulations [15, 16]. The ability to predict
maximum therapeutic dose directly from molecular structure is both clinically and scientifically attractive in terms
of treatment management and reducing drug development
costs [17]. Unfortunately, such models for drugs used in
the treatment of AIDS do not yet exist. Accurate prediction
2
Computational and Mathematical Methods in Medicine
of the MRTD for antiretroviral type compounds would be
particularly useful in formulation studies so that clinically
relevant extrapolations on drug dissolution and permeability
can be made earlier in the drug development process [17–
20]. Several recent studies have been conducted to define
a relationship between the dose and physicochemical properties of the drug [20, 21], or to investigate the underlying
mechanisms of drug toxicity and bioaccumulation [20, 22].
Still, the prediction of optimal dose continues to challenge
pharmaceutical scientists because of its complexity and
variability between different organisms. Artificial neural networks (ANNs) have emerged as a powerful tool suitable for
processing complex relationships between molecular stimuli
and biological system responses [23]. Examples include
prediction of warfarin maintenance dose [24], gentamicin
steady-state plasma concentrations [25], skin permeability
[26], and prediction of HIV drug resistance [27]; supporting
data for these and other studies suggest the utility of neural
network modeling for predicting maximum therapeutic
doses.
We believed that since the MRTD estimates are derived
from human data, they would provide a more relevant,
accurate, and specific estimate for toxic dose levels compared
to risk assessment models based on animal data alone. In this
article, we predict the MRTD of antiretroviral drugs from
their molecular structures using relevant molecular property
descriptors and neural network software as a data mining
tool. Predictive performance of the models were evaluated
and statistically compared with the results obtained clinically
or reported in the literature. The application of predictive
models in the design of safe, effective antiretroviral drug
delivery systems is discussed.
Bias
2. Materials and Methods
2.1. Chemoinformatic Software and Modeling Tools. The
physicochemical descriptors, molecular weight (MW), aqueous solubility (ASol), and lipophilicity (AlogP) were determined using ALOGPS 2.1. Virtual Computational Chemistry
Laboratory, (http://www.vcclab.org/) [28, 29]. Bioaccumulation descriptors, log biotransformation half life (logBioHL), oxidation half life (OxidHL), and biodegradation probability (P[BD]) were determined using EPI Suite
v.410 (http://www.epa.gov/oppt/sf/tools/methods.htm) [30,
31]. All inferential statistics and MRTD data analysis was
performed using MedCalc v.12 (MedCalc Software bvba,
Belgium). Artificial neural network analysis was performed
using Tiberius Data Mining Software v.6.1.9 (Tiberius Data
Mining Software Ltd. Pty, UK).
2.2. MRTD Training and Validation Datasets. The MRTD
of 31 structurally diverse antiretroviral drugs were taken
from the FDA MRTD database or package inserts. This
“clinical MRTD” dataset was randomly split into training
and validation subsets as shown in Table 1. Subsequently,
each of the calculated descriptors and clinical MRTD values
were correlated by multiple linear regression analysis and
the results used to identify statistically significant property
descriptors. An error back propagation algorithm was used
MW
ASol
AlogP
logBioHL
Clinical MRTD
P[BD]
OxidHL
Figure 1: A neural network model was constructed with (2) hidden
neuron, (1) output variable that is Clinical MRTD, and (6) input
variables that is, OxidHL, P[BD], logBioHL, AlogP, ASol, MW. Total
number of patterns (23) were loaded in the data of which 23 were
complete and available for training. Two nonlinear neurons were
used and the model error minimization was stable for 20 minutes.
for network training, with learning rate set at 0.7. A tangent
sigmoid transfer function on the first layer and two neurons
with a nonlinear transfer function on the hidden layer were
minimalistic structures used to reduce over-fitting. Results of
MRTD versus molecular property descriptors with multiple
correlation coefficients are listed in Table 2. For the neural
network model training, correlation and error statistics are
listed in Table 3.
2.3. Model Validation and Statistical Comparisons. The predictability of each of the multiple linear regression (MLR)
and neural network (TNN) models was evaluated by a crossvalidation procedure [32]. Each model was constructed on
the basis of the same training dataset and was subsequently
used to predict the excluded test data. Statistical comparisons
were performed between the clinical MRTD and predicted
MRTD values using the root mean squared error (RMSE),
Kendall’s correlation coefficient (tau), P-value, and Bland
Altman plots for methods comparisons.
3. Results
3.1. Model Predicted versus Actual MRTD Estimates. In this
multivariable system, a quantitative relationship between
certain molecular property descriptors and maximum recommended therapeutic dose was characterized using two
datasets (i.e., MRTD and TEST). A multiple linear regression
(Table 2) and a 6-2-1-neural network model (Figure 1) for
the prediction ARVs maximum dose were constructed.
Table 1 lists summary statistics for the molecular property
descriptors and corresponding MRTD values in each dataset.
The ARVs in this study, although few in number, covered
a broad range (CV equal to 40% or greater) in terms of
the physicochemical (MW, ASol, AlogP) and bioaccumulation properties (logBioHL, OxidHL, P[BD]) included. A
significant difference in mean ASol between training set
(6.1489 g/L) and TEST set (0.436 g/L) was noted although
corresponding changes in lipophilic character were not as
great. Multiple linear regression analysis was performed with
results listed in Table 2. A statistically significant but modest
correlation between two of the six descriptors, that is, P[BD]
Computational and Mathematical Methods in Medicine
3
Table 1: Clinical MRTD data consisted of 23 training set compounds (italic) and 8 test set compounds (bold). Mean and standard deviation
statistics are shown indicating the distribution and central tendency of each molecular property descriptor.
Drug
Acyclovir
Ancitabine
Delaviridine
Didanosine
Famciclovir
Foscarnet
Indinavir
Lofexidine
Lamivudine
Rimantadine
Ribavirin
Valacyclovir
Zalcitibine
Zanamivir
Zidovudine
Saquinavir
Darunavir
Tipranavir
Ritonavir
Maraviroc
Tenofovir
Nelfinavir
Stavudine
n
mean
SD
CV%
Abacavir
Emtricitabine
Raltegravir
Nevirapine
Efavirenz
Fosamprenavir
Atazanavir
Lopiravir
n
mean
SD
CV%
MRTD (mg/kg/day)
13.30
20.00
6.67
6.67
25.00
120.00
16.70
0.040
5.00
3.33
200.00
50.00
0.0375
0.333
10.00
33.330
53.00
6.670
20.00
20.00
10.00
25.00
1.333
23.00
28.54
45.53
159.52
10.00
4.00
30.07
3.000
10.10
46.70
5.35
53.33
8.00
20.32
20.29
99.86
OxidHL
(days)
0.135
0.139
0.034
0.140
0.051
13.37
0.038
0.123
0.06
0.171
0.264
0.044
0.096
0.038
0.139
0.052
0.094
0.043
0.105
0.129
0.048
0.055
0.088
23.00
0.672
2.769
412.00
0.040
0.08
0.094
0.167
0.254
0.067
0.109
0.103
8.00
0.114
0.067
58.99
P[BD] (%)
(Rz = 0.427, P = 0.035), ASol (Rz = 0.476, P = 0.014)
and actual MRTD value is noted. MRTD values appear
to increase with P[BD], ASol, and OxidHL; but decrease
with increasing logBioHL, AlogP, or MW. The multiple
correlation coefficient (MCC = 0.7727), residual error (RSD
= 33.89), and ANOVA (P = 0.013) indicate acceptable
predictability of the multiple regression model for ARVs
0.5475
0.2005
0.0014
0.5085
0.991
0.772
0.0501
0.0792
0.0719
0.4624
0.8963
0.944
0.0909
0.8247
0.0432
0.9780
0.0001
0.0000
0.9488
0.0700
0.0016
0.6868
0.0768
23.00
0.402
0.394
97.00
0.0229
0.0566
0.0006
0.0473
0.0001
0.0007
0.0202
0.9933
8.00
0.143
0.344
241.300
logBioHL
(days)
−2.2874
−2.418
−2.3428
−1.9956
−3.6235
−2.4547
−4.6478
0.9861
−3.9261
0.4539
−2.8715
−3.3309
−3.5083
−4.4141
−3.0445
−3.9394
−1.8175
−0.0265
−4.61
−0.2247
−3.0098
−1.6709
−3.082
23.00
−2.5133
1.5738
−62.620
−2.1757
−3.7784
−2.6157
−3.7336
0.1111
−2.4135
−3.4392
−3.3281
8.00
−2.672
1.278
−47.830
AlogP (:)
ASol (g/L)
MW (Da)
−1.45
8.65
3.20
0.086
6.43
1.32
16.76
0.048
0.15
2.76
0.009
33.17
1.49
7.05
1.49
16.35
0.002
0.067
0.0002
0.0012
0.0106
1.87
0.0002
40.51
23.00
6.15
10.90
177.25
1.21
2.00
0.095
0.10
0.008
0.068
0.003
0.002
8.00
0.436
0.753
173.000
225.21
225.21
456.57
236.23
321.38
126.01
613.81
259.14
229.26
179.31
244.21
326.41
211.22
332.32
267.28
670.84
547.66
602.66
720.94
513.66
287.21
567.78
224.21
23.00
362.72
178.20
48.86
286.38
247.28
444.47
266.33
315.67
585.68
704.96
614.86
8.00
433.20
180.48
41.66
−2.62
2.77
−1.26
0.13
−1.63
3.26
3.31
−1.29
3.28
−1.92
−1.03
−1.29
−2.29
−0.1
4.04
1.76
5.71
4.24
4.3
−1.51
6.00
−0.8
23.00
0.939
2.809
299.04
0.61
−0.8
1.7
1.75
3.88
0.84
4.37
4.07
8.00
2.05
1.878
91.50
maximum dose. A multiple linear regression equation used
for the prediction of MRTD is as follows:
−34.3303 − 12.20 ∗ AlogP + 2.1249 ∗ (ASol)
+15.90 ∗ logBioHL + 0.2159 ∗ (MW)
+5.659 ∗ (OxidHL) + 46.5133 ∗ (P[BD]).
(1)
4
Computational and Mathematical Methods in Medicine
Table 2: The results of the multiple linear regression analysis for antiretroviral drugs versus MRTD. In the training, dataset only two of
the six molecular descriptors showed a statistically significant correlation with therapeutic dose, that is, P[BD] (P = 0.0349) and ASol (P =
0.0140). A multiple regression model equation coefficients are listed with there standard errors, coefficient of determination (R2 = 0.5970),
multiple correlation coefficient (MCC = 0.7727) and residual standard deviation (RSD = 33.89).
Multiple linear regression
independent variables
Constant
AlogP
ASol
logBioHL
MW
OxidHL
P[BD]
ANOVA
n = 23
Coefficient
−34.3303
−12.1995
2.1239
15.9000
0.2159
5.6589
46.5133
F-ratio = 3.9507
R2 = 0.5970
SE
6.6976
0.7697
8.0377
0.1015
2.8478
20.1746
RSD = 33.89
P-value
MCC = 0.7727
Rz
0.0873
0.0140
0.0654
0.0494
0.0643
0.0349
P = 0.013
−0.221
0.476
−0.071
−0.116
0.448
0.427
Table 3: The results of the neural network analysis for antiretroviral drugs versus MRTDs. All molecular descriptors show weak correlation
with MRTD except for ASol, OxidHL, and P[BD]. The 6-2-1 neural network model predicted training set MRTD values with high accuracy
(R2 = 0.992, MAX = 13.64, P < 0.001). No multicollinearity between independent variables was observed in the training set.
n = 23
R2 = 0.992
P < 0.001
Correlation coefficients
−0.221
0.476
−0.071
−0.116
0.448
0.427
F-ratio = 1340.73
6-2-1 neural network
Model versus clinical MRTD
Independent variables
AlogP
ASol
logBioHL
MW
OxidHL
P[BD]
ANOVA
The results of the neural network model for antiretroviral
MRTDs is shown in Table 3. All molecular descriptors show
weak correlation with MRTD except ASol, OxidHL, and
P[BD]. The 6-2-1 neural network predicted training set
MRTD values with high accuracy (R2 = 0.992, MAX =
13.64, P < 0.001). No multicollinearity between independent
variables was observed in the training set. SPSS code for the
6-2-1 neural network may be executed as follows:
COMPUTE Var1 = ((OxidHL
(−0.543086558961242)) +
3.63976611815824)
∗
+ ((Pr BD ∗ (−5.67413921537257)) +
2.81153598121711)
+ ((LogBioHL ∗ (0.40828801067156)) +
0.747514104338025)
MAX = 13.64
RMSE = 5.53
Learning rate = 0.700
R2
0.049
0.226
0.005
0.013
0.201
0.182
P < 0.001
COMPUTE Var2 = ((OxidHL ∗
(8.23491975851945E−02)) 0.551904322215974)
+ ((Pr BD ∗ (3.12529587401663)) 1.54858410557524)
+ ((LogBioHL ∗ (0.431493610228662)) +
0.790000076287146)
+ ((ALogP ∗ (−0.04107624611641)) +
6.94188559367329E−02)
+ ((Sol ∗ (4.12997852313067E−02)) −
0.836531279838639)
+ ((MW ∗ (1.05078568602304E−03)) −
0.444983569959981)
+ 9.28741349960935E-02.
+ ((ALogP ∗ (−0.848110602763694)) +
1.43330691867064)
COMPUTE Var3 = −0.535249116383622.
+ ((Sol ∗ (−2.55399028293542E−02)) +
0.517313285798853)
COMPUTE Var1 = 0.108990429907616
Execute.
∗
∗
Var1.
+ ((MW ∗ (1.40343495376914E−02)) −
5.94322423917294)
COMPUTE Var2 = 1.44376333322051
(Exp(Var2) − Exp(-Var2)) / (Exp(Var2) +
Exp(−Var2)).
+ 3.86733992096867.
COMPUTE Var3 = Var3.
Computational and Mathematical Methods in Medicine
5
Table 4: Test dataset Goodness of fit comparisons. Clinical MRTD values from 8 ARVs were using as and external test dataset the validate
model predictability. Model performance is characterized here in terms of root means squared error (RMSE), Kendall’s correlation coefficient
(tau), and Type II error probability (P-value).
MRTD
10.00
4.00
30.07
3.00
10.10
46.70
5.35
53.33
20.32
MLR
−10.6738
−23.9239
0.0628
−54.1862
−10.2863
44.0515
11.4360
42.6361
−0.1105
0.714
0.019
Execute.
COMPUTE Tiberius MRTD = ((((Var1 +
Var2 + Var3)/2.0) + 0.5) ∗ 199.9625)
+ 0.0375.
Execute.
3.2. Model Validation and Statistical Comparisons. Each of
the models was then validated using external TEST MRTD
dataset and a cross-validation procedure. Model “goodness
of fit” and predictability are summarized in Table 4. RMSE of
the 6-2-1 neural network (RMSE = 13.67) was substantially
less the multiple linear regression model (RMSE = 27.27).
Comparison of model predictivity was confirmed with TNN
having maximum squared error (SE = 601.23) compared to
MLR (SE = 3270.26). Model Figure 3 correlation with the
clinical MRTD values using Kendall’s correlation coefficient
was, however, greater for the MLR (tau = 0.714, P = 0.019)
than the resultant 6-2-1 neural network model (tau = 0.643,
P = 0.035). Bland Altman plots for methods comparison
further illustrate the predictive value of the neural network
(TNN) and multiple linear regression (MLR) models. The
upper and lower limits of agreement for the MLR (58.3,
−17.4) and TNN (29.8, −27.3) are illustrated in Figure 2.
While it can be seen that all of the differences lie between
these limits, MLR model limits of agreement were wider
than the TNN model, which were more narrow and nearly
symmetrical.
4. Discussion
MRTD values and SMILES codes for antiretroviral drugs
were collected from the FDA MRTD database which
is a highly reliable source pharmacologic activity based
on extensive clinical evidence (http://www.fda.gov/cder/).
The ALOGPS+ program was used for the calculation of
molecular mass (MW), solubility (ASol), and lipophilicity
(AlogP) without alteration. The precision and robustness of
these tools are well established in the pharmaceutical and
modeling community and its predictive power concerning
SE
427.4060
779.7441
900.4320
3270.2614
415.6012
7.0145
37.0393
114.3594
RMSE = 27.27
TNN
6.4861
3.3678
19.7700
−16.5700
−5.7782
50.7500
29.8700
64.5900
19.06
0.643
0.035
SE
12.3474
0.3996
106.0900
382.9849
252.1172
16.4025
601.2304
126.7876
RMSE = 13.67
10
10.1 30.07
Clinical MRTD
46.7
80
60
40
Predicted MRTD
Drug
Abacavir
Emtricitabine
Raltegravir
Nevirapine
Efavirenz
Fosamprenavir
Atazanavir
Lopiravir
Mean
Kendall’s tau
P-value
20
0
−20
−40
−60
3
4
5.35
53.33
MLR
TNN
Figure 2: Method comparisons graph. MRTD values predicted by
multiple linear regression (hollow squares, ) nearly traced those
values predicted by the 6-2-1 neural network (solid squares, ) as
shown. Multiple linear regression estimates for MRTD were consistently lower that those predicted by the neural network model.
the input parameters in question are published on the
company website (http://www.vcclab.com/) and cited in the
manuscript [28, 29, 33]. Independent validation of the
parameters calculated with ALOGPS has been published
[33, 34] and the estimates were found to be appropriately
accurate for the use intended. The bioaccumulation input
parameters, oxidation half life (oxidHL), log biotransformation half life (logBioHL), and biodegradation probability
(P[BD]) were calculated using the EPI Suite software which
is publically available and has been validated in hundreds
of modeling experiments for the estimated parameters. EPI
Suite software is continuously updated and the predictive
power of its latest version is always available at the website
(http://www.epa.gov/oppt/sf/tools/methods.htm). None of
6
Computational and Mathematical Methods in Medicine
60
100
+1.96 SD
58.3
60
40
Mean
20.4
20
0
−1.96 SD
−17.4
−20
−40
Clinical MRTD—predicted MRTD
Clinical MRTD—predicted MRTD
80
40
+1.96 SD
29.8
20
Mean
1.3
0
−20
−1.96 SD
−27.3
−40
−60
−60
−40
−20
0
20
40
Average of clinical MRTD and predicted MRTD
60
(a)
−20
0
20
40
60
Average of clinical MRTD and predicted MRTD
80
(b)
Figure 3: Bland Altman plot for method comparisons. Bland Altman plots are shown for (a) multiple linear regression predicted versus
clinical MRTD, and (b) neural network model predicted versus clinical MRTD. Horizontal lines are drawn at the mean difference, and at the
limits of agreement. All predicted values were within limits of agreement for both models, althought these limits were more narrow for the
neural network model estimates. The plots are useful revealing the relationship between the differences and the averages, the slight deviation
from symmetry in the multiple regression indicates some systematic bias but no possible outliers were identified.
the parameters were log-transformations of the software
output but instead used as direct input values for the training
set. Although some of the predictor values ranged over 3
orders of magnitude, this did not appear to adversely affect
comparisons of the MLR or ANN methods. Furthermore,
because of the highly diverse nature of antiretroviral compounds in terms of physicochemical descriptors and splitdata validation procedure with randomly selected training
and test sets instead of cross-validating scheme. It is our
intention to adapt the method as new ARV drugs become
listed in evolving FDA database.
We began our study looking at dose-related adverse effects of commercial antiretroviral drugs or new ARVs in
development. Although the appearance of serious longterm metabolic complications, such as cardiovascular disturbances [35], hyperlipidemia [35, 36], and diabetes, have been
extensively reported [35, 37], few reports on computational
models to predict these dose-limiting toxicities [38, 39]
can be found in the literature. A mathematical model to
predict the optimal dosing regimen for AIDS therapy has
been reported [40], but in this case, CD4+ cell counts and
knowledge of the adherence interval of individual patients is
required to adjust the dose. While the model was effective at
reducing dose-limiting toxicities in an AIDS patient population, it cannot be applied to nonapproved ARV formulations or to drugs in development. Since the overwhelming
majority of anti-HIV drugs demonstrate efficacy over a small
range of treatment doses, MRTD predicting models would be
beneficial to the drug delivery scientist who needs to design
experiments based on the most effective therapeutic dose
of the medication [41]. The MRTD is empirically derived
from human clinical trials, thus it is a direct measure of
the threshold for dose-related adverse effects in humans.
Prediction of the MRTD from molecular structure is of
importance for both new and existing drugs which may
require modifications to improve their aqueous solubility
or bioavailability in vivo. ARVs fit this description also
and are excellent subject molecules for predictive modeling
to estimate maximum dose. Unfortunately, the number of
antiretroviral drugs with established MRTD is yet small,
so the development of models specifically of ARVs is both
tedious and rare.
The molecular descriptors used in this study were selected to represent physicochemical (MW, AlogP, ASol) and
bioaccumulation (OxidHL, P[BD], logBioHL) property
influences therapeutic dose. Although molecular weight does
not strongly correlate with toxicity of most compounds, the
larger the molecular size of a compound, the smaller its
membrane permeability and diffusion coefficient become
[34]. Therefore, compounds with higher weights are less
likely to be absorbed, which limits their systemic toxicity.
Results of this study predict no correlation between molecular weight and maximum ARV dose. In contrast, bioactivity
and drug toxicity almost always increase with increasing
lipophilicity. This is due in part to the fact that lipophilic
molecules tend to cross cellular membranes more readily
increasing exposure and residence in the body. In addition,
lipophilic drugs are characterized by increased plasma protein binding, thus an assessment of lipophilicity is almost
always included in the physicochemical evaluation of a
drug because of its close association with pharmacologic,
permeability and potential bioaccumulation [42, 43]. Our
Computational and Mathematical Methods in Medicine
results here did not indicate AlogP or lipophilic character as
having any influence on ARV dose. The aqueous solubility
of a compound significantly affects its absorption and
distribution characteristics. Typically, low solubility goes
along with a poor absorption and, therefore, its systemic
toxicity reduced, however, local irritability may develop
and/or reduced elimination rates, both are characteristic of
drugs with low aqueous solubility. For highly potent drugs,
increasing solubility usually enhances the elimination rate
and lowers systemic half life, [44]. This means that for
ARVs with low aqueous solubility, some correlation with
maximum therapeutic dose (as we has shown) is likely to be
observed.
Any chemical (even water) can produce toxic side effects
in the body if allowed to accumulate to sufficiently large
concentrations. While much of the effort in bioaccumulation
modeling [43, 45] has been initiated by scientists estimating
the equilibrium distribution of chemicals between organisms and their environments (e.g., fish-water, plants-soil),
effective physicochemical property estimation routines (i.e.,
PERs) have resulted from these studies that may be applied to
similar biodistribution problems in pharmaceutical research.
For example, oxidation half life (OxidHL) is an estimate
of the molecules ability to form stable hydroxyl radicals
or to interact with such moiety under ambient conditions.
Formation of hydroxyl-radicals is often associated with doselimiting toxicities. Although in our studies, oxidation half life
was not statistically significant in the prediction of MRTD.
Another bioaccumulation descriptor, biotransformation half
life (logBioHL), is the (linearized) fraction of drug mass
in the whole body that has been metabolized per day.
Our estimates do not account for the formation of specific
metabolites which may be toxic nor do they identify specific
pathways in the process (i.e., phase I redox reactions or phase
II type conjugation reactions). Consequently, logBioHL was
not correlated with MRTD in the present study and was a
weaker bioaccumulation property descriptor in comparison
to oxidation half life or biodegradation probability P[BD].
The probability of biodegradation attempts to combine
both oxidative and biotransformation susceptibility of the
structure to give an estimate of overall persistence. P[BD]
estimates are based upon molecular fragments [46–48]
and in our investigation showed only moderate correlation
with MRTD in both multiple linear regression and neural
network models. Each of the models was evaluated for
predictive accuracy and by statistical comparison. In terms of
predictability, root mean squared errors were larger for the
multiple linear regression than the neural network model.
Some advantages of ANN over MLR models were illustrated
in the current study.
Artificial neural networks (ANNs) are biologically inspired data-mining algorithms which work by detecting the
patterns and relationships in data. We used the back propagation rule in which the neural network is trained to map
a set of input data by iterative adjustment of the weights. A
tangent sigmoid transfer function on the first layer and two
neurons with a nonlinear transfer function on the hidden
layer were minimalistic structures used to reduce overfitting.
Our training processes for TNN were allowed to run until no
7
change in RMSE was observed for 20 minutes, at which point
the model was saved. This learning method is commonly
used for neural network predictive models given doseresponse type data. However, ANNs have several limitations,
a major theoretical concern is the “black box” nature of the
output, that is, conclusions are generated without mechanistic explanations. ANNs also are limited by the quality
of their data and may need to be retrained periodically if
its performance changes over time. This is not necessarily
counterproductive, since it indicates robustness in the model
which adapts to changes in the predictive criteria. Real-time
monitoring of the training process is also important since
overtraining can easily occur, especially when the datasets are
small in size. This is may be one of the unique advantages of
real-time visualization of the data-mining process allowing
the investigator to make “intermediate evaluations” of model
predictability and then continue training until the reliability
and accuracy required of the predictions are met.
In conclusion, antiretroviral drugs are a chemically diverse class of compounds in terms of both physicochemical
properties and bioaccumulation potential. However, commercial ARVs may be categorized for predictive modeling
purposes into two groups based on aqueous solubility and
lipophilic character, in which hydrophilic compounds may
be administered at higher doses (MRTD) and prediction
of their MRTD value may be possible using simple multiple linear regression models. In contrast, the prediction
of MRTD values for antiretrovirals with poorer aqueous
solubility would be the most effective when the neural
network approach is used and when both physicochemical
and bioaccumulation property descriptors are available for
training. With regard to future studies, ANN represents a
promising tool for predicting maximum therapeutic dose,
especially for antiretroviral drugs with narrow therapeutic
index in the treatment of AIDS.
Abbreviations
AlogP:
ASol:
logBioHL:
CV:
R2 :
MLR:
MRTD:
MAX:
MW:
MCC:
OxidHL:
P[BD]:
TNN:
QSPR:
RSD:
RMSE:
SMILES:
SE:
SD:
Rz :
Calculated octanaol/water partition coefficient
Aqueous solubility
Biotransformation half life
Coefficient of variation
Coefficient of determination
Multiple linear regression
Maximum recommended therapeutic dose
maximum absolute error
Molecular weight
Multiple correlation coefficient
Oxidation half life
Biodegradation probability
Tiberius Neural Network
Quantitative Structure Property Relationships
Residual standard deviation
Root mean squared error
Simplified molecular input line entry system
Squared error
Standard deviation
Zero order correlation coefficient.
8
References
[1] K. A. Pkowitz, “AIDS: the first 20 years,” New England Journal
of Medicine, vol. 344, no. 23, pp. 1764–1772, 2001.
[2] P. Piot, M. Bartos, P. D. Ghys, and N. Walker, “The global
impact of HIV/AIDS,” Nature, vol. 410, no. 6831, pp. 968–973,
2001.
[3] K. H. Mayer and K. K. Venkatesh, “Antiretroviral therapy as
HIV prevention: status and prospects,” American Journal of
Public Health, vol. 100, no. 10, pp. 1867–1876, 2010.
[4] E. de Clercq, “Anti-HIV drugs: 25 compounds approved
within 25 years after the discovery of HIV,” International
Journal of Antimicrobial Agents, vol. 33, no. 4, pp. 307–320,
2009.
[5] E. de Clercq, “Antiretroviral drugs,” Current Opinion in
Pharmacology, vol. 10, no. 5, pp. 507–515, 2010.
[6] G. C. Williams and P. J. Sinko, “Oral absorption of the HIV
protease inhibitors: a current update,” Advanced Drug Delivery
Reviews, vol. 39, no. 1–3, pp. 211–238, 1999.
[7] E. Ojewole, I. Mackraj, P. Naidoo, and T. Govender, “Exploring
the use of novel drug delivery systems for antiretroviral drugs,”
European Journal of Pharmaceutics and Biopharmaceutics, vol.
70, no. 3, pp. 697–710, 2008.
[8] C. Y. Wu and L. Z. Benet, “Predicting drug disposition via
application of BCS: transport/absorption/ elimination interplay and development of a biopharmaceutics drug disposition
classification system,” Pharmaceutical Research, vol. 22, no. 1,
pp. 11–23, 2005.
[9] M. A. Cosenza, M. L. Zhao, and S. C. Lee, “HIV-1 expression
protects macrophages and microglia from apoptotic death,”
Neuropathology and Applied Neurobiology, vol. 30, no. 5, pp.
478–490, 2004.
[10] C. M. Bates, “HIV medicine: drug side effects and interactions,” Postgraduate Medical Journal, vol. 72, no. 843, pp. 30–
36, 1996.
[11] D. D. Richman, D. M. Margolis, M. Delaney, W. C. Greene, D.
Hazuda, and R. J. Pomerantz, “The challenge of finding a cure
for HIV infection,” Science, vol. 323, no. 5919, pp. 1304–1307,
2009.
[12] C. Flexner, “HIV drug development: the next 25 years,” Nature
Reviews Drug Discovery, vol. 6, no. 12, pp. 959–966, 2007.
[13] R. D. Benz, “Toxicological and clinical computational analysis
and the US FDA/CDER,” Expert Opinion on Drug Metabolism
and Toxicology, vol. 3, no. 1, pp. 109–124, 2007.
[14] P. Palumbo, H. Wu, E. Chadwick et al., “Virologic response to
potent antiretroviral therapy and modeling of HIV dynamics
in early pediatric infection,” Journal of Infectious Diseases, vol.
196, no. 1, pp. 23–29, 2007.
[15] D. Woolfson, M. L. Umrethia, V. L. Kett, and R. K. Malcolm,
“Freeze-dried, mucoadhesive system for vaginal delivery of
the HIV microbicide, dapivirine: optimisation by an ANN,”
International Journal of Pharmaceutics, vol. 388, no. 1-2, pp.
136–143, 2010.
[16] N. Subramanian, A. Yajnik, and R. S. Murthy, “Artificial neural
network as an alternative to multiple regression analysis in
optimizing formulation parameters of cytarabine liposomes,”
AAPS PharmSciTech, vol. 5, no. 1, p. E4, 2004.
[17] J. F. Contrera, E. J. Matthews, N. L. Kruhlak, and R. D. Benz,
“Assessment of the health effects of chemicals in humans: I.
QSAR estimation of the maximum recommended therapeutic
dose (MRTD) and no effect level (NOEL) of organic chemicals
based on clinical trial data,” Current Drug Discovery Technologies, vol. 1, no. 1, pp. 61–76, 2004.
Computational and Mathematical Methods in Medicine
[18] J. Röling, H. Schmid, M. Fischereder, R. Draenert, and
F. D. Goebel, “HIV-associated renal diseases and highly
active antiretroviral therapy-induced nephropathy,” Clinical
Infectious Diseases, vol. 42, no. 10, pp. 1488–1495, 2006.
[19] Y. Sun, Y. Peng, Y. Chen, and A. J. Shukla, “Application of artificial neural networks in the design of controlled release drug
delivery systems,” Advanced Drug Delivery Reviews, vol. 55, no.
9, pp. 1201–1215, 2003.
[20] E. J. Matthews, N. L. Kruhlak, R. D. Benz, and J. F. Contrera,
“Assessment of the health effects of chemicals in humans: I.
QSAR estimation of the maximum recommended therapeutic
dose (MRTD) and no effect level (NOEL) of organic chemicals
based on clinical trial data,” Current Drug Discovery Technologies, vol. 1, no. 1, pp. 61–76, 2004.
[21] J. F. Blake, “Chemoinformatics—predicting the physicochemical properties of ”drug-like” molecules,” Current Opinion in
Biotechnology, vol. 11, no. 1, pp. 104–107, 2000.
[22] D. Butina, M. D. Segall, and K. Frankcombe, “Predicting
ADME properties in silico: methods and models,” Drug Discovery Today, vol. 7, no. 11, pp. S83–S88, 2002.
[23] S. Agatonovic-Kustrin and R. Beresford, “Basic concepts of
artificial neural network (ANN) modeling and its application
in pharmaceutical research,” Journal of Pharmaceutical and
Biomedical Analysis, vol. 22, no. 5, pp. 717–727, 2000.
[24] I. Solomon, N. Maharshak, G. Chechik et al., “Applying
an artificial neural network to warfarin maintenance dose
prediction,” Israel Medical Association Journal, vol. 6, no. 12,
pp. 732–735, 2004.
[25] B. W. Corrigan, P. R. Mayo, and F. Jamali, “Application of
a neural network for gentamicin concentration prediction in
a general hospital population,” Therapeutic Drug Monitoring,
vol. 19, no. 1, pp. 25–28, 1997.
[26] C. W. Lim, S. Fujiwara, F. Yamashita, and M. Hashida, “Prediction of human skin permeability using a combination of
molecular orbital calculations and artificial neural network,”
Biological and Pharmaceutical Bulletin, vol. 25, no. 3, pp. 361–
366, 2002.
[27] S. Drăghici and R. B. Potter, “Predicting HIV drug resistance
with neural networks,” Bioinformatics, vol. 19, no. 1, pp. 98–
107, 2003.
[28] I. V. Tetko, “The WWW as a tool to obtain molecular
parameters,” Mini Reviews in Medicinal Chemistry, vol. 3, no.
8, pp. 809–820, 2003.
[29] I. V. Tetko, V. Y. Tanchuk, and A. E. P. Villa, “Prediction
of n-octanol/water partition coefficients from PHYSPROP
database using artificial neural networks and E-state indices,”
Journal of Chemical Information and Computer Sciences, vol.
41, no. 3–6, pp. 1407–1421, 2001.
[30] A. Arnot, W. Meylan, J. Tunkel et al., “A quantitative structureactivity relationship for predicting metabolic biotransformation rates for organic chemicals in fish,” Environmental
Toxicology and Chemistry, vol. 28, no. 6, pp. 1168–1177, 2009.
[31] A. Arnot and F. A. P. C. Gobas, “A review of bioconcentration
factor (BCF) and bioaccumulation factor (BAF) assessments
for organic chemicals in fish,” Environmental Reviews, vol. 14,
no. 4, pp. 257–297, 2006.
[32] C. Cobelli, E. R. Carson, L. Finkelstein, and M. S. Leaning,
“Validation of simple and complex models in physiology and
medicine,” The American Journal of Physiology, vol. 246, no. 2,
pp. R259–266, 1984.
[33] R. Mannhold, G. I. Poda, C. Ostermann, and I. V. Tetko,
“Calculation of molecular lipophilicity: state-of-the-art and
Computational and Mathematical Methods in Medicine
[34]
[35]
[36]
[37]
[38]
[39]
[40]
[41]
[42]
[43]
[44]
[45]
[46]
[47]
[48]
comparison of logP methods on more than 96,000 compounds,” Journal of Pharmaceutical Sciences, vol. 98, no. 3, pp.
861–893, 2009.
M. Yazdanian, S. L. Glynn, J. L. Wright, and A. Hawi, “Correlating partitioning and Caco-2 cell permeability of structurally
diverse small molecular weight compounds,” Pharmaceutical
Research, vol. 15, no. 9, pp. 1490–1494, 1998.
R. G. Jain, E. S. Furfine, L. Pedneault, A. J. White, and J. M.
Lenhard, “Metabolic complications associated with antiretroviral therapy,” Antiviral Research, vol. 51, no. 3, pp. 151–177,
2001.
S. K. Gan, K. Samaras, A. Carr, and D. Chisholm, “Antiretroviral therapy, insulin resistance and lipodystrophy,” Diabetes, Obesity and Metabolism, vol. 3, no. 2, pp. 67–71, 2001.
M. J. Glesby, “Toxicities and adverse findings associated with
protease inhibitors,” in Protease Inhibitors in AIDS Therapy, R.
C. Ogden and C. W. Flexner, Eds., pp. 237–256, Marcel Dekker,
Inc., New York, NY, USA, 2001.
P. A. Pham and C. Flexner, “Emerging antiretroviral drug
interactions,” Journal of Antimicrobial Chemotherapy, vol. 66,
no. 2, pp. 235–239, 2011.
D. Wang, B. Larder, A. Revell et al., “A comparison of
three computational modelling methods for the prediction of
virological response to combination HIV therapy,” Artificial
Intelligence in Medicine, vol. 47, no. 1, pp. 63–74, 2009.
O. Krakovska and L. M. Wahl, “Optimal drug treatment
regimens for HIV depend on adherence,” Journal of Theoretical
Biology, vol. 246, no. 3, pp. 499–509, 2007.
S. Hongzong, Y. Shuping, Z. Kejun, F. Aiping, D. Yun-Bo, and
H. Zhide, “Quantitative structure activity relationship study
on EC50 of anti-HIV drugs,” Chemometrics and Intelligent
Laboratory Systems, vol. 90, no. 1, pp. 15–24, 2008.
B. Testa, P. Crivori, M. Reist, and P. A. Carrupt, “The influence
of lipophilicity on the pharmacokinetic behavior of drugs:
concepts and examples,” Perspectives in Drug Discovery and
Design, vol. 19, pp. 179–211, 2000.
J. W. Nichols, M. Bonnell, S. D. Dimitrov, B. I. Escher, X.
Han, and N. I. Kramer, “Bioaccumulation assessment using
predictive approaches,” Integrated Environmental Assessment
and Management, vol. 5, no. 4, pp. 577–597, 2009.
C. A. Lipinski, F. Lombardo, B. W. Dominy, and P. J. Feeney,
“Experimental and computational approaches to estimate solubility and permeability in drug discovery and development
settings,” Advanced Drug Delivery Reviews, vol. 23, no. 1–3, pp.
3–25, 1997.
W. B. Neely, D. R. Branson, and G. E. Blau, “Partition
coefficient to measure bioconcentration potential of organic
chemicals in fish,” Environmental Science and Technology, vol.
8, no. 13, pp. 1113–1115, 1974.
S. Dimitrov, T. Pavlov, D. Nedelcheva et al., “A kinetic
model for predicting biodegradation,” SAR and QSAR in
Environmental Research, vol. 18, no. 5-6, pp. 443–457, 2007.
S. D. Dimitrov, N. C. Dimitrova, J. D. Walker, G. D. Veith,
and O. G. Mekenyan, “Predicting BCFs of highly hydrophobic
chemicals: effects of molecular size,” Pure and Applied Chemistry, vol. 74, no. 10, pp. 1823–1830, 2002.
P. Howard, W. Meylan, D. Aronson et al., “A new biodegradation prediction model specific to petroleum hydrocarbons,”
Environmental Toxicology and Chemistry, vol. 24, no. 8, pp.
1847–1860, 2005.
9
MEDIATORS
of
INFLAMMATION
The Scientific
World Journal
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Gastroenterology
Research and Practice
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Journal of
Hindawi Publishing Corporation
http://www.hindawi.com
Diabetes Research
Volume 2014
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
International Journal of
Journal of
Endocrinology
Immunology Research
Hindawi Publishing Corporation
http://www.hindawi.com
Disease Markers
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Volume 2014
Submit your manuscripts at
http://www.hindawi.com
BioMed
Research International
PPAR Research
Hindawi Publishing Corporation
http://www.hindawi.com
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Volume 2014
Journal of
Obesity
Journal of
Ophthalmology
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Evidence-Based
Complementary and
Alternative Medicine
Stem Cells
International
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Journal of
Oncology
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Parkinson’s
Disease
Computational and
Mathematical Methods
in Medicine
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
AIDS
Behavioural
Neurology
Hindawi Publishing Corporation
http://www.hindawi.com
Research and Treatment
Volume 2014
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Oxidative Medicine and
Cellular Longevity
Hindawi Publishing Corporation
http://www.hindawi.com
Volume 2014
Download