Support Vector Machines and Extensions for Toxicological Evaluation of Nanoparticles Erhun Kundakcioglu

advertisement
Support Vector Machines and Extensions
for Toxicological Evaluation of Nanoparticles
Erhun Kundakcioglu
Department of Industrial Engineering
University of Houston
Texas A&M University
College Station, TX
March 30, 2011
Outline
1
Introduction
2
Support Vector Machine Classifiers
3
Heat Induced Cellular Death
4
Toxicological Evaluation of Titania Nanoparticles
5
Current & Future Research
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
1 / 45
Introduction
Motivation
Cancer causes about 13% of all human deaths
7.6 million deaths worldwide due to cancer in 2007
Current Treatments
Radiation
Toxic for other tissues
Chemotherapy
High toxicity
Surgery
Complication and high cost
Gene/bone marrow transplantation
High cost
The cost of the therapy is estimated to $18,000 - $37,000 per patient in a
total of $72B annually.
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
2 / 45
Introduction
Motivation
Toxicity
A shop in Beijing, China: Polyacrylic coating application on
polystyrene boards
Coatings used by the workers contain nanoparticles
Seven women workers (aged 18-47) were identified with shortness of
breath
Clinical examinations reveal hypoxemia (low oxygen saturation in
blood), pleural effusion (fluid in thoracic cavity), and pericardial
effusion (fluid around heart)
Severe skin rash from intense itching of their faces, hands, and forearms
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
3 / 45
Introduction
Motivation
Toxicity
A shop in Beijing, China: Polyacrylic coating application on
polystyrene boards
Coatings used by the workers contain nanoparticles
Seven women workers (aged 18-47) were identified with shortness of
breath
Clinical examinations reveal hypoxemia (low oxygen saturation in
blood), pleural effusion (fluid in thoracic cavity), and pericardial
effusion (fluid around heart)
Severe skin rash from intense itching of their faces, hands, and forearms
Two other women from the same shop died (ages 19 and 29)
Pathology samples from the workers lungs identified 30 nm
(nanometer) in diameter particles
Song Y, Li X, Du X. Exposure to nanoparticles is related to pleural effusion, pulmonary fibrosis and granuloma.
European Respiratory Journal, Aug 2009.
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
3 / 45
Introduction
Motivation
Criticizms
“I think the paper should never have been published without characterizing the
exposure and the toxicological reactivity of the nanoparticles before blaming the
effects on them.”
“The real tragedy here is that these workers could have been protected if a
conventional chemical hygiene plan had been implemented that included a
working ventilation system and personal protective equipment. Preventing
inhalation of 30-nm nanoparticles can be as simple as the proper use of an
inexpensive mask sold by your neighborhood home improvement store.”
These researchers are funded by DuPont, Intel, L’oreal, Lockheed Martin,
Proctor&Gamble, Unidym.
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
4 / 45
Introduction
Cell Death Discrimination
Investigate in-situ the mechanism that the particles are inducing by
Titania nanoparticles and heat
Lung epithelia cancer cells (A549 from ATCC) are grown for 24 hours
785nm high power near infrared diode laser for spectroscopy analysis
(this light does not damage cells even after 60min at 115mW power)
Cells have been grown on MgF2 crystal for minimizing background
radiation
After reaching 80% confluency, cells are treated with the testing
material
Drug or heat exposure
Recording for the next 24 hours
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
5 / 45
Introduction
Cell Death Discrimination
Helps understanding pathological processes
induced by a disease
induced by pharmaceutical treatments such as anti-cancer drugs
Raman spectroscopy is employed to assess the potential toxicity of
chemical substances
noninvasive and does not require chemicals or markers
measurements can be taken rapidly and in real time
it is possible to analyze the health of either a single cell or the entire
population
has no interference with water and CO2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
6 / 45
Introduction
Raman Spectroscopy
Observes vibrational, rotational, and other low-frequency modes
Relies on Raman scattering of monochromatic light from a laser
Laser light interacts with molecular vibrations, phonons or other excitations
The energy levels of the laser photons are shifted up or down
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
7 / 45
Introduction
Raman Spectroscopy (cont’d)
Drawback: complexity of the obtained spectra
Traditionally, peak fitting has been used to analyze Raman spectra
time consuming due to large number of overlapping peaks
series of mathematical procedures to remove the baseline, the
fluorescence, to normalize the spectra, and calculate the average and
the standard deviation
prior knowledge needed on which peaks are discriminant with limited
interference from background noise
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
8 / 45
Support Vector Machine Classifiers
Outline
1
Introduction
2
Support Vector Machine Classifiers
3
Heat Induced Cellular Death
4
Toxicological Evaluation of Titania Nanoparticles
5
Current & Future Research
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
9 / 45
Support Vector Machine Classifiers
An Example: Flu Patients
Patient
Age
Gender
Blood type
Body Temperature
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
Flu
March 30, 2011
10 / 45
Support Vector Machine Classifiers
An Example: Flu Patients
Patient
1
Age
25
Gender
F
Blood type
A(-)
Body Temperature
99
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
Flu
-
March 30, 2011
10 / 45
Support Vector Machine Classifiers
An Example: Flu Patients
Patient
1
2
Age
25
26
Gender
F
M
Blood type
A(-)
AB(+)
Body Temperature
99
104
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
Flu
+
March 30, 2011
10 / 45
Support Vector Machine Classifiers
An Example: Flu Patients
Patient
1
2
3
Age
25
26
30
Gender
F
M
M
Blood type
A(-)
AB(+)
B(+)
Body Temperature
99
104
103
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
Flu
+
+
March 30, 2011
10 / 45
Support Vector Machine Classifiers
An Example: Flu Patients
Patient
1
2
3
4
Age
25
26
30
32
Gender
F
M
M
M
Blood type
A(-)
AB(+)
B(+)
0(+)
Body Temperature
99
104
103
98
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
Flu
+
+
-
March 30, 2011
10 / 45
Support Vector Machine Classifiers
An Example: Flu Patients
Patient
1
2
3
4
5
Age
25
26
30
32
23
Gender
F
M
M
M
F
Blood type
A(-)
AB(+)
B(+)
0(+)
B(-)
Body Temperature
99
104
103
98
105
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
Flu
+
+
?
March 30, 2011
10 / 45
Support Vector Machine Classifiers
An Example: Flu Patients
Patient
1
2
3
4
5
Age
25
26
30
32
23
Gender
F
M
M
M
F
Blood type
A(-)
AB(+)
B(+)
0(+)
B(-)
Body Temperature
99
104
103
98
105
Flu
+
+
?
Patients 1-4: Training Set
Patient 5: Test Set
Goal: Minimize the generalization error
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
10 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
?
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
?
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
?
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
margin
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
n
rgi
ma
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
?
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
11 / 45
Support Vector Machine Classifiers
Support Vector Machines (cont’d)
Developed by Vladimir Vapnik in 1995
State-of-the-art supervised learning methods
Classify two linearly/nonlinearly separable sets of pattern vectors
Kernel trick
Quadratic convex optimization problem
Efficiently solved
Can also be formulated as a SOCP problem
Minimizes the generalization error
Widely used; e.g., pattern recognition, text categorization,
biomedicine, brain-computer interface, financial applications.
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
12 / 45
Support Vector Machine Classifiers
Hard Margin Classifiers
Given the data xi ∈ Rd and labels yi ∈ {−1, +1} for i = 1, . . . , n
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
13 / 45
Support Vector Machine Classifiers
Hard Margin Classifiers
Given the data xi ∈ Rd and labels yi ∈ {−1, +1} for i = 1, . . . , n
Separating hyperplane is defined as hw, xi + b = 0
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
13 / 45
Support Vector Machine Classifiers
Hard Margin Classifiers
Given the data xi ∈ Rd and labels yi ∈ {−1, +1} for i = 1, . . . , n
Separating hyperplane is defined as hw, xi + b = 0
The geometric margin between the hyperplane and the nearest data
point x∗ is
|hw, x∗ i + b|
kwk
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
13 / 45
Support Vector Machine Classifiers
Hard Margin Classifiers
Given the data xi ∈ Rd and labels yi ∈ {−1, +1} for i = 1, . . . , n
Separating hyperplane is defined as hw, xi + b = 0
The geometric margin between the hyperplane and the nearest data
point x∗ is
|hw, x∗ i + b| → functional margin
kwk
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
13 / 45
Support Vector Machine Classifiers
Hard Margin Classifiers
Given the data xi ∈ Rd and labels yi ∈ {−1, +1} for i = 1, . . . , n
Separating hyperplane is defined as hw, xi + b = 0
The geometric margin between the hyperplane and the nearest data
point x∗ is
|hw, x∗ i + b| → functional margin
kwk
⇓
max
subject to
geometric margin
functional margin = 1
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
13 / 45
Support Vector Machine Classifiers
Hard Margin Classifiers
Given the data xi ∈ Rd and labels yi ∈ {−1, +1} for i = 1, . . . , n
Separating hyperplane is defined as hw, xi + b = 0
The geometric margin between the hyperplane and the nearest data
point x∗ is
|hw, x∗ i + b| → functional margin
kwk
⇓
max
subject to
min
w,b
subject to
geometric margin
functional margin = 1
⇓
1
kwk2
2
yi (hw, xi i + b) ≥ 1
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
i = 1, . . . , n
March 30, 2011
13 / 45
Support Vector Machine Classifiers
Soft Margin Classifiers
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
0000
1111
000
111
0000
000 1111
111
0000
1111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
14 / 45
Support Vector Machine Classifiers
Soft Margin Classifiers
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
0000
1111
000
111
0000
000 1111
111
0000
1111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
er
1111
0000
0000
1111
0000
1111
r(
ξ)
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
ro
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
14 / 45
Support Vector Machine Classifiers
Soft Margin Classifiers
111
000
000
111
000
111
000
111
000
111
111
000
000
111
000
111
000
111
000
111
0000
1111
000
111
0000
000 1111
111
0000
1111
000
111
000
111
000
111
000
111
000
111
000
111
Feature 1
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
error
(ξ )
111
000
000
111
0000
1111
000
111
0000
1111
0000
1111
Control Cell
Death Cell
1111
0000
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
14 / 45
Support Vector Machine Classifiers
Soft Margin Classifiers
min
w,b,ξ
subject to
n
1
CX 2
2
kwk +
ξi
2
2
i=1
yi (hw, xi i + b) ≥ 1 − ξi
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
i = 1, . . . , n
March 30, 2011
15 / 45
Heat Induced Cellular Death
Outline
1
Introduction
2
Support Vector Machine Classifiers
3
Heat Induced Cellular Death
4
Toxicological Evaluation of Titania Nanoparticles
5
Current & Future Research
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
16 / 45
Heat Induced Cellular Death
Experiment / Data Set
Each row represents a lung epithelia cell
Each column represents the Raman outputs at different energy levels
There are ∼ 1300 columns
Some cells are subject to either Etoposide or Triton X-100
Etoposide is a strong chemotherapeutic drug, used for Ewing’s
sarcoma, lung cancer, testicular cancer, lymphoma, non-lymphocytic
leukemia, and glioblastoma multiforme
Triton X-100 raptures the cellular membrane and results in the necrotic
death of the cells
The effect of abnormal heat (45 ℃ for 2 hours)
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
17 / 45
Heat Induced Cellular Death
Effect of Abnormal Heat - Heat Induced Cellular Death
Training Set:
Etoposide vs. control cells
Triton X-100 vs. control cells
Triton X-100 vs. Etoposide
Test Set: Cells that are subject to 45 ℃ for 2 hours
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
18 / 45
Heat Induced Cellular Death
Distance from the Hyperplane
Computational Results for Death Cell Discrimination
Sample ID’s
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
19 / 45
Heat Induced Cellular Death
Distance from the Hyperplane
Computational Results for Death Cell Discrimination
Sample ID’s
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
20 / 45
Heat Induced Cellular Death
Distance from the Hyperplane
Computational Results for Death Cell Discrimination
Sample ID’s
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
21 / 45
Heat Induced Cellular Death
Remarks
Heat treatment can be used as a form of programmed cell death
Foundation for developing diagnostic tools for cancer or other genetic
diseases
Powerful monitoring of the cellular response and disease treatment
with chemotherapy and radiation
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
22 / 45
Toxicological Evaluation of Titania Nanoparticles
Outline
1
Introduction
2
Support Vector Machine Classifiers
3
Heat Induced Cellular Death
4
Toxicological Evaluation of Titania Nanoparticles
5
Current & Future Research
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
23 / 45
Toxicological Evaluation of Titania Nanoparticles
Experiment / Data Set
Each row represents a lung epithelia cell
Each column represents the Raman outputs at different energy levels
There are ∼ 1300 columns
Some cells are subject to either Etoposide or Triton X-100
Etoposide is a strong chemotherapeutic drug, used for Ewing’s
sarcoma, lung cancer, testicular cancer, lymphoma, non-lymphocytic
leukemia, and glioblastoma multiforme
Triton X-100 raptures the cellular membrane and results in the necrotic
death of the cells
The effect of TiO2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
24 / 45
Toxicological Evaluation of Titania Nanoparticles
Titania Nanoparticles
Claim: Titania Nanoparticles / Anatase (i.e., TiO2 ) is potentially
toxic
Training Set:
Etoposide vs. control cells
Triton X-100 vs. control cells
Test set: Cells that are subject to TiO2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
25 / 45
Toxicological Evaluation of Titania Nanoparticles
Results: Cross Validation
4
Apoptotic
Healthy
Functional Distance / Arbit. Units
3.5
3
2.5
2
1.5
1
0.5
0
ï0.5
ï1
ï1.5
0
2
4
6
8
10
12
14
16
18
20
Sample ID
Figure: Cross-validation results with leave-one-out method: apoptotic vs. healthy.
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
26 / 45
Toxicological Evaluation of Titania Nanoparticles
Results: Cross Validation
5
Necrotic
Healthy
Functional Distance / Arbit. Units
4
3
2
1
0
ï1
ï2
ï3
ï4
0
2
4
6
8
10
12
14
16
18
20
Sample ID
Figure: Cross-validation results with leave-one-out method: necrotic vs. healthy.
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
27 / 45
Toxicological Evaluation of Titania Nanoparticles
Results: Apoptotic-Healthy
Figure: Progression from healthy to apoptotic along the course of the 36 hrs.
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
28 / 45
Toxicological Evaluation of Titania Nanoparticles
Results: Necrotic-Healthy
Figure: Inconclusive since the distances from the separation plane are scattered along
both regions.
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
29 / 45
Toxicological Evaluation of Titania Nanoparticles
Remarks
Classification between apoptotic and healthy cells is in agreement
with previous literature
There is a progression from healthy to apoptotic for Titania exposed
cells in 36 hrs
Classification between necrotic and healthy cells did not show any
particular trend
The MTT assay shows the cytotoxicity of Titania nanoparticles,
which is in agreement with other reports claiming that titania
nanoparticles are potentially toxic
MTT assay: Laboratory test for measuring the activity of enzymes that
reduce MTT to formazan giving a purple color
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
30 / 45
Current & Future Research
Outline
1
Introduction
2
Support Vector Machine Classifiers
3
Heat Induced Cellular Death
4
Toxicological Evaluation of Titania Nanoparticles
5
Current & Future Research
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
31 / 45
Current & Future Research
Research Directions
Observation: SVM classifier is extremely sensitive to outliers
Robust techniques: SVMs with hard margin loss
Minimize the number of misclassified points
n
min
w,b,z
subject to
X
1
kwk2 + C
zi
2
i=1
yi (hw, xi i + b) ≥ 1
if zi = 0, i = 1, . . . , n
zi ∈ {0, 1}
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
i = 1, . . . , n
March 30, 2011
32 / 45
Current & Future Research
SVMs with Hard Margin Loss
The formulation can accommodate nonlinear kernel functions
Convex quadratic IP formulation
Proven to be N P-hard
More robust (i.e., less sensitive to outliers compared to Hinge loss
function)
Universally consistent
A classifier is consistent if the probability of misclassification converges
in expectation to a Bayes optimal rule as sample size is increased
A classifier is universally consistent if it is consistent for all distributions
of data
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
33 / 45
Current & Future Research
SVMs with Hinge Loss
min
w,b,ξ
subject to
n
C X 2
1
2
kwk +
ξi
2
2
i=1
yi (hw, xi i + b) ≥ 1 − ξi
i = 1, . . . , n
Feature 1
Control Cell
Death Cell
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
34 / 45
Current & Future Research
SVMs with Hinge Loss
min
w,b,ξ
subject to
n
C X 2
1
2
kwk +
ξi
2
2
i=1
yi (hw, xi i + b) ≥ 1 − ξi
i = 1, . . . , n
Feature 1
Control Cell
Death Cell
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
34 / 45
Current & Future Research
SVMs with Hinge Loss
min
w,b,ξ
subject to
n
C X 2
1
2
kwk +
ξi
2
2
i=1
yi (hw, xi i + b) ≥ 1 − ξi
i = 1, . . . , n
Feature 1
Control Cell
Death Cell
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
34 / 45
Current & Future Research
SVMs with Hinge Loss
min
w,b,ξ
subject to
n
C X 2
1
2
kwk +
ξi
2
2
i=1
yi (hw, xi i + b) ≥ 1 − ξi
i = 1, . . . , n
Feature 1
Control Cell
Death Cell
Feature 2
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
34 / 45
Current & Future Research
SVMs with Hard Margin Loss
n
min
w,b,z
subject to
X
1
kwk2 + C
zi
2
i=1
yi (hw, xi i + b) ≥ 1
if zi = 0, i = 1, . . . , n
zi ∈ {0, 1}
i = 1, . . . , n
Feature 1
Control Cell
Death Cell
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
Feature 2
March 30, 2011
35 / 45
Current & Future Research
SVMs with Hard Margin Loss
n
min
w,b,z
subject to
X
1
kwk2 + C
zi
2
i=1
yi (hw, xi i + b) ≥ 1
if zi = 0, i = 1, . . . , n
zi ∈ {0, 1}
i = 1, . . . , n
Feature 1
Control Cell
Death Cell
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
Feature 2
March 30, 2011
35 / 45
Current & Future Research
Multiple Instance Learning - Drug Activity Prediction
Drug
Conformation
c1
c2
c3
c4
c5
c6
c7
c8
c9
c10
c11
c12
c13
Naproxen
50
20
30
20
30
60
5
10
40
60
10
10
50
Sodium
50
30
20
40
30
45
5
40
10
50
10
50
10
Label
Effective
Effective
Effective
Ineffective
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
36 / 45
Current & Future Research
Multiple Instance Learning - Drug Activity Prediction
Drug
Conformation
c1
c2
c3
c4
c5
c6
c7
c8
c9
c10
c11
c12
c13
Naproxen
50
20
30
20
30
60
5
10
40
60
10
10
50
Sodium
50
30
20
40
30
45
5
40
10
50
10
50
10
Label
Effective
Effective
Effective
Ineffective
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
36 / 45
Current & Future Research
Multiple Instance Learning - Drug Activity Prediction
}
Effective (+)
Ineffective (−)
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
37 / 45
Current & Future Research
Multiple Instance Learning - Drug Activity Prediction
Naproxen
}
Effective (+)
Ineffective (−)
Sodium
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
37 / 45
Current & Future Research
Multiple Instance Learning - Drug Activity Prediction
Naproxen
Actual Positives
}
Effective (+)
Ineffective (−)
Sodium
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
37 / 45
Current & Future Research
Multiple Instance Learning - Drug Activity Prediction
Naproxen
}
Effective (+)
Ineffective (−)
Select at least one
from each positive bag
such that the margin
is maximized.
Sodium
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
37 / 45
Current & Future Research
Multiple Instance Learning - Drug Activity Prediction
Naproxen
Keep all negative instances
}
Effective (+)
Ineffective (−)
Select at least one
from each positive bag
such that the margin
is maximized.
Sodium
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
37 / 45
Current & Future Research
Multiple Instance Learning - Drug Activity Prediction
Naproxen
}
Effective (+)
Ineffective (−)
Sodium
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
37 / 45
Current & Future Research
Other Applications of Multiple Instance Learning
Drug/molecular activity prediction
Medical image annotation
Analysis of noisy time series data (e.g., local field potential, EEG)
Nanotoxicity
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
38 / 45
Current & Future Research
Hard Margin Multiple Instance Classification:
Formulation 1
ηi =
1
0
if instance i is selected
otherwise
min
X
1
kwk2 + C
vi
2
s.t.
− hw, xi i − b ≥ 1 − Mvi
∀i ∈ I −
hw, xi i + b ≥ 1 − Mvi − M(1 − ηi )
X
ηi = 1
∀i ∈ I +
i
∀j ∈ J +
i∈Ij
vi ∈ {0, 1}
∀i
ηi ∈ {0, 1}
∀i ∈ I +
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
39 / 45
Current & Future Research
Hard Margin Multiple Instance Classification:
Formulation 2
v̂j : Minimum misclassification for each positive bag.
min
X
X
1
kwk2 + C
vi + C
v̂j
2
+
−
i∈I
s.t.
j∈J
− (hw, xi i + b) ≥ 1 − Mvi ;
∀i ∈ I −
hw, xi i + b ≥ 1 − Mvi ;
X
v̂j =
ηi vi ;
∀i ∈ I +
∀j ∈ J +
i∈Ij
X
ηi = 1;
∀j ∈ J +
0 ≤ ηi ≤ 1
∀i ∈ I +
v̂j ∈ {0, 1}
∀j ∈ J +
vi ∈ {0, 1}
∀i
i∈Ij
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
40 / 45
Current & Future Research
Hard Margin Multiple Instance Classification:
Linearized Formulation 2
min
X
X
1
kwk2 + C
vi + C
v̂j
2
+
−
i∈I
s.t.
j∈J
− (hw, xi i + b) ≥ 1 − Mvi ;
∀i ∈ I −
hw, xi i + b ≥ 1 − Mvi ;
X
v̂j =
ẑi ;
∀i ∈ I +
∀j ∈ J +
i∈Ij
ẑi ≥ −1 + ηi + vi ;
∀i ∈ I +
ẑi ≤ ηi ;
∀i ∈ I +
ẑi ≤ vi ;
X
ηi = 1;
∀i ∈ I +
∀j ∈ J +
i∈Ij
0 ≤ ẑi ≤ 1
∀i ∈ I +
v̂j ∈ {0, 1}
∀j ∈ J +
Erhun Kundakcioglu (University of η
Houston)
1} for Toxic Evaluation of Nanoparticles
i ∈ {0,SVMs
∀i ∈March
I + 30, 2011
41 / 45
Current & Future Research
Hard Margin Multiple Instance Classification:
Formulation 3
min
X
X
1
kwk2 + C
vi + C
v̂j
2
+
−
i∈I
s.t.
j∈J
− (hw, xi i + b) ≥ 1 − Mvi ;
∀i ∈ I −
hw, xi i + b ≥ 1 − Mvi ;
X
v̂j ≥
vi − |Ij | + 1;
∀i ∈ I +
∀j ∈ J +
i∈Ij
vi ∈ {0, 1}
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
∀i
March 30, 2011
42 / 45
Current & Future Research
Results
Molecular activity prediction
Musk Data: A. Asuncion, D. Newman, UCI machine learning repository. http://mlearn.ics.uci.edu
# of feat.
10
20
40
80
166
10
20
40
80
166
10
20
40
80
166
# of ins.
40
40
40
40
40
80
80
80
80
80
160
160
160
160
160
F1
0.052
0.064
0.128
0.371
0.777
0.728
0.414
1.164
3.606
14.846
N/A
62.455
42.65
59.655
152.17
F2
0.092
0.091
0.157
0.328
0.681
0.744
0.722
1.618
4.648
9.142
127.44
5.51
18.76
66.14
114.505
F3
0.056
0.068
0.135
0.302
0.661
0.644
0.506
1.476
3.028
17.548
91.31
45.555
53.975
45.035
242.31
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
CP1
0.368
0.218
0.28
0.625
1.075
294.418
2.676
4.916
8.23
13.384
N/A
128.82
71.695
1767.635
583.665
CP3
0.074
0.057
0.107
0.283
0.786
1.412
0.788
0.454
1.586
4.444
N/A
10.555
9.825
8.025
20.395
March 30, 2011
43 / 45
Current & Future Research
Results
Molecular activity prediction
Musk Data: A. Asuncion, D. Newman, UCI machine learning repository. http://mlearn.ics.uci.edu
# of feat.
10
20
40
80
166
10
20
40
80
166
10
20
40
80
166
# of ins.
200
200
200
200
200
300
300
300
300
300
400
400
400
400
400
F1
N/A
65.14
30.035
191.87
677.84
N/A
95.54
808.54
N/A
N/A
N/A
N/A
N/A
N/A
N/A
F2
371.2
13.625
805.17
105.38
584.21
N/A
124.45
357.99
N/A
N/A
N/A
N/A
435.8633
N/A
N/A
F3
70.35
6.265
58.815
N/A
572.89
N/A
87.48
N/A
N/A
N/A
N/A
N/A
304.36
N/A
N/A
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
CP1
N/A
N/A
87.33
N/A
770.43
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
CP3
N/A
251.465
12.72
21.805
35.055
N/A
N/A
331.77
375.79
152
N/A
N/A
644.69
N/A
469.43
March 30, 2011
44 / 45
Acknowledgments
Georgios Pyrgiotakis
Particle Engineering Research Center, University of Florida
Mohammad H. Poursaeidi
Department of Industrial Engineering, University of Houston
Q&A
Erhun Kundakcioglu (University of Houston) SVMs for Toxic Evaluation of Nanoparticles
March 30, 2011
45 / 45
Download