Craniopharyngioma Neoplasm Incarnation Analysis of Functional MR Medical Image Classification

advertisement
International Journal of Engineering Trends and Technology(IJETT) - Volume 4 Issue 1-Jan 2013
Craniopharyngioma Neoplasm
Incarnation Analysis of Functional MR
Medical Image Classification
J.Vignesh#1, Dr. K.P. Yadav*2
#
Research Scholar, Department of Computer Science and Engineering,
SunRise University, Alwar, India.
*
Professor & Director, Mangalmay Institute of Engineering & Technology ,
Greater Noida,U.P, India.
Abstract— There has been increasing interest in pattern
classification methods and neuroimaging studies using
permutation tests to estimate the statistical significance of a
classifier (p-value). Permutation tests usually use the test error
as a dataset statistic to estimate the p-value(s)by measuring the
dissimilarity between two or more number of populations.
Using the test error as a dataset statistics; however, may
camouflage the lowest recognizable classes,and the resulting pvalue will be biased toward better values because of the highly
recognizable classes;thus,lower p-values could sometimes be
the result of under coverage.In this study,we investigate this
problem and propose the implementation of permutation tests
based on a per-class test error as a data set statistic. We also
propose a model that is based on partially scrambling the
testing samples when computing the non-permuted statistic in
order to judge the p-value’s tolerance and to draw conclusions
regarding, which permutation test procedures are more
reliable. For the same purpose, we propose an other model
that is based on chance-level shifting of the permuted statistic.
We tested the set we proposed models on functional magnetic
resonance imaging data that we recollected while human
subjects responded to visual stimulation paradigms, and our
results showed that these models can aid in determining, which
permutation test procedure is superior. We also found that
permutation tests that use a per class test error as a data set
statistic are more reliable in addressing the null hypothesis
that all classes in the problem domain are drawn from the
same distribution.
Keywords—: ROI, Permutation tests, MRI, fMRI,Multi voxel
pattern analysis and brain tumor
I.
I. INTRODUCTION
The Craniopharyngiomas develop in the area of the brain
called the hypothalamus, which is close to the pituitary
gland. It is usually found in children or young adults and
accounts for around 1% of all brain tumours. The mixed
solid and cystic nature of the tumor is clear on MR images.
By MRI examination, the tumor is of variable T1 signal,
often hyperintense. The T1 hyper-intensity is usually
secondary to high protein content in the cyst fluid. On T2weighted sequences, including FLAIR, the solid portion is
ISSN: 2231-5381
again usually het-erogeneous, whereas the cysts are
invariably hyperintense. Following contrast agent, there is
almost invariable enhancement of the solid portion and the
peripheral rim of the cystic portion on MR image. The
enhancement of the solid portion may be either uniform or
heterogeneous.
Fig 1.1 Craniopharyngioma Neoplasm in the MRI
Machine
variable behavior of protons within different tissues leads to
differences in tissue appearance. The amount of signal
produced by specific tissue types is determined by their
number of mobile hydrogen protons, the speed at which
they are moving, and the tissue’s T1 and T2 relaxation
times [Armstrong et al., 2004], [Lee et al., 2004] (Table 1.1
summarizes the terms used to describe MRI techniques). As
T1 and T2 relaxation times are time dependent, the timing
of the RF pulse and the reading of the radiated RF energy
change the appearance of the image. The repetition time
(TR) describes the time between successive applications of
RF pulse sequences. The echo time (TE) describes the delay
http://www.ijettjournal.org
Page 77
International Journal of Engineering Trends and Technology(IJETT) - Volume 4 Issue 1-Jan 2013
before the RF energy radiated by the tissue in question is
measured.
The pulse sequence, which is described by the TR and TE
and indicates the technique used to administer the RF
energy, can be chosen to maximize the effect of differences
in T1 or T2. This gives rise to the description of an MRI
image as T1 or T2 weighted [Stark and Bradley, 1999].The
standard MRI pulse sequence for anatomic and pathologic
detail is a spin echo sequence. T1-weighted images (short
TR, short TE) provide better anatomic detail, while T2
weighted images (long TR, long TE), which are more
sensitive to water content, are more sensitive to pathology.
The intermediate or proton density images (long TR, short
TE) provide improved contrast between lesions and
cerebrospinal fluid.Because of its simplicity, flexibility, and
ease of implementa- tion, pattern classification is
increasingly appearing in many scientific fields.
Classification methods, for example, have been used in
modeling diseases from gene expression data [1,2], in
inferring ethnicity from mitochondrial DNA sequences [3],
in robotic navigation problems [4], and in the decoding of
mental states [5–7]. An important issue in pattern
classification, however, is determining whether the
classifier (and, hence, the classifica- tion) is significant.
This task is usually performed by estimating how much the
test error deviates from chance. There has been recent
interest in this issue, especially in neuroimaging applications and genome analysis studies, where researchers
usually work on constrained datasets that suffer from low
sample sizes and high dimensionality. Because class
distributional properties are unknown and distributional
properties of test statistics are complex, permutation tests
can be used to test for classification significance. These
methods are based on the null hypothesis that classes have
identical distributions; in these cases, an attempt is made to
reject the hypothesis and prove otherwise.
To construct the empirical cumulative distribution, we
com pute, using a dataset statistic, the value t 0 ¼ T ðx1 ,y1
,x2 ,y2 ,. . .,xL ,yL Þ for the actual labels; then, we find the
corresponding p-value (p0) under the empirical distribution
P. If p0 rp, we reject the null hypothesis.
We shall denote the methodology that makes use of the
OATE dataset statistic shown in (1)–(4) as the T1procedure and the p-value calculated with it as p0. The
value of p0 states how likely the observed error would be
obtained by chance
Assume that we choose T as an
appropriate dataset statistic. Let a be the acceptable
significance level, and let PL be the set of all of the
ISSN: 2231-5381
permutations of the indices 1,y, L that are derived from S.
The permutation test pseudo code may be given as follows:
For m=1:M do:
Let permuted ¼{t1,t2,y,tM}, where M is the number of
permutation iterations.
II.
LITERATURE SURVEY
Genetic algorithms offer a powerful search method
for a variety of learning tasks, and there are different
approaches in which they have been applied to learning
processes. Structural learning algorithm on vague
environment (SLAVE) is a genetic learning algorithm that
uses the iterative approach to learn fuzzy rules. SLAVE can
select the relevant features of the domain, but when
working with large databases the search space is too large
and the running time can sometimes be excessive. We
propose to improve SLAVE by including a feature selection
model in which the genetic algorithm works with
individuals (representing individual rules) composed of two
structures. In order to achieve an adequate segmentation,
appropriate initializations for each considered tissue class
are an important requirement. Commonly applied methods
include manual placing of example tissue points6 or
registration of an atlas dataset onto a patient dataset.11
However, different anatomies as well as different shapes,
sizes, and locations between atlas and patient dataset
complicate this task, and a precise registration can take
a significant amount of time. Furthermore, the employed
brain atlas may lead to incorrect initializations, e.g. in case
of large deformations due to tumor growth. Especially,
information about pathological tissue such as tumor is
generally missing in current brain atlases. A common
approach for the modelling of tumor tissue is to provide an
intensity model, assuming a mixture of density functions.
Popovic et al. proposed an a priori tumor intensity model
(or believe map) for the segmentation of calvarial tumors
from CT images.10 In their work, a mixture of Gaussians
generated from a number of training datasets is incorporated
into a level set framework.
III. METHODOLOGY
.
3.1 EXISTING METHOD
Fisher [8] was the first to introduce permutation tests, which
he proposed in the 1930s and called exact tests. Recent
permuta- tion test procedures in classification [9–11]
calculate the classi- fication significance by finding the
cumulative distribution after estimating how much the test
error of the classifier (the non- permuted test error) deviates
from the permuted test error values of the same classifier.
http://www.ijettjournal.org
Page 78
International Journal of Engineering Trends and Technology(IJETT) - Volume 4 Issue 1-Jan 2013
yields approximately $2.03eþ59 permutations for the 72
timepoints in each run.
3.2 DISADVANTAGES
Unfortunately, this cumulative distribution does not take
into account the fine details that are related to the error in
predicting each class. It is possible in a two-class problem,
for example, that the classification error is very high for
samples that belong to one class while the error of
classifying samples that belong to the other class is low, yet
permutation tests show significance. This issue would have
additional side effects when determining the significance of
classifying more than two classes. We should mention that
[12] made use of permutation tests to perform statistical
analysis of functional neuroimaging data (i.e., without using
classifiers). We should mention that [12] made use of
permutation tests to perform statistical analysis of
functional neuroimaging data (i.e., without using
classifiers). Many other studies were based on [11], using
permutation tests to estimate the significance of a
classification.
3.3 PROPOSED METHOD
The study presented in [13] emphasizes that studying the
classifier performance via permutation tests is effective;
however, the authors provided a technique that is based on
scrambling the features within the classes to study whether
the classifier exploits the dependency between the features
in the classification. In [10], the authors made use of the
wavelet transform [14] to speed up the permutation tests.
Aside from the research that is presented in [13], all of the
other studies used permutation tests that were similar to the
method discussed in [11].
3.4 ALGORITHM
Permutation scheme
The following are a few points that describe the practical
issues that are related to the permutation scheme that we
used in this study:
1. Instead of permuting the samples (timepoints), we
permuted only the labels, which is more feasible and was
used in all of the previous studies.
Thus, permutation tests were used as a tool, and the
resulting p-values, especially when they showed that the
poorly performing classifier was significant, were
sometimes taken as vital findings. To offer an example, it
was shown in [2] that using a regularized least-squares
classifier may significantly classify healthy controls from
Crohn’s disease subjects from profiles of single nucleotide
polymorphisms (the classification accuracy was $ 0.612,
p¼0.011). It is not possible to judge this p-value because of
differences in sensitivity and specificity, and the situation
would be more complicated in a multiclass classification
problem. Thus, we think that permutation tests may better
reflect the classification significance by taking into account
the error for each class instead of the overall test error. In
this study, we investigate how the per-class classification
error would affect the estimation of the classification
significance and how it may be more sensitive to p-value
changes than using the test error. Therefore, we propose a
dataset statistic that is based on the per-class test error; in
addition, we propose a method for adapting the permutation
test to handle this statistic.
3.5 ADVANTAGES
Comparing two permutation test procedures is not an easy
task because of the randomness of the procedures. To
compare the proposed permutation test procedure, we will
need a model to judge, which of the permutation tests
procedures outperform the procedure most commonly used
in the literature. Thus, for the purpose of comparison, we
propose two models. The first model is based on shifting the
chance level by some value and observing how this shift
affects the characteristics of the p-value, and the second
model is based on partial scrambling the testing samples of
the non-permuted statistic. We run several classification
experiments on functional magnetic resonance imaging
(fMRI) data that are collected while subjects respond to
visual stimulation paradigms
IV.
2. Uniformly distributed pseudorandom numbers were
gener ated for all of the permutations samples that we
used. Then,
these numbers were used to scramble the
labels.
3. For each subject in the dataset that we used, there were
12
runs; each run had eight blocks, with one block for
each
stimulus type (class), and each block had nine
timepoints.
4. We scrambled the labels for each run (e.g., within run
permutations). In this case, we needed to use a multiset
permutation, which, when using a multinomial coefficient,
ISSN: 2231-5381
PERMUTATION ANALYSIS
List of Modules
1.
Permutation Test For Classification
2.
Permutation test pseudocode
3.
The multiclass effect on permutation tests
4.1 Permutation Test For Classification
http://www.ijettjournal.org
Page 79
International Journal of Engineering Trends and Technology(IJETT) - Volume 4 Issue 1-Jan 2013
To construct the empirical cumulative distribution, we
com pute, using a dataset statistic, the value t 0 ¼ T ðx1 ,y1
,x2 ,y2 ,. . .,
xL ,yL Þ for the actual labels; then, we find the
corresponding p-value (p0) under the empirical distribution
P. If p0 rp, we reject the null hypothesis.
We shall denote the methodology that makes use of the
OATE dataset statistic shown in (1)–(4) as the T1procedure and the p-value calculated with it as p0. The
value of p0 states how likely the observed error would be
obtained by chance
4.2 Permutation test pseudocode
Assume that we choose T as an appropriate dataset
statistic. Let a be the acceptable significance level, and let
PL be the set of all of the permutations of the indices 1,y, L
that are derived from S. The permutation test pseudo code
may be given as follows:
For m=1:M do:
Let permuted ¼{t1,t2,y,tM}, where M is the number of
permutation iterations.
Construct an empirical cumulative distribution.
4.3 The multiclass effect on permutation tests
When the above OATE dataset statistic is used to estimate
the classification significance via permutation tests, it
suffers from the undercoverage problem because it neglects
the per-class test error, and thus, the p-value sensitivity to
the per-class error is not satisfactory. For example, a twoclass classifier that has an error of 40% will occasionally
yield a rather good p-value because chance level is 50%;
however, the 40% might result from the average error
between class-I error (say 10%) and class-II error (say
70%). Thus, despite the fact that the class-II’s classification
error was 70%, the permutation tests may still show that the
classification is highly significant. The same problem might
also occur when classification problems with more than two
classes are under investigation. A better alternative to
ISSN: 2231-5381
OATE that measures the overall test error would be based
on the Per-Class Test Error dataset statistic (PCTE).
For this case, we propose the PCTE dataset statistic, as
follows:
V. EXPERIMENT RESULT
The name “MATLAB” is derived from “matrix laboratory.”
MATLAB is an application that allows you to create
mathematically oriented programs. It is generally
recommended that you can get the most out of it if you have
familiarity with linear algebra. However, as it is, MATLAB
can be used in many ways, an almost any context, ranging
from the most elementary forms of math to projects pursued
at a doctoral level. It is a calculator, a plotter, and a
programming tool. It has many modules that can be added
to it that address different specializations, such as biology,
physics, and electrical engineering, among many others. At
CU, MATLAB is used in Calculus III labs and other upper
division courses, such as Differential Equations and Linear
Algebra. MATLAB was first developed in the 1970’s by
Cleve Moler, who was on the faculty of the University of
New Mexico.The programming language used with
MATLAB is usually referred to as MATLAB script or Mscript. After becoming familiar with the basic syntax of the
M-script, a number of useful utilities are available to you
that allow you to make extended uses of MATLAB. You
can, for example, write programs that involve simulation.
You can also create graphics, web pages, and GUI
applications. When you develop programs using MATLAB,
you can output the results to a number of media, including
graphics files, HTML pages, PDF files, and Word
documents. You can also connect up MATLAB with other
applications, such as Excel or LabView to make extended
uses of it. Since it is programmed in part using Java, you
can modify it in the background using Java.The feasibility
of the project is analyzed in this phase and business
proposal is put forth with a very general plan for the project
and some cost estimates. During system analysis the
feasibility study of the proposed system is to be carried out.
This is to ensure that the proposed system is not a burden to
the company. For feasibility analysis, some understanding
of the major requirements for the system is essential
http://www.ijettjournal.org
Page 80
International Journal of Engineering Trends and Technology(IJETT) - Volume 4 Issue 1-Jan 2013
SCREENSHOTS
INPUT IMAGE
Fig: 5.2c Segmented Image Based on Permutation
(Elapsed time is 9.434162 seconds.)
Fig: 5.1 Input Image
7.2 PROCESSING IMAGE
FIGURE 1
SECOND ITERATION
Fig: 5.2d Segmented Image Based on Permutation
(Elapsed time is 9.434162 seconds.)
Fig: 5.2 Segmented Image Based on Permutation
(Elapsed time is 19.558152 seconds.)
Fig 5.3 Permutation Result Based on Cluster Analysis
Fig: 5.2 Segmented Image Based on Permutation
(Elapsed time is 38.995707 seconds.)
ISSN: 2231-5381
http://www.ijettjournal.org
Page 81
International Journal of Engineering Trends and Technology(IJETT) - Volume 4 Issue 1-Jan 2013
VI.
CONCLUSION
6.1 CONCLUSION
We showed that permutation tests that use the showed that
permutation tests that use the (overall) test error as a dataset
statistic to estimate the significance of a classification might
suffer from an undercoverage problem. Permu- tation tests
that use a dataset statistic that is based on the per test
error(e.g.,the value of the test error for each class) result in
more reliable p-values and would be more sensitive to slight
changes in classification errors than the commonly used
methods in the literature.The partial label scrambling model
that we proposed is very robust when used in comparing
different permutation test procedures.
6.2 FUTURE WORK
An other model that we proposed in this study is based on
shifting the chance level to verify the significance of the pvalue estimation; this model gave results that lead to similar
conclusions to those of partial label scrambling.The
proposed permutation test procedure and its related dataset
statistic have the same computational complexity order of
the method that is commonly used in the literature. The
same conclusion applies to partial label scrambling and to
chance level shifting models.
REFERENCE
1. Bezdek, J.C., Hall, J.L., Clarke, L., 1993. Review
of MR image segmentation using pattern recognition.
Medical Physics 20, 1033–1048.
2. Borgelt, C., Kruse, R., 2003. Speeding up fuzzy
clustering with neural network techniques. In: Proc. 12th
IEEE International Conference on Fuzzy Systems
(FUZZ-IEEE’03, St. Louis, MO, USA) IEEE Press,
Piscataway, NJ, USA.
3. Boudraa, A.E., Champier, J., Cinotti, L., Bordet,
J.C., Lavenne, F., Mallet, J.J., 1996. Delineation and
quantization of brain lesions by fuzzy clustering in
positron emission tomography computerized. Medical
Imaging Graphics 20, 31–41.
4. Boudraa, A.O., Dehak, S.M., Zhu, Y.M., Pachai,
C., Bao, Y.G., Grimaud, J., 2000. Automated
segmentation of multiple sclerosis lesions in
multispectral MR imaging using fuzzy clustering.
Computers in Biology and Medicine 30, 23–40.
ISSN: 2231-5381
5. Chen, S., Zhang, D., 2004. Robust image
segmentation using FCM with spatial constraints based
on new kernel-induced distance measure. IEEE
Transactions on Systems Man and Cybernetics – Part B:
Cybernetics 34 (4), 1907–1916.
6. Chen, W., Maryellen, L.G., Bick, U., 2006. A
fuzzy c-means (FCM)-based approach for computerized
segmentation of breast lesions in dynamic contrastenhanced MR images. Academic Radiology 13 (1), 63–
72.
7. Chiu, S.L., 1994. Fuzzy model identification based
on cluster estimation. Journal of Intelligent and Fuzzy
Systems 2, 267–278.
8. Chuang, K.-S., Tzeng, H.-L., Chen, S., Wu, J.,
Chen, T.-J., 2006. Fuzzy c-means clus-tering with
spatial
information
for
image
segmentation.
Computerized Medical Imaging and Graphics 30, 9–15.
9. Clark, M.C., Hall, L.O., Goldgof, D.B.,
Velthuizen, R., Murtagh, F.R., Silbiger, M.S., 1998.
Automatic tumor-segmentation using knowledge-based
techniques. IEEE Transactions on Medical Imaging 117,
187–201.
10. Dubes, R.C., Jain, A.K., 1980. Clustering
methodology in exploratory data analysis. In: Advances
in Computers. Academic press, New York, pp. 113–225.
11. Fletcher-Heath, L.M., Hall, L.O., Goldgof, D.B.,
Murtagh, F.R., 2001. Automatic seg-mentation of nonenhancing brain tumors in magnetic resonance images.
Artificial Intelligence in Medicine 21, 43–63.
12. Gelbard, R., Goldman, O., Spiegler, I., 2007.
Investigating diversity of clustering meth-ods: an
empirical comparison. Source, Data & Knowledge
Engineering Archive 63 (October (1)), Iris-wine.
13. Gering, D.T., Grimson, W.E.L., Kikinis, R., 2002.
Recognizing deviations from nor-malcy for brain tumor
segmentation. In: Medical Image Computing and
Computer-Assisted Intervention – MICCAI. Springer,
2488, pp. 388–395.
14. Girolami, M., 2002. Mercer kernel-based
clustering in feature space. IEEE Transac-tions on
Neural Networks 13 (3), 780–784.
15. Gustafson, D.E., Kessel, W., 1979. Fuzzy
clustering with a fuzzy covariance matrix. In:
http://www.ijettjournal.org
Page 82
International Journal of Engineering Trends and Technology(IJETT) - Volume 4 Issue 1-Jan 2013
Proceedings of IEEE Conference on Decision Control,
San Diego, CA.
16. Hathaway, R.J., Bezdek, J.C., 2006. Extending
fuzzy and probabilistic cluster-ing to very large data
sets. Computational Statistical and Data Analysis 51,
215–234.
25. B. Vijayakumar, Ashish Chaturvedi and K. Muthu
Kumar, Effective Classification of Anaplastic Neoplasm
in Huddling Stain Image by Fuzzy Clustering Method,
International Journal of Scientific Research, Issue 3
volume 2, March-April 2013,
17. Hathaway, R.J., Hu, Y., 2009. Density-weighted
fuzzy c-means clustering. IEEE Trans-actions on Fuzzy
Systems 17 (February (1)).
18. Ichihashi, H., Honda, K., 2005. Fuzzy c-means
classifier for incomplete data sets with outliers and
missing values. Proceedings of the International
Conference on Computational Intelligence for
Modelling Control and Automation, and Inter-national
Conference on Intelligent Agents 2, 457–464.
19. Kannan, S.R., 2008. A new segmentation system
for brain MR images based on fuzzy techniques.
Applied Soft Computing 8 (4), 1599–1606.
20. Vijayakumar, B., and Ashish Chaturvedi.
"Automatic Brain Tumors Segmentation of MR Images
using Fluid Vector Flow and Support Vector Machine."
Research Journal of Information Technology 4.
21. Vijayakumar, B., and Ashish Chaturvedi. "Tumor
Cut-Segmentation and Classification of MR Images
using Texture Features and Feed Forward Neural
Networks." European Journal of Scientific Research
85.3 (2012): 363-372Krishnapuram, R., Kim, J., 1999.
A note on the Gustafson–Kessel and adaptive fuzzy
clustering algorithms. IEEE Transactions on Fuzzy
Systems.
22. Vijayakumar, B., and Ashish Chaturvedi,
“Idiosyncrasy Dissection and Assorting of Brain MR
Images Instigating Kernel Corroborated Support Vector
Machine,” Archive Des Science
23. Vijayakumar,
B.,
and
Ashish
Chaturvedi,“Abnormality
segmentation
and
classification of brain MR images using Kernel based
Support vector machine,” Archive Des Science,Volume
66,Issue 4
24. B. Vijayakumar, Ashish Chaturvedi and K. Muthu
Kumar, Brain Tumor in Three Dimenesional Magnetic
Resonance
Images
and
Concavity
Analysis,
International Journal of Computer Application, Issue 3,
Volume 1 (February 2013),
ISSN: 2231-5381
http://www.ijettjournal.org
Page 83
Download