Uploaded by Mujeeb khan

assignment # 04 term paper

advertisement
A Novel Segmentation Method for Optic Disc and
Optic Cup
Zeeshan Mughal
BEEE, -Abasyn Uniuversity Islamabad ,
Park Road Chak Shahzad Islamabad, Pakistan
shanmughal794@gmail.com
Abstract— Glaucoma is one of the major leading causes of
blindness all over the world, and it is a kind of chronic eye
fundus disease. Precise detection of the early stage of
glaucoma is crucial for glaucoma treatment. The vertical
cup-to-disc ratio (VCDR) is a key parameter for glaucoma
detection. Thus, the accurate segmentation of optic disc and
cup on the retina is the critical step for glaucoma detection.
Most of existing segmentation methods either have
unsatisfactory segmentation results or consume amounts of
time costs. In this paper, we propose an efficient
segmentation method based on deformable u-net for optic
disc and optic cup. We combine the deformable convolution
with the original u-net and design a novelar chitecture for
optic disc and cup segmentation. Finally, wee valuate our
method on several publicly datasets DRIONS-DB,RIM-ONE
v3, DRISHTIGS. The experiments show that our method
achieves satisfactory results, which also show promising
performance for glaucoma detection.
Keywords- Deformable convolution; glaucoma detection ;optic
disc and cup; segmentation;
I. INTRODUCTION
Glaucoma is the second biggest ocular disease in theworld which
can lead to permanent blindness [1]. It is predicted that the
number of people with glaucoma worldwide will increase from
64.3 million in 2013 to 111.8 million in 2040 [2]. As a chronic
fundus disease, there is no obvious symptom in the early stage,
but with the degeneration of disease, the structure of optic nerve
head will change gradually, optic nerve cells will die, and optic
nerve fibers will also be degenerated, finally, it will lead to
permanent blindness. So Glaucoma is also known as “the thief of
vision”. Due to the irreversible damage caused by glaucoma, the
early diagnosis of glaucoma is crucial to help patients to treat the
disease and control the progression of Glaucoma. There are three
main different sets of examinations for glaucoma detection: (1)
assessment of intraocular pressure (IOP), (2) assessment of
visual field, (3) assessment of optic nerve head. The assessment
of IOP is not sufficient enough, because different types of
glaucoma have different IOP conditions, glaucoma can also be
present without raised intraocular pressure. Meanwhile,
evaluation of abnormal visual field requires expensive medical
devices which can only be found in large hospitals, thus it is not
suitable for feature, and the other is based on automatic extracted
feature. The former refers to the traditional image process
techniques, such as the level-set based approach, thresholding
based approach, active shape modeling approach, and the later
mainly refers to the popular deep learning techniques. With the
great development of deep learning, methods based on deep
learning have made a great progress in many computer visions
tasks. Especially, after the occurrence of Fully Connected
Network
In order to further improve the performance of segmentation of
OD and OC for glaucoma detection, we propose a new method
which is based on a deformable u-net for segmentation of OD
and OC. Our method is inspired by deformable convolutional
networks [5] and deformable u-net for Sickle Cell Disease
diagnosis [6]. The main contributions of our work are as follows:
(1) We apply the deformable convolutional u-net to automatic
glaucoma detection, to the best of our knowledge, it is the first
time to apply deformable convolution network to glaucoma
detection. Besides, we modify the deformable U-net [6] and make
it have a better performance on glaucoma Detection.
(2) Due to the deformable u-net, our framework is more robust
for glaucoma detection. The experiment result shows that our
method has improved the OD segmentation result from 0.95 to
0.97 (in terms of the Dice metrics and RIM-ONE v.3), and the
OC segmentation result is also improved from 0.82 to 0.84
compared to the best reported result, which shows the potential
performance of deformable U-net in glaucoma detection.
The rest of paper is organized as follows. Section II gives a brief
introduction of our proposed method and architecture. Section IV
introduces the OD & OC segmentation process in our method.
Section V evaluates our proposed method. Finally, we conclude
our work in section VI
II. RELATED WORK
For automatic feature extract methods, Maninis et al. [7]
presented a deep retinal image understanding (DRIU) method
which was based on CNN architecture.
large scale screening of glaucoma. Optic nerve head can be
evaluated in the form of digital fundus image (DFI), DFI gives us
an intuitive display of the structure of the optic disc (OD) and the
optic cup (OC) on the retina, glaucoma can be diagnosed by the
assessment of OD and OC in a noninvasive manner. Compared to
the two aforementioned methods, assessment of optic nerve head
is both efficient and promising, digital fundus image is also
recognized as the main modality to diagnose glaucoma in many
automatic glaucoma detection methods. The optic disc (AKA.
optic nerve head) is the location where ganglion cell axons leave
the eye to form the optic nerve [3]. In digital fundus image, the
OD appears as a bright oval area, and the OC is the brighter oval
area in the center of the OD. The vertical cup-to-disc ratio
(VCDR) is a key parameter for glaucoma detection which is
calculated by a vertical cup to disc ratio. The illustration of OD
and OC of glaucoma case and non-glaucoma case is shown in
Figure 1. In order to get VCDR parameter, the area of OD and
OC has to be accurately segmented. Afterwards, the VCDR is
calculated based on segmented OD and OC for glaucoma
screening. Therefore accurate segmentations of OD and OC are
crucial to diagnosing glaucoma. The former refers to the
traditional image process techniques, such as the level-set based
approach, thresholding based approach, active shape modeling
approach, and the later mainly refers to the popular deep learning
techniques. With the great development of deep learning, methods
based on deep learning have made a great progress in many
computer visions tasks. Especially, after the occurrence of Fully
Connected Network
Fig
ure 1. Left is glaucoma case, right is non-glaucoma case, the blue
dotted line refers to optic cup, and the green dotted line denotes
optic disc. The sample images are derived from DRISHTI-GS.
They modified the CNN and combined a volume of fine-tocoarse feature maps into a regressed result for the optic disc
segmentation. However, this method is designed for OD
segmentation and it is not evaluated on OC segmentation. Zilly
et al. [8] utilized entropy sampling and ensemble learning for OC
and OD segmentation. They designed a multi-scale CNN for OD
and OC segmentation, during the training process, they used
B. Deformable U-net
1) U-net: In our paper, we implement a typical U-net as the
baseline model, which is based on fully convolutional networks
[12]. We modify the U-shape convolutional network (U-net) in
Ref. [4] as the main body of our deep architecture. U-net is a
kind of efficient fully convolutional neural network which was
first proposed in 2015 ISBI cell tracking challenge for the cell
image segmentation. Unlike the other segmentation networks, Unet can achieve great performance without large training data on
many tasks, especially on medical image segmentation tasks. The
U-net architecture consists of two path: the encoder path (left
side) and decoder path (right side).
Similar to the original U-net architecture, our architecture is
shown in Figure 2. We add the deformable convolution operation
at different layers, and inspired by the proposed architecture in
Ref. [9], we also reduce the number of convolution kernel in all
the convolution layers, and keep the fixed kernel number during
some certain layers. The results in Ref. [9] show that properly
reducing filters does not affect the segmentation performance.
Meanwhile, and less filters will lead to less computational costs.
2) Deformable convolution: As previously mentioned, U-net is
based on fully convolutional networks. Further, as standard
convolution is limited to dealing with object shape
transformations inherently due to its regular square receptive
field and fixed geometric structures in their building modules
[5]. The traditional convolution units usually sample at a fixed
position. In general, there are two normal ways to adapt the
geometric variations. One is to augment the dataset, another is
feature invariance algorithms such as SIFT algorithm [13].
However, large data augmentation will lead to huge time costs
during the model training process and still be unable to avoid the
defect in dealing with some special unknown geometric
transformation; Manually designed features are not feasible for
complicated model variation.
Deformable convolution can sample the input feature map in a
local and dense way and adapt to locating different shape
objects, which is exactly what we need. As Figure 5 shows, the
receptive field of standard convolution is fixed, while the
deformable convolution is self-adaptive. In this paper, we replace
the standard convolution kernel with deformable convolution in
the certain layers of the U-net
Fig
ure 4. Illustration of deformable convolution
Deformable convolution can sample the input feature map in a
local and dense way and adapt to locating different shape objects,
which is exactly what we need. As Figure 5 shows, the receptive
field of standard convolution is fixed, while the deformable
convolution is self-adaptive. In this paper, we replace the standard
convolution kernel with deformable convolution in the certain
layers of the U-net.
peat and needs much time to train the model. Sevastopolsky [9]
applied a modified u-net to segment OD and OC. The modified unet has less convolution filters leading to less parameters and it is
more efficient to train the model. Thus, it performs well on OD
segmentation, however it is not ideal on OC segmentation. Fu et
al. [10] proposed a method based on M-net (a modified u-net) for
joint segmentation of OD and OC. This method combines the
segmentation of OD with the segmentation of OC and utilizes the
polar transformation to pre-process the input images, which deals
well with the imbalanced data problem of pixelwise segmentation
for fundus image. However, the center coordinate of OD must be
acquired for polar transformation. Locating the center of OD is
also another challenge for glaucoma detection, and it will
consume more time costs.
III. METHODOLOGY
In this paper, we propose a deep learning architecture based on
deformable U-net for OD and OC segmentation, the illustration of
our method is shown in Figure 2, our architecture can be divided
into two main parts: Data Preprocess, Deformable U-net.
A. Data Preprocess
In our method, we downsample our input image to a small fixed
scale such as 256 X 256 px, then we apply the ContrastLimited
Adaptive Histogram Equalization (CLAHE) [11] to preprocess
the rescaled image. CLAHE is an enhancement operation that is
widely applied in retinal image analysis due to its capability to
improve the contrast and illumination of the fundus. Additionally,
it is also known to characterize the retinal nerve fiber. Many
existing glaucoma detection methods have both indicated the
advantages of CLAHE.
Medical image analysis is different from the natural image
analysis. Despite of the difference between their morphological
interpretability, the biggest difference is the sample size of
dataset. Unlike the natural image datasets, medical image datasets
are usually small-scale, it will bring several drawbacks, such as
overfitting, imbalanced class. In order to reduce the negative
effects, we apply data augmentation to the dataset, by means of
cropping and zooming the image, flipping the image horizontally
or vertically, the size of dataset has been enlarged several times.
Besides, we apply an extra data preprocess for optic cup
segmentation, due to the low contrast boundary
Fig
ure 5. Illustration of receptive field in standard convolution(c)
and deformable convolution(d)
denotes number, OD denotes whether dataset contains the optic
disc segmentation ground-truth, OC denotes whether dataset
contains optic cup segmentation ground-truth, DRIONS-DB
contains 110 full eye fundus images with OD segmentation;
RIM-ONE v.3 contains 159 images cropped by OD area, all the
images contain the OD and OC segmentations; DRISHTIGS
contains 50 full eye fundus images with OD and OC
segmentation
D.
Experimental Results The comparison between our proposed
method and the existing methods is shown in Table II and Table
III. We have five methods to evaluate on the datasets for OD and
huge. Besides, the full deformable U-net means replacing all
convolution in original U-net as deformable convolution. Zhang
et al. [6] proposed the full deformable U-net for Sickle Cell
Disease detection. The deformable Unet just replaces a part of
the convolution operations as deformable convolution, as shown
in Fig. 2. From the result, we notice that deformable U-net gets
the comparative performance with full deformable U-net. The
proposed method has less parameters and spends less
computational costs. From the Table III, we can notice the
similar results with OD segmentation, while the difference is that
optic cup segmentation is more challenging than OD
segmentation.
The visualization of OD segmentation on DRIONSDB is
shown in Figure 6, and the OD and OC segmentation results for
deformable U-net on RIM-ONE v.3 and DRISHTI-GS are shown
in Figure 7.
difference between OD and OC, in order to get a better
segmentation performance, we first extract the region of interest
(AKA.ROI).
In fact, we can use the result of OD segmentation to acquire
the ROI, then resize the image. After above operations, we will
obtain the inputs for optic cup
segmentation.
Fig
ure 3. Deformable convolution kernel
The main idea of deformable convolution is adding 2 offsets on
the regular convolution kernels, just as shown in Figure 3
(modified from Ref. [5]). The left is the regular kernel and the
right is the result of adding offsets. The key for deformable
convolution network is how to acquire the offsets, then the offsets
will be utilized to change the sampling locations during the
training process. The general convolution process can be denoted
as Eq. 1 (from Ref. [5]).
(1) where pi denotes the receptive field in the input feature map,
and pi png, w denotes the weights of kernel. For the deformable
convolution, an extra offset _pi will be added to get the Eq. 2
(from Ref. [5]).and offset _pi is generated by another general
convolution process. In order to change the input feature map
smoothly and slowly, the offset should be small. We denote the
results
IV. OPTIC DISC & CUP SEGMENTATION
Optic disc and optic cup segmentations are the goals of this paper.
In order to produce more accurate segmentation results, we
choose to train two deformable U-net orderly for OD and OC
segmentations. For the OD segmentation, we first preprocess the
raw images, and train the deformable Unet with our input images,
then output the OD segmentation results, meanwhile, ellipse
fitting technique is also applied to process the raw segmentation
results. After OD segmentation, the ROI of input image can be
easily and precisely extracted as the input of optic cup
segmentation. Due to relatively vague boundary of optic cup,
precise ROI extraction is essential for eyes.
VI. CONCLUSION
In this paper, a novel segmentation method for optic disc and
optic cup was proposed, our architecture was based on
deformable U-net. The deformable convolution makes our
architecture more robust for segmentation, and it can achieve
satisfactory results in spite of the small dataset. We evaluated our
proposed method on different publicly datasets and achieved
promising results compared with conventional methods.
optic cup segmentation. The raw optic cup segmentation results
also need to be processed by ellipse fitting. So far, we get the
segmentations of both OD and OC, then we can use the results to
calculate the VCDR of digital fundus image for further glaucoma
detection
V. EXPERIMENT
In this section, we evaluate our method on several publicly
available digital fundus image datasets DRIONSDB [14], RIMONE v.3 [15], and DRISHTI-GS [16]. The comparison with
other methods is also evaluated. Besides, we design several
internal comparison experiments for deformable convolution Unet. The performances of different number of deformable
convolution in u-net is evaluated. (1) Original U-net, (2)
deformable U-net (with different deformable layers), (3)
segmentation method based on other optic cup/disc segmentation
algorithms.
A. Evaluation Criteria
For evaluation of the segmentation results and comparison with
other segmentation methods, we take the real-valued dice score d
(A, B) [9] as our evaluation criteria
B. Datasets
Our experiment is based on three publicly available datasets
DRIONS-DB [14], RIM-ONE v.3 [15], and DRISHTI-GS [16],
which contain ground-truth segmentation for OD, and some for
optic cup as well.
OC segmentation, and the disappeared number means the result is
not reported. From the Table II, we can easily observe the
improvement of our method in Segmentation.
REFERENCES
[1] S. Kingman, “Glaucoma is second leading cause of blindness
globally,” Bulletin of the World Health Organization, vol. 82,
pp.887–888, 2004.
[2] Y.-C. Tham, X. Li, T. Y. Wong, H. A. Quigley, T. Aung, and
C.-Y. Cheng, “Global prevalence of glaucoma and projections of
glaucoma burden through 2040: a systematic review and
metanalysis,” Ophthalmology, vol. 121, no. 11, pp. 2081–2090,
2014.
[3] X. Chen, Y. Xu, D. W. K. Wong, T. Y. Wong, and J. Liu,
“Glaucoma detection based on deep convolutional neural
network,” in Proc. IEEE EMBC, Aug. 2015, pp. 715–718.
[4] O. Ranneberger, P. Fischer, and T. Brox, “U-net:
Convolutional networks for biomedical image segmentation,” in
Proc. Springer MICCAI, Nov. 2015, pp. 234–241.
[5] J. Dai, H. Qi, Y. Xiong, Y. Li, G. Zhang, H. Hu, and Yewei,
“Deformable convolutional networks,” CoRR, abs/1703.06211,
vol. 1, no. 2, p. 3, 2017.
[6] M. Zhang, X. Li, M. Xu, and Q. Li, “Image segmentation
and classification for sickle cell disease using deformable u-net,”
arXiv preprint arXiv:1710.08149, 2017.
[7] K. Maninis, J. Pont-Tuset, P. Arbel´aez, and L. V. Gool,
“Deep retinal image understanding,” in Proc. Springer MICCAI,
Oct. 2016, pp. 140– 148.
[8] J. Zilly, J. M. Buhmann, and D. Mahapatra, “Glaucoma
detection using entropy sampling and ensemble learning for
automatic optic cup and disc segmentation,” Computerized
Medical Imaging and Graphics, vol. 55, pp. 28–41, 2017.
[9] A. Sevastopolsky, “Optic disc and cup segmentation methods
for glaucoma detection with modification of u-net convolutional
neural network,” Pattern Recognition and Image Analysis, vol.
27, no. 3, pp. 618–624, 2017.
[10] H. Fu, J. Cheng, Y. Xu, D. W. K. Wong, J. Liu, and X. Cao,
“Joint optic disc and cup segmentation based on multi-label deep
network and polar transformation,” IEEE Trans. on Medical
Imaging, 2018.
Download