Distinguishing prostate cancer from benign confounders via a

Distinguishing prostate cancer from benign confounders via a cascaded classifier on multi-parametric MRI G.J.S. Litjensa , R. Elliottb , N. Shihc , M. Feldmanc , J.O. Barentsza , C.A. Hulsbergen - van de Kaaa , I. Kovacsa , H.J. Huismana and A. Madabhushib a Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands; b Case Western Reserve University, Cleveland, USA; c University of Pennsylvania, Philadelphia, USA; ABSTRACT Learning how to separate benign confounders from prostate cancer is important because the imaging characteristics of these confounders are poorly understood. Furthermore, the typical representations of the MRI parameters might not be enough to allow discrimination. The diagnostic uncertainty this causes leads to a lower diagnostic accuracy. In this paper a new cascaded classifier is introduced to separate prostate cancer and benign confounders on MRI in conjunction with specific computer-extracted features to distinguish each of the benign classes (benign prostatic hyperplasia (BPH), inflammation, atrophy or prostatic intra-epithelial neoplasia (PIN). In this study we tried to (1) calculate different mathematical representations of the MRI parameters which more clearly express subtle differences between different classes, (2) learn which of the MRI image features will allow to distinguish specific benign confounders from prostate cancer, and (2) find the combination of computer-extracted MRI features to best discriminate cancer from the confounding classes using a cascaded classifier. One of the most important requirements for identifying MRI signatures for adenocarcinoma, BPH, atrophy, inflammation, and PIN is accurate mapping of the location and spatial extent of the confounder and cancer categories from ex vivo histopathology to MRI. Towards this end we employed an annotated prostatectomy data set of 31 patients, all of whom underwent a multi-parametric 3 Tesla MRI prior to radical prostatectomy. The prostatectomy slides were carefully co-registered to the corresponding MRI slices using an elastic registration technique. We extracted texture features from the T2-weighted imaging, pharmacokinetic features from the dynamic contrast enhanced imaging and diffusion features from the diffusion-weighted imaging for each of the confounder classes and prostate cancer. These features were selected because they form the mainstay of clinical diagnosis. Relevant features for each of the classes were selected using maximum relevance minimum redundancy feature selection, allowing us to perform classifier independent feature selection. The selected features were then incorporated in a cascading classifier, which can focus on easier sub-tasks at each stage, leaving the more difficult classification tasks for later stages. Results show that distinct features are relevant for each of the benign classes, for example the fraction of extra-vascular, extra-cellular space in a voxel is a clear discriminator for inflammation. Furthermore, the cascaded classifier outperforms both multi-class and one-shot classifiers in overall accuracy for discriminating confounders from cancer: 0.76 versus 0.71 and 0.62. Keywords: CAD, feature selection, cascading classification, prostate cancer, MRI, confounders 1. INTRODUCTION One of the most difficult tasks in diagnosing prostate cancer on MRI is separation between prostate cancer and benign confounders like atrophy, inflammation, benign prostatic hyperplasia (BPH) and prostatic intra-epithilial neoplasia (PIN).1, 2 Atrophy occurs generally in the aging prostate and results in the loss of the normal lobular characteristics of the prostate glands; resulting in different structures in pathology. Inflammation of the prostate occurs for various reasons, for example urinary retention due a blocked urethra or biopsy. Inflammation tends to cause vaso-dilation and apoptotic or necrotic processes. Benign prostatic hyperplasia is the benign growth of the prostate transition zone (i.e. proliferation of epithelial and stromal cells), which occurs in many older men, although the exact cause is unknown. This results in a nodular, high density area of either stromal or glandular cells. Finally, PIN is essentially a carcinoma in situ, an abnormal proliferation of epithelial cells which yet remain contained inside the gland. Some PIN progresses to cancer, some doesn’t. One reason that it is hard to separate Medical Imaging 2014: Computer-Aided Diagnosis, edited by Stephen Aylward, Lubomir M. Hadjiiski, Proc. of SPIE Vol. 9035, 903512 · © 2014 SPIE · CCC code: 1605-7422/14/$18 · doi: 10.1117/12.2043751 Proc. of SPIE Vol. 9035 903512-1 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms Parameter Description T2W High-resolution allows texture characterization, cancer exhibits so-called ’erased charcoal’ sign. Different confounding classes might also exhibit different textures.4 DWI Characterizes tissue properties like cell density. Due to fast proliferation of cells prostate cancer tends to exhibit low diffusivity.9 Benign prostatic hyperplasia is also known to have low diffusivity.4 DCE Enhancement over time is related to micro-vessel density and permeability. Prostate cancer tends to have a high vascularization with leaky vessels. Confounders like inflammation are also known to exhibit a vascular response. citeLove92 Table 1: Description of overlap in imaging attributes on multi-parametric MRI for adenocarcinoma and confounding tissue classes. these confounders from cancer is that the original MRI images might not be best to visualize subtle differences that exist between the confounding classes and prostate cancer. Radiologists also face the additional challenge of attempting to mentally combine parameters from different MRI protocols to render an accurate diagnosis, a formidable task. The current Prostate Imaging Reporting and Data Standard (PI-RADS) for reporting MRI does not yet offer guidelines which allow discrimination between specific confounders and prostate cancer.3 Improved understanding of the image characteristics of these confounders across the different MRI sequences (T2-weighted (T2w), dynamic contrast enhanced (DCE) and diffusion-weighted (DWI)) might improve the PI-RADS standard and subsequently the detection and diagnosis of prostate cancer.4 Additionally, the image characteristics can be used to identify features which could increase the performance of computer-aided detection and diagnosis systems for prostate cancer.5–7 Figure 1 illustrates the challenges in distinguishing between cancer and all the various confounding categories in ”one shot”. In Figure 1(a) we show what happens when we group all confounders as a single class. A linear discriminant classifier attempting to identify every image location as belonging to either cancer or benign confounding classes, ends up assigning every sample to the non-cancer class. In Figure 1(b), we show what happens when you use the same features, and attempt a pair-wise classification (in this case atrophy versus cancer). We see that in this 2-class separation problem, the LDA classifier is able to create an accurate decision boundary. Figure 1(c)-(e) further illustrate the utility of the pair-wise classification approach. However, it is important to note that the choice of features for distinguishing the classes of interest is as important as the specific choice of hierarchy within the cascade for separating adenocarcinoma from benign confounding classes. The types of features we use to characterize the benign confounders are based on the diagnostic guidelines presented in.3 A short summary is also presented in Table 1. The T2-weighted imaging is mostly used for its high resolution and contrast, allowing detailed visualization of the tissue anatomy. From clinical guidelines we know that the T2-weighted images are especially useful to assess the texture of prostate lesions.3 Prostate cancer exhibits a so-called ’erased charcoal sign’, a smudge-like dark texture on T2-weighted images.3 Different benign confounders might also express different types of textures. Diffusion weighted imaging is specifically useful for characterizing tissue at a microscopic level, enabling us to assess traits like cellular density at a macro level. Prostate cancer has a high cellular density compared to the normal glandular structure of the prostate.8–10 This results in a reduced diffusivity in cancerous tissue and thus a high signal in high b-value images and subsequently a low apparent diffusion coefficient values. Finally, dynamic contrast-enhanced MRI result in signal over time curves and shows the uptake of contrast agent in tissue. This allows us to measure attributes of the tissue vasculature, like the relative fraction of extra-vascular, extra-cellular space in each voxel and micro-vessel permeability. Prostate cancer lesions tend to have leaky micro-vasculature, which results in fast initial enhancement and wash-out.11, 12 It is known that for example inflammation also causes a vascular response, however, it is unknown in which way this differs from prostate cancer. Proc. of SPIE Vol. 9035 903512-2 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms 0 1 2 2 Gabor θ=1.18, λ=4 Gauss. Deriv XX σ=4.1 3 0 1 4 0 -2 -4 1 0 -1 -2 -6 -3 -8 -1 0 1 ADC -4 -1 2 0 1 2 (Pseudo)T2-Map (a) 5 4 (b) 0.8 1 3 1 2 0.6 3 0.4 Time-to-peak Gabor θ=0.78, λ=1.5 3 0.2 0.0 -0.2 2 1 0 -0.4 -0.6 -0.8 -4 -3 -2 -1 0 1 2 3 Gauss. Deriv. XX σ=2.8 1 5 4 2 0 2 4 Maximum enhancement (c) 6 (d) 4 1 4 Gauss. Deriv. XX σ=4.1 2 0 -2 -4 -6 -8 -10 -2 -1 0 ADC 1 2 3 (e) Figure 1: Figure (a) shows the results for training a linear discriminant classifier to separate all the benign confounding classes (0) and cancer (1) at once given two features. As it is unable to discriminate the classes, it assigns every sample to class 0. The other figures show that in a cascaded setting, separating out one benign confounder per step, gives a reasonable decision boundary given discriminative features. In figures (b)-(e) class 0 is atrophy, class 1 is cancer, class 2 is BPH, class 3 is inflammation and class 4 is HGPIN. An explanation of the features used in these plots can be found in section 3.3. However, as previously suggested the scanner derived MRI parameters by themselves might not possess sufficient discriminability to distinguish adenocarcinoma from the confounding tissue classes, motivating the need for computer extracted image features. The first goal of this study was (1) to obtain computer-extracted features which are designed to better capture specific subtle differences between the classes. For the second goal we need to (2) establish which combination of regular and computer-extracted features is the most suitable for each benign confounding class. Proc. of SPIE Vol. 9035 903512-3 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms To be able to assess whether these identified features are useful in discriminating the different confounders and cancer, we can integrate them into different classification strategies. One option is to use a single shot classification approach, considering all the benign confounders as a single class and trying to differentiate this class from prostate cancer.7, 11, 13 The second option is to use a multi-class classification approach in which each benign confounder is a separate class. In this paper we will also investigate a third strategy: the use of a cascaded classifier to separate the confounding classes and prostate cancer step-by-step. Similar studies have shown cascaded approaches to outperform regular one-shot classification or multi-class classification approaches because it sub-divides the difficult task of separating confounders and cancer into easier sub-problems, thus circumventing issues with similar classes.14 In the paper by Doyle et al.14 they showed that cascaded approaches resulted in better discrimination of histopathologic tissue classes compared to multi-class classification. In Figure 1 we presented an illustrative example of such an approach. Furthermore, it allows us to perform feature selection for each problem separately. As such the third goal of this paper is (3) to establish if cascaded classifiers outperform regular single-shot or multi-class classifiers. The rest of the structure of this paper is as follows: first we will discuss previous work and our novel contributions. Second, we will discuss the methodology and experimental design. This will cover histology/MRI co-registration, feature extraction and selection and subsequently classification. The next section will present and discuss the results after which we will end with concluding remarks and future work. 2. PREVIOUS WORK AND NOVEL CONTRIBUTIONS Only little work has been done on accurately characterizing the appearance of different benign confounders in prostate MRI, most likely due to difficulty of obtaining accurate annotations. Several groups have investigated specific confounding classes using a single modality, for example benign prostatic hyperplasia on diffusionweighted imaging (Liu et al.,15 Oto et al.16 ). Another example is the differentiation of prostatitis using diffusionweighted imaging (Nagel et al.17 ). None of these groups have looked at all the benign confounders or all the modalities. There has been some previous work on computer-extracted features for the detection of prostate cancer.6 investigated the use of magnetic resonance spectroscopy in combination with T2-weighted imaging to identify the voxels that are affected by prostate cancer. They also introduced the use of wavelet embedding to map MRS and T2-W texture features into a common space. This work was further expanded and evaluated in.7 Niaf et al.18 presented the use of computer-aided diagnosis in the peripheral zone of the prostate using DCE features (similar to Vos et al.19 ). They confirmed the results in discriminating prostate cancer from normal regions (area under the ROC curve (AUC)=0.89) and discriminating prostate cancer from suspicious benign regions (AUC of 0.82). Singanamalli et al.20 investigated the use of DCE-MRI/histopathology fusion to assess feature correlations between prostate cancer of different grades. Lastly, Viswanath et al.21 investigated the use of texture features to discriminate prostate cancer from normal and benign regions. They also found different texture features were important depending on the originating prostatic zone of the cancer. In this paper we have several novel contributions: 1. Use of pathology annotated benign confounder classes to identify corresponding regions on the MRI 2. Identifying important imaging features per confounding class over all prostate MRI parameters 3. Establishing computer-extracted features which better capture subtle differences between the confounding classes 4. Improving computerized prostate cancer diagnosis using a cascaded classifier which incorporates per-stage feature selection Proc. of SPIE Vol. 9035 903512-4 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms 3. METHODOLOGY 3.1 Brief overview These methods are evaluated in a unique MRI/histology data set with annotations of cancer and the benign confounding classes on the histopathologic slides from 31 patients with biopsy-confirmed prostate cancer. The prostatectomy slides were then carefully co-registered to the MRI to obtain the MR regions corresponding to the different classes. The first goal was to (1) extract texture, pharmacokinetic and intensity features for each of the classes to establish the imaging signatures, after which we (2) used maximum relevance minimum redundancy (mRMR) feature selection22 to identify the best feature combination for each class. Using this information a cascading classifier which will remove each confounding class in a step-by-step approach was created. Subsequently, (3) the cascaded classifier was evaluated with respect to accuracy of classifying cancer and compared to single-shot two-class and multi-class classification. 3.2 Annotation and co-registration of histopathology A pathologist contoured distinct examples of each benign confounding class and prostate cancer on the histopathology slides using the Aperio ImageScope software, if present. The pathology annotations were transferred to the MRI by registering the whole-mount slide using a thin plate spline registration technique.21 The process, in a step-by-step fashion, goes as follows: 1. The slice in the MRI which corresponds to the prostatectomy slide is established by an image analysis researcher under the supervision of a radiologist by comparing landmarks on the pathology and the MRI.23 2. Corresponding points are indicated on the prostate boundary for both the prostatectomy slide and the MRI slice. 3. A b-spline transformation is calculated to move from the prostatectomy coordinate space to MRI coordinate space.24 4. The histopathology image is transformed to the MRI space using this b-spline transformation. A visual assessment of whether the registration is accurate is made. 5. The annotations of the pathologist are transferred to the MRI using this b-spline transformation. 3.3 Feature extraction and feature selection Correction of intensity drift Intensity drift and bias field are distinct issues that are well known in MRI.21 Intensity drift means that intensities differ from scanner to scanner and even from protocol to protocol on the same scanner. The bias field caused by the coil causes signal intensity fluctuation of the same tissue across the field of view. To circumvent both issues in T2-weighted images we can calculate a (pseudo)T2-map using the transverse T2W-image and the proton density-weighted image as described in.13 This approach uses MR signal equations and a muscle reference region of interest to reduce intensity drift between the different studies. T2-weighted imaging For the T2-weighted texture features we calculated several often used filter types: 13 Haralick texture features using 3 kernel sizes (3,5 and 7 voxels), Gabor texture features using 4 different angles and 3 different wavelengths between 1mm and 6mm and Gaussian derivatives up to second order using 4 different scales between 1mm and 6mm.21 The texture features were all calculated on the (pseudo)T2-map. Diffusion-weighted imaging From the diffusion-weighted imaging we directly incorporate the apparent diffusion coefficient and b800 image. Furthermore, as prostate cancer lesions tend to have a focal appearance on diffusion weighted imaging, we implemented the multi-scale blobness filter proposed by Li et al.25 and calculated the filter using 4 scales between 1 and 6 mm on the b800 and ADC images. Proc. of SPIE Vol. 9035 903512-5 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms Category Intensity Feature name (Pseudo)T2-map ADC b-800 Gaussian tives27 Texture Pharmacokinetic MR sequence 27 Deriva- Feature settings T2W DWI DWI T2Map Captures Molecular structure Cellular density Cellular density Up to 2nd order, σ=2.0, 2.67, 4.1 and 6.0 mm Kernel sizes 3, 5, 7 voxels Haralick21 T2Map Gabor21 T2Map Multi-scale blobness25 All Curve fitting parameters5 DCE Time-to-peak, maximum enhancement, wash-out rate Std. Tofts PK model5 DCE Ktrans, Kep, Ve Four angles: 0, π4 , π2 , 3π , λ=1.5, 2 and 4 4 voxels σ= 2.0, 2.67, 4.1 and 6 Edge strengths, stability of edges Homogeneity, quantitative texture measures edge direction, frequency Focality and compactness Semi-quantitative Micro-vasculature permeability, extracellular space Quantitative microvasculature permeability, extra-cellular space Table 2: Table of feature and feature settings calculated on the different MR sequences and used in the different classifiers. Features are chosen such that they represent physiological quantities which we hypothesize are different in the different confounding classes. For example, inflammation is known to cause vaso-dilation and as such result in a different response compared to the other confounders. Dynamic contrast-enhanced imaging Dynamic contrast-enhanced MRI also tends to suffer from scanner and protocol dependency.12, 19, 26 To remove this dependency and extract the most useful information from these curves we implemented curve fitting and pharmacokinetic modeling routines as presented in.5, 12, 26 The curve fitting routine uses a bi-exponential curve model and was implemented to enable faster and more robust pharmacokinetic modeling. The pharmacokinetic model that was implemented was the standard Tofts model,26 which ignores the vascular component in each voxel. The temporal resolution of the DCE time series was 4 seconds. To capture characteristics on the microvasculature we included a total of 3 curve features (time-to-peak, washout rate and maximum enhancement)12 and 3 pharmacokinetic features (Ktrans, Kep, Ve).5 Furthermore, as cancer also tends to have a focal appearance on DCE MRI3 we also calculate the Li blobness filter on the Ktrans, Kep, Ve, maximum enhancement, time-topeak and wash-out rate images. Feature selection The maximum relevance, minimum redundancy (mRMR) feature selection technique22 was used to determine features for distinguishing each of the following two-class classification tasks (BPH vs. cancer, inflammation vs. cancer, HGPIN vs. cancer and atrophy vs. cancer). This mRMR feature selection tries to maximize the mutual information between the individual features and the class labels and minimize the mutual information between features to select the most relevant features without redundancy. After mRMR feature selection, unique features discriminating each benign class from cancer were identified. mRMR feature selection was selected because it tends to give good results at reasonable computation time and does not need parameter optimization or nested cross-validation loops like for example sparse coding.28 Proc. of SPIE Vol. 9035 903512-6 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms Prostate cancer BPH HGPIN Atrophy Inflammation Figure 2: Schematic representation of the cascade classification strategy. Each stage shows a traffic light map for that specific confounding class obtained from the cascaded classifier. In this heat map red is high likelihood, green is low likelihood. The contours indicate cancer (pink) and the confounding class (blue). Cascading Classifier The feature selection step can be used to improve the classification of unknown samples in prostate cancer and benign disease by using a cascaded classification scheme. At each step of the cascade the classifier attempts to separate out one type of benign disease. This is schematically shown in Figure 2. The order of the cascade was empirically determined. At each step of the cascade, mRMR feature selection is used. Training data at that step consists of the benign confounder to be detected at that stage (in step 1 inflammation), which is given class label 0 and all classes that are upstream in the cascade (given class label 1). The training folds were balanced before feature selection. A linear discriminant classifier is then trained with the selected features. In each step either a sample is identified as either belonging to the target benign confounding class and removed from further classification, or it moves on to the next stage. At the final stage of the classifier cascade, the remaining samples are either classified as atrophy or as prostate cancer. 4. EXPERIMENTAL RESULTS AND DISCUSSION 4.1 Patient data, annotation, and co-registration For this study we included 31 patients with an MRI including T2-weighted, dynamic contrast-enhanced, diffusionweighted and proton density-weighted image sequences. Inclusion criteria were persistently high PSA, initial negative TRUS biopsy, multi-parametric MRI positive for prostate cancer and subsequent prostactectomy. MRI acquisition was performed using a 3 Tesla MRI scanner (either a TrioTim or a Skyra; Siemens, Erlangen, Germany). Cases were acquired both with and without an endorectal coil. A pelvic phased array coil was always used. The multi-parametric protocol consisted of three orthogonal T2-weighted volumes, diffusion-weighted imaging (three b-values averaged over three orthogonal directions, 50, 400-500 and 800) and dynamic contrastenhanced imaging (15 mL of Dotarem; Guerbet, France). After prostatectomy, in total 44 histological H&E stained slides where digitized using a digital slide scanner (VS120-S5, Olympus, Japan) at 10x or 20x, corresponding to a resolution of 0.6 um and 0.3 um respectively, with at least 1 slide per patient; sometimes two slides per patient were digitized. The slides selected contained the largest tumor volume, based on the interpretation by an experienced pathologist. Areas of prostate cancer, Proc. of SPIE Vol. 9035 903512-7 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms 4.4 (a) (d) (g) (b) * (c) (e) (f) (h) (i) Figure 3: Representative images of a T2-weighted MRI ((a),(d),(g)) and a H&E stained prostatectomy slide ((b),(e),(h)) and the result of the subsequent MRI/histology registration with ((c),(f),(i)) annotations. In this example prostate cancer is indicated in yellow, a BPH nodule in red, inflammation in blue and atrophy in green. Proc. of SPIE Vol. 9035 903512-8 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms Rank 1 Rank 2 Single-shot classifier Cascaded Classifier Likelihood (TC) Likelihood (Cas) Blobness b800 Likelihood (TC) Likelihood (Cas) b800 Likelihood (TC) Likelihood (Cas) Har. Meas. Correl. Likelihood (TC) Likelihood (Cas) 1. 0 Inflammation Ve HGPIN 4. di \ .80 BPH GD XX σ = 4.1] - itidI9 0 Atrophy GD YY σ = 4.1] Hara. Sum Var. GD σ = 2.8 0. 0 Table 3: The top 2 selected features are shown in the first two columns for each of the benign class versus cancer classification tasks. Each row is a separate patient. The last two columns show likelihood maps for the single-shot classification (column 3) and the cascaded classification (column 4). The overlays indicate likelihood value: transparent, green and red are a zero, low and high value respectively between 0 and 1. The confounding class is indicated with a dark blue contour and prostate cancer with a purple contour. The corresponding over-expressing features (first, second column) are described in Table 4 BPH, atrophy, inflammation and HGPIN were subsequently annotated on each slide by one of two pathologists using the Aperio ImageScope software. The annotations made on the histopathologic slides where transferred to the MRI using the methodology explained in subsection 3.2. Slice correspondences were manually established by an image analysis researcher based on landmarks and the relative position within the prostatectomy data set (e.g. if the middle slide in the prostatectomy was selected it most likely does not correspond to the first prostate slice in the MRI). This resulted in a total of 68 atrophy, 25 BPH, 58 PIN, 47 inflammation and 93 prostate cancer lesions on MRI. An example of co-registered annotations is shown in Figure 3. 4.2 Experiment 1: Feature Selection via mRmR In Table 4, the top 5 selected features for each of the different benign vs. cancer classification tasks are presented. Each task has a number of distinct unique features (indicated with the gray background). We also show the top Proc. of SPIE Vol. 9035 903512-9 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms Rank 1 2 3 4 5 Inflammation BPH Ve Haralick Sum Variance (ω=7) Blobness ADC Gauss. Deriv (o = xx, σ = 4.1) Max. Enhancement Gauss. Deriv (o = yy, σ = 4 .1 ) b800 intensity Time-to-peak Blobness ADC Gabor (λ = 1 .5 , θ = 0 .0 ) Atrophy HGPIN Gauss. Deriv (o = −, σ = 2.8) Haralick Measure of Correlation1 (ω=3) Blobness (Pseudo)T2Map Blobness Kep Gabor (λ = 2.0, θ = 0.0) Gauss. Deriv (o = xx, σ = 4.1) Blobness b800 ADC intensity Haralick Contrast (ω=3) Blobness time-to-peak Rank 1 2 3 4 5 Table 4: Selected features for each of the benign classes during the feature selection for each of the classification tasks (BPH vs. cancer, atrophy vs. cancer). Feature that are only selected in one classification task, and thus are specific for that specific confounder, are indicated with gray backgrounds. In the table o is the order of the Gaussian derivative, σ the scale, θ the angle of the Gabor filter, λ the bandwidth and ω the kernel size for the Haralick texture features. 2 selected features in Table 3. 4.2.1 Distinguishing inflammation from cancer Inflammation often results in marked vascular changes like increased blood flow and permeability (to allow leukocytes to move to the inflamed tissue) and increased extra-cellular, extra-vascular space due to dying (apoptotic or necrotic) cells. From Table 4 we can see that this has been picked up by the feature selection strategy (selected features 1 and 5, Ve and maximum enhancement) and that these features can help discriminate between inflammation and prostate cancer. 4.2.2 Distinguishing atrophy from cancer Atrophy of the prostate results in a loss of the regular structure of the glands in the prostate (reduced infolding, cytoplasm volume and presence of corpora amylacea).29 Atrophy tends to be focal and looks texturally different from prostate cancer on pathology. This can also be recovered on the MRI if we look at Table 4, where 3 out of the top 5 features are related to textural attributes. Finally, an excellent discriminator of atrophy versus prostate cancer would be the blobness feature on the (pseudo)T2-map, as prostate cancer tends to have a diffuse appearance with irregular border, whereas the focal appearance of atrophy on pathology carries over to the T2-weighted MRI. 4.2.3 Distinguishing BPH from cancer Benign prostatic hyperplasia is benign glandular or stromal proliferation in the transition zone of the prostate. Typically BPH is a difficult confounder to separate from prostate cancer, as features like ADC intensity tend to show similar changes.30 From our feature selection results we can appreciate that instead of using ADC intensity to discriminate prostate cancer and BPH, it is more useful to look at the textured appearance on T2-weighted imaging, in combination with a focal appearance on the ADC map. BPH tends to have a more nodular, compact shape compared to prostate cancer which is picked up in our experiments. 4.2.4 Distinguishing PIN from cancer Prostatic intra-epithelial neoplasia is a cancer pre-cursor, sometimes it will progress into cancer, sometimes it will not. As such, it shares similar imaging characteristics with cancer. However, in our experiments we can still identify features which can specifically help discriminate this confounder. Directly using ADC intensity works well in discriminating PIN, as it tends not yet to exhibit the high cellular density which is apparent in aggressive prostate cancer.17 Furthermore, as it still localized and usually well-defined, it exhibits a more compact and focal appearance compared to prostate cancer on the time-to-peak images. Proc. of SPIE Vol. 9035 903512-10 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms Accuracy 0.85 0.80 0.75 0.70 0.65 0.60 0.55 0.50 0.450 5 10 15 Number of selected features 20 Figure 4: Two-class accuracies over the number of selected features for the simple two-class strategy (green), multi-class classification (blue) and the cascade classification (red) including 95% confidence intervals. The cascaded classifier has higher accuracy for almost the entire range of features. 4.3 Cascaded classifier based evaluation of discriminating features separating cancer from benign confounders The evaluation is based on the overall accuracy in separating benign disease and prostate cancer on a lesion level and compared to direct two-class classification of benign disease and prostate cancer and multi-class classification with and without feature selection. In the multi-class and cascaded classification the resulting benign classes are grouped as one benign class after classification, after which accuracy is calculated. The number of features for the classifiers and in each stage of the cascade was varied between 2 and 20 in steps of 2 to allow a comparison across the number of selected features. The validation of the cascaded classifier was performed in a five-fold patient-based cross-validation (20 repeated runs). The results are shown in Figure 4. The cascaded classifier outperforms the single-shot classifier and the multiclass classifier in the overall accuracy of correctly classifying voxels as either containing cancer or not containing cancer over almost the entire range of features. The overall maximum accuracy is 76.4% when using 4 features for the cascading classifier and 63.0% and 71.0% for the single-shot classification and the multi-class classifier at 10 and 8 features respectively. Overall accuracy is limited because some lesions annotated on pathology ended up being only a couple of voxels on the MRI and are indistinguishable from normal tissue. 5. CONCLUDING REMARKS A major issue in rendering a cancer diagnosis on prostate MRI is the presence of confounding benign classes which have similar image characteristics as prostate cancer on multi-parametric MRI. Since many of these confounder categories (e.g BPH, PIN, atrophy, inflammation) have overlapping appearance to prostate cancer, they can mask the appearance of true disease on MRI. Hence it can be difficult for radiologists to assess whether a suspicious region is benign or malignant. Computerized feature extraction might allow better assessment of subtle differences between classes by changing the representation of the multi-parametric MRI in such a way that accentuates these subtle differences. By leveraging ex vivo histopathology from patients undergoing radical prostatectomy with a pre-operative MRI we were able to map annotations of benign confounders from ex vivo pathology onto MRI. This allowed us to identify features which might help in discriminating between specific confounders and prostate cancer. Furthermore, we combined mRMR feature selection with a cascaded classifier, which can step-by-step learn features for each of the confounders, thus simplifying the classification task for the following steps, as opposed to single-shot or multi-class classifiers, which are restricted to only learning features common to all benign classes Proc. of SPIE Vol. 9035 903512-11 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms or having to learn all features at once. We applied these techniques to a data set of 31 patient for which we obtained whole mount prostatectomy and full multi-parametric MRI. The main results include a unique set of image attributes for each of the benign classes. These selected features can be related to the underlying pathology to achieve a better understanding of which pathological processes cause certain image characteristics on MRI. An example here is the fraction of extra-cellular, extravascular space, Ve, which is discriminative for inflammation. Our results suggest that only very specific computer extracted image features are able to accurately discriminate between the different benign confounder and prostate cancer. Our results also indicate that the computer extracted features, as opposed to the original scanner derived parameters (e.g. T2w intensities) can better express the subtle differences between classes than the original MR images is valid. The fact that computer extracted features from all 3 imaging protocols are differentially expressed in discrimination of one or more confounding tissue classes and cancer suggests that multi-parametric MRI is needed for improved prostate cancer diagnosis. This study also has some limitations. We did not discriminate on lesion size when converting the pathology annotations to the MRI. This resulted in some very small lesions (< 5 voxels) which might get misclassified due to issues like partial volume effect or small co-registration errors. We also did not take into account combined classes, i.e. sometimes areas of atrophy or cancer also show active inflammation. Finally, the splits in the hierarchical classifier were determined empirically. A future avenue of work is to automatically learn the optimal cascade hierarchy to distinguish the confounder and cancer classes. In the future we would like to validate our approach on a larger cohort and experiment with more complex classifiers for each of the steps in the cascade. Furthermore, we would like to look into a soft per-class classification to take into account class overlap (e.g. inflammation occurring together with atrophy). 5.1 Acknowledgements Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under award numbers R01CA136535-01, R01CA140772-01, and R21CA167811-01; the National Institute of Diabetes and Digestive and Kidney Diseases under award number R01DK098503-02, the DOD Prostate Cancer Synergistic Idea Development Award (PC120857); the QED award from the University City Science Center and Rutgers University, the Ohio Third Frontier Technology development Grant. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. REFERENCES [1] Hoeks, C. M. A., Hambrock, T., Yakar, D., Hulsbergen-van de Kaa, C. A., Feuth, T., Witjes, J. A., Fütterer, J. J., and Barentsz, J. O., “Transition zone prostate cancer: Detection and localization with 3-t multiparametric MR imaging,” Radiology 266, 207–217 (2013). [2] Bratan, F., Niaf, E., Melodelima, C., Chesnais, A. L., Souchon, R., Mège-Lechevallier, F., Colombel, M., and Rouvière, O., “Influence of imaging and histological factors on prostate cancer detection and localisation on multiparametric MRI: a prospective study,” Eur Radiol , 1–11 (2013). [3] Barentsz, J. O., Richenberg, J., Clements, R., Choyke, P., Verma, S., Villeirs, G., Rouviere, O., Logager, V., Fütterer, J. J., and European Society of Urogenital Radiology, “ESUR prostate MR guidelines 2012,” Eur Radiol 22, 746–757 (2012). [4] Lovett, K., Rifkin, M. D., McCue, P. A., and Choi, H., “MR imaging characteristics of noncancerous lesions of the prostate,” J Magn Reson Imaging 2, 35–39 (1992). [5] Vos, P. C., Barentsz, J. O., Karssemeijer, N., and Huisman, H. J., “Automatic computer-aided detection of prostate cancer based on multiparametric magnetic resonance image analysis,” Phys Med Biol 57, 1527–1542 (2012). [6] Tiwari, P., Viswanath, S., Kurhanewicz, J., Sridhar, A., and Madabhushi, A., “Multimodal wavelet embedding representation for data combination (maweric): integrating magnetic resonance imaging and spectroscopy for prostate cancer detection,” NMR Biomed (2011). Proc. of SPIE Vol. 9035 903512-12 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms [7] Tiwari, P., Kurhanewicz, J., and Madabhushi, A., “Multi-kernel graph embedding for detection, gleason grading of prostate cancer via MRI/mrs,” Med Image Anal 17, 219–235 (2013). [8] Haider, M. A., van der Kwast, T. H., Tanguay, J., Evans, A. J., Hashmi, A.-T., Lockwood, G., and Trachtenberg, J., “Combined T2-weighted and diffusion-weighted MRI for localization of prostate cancer,” AJR Am J Roentgenol 189, 323–328 (2007). [9] Hambrock, T., Somford, D. M., Huisman, H. J., van Oort, I. M., Witjes, J. A., Hulsbergen-van de Kaa, C. A., Scheenen, T., and Barentsz, J. O., “Relationship between apparent diffusion coefficients at 3.0-T MR imaging and Gleason grade in peripheral zone prostate cancer,” Radiology 259, 453–461 (2011). [10] Kobus, T., Hambrock, T., Hulsbergen-van de Kaa, C. A., Wright, A. J., Barentsz, J. O., Heerschap, A., and Scheenen, T. W. J., “In vivo assessment of prostate cancer aggressiveness using magnetic resonance spectroscopic imaging at 3 t with an endorectal coil,” Eur Urol 60, 1074–1080 (2011). [11] Artan, Y., Haider, M. A., Langer, D. L., van der Kwast, T. H., Evans, A. J., Yang, Y., Wernick, M. N., Trachtenberg, J., and Yetik, I. S., “Prostate cancer localization with multispectral MRI using cost-sensitive support vector machines and conditional random fields,” IEEE Trans Image Process 19, 2444–2455 (2010). [12] Huisman, H. J., Engelbrecht, M. R., and Barentsz, J. O., “Accurate estimation of pharmacokinetic contrastenhanced dynamic MRI parameters of the prostate,” J Magn Reson Imaging 13, 607–614 (2001). [13] Litjens, G., Barentsz, J., Karssemeijer, N., and Huisman, H., “Automated computer-aided detection of prostate cancer in MR images: from a whole-organ to a zone-based approach,” in [Medical Imaging], Proceedings of the SPIE 8315, 83150G–83150G–6 (2012). [14] Doyle, S., Feldman, M. D., Shih, N., Tomaszewski, J., and Madabhushi, A., “Cascaded discrimination of normal, abnormal, and confounder classes in histopathology: Gleason grading of prostate cancer,” BMC Bioinformatics 13(1), 282 (2012). [15] Liu, X., Zhou, L., Peng, W., Wang, C., and Wang, H., “Differentiation of central gland prostate cancer from benign prostatic hyperplasia using monoexponential and biexponential diffusion-weighted imaging,” Magn Reson Imaging 31, 1318–1324 (2013). [16] Oto, A., Kayhan, A., Jiang, Y., Tretiakova, M., Yang, C., Antic, T., Dahi, F., Shalhav, A. L., Karczmar, G., and Stadler, W. M., “Prostate cancer: differentiation of central gland cancer from benign prostatic hyperplasia by using diffusion-weighted and dynamic contrast-enhanced MR imaging,” Radiology 257, 715– 723 (2010). [17] Nagel, K. N. A., Schouten, M. G., Hambrock, T., Litjens, G. J. S., Hoeks, C. M. A., Haken, B. T., Barentsz, J. O., and Fütterer, J. J., “Differentiation of prostatitis and prostate cancer by using diffusion-weighted MR imaging and MR-guided biopsy at 3 t,” Radiology 267, 164–172 (2013). [18] Niaf, E., Rouvière, O., Mège-Lechevallier, F., Bratan, F., and Lartizien, C., “Computer-aided diagnosis of prostate cancer in the peripheral zone using multiparametric MRI,” Phys Med Biol 57, 3833 (2012). [19] Vos, P. C., Hambrock, T., Barentsz, J. O., and Huisman, H. J., “Computer-assisted analysis of peripheral zone prostate lesions using T2-weighted and dynamic contrast enhanced T1-weighted MRI,” Phys Med Biol 55, 1719–1734 (2010). [20] Singanamalli, A., Sparks, R., Rusu, M., Shih, N., Ziober, A., Tomaszewski, J., Rosen, M., Feldman, M., and Madabhushi, A., [Identifying in vivo DCE MRI parameters correlated with ex vivo quantitative microvessel architecture: A radiohistomorphometric approach], 867604, SPIE-Intl Soc Optical Eng (Mar 2013). [21] Viswanath, S. E., Bloch, N. B., Chappelow, J. C., Toth, R., Rofsky, N. M., Genega, E. M., Lenkinski, R. E., and Madabhushi, A., “Central gland and peripheral zone prostate tumors have significantly different quantitative imaging signatures on 3 tesla endorectal, in vivo T2-weighted MR imagery,” J Magn Reson Imaging 36, 213–224 (2012). [22] Peng, H., Long, F., and Ding, C., “Feature selection based on mutual information criteria of maxdependency, max-relevance, and min-redundancy,” IEEE Trans Pattern Anal Mach Intell 27, 1226–1238 (2005). [23] Xiao, G., Bloch, B. N., Chappelow, J., Genega, E. M., Rofsky, N. M., Lenkinski, R. E., Tomaszewski, J., Feldman, M. D., Rosen, M., and Madabhushi, A., “Determining histology-mri slice correspondences for defining mri-based disease signatures of prostate cancer.,” Comput Med Imaging Graph 35(7-8), 568–578 (2011). Proc. of SPIE Vol. 9035 903512-13 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms [24] Chappelow, J., Tomaszewski, J. E., Feldman, M., Shih, N., and Madabhushi, A., “Histostitcher(): an interactive program for accurate and rapid reconstruction of digitized whole histological sections from tissue fragments.,” Comput Med Imaging Graph 35(7-8), 557–567 (2011). [25] Li, B., Christensen, G. E., Hoffman, E. A., McLennan, G., and Reinhardt, J. M., “Establishing a normative atlas of the human lung: Intersubject warping and registration of volumetric CT images,” Acad Radiol 10, 255–265 (2003). [26] Tofts, P. S., Brix, G., Buckley, D. L., Evelhoch, J. L., Henderson, E., Knopp, M. V., Larsson, H. B., Lee, T. Y., Mayr, N. A., Parker, G. J., Port, R. E., Taylor, J., and Weisskoff, R. M., “Estimating kinetic parameters from dynamic contrast-enhanced t(1)-weighted MRI of a diffusable tracer: standardized quantities and symbols,” J Magn Reson Imaging 10, 223–232 (1999). [27] Litjens, G. J. S., Debats, O. A., van de Ven, W. J. M., Karssemeijer, N., and Huisman, H. J., “A pattern recognition approach to zonal segmentation of the prostate on MRI,” in [Med Image Comput Comput Assist Interv], Lect Notes Comput Sci 7511, 413–420 (2012). [28] Olshausen, B. A. and Field, D. J., “Sparse coding with an overcomplete basis set: A strategy employed by v1?,” Vision Research 37, 33113325 (Dec 1997). [29] Gardner, Jr, W. and Culberson, D. E., “Atrophy and proliferation in the young adult prostate.,” J Urol 137, 53–56 (Jan 1987). [30] Rosenkrantz, A. B. and Taneja, S. S., “Radiologist, be aware: ten pitfalls that confound the interpretation of multiparametric prostate MRI,” AJR Am J Roentgenol 202, 109–120 (2014). Proc. of SPIE Vol. 9035 903512-14 Downloaded From: http://proceedings.spiedigitallibrary.org/ on 09/30/2014 Terms of Use: http://spiedl.org/terms

Distinguishing prostate cancer from benign confounders via a

Related documents

Products

Support

Distinguishing prostate cancer from benign confounders via a

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib