sequential analysis of medical images using neural networks

Sequential Analysis of Medical Images Using Neural Networks Marek Czeleń, Grzegorz Wojtas, Zbigniew Mikrut1 Institute of Automatics, University of Mining and Metallurgy Mickiewicza 30, 30-059 Kraków, Poland e-mail: zibi@biocyb.ia.agh.edu.pl Abstract. A problem frequently encountered in the analysis of medical images is reaching a correct and reliable diagnosis of the examined organ, particularly by persons with little experience, e.g. first year medical students. That fact has become an inspiration for construction of an assisting program, based on application of three neural networks: 1) moving the attention point along the selected object, 2) elaborating the diagnosis for the area in the current viewport, 3) reporting the end of the examination. In the present work a summary is done for almost a hundred of experiments, testing the efficiency of all those neural networks in the analysis of raw medical images. The final version of the algorithm uses backpropagation type networks: it operates on square areas which are 31x31 pixels in size for the first and third networks and 91x91 pixels in size for the second (diagnosing) network. The achieved level of correct recognition (about 80%) is a very good result, if one takes into account rather low quality (dynamic range) of the analysed images and the observed natural interferences. 1 Introduction With the increasing presence of the equipment for digital recording and visualisation of medical images more and more needs are expressed, concerning not only the quality increase of the registered images but also their intelligent analysis. Such systems are narrowly specialised applications, the role of which is to suggest the answer, leaving the final decision to the human staff. Most of such applications performs an overall analysis of the selected object, applying various preprocessing methods, segmentation, detection and classification of preselected features, both by neural [3, 8] and non-neural [5] methods. An interesting alternative is provided by a sequential object analysis, for which a distant and unequal led ideal is the human vision system. It seems the neural networks are one of the few possibilities for realisation of that technique [6, 1]. This concerns both implementation of moving the so called "attention point" and the analysis (diagnosing) of the currently "seen" image window. In order to preserve a consistent approach to the problem for the present study it was decided that raw data or slightly preprocessed medical image data will be 1 The work was partially supported by the Polish Committee for Scientific Research under grant No. T11E 007 17. used as input. It reflects the authors' belief that an adequately complicated neural network will be able to extract the interesting image areas by itself. Other concepts of recognition based on raw images can be found in papers [7, 2, 4]. The material selected for the study consisted of ERCP1 X-ray images, registered for several patients (see Fig. 3). The task for the created application was a preliminary examination of the pancreatic duct area preselected by the human operator. During the pancreas area tracking the application should be able to provide a current diagnosis, looking for the following pathological changes:  branches characteristic for the acute pancreas inflammation  swellings and constrictions characteristic for the pancreas cancer  presence of cysts. 2 Methods and Experiments In order to select an appropriate form of the input data and the neural network structures, several experiments have been carried out, during which the networks were learned and their structure was being adjusted. Therefore it is not possible to strictly divide the description of the study into the classical stages, i.e. design of the input data form, choice of the network structure and the experiments themselves. The concepts presented below are results from positively completed tests. 2.1 Task Distribution The artificial neural networks operating on raw images are able to perform well in only one learned task. That is a limitation for the sequential image analysis procedure, because for a complete analysis of the pancreatic duct image in each step it is necessary to determine the motion direction, elaborate the local diagnosis and possibly evaluate the condition for the algorithm termination. Therefore it has been decided that three independent networks will be used: 1) the first network determined the motion direction for the "focus of attention" point using a small (31x31 pixels) area of the image, what was equivalent to determination of a new actual position 2) the second network elaborated the local diagnosis at the actual position using a bigger image area than the previous network (91x91 pixels), which was reduced to the area 31x31 pixels in size 3) the third network, operating on the same image area as the diagnosing network, determined whether any part of the pancreatic duct can be seen. It was one of the conditions required for program termination, in the case when all the objects has been analysed or in a situation when the first network lost the tracked object. 1 Endoscopic Retrograde CholangioPancreatography 2.2 Construction of the Learning Sequences During the construction of the learning sequences the main objective was to ensure that they are properly representative. For the network setting the motion direction the images and the respective motion directions have been selected in such a way that the direction distribution was reasonably uniform or the forward motion was slightly preferred. Obviously the images should be consistent (noncontradicting) what has been finally ensured by proposing a concept of "virtual vehicle" (see sect. 2.4). Another problem was the necessity of making a decision concerning the choice of the algorithm used for determination of the motion direction for the "narrow" and "wide" objects (see Fig. 3a and 3b respectively). When the pancreatic duct is narrow, then bright areas can be seen on both sides. On the other hand when it is wide, then the bright area can be detected on one side only. Frequently other objects are visible in the image window, which can interfere in the process of direction determination (e.g. in almost every image a duodenoscope tube - a device for dosage the contrast medium - can be seen). Such situations have also been taken into account during the construction of the learning sequence. It was also ensured that the numbers of images representing particular diseases (pathological conditions) were approximately the same. The images have been registered at different angles, so that after the learning stage the network would be able to carry out a proper evaluation. In the construction of the learning set for the network responsible for analysis termination both the images containing parts of the pancreatic duct and images without such parts have been taken into account. The ratio of the respective images was kept close to the 1:1. 2.3 Preprocessing The ERCP image express particularly low dynamic range (see Fig. 1 upper row). The preliminary experiments have shown that the neural network was not able to correct the dynamic range on its own, and - what is a consequence of the above that execution of a simple preprocessing essentially affects the obtained results. From several examined methods of contrast enhancement based on the histogram analysis one was selected, consisting of histogram stretching to the full brightness range accompanied by aggregation of 10% brightest and 10% darkest pixels. Several examples of application of the above mentioned operation are shown in Fig.1. Fig. 1. Image contrasting using the histogram analysis 2.4 Moving the Focus of Attention The best results have been obtained by implementing the "virtual vehicle" concept, in which the window to be analysed behaves like a small vehicle, moving along the pancreatic duct as if it was a road. During consecutive steps the window is turned in such a way that its centre is kept in the same location, while the lateral edge (with respect to the motion direction) is kept parallel to the segment connecting the actual analysis window centre with its previous location (Fig.2) previous locations Previous location Fig.2.The virtual vehicle concept For such a situation the network "sees" similarly oriented images and is able to determine the direction of motion for the attention point with its proper orientation ("forward" direction). The rotation angle is determined using the actual and previous location of the window centre by the program. Therefore the neural network should be able to recognise relative direction (with respect to the previous motion direction) within the [-90,90] degrees. The tests have not confirmed the utility of methods based on the log-polar transform and similar concepts. It was a direct consequence of considerable information averaging near the edges of the input image area, what is an advantage only when the tracked object is within the window. The attempts of application of "receptor fields", detecting lines of various inclinations have lead to better results than the transforms mentioned above, but still the results were worse then for the pure contrast enhancement. An attempt of applying a Kohonen network, as a feature detecting network, did not work at all. Also the combining of contrasted image with the information taken from the receptor field or with the Kohonen network did not improve the correctness of the network operation (see Tab. 1). 3 Results and Discussion The structures of all the networks tested are listed in Table 1, while several examples of analysis are presented in Fig.3. In the cases when the pancreatic duct was located too close to other objects entering the "region of interest" of the direction determining network or when it crossed other objects (Fig.3a) the network was not able to determine properly the motion direction because the information about context (surrounding) was not sufficient. Possible extension of the window on one hand would increase the amount of available information, but on the other hand it would complicate the analysis by introducing more visible organs. a) b) c) Fig. 3. (a) Analysis of a regular pancreas; NN diagnosis: 84.6% of the pancreas length does not exhibit any pathological changes. (b) Pancreatic duct with cysts; NN diagnosis: 38.5% of the pancreas length exhibits cysts, 22.9% exhibits constrictions. (c) Regular pancreatic duct; NN diagnosis: 78.1% of the pancreas length does not exhibit pathological changes. Table 1. Summary of data representations and network architectures tested (X) Data representation Log-Polar Network structure [type, inputs, structure] Bp, (8x5), (100:100:2) Bp, (12x6), (100:100:8) Retina Bp, (68), (15:15:5) Bp, (130), (15:15:5) Bp, (68), (100:100:16) Square window Bp, (31x31), (15:15:5) of analysis Bp, (31x31), (100:100:2) Bp, (31x31), (100:100:5) Bp, (31x31), (25:25:5) Bp, (31x31), (10:2) Receptor fields Bp, (var), (15:15:5) Hybrid netKh, (31x31), (5:5) + Bp, (25), (15:15:5) work Kh, (31x31), (5:5) + Bp, (25), (5:5:5) Kh, (31x31), (5:5) + Bp, (31x31+5), (15:15:5) Bp – backpropagation Kh – Kohonen SOM 1 X X X X X X X X NN 2 3 X X X X X X The pancreases of regular shapes are better suited for the analysis than the pathological ones (see Fig. 3). Particularly difficult cases are the ducts with cancer symptoms, because the constrictions in their sections are seen as discontinuities, what may lead to errors in the procedure of determination of motion direction. For the diagnosing network the interference from another objects (e.g. the spine) has lead to erroneous detection of branching. However the network used for determination of the conditions of analysis termination correctly reported the fact of absence of the pancreatic duct on its ends or in the case of deviation from the proper route within the duct area. The errors were generated mainly on discontinuities, which the network was not able to distinguish from the situation when two various objects were present. 4 Conclusions The obtained results prove that the analysis of raw images using neural networks is possible and effective. The diagnostic recognition level about 80% presents a very good result, if the low quality of the ERCP images and the presence of natural interference are taken into account. The result is comparable to the 85% recognition level reported in the paper [5], whereas it should be mentioned that in the cited work structural descriptions (grammars) have been applied to a preselected (binary) object. The presented method can be also applied to the analysis of medical images containing objects of classes similar to the ERCP images, e.g. coronary vessels or eyeball vessels. Having this in mind refinement of the whole algorithm is planned, by introduction of feedback worked-out in the diagnosing network. The feedback's task would include a temporary reduction of the step length between consecutive attention shifts (at present the step length is constant) in order to provide a possibility of more detailed analysis by the recognising network in the cases which are difficult to diagnose. That would be a next step towards the functional similarity of the proposed algorithm to the movements of the human eyeball, executed during realisation of the diagnostic task, which have been registered in our Biocybernetic Laboratory. References 1. van de Laar P., Heskes T., Gielen S. (1997) Task-Dependent Learning of Attention. Neural Networks vol.10 no 6, Pergamon 2. Lindblad T., Lindsey C., Padgett M., Vanyachine A., Zanevsky I. (1998) Digital X-Ray image processing using biologically inspired methods. Proc. IJCNN, Anchorage, Alaska. 3. Mikrut Z., Tadeusiewicz R. (2000) Neural Networks applied to digital image processing and recognition, in: Biocybernetics and Biomedical Engineering, Vol. 6, Ed. Nałęcz M. AOW EXIT, Warsaw (in Polish) 4. Neubauer C. (1998) Evaluation of Convolutional Neural Networks for Visual Recognition. IEEE Trans. Neural Networks, Vol. 9, No. 4 5. Ogiela M., Tadeusiewicz R. (2000) Syntactic Pattern Recognition for X-Ray Diagnosis of Pancreatic Duct. Eng. in Medicine and Biology, IEEE Magazine, 19, 6. 6. Postma E., Herik H., Hudson P. (1997) SCAN: A Scalable Model of Attentional Selection. Neural Networks vol.10 No. 6, Pergamon 7. Sjostrom P. J., Wahlberd L.U. (1999) Automated cell recognition and counting based on combination of artificial neural networks and standard image analysis methods. Proc. 3rd Int. Conf. Cognitive and Neural Systems, Boston 8. Żurada J. M. (1992) Introduction to artificial neural networks. West Publ., St. Paul New York.

sequential analysis of medical images using neural networks

Related documents

Products

Support

sequential analysis of medical images using neural networks

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib