www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 5 Issue 8 August 2016 Page No. 17658-17661 A Review on Plant Leaf Classification and Segmentation Lakhvir Kaur1, Dr. Vijay Laxmi2 ¹Research Scholar M-Tech, Computer Science and Engineering, Guru Kashi University, India ²Dean, UCCA, Guru Kashi University, India 1 lakhvirgrewal92@gmail.com, 2cse_vijay2003@yahoo.co.in Abstract: A leaf is an organ of vascular plant and is the principal lateral appendage of the stem. Each leaf has a set of features that differentiate it from the other leaves, such as margin and shape. This paper proposes a comparison of supervised plant leaves classification using different approaches, based on different representations of these leaves, and the chosen algorithm. Beginning with the representation of leaves, we presented leaves by a fine-scale margin feature histogram, by a Centroid Contour Distance Curve shape signature, or by an interior texture feature histogram in 64 element vector for each one, after we tried different combination among these features to optimize results. We classified the obtained vectors. Then we evaluate the classification using cross validation. The obtained results are very interesting and show the importance of each feature. the classification of plant leaf images with biometric features. Traditionally, the trained taxonomic perform this process by following various tasks. The taxonomic usually classify the plants based on flowering and associative phenomenon. It was found that this process was time consuming and difficult. The biometric features of plants leaf like venation make this classification easy. Leaf biometric feature are analyzed using computer based method like morphological feature analysis and artificial neural network based classifier. KNN model take input as the leaf venation morphological feature and classify them into four different species. Keyword - Artificial neural network, Canny edge detection, kNN Classification, Leaf venation pattern, Morphological Features I. INTRODUCTION Plant diseases have turned into a dilemma as it can cause significant reduction in both quality and quantity of agricultural products. In India 70% of the population depend on agriculture. Farmers have wide range of diversity to select suitable Fruit and Vegetable crops. However, the cultivation of these crops for optimum yield and quality produce is highly technical. It can be improved by the aid of technological support. The management of perennial fruit crops requires close monitoring especially for the management of diseases that can affect production significantly and subsequently the postharvest life. In [2] the authors have worked on the development of methods for the automatic classification of leaf diseases based on high resolution multispectral and stereo images. Leaves of sugar beet are used for evaluating their approach. Sugar beet leaves might be infected by several diseases, such as rusts (Uromyces betae), powdery mildew (Erysiphe betae). Disease is caused by pathogen which is any agent causing disease. In most of the cases pests or diseases are seen on the leaves or stems of the plant. Therefore identification of plants, leaves, stems and finding out the pest or diseases, percentage of the pest or disease incidence , symptoms of the pest or disease attack, plays a key role in successful cultivation of crops. It is found that diseases cause heavy crop losses amounting to several billion dollars annually. Disease management is a challenging task. Mostly diseases are seen on the leaves or stems of the plant. Precise quantification of these visually observed diseases, pests, traits has not studied yet because of the complexity of visual patterns. Hence there has been increasing demand for more specific and sophisticated image pattern understanding. In biological science, sometimes thousands of images are generated in a single experiment. These images can be required for further studies like classifying lesion, scoring quantitative traits, calculating area eaten by insects, etc. Almost all of these tasks are processed manually or with distinct software packages. It is not only tremendous amount of work but also suffers from two major issues: excessive processing time and subjectiveness rising from different individuals .Hence to conduct high throughput experiments, plant biologist need efficient computer software to automatically extract and analyze significant content. Here image processing plays important role. Lakhvir Kaur1, IJECS Volume 05 Issue 08 August 2016 Page No.17658-17661 Page 17658 DOI: 10.18535/ijecs/v5i8.41 II. IMPORTANCE In object recognition research a lot have been done about general features extraction or recognition between different classes of objects. In case of a species domain recognition, 10 taking into account the unique characteristics that belong to this category, improves the performance of the system. Despite the high technical aspect of this project, dealing with leaves, gives it a biological connotation. Some basic knowledge about leaves have to be learned and concepts about how the biologists themselves recognize leaves has to be studied. The next two paragraphs are devoted to these experiences. Precision Botany (PB) refers to the application of new technologies in plant identification. Computer vision can be used in PB to distinguish plants from its species level, so that an identification can be applied on the size and number of plants detected for the classification purpose. This is focused on the application of computer vision for identification purposes of species in Stemonoporus genus. Surveys reveal that there are 3711 flowering plant species in Sri Lanka [4]. Out of these, 926 are endemic [5, 6]. Since some of these have minute variations, identification of these species has become difficult. Accurate and speedy identification of plants has become a time consuming and a fuzzy work due to nonavailability of a computerized scientific plant identification system. Design and implementation of image-based plant classification system is a long felt. Biologists receive a large number of requests to identify plants for people, many species of plants look very similar on their leaves, and botanists will turn to identifying the species based on their structure or other morphologies. There are three main parts to a leaf: 1. The base which is the point at which the leaf is joined to the stem. 2. The stalk or petiole is the thin section joining the base to the lamina - it is generally cylindrical or semicircular in form. 3. The lamina or leaf blade is the wide part of the leaf Leaves can be of many different shapes: Primarily, leaves are divided into simple - a single leaf blade with a bud at the base of the leafstem; or compound - a leaf with more than one blade. All blades are attached to a single leafstem. Where the leafstem attaches to the twig there is a bud. Leaves may be arranged on the stem either in an alternate arrangement – leaves that are not places directly across from each other on the twig; or in an opposite arrangement – 2 or 3 leaves that are directly across from each other on the same twig. The margin (the edge of a leaf) as seen in Figure 3.2 may be entire, singlytoothed, doubly-toothed, or lobed. Compound leaves may be palmate – having the leaflets arranged round a single point like fingers On the palm of a hand; or pinnate – when the leaves are joined on the two sides of the stalk, like the vanes of a feather as seen in the Figure 1.3. Leaf arrangements are pretty straightforward to figure out. Need to look for the nodes and then determine how many leaves are coming off each node. If there’s only one leaf per node, then need only to determine whether the arrangement is alternate or spiral, and it’s usually pretty obvious. Figure 1. Simple leaves – Margin structure. III. LITERATURE SURVEY Smita Naikwadi et.al.[2013] We propose and experimentally evaluate a software solution for automatic detection and classification of plant leaf diseases. Studies of plant trait/disease refer to the studies of visually observable patterns of a particular plant. Nowadays crops face many traits/diseases. Damage of the insect is one of the major trait/disease. Insecticides are not always proved efficient because insecticides may be toxic to some kind of birds. It also damages natural animal food chains. The following two steps are added successively after th segmentation phase. In the first step we identify the mostly green colored pixels. Next, these pixels are masked based on specific threshold values that are computed using Otsu's method, then those mostly green pixels are masked. The other additional step is that the pixels with zeros red, green and blue values and the pixels on the boundaries of the infected cluster (object) were completely removed. The experimental results demonstrate that the proposed technique is a robust technique for the detection of plant leaves diseases. The developed algorithm’s efficiency can successfully detect and classify the examined diseases with a precision between 83% and 94%, and can achieve 20% speedup over the approach proposed in [1]. Abdul Kadir et.al. [2012]This paper reports the results of experiments in improving performance of leaf identification system using Principal Component Analysis (PCA). The system involved combination of features derived from shape, vein, color, and texture of leaf. PCA was incorporated to the identification system to convert the features into orthogonal features and then the results were inputted to the classifier that used Probabilistic Neural Network (PNN). This approach has been tested on two datasets, Foliage and Flavia, that contain various color leaves (foliage plants) and green leaves respectively. The results showed that PCA can increase the accuracy of the leaf identification system on both datasets.[2] Samuel E. Buttrey et.al. [2002] We construct a hybrid(composite) classi&er by combining two classi&ers in common use— classi&cation trees and k-nearest-neighbor (kNN). In our scheme we divide the feature space up by a classi&cation tree, andthen classify test set items using the k-NN rule just among those training items in the same leaf as the test item. This reduces somewhat the computational loadassociatedwith k-NN, andit produces a classi&cation rule that performs better than either trees or the usual k-NN in a number of well-known data sets. [3] Prof. Meeta Kumar et al. [2013]In this paper we present survey on various classification techniques which can be used for plant leaf classification. A classification problem deals with Lakhvir Kaur1, IJECS Volume 05 Issue 08 August 2016 Page No.17658-17661 Page 17659 DOI: 10.18535/ijecs/v5i8.41 associating a given input pattern with one of the distinct classes. Plant leaf classification is a technique where leaf is classified based on its different morphological features. There are various successful classification techniques like kNearest Neighbor Classifier, Probabilistic Neural Network, Genetic Algorithm, Support Vector Machine, and Principal Component Analalysis. Deciding on the method for classification is often a difficult task because the quality of the results can be different for different input data. Plant leaf classifications has wide applications in various fields such as botany, Ayurveda, Agriculture etc. The goal of this survey is to provide an overview of different classification techniques for plant leaf classification.[4] Gurpreet kaur et.al.[2012] Plants play an important role in human life and provide required information for the development of human society. The urgent situation is that due to environmental degradation and lack of awareness, many rare plant species are at the risk of extinction so it is necessary to keep record for plant protection. We believe that the first step is to buildup a database for protecting plants. So the need arises to teach a computer how to classify plants. Despite the great advances made in Botany, there are many plants which are still unknown.This research focuses on using digital image processing for the purpose of automate classification and recognition of plants based on the images of the leaves. In this paper we review leaf architecture and various techniques for automated plant classification and recognition. [5] Comparison Table : Methods Advantages of classifica tion K-NN 1. Simpler classifier as exclusion of any training process. 2. Applicable in case of a small dataset which is not trained. SVM 1. Simple geometric interpretation and a sparse solution. 2. Can be robust, even when training sample has some bias. ANN 1. Tolerant of noisy inputs. 2. Instances classified by more than one output. 1. Easy to implement. 2. Applicable to wide range of problems. Backprop agation Network (BPN) Disadvantages 1. Speed of computing distance increases according to numbers available in training samples. 2. Expensive testing of each instance and sensitive to irrelevant inputs. 1. Slow training. 2. Difficult to understand structure of algorithm. 3. Large no. support vectors are needed from training set to perform classification task 1. Long training time. 2. Large complexity of network structure. 3. Need lot of memory for training data 1. Learning can be slow. 2. It is hard to know how many neurons as well as layers are required. 3. Able to form arbitrarily complex nonlinear mappings Radial Basis Function (RBF) or Naive Bayesian 1. Training phase is faster. 2. Hidden layer is easier to interpret. 1. It is slower in execution when speed is a factor. IV. PROBLEM STATEMENT Plant leaf classification is the major problem and also their diseases are also major problem. Most of plant leaf have attacks by snail, worm and fungi. Furthermore, when the leaf had been infected or attacked, the others areas had been exposed to be infected. Thus, it will decrease the life of plant. There is the overlapping of the plant diseases problem. Plants play an important role in our lives, without plants there will not be the existence of the ecology of the earth. The large amount of leaf types now makes the human being in a front of some problems in the specification of the use of plants, the first need to know the use of a plant is the identification of the plant leaf. The problem with a number of these techniques is that they require some manual intervention such as correctly orienting the image or identifying the end points of the leaf’s main vein. S. R. Deokar (2013) has worked on leaf recognition by extracting 28 and 60 Feature point. These features extracted by vertical and horizontal splitting the leaf images. ANN is used to compare performance of leaf recognition. • • V. SCOPE The users of this system are paddy farmers. The prototype will be develop by using MATLAB 2013 a. 10 samples each of the normal, brown spot disease, narrow brown spot disease and blast disease will be used in this project. VI. CONCLUSION Plants play an important role in our lives, without plants there will not be the existence of the ecology of the earth. The large amount of leaf types now makes the human being in a front of some problems in the specification of the use of plants, the first need to know the use of a plant is the identification of the plant leaf. This work proposed a comparison of supervised classification of plant leaves, where we used to represent species in seven different representations, using three features extracted from binary masks of these leaves: a finescale margin feature histogram, by a Centroid Contour Distance Curve shape signature, and by an interior texture feature histogram. Results were very interesting in a way that gives as clear ideas: In term of representation: we can differentiate leaves by its margin better than shape or texture, but, experiments shown in this study prove our idea: the more we combine these features, the more precise the difference between samples is and that is what gives better results in classification. In term of classification: distance based algorithms give the best result for plant leaves classification. So, we can conclude that these algorithms are the most suitable for that task. On the other hand, the approach based on decision tree Lakhvir Kaur1, IJECS Volume 05 Issue 08 August 2016 Page No.17658-17661 Page 17660 DOI: 10.18535/ijecs/v5i8.41 gives the worst results because of the overfitting problem. In general, a learning algorithm is said to overfit relative to a simpler one if it is more accurate in fitting known data but less accurate in predicting new data. Use of the three features proved that there is some information more important than other. We discovered that margin representation can affect results more than the shape of the leaf. However, the combination of the three features gives the best result. To solve this problem, we plan, as future work, use of feature extraction algorithms, like PSO, to clean dataset and keep the important information in order to optimize the obtained results and avoid overfitting problem posed by decision tree algorithm. We plan also to use bioinspired algorithms. They are part of a new research domain that is becoming more important due to its results in different areas. REFERENCES [1]. Smita Naikwadi et.al.―ADVANCES IN IMAGE PROCESSING FOR DETECTION OF PLANT DISEASES‖ International Journal of Application or Innovation in Engineering & Management (IJAIEM) Web Site: www.ijaiem.org Email: editor@ijaiem.org, editorijaiem@gmail.com Volume 2, Issue 11, November 2013. [11]. C. Mallah, J. Cope, and J. Orwell ,"Plant leaf classification using probabilistic integration of shape, texture and margin features," Signal Processing, Pattern Recognition and Applications, 2013 [12]. S. Zhang, and K. W. Chau, "Dimension reduction using semisupervised locally linear embedding for plant leaf classification," Emerging Intelligent Computing Technology and Applications, Springer Berlin Heidelberg, 2009, pp. 948-955 [13]. A. Kadir, L. E. Nugroho, A. Susanto, and P. I. Santosa, "Leaf classification using shape, color, and texture features , arXiv preprint arXiv:1401.4447, 2013 [14]. A. H. M. Amin, and A. I. Khan, "One-shot Classification of 2-D Leaf Shapes Using Distributed Hierarchical Graph Neuron (DHGN) Scheme with k-NN Classifier," Procedia Computer Science, 24, 2013, pp.84-96 [2]. Abdul Kadir et.al. ―Performance Improvement of Leaf Identification System Using Principal Component Analysis‖ International Journal of Advanced Science and Technology Vol. 44, July, 2012 [3]. Samuel E. Buttrey et.al. ―Using k-nearest-neighbor classi&cation in the leaves of a tree‖ Computational Statistics & Data Analysis 40 (2002) 27 – 37 www.elsevier.com/locate/csda. [4]. Prof. Meeta Kumar et al. ―Survey on Techniques for Plant Leaf Classification‖ International Journal of Modern Engineering Research (IJMER) www.ijmer.com Vol.1, Issue.2, pp-538-544 ISSN: 2249-6645. [5]. Gurpreet kaur et.al. ―Classification of Biological Species Based on Leaf Architecture–A review‖ IRACST - International Journal of Computer Science and Information Technology & Security (IJCSITS), ISSN: 2249-9555 Vol. 2, No.2, April 2012. [6]. Amlekar Manisha, Manza R.R, Yannawar Pravin,(2013), Leaf classification based on leaf dimension biometric features of leaf shape using k-means classifier, NCAC, Jalgoan. [7]. Amlekar Manisha, Manza R.R, Yannawar Pravin, Gaikwad B.P,(2013), Image data mining for classifying leaf dimension biometric features of leaf shape using KNN classification technique, CMS, [8]. Aurangabad. Beghin T., Cope J. S., Remagnino P. and Barman S., (2010), Shape and texture based plant leaf Classification, ACIVS, 2, 345–353. [9]. J. S., Remagnino P., Barman S., and Wilkin P., (2010), Plant texture classification using gabor co-occurrences,‖ in Proceedings of the 6 th international conference on Advances in visual computing,2, 669– 677. [10]. Deokar S. R., Zope P. H. Suralkar S. R ,(2013), Leaf Recognition Using Feature Point Extraction and Artificial Neural Network, International Journal of Engineering Research & Technology (IJERT), ISSN: 2278-0181, 2(1). Ehsanirad, (2010), Plant classification based on leaf recognition, International Journal of Computer Science and Information Security, 8(4), 78–81. Lakhvir Kaur1, IJECS Volume 05 Issue 08 August 2016 Page No.17658-17661 Page 17661