International Journal of Engineering Trends and Technology (IJETT) – Volume 11 Number 7 - May 2014 Handwriting Verification : An Approach to Verify Handwriting Using Feature Extraction of Orientation and Curvature V Swetha D. Rao #1 , Rashmi Welekar *2 1 2 M.Tech Scholar, CSE Department, SRCOEM, Nagpur, India Assistant Professor , CSE Department, SRCOEM, Nagpur, India Abstract—The writer identification task lies in the definition of a feature space common to all the handwritten documents. In this study we have extended this principle to the whole document database Identifying handwritten documents is a herculuies task. This paper consists approach used to identify an individual writer’s handwriting by extracting features of orientation , curvature and width of characters. It is the approach to study individual’s characteristics . In this approach we train the classifier with sample words from an writer . Classifier identifies the trainned samples and displays writer’s name present in database otherwise it displays that writer’s name who writing stlye matches to the sample. by segmenting the word into those subunits, and then identifies the units. As an example to identify any word using this approach the letters are to be identified first then they are used for the total word recognition. Holistic approach: It treats the word as a single, indivisible entity and attempts to recognize it using features of the word as a whole. So, in this approach the whole word image features are used for the recognition purposes. III. RELATED WORK Researchers have utilized many different approaches for both the segmentation and Keywords— knn classifier, orientation, curvature, feature recognition tasks of word recognition. Some extraction. researchers have used conventional, heuristic techniques for both character segmentation and I. INTRODUCTION Optical character recognition (OCR), is a recognition some have used a convex hull based program that translates scanned or printed image and recursive contour following algorithms, while document into a text document. Once it is others have used heuristic techniques for translated into text, it can be stored in ASCII or segmentation followed by ANN based methods for UNICODE format. There are several applications the character/word recognition process.Hidden with OCR. Some of the practical applications are as Markov Model based techniques are also used reading aid for the blind, automatic text entry into widely for both offline and online hand-written the computer for desktop publication, library document recognition. cataloging, ledgering, etc. automatic reading for IV. PROPOSED METHOD sorting of postal mail, bank cheques and other The method proposed covers basically five phases documents, document data compression: from image acquisition, pre processing, segmentation, document image to ASCII format, language feature extraction and classification. processing such as indexing, spell checking, grammar checking , multi-media system design. A. Image Acquisition II. CONCEPT OVERVIEW In image scanning recognition system acquires a There can be mainly two approaches for the word scanned image as an input image through a digital recognition purpose for any handwritten documents. scanner or any other suitable digital input device. Analytical approach: It treats words as a collection The input captured may be in gray, color or binary of simpler subunits such as characters and proceeds ISSN: 2231-5381 http://www.ijettjournal.org Page 322 International Journal of Engineering Trends and Technology (IJETT) – Volume 11 Number 7 - May 2014 from scanner or digital camera. This image should E. Classification have a specific format such as JPEG, BMP etc. Image is classified using features that are extracted ie writers are identified in the database. B. Pre Processing As for the known writer we know all features that is The output of the image acquisition is fed as slant, orientation along with area used by writer. As input to the preprocessing step. The raw data of for unknown writer , firstly when image is acquired handwritten characters will be subjected to a it processed and features are extracted. Features number of preprocessing steps to make it useable extracted are matched with database enteries and for the next steps. The preprocessing phase aims to accordingly near matching writer’s name is popped extract the relevant textual parts and prepares them up in menu. for segmentation and recognition. Pre-processing of the image means applying a number of procedures for thresholding, filtering, and normalizing so that final classification can be made simple and more accurate. C. Segmentation An image is present in the form of sequence of characters that is decomposed into sub-images of individual character in the segmentation stage. In this system, labeling process is used for segmentation of pre-processed input image into isolated characters by assigning a number to each character. This labeling provides information about number of characters in the image. D. Feature Extraction Each character has its own differential features, which play an important role in pattern recognition. Feature extraction describes the relevant shape information contained in a pattern so that the task of classifying the pattern is made easy by a formal procedure. The main goal of feature extraction is to obtain the most relevant information from the original data and represent that information in a lower dimensionality space. When the input data to an algorithm is too large and also may be redundant (much data, but not much information) then the input data will be transformed into a reduced representation set of features (also named features vector). A term feature extraction is termed that transforms the input data into the set of features. The features set will be used to extract the relevant information from the input data in order to perform the desired task using this reduced representation instead of the full size input. ISSN: 2231-5381 Fig. 1 Flowchart showing the proposed system V. PROPOSED APPROACH In our approach writer is asked to write statement. First word is segmented from the statement and into characters. Fig.2 Word “ The “ is written by writer (trained sample) When this word is analyed through the classifier features are extracted like curvature, orientation and area along with writer’s name and stored in database. Fig.3 Word “Hello” is written by writer(untrained sample) When classifier checks features of new word to that of existing word if writer is known (already in database) then name is popped up otherwise near to match writer’s name is popped up. http://www.ijettjournal.org Page 323 International Journal of Engineering Trends and Technology (IJETT) – Volume 11 Number 7 - May 2014 REFERENCES [1] [2] [3] Fig.4 Menu window showing writer’s name [4] VI. CONCLUSIONS In our proposed method we are able to identify known writer from trained sample as from untrained samples, similarly we are also able identify near about matching of handwriting from unknown writer. [5] [6] [7] ACKNOWLEDGMENT The writers V. Swetha D. Rao and Rashmi Welekar would like to thank all staff members and authorities of Shri Ramdeobaba College Of Engineering & Management Nagpur, India for their support and motivation. [8] [9] [10] ISSN: 2231-5381 Roberto J. Rodrigues and Antonio Carlos Gay Thomé , “Cursive character recognition – a character segmentation method using projection profile-based technique”, 2010 A. Papandreou and B. Gatos, “A Novel Skew Detection Technique Based on Vertical Projections “, International Conference on Document Analysis and Recognition, 2011 Rafael M. O. Cruz, George D. C. Cavalcanti and Tsang Ing Ren, “ Handwritten Digit Recognition Using Multiple Feature Extraction Techniques and Classifier Ensemble “, 17th International Conference on Systems, Signals and Image Processing, IWSSIP 2010 Sandhya Arora. Debotosh Bhattacharjee, Mita Nasipuri, “Combining Multiple Feature Extraction Techniques for Handwritten Devnagari Character Recognition “,IEEE Region 10 Colloquium and the Third ICIIS, Kharagpur, INDIA December 8-10, 2008 Cheng-Lin Liu, Hiroshi Sako, Hiromichi Fujisawa , “Performance evaluation of pattern classifiers for handwritten character recognition” , International Journal on Document Analysis and Recognition, 2002 Yousri Kessentini, Thierry Paquet and AbdelMajid Benhamadou . “Off-Line Handwritten Word Recognition Using Multi-Stream Hidden Markov Models“,"Pattern Recognition Letters 31, 1 (2010) 60 – 70 Aiquan Yuan, Gang Bai, Po Yang, Yanni Guo, Xinting Zhao, “Handwritten EnglishWord Recognition based on Convolutional Neural Networks”, International Conference on Frontiers in Handwriting Recognition, 2012 Th´eodore Bluche and Hermann Ney and Christopher Kermorvant, “Feature extraction with convolutional neural networks for handwritten word recognition “,ICDAR , 2013 Alessandro Vinciarelli, Samy Bengio, and Horst Bunke, “ Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models “,IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 26, NO. 6, JUNE 2004 V. Swetha D. Rao and Rashmi Welekar, “Survey on Existing Techniques for Writer Verification”, COMPUSOFT , An International Journal of Advanced Computer Technology (IJCET), Volume III, Issue V, May (2014) , ISSN 2320 – 0790 (Online) Pg. 773-776 http://www.ijettjournal.org Page 324