vii ii iii iv

advertisement
vii
TABLE OF CONTENTS
CHAPTER
1
TITLE
PAGE
DECLARATION
ii
DEDICATION
iii
ACKNOWLEDGEMENT
iv
ABSTRACT
v
ABSTRAK
vi
TABLE OF CONTENTS
vii
LIST OF TABLES
xi
LIST OF FIGURES
xii
LIST OF SYMBOLS
xv
LIST OF APPENDICES
xvii
INTRODUCTION
1
1.1
Introduction
1
1.2
Problem Background
2
1.3
Problem Statement
4
1.4
Project Objectives
5
1.5
Scope of the Project
6
1.6
Significant of the Study
6
1.7
Report Organization
7
viii
2
LITERATURE REVIEW
8
2.1
Introduction
8
2.2
Segmentation Categories
9
2.2.1. Threshold Based Segmentation
9
2.2.2. Clustering Techniques
10
2.2.3. Matching
10
2.2.4. Edge Based Segmentation
10
2.2.5. Region Based Segmentation
11
Categories of Variance Text
11
2.3.1.
Lighting Variance
12
2.3.2.
Scale Variance
12
2.3.3.
Orientation Variance
12
2.3
2.4
2.5
Recognition Text
13
2.4.1.
Text Detection
15
2.4.2.
Text Area Identification
15
2.4.3.
Text Region Localization
15
2.4.4.
Text Extraction and Binary Image
16
Analytic Segmentation
17
2.5.1.
Pattern Recognition
17
2.5.2.
Statistical Pattern Recognition
18
2.5.3.
Data Clustering
18
2.5.4.
Fuzzy Logic
19
2.5.5.
Neural Networks
19
2.5.6.
Structural Pattern Recognition
20
2.5.7.
Syntactic Pattern Recognition
20
2.5.8.
Approximate Reasoning Approach to
Pattern Recognition
2.5.9.
Application of Support Vector Machine
(SVM)
2.6
21
21
Pattern Recognition System
21
2.6.1.
22
The Structure of Pattern Recognition
ix
2.7
2.8
2.9
2.6.2.
Application of Pattern Recognition
23
2.6.3.
Character Recognition
23
Run-Length Coding Algorithm
24
2.7.1.
Neighbors
25
2.7.2.
Path
26
2.7.3.
Foreground
26
2.7.4.
Connectivity
27
2.7.5.
Connected Component
27
2.7.6.
Background
28
2.7.7.
Boundary
29
2.7.8.
Interior
29
2.7.9.
Surrounds
30
2.7.10. Component Labeling
30
Properties Text
32
2.8.1.
Removing the Borders
32
2.8.2.
Divide the Text into Rows
32
2.8.3.
Divide the Row “Lines” into the Words
32
2.8.4.
Divide the Word into Characters
34
Identify Character
2.10 Fuzzy Logic
35
35
2.10.1. What Fuzzy Logic?
37
2.10.2. What is the Fuzzy Logic Toolbox?
38
2.10.3. Fuzzy Sets
38
2.10.4. Membership Function
39
2.10.5. If-Then Rules
40
2.10.6. Fuzzy Inference System
41
2.10.7. Rule Review
41
2.10.8. Surface Review
42
2.11 Summary
43
x
3
METHODOLOGY
44
3.1. Introduction
44
3.2. Problem Statement and Literature Review
46
3.3. System Development
46
3.4. Performance Evaluation
47
3.5. General Steps of Proposed Techniques
47
3.6. Proposed Algorithm for Edge Based Text
Region Extraction
3.7. Detection
48
49
3.8. Feature Map and Candidate Text Region
4
Detection
55
3.8.1. Directional Filtering
55
3.8.2. Edge Selection
55
3.8.3. Feature Map Generation
58
3.8.4. Localization
59
3.8.5. Character Extraction
59
3.9. Connection Component
60
3.10.
Fuzzy Logic
65
3.11.
Summary
67
IMPLEMENTATION
68
4.1. Introduction
68
4.2. Input Image
69
4.3. Complement Edge Detect with them
83
4.4. Eight Edge Detection
85
4.5. Image Localization
85
4.6. Separate Text From Background
86
4.7. Reduce Size
88
4.7.1. Determine Borders
88
4.7.2. Divide Text into Rows
89
4.8. Determine Character by Run-Length
90
xi
5
6
REFERENCES
Appendices
RESULTS DISCUSION
95
5.1. Introduction
95
5.2. Discussion on Results
96
5.3. Experimental results and discussion
98
5.4. Project Advantage
108
5.5. Suggestion and Future Works
109
5.6. Conclusion
110
CONCLUSION
111
113-115
116
xi
LIST OF TABLES
TITLE
TABLE NO.
3.1
Results to object to rows
4.2
Results after image scan ,where ST=start, EN=end and
PAGE
63
RW=row
64
4.1
Running time of major step
67
5.1
Performance evaluation 1
105
5.2
Performance evaluation 2
107
5.3
Performance evaluation 3
108
xii
LIST OF FIGURES
TITLE
FIGURE NO.
PAGE
2.1
General model of extraction text
13
2.2
The composition of PR system
22
2.3
Horizontal projection calculated from run-length code
24
2.4
4-and 8-neighborhood for rectangular image location
Pixel [i,j] is located in center
25
2.5
4-path and 8-path
26
2.6
Border of an image
28
2.7
Ambiguous border
28
2.8
A binary image with its boundary ,interior and
surrounds
30
2.9
An image (a) and its connected component image (b)
31
2.10
Divide the text into rows
33
2.11
Divide the rows into the words
34
2.12
Divide the word into characters
35
2.13
Identify character
35
2.14
A classical set and fuzzy set representation of “warm
room temperature”
37
2.15
(a) input of pixel (b) input of location for pixel
39
2.16
Output variable “letter”
40
2.17
Building the system with fuzzy logic
42
xiii
3.1
Proposed method
45
3.2
Block diagram of general steps of proposed approach
48
3.3
Gaussian filter
49
3.4
Sample gaussian pyramid with 8 levels
50
3.5
Extraction Operation
50
3.6
Edge detection
53
3.7
U shape object with runs after pixeltoruns
63
3.8
8-neighborhoods for rectangular image location pixel
[i,j] is located in center of each figure
65
3.9
Identify the character
66
3.10
(a) example of fuzzy input (b) example of fuzzy output
4.1
Original image
56
4.2
Structure 3x3(filter)
70
4.3
Our example of convolution operation
71
4.4
Kernel used
73
4.5
Directions of edge-detection
74
4.6
Structure of convolution
4.7
Operation of kernel 0
4.8
Edge detection
4.9
Effect of adding two edge
84
4.10
Total of edges detection
85
4.11
Localized of text
86
4.12
Separate text from background
87
4.13
Test image1 (a)image (b)localization(c)result
87
4.14
Test image2 (a)image (b)localization(c) result
88
4.15
Determine borders
88
4.16
(a)row one (b) row two
89
4.17
Identified character
90
4.18
Ten input and one output
91
4.19
Input one n1
92
4.20
Output
92
54-55
75-76
77
78-83
xiv
4.21
Output of extracted text
93
5.1
Sample 1
98
5.2
Sample 2
99
5.3
Sample 3
99
5.4
Sample 4
100
5.5
Sample 5
100
5.6
Sample 6
101
5.7
Sample 7
101
5.8
Sample 8
102
5.9
Sample 9
102
5.10
Sample 10
103
xv
LIST OF SYMBOLS
OCR
-
Optical character recognition
CC
-
Connected components
BAG
-
Black adjacency graph
AMA
-
Aligning-and merging analysis
SVM
-
Support vector machine
RLC
-
Run-length code
PR
-
Pattern recognition
SE
-
Structuring element
MFs
-
membership functions
FIS
-
Fuzzy Inference System
xvii
LIST OF APPENDICES
TITLE
APPENDIX
PAGE
A1
Matlab command to find binary image
116
A2
Matlab command used fuzzy logic for identify
116
character
Download