Fast Retrieval of Images Using Filtered HSV Color Level

advertisement
International Journal of Emerging Technology and Advanced Engineering
Website: www.ijetae.com (ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 3, Issue 8, August 2013)
Fast Retrieval of Images Using Filtered HSV Color Level
Detection
Durgesh Nirapure, Prof. Udaypal Reddy
Department of Computer Science Engineering, IES College of Technology Bhopal, India
Models of Color Levels for Feature Extraction
The extraction of the color features for each of the four
methods is performed in the HSV (hue, saturation and
value) which is converted from RGB color space hsv, to
perceptual color space, where Euclidean distance
corresponds to the human visual system’s notion of
distance or similarity between colors.
Abstract-- Image searching is the wide area of research in
the field of data mining and in web mining also. This work
proposes the method which efficiently searches the images
from database with high speed as well as high performance.
The major goal of retrieval of images from a large database is
the process of searching images based on color levels in HSV
color space. The HSV color space based retrieval of images
has wide area of application in the field of data mining. In this
work the method works on the higher color level in the
database and retrieves the most appropriate results in
descending order. The aim of this work is to explore efficient
way based on color to search images in database with
considerable speed.
(a) Color Level Histogram
The conventional color histogram (CCH) of an image
indicates the frequency of occurrence of every color in the
image. From a probabilistic perspective, it refers to the
probability mass function of the image intensities. It
captures the joint probabilities of the intensities of the color
channels.
The CCH can be represented as
Keywords-- Image Retrieval, HSV Color space.
I. INTRODUCTION
The evolution of technology has become the wonder of
universe and this leads the faster and more efficiently
exchange of information in the form of images in parallel
the advancement with the large storage technology the
collection of images has the fashion and facilitates the wide
areas to execute the applications. As the storage is large
there is need to fetch images from large collection for our
concern only and this is the point where we need the data
mining algorithms to perform such things. Consequently,
the search for relevant information in the large storage of
image databases has become more challenging. The main
challenge lies in the advancement of retrieval speed with
considerable accuracy. How to achieve accurate retrieval
results are still a challenging and an unsolved research
problem. A typical image retrieval system includes three
major components: feature extraction, high dimensional
indexing and system design [2]. In this work, we study the
first component; that of low-level feature extraction, and
we attempt to answer the following question: What are the
color features that need to be extracted from an image, in
order to achieve the highest retrieval performance, at a
relatively low computational speed? The main contribution
of this work is a comprehensive comparison of color
feature extraction approaches.
h A,B,C(a,b,c) = N. Prob(A=a, B=b, C=c)
Where A, B and C are the three color channels and N is
the number of pixels in the image [3].
Computationally, it is constructed by counting the
number of pixels of each color (in the quantized color
space).
(b) Color histogram based on Fuzzy Logic
In the fuzzy color histogram (FCH) approach, a pixel
belongs to all histogram bins with different degrees of
membership to each bin. More formally, given a color
space with K color bins, the FCH of an image I is defined
as F(I)=[f1,f2,…fk] where
Where N is the number of pixels in the image and µij is
the membership value of the jth pixel to the ith color bin,
and it is given by µij = 1/(1 + dij/ ς ), where dij
is the Euclidean distance between the color of pixel j(a 3‐
dimensional vector of the H, S and V components),
and the ith color bin, and ς is the average distance
between the colors in the quantized color space.
414
International Journal of Emerging Technology and Advanced Engineering
Website: www.ijetae.com (ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 3, Issue 8, August 2013)
(c) The Color Correlogram
The color correlogram (CC) expresses how the spatial
correlation of pairs of colors changes with distance. A CC
for an image is defined as a table indexed by color pairs,
where the dth entry at location (i,j) is computed by counting
number of pixels of color j at a distance d from a pixel of
color i in the image, divided by the total number of pixels
in the image.
its colour and texture provides a better result than using a
single colour feature since two features are now used as
indicator during matching process.
In this research we focus on the method and strategies to
retrieve images by using both colour and texture feature
vectors to produce more accurate results.
III. SYSTEM MODEL
Color feature of HSV
We evaluate the content based image retrieval HSV
color space of the images in the database. The HSV
stands for the Hue, Saturation and Value, provides the
perception representation according with human visual
feature. The HSV model, defines a color space in terms
of three constituent components: Hue, the color type Range
from 0 to 360. Saturation, the "vibrancy" of the color:
Ranges from 0 to 100%, and occasionally is called the
"purity". Value, the brightness of the color: Ranges from
0 to 100%. HSV is cylindrical geometries, with hue, their
angular dimension, starting at the red primary at 0°,
passing through the green primary at 120° and the blue
primary at 240°, and then back to red at 360° [8, 9]. The
HSV planes are shown as Figure 1.
(d) Color Indexing
In color indexing, given a query image, the goal is to
retrieve all the images whose color compositions are
similar to the color composition of the query image.
Typically, the color content is characterized by color
histograms, which are compared using the histogram
intersection distance measure[3].
Content-based image retrieval system attempts a tradeoff between the two techniques and combines the features
of both. The attributes commonly used in CBIR are colour,
texture, shape and motion of which shape is the key
attribute. The images are searched from a multimedia
database by searching the content of the stored images and
retrieving images similar to the input query image. The
database can be considered as a cluster of databases and
each cluster having maximum similarity between images
and least inter-cluster similarity [7]. It proposes to search
within each cluster for the query image. The relevance
feedback strategy is used in displaying search results. The
results are usually returned to the user in the form of
multimedia presentation as proposed in Ref [6]. The
relevance feedback strategy refines the output according to
the similarity with the query image. In a typical CBIR
system object boundaries are searched for the query image.
This can be done using active contour model with which
one can easily detect an object boundary.
II. CURRENT IMAGE RETRIEVAL APPROACHES
CBIR is basically a two step process which is Feature
Extraction and Image Matching (also known as feature
matching). Feature Extraction is the process to extract
image features to a distinguishable extent. Information
extracted from images such as colour, texture and shape are
known as feature vectors. The extraction process is done on
both query images and images in the database. Image
matching involves using the features of both images and
comparing them to search for similar features of the images
in the database [5]. Using multiple feature vectors to
describe an image during retrieval process increases the
accuracy when compared to the retrieval using single
feature vector. For example, searching of image based on
Figure 1. The Different planes of HSV color space
The quantization of the number of colors into several
bins is done in order to decrease the number of colors used
in image retrieval, J.R. Smith [5] designs the scheme to
quantize the color space into 166 colors. Li [12] design
the non-uniform scheme to quantize into 72 colors. We
propose the scheme to produce 15 non-uniform colors.
415
International Journal of Emerging Technology and Advanced Engineering
Website: www.ijetae.com (ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 3, Issue 8, August 2013)
The formula that transfers from RGB to HSV is defined as
below:
not complete image than prepare another database which
contains features only.
Than doing comparison with the query image makes
comparison faster. After that similarity matrix preparation
order it with descending similarity, fetch indices of the
most similar records and then display the retrieved images.
The flow of algorithm is explained in the flow chart, it also
explain the how the one test process or retrieval of images
from large database is done. The below flow chart showing
the steps to search images in figure 3:
Start
The R, G, B represent red, green and blue components
respectively with value between 0-255. In order to obtain
the value of H from 0o to 360 o, the value of S and V
from 0 to 1, we do execute the following formula:
Remove noise from RGB images and
calculate HSV values
H= ((H/255*360) mod 360
V= V/255
S= S/255
Prepare Database of HSV values of
Images
Proposed Approach:
Find Levels of Hue, Saturation and Value
in image and normalize it
Choose test image to search from similar
images from database and preprocess it
Compare with database records
Fig.2 Block diagram of proposed methodology
The proposed approach to retrieve the images from
database using color level is the method which is based on
the color level present search of things which has the
practical aspects in many of the day to day applications in
the security, traffic, surveillance as well as in the data
mining or in web mining of images needed. The best part
of this proposed approach to extract color features from the
images of the database and then search accordingly. That
gives us the way faster classification of images from large
stake.
The block diagram of proposed methodology is showing
the major blocks of proposed system. The first block is the
input query image i.e. test image we want to search similar
images from databases. Than preprocess the test image to
compare with the database images that is previously
processed. The comparison from database with lots of
images takes time so there is a need which reduces the size
of database to compare so the preprocessing of database is
the actual process of extracting features only from images
Arrange similar results in descending
order
Display results
End
Figure 3. Flow chart of Proposed System
IV. S IMULATION RESULTS
The simulation a result of the proposed system is given
below the whole simulation process is completed in the
following experiments. The experiments work as explained
in the flow chart of proposed approach. In the results first
image is the query image and the next one is the retrieved
results.
416
International Journal of Emerging Technology and Advanced Engineering
Website: www.ijetae.com (ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 3, Issue 8, August 2013)
Experiment 1:
In this experiment we have taken query image which
contains green color of grass gray color of water, and blue
color of sky than the algorithm searches for these color
levels in the database and gives results in the most similar
images.
Figure 4. Query Image
Fig. 7. Searched Results of experiment 2
In this experiment we have taken query image which
contains tulip flower of red and yellowish green color its
root and leaf and black colored background. Than the
algorithm searches for these color levels in the database
and gives results in the most similar images.
The experiment performed on the query images with
different color combinations and their recall and precision
values are given below in the table.
Table 1.1
Precision-Recall Curve for the Query Images
SNo.
Figure 5. Searched Results of experiment 1
Experiment 2:
Figure 6 Query image
417
Query Images
Precision
Recall
69.58%
99.49%
1
'1 dam.jpg'
2
'2 Red Flower.jpg'
0.09%
0.17%
3
'3 lake.jpg'
33.01%
94.84%
4
'4 nature.jpg'
32.33%
79.06%
5
'5 red Flower.jpg'
25.79%
83.57%
6
'6 Tulip.jpg'
14.15%
45.05%
7
'7 sun flower.jpg'
11.08%
16.94%
8
'8 Blue Flower.jpg'
5.31%
59.18%
9
'9 Water.jpg'
41.85%
95.08%
10
'10 Tawa Restaurent.jpg'
38.08%
76.72%
11
'11 tawa resort.jpg'
19.35%
87.67%
International Journal of Emerging Technology and Advanced Engineering
Website: www.ijetae.com (ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 3, Issue 8, August 2013)
After performing the retrieval process on more different
query images we get the the values of recall and precision
for each experiment and after plotting these values we get
recall-precision curve of our system which is given below.
Precision Recall Curve of Proposed Methodology
[4]
Ying Liu, Dengsheng Zhang, Guojun Lu, Wei-Ying Ma, “A survey
of content-based image retrieval with high-level semantics”,
Pattern Recognition, Volume: 40, Issue: 1, pp. 262-282, January,
2007.
[5]
J R Smith, “Integrated spatial and feature image system:
Retrieval, analysis and compression “[Ph D dissertation], Columbia
University, New York, 1997.
[6]
Jain, A & Vailaya,A ,(1996) ” Image retrieval using colour and
shape”, Pattern Recognition, Vol. 29, pp1233-1244.
[7]
M. S. Lew, N. Sebe, C. Djeraba, and et al,“Content based
multimedia information retrieval: State of the art and challenges”,
ACM Trans. Multimedia Comput. Commun. Appl., Vol.2, No. 1, 119, 2006.
[8]
D. Feng, W. C. Siu, and H. J. Zhang, “Fundamentals of
Content-Based Image Retrieval, in Multimedia Information
Retrieval and Management Technological Fundamentals and
Applications.”New York: Springer, 2003.
[9]
Nezamabadi-pour, H. & Kabir, E., (2004) “Image retrieval
using histograms of uni-colour and bicolour blocks and directional
changes in intensity gradient”, Pattern Recognition Letters, Vol.
25, pp1547- 1557.
1
0.9
0.8
0.7
Precision
0.6
0.5
0.4
0.3
0.2
0.1
0
0
0.1
0.2
0.3
0.4
0.5
Recall
0.6
0.7
0.8
0.9
1
Fig.8. Precision-Recall Curve
[10] Vadivel A. ,Majumdar A. K. & Sural Shamik, (2003)
"Perceptually Smooth Histogram Generation from the HSV Colour
Space for Content Based Image Retrieval", International Conference
on Advances in Pattern Recognition, Kolkata, pp 248-251.
V. CONCLUSIONS A ND FUTURE SCOPE
The proposed methodology of image retrieval in this
paper is fast but there is always a provision of
improvements in the existing system. The current algorithm
works on HSV color model of images for the retrieval from
database, but there are other color models are also available
which may be efficiently works for retrieval applications.
The future of this method will be the improvement in speed
of retrieval as well as precision in results. The color based
retrieval has wide area of implementation in the field of
data mining and web mining where color is the important
classification criteria.
[11] J R Smith, “Integrated spatial and feature image system:
Retrieval, analysis and compression “[Ph D dissertation], Columbia
University,New York, 1997.
[12] Li, Liu and Cao, “An Image Retrieval Method Based on Color
Perceived Feature”, Journal of Image and Graphics, 1999.
[13] Jun Tang, “A Color Image Segmentation algorithm Based on Region
Growing” 2010.
[14] Yoichi MIYAKE and Kimiyoshi MIYATA “Color Image Processing
Based on Spectral Information and Its Application” 1999.
REFERENCES
[1]
[15] V.S.V.S. Murthy, E.Vamsidhar, J.N.V.R. Swarup Kumar “Content
Based Image Retrieval using Hierarchical and K-Means Clustering
Techniques “ 2010.
Sharma,N.,Rawat,p.,and singh J.,Efficient CBIR using color
Histogram Processing, Signal and image processing an
international Journal (SIPIJ) Vol.2,No.1,(March 2011).
[2]
Deb & Y. Zhang, (2004) “An overview of content-based Image
retrieval techniques,” Proc. on 18th Int. Conf. on Advanced
Information Networking and Applications, Vol. 1, pp59-64.
[3]
Sameer Antani, Rangachar Kasturi, Ramesh Jain, “A survey on the
use of pattern recognition methods for abstraction, indexing and
retrieval of images and video”, Pattern Recognition Volume:35,
Issue: 4, April, 2002, pp.945-96.
[16] Sangoh Jeong, “Histogram-Based Color Image Retrieval”, 2001.
[17] N. K. Kamila “Image retrieval using equalized histogram image bins
moments”, 2010.
[18] V. Vijaya Kumar “Integrated Histogram Bin Matching For
Similarity Measures of Color Image Retrieval” 2009.
[19] A. K. Jain “Fundamentals of Digital Image Processing”2006.
[20] Rafael C. Gonzalez and Richard E. Woods “Digital image
processing Using MATLAB”, 2010.
418
International Journal of Emerging Technology and Advanced Engineering
Website: www.ijetae.com (ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 3, Issue 8, August 2013)
[21] Weisheng Li “New Color Cluster Algorithm For Image Retrieval”
2009.
[22] Song Mailing, Li Huan, “An Image Retrieval Technology Based on
HSV Colour Space”, Computer Knowledge and Technology, No.
3,pp.200-201, 2007.
[23] D.C. He and Li Wang. Texture filters based on texture spectrum.
Pattern Recognition, 24(12):1187–1195, 1991.
AUTHOR’S PROFILE
Durgesh Nirapure is research scholar and
pursuing his Master of Technology
Computer Science Engineering from IES
College of Technology Bhopal, India. He is
very keen to study the image processing
techniques and its retrieval processes.
419
Download