International Journal of Engineering Trends and Technology (IJETT) – Volume 4 Issue 10 - Oct 2013 A Novel Geo-coding and Cache Based Approaches for Spatial Queries R.Subbarao1, K.Srikanth2 1 1,2 M.Tech Scholar,2 Associate Professor Dept of CSE, Pydah College of Engineering and Technology Visakhapatnam Abstract: This paper proposes an efficient algorithm for spatial databases, various approaches are delivered by the various researchers for finding the result based on the keywords, and usually spatial query is a combination of a location and set of features. In our approach we are handling the spatial queries jointly and returns the only user specified number of optimal results, we implemented a cache based approach for efficient results. the nearest neighbors and for each neighbor an inverted index is used to check if the query keywords are contained. We show that such two-phase approaches are inefficient. In this paper are proposing a cache based approach for handling the queries jointly with corresponding geocoding parameters. I. INTRODUCTION In a nutshell, given a user query consisting of several keywords, a Standard search engine ranks the pages in its collection in terms of .Their relevance to the keywords. This is done by using a text index Structure called an inverted index to retrieve the IDs of pages containing the keywords, and then evaluating a term-based ranking Function on these pages to determine the k highest-scoring pages. (Other factors such as hyperlink structure and user behavior are Also often used, as discussed later). Query processing is highly optimized To exploit the properties of inverted index structures, stored In an optimized compressed format, fetched from disk using efficient Scan operations, and cached in main memory. Let D be a dataset in which each object p ∈ D is a pair (λ, ψ) of a spatial location p.λ and a textual description p.ψ (e.g., the facilities and menu of a restaurant). Similarly, a spatial keyword query [3] q = _λ, ψ_ has two components, where q.λ is a spatial location and q.ψ is a set of keywords. The answer to query q is a list of k objects that are in ascending order of their distance to the query location q.λ and whose descriptions contain the set of query keywords q.ψ Formally, let the function dist( ・ , ・ ) denotes the Euclidean distance between its argument locations, and let D(q.ψ) = {p ∈ D | q.ψ ⊆ p.ψ} be the objects in D that contain all the keywords in q. The result of the top-k spatial keyword query q, q(D), is a subset of D(q.ψ) containing k objects such that ∀p ∈ q(D) (∀p_ ∈ D(q.ψ)−q(D) (dist(q.λ, p.λ) ≤ dist(q.λ, p_.λ))). The joint top-k spatial keyword query Q is a set {qi} of such queries. We introduce the following notion to capture useful information on a joint query Q: (i) Q.λ = MBRqi∈Q qi.λ is the minimum bounding rectangle (MBR) of the locations of the subqueries in Q, (ii) Q.ψ = ∪qi∈Q qi.ψ is the union of the keyword sets of the subqueries in Q, and (iii) Q.m = minqi∈Q |qi.ψ| is the smallest keyword set size of a The World-Wide Web has reached a size where it is becoming increasingly challenging to satisfy certain information needs. While search engines are still able to index a reasonable subset of the (surface) web, the pages a user is really looking for are often buried under hundreds of thousands of less interesting results. Thus, search engine users are in danger of drowning in information. Adding additional terms to standard keyword searches often fails to narrow down results in the desired direction. A natural approach is to add advanced features that allow users to express other constraints or preferences in an intuitive manner, resulting in the desired documents to be returned among the first results. In fact, search engines have added a variety of such features, often under a special advanced search interface, but mostly limited to fairly simple conditions on domain, link structure, or modification date. A spatial keyword query consists of a query area and a set of keywords shown in below figure. The answer is a list of objects ranked according to a combination of their distance to the query area and the relevance of their text description to the query keywords. A simple yet popular variant, which is used in our running example, is the distance-first spatial keyword query, where objects are ranked by distance and keywords are applied as a conjunctive filter to eliminate objects that do not contain them. Unfortunately there is no efficient support for topk spatial keyword queries, where a prefix of the results list is required. Instead, current systems use ad-hoc combinations of nearest neighbor (NN) and keyword search techniques to tackle the problem. For instance, an R-Tree is used to find ISSN: 2231-5381 II. RELATED WORK http://www.ijettjournal.org Page 4408 International Journal of Engineering Trends and Technology (IJETT) – Volume 4 Issue 10 - Oct 2013 subquery in Q. We later define a variable qi.τ that captures the upper bound kth nearest neighbor distance of subquery qi. The value Q.τ = maxqi∈Q qi.τ then represents the maximum upper bound kth nearest neighbor distance of all the subqueries in Q. 3Referring to Figure 1, the joint query Q contains three subqueries q1, q2, and q3 (shown as shaded dots). The objects (e.g., restaurants) are shown as white dots. We have that: q1.ψ = {curry,sushi}, q2.ψ = {seafood, sushi}, q3.ψ = {curry, seafood}. Note that Q.λ denotes the MBR of the shaded dots in the figure. We also have: Q.ψ = {curry, seafood, sushi} and Q.m = 2. Figure1: A Dataset of Spatial Keyword Objects III. PROPOSED SYSTEM In this paper we proposed an efficient approach for handling the spatial query based on geocodings(latitude and longitudes) and for optimal performance we introduced Cache mechanism. we implemented both approaches with cache and without cache. Our entire process is divided in to two phases based on cache and geocodes. Geocoding process receives the query and find and the geocodings of the respective user from where he made a query and get the geocoding values from the all the queries from the tree. Find the distance with all the query nodes and input query node and compare all features of the query node with all node features until it meets the leaf node and add the results to the list frequently and returns the result Input: Query, Cache Queries Output: Result set generated for query Procedure: If Query available in cache Result related to query: = ForwardToTreeprocess (Query) Else Result related to query: = GeocodingtreeProcess (Query) Geocoding process(Query): Parameters ISSN: 2231-5381 Qi—Input Spatial Query Qj (j=1…n) ---Set of Queries contains same Location Dist[j] (j=1…..n)-----Array for set of distances Procedure: (xi,yi)---Geocodings of Qi (xj,yj)--- Geocodings of all queries with respect to location Dist[i]=Euclidean distance between the geocodes While not leafnode Read nodes from tree For Q.features If Q.features[i]==Q.features[j] Add to list End while Sort list by feature and distance Return list. ForwardToTreeprocess () 1. Build an empty list 2 .Make a root node 3. if Qi in cache and status=false For j=0 to n Compare features(Qi,Qj) status=true; For Each child in tree If(status==true) Getnodebyfeature (Qi); Getnodebyfeature (Qj); End Else Empty list () End For Each 4.Add nodes to list 5.Return list IV. CONCLUSION Finally we proposed an efficient a novel search implementation on spatial databases with simple implementation than the complex tree constructions like R trees, in both cache based and non cache based(with geocodings),our algorithms shows an optimal results than the traditional approaches. REFERENCES [1] Y.-Y. Chen, T. Suel, and A. Markowetz. Efficient query processing in geographic web search engines. In SIGMOD, pp. 277–288, 2006. [2] G. Cong, C. S. Jensen, and D. Wu. Efficient retrieval of the top-k most relevant spatial web objects. In VLDB, pp. 337–348, 2009. [3] I. De Felipe, V. Hristidis, and N. Rishe. Keyword search on spatial databases. In ICDE, pp. 656–665, 2008. [4] M. Duckham and L. Kulik. A formal model of http://www.ijettjournal.org Page 4409 International Journal of Engineering Trends and Technology (IJETT) – Volume 4 Issue 10 - Oct 2013 obfuscation and negotiation for location privacy. In PERVASIVE, pp. 152–170, 2005. [5] A. Guttman. R-trees: a dynamic index structure for spatial searching. In SIGMOD, pp. 47–57, 1984. [6] R. Hariharan, B. Hore, C. Li, and S. Mehrotra. Processing spatial-keyword (SK) queries in geographic information retrieval (GIR) systems. In SSDBM, p. 16, 2007. [7] T. Brinkhoff, H. Kriegel, and B. Seeger. Efficient processing of spatial joins using R-trees. Proc. SIGMOD, pages 237–246, 1993. [8] G. Cong, B. Ooi, K. Tan, and A. Tung. Go green: recycle andreuse frequent patterns. In Data Engineering, 2004. Proceedings. 20th International Conference on, pages 128–139. [9] A. Corral, Y. Manolopoulos, Y. Theodoridis, and M. Vassilakopoulos. Closest pair queries in spatial databases. Proc. SIGMOD, pages 189–200, 2000. [10] I. D. Felipe, V. Hristidis, and N. Rishe. Keyword search on spatial databases. In Proc. ICDE International Conference on Data Engineering,2008. [11] A. Guttman. R-trees: A dynamic index structure for spatial searching. Proc. SIGMOD, pages 47–57, 1984. [12] R. Hariharan, B. Hore, C. Li, and S. Mehrotra. Processing spatial keyword(sk) queries in geographic information retrieval (gir) systems. In SSDBM, page 16, 2007. [13] G. Hjaltason and H. Samet. Incremental distance join algorithms forspatial databases. Proc. SIGMOD, pages 237–248, 1998. [14] H. Jagadish, R. Ng, B. Ooi, and A. Tung. ItCompress: An IterativeSemantic Compression Algorithm. In Proceedings of the 20th International Conference on Data Engineering (ICDE04), volume 1063, pages 20–00. [15] H. V. Jagadish, N. Koudas, and D. Srivastava. On effective multidimensional indexing for strings. Proc. SIGMOD, pages 403–414, 2000. [16] K. Koperski and J. Han. Discovery of spatial association rules ingeographic information databases. Proc. SSD, pages 47–66, 1995. [17] N. Mamoulis and D. Papadias. Multiway spatial joins. Proc. TODS,26(4):424–475, 2001. [18] B.-U. Pagel, H.-W. Six, H. Toben, and P. Widmayer. Towards an analysis of range query performance in spatial data structures. In PODS, pages 214–221, New York, NY, USA, 1993. ACM. [19] D. Papadias and D. Arkoumanis. Approximate processing of multiway spatial joins in very large databases. Proc. EDBT, pages 179–196, 2002. [20] D. Papadias, N. Mamoulis, and B. Delis. Algorithms for querying by spatial structure. Proc. VLDB, pages 546– 557, 1998. [21] D. Papadias, N. Mamoulis, and Y. Theodoridis. Processing and optimization of multiway spatial joins using R-trees. Proc. PODS, pages 44–55, 1999. ISSN: 2231-5381 BIOGRAPHIES Mr.K.Srikanth completed his M.Tech at Gitam University.Visakhapatnam in the year 2008 . He is having 15 years of experience in teaching to engineering .Currently he is working as Associate Professor , department of CSE at Pydah college of engineering and technology. JNTUK University. His areas of interest include Computer Organization, Computer Networks, Operating Systems , Software Engineering , Advanced Information Systems Mr.R.Subbarao completed his B.tech at Sarada Institute of Technology and Sciences in 2010. He is pursuing M.Tech in Computer Science and Engineering from Pydah college of Engineering and Technology . JNTUK University .His areas of interest include C Programming, JAVA , C++. http://www.ijettjournal.org Page 4410