GEOHYBRID: A HIERARCHICAL APPROACH FOR ACCURATE AND SCALABLE GEOGRAPHIC LOCALIZATION

advertisement
ITU-T Kaleidoscope 2010
Beyond the Internet? - Innovations for
future networks and services
GEOHYBRID: A HIERARCHICAL
APPROACH FOR ACCURATE AND
SCALABLE GEOGRAPHIC
LOCALIZATION
Dr. Bamba Gueye
Joint work with Ibrahima Niang and Bassirou Kasse
University Cheikh Anta Diop of Dakar
bamba.gueye@ucad.edu.sn
http://edmi.ucad.sn/~gueye
Pune, India, 13 – 15 December 2010
Motivations
New class of location-aware applications
Web services: targeted advertising, locating
cyber-criminality, restricted content delivery
Location-based security check
Geographic information of the Internet
routes
Analyze the geographic behavior of network
routing, internet topology mapping
Optimization of the decision taking
process of a Grid Resource Broker
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
2
Problem statement
IP-location mapping: Given the IP address of an
Internet host, can we estimate its geographic location?
AS1
AS2
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
AS3
3
How to locate Internet host ?
Passive measurements approach
Use databases and exhaustive tabulation
Ex: GeoBytes, GeoURL, GeoIP
Active measurements approach
Round-trip time-based or/and Traceroutebased
Ex: GeoPing, CBG, GeoBuD, GeoTrack,
SarangWorld project
Hybrid approach
Ex: GeoHybrid
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
4
Passive measurements approach
DNS-based
Incorporating location information in
DNS records
RFC 1876
Whois-based approach
Use location information recorded in
the Whois database
141.152.24.9: where am I?
Response = the location of the ISP Verizon
(141.149.0.0/13), Reston, VA
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
5
Drawbacks
Information recorded in Whois
database may be inaccurate or stale
Large ISPs advertise only aggregate
prefix for reasons of scalability
Single prefix with multiple locations
[Feamster et al. IMC05, Gueye et al. PAM08]
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
6
Active measurement approach
DNS-based approach
Observation:
Recognizable host: geographically meaningful
names
Ex: bcr1-so-2-0-0.Paris.cw.net
Use the reports of “traceroute”
Location of a target host = location of the last
recognizable router on the path
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
7
DNS-based approach drawbacks
No rules for naming the routers
[Rexford et al. USENIX06]
charlotte.ucsd.edu – San Diego, CA (not Charlotte, NC)
dnverng-kscyng.abilene.ucaid.edu – Denver, CO (not
Kansas, KS)
The last recognizable router can be
located far
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
8
Active measurements approach
GeoPing-like
[Padmanabhan et al. SIGCOMM01]
The number of possible locations of a given host is
equal to the number of landmarks (discrete space
of answers)
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
9
CBG: A continuous response space via
multilateration
Multilateration
[Gueye et al. ToN06]
Estimates position using distance estimates
from some fixed points
Similar to GPS
Active measurements
From a set of landmarks to a target host
Select minimum RTT
Transformation of RTT measurements into
distance
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
10
Constraint-Based Geolocation (CBG) :
locating Internet host
Multilateration using
distance constraints
Distance constraints are
overestimated
Assigns confidence
region to each location
estimate
d2 + d’2
d2
d1 + d’1
d1
P2
d3
d3 + d’3
P1
P3
Intersection
Estimated location of
target host
Confidence region
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
11
Contributions
GeoHybrid
Combination of active and passive
measurements
Reduce the number of measurements
injected in the network
Geolocation service for grid computing
middleware
Useful for the optimization of the decision
taking process of a Grid Resource Broker
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
12
GeoHybrid: A comprehensive
technique for geolocalization
GeoHybrid is based on:
active measurements
CBG approach
Geolocation
queries
Database
(IP to location
mappings)
Server
Geolocation
answers
Scripts for handlings
active measurements
passive measurements
Database with exhaustive
tabulation
Heuristic implemented
Find the nearest set of
landmarks for a given
target host
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
13
Experimental setup
74 PlanetLab nodes as landmarks
127 hosts (AMP and RIPE nodes) as
targets
GeoIP’s database
[MaxMind LLC]
1,876,596 blocks of IP addresses
Each block has its own geographic location
such as country, region, city or
latitude/longitude
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
14
Results
Heuristic approach
Median error: 175 km
Random approach
Median error: 400 km
Random approach: 30 samples for each landmarks
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
15
Summary of GeoHybrid
GeoHybrid reduces:
Number of measurements
Response time
A set of 20 nearest landmarks are
sufficient to locate Internet hosts
Exhaustive tabulation is difficult to
manage and keep updated
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
16
Geolocation-based Grid Optimizer
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
17
Conclusions
GeoHybrid allows to reduce the number
of measurements
The proposed measurement middleware
service brings benefits for the area of
grid computing
Mitigates the amount of traffic exchanged
across the grid
We plan to implement this middleware
in the Research Education Network of
Senegal
Pune, India, 13 – 15 Dec 2010:
ITU-T Kaleidoscope 2010 – Beyond the Internet? Innovations for future networks and services
18
Download