Building Ontology for the national map

advertisement
Building Ontology for
The National Map
Andrew Bulen, Jonathan Carter, Dalia Varanka
3rd Annual SOCoP Workshop, Reston, Virginia
December 3, 2010
U.S. Department of the Interior
U.S. Geological Survey
Objectives

To build a framework to more explicitly
articulate detailed information about features
contained in The National Map based on the
semantics of feature types

The flexible exchange of feature semantics
enables more specific information access

Richer data models based on ontology will
Increase potential data applications
Project Description



Develop a conceptual framework for data
handling
Develop algorithms for triples and ontology
pattern concepts
Build infrastructure and program digital
products
SOCoP Workshop, Nov. 2010
Outline




Topographic data conversion to triples
Geospatial relations for topographic data
Topographic feature ontology patterns
Conclusions
Data Conversion: Challenges



Point data: the Geographic Names Information
System (GNIS) gazetteer
Vector data: hydrography, structures, transportation,
divisions
Challenges:
 Retrieving data from The National Map database
formats
 Creating GML that is valid for any GML processing
programs
 Linking data to features from other sources
 Converting large amounts of data
Data Conversion: Solutions

Create an automated tool to translate existing format
files

Quantum GIS conversion to GML v2 with spatial
reference system included in each geometry
 Store URI and point to resources replacing literals

Parallelize conversion and spatial relation
comparison
Conversion Tool:
Jena and GeoTools libraries to convert to RDF
Configuration Editor
Data Conversion: Outcomes





The semantic content of the converted data is
identical to the original data.
Increased openness, availability, and
interoperability
Data is simpler to extract
Increased data richness
Database size is very large
SPARQL Endpoint
Challenges
 Create an endpoint so the public can access the data
 Must be fast, secure, easy to use
Solutions
 User Virtuoso to create and serve the endpoint
 Virtuoso is capable of scaling to a large size
Outcomes
 Data for converted areas is publicly accessible
 Data can be retrieved quickly
 Data is securely hosted
Spatial Relation Predicates:
Challenges:
 Describe relation predicates between currently


converted GIS data
Build a vocabulary of relations for triples that can
effectively model topography and topographic
science
Define relational predicates to meet standards
Spatial Relation Predicates
Solutions:
 Define relations based on current USGS data


models (Hydrologic Unit Codes, flow tables, etc.)
Apply Open Geospatial Consortium (OGC)
standard spatial relation terms
 Terms based on the 9-intersection model
Determine new cognitive/linguistic spatial
relations for topographic data
Vocabulary of Spatial Relations
 Topographic spatial relations and prepositions extracted from

feature definitions
Classified by logic types and spatial reference systems: usercentered, object-centered, and environment-centered
FLOW
Water
water
Underground water
CAUSED
Crater (Circular-shaped depression
at the summit of a volcanic cone or
one on the surface of the land)
Crater (a manmade depression)
FORM
Crossing (A place where two or
more routes of transportation)
REMOVED
Mine (place where commercial
minerals)
Oilfield (area where petroleum
is/was)
flowTHROUGH
flowTHROUGH
flowTO
Arroyo (Watercourse or channel)
Channel (Linear deep part of a body
of water)
The surface of the Earth
causedBY
the impact of a meteorite
causedBY
an explosion
form
a junction or intersection (overpass,
underpass)
removedFROM
Earth
removedFROM
Earth
Feature Primitives: Properties that Meet
Necessary and Sufficient Conditions
Metals
RESOURCE
Industrial
Minerals
Surface
Mine
EXTRACTION
Underground
Mine
Required Relations Reflect Primitives
powers
Power lines
Buildings
powers
• offices
• maintenance sheds
• head frame (shaft) *
• ore processing
carriesTo
Conveyors
connects
Large vehicles
• haulers
• front-end loaders
• scoops
• dump trucks
carriesTo
Roads
• dirt / gravel
carriesTo/From
Disturbed ground
connects
• ore piles
• tailings
• quarry / pit **
• mountaintop removed **
Railroad
* Underground mining ** Surface mining
Complex Topographic Features

Component assemblages are supported by resource
systems and are embedded in the near-by landscape.
LANDSCAPE
SYSTEMS
COMPLEX
FEATURE
Complex Features, Systems, and
Landscapes
Material Services Corporation, Thornton, IL
Ü
1:24,000
0
0.5
1 Kilometers
Topographic Science Modules
Complex Features and the
Geosemantic Web

Complex feature ontology
 saved to an ontology repository for re-use and

customized by others
queried and linked to other data for environmental
applications
Topographic Ontology: Challenges
 Create ontology patterns so that necessary data


can be linked using RDF and OWL
Adhere to RDF and comparable research
standards
Build logical reasoning: template of definitions
added for testing
Conclusions
Our approach for semantic topographic data
 Converts features to RDF
 Identifies spatial relations that reflect feature
primitives
 Uses a taxonomic structure that adds
semantic specifics and offers relative scale
 Accounts for three stages of topographic
representation
Outlook for 2011



Data made available to be queried and linked
to other data for environmental applications
Ontology saved to a repository for re-use and
customized by others
Gazetteer interface for data retrieval
 Use the GNIS data with spatial relations to
advance gazetteer functions
Publications

Varanka, D. and Usery, E.L., 2010, Special Section: Ontological Issues for The National
Map: Cartographica: The International Journal for Geographic Information and Visualization,
v. 45, n. 2, p. 103-104.

Varanka, D., Carter, J., Shoberg, T., and Usery, E.L., in press, Topographic Mapping Data
Semantics; Data Conversion and Enhancement, in Sheth, Amit and Ashish, Naveen, Eds.,
Geospatial Semantics and the Semantic Web. Semantic Web and Beyond: Computing for
Human Experience, Springer.

Varanka, D. and Jerris, T., 2010, Ontology Design Patterns for Complex Topographic
Features. AutoCarto 2010, Orlando FL, November 15 – 18, 2010.

Caro, H., and Varanka, D., Analysis of Spatial Relation Predicates in U.S. Geological Survey
Feature Definitions. U.S. Geological Survey Open File Report.
Project Web Page
Building Ontology for The National Map
http://cegis.usgs.gov/ontology.html
Principle Investigators
E. Lynn Usery
usery@usgs.gov
Dalia Varanka
dvaranka@usgs.gov
Download