• Conceptualizations of real world
• Often derived in Consensus processes or enforced by entities
• Variety of content and representations
– Thesauri, Dictionary, Taxonomies, DB Schema
– XML Schema, DTDs, UML, RDF Schema
– Might contain is_a, classes, partof, operations,
Behaviour axioms, synonyms, hyponym
• Classes, attributes
• Relationships
– Is-a, part-of, non-standard
• Events with Spatio-temporal characteristics
• Uncertainties ?
• Visual/Iconic constructs
• Multiple Languages
• Standards
– UMLS Metathesaurus
– Yahoo, Open Directory
– Business Process Modeling Initiative (BPMI)
– XML-HR Initiative (Human Resouces)
– PapiNet (Paper Industry)
• Application
– Genom Research Exchange
– B2B Exchange (product catalog interoperation, business process interoperation)
– Mediation across multi-lingual ontologies
• Challenges for the Database Community
– Storing, retrieval, querying
– Browsing, interoperation
• Of heterogeneous Ontologies
• Support for Ontologies
• Acquiring Ontologies
• Machine Learning
• Learning from User Practices
• Reusing existing Ontologies
• Ontology Merging
(resolution of differences/mismatch in representing same or similar things)
• Support technology depends on the tasks to perform
• Comprehensive Data Management support requires the identification of the ontology life cycle
Requirements/
Analysis
Ontology
Learning
Consistency
Checking
Ontology Search
Compare/Similarity
Merge/
Refine/Assemble
Evaluation
Maintenance
Versioning
Creation/
Change
Deployment
(e.g., Hypothesis Generation, Query)
• Operations to compare Models/Ontologies
• Scalability/Storage Indexing of Ontologies
– DB approaches data model specific
– Need to support graph based data models
• Temporal Query Languages
• Schema Mapping
– Meta Model specific
– Representation of exceptions, e.g., tweety
– Specification of Inexact Schema Correspondences
• E.g., 40% of animals are 30% of humans
• Meta Model Transformations/Mappings (e.g.,
UML to RDF Schema)
• Ontology Versioning
– Collaborative editing
– Meta Model specific versioning
– Version of Schema/Meta Model
Transformations
• Inference v/s Query Rewriting/Processing for Semantic Integration:
• E.g., RichPerson = (AND Person (> Salary 100))
• Can Query Processing/Concept Rewriting provide the same functionality as inferences ? More efficiently ?
• Distributed Inferences and Loss of Information
• Query Languages for combining metadata and data queries
• Graph-based data models and query languages
• Schema Correspondences/Mappings (Repeat from previous slide)
•Intensional Answers (Answers are descriptions, e.g. (AND Person (>
Salary 100)) instead of a list of all rich people)
•Semantic Associations (identification of meaningful relationships between different types of instances)