Logics for Data and Knowledge Representation Towards Infrastructure, Methodology and Principles for Ontology Development Fausto Giunchiglia and Biswanath Dutta Fall - 2011 Outline Introduction Knowledge Representation (KR) Ontology Domain Facet DERA Mapping Methodology Principle Demo 2 : Domain Modeling INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Knowledge Representation (KR) “…is about building models of the world, of a particular domain or problem, which allow automatic reasoning and interpretation” Fundamental 3 Goal is to represent knowledge in a manner that facilitates inferencing new knowledge (i.e. drawing conclusions) from the knowledge base INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Knowledge Representation Properties According to (Crawford & Kuipers, 1990): It must have a reasonably compact syntax. It must have a well defined semantics so that one can say precisely what is being represented. It must have sufficient expressive power to represent human knowledge. It must have an efficient, powerful, and understandable reasoning mechanism It must be usable to build large knowledge bases. Crawford, J. M. & Kuipers, B. (1990). ALL: Formalizing Access Limited Reasoning. Principles of semantic networks: Explorations in the representation of knowledge, Morgan Kaufmann Pub., 299-330. 4 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Knowledge Representation Issues KR issues: How do people represent knowledge? What is the nature of knowledge? Do we have domain specific schema or generic, domain independent schema? How much it needs to be expressive? 5 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Ontology “formal, explicit specification of a shared conceptualisation” – Gruber, 1993 Models a domain consisting of shared vocabulary with the definition of objects and/or concepts and their properties and relations A structural used as a form of KR in the fields like, AI, SW, Lib. Sc., Inf. Architecture, etc. Ontology 6 framework for organizing information, and as a language resource INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Ontology Properties Some of the ontological properties are: Extendable Reusable Flexible Robust … 7 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Domain An area of knowledge or field of study that we are interested in or that we are communicating about Example: Computer science, Artificial Intelligence, Soft computing, Social networks, …Library science, Mathematics, Physics, Chemistry, Agriculture, Geography, … Music, Movie, Sculpture, Painting, …Food, Wine, Cheese, …Space,… 8 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Domain (contd…) A domain can be decomposed into its several constituents, and Each of them denotes a different aspect of entities An example from Space domain: by region, by body of water, by landform, by populated places, by administrative division, by land, by agricultural land, by facility, by altitude, by climate,… Each 9 of these aspects is called facet INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Facet A hierarchy of homogeneous terms describing an aspect of the domain, where each term in the hierarchy denotes a different concept E.g., Body of water(e.g., River, Lake, Pond, Canal), Landform (e.g., mountain, hill, ridge), facility (e.g., house, hut, farmhouse, hotel, resort), etc. language facet (e.g., English, Hindi, Italian,), property facet, author facet, religion facet (e.g., Christian, Hindu, Muslim), commodity facet, etc. INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO DERA Is a Facet based knowledge organization framework It is is independent from any specific domain Allows building domain specific ontologies Is logically sound Has mapping to Description Logic Decidable Designed 11 by the UniTn KnowDive group INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO DERA Surface Structure In the surface level, it has the following components: D – Domain E – Entity R – Relation A – Attribute 12 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Domain (D) A DERA domain is a tuple of, D = <E, R, A> 13 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Entity (E) an elementary component that consist of entity classes and their instances, having either perceptual correlates or only conceptual existence in a domain in context E = <{C} + {E'}> Where, C = Entity class – consists of core classes representing the entities; E' = Entity (also called object) – consists of real world named entities, that is the instantiation of the entity classes C. 14 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Entity (E) (contd…) Entity class (C) : Represents the essence of the domain under consideration; Consists of the core classes representing a domain in context E.g., Consider the following classes in context of Space domain: 15 Mountain, Hill, Lake, River, Canal, Province, City, Hotel,... INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Entity (E) (contd…) Entity (E') : Are the real world named entities An extension of the real world entities E.g., 16 The Himalaya, Monte Bondone, Lake Garda, Trento, Povo, Hotel America,... INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Entity (E) (contd….) E.g., An example from a domain Space 17 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Relation (R) An elementary component consists of classes defining the relations between entities R = <{r}> A relation is a link between two entities (E') Builds a semantic relation between the entities E.g., 18 Some relations (spatial) from Space domain: near, adjacent, inside, before, center, sideways, etc. INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Attribute (A) An elementary component consists of classes expressing the characteristics of entities A = <{A'}, {C}> An attribute is any property, qualitative, quantitative or descriptive measure of an entity 19 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Attribute (A) (contd…) Datatype Attribute (A'): A datatype attribute includes the attribute classes that accounts the quality or quantity of an entity within a domain E.g., latitude, longitude (of a place): altitude (of a mountain): 8000ft, 2400m. high, low depth (of a lake): 20 450 N, 180 S deep, shallow 100ft., 20m. INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Attribute (A) (contd…) Descriptive Attribute It includes the attribute classes that describe the entities under a domain in consideration The value of a descriptive attribute could consists of a single string or a set of strings E.g., natural resource (of a place): {Classical architecture, Greek architecture, Roman architecture, Bauhaus, etc.} history (of a place) coal, natural gas, oil, … architectural style (of a castle): 21 (C): ………. climbing route (to a mountain) ………………. INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Mapping From DERA to DL Entity classes (C) -> Concepts Relation (R) -> Roles Datatype attribute (A') -> Roles Descriptive attribute (C) -> Roles Entity (E') -> Individuals 22 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Methodology Step 1: Identification of the atomic concepts Step 2: Analysis (per genus et differentiam) Step 3: Synthesis Step 4: Standardization Step 5: Ordering Following the above steps leads to the creation of a set of facets. They constitute a faceted representation scheme for a domain 23 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Ontological Principle Relevance (e.g.,breed is more realistic to classify the universe of cows instead of by grade) Ascertainability (e.g., flowing body of water) Permanence (e.g., Spring- a natural flow of ground water) Exhaustiveness (e.g., to classify the universe of people, we need both male and female) Exclusiveness (e.g., age and date of birth, both produce the same divisions) Context (e.g., bank, a bank of a river, OR, a building of a financial institution) Important: helps in reducing the homographs Currency (e.g., metro station vs. subway station) Reticence (e.g., minority author, black man) Ordering 24 Important: ordering carries semantics as it provides implicit relations between the coordinate terms INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Step 1: identification of the atomic concepts Sources of the concepts WordNet GeoNames (357/663 classes) TGN Literature 25 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Step 1: identification of the atomic concepts (2) Some of the relevant sub-trees in WordNet are: location artifact, artefact body of water, water geological formation, formation land, ground, soil land, dry land, earth, ground, solid ground, terra firma Note: not necessarily all the nodes in these sub-trees need to be part of the space domain. For example, the descendants of artifact, like, article, anachronism, block, etc. are not. 26 INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Analysis Mountain • • • the well elevated land defined formed by the geological formation (where geological formation is a natural phenomenon) altitude >500m 27 in general Hill • • • the well defined elevated land formed by the geological formation, where geological formation is a natural phenomenon altitude <500m in River Stream general • a body of water • a body of water • a flowing water • a flowing water • no fixed boundary • no fixed boundary • confined within a bed and stream banks • confined within a bed and stream banks • larger than a brook body of body of INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Synthesis Body of water Landform Flowing body of water Stream Brook River Stagnant body of water Pond Natural depression Oceanic depression Oceanic valley Oceanic trough Continental depression Trough Valley Natural elevation Oceanic elevation Seamount Submarine hill Continental elevation Hill Mountain 28 * each term in the above has gloss and is linked to synonym(ous) terms in the knowledge base INTRODUCTION :: DOMAN :: FACET :: DERA :: MAPPING :: METHODOLOGY :: PRINCIPLE :: DEMO Facets and sub-facets Space [Domain] by geographical feature [Entity class] by water formation by land formation by land by administrative division … by relations [Relation] spatial relation functional relation (e.g., primary inflow, primary outflow) … by attribute [Datatype attribute] latitude Longitude dimension … [Descriptive attribute] 29 direction, internal, external, longitudinal, sideways, etc. Natural resource Architectural style Time zone ph History … Log-in: http://uk.disi.unitn.it/resources/html/UKDomain.html