Ontologies - clic

advertisement
INTRODUCTION TO ARTIFICIAL
INTELLIGENCE
Massimo Poesio
LECTURE 5: Ontologies and Formal
Ontologies
LOGIC vs ONTOLOGIES
• Logic is not ‘knowledge’: is just a language for
encoding knowledge and inferences
• Already Aristotle realized that we need a
separate theory of what types of objects there
are, what properties they have, and how they
are related, and made a first attempt in his
CATEGORIES
• Now this area of research is known as
(Formal) ontology
PHILOSOPHICAL BACKGROUND
• Aristotle’s Metaphysics:
– A list of 10 categories
– Criteria for definition of categories
• Tree of Porphiry: Organize categories under
SUBSTANCE in a hierarchy
• Brentano: all ten categories
• Kant: neutral as to whether these categories
really reflect the world or merely our
conception of it
ARISTOTLE’s CATEGORIES
•
•
•
•
•
•
•
•
•
•
SUBSTANCE (man, horse)
QUALITY (white, heavy)
QUANTITY (four-foot, five-foot)
ACTIVITY (cutting, burning)
NB: simultaneously a
PASSIVITY (being cut, being burned)
classification of what is
SPATIALITY / LOCATION (in the there
Lyceum)
and what
TEMPORALITY / LOCATION (yesterday,
propertieslast
thereyear)
may be
RELATION (double, half)
HAVING / STATE (has shoes on)
SITUATEDNESS / POSTURE (is lying, is sitting)
DEFINITION OF CONCEPTS
(Aristotle’s Metaphysics, Book Z)
“a definition is an account, and every account
has parts, and part of the account stands to
part of the thing in just the same way that
the whole account stands to the whole thing”
= Most concepts encode necessary and
sufficient conditions for their own application
DEFINITIONS BY GENUS AND
DIFFERENTIA
MAN = RATIONAL ANIMAL
DEFINIENDUM
DEFINIENS
Definition by genus and differentia
• The ‘method of division’:
– Begin with the broadest genus containing the
species to be defined (‘ANIMAL’)
– Divide the genus in two sub-parts by some
differentia (‘FOOTED’)
– Then divide the two sub-types again (CLOVENFOOTED)
THE TREE OF PORPHIRY
Other philosophers
The greatest part of the Ideas, that make our
complex Idea of GOLD, are YELLOWNESS, great
WEIGHT, FUSIBILITY, and SOLUBILITY IN AQUA
REGIA (Locke)
In the case of many words … it is possible to
specify their meaning by reference to other
words. E.g., “ARTHROPODES” are ANIMALS
with SEGMENTED BODIES and JOINTED LEGS.
(Carnap)
`BOTTOM-UP’ ONTOLOGIES IN AI
• The interest of Artificial Intelligence
researchers in these ideas was born out of
attempts to model knowledge in specific
domains
THE BLOCKS WORLD
A BLOCKS WORLD ONTOLOGY
OBJECT
GRASPABLE
grasp(arm,x)
NON- GRASPABLE
~grasp(arm,x)
STACKABLE
stack(x,y)
CUBE
PYRAMID
~stack(x,y)
TODAY’s DOMAIN-SPECIFIC
ONTOLOGIES
• Protein Ontology: developed to codify in a
systematic way our knowledge about proteins
– http://pir.georgetown.edu/pro/
• Other ontologies listed on OPEN BIOMEDICAL
ONTOLOGY
– http://www.obofoundry.org/
– Gene ontology, C. elegans, etc
• Medical domain: UMLS
PROTEIN ONTOLOGY
UPPER ONTOLOGIES
• The work on domain-specific ontologies
eventually led to the desire to develop ontologies
that could ‘connect’ formalizations in one domain
with formalizations in other domains
– E.g., an ontology for biology with an ontology for
medicine
– But also an ontology of art with an ontology for
tourism
• Whether this is actually possible is a deep
philosophical question
CYC
• One of the first attempts in AI to produce such
an overarching ontology was done in the CYC
project – an effort to produce an
enCYClopedia of commonsense knowledge
THE CYC ONTOLOGY
http://www.cyc.com/
PROBLEMS ENCOUNTERED IN CYC
• The researchers working on CYC found
themselves confronting every single issue in
knowledge representation
• E.g., how to define VIDEOTAPE?
– A strip of coated plastic? (concrete)
– The information contained on that strip?
(abstract)
FORMAL ONTOLOGIES
• Work on formal ontologies is concerned with
providing an inferential characterization of
categories in terms of logic
• A simple example of inference:
– if X is a PHYSICAL OBJECT, then moving X from L1 to L2
implies that the LOCATION of X after the movement is
L2
• A more complex inference:
– Moving X with mass M from L1 to L2 implies that the
total mass at L1 is reduced by M, whereas the total
mass at L2 is increased by M (this is not true if X is an
abstract object)
UPPER ONTOLOGIES: DOLCE
• Work on specifying the ‘categories of
existence’ is exemplified by DOLCE, an upper
ontology developed by the Lab for Applied
Ontology of CNR (Povo)
DOLCE’S TAXONOMY
PT
Particular
ED
Endurant
PED
Physical
Endurant
M
Amount of
Matter
F
Feature
PD
Perdurant
NPED
Non-physical
Endurant
POB
Physical
Object
…
AS
Arbitrary
Sum
NPOB
Non-physical
Object
Q
Quality
EV
Event
STV
Stative
ACH
ACC
Achievement Accomplishment
…
…
TQ
Temporal
Quality
ST
State
PRO
Process
…
…
… TL
Temporal
Location
PQ
Physical
Quality
AB
Abstract
… Fact
AQ
Abstract
Quality
… SL
Spatial
Location
…
TR
Temporal
Region
…
APO
Agentive
Physical
Object
NAPO
Non-agentive
Physical
Object
MOB
Mental Object
SOB
Social Object
ASO
Agentive
Social Object
SAG
Social Agent
NASO
Non-agentive
Social Object
SC
Society
T
Time
Interval
Set
PR
Physical
Region
… S
Space
Region
R
Region
AR
Abstract
Region
…
FUNDAMENTAL ONTOLOGICAL
CHOICES IN DOLCE
• CONCRETE: ‘rock’
– Exists in space / time
• ABSTRACT: ‘law’
– Does not exists in space time
FUNDAMENTAL ONTOLOGICAL
CHOICES IN DOLCE
• ENDURANT vs PERDURANT
– ENDURANT OBJECTS: have a stable identity over a
period of time (e.g., concrete objects)
– PERDURANT OBJECTS: events that occur and then
exist no more
FUNDAMENTAL ONTOLOGICAL
CHOICES IN DOLCE
• QUALITIES
– The particular qualities of specific objects (e.g.,
the specific color of this specific slide, the
particular weight of this particular laptop, etc)
– Each quality associated with a QUALITY SPACE that
specifies the range of values that quality may take
FORMALIZATIONS OF RELATIONS
• Arguably most of the work on formal ontology
is concerned with the formalization of
RELATIONS
– PARTS
– SPACE
– TIME
PART-OF RELATION(S)
• A great variety of relations between objects
could be called ‘part’:
– My hand is part of my body
– The handle of the door
– The top of the cupboard
– This dish is made up of pepper and cod
– This atom has one electron
A SINGLE PART-OF RELATION?
• In MEREOLOGY (Lesniewski, 1927-31; Link,
1983; Simons, 1987); a single transitive part-of
relation is proposed
• Problems: intransitivity
– Marguerite’s tail is part of Marguerite the cow
– Marguerite the cow is part of the herd
– But: Marguerite’s tail is not part of the herd
Winston et al’s classification
• Winston et al (1987) distinguish between six
types of part relation:
– COMPONENT-INTEGRAL OBJECT (handle / cup)
– PORTION-WHOLE(slice / pie)
– SUBSTANCE-WHOLE(steel / bike)
– MEMBER-COLLECTION (tree / forest)
– FEATURE-ACTIVITY (paying / shopping)
– PLACE-AREA (oasis / desert)
VIEU & ARNAGUE 2007
• Vieu & Arnague show that many of the ‘part-of’
relations can be distinguished using a limited number
of categories:
– PLURALITY
• Both ELEMENT-COLLECTION and SUB-COLLECTION –COLLECTION
require one (or two) of the relata to be collections
– SUBSTANCE
• PORTION-WHOLE and SUBSTANCE-WHOLE require one of the
relata to be a substance
• This leaves out
– ‘PART’ proper, Component-Integral Whole (CIW)
– Temporal and spatial part
COMPONENT-INTEGRAL-WHOLE
• Main claim: an account of the ‘proper’ part
relation requires an account of
FUNCTIONALITY
– Part of what makes a object a ‘hand’ or a ‘wheel’
is the function it performs
– Previous accounts: Wright, Cummins, Searle
• Wright: ‘proper function’ analyzed in terms of evolution
– Problem: doesn’t apply to non-biological entities
• Cummins: the function of a pigeon’s wing with respect
to some analytical account of the pigeon’s capacity to
fly is to generate lift and propulsion
LEXICAL TYPES
• Contrasts such as
– The motor is part of the car
– The motor is part of the vehicle
– ?? The motor is part of the ARTEFACT
• Suggest to Vieu & Arnague that CIW is a
relation between LEXICAL TYPES not
denotations
– CIW-direct(x,X,y,Y,t)
CIW: DEFINITION
PHYSICAL PART
INDIVIDUAL FUNCTIONAL DEPENDENCE
GENERIC FUNCTIONAL DEPENDENCE
CLASSIFIED AS
OTHER AREAS OF RESEARCH IN
FORMAL ONTOLOGY
• Time
• Space
• Causality
ONTOLOGIES ON THE WEB: THE
SEMANTIC WEB
• The Semantic Web (Berners-Lee et al, 2001) is
a proposal to specify the type of objects
mentioned in a Web page
AN EXAMPLE OF SEMANTICALLY MARKED PAGE
JIM HENDLER’S PAGE, SEMANTIC WEB
INFO
<BODY>
<INSTANCE KEY="http://www.cs.umd.edu/users/hendler/">
<USE-ONTOLOGY ID="cs-dept-ontology" VERSION="1.0" PREFIX="cs" URL=
"http://www.cs.umd.edu/projects/plus/SHOE/cs.html" />
<CATEGORY NAME="cs.Professor" FOR="http://www.cs.umd.edu/users/hendler/"/>
<RELATION NAME="cs.member">
<ARG POS=1 VALUE="http://www.cs.umd.edu/projects/plus/">
<ARG POS=2 VALUE="http://www.cs.umd.edu/users/hendler/">
</RELATION>
<RELATION NAME="cs.name">
<ARG POS=2 VALUE="Dr. James Hendler">
</RELATION>
<RELATION NAME="cs.doctoralDegreeFrom">
<ARG POS=1 VALUE="http://www.cs.umd.edu/users/hendler/">
<ARG POS=2 VALUE="http://www.brown.edu">
</RELATION>
<RELATION NAME="cs.emailAddress">
<ARG POS=2 VALUE="hendler@cs.umd.edu">
</RELATION>
…..
</INSTANCE>
<b>As of January 1, 2007 Professor Hendler has moved from the University of Maryland to <a
href="http://www.rpi.edu">Rensselaer Polytechnic Institute</a></b>.
SEMANTIC WEB INGREDIENTS
• XML as a language of markup
• RDF as the basic tool for representing
information
• OWL (Web Ontology Language) to describe
concepts, attributes, and relations
• One or more ontologies
RESOURCE DESCRIPTION FRAMEWORK
(RDF)
• A language to describe statements of the
form: <RESOURCE, PROPERTY, VALUE>
‘Il presidente Ciampi vive a Roma’
RDF EXAMPLE
<?xml version='1.0'?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:wikipedia="http://it.wikipedia.org/wiki/"
xmlns:wikidizionario="http://it.wiktionary.org/wiki/">
<rdf:Description rdf:about="http://www.quirinale.it/presidente/ciampi.htm">
<wikidizionario:vivere
rdf:resource="http://www.comune.roma.it/index.asp"/>
<wikipedia:codice_fiscale> CMPCLZ20T09E625V
</wikipedia:codice_fiscale>
</rdf:Description>
</rdf:RDF>
OWL: A LANGUAGE TO DESCRIBE
ONTOLOGIES
• A series of languages allowing increasingly
more complex descriptions
– OWL-LITE: taxonomies, restrictions
– OWL-DL: Description Logics (see next week)
– OWL-FULL: Maximum expressivity
OWL
<owl:Class rdf:ID="ProteinComplex">
<owl:disjointWith> <owl:Class rdf:ID="SiteGroup"/>
</owl:disjointWith>
<owl:disjointWith> <owl:Class rdf:about="#Chains"/>
</owl:disjointWith>
<owl:disjointWith> <owl:Class rdf:about="#Residues"/>
</owl:disjointWith>
READINGS
• Sowa, Knowledge Representation, Brooks &
Cole, chapter 2
• Vieu & Arnague (2007), Part-of Relations,
Functionality and Dependence, In Aurnague,
M.; Hickmann, M. and Vieu, L. (eds.), The
Categorization of Spatial Entities in Language
and Cognition, John Benjamins, p. 307-337.
Download