Introducing the Semantic Web Professor James Hendler

advertisement
Introducing the Semantic Web
Professor James Hendler
http://www.cs.umd.edu/~hendler
Co-Director, Maryland Information and Network Dynamics
Laboratory
Semantic Web Agents Project
http://www.mindswap.org
The Evolving Web
Web of
Knowledge
Proof, Logic and
Ontology Languages
DATA/PROGRAMS
Shared terms/terminology
Machine-Machine communication
2010
Resource Description Framework
eXtensible Markup Language
HyperText Markup Language
HyperText Transfer Protocol
Self-Describing Documents
2000
DOCUMENTS
Foundation of the Current Web
1990
J. Hendler, 2002
Berners-Lee, Hendler; Nature, 2001
3
Web Semantics
Semantic Web LayerCake (Berners-Lee, 99;Swartz-Hendler, 2001)
J. Hendler, 2002
4
Can’t we just use XML?
This is what a web-page in natural language
looks like for a machine
XML helps
XML allows “meaningful tags” to be added to
parts of the text
< name >
< education>
< CV >
< work>
< private >
XML  machine accessible meaning
But to your machine,
the tags look like this….
name >
< name
<education>
< education>
< CV
CV >
<work>
< work>
<private>
< private >
Schemas take a step in the right direction
Schemas help….
< CV >
private
…by relating
common terms
between documents
But other people use other schemas
Someone else has one like this….
<name>
name >
<<educ>
education>
<> >
< work
<<>
private >
CV> >
<<CV
The “semantics” isn’t there
< CV >
private
…which don’t fit in
KR provides “external” referents to merge on
nme
CV
CV
work
vate
educ
CV
ed
uc
<name
< name >>
<<educ
education> >
SW languages add mappings
And structure.
<> >
< work
<<
private>>
<< CV
CV >
Which is what the web was meant to be!!

"This is a pity, as in fact documents on the web describe real objects and imaginary concepts, and
give particular relationships between them... For example, a document might describe a person.
The title document to a house describes a house and also the ownership relation with a person. ...
This means that machines, as well as people operating on the web of information, can do real
things. For example, a program could search for a house and negotiate transfer of ownership of
the house to a new owner. The land registry guarantees that the title actually represents reality.”
 Tim Berners-Lee plenary presentation at WWW Geneva, 1994
J. Hendler, 2002
12
Putting semantics on the web
J. Hendler, 2002
13
(and making it machine-readable)
J. Hendler, 2002
14
J. Hendler, 2002
15
Event:title
<daml:ObjectProperty rdf:ID="photograph">
<rdfs:domain rdf:resource="#Picture"/>
<rdfs:range rdf:resource= …#person"/>
</daml:ObjectProperty>
Event:WebPage
< > rdf:type photo:Photograph,
Photo:File http://…/images#image1,
Photo:topic :event1#event:speaker.
Event1 a Event:event;
date “May 7-11”,
speaker http://…#timbl.html
Title “WWW 2002…”
TimBL rdf:type w3c-ont:person;
name “Tim Berners-Lee”
…
J. Hendler, 2002
<s:Class
rdf:about="http://www.semanticweb.org/o
ntologies/swrc-onto-2000-0910.daml#Conference">
<s:comment>
describes a generic conceptabout events
</s:comment>
<s:subClassOf
rdf:resource="http://www.semanticweb.or
g/ontologies/swrc-onto-2000-0910.daml#Event"/>
<a:disjointFrom
rdf:resource="http://www.semanticweb.or
g/ontologies/swrc-onto-2000-0910.daml#Workshop"/>
<a:restrictedBy
rdf:resource="http://www.semanticweb.or
g/ontologies/swrc-onto-2000-0910.daml#genid18"/>
<rdf:Description rdf:about="http://www.w3.org/2001/03/earl/0.95#Person">
<rdf:type rdf:resource="http://www.w3.org/2000/01/rdf-schema#Class"/>
<rdfs:subClassOf rdf:resource="http://www.w3.org/2001/03/earl/0.95#Assertor"/>
</rdf:Description>
16
On the Web -- links are critical!
Web page
Any Web Resource
<a href=
URI>
HTML
<a href=“http://…”>
On the Semantic WEB -- links are critical!
URI
URI
RDF
J. Hendler, 2002
URI
RDF is like the web! And…
17
RDF graphs resemble semantic nets
DOC1
<mind:Person rdf:id=“Hendler”>
<mind:title jobs:Professor>
<jobs:placeOfWork http://www.cs.umd.edu>
</mind:Person>
Jobs: Professor
Mind:
Mind:title
DOC1
Jobs:
J. Hendler, 2002
Hendler
Jobs:placeOfWork
Web Page
http://www…
18
Semantics on the WEB
 RDF, like the WWW itself, is not “separable”
 Thinking about the ontologies, without considering
 The links to other terms
 The instances that link to them
 The crawling and collecting of ontological terminologues
Is like thinking about the Web without the links!!
Other
titles
Mind:
Other
URIs
Jobs:
J. Hendler, 2002
Other
Professors
Jobs: Professor
Mind:title
DOC1
Other
Pages
Hendler
Jobs:placeOfWork
Other
descriptions
Web Page
http://www…
19
Radically new view of Semantics
= some partial mapping
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
uses
Distributed,partially
mapped,
inconsistent -- but SCALEABLE!
uses
uses
J. Hendler, 2002
20
Real examples
 Examples from
http://dormouse.cs.umd.edu:8080/wiki/cmsc498wiki.wiki
 Students violated every rule in the KR book



Extended existing ontologies
Linked instances directly to terms from multiple ontologies
Mixed “real KR” and NL
 We can learn from their lessons

J. Hendler, 2002
http://dormouse.cs.umd.edu:8080/wiki/assignment1_collected_les.wiki
21
Current Activities
You
are
here
Semantic Web LayerCake (Berners-Lee, 99;Swartz-Hendler, 2001)
J. Hendler, 2002
22
W3C Web Ontology Working Group
 Web Ontology Working Group in the W3C Semantic Web Activity
aimed at “extending the semantic reach of current XML and RDF
meta-data efforts. “
 History





J. Hendler, 2002
DAML+OIL is submitted as a joint committee effort published as a W3C note .
W3C WG Announcement in November 2001 http://lists.w3.org/Archives/Public/www-rdf-logic/2001Nov/0000.html
Weekly teleconferences started in November 2001
First Face to Face Meeting - New Jersey (Lucent), Jan ‘02; 2nd - Amsterdam
April (W3C); 3rd - CA (Fujitsu/Stanford host) July; 4th in Bristol UK (HP Host)
Oct.
Four Working Drafts to date
 Requirements/Use cases - March 2002
 3 Technical Documents - July 2002 (Language renamed OWL)
23
Membership
 Current Working Group includes over 50 members from over 30 organizations.

Chairs
 J. Hendler, MIND Lab UMCP
 G. Schreiber, Univ. of Amsterdam

Industry including:
 Large companies - Daimler Chrysler, IBM, HP, Intel, EDS, Fujitsu, Lucent, Motorola, Nokia,
Philips Electronics, Sun, Unisys
 Newer/smaller companies - IVIS Group, Network Inference, Stilo Technology, Unicorn
Solutions

Government and Not-For-Profits:
 US Defense Information Systems Agency, Interoperability Technology Association for
Information Processing, Japan (INTAP) , Electricite De France, Mitre, NIST

Universities and Research Centers:
 University of Bristol, University of Maryland, University of Southamptom, Stanford
University
 DFKI (German Research Center for Artificial Intelligence), Forschungszentrum Informatik,
Ontoweb

Invited Experts
 Well-known academics from non-W3C members (Hayes, Heflin, Stein, Borden)
J. Hendler, 2002
24
But will it fly?
 DAML+OIL is already the most used ontology language ever!!

http://www.daml.org (3.5M statements on 25,000 web pages)
 Gaining acceptance by web players


Semantic Web Track being offered at WWW 2002
3x more people attended WWW2002 Developer Day on SW than attended KR
 Significant (international) Govt Support

US DARPA/NSF; EU IST Framework 5,6

Japan, Germany, Australia considering significant investments
US National Cancer Institute to publish cancer vocabulary in DAML+OIL

 Much New Startup activity (even in this economic climate)
 Many tools being developed

J. Hendler, 2002
Many of them aimed at developers, not just AI literate types
25
Making Markup Easier
J. Hendler, 2002
26
Machine worries about the syntax
J. Hendler, 2002
27
Use that markup in query/portal interfaces
J. Hendler, 2002
28
Extending ontologies on the fly
J. Hendler, 2002
29
Semantic Web Portals:
The Mosaic of the semantic web?
<XSLT/>
<Oncogene rdf:ID="Oncogene,
MYB"><code>C3682</code><id>3683</id>
<Found_In_Organism
rdf:ID="Human"></Found_In_Organism>
<Gene_Has_Function rdf:ID="Gene
Transcription"></Gene_Has_Function>
<Gene_Has_Function
rdf:ID="Transcriptional
Regulation"></Gene_Has_Function>
<In_Chromosomal_Location rdf:ID="6q22q23"/>
</Oncogene>
J. Hendler, 2002
<Oncogene rdf:ID="Oncogene NMYC">
<code>C17656</code><id>17657</id><Found_In_Organism
rdf:ID="Human"></Found_In_Organism>
<In_Chromosomal_Location rdf:ID="2p24.1"/>
<Gene_Has_Function rdf:ID="Transcriptional Regulation">
</Gene_Has_Function><Gene_Associated_With_Disease
rdf:ID="Neuroblastoma">
</Gene_Associated_With_Disease></Oncogene>
30
Moving to the futureof the web
Semantic Web LayerCake (Berners-Lee, 99;Swartz-Hendler, 2001)
J. Hendler, 2002
31
Web “travel agents”
How many cows are there in Texas?
Query processed: 73 answers found

Google document search finds 235,312 possible page hits.

Http://www…/CowTexas.html claims the answer is 289,921,836

A database entitled “Texas Cattle Association” can be queried for the
answer, but you will need “authorization as a state employee.”

A computer program that can compute that number is offered by the State of
Texas Cattleman’s Cooperative, click here to run program.
...
The “sex network” can answer anything that troubles you, click here for
relief...



J. Hendler, 2002
The “UFO network” claims the “all cows in Texas have been replaced by
aliens
32
Allows new capabilities
J. Hendler, 2002
33
Services off the desktop
J. Hendler, 2002
34
Or perhaps on different desktops…
J. Hendler, 2002
35
Web Agents need Service
Descriptions
J. Hendler, 2002
36
Semantic Web Service Description
- <daml:Class rdf:ID="CreateAcct">
<rdfs:subClassOf
rdf:resource="http://www.daml.org/services/damls/2001 /10/Process.daml#AtomicProcess" />
- <rdfs:subClassOf>
- <daml:Restriction daml: cardinality="1">
<daml:onProperty rdf:resource="#createAcctInfo" />
</ daml:Restriction>
</ rdfs:subClassOf>
- <rdfs:subClassOf>
- <daml:Restriction daml: cardinality="1">
<daml:onProperty rdf:resource="#createAcctOutput" />
</ daml:Restriction>
</ rdfs:subClassOf>
</ daml:Class>
- <rdf:Property rdf:ID="createAcctInf o">
<rdfs:subPropertyOf
rdf:resource="http://www.daml.org/services/damls/2001 /10/Process.daml#input" />
<rdfs:domai n rdf:resource="#CreateAcct" />
<rdfs:range rdf:resource="#AcctInf o" />
</ rdf:Property>
- <rdf:Property rdf:ID="createAcctOutput">
<rdfs:subPropertyOf
rdf:resource="http://www.daml.org/services/damls/2001 /10/Process.daml#output" />
<rdfs:domai n rdf:resource="#CreateAcct" />
<rdfs:range rdf:resource="#CreateAcctOutputType" />
</ rdf:Property>
J. Hendler, 2002
37
Use Semantics for Composition
Translate my symptoms from
French and find me a pharmacy
that has the necessary medicine
(then compute how to get there
and print the directions)
J. Hendler, 2002
Print the directions to a pharmacy
which has a medicine that cures
the symptoms that I will tell you
(in French)
38
Or, translate to Planning Operators
- <daml:Class rdf:ID="DeliveryType">
- <daml:oneOf rdf:parseType="daml:collection">
Ź <DeliveryType rdf:ID="FedExOneDay" />
Ź <DeliveryType rdf:ID="FedEx2-3day" />
Ź <DeliveryType rdf:ID="UPS" />
Ź <DeliveryType rdf:ID="Ordin aryMail" />
Ź </ daml:oneOf>
Ź </ daml:Class>
The atomic process SpecifyDeliveryD etailsΣ can be transla ted into a SH OP2 operator
with cor responding pr econd itions as follows:
(:operator (!SpecifyDeliveryDetails ?deli veryAddress ?pa ckagingSelection
?deliveryTypeSelection)
((not (same ?deliveryAdd ress ni l))
(not (same ?packagingSelection nil))
(not (same ?deli veryTypeSelection nil))
(eval (stringp ?del iveryAdd ress))
(eval (member ‘?packagingSelection ‘(Gif twrap Ordinary)))
(eval (member ‘?deliveryTypeSelection ‘(FedExOneDay FedEx23day UPS OrdinaryMail))))
()
())
J. Hendler, 2002
39
For goal-based service composition
J. Hendler, 2002
Buy the French version of a book from amazon.fr and
have it sent to Mom’s address
40
Services need Web Logics
J. Hendler, 2002
41
Web of Trust
 Claims can be verified if there is supporting evidence from
another (trusted) source
 We only believe that someone is a professor at a university if
the university also claims that person is a professor, and the
university is on a list I trust.
believe(c1) :- claims(x, c1) ^ predicate(c1, professorAt) ^
arg1(c1, x) ^ arg2(c1, y) ^ claims(c2, y) ^
predicate(c2, professorAt) ^ arg1(c2, x) ^
arg2(c2, y) ^ AccreditedUniversity(y)
AcknowledgedUniversity(u) :- link-from(“http://www.cs.umd.edu/university-list”,u)
Notice this one
J. Hendler, 2002
42
Distributed Trust
J. Hendler, 2002
43
Conclusion
 It is no longer a question of whether the semantic web will
come into being, it is already here!
 We’re already well past the starting gate



Web ontologies, term languages, “shims” to DB and services, research in
proofs/rules/trust
Standardization providing a common denominator for KR researchers as
well as web developers
Small companies starting to form, Big companies starting to move
 The current environment is open, encouraging, moving fast,
and exciting as heck

Come play!
J. Hendler, 2002
44
Download