Data and Knowledge Representation Lecture 6 Harvard-MIT Division of Health Sciences and Technology

advertisement
Harvard-MIT Division of Health Sciences and Technology
HST.952: Computing for Biomedical Scientists
Data and Knowledge
Representation
Lecture 6
Last Time We Talked About
z Medical
z UMLS
Coding Systems
Today We Will Talk About
z Major
KR schemes
z Semantic
Network
z Frame-based Representation
z Production Rules
Semantic Network
z An
long existing notion: there are different
pieces of knowledge of world, and they
are all linked together through certain
semantics.
Basic Components
z Nodes
z Represent
concepts
z Arcs
z Represent
z Labels
relations
for nodes and arcs
Little Constraint
patient
Interact
Interact
Nurse
physician
Interact
Little Constraint
DSG Site
Link
Link
Instructors’
Homepage
Course Site
Link
Web
Relation
z Directed
or non-directed
z Multiple relations between two concepts
z Can have different properties
z Reflexive
(e.g. concurrence)
z Transitive (e.g. causal)
z Symmetric (e.g. sibling)
z ………..
Some Often Used Relations in
Biomedical Domain
z IS
A
z IS PART OF
z CAUSE OF
z MEASURES
z CO-OCCURS
z …………
Major Limitation
z Lack
z No
of Semantics
formal semantic of the relations
z E.g.
Does “ISA” mean subclass, member, etc?
z Possible
multiple interpretations
z Restricted expressiveness
z E.g.
can not distinguish between instance and
class
Extension
z Extending
expressivity (distinguish
different types of concepts and relations”
z Distinguish
between “some” and “all”
z Distinguish between “existence” and
“intension”
z Distinguish between “definition” and
“assertion”
z Add
semantic rigor
z Map
to logic (Sowa – CG)
UMLS Semantic Network – Concept
Hierarchy
http://www.nlm.nih.gov/research/umls/archive/2002AA/META3.HTML Figure 1
UMLS Semantic Network – Relation
Hierarchy
http://www.nlm.nih.gov/research/umls/archive/2002AA/META3.HTML Figure 2
UMLS Semantic Network – Relation Constraints
http://www.nlm.nih.gov/research/umls/archive/2002AA/META3.HTML Figure 3
Frame-based Network
z Distinguish
instance vs. class
z Hierarchical structure (superclass and
subclass)
z Multiple hierarchy
z Slots
z Member
slot
z Own slot
Slot
z Frame
identifying information
z Relationship between frames
z Descriptors of requirements for frame
match
z Procedural information
z Default information
z Restrictions and constraints
z New instance information
Strength
z Help
organize knowledge hierarchically
z Procedure information
z Support multiple inheritance
Weakness
z Expressiveness
(e.g. quantifier)
z Inheritance
z Sub
classing (override slot value)
z Multiple inheritance
z Large
complex knowledge system
Example: MED
http://www.cpmc.columbia.edu/homepages/dab7001/topics/index.html
Example: Protégé
http://protege.stanford.edu/
Production Rules
z Also
called IF-THEN rules
z Many forms:
z IF
condition THEN action
z IF premise THEN conclusion
z IF proposition p1 and proposition p2 are true
THEN proposition p3 is true
Components
z Rule
base
z Inference engine
z Working memory
Inference
z Modus
ponens
z Forward
z Modus
chaining
tollens
z Background
chaining
Example: MYCIN
IF the identity of the germ is not known
with certainty
AND the germ is gram-positive
AND the morphology of the organism is
"rod"
AND the germ is aerobic
THEN there is a strong probability (0.8) that
the germ is of type enterobacteriacae
Pro and Con
z Pro
z Modular
z Natural
z Con
z Not
efficient
z Not expressive
Exercise
“Mrs Z.N. : 35 yr old farmer labourer from the Kwazulu-
Natal province. She presented with a one week history
of right sided pleuritic chest pain. The onset was sudden.
This was associated with a warm flushing feeling and
dizziness. There was also a pain in the right upper
quadrant of her abdomen. There was no history of
shortness of breath, cough or wheezing. There were no
cardiac symptoms, no symptoms of malaise or loss of
weight. No history of fevers. She was not a smoker and
drank no alcohol. Of note is that she admitted to eating
the entrails of the goats that she keeps.” ­ http://www.wits.ac.za/fac/med/pulmonology/case1.htm
Exercise
z Represent
the information in semantic
network
z Represent the information in frame-based
network
Exercise
The thyroid gland is located at the base of your neck in front of your
trachea (or windpipe). It has two sides and is shaped like a
butterfly.
z The thyroid gland makes, stores, and releases two hormones - T4
(thyroxine) and T3 (triiodothyronine). Thyroid hormones control the
rate at which every part of your body works. This is called your
metabolism. Your metabolism controls whether you feel hot or cold
or tired or rested. When your thyroid gland is working the way it
should, your metabolism stays at a steady pace -not too fast or too
slow.
z If no cancer cells are found, your doctor may prescribe a thyroid
hormone to decrease the size of your nodule. Or, your doctor may
suggest surgery to remove it. If cancer cells are found, further
treatment will be needed. Thyroid cancer usually can be treated
with success.
z
Excise
z Which
representation scheme to choose?
Knowledge Representation
Process
z Identify
Needs
z Conceptualization
z Formalization
z Implementation
z Evaluation
Reading
z Sowa
Chap. 4
Download