jan26and31

advertisement
CS 4100 Artificial Intelligence
Prof. C. Hafner
Class Notes Jan 26, 2012
Topics
• More about assignment 3
• Negation by failure and Horn Clause databases
– Closed world assumption
• First order logic continued
• Wumpus world model using FOL
Jan 31
• A few more details about hw3
– Test data available
– loadInitialKB and processPercepts functions
•
•
•
•
Return an discuss assignments 1 and 2
Converting FOL sentences to Normal Form
Unification
Automated reasoning in FOL: resolution, forward chaining,
backward chaining
Sketch of Forward Chaining Algorithm (version 1
assumes the initial KB is fully expanded)
ALGORITHM (recursive):
processPercepts(‘percepts file’)
uses KBase -- a knowledge base of definite clauses
for each new percept p
PLForwardChain(p) #use a recursive "helper function"
PLForwardChain(percept)
if percept is already in KBase, return
else
add percept to KBase
for r in rules where conclusion of r is not already in KBase
if percept is a premise of r and all the other
premises of r are known
PLForwardChain(conclusion of r)
Sketch of Forward Chaining Algorithm (version 2
drops this assumption)
processPercepts(‘percepts file’)
uses KBase -- a knowledge base of definite clauses
for each new percept p
PLForwardChain(p) #use a recursive "helper function"
PLForwardChain(percept)
if percept is already in KBase, return
else
add percept to KBase
for r in rules where conclusion of r is not already in KBase
if percept is a premise of r and allTrue(premises of r)
#all premises are known or provable
PLForwardChain(conclusion of r)
allTrue(premises)
for p in premises
if p is already in Kbase continue
if p is not provable return false
return true
Return to FOL: Meaning and truth (review)
• Sentences of FOL are true with respect to a model and an
interpretation
• A model for a FOL language is a “world” of objects (domain elements)
and relations among them (compare with propositional logic model)
• Interpretation I specifies referents for
constant symbols
→
objects
predicate symbols
→
relations
function symbols
→
functions
• For an atomic sentence, the interpretation I(P(term1,...,termn) )=
true iff the objects I(term1)..., I(termn) are in the relation I(P)
•
Meaning and truth in first-order logic (cont.)
• Complex sentences: truth is defined using the same
truth tables: I(S1  S2) = true iff I (S1) = true and
I (S2) = true.
• I(x [S]) = true iff for every object o in the model
I (S[x/C]) = true whenI (C) = o
• I( x [S]) = is true iff there is at least one object o in
the model such that:
I (S[x/C]) = true whenI (C) = o
Models for FOL: Example
symbols: constant
relation
function
Quantification examples:
• <variables> <sentence>
Everyone at NU is smart:
x [ At(x,NU)  Smart(x) ]
• x P is true in a model m iff P is true with x being
each possible object in the model
• Roughly speaking, equivalent to the conjunction of
all instantiations of P
At(KingJohn,NU)  Smart(KingJohn)
 At(Richard,NU)  Smart(Richard)
 At(NUS,NU)  Smart(NUS)
A common mistake to avoid
• Typically,  is the main connective with 
• Common mistake: using  as the main connective
with :
x At(x,NU)  Smart(x)
means “Everyone is at NU and everyone is smart”
Quantification examples (cont.)
• <variables> <sentence>
• Someone at NU is smart:
• x [At(x,NU)  Smart(x)]
• x P is true in a model m iff P is true with x being
some possible object in the model
• Roughly speaking, equivalent to the disjunction of all
instantiations of P
At(KingJohn,NU)  Smart(KingJohn)
 At(Richard,NU)  Smart(Richard)
 At(NU,NU)  Smart(NU)
 ...
Another common mistake to avoid
• Typically,  is the main connective with 
• Common mistake: using  as the main connective
with :
x At(x,NU)  Smart(x)
is true if there is no one who at NU!
•
•
•
•
Properties of quantifiers
x y is the same as y x
x y is the same as y x
x y is not the same as y x
x y Loves(y, x)
– “There is a person who is loved by everyone in the world”
• x y Loves(y, x)
– “Everyone in the world is loved by at least one person”
• Quantifier duality: each can be expressed using the
other(cf DeMorgan laws)
• x Likes(x,IceCream) == x Likes(x,IceCream)
• x Likes(x,Broccoli) == x Likes(x,Broccoli)
Equality
• term1 = term2 is true under a given interpretation if
and only if term1 and term2 refer to the same object
• E.g., definition of Sibling in terms of Parent:
x,y Sibling(x,y)  [(x = y)  m,f [  (m = f) 
Parent(m,x)  Parent(f,x)  Parent(m,y)  Parent(f,y)]]
• We will use a different notation for equality: =(x, y)
– makes programming simpler
A model M for the kinship domain
• Individuals: J K L M N O P Q R
• Functions: mom[1] : mom(N)  M
• Relations[arity]
– fem[1] = {M, Q}
– par[2] = {[M, N], [N, R] . . }
– sib[2] = {[M, O], [P, J], [J, P]}
--------------------Interpretation I ----------------------• Constants: John, Mary, Sue, Tom .. ..
– I(Mary) = M, I(Sue) = Q, . . .
• Function symbol: Mother, I(Mother) = mom
• Relation symbols: Female, Parent, Sibling
I(Female) = fem, I(Parent) = par, I(Sibling) = sib
Using FOL
The kinship domain:
• Brothers are siblings
x,y Brother(x,y)  Sibling(x,y)
• “Sibling” is symmetric
x,y Sibling(x,y)  Sibling(y,x)
• One's mother is one's female parent
m,c =(Mother(c) , m)  (Female(m)  Parent(m,c))
• Some mothers are over 40 years old
 m, x [=(Mother(x), m) ^ > (Age(m), 40) ]
Knowledge Engineering: Choice of
Representations
• Human(Bob) vs. ISA(Bob, Human)
• Green(B21) vs. Color(B21, Green)
The choice affects the generality at which concepts can
be expressed
Inheritance rule:
x,y,z ISA(x, y) ^ ISA(y, z)  ISA(x, z)
Two blocks are the same color:
x Color(B21, x) ^ Color(B22, x)
Informal quiz on use of FOL to represent
“common sense” knowledge
•
•
•
•
•
•
All apples are red
Some apples are red (“some” means at least one)
All apples contain (some) worms
Some apples contain (some) worms
Every person is mortal
Every person is male or female (but not both)
Wumpus world in FOL
• First step: define constants, function symbols,
predicate symbols to express the facts
• Percept(data, t) means: at step t, the agent perceived
the data where data is a vector:
– [Stench, Breeze, Glitter]
– Ex: Percept([None, Breeze, None],2]
• At(Agent, s, t) means: agent is at square s at step t
– Ex: At(Agent, [2,1], 2]
Some Wumpus axioms
Axiom to interpreting perceptions in context
x,t At(Agent, x, t) ^ Breeze(t)  Breezy(x)
Definitional axiom:
s,g,t Percept([s, Breeze, g], t)  Breeze(t)
Diagnostic Axiom
x Breezy(x)  z Adjacent(z, x) ^ Pit(z)
Causal Axiom:
z Pit(z)  (x Adjacent(z, x)  Breezy(x))
World model axioms: Adjacent([1,1],[2,1]) etc.
x,y Adjacent(x, y)  Adjacent(y,x)
Interacting with FOL KBs
• Suppose a wumpus-world agent is using an FOL KB and
perceives a smell and a breeze (but no glitter) at t=5:
Tell(KB,Percept([Smell,Breeze,None],5)) – use forward chaining
Ask(KB,a BestAction(a,5)) - use backward chaining
BC Query: does the KB entail some best action at t=5?
• Answer: {a/Shoot} ← substitution (binding list)
• Given a sentence S and a substitution σ,
• Sσ denotes the result of plugging σ into S; e.g.,
S = Smarter(x,y)
σ = {x/Sue,y/Bill}
Sσ = S {x/Sue,y/Bill} = Smarter(Sue,Bill)
• Ask(KB,S) returns σ such that KB╞ Sσ
Knowledge engineering in FOL
1. Identify the task
2. Assemble the relevant knowledge
3. Decide on a vocabulary of predicates, functions,
and constants (a logical language L)
4. Encode general knowledge about the domain
5. Encode a description of the specific problem
instance
6. Pose queries to the inference procedure and get
answers
7. Debug the knowledge base
Negation by failure and CWA
A closed world is a world where every fact that is not
known is false.
Real-world examples: databases
Query: Does American Airlines fly from Boston to Tampa?
If no DB records of such flights answer NO
(consider a Horn clause KB of food “likes”)
Query: Does Sam like cheeseburgers?
not a known “fact”
not provable by “rules”
no way to prove he does not, but we say “NO”
Conversion to CNF
• Everyone who loves all animals is loved by someone:
x [ y [Animal(y)  Loves(x,y)]  y Loves(y,x) ]
• 1. Eliminate biconditionals and implications
x [ y [Animal(y)  Loves(x,y) ]  y Loves(y,x) ]
• 2. Move  inwards: x p ≡ x p,  x p ≡ x p
x [y [ (Animal(y)  Loves(x,y))]  y Loves(y,x) ]
x [y [Animal(y)  Loves(x,y) ]  y Loves(y,x) ]
No more negated quantifiers
Conversion to CNF contd.
3. Rename variables: each quantifier should use a
different one
x [y [Animal(y)  Loves(x,y) ]  y Loves(y,x) ]
x1 [ y1 [ Animal(y1)  Loves(x1,y1) ]  y2 Loves(y2,x1) ]
4. Skolemize: Each existential variable is replaced by a Skolem
function of the enclosing universally quantified variables:
x1 [ (Animal(F1(x1))  Loves(x1,F1(x1)))  Loves(F2(x1),x1) ]
5. Drop universal quantifiers:
(Animal(F1(x))  Loves(x,F1(x)))  Loves(F2(x),x)
6. Distribute  over 
Animal(F1(x))  Loves(F2(x),x)
Loves(x,F1(x) )  Loves(F2(x),x)
Class exercise
Inference in FOL – Chapter 9
• Theoretical foundations
– Inference by universal and existential instantiation
– Unification
– Resolution viewed as Generalized Modus Ponens
• Practical implementation (forward and backward
chaining)
Notation
A substitution is a set of variable-term pairs:
{x/term, y/term, . . . }, often referred to using the
symbol θ [theta]. No variable can occur more than once.
For any term or formula A:
Subst(θ, A) also written Aθ == the result of replacing each
variable in A with the corresponding term. A term is a
constant symbol, a variable symbol, or a function symbol
applied to 0 or more terms.
Def: A ground term is a term with no variables
Def: A ground sentence is a sentence with no free variables
Inference by Universal instantiation (UI)
• Every instantiation of a universally quantified sentence is entailed
by it:
v α
Subst({v/g}, α)
for any variable v and ground term g
• E.g., x King(x)  Greedy(x)  Evil(x) yields:
King(John)  Greedy(John)  Evil(John)
King(Richard)  Greedy(Richard)  Evil(Richard)
King(Father(John))  Greedy(Father(John))  Evil(Father(John))
.
Inference by Existential instantiation (EI)
• For any sentence α, variable v, and constant symbol k
that does not appear elsewhere in the knowledge
base:
v α
Subst({v/k}, α)
• E.g., x Crown(x)  OnHead(x,John) yields:
Crown(C1)  OnHead(C1,John)
provided C1 is a new constant symbol, called a
Skolem constant
To implement universal instantiation
• All humans are mortal
• Jack is human
---------------------------• Jack is mortal
R1. x Human(x)  Mortal(x)
F1. Mortal(Jack)
Let R1 be p  q, Let F1 be be p’
Modus ponens for FOL: If p and p’ unify, conclude q’
Unify means we can match p and p’ by a substitution.
We then apply the same substitution to q to get q’
Unification
• Def: Two formulas A and B unify if there is a
substitution θ such that Aθ = Bθ.
• Ex: To unify A = Knows(John, x) and B = Knows(y, Mary)
θ = {y/John, x/Mary }
• θ is not unique!
• To unify A = Knows(John,x) and B = Knows(y,z),
θ1 = {y/John, x/z } or θ2 = {y/John, x/Sue, z/Sue}
• The first unifier is more general than the second.
• There is a single most general unifier (MGU) that is
unique up to renaming of variables.
MGU = { y/John, x/z } or {y/John, z/x }
Most general unifier
• Def: If θ1 is a unifier for formulas A and B, it is a
MOST GENERAL UNIFIER (MGU) iff:
– There is no other unifier θ2 for A and B such that
Aθ1 subsumes Aθ2
• A formula F subsumes a formula G if there is a nontrivial substitution  such that F = G
• A θ1 = Knows(John, z)
Aθ2 = Knows(John, Sue)
Aθ1 subsumes Aθ2 therefore θ2 = {y/John, x/Sue, z/Sue} is not
a MGU.
Note: what is  ??
Class Exercise: Unification Examples
Unify(α,β) = θ if αθ = βθ
p
Knows(John,x)
Knows(John,x)
Knows(John,x)
Knows(John,x)
q
Knows(John,Jane)
Knows(y,Barak)
Knows(y,Mother(y))
Knows(x,Barak)
θ
Unification
• Unify(α,β) = θ if αθ = βθ
p
q
Knows(John,x) Knows(John,Jane)
Knows(John,x) Knows(y,Barak)
Knows(John,x) Knows(y,Mother(y))
Knows(John,x) Knows(x,Barak)
θ
{x/Jane}}
Unification
• Unify(α,β) = θ if αθ = βθ
p
q
Knows(John,x) Knows(John,Jane)
Knows(John,x) Knows(y,Barak)
Knows(John,x) Knows(y,Mother(y))
Knows(John,x) Knows(x,Barak)
θ
{x/Jane}}
{x/Barak,y/John}}
Unification
•
•
• Unify(α,β) = θ if αθ = βθ
p
Knows(John,x)
Knows(John,x)
Knows(John,x)
Knows(John,x)
q
Knows(John,Jane)
Knows(y,Barak)
Knows(y,Mother(y))
Knows(x,Barak)
θ
{x/Jane}}
{x/Barak,y/John}}
{y/John,x/Mother(John)}}
Unification
•
•
• Unify(α,β) = θ if αθ = βθ
p
Knows(John,x)
Knows(John,x)
Knows(John,x)
Knows(John,x)
q
Knows(John,Jane)
Knows(y,Barak)
Knows(y,Mother(y))
Knows(x,Barak)
θ
{x/Jane}}
{x/Barak,y/John}}
{y/John,x/Mother(John)}}
{fail}
The unification algorithm (Fig. 9.1)
The unification algorithm (cont.)
Application to reasoning
Modus ponens says:
Given p q and p
Conclude: q
In FOL:
Given p  q and p’ (where p and p’ unify by θ)
Conclude: qθ
Suppose KB includes:
x King(x)  Greedy(x)  Evil(x)
King(John)
This won’t quite work since we have p1 ^ p2  q
Generalized Modus Ponens (GMP)
(follows from the resolution rule for FOL)
( p1  p2  …  pn q), p1', p2', … , pn'
qθ
p1' is King(John)
p2' is Greedy(y)
θ is {x/John,y/John}
where pi'θ = pi θ for all i
p1 is King(x)
p2 is Greedy(x)
q is Evil(x)
q θ is Evil(John)
• GMP used with KB of definite clauses (exactly one positive
literal)
• All variables assumed universally quantified
• How do we get Greedy(y) in our KB ?
Soundness of GMP
•
Need to show that
p1', …, pn', (p1  …  pn  q) ╞ qθ
provided that pi'θ = piθ for all p
•
Lemma: For any sentence p, we have p ╞ pθ by UI
1. (p1  …  pn  q) ╞ (p1  …  pn  q)θ = (p1θ  …  pnθ  qθ)
2. p1', …, pn' ╞ (p1'  …  pn‘ ) θ ╞ p1'θ  …  pn'θ
3. From 1 and 2, qθ follows by ordinary Modus Ponens
Note: you should know the definitions of a sound inference
procedures and a complete inference procedure.
Forward Chaining in FOL
(with the “explicit knowledge” assumption)
• Assume percepts do not contain variables (may
contain “generated symbol” constants)
• Example: you see an unfamiliar dog in the building:
Percept: Isa(G33, Dog)
Assume KB includes: Isa(x, Dog)  Isa(x, Animal)
• Add new percept to KB if not already believed
• If percept UNIFIES with a rule premise (by some θ)
and if all the other premises pθ are believed, add qθ
to KB.
• θ is {x/G33) and qθ is isa(G33, Animal)
Backward Chaining
• Given a definite clause KB and a query q’:
– For any fact q in KB that unifies with q’, return θ (or
“YES” if θ = { } )
– For any rule in KB whose conclusion q unifies with q’:
• If the rule’s premises p1 θ . . . pn θ can all be proved with
a resulting substitution θ’, return COMPOSE(θ, θ’)
– If no facts or rules result in a substitution, return “NO”
• Note this can return multiple answers!
Simple examples
• KB: Likes(John, Pizza)
Likes(Mary, Pizza)
Likes(Sam, IceCream)
• Query: Likes(John, Pizza) return YES
• Query: Likes(Sam, Pizza) return NO (justified by CWA)
• Query: Likes(x, Pizza) return a list of substitutions
{ {x/John} , {x/Mary} }
More complex examples
• Add to KB: Likes(y, Pizza)  Likes(y, Spaghetti)
• Query: Likes(John, Spaghetti)
• Query: Likes(Sam, Spaghetti)
• Query: Likes(x, Spaghetti)
• Query: Likes(x, y)
Set of substitutions:
{ {x/John, y/Pizza}, {x/Mary, y/Pizza}, {x/Sam, y/IceCream},
{x/John, y/Spaghetti}, {x/Mary, y/Spaghetti} }
Backward chaining algorithm
SUBST(COMPOSE(θ1, θ2), p) = SUBST(θ2, SUBST(θ1, p))
Backward Chaining: Example knowledge base
• The law says that it is a crime for an American to sell weapons
to hostile nations. The country Nono, is an enemy of America,
has some missiles, and all of its missiles were sold to it by
Colonel West, who is American.
• Prove that Col. West is a criminal
Example knowledge base contd.
... it is a crime for an American to sell weapons to hostile nations:
American(x)  Weapon(y)  Sells(x,y,z)  Hostile(z)  Criminal(x)
Nono … has some missiles, i.e., x Owns(Nono,x)  Missile(x):
Owns(Nono,M1) and Missile(M1)
… all of its missiles were sold to it by Colonel West
Missile(x)  Owns(Nono,x)  Sells(West,x,Nono)
Missiles are weapons:
Missile(x)  Weapon(x)
An enemy of America counts as "hostile“:
Enemy(x,America)  Hostile(x)
West, who is American …
American(West)
The country Nono, an enemy of America …
Enemy(Nono,America)
Backward chaining example
Backward chaining example
Backward chaining example
Backward chaining example
Backward chaining example
Backward chaining example
Backward chaining example
Backward chaining example
Properties of backward chaining
• Depth-first recursive proof search: space is linear in
size of proof
• Incomplete due to infinite loops
–  fix by checking current goal against every goal on stack
• Inefficient due to repeated subgoals (both success
and failure)
–  fix using caching of previous results (extra space)
• Widely used for logic programming
Resolution: brief summary
• Full first-order version:
l1  ···  lk,
m1  ···  mn
(l1  ···  li-1  li+1  ···  lk  m1  ···  mj-1  mj+1  ···  mn) θ
where Unify(li, mj) = θ for some i, j
• The two clauses are assumed to be standardized apart so that they share
no variables. For example,
Rich(x)  Unhappy(x)
Rich(Ken)
Unhappy(Ken)
with θ = {x/Ken}
• Apply resolution steps to CNF(KB  α); complete for FOL
Resolution proof: definite clauses
Download