Quantified Boolean Formula (QBF) Reasoning Bart Selman, Carla Gomes, Ashish Sabharwal

Quantified Boolean Formula (QBF) Reasoning Bart Selman, Carla Gomes, Ashish Sabharwal Cornell University Feb 13, 2007 Tutorial for Dr. Charles Holland and Dr. Tom Wagner Tutorial Roadmap 1. Automated reasoning – The complexity challenge – State of the art in Boolean reasoning 2. SAT-based reasoning – – – – Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT • Example domain: planning 3. QBF reasoning (extends SAT) – A new range of applications – Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 2 Tutorial Roadmap 1. Automated reasoning  The complexity challenge – State of the art in Boolean reasoning 2. SAT-based reasoning – – – – Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT • Example domain: planning 3. QBF reasoning (extends SAT) – A new range of applications – Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 3 The Quest for Machine Reasoning Objective: Develop foundations and technology to enable effective, practical, large-scale automated reasoning. Machine Reasoning (1960-90s) Computational complexity of reasoning appears to severely limit real-world applications Current reasoning technology Revisiting the challenge: Significant progress with new ideas / tools for dealing with complexity (scale-up), uncertainty, and multi-agent reasoning 4 General Automated Reasoning Problem instance Domain-specific e.g. logistics, war games, chess, space missions, planning, scheduling, ... Model Generator (Encoder) General Inference Engine Solution Generic applicable to all domains within range of modeling language Research objective Impact Better reasoning and modeling technology Faster solutions in several domains 5 Reasoning Complexity • EXPONENTIAL COMPLEXITY: INHERENT AN worst case N= No. of Variables/Objects A= Object states Simple Example: Knowledge Base Variables (binary) X1 = email_ received X2 = in_ meeting X3 = urgent X4 = respond_to_email X5 = near_deadline X6 = postpone X7 = air_ticket_info_request X8 = travel_ request X9 = info_request Rules: 1. X1 & (not X2) & X3  X4 2. X2  not X4 3. 4. 5. 6. 7. X5  X3 or X6 X7  X8 X8  X9 X8  X5 X6  not X9 • TIME/SPACE Granularity   Object states • Current implementations trade time with soundness Question: Given: X1= true; X2 = false; X7=true. What is X4 = ? Answer Development: Inference Chain Step 1: X7  X8 (rule 4) Step 2: X8  X5 (rule 6) Step 3: X5  X3 or X6 (rule 3) Search for rules to apply M For N variables: 2N cases drive complexity! Case A: X6 = true Step 4: X6  not X9 Step 5: X9  not X8 Step 6: Contradiction Backtrack to M Case B: X3 = true X1 & (not X2) & X3  X4 Step 7: X4 = true (Rule 1) Check Contradictions 6 Exponential Complexity Growth: The Challenge of Complex Domains Case complexity Note: rough estimates, for propositional reasoning 1M War Gaming 5M 10301,020 0.5M VLSI 1M Verification 10150,500 100K 450K 106020 Military Logistics 20K Chess (20 steps deep) 100K No. of atoms on the earth 103010 Seconds until heat death of sun 1047 Protein folding Calculation (petaflop-year) 10K 50K Deep space mission control 100 Car repair diagnosis 200 1030 100 10K 20K 100K 1M Variables Rules (Constraints) [Credit: Kumar, DARPA; Cited in Computer World magazine] 7 Tutorial Roadmap 1. Automated reasoning  The complexity challenge  State of the art in Boolean reasoning 2. SAT-based reasoning – – – – Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT • Example domain: planning 3. QBF reasoning (extends SAT) – A new range of applications – Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 8 Progress in Last 15 Years Focus: Combinatorial Search Spaces Specifically, the Boolean satisfiability problem, SAT Significant progress since the 1990’s. How much? • Problem size: We went from 100 variables, 200 constraints (early 90’s) to 1,000,000 vars. and 5,000,000 constraints in 15 years. Search space: from 10^15 to 10^300,000. [Aside: “one can encode quite a bit in 1M variables.”] • Tools: 50+ competitive SAT solvers available Overview of state of the art: Plenary talk at IJCAI-05 (Selman); Gomes and Selman, Nature ’05 9 How Large are the Problems? A bounded model checking problem: 10 SAT Encoding (automatically generated from problem specification) i.e., ((not x1) or x7) ((not x1) or x6) etc. x1, x2, x3, etc. are our Boolean variables (to be set to True or False) Should x1 be set to False?? 11 10 Pages Later: … i.e., (x177 or x169 or x161 or x153 … x33 or x25 or x17 or x9 or x1 or (not x185)) clauses / constraints are getting more interesting… Note x1 … 12 4,000 Pages Later: … 13 Finally, 15,000 Pages Later: Search space of truth assignments: Current SAT solvers solve this instance in under 30 seconds! 14 SAT Solver Progress Solvers have continually improved over time Instance Posit' 94 Grasp' 96 Sato' 98 Chaff' 01 40.66s 1.20s 0.95s 0.02s bf1355-638 1805.21s 0.11s 0.04s 0.01s pret150_25 >3000s 0.21s 0.09s 0.01s dubois100 >3000s 11.85s 0.08s 0.01s aim200-2_0-no-1 >3000s 0.01s < 0.01s < 0.01s 2dlx_..._bug005 >3000s >3000s >3000s 2.90s c6288 >3000s >3000s >3000s >3000s ssa2670-136 Source: Marques-Silva 2002 15 How do SAT Solvers Keep Improving? From academically interesting to practically relevant. We now have regular SAT solver competitions. (Germany ’89, Dimacs ’93, China ’96, SAT-02, SAT-03, SAT-04, SAT-05, SAT-06) E.g. at SAT-2006 (Seattle, Aug ’06): • 35+ solvers submitted, most of them open source • 500+ industrial benchmarks • 50,000+ benchmark instances available on the www This constant improvement in SAT solvers is the key to making, e.g., SAT-based planning very successful. 16 Current Automated Reasoning Tools Most-successful fully automated methods: based on Boolean Satisfiability (SAT) / Propositional Reasoning – Problems modeled as rules / constraints over Boolean variables – “SAT solver” used as the inference engine Applications: single-agent search • AI planning SATPLAN-06, fastest optimal planner; ICAPS-06 competition (Kautz & Selman ’06) • Verification – hardware and software Major groups at Intel, IBM, Microsoft, and universities such as CMU, Cornell, and Princeton. SAT has become the dominant technology. • Many other domains: Test pattern generation, Scheduling, Optimal Control, Protocol Design, Routers, Multi-agent systems, E-Commerce (E-auctions and electronic trading agents), etc. 17 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning 2. SAT-based reasoning  – – – Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT • Example domain: planning 3. QBF reasoning (extends SAT) – A new range of applications – Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 18 Boolean Logic Defined over Boolean (binary) variables a, b, c, … Each of these can be True (1) or False (0) Variables connected together with logic operators: and, or, not (denoted ) E.g. (a or b) is True iff at least one of a and b is True ((c and d)) or f) is True iff either c is True and d is False, or f is True Fact: All other Boolean logic operators can be expressed with and, or, not E.g. (a  b) same as (a or b) Boolean formula, e.g. F = (a or b) and (a and (b or c)) (Truth) Assignment: any setting of the variables to True or False Satisfying assignment: assignment where the formula evaluates to True 19 Boolean Logic: Example F = (a or b) and (a and (b or c)) Note: True often written as 1, False as 0 • There are 23 = 8 possible truth assignments to a, b, c – (a=0,b=1,c=0) – (a=0,b=0,c=1) – … representing (a=False, b=True, c=False) • Exactly 3 truth assignments satisfy F – (a=0,b=1,c=0) – (a=0,b=1,c=1) – (a=1,b=0,c=0) Truth Table for F a b c F 0 0 0 0 0 0 1 0 0 1 0 1 0 1 1 1 1 0 0 1 1 0 1 0 1 1 0 0 1 1 1 0 20 Boolean Logic: Expressivity All discrete single-agent search problems can be cast as a Boolean formula! Variables a, b, c, … often represent “states” of the system, “events”, “actions”, etc. (more on this later, using Planning as a example) Very general encoding language. E.g. can handle • Numbers (k-bit binary representation) • Floating-point numbers • Arithmetic operators like +, x, exp(), log() • … SAT encodings (generated automatically from high level languages) routinely used in domains like planning, scheduling, verification, e-commerce, network design, … Recall Example: Variables X1 = email_ received X2 = in_ meeting X3 = urgent X4 = respond_to_email “event” “state” “action” X5 = near_deadline X6 = postpone X7 = air_ticket_info_request X8 = travel_ request X9 = info_request Rules: 1. X1 & (not X2) & X3  X4 2. X2  not X4 constraint 3. 4. 5. 6. 7. X5  X3 or X6 X7  X8 X8  X9 X8  X5 X6  not X9 21 Boolean Logic: Standard Representations Each problem constraint typically specified as (a set of) clauses: E.g. (a or b), (c or d or f), (a or c or d), … clauses (only “or”, “not”) Formula in conjunctive normal form, or CNF: a conjunction of clauses E.g. F = (a or b) and (a and (b or c)) changes to FCNF = (a or b) and (a or b) and (b or c) Alternative [useful for QBF]: specify each constraint as a term (only “and”, “not”): E.g. (a and d), (b and a and f), (b and d and e), … Formula in disjunctive normal form, or DNF: a disjunction of terms E.g. FDNF = (a and b) or (a and b and c) 22 Boolean Satisfiability Testing The Boolean Satisfiability Problem, or SAT: Given a Boolean formula F, • find a satisfying assignment for F • or prove that no such assignment exists. • • • A wide range of applications Relatively easy to test for small formulas (e.g. with a Truth Table) However, very quickly becomes hard to solve – Search space grows exponentially with formula size (more on this next) SAT technology has been very successful in taming this exponential blow up! 23 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning 2. SAT-based reasoning   – – Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT • Example domain: planning 3. QBF reasoning (extends SAT) – A new range of applications – Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 24 SAT Search Space All vars free Fix one variable to True or False Fix another var Fix a 3rd var Fix a 4th var True False False False True SAT Problem: Find a path to a True leaf node. For N Boolean variables, the raw search space is of size 2N • Grows very quickly with N • Brute-force exhaustive search unrealistic without efficient heuristics, etc. 25 Worst-Case Complexity SAT is an NP-complete problem • Worst-case believed to be exponential (roughly 2N for N variables) • 10,000+ problems in CS are NPcomplete (e.g. planning, scheduling, protein folding, reasoning) • P vs. NP --- $1M Clay Prize However, real-world instances are usually not pathological and can often be solved very quickly with the latest technology! Typical-case complexity provides a more detailed understanding and a more positive picture. exponential polynomial 27 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning 2. SAT-based reasoning    – Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT • Example domain: planning 3. QBF reasoning (extends SAT) – A new range of applications – Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 28 Typical-Case Complexity A key hardness parameter for k-SAT: the ratio of clauses to variables Delete Constraints Add Constraints Problems that are not critically constrained tend to be much easier in practice than the relatively few critically constrained ones [Mitchell, Selman, and Levesque ’92; Kirkpatrick and Selman – Science ’94] 29 Typical-Case Complexity Random 3-SAT as of 2004 Phase transition Linear time algs. Random Walk DP DP ’ GSAT Walksat SP SAT solvers continually getting close to tackling problems in the hardest region! SP (survey propagation) now handles 1,000,000 variables very near the phase transition region 30 Tractable Sub-Structure Can Dominate and Drastically Reduce Solution Cost! Median runtime 2+p-SAT model: mix 2-SAT (tractable) and 3-SAT (intractable) clauses > 40% 3-SAT: exponential scaling  40% 3-SAT: linear scaling! Number of variables (Monasson, Selman et al. – Nature ’99; Achlioptas ’00) 31 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning 2. SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT • Example domain: planning 3. QBF reasoning (extends SAT) – A new range of applications – Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 32 SAT Encoding Example: Planning Domain Planning Problem  Propositional CNF formula by axiom schemas Discrete time, modeled by integers • state predicates: indexed by time at which they hold E.g. at_location(x,,loc,i), free(x,i+1), route(cityA,cityB,i) • action predicates: indexed by time at which action begins E.g. fly(cityA,cityB,i), pickup(x,loc,i), drive_truck(loc1,loc2,i) – each action takes 1 time step – many actions may occur at the same step 33 Encoding Rules • Actions imply preconditions and effects fly(x,y,i) •  at(x,i) and route(x,y,i) and at(y,i+1) Conflicting actions cannot occur at same time (A deletes a precondition of B) fly(x,y,i) and yz •  If something changes, an action must have caused it (Explanatory Frame Axioms) at(x,i) and not at(x,i+1) • not fly(x,z,i)  y . route(x,y) and fly(x,y,i) Initial and final states hold at(NY,0) and ... and at(LA,9) and ... 34 Using SAT Solvers for Planning Modeling and Solving a Planning Problem Problem description in high level language axiom schemas instantiate instantiated propositional clauses (manual) length mapping plan interpret satisfying model (fully automatic) SAT engine(s) 36 Planning Benchmark Complexity Logistics domain – a complex, highly-parallel transportation domain E.g. logistics.d problem: o 2,165 possible actions per time slot o optimal solution contains 74 distinct actions over 14 time slots (out of 5 x 10^46 possible sequential plans of length 14) Satplan [Selman et al.] approach is currently fastest optimal planning approach. Winner ICAPS-05 & ICAPS-06 international planning competitions. 39 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning 3. QBF reasoning (extends SAT)  A new range of applications – Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 40 The Next Challenge in Reasoning Technology Multi-Agent Reasoning: Quantified Boolean Formulae (QBF) – Allow use of Forall and Exists quantifiers over Boolean variables – QBF significantly more expressive than SAT: from single-person puzzles to competitive games! New application domains: • • • • Unbounded length planning and verification Multi-agent scenarios, strategic decision making Adversarial settings, contingency situations Incomplete / probabilistic information war gaming But, computationally *much* harder (formally PSPACE-complete rather than NP-complete) Key challenge: Can we do for QBF what was done for SAT solving in the last decade? Would open up a tremendous range of advanced automated reasoning capabilities! 41 Multi-Agent Reasoning A Wide Range of Applications • Logistics planning and scheduling under adversarial / uncertain conditions – Multi-agent contingency planning with performance guarantees • Analysis and validation of distributed agent strategies and coordination – Event logic • Reasoning in rich multi-player (adversarial) settings – War gaming, designing secure data and communication networks QBF Technology for Multi-Agent Reasoning offers: • Performance guarantee / optimality / “worst-case” scenario analysis – NOTE: Worst-case may be rare but can have catastrophic consequences! The Challenge: Overcoming the high computational complexity 42 SAT Reasoning vs. QBF Reasoning SAT Reasoning Scope of technology Worst-case complexity Application areas Research status     Combinatorial search for optimal and nearoptimal solutions NP-complete (hard) planning, scheduling, verification, model checking, … From 200 vars in early ’90s to 1M vars. Now a commercially viable technology. QBF Reasoning     Combinatorial search for optimal and nearoptimal solutions in multi-agent, uncertain, or hostile environments PSPACE-complete (harder) adversarial planning, gaming, security protocols, contingency planning, … From 200 vars in late 90’s to 100K vars currently. Still rapidly moving. 43 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning 3. QBF reasoning (extends SAT)  A new range of applications  Two motivating examples • – – – – 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary network planning, logistics planning Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 44 The Need for QBF Technology SAT technology, while very successful for single-agent search, is not suitable for adversarial reasoning. Must model the adversary and incorporate his actions into reasoning • SAT does not provide a framework for this • In fact, it cannot (more on this later) Two examples next: 1. Network planning: create a data/communication network between N nodes which is robust under failures during and after network creation 2. Logistics planning: achieve a transportation goal in uncertain environments 45 Adversarial Planning: Motivating Example Network Planning Problem: – – – – Input: 5 nodes, 9 available edges that can be placed between any two nodes Goal: all nodes finally connected to each other (directly or indirectly) Requirement (A): final network must be robust against 2 node failures Requirement (B): network creation process must be robust against 1 node failure E.g. a sample robust final configuration: (uses only 8 edges) Side note: Mathematical structure of the problem: 1. (A) implies every node must have degree ≥ 3 (otherwise it can easily be “isolated”) 2. At least one node must have degree ≥ 4 (follows from 1. and that not all 5 nodes can have odd degree in any graph) 3. Need at least 8 edges total (follows from 1. and 2.) 4. If one node fails during creation, the remaining 4 must be connected with 6 edges to satisfy (A) 5. Actually need 9 edges to guarantee construction (follows from 4. because a node may fail as soon as its degree becomes 3) 46 Example: A SAT-Based Sequential Plan Create edge Next move if no failures Final network robust against 2 failures The plan goes smoothly and we end up with the target network, which is robust against any 2 node failures 47 Example: A SAT-Based Sequential Plan Node failures may render the original plan ineffective, but re-planning could help make the remaining network robust. What if the left node fails? Create edge Node failure during network creation Next move if a particular node fails Next move if no failures Final network robust against 2 more failures Can still make the remaining 4 nodes robust using 2 more edges (total 8 used) • Feasible, but must re-plan to find a different final configuration 48 Example: A SAT-Based Sequential Plan Trouble! Can get stuck if • Resources are limited What if the top node fails? (only 9 edges) • Adversary is smart (takes out node with degree 4) • Poor decisions were made early on in the network plan Need to create 4 more edges to make the remaining 4 nodes robust • Stuck! Have already used up 6 of the 9 available edges! 49 Example: A QBF-Based Contingency Plan A QBF solver will return a robust contingency plan (a tree) • Will consider all relevant failure modes and responses (only some “interesting” parts of the plan tree are shown here) …. …. …. …. …. …. …. …. only 8 edges used Create edge Node failure during network creation Next move if a particular node fails Next move if no failures 9 edges needed Final networks robust against 2 more failures only 8 edges used 9 edges needed 50 Another Example: Logistics Planning • Blue nodes are cities, green nodes are military bases (1) (2) SatPlan QbPlan City-1 *At any step commercial player can transport up to 80 civilians (s1)  (s1) City-2 (s1) City-4 (s2) 60p (s1) (s2) City-3 • • operator: “transport” t(who, amount, from, to, step) • parallel actions can be taken at each step • Goal: Send 60 personal from Base-1 to Base-2 in at most 3 steps (20p) + (up 80c) <= 100 (60p) + (80c) > to 100 (civilian transport capacity) Re-planning needed !!! One player: military player, deterministic classic planning, SatPlan • • • Blue edges are commercial transports, green edges are military • Green edges (transports) have a capacity of 60 people, blue edges have a capacity of 100 people (s3) (s3) (s2) Base 1 20p 20p 60p 20p Base 2 (s2) (1) Sat-Plan: t(m, 60, base-1, city-3, 1), t(m, 60, city-3, city-4, 2), t(m, 60, city-4, base-2, 3) Two players: deterministic adversarial planning QB Plan • Military Player (m) is “white player”, Commercial Player (c) is “black player” (Chess analogy). Commercial player can move up to 80 civilians between cities. Commercial moves can not be invalidated. Goal can be read as: “send 60 personal from Base-1 to Base-2 in at most 3 steps whatever commercial needs (moves) are” • If commercial player decides to move 80 civilians from city-3 to city-4 at the second step, we should replan (1). Indeed, the goal can not be achieved if we have already taken the first action of (1) • (2) QB-Plan: t(m, 20, base-1, city-1, 1), t(m, 20, base-1, city-2, 1), t(m, 20, base-1, city-3, 1), t(m, 20, city-1, city-4, 2), t(m, 20, city-2, city-4, 2), t(m, 20, city-3, city-4, 2), t(m, 60, city-4, base-2, 3) 51 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning 3. QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning  – – – Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 52 Quantified Boolean Logic Boolean logic extended with “quantifiers” on the variables – “there exists a value of x in {True,False}”, represented by x – “for every value of y in {True,False}”, represented by y – The rest of the Boolean formula structure similar to SAT, usually specified in CNF form E.g. QBF formula F(v,w,x,y) = v w x y : (v or w or x) and (v or w) and (v or y) Quantified Boolean variables constraints (as before) 53 Quantified Boolean Logic: Semantics F(v,w,x,y,z) = v w x y : (v or w or x) and (v or w) and (v or y) What does this QBF formula mean? Semantic interpretation: F is True iff “There exists a value of v s.t. for both values of w there exists a value of x s.t. for both values of y (v or w or x) and (v or w) and (v or y) is True” 54 Quantified Boolean Logic: Example F(v,w,x,y,z) = v w x y : (v or w or x) and (v or w) and (v or y) Truth Table for F as a SAT formula v w x y F 0 0 0 0 0 0 0 0 1 1 0 0 1 0 0 0 0 1 1 1 0 1 0 0 0 0 1 0 1 0 0 1 1 0 0 0 1 1 1 0 1 0 0 0 0 1 0 0 1 0 1 0 1 0 1 1 0 1 1 1 1 1 0 0 1 1 1 0 1 1 1 1 1 0 1 1 1 1 1 1 Is F True as a QBF formula? Without quantifiers (as SAT): have many satisfying assignments e.g. (v=0, w=0, x=0, y=1) With quantifiers (as QBF): many of these don’t work e.g. no solution with v=0 F does have a QBF solution with v=1 and x set depending on w 55 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning 3. QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning   – – Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 56 QBF Modeling: Building Blocks Similar to SAT • • Boolean Variables: states + actions w-king_at_(2,1)_3 = “white king is at cell (1,2) after time step 3” move_b-rook_(1,1)_(2,1)_2 = “move b-rook from (1,1) to (2,1) at step 2” truck1_at_cityA_2 load_box20_truck1_4 = = “truck #1 is at city A after time step 2” “load box #20 onto truck #1 at time step 4” Constraints: initial state + goal / query + rules of the domain w-king_at_(4,1)_start = “w-king starts out at cell (4,1)” (move_b-rook_(1,1)_(3,1)_2 implies (b-rook_at_(1,1)_2 and empty_(2,1)_2)) = “if b-rook moves from (1,1) to (3,1) at time step 2, then it must be at cell (1,1) then, and cell (2,1) must be empty ” box12_at_cityC_end = “box12 must reach cityC at the end” (drive_truck1_cityA_cityB_1 implies truck1_at_cityA_1) 57 QBF Modeling: The Semantics Example 1: a 4-move chess game There exists a move of the white s.t. for every move of the black there exists a move of the white s.t. for every move of the black the white player wins Example 2: contingency planning for disaster relief There exist preparatory steps s.t. for every disaster scenario within limits there exists a sequence of actions s.t. necessary food and shelter can be guaranteed within two days 58 QBF and Uncertainty Adversarial planning requires reasoning about uncertainty in the actions of the adversary (A) Uncertainty may be captured as probabilities over adversarial actions : probabilistic planning results good in expectation or with high probability (B) Uncertainty may alternatively be modeled as a set of contingencies : discrete contingency planning guaranteed results robust against all modeled modes of failure QBF in the basic form takes approach (B) Can be extended to weighted QBF in order to capture (A) 59 Adversarial Uncertainty Modeled as QBF • • • • Two agents: self and adversary Both have their own set of actions, rules, etc. Self performs actions at time steps 1, 3, 5, …, T Adversary performs actions at time steps 2, 4, 6, …, T-1 The following QBF formulation is True if and only if self can achieve the goal no matter what actions adversary takes There exists a self action at step 1 s.t. for every adversary action at step 2 there exists a self action at step 3 s.t. for every adversary action at step 4 … there exists a self action at step T s.t. ( (initialState(time=1) and self-respects-modeled-behavior(1,3,5,…,T) and goal(T)) OR (NOT adversary-respects-modeled-behavior(2,4,…,T-1)) ) 60 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning 3. QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning    – Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 61 QBF Search Space Initial state  : self Self action  : adversary Adversary action Self action Adversary action No goal Goal No goal Goal Initial state Self action Recall traditional SAT-type search space Self action Goal No goal Goal 62 QBF Solution: A Policy or Strategy Initial state Self action Adversary action Self action Adversary action Goal No goal Goal Goal Goal Initial state Contingency plan • A policy / strategy of actions for self • A subtree of the QBF search tree (contrast with a linear sequence of actions in SAT-based planning) Self action Self action No goal Goal 63 Exponential Complexity Growth Planning (single-agent): find the right sequence of actions HARD: 10 actions, 10! = 3 x 106 possible plans Contingency planning (multi-agent): actions may or may not produce the desired effect! … 1 out of 10 4 out 2 out of 8 of 9 REALLY HARD: 10 x 92 x 84 x 78 x … x 2256 = 10224 possible contingency plans! exponential polynomial 64 Exponential Complexity Growth: Chess Problems (benchmark domain for REAL) Board Parameters No. of vars. QBF model size (no. of constraints) Boolean search space RAW SPACE (legal moves) 3 600 ~ 4K 10180 107 5 4 1.3K ~ 10K 10390 1012 5x5 5 5 3K ~ 25K 10900 1014 6x6 8 6 10K ~ 100K 103,000 1019 7x7 10 7 24K ~ 200K 107,200 1025 8x8 16 11 90K ~ 500K 1027,000 1043 bxb p m O(pb3m) [highly board dependent] O(2pb3m) O(pmb3m) Board size No. of pieces No. of moves 4x4 3 4x4 (such problems solved in under 20 seconds by Qbf-Cornell) 65 Computational Complexity Hierarchy Hard EXP-complete: games like Go, … EXP PSPACE-complete: QBF, adversarial planning, … NP-complete: PSPACE SAT, scheduling, graph coloring, … NP P-complete: circuit-SAT, … P In P: sorting, shortest path… Easy Note: widely believed hierarchy; know P≠EXP for sure 66 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning 3. QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning – Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning     Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 67 QBF Scaling Behavior in Practice Although hard in the worst-case, QBF technology can scale well in practice. • Real-world problems often have built-in structure and contain hidden tractable sub-problems that make QBF solvers much faster in practice. (Note: This has helped make SAT-based single-agent reasoning scale to large problems with 1,000,000+ variables. We are pushing QBF toward this goal.) • Qbf-Cornell and Qbf-CornellD can automatically exploit many hidden tractable sub-structures and regularities in the problem (scaling plots on the next few slides) On many chess instances, we observe – either low-polynomial (or even near-linear) scaling – or exponential scaling but with a much lower exponent than raw search 68 Scaling: Number of Pieces Qbf-CornellD compared with the raw space of legal chess moves searched at 100,000,000 moves per second: Polynomial scaling with number of pieces (log-log plot) 8K vars, 150K constraints Runtime in seconds (logscale) 1e+08 Raw Search at 100M moves/sec Cornell QBF Solver (1 year) 1e+07 1e+06 100000 (1 day) 10000 1000 100 (1 minute) 10 17K vars, 400K constraints 1 8 9 10 11 12 13 14 15 16 17 18 19 Number of pieces (logscale) 69 Scaling: Number of Steps The worst-case scaling is exponential in the number of steps. However, in practice, the exponent is much smaller for the Qbf-CornellD than for raw search. Exponential speed-up over the raw search time Runtime in seconds (logscale) Scaling with number of steps (log-plot) Raw Search at 100M moves/sec Cornell QBF Solver 1e+14 1e+12  Problems remain feasible much longer 1e+10 5-18K vars, 100-370K constraints 1e+08 (1 year) 1e+06 (1 day) 10000 100 (1 minute) 1 3 5 7 9 11 Number of steps 70 Scaling: Number of Steps Many real-world problems contain tractable (but hidden) sub-problems. Qbf-CornellD can often automatically learn and exploit these. Reachability-based boards: Scaling with no. of pieces, Hidden tractable sub-structure (log-plot) Runtime in seconds (logscale) 10000 Near-linear scaling with number of steps! Raw Search at 100M moves/sec Cornell QBF Solver y = x, for comparison (1 hour) 1000 77K vars, 2.2M constraints 100 (1 minute) 10 1 3 5 7 9 11 13 15 Number of steps 71 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning  QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning  Key research advances – The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning     Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 72 Key Technical Advances 1. Good scaling with increasing number of quantifier alternations 2. New problem modeling techniques • natural, generic, automated • exploited by new solver techniques (Qbf-Cornell [’05], Qbf-CornellD [’06]) 3. Effective constraint propagation across quantifiers • essential for SAT solver success; challenging for QBF 4. Addressing efficiency drop caused by auxiliary variables • the “illegal search space issue” specific to QBF [more on these next] 73 Key Advance #1 • Most QBF benchmarks have only 2-3 quantifier levels – – – • Might as well translate into SAT (it often works!) Early QBF solvers focused on such instances Benchmarks with many quantifier levels are often the hardest Practical issues in both modeling and solving become much more apparent with many quantifier levels QBF solvers can now scale well with 10+ quantifier alternations Achieved through better modeling and learning-while-reasoning techniques. [details to follow] 74 Key Advance #2 QBF solvers are extremely sensitive to encoding! – Especially with many quantifier levels, e.g., evader-pursuer chess instances [Madhusudan et al. 2003] Instance (N, steps) Model X [Madhusudan et al. 2003] Model A [Cornell 2005] QuBEJ [Italy] Semprop [Germany] Quaffle [Princeton] Best other solver Model B [Cornell 2005] QbfCornell Best other solver QbfCornell 4 7 2030 >2030 >2030 7497 3 0.03 0.03 4 9 -- -- -- -- 28 0.06 0.04 8 7 -- -- -- -- 800 5 5 We now have generic QBF modeling techniques that are simple and efficient for solvers [details to follow] 75 Key Advance #3 For QBF, traditional encodings hinder unit propagation – – – E.g. unsatisfiable “reachability” queries A SAT solver would have simply unit propagated Most QBF solvers need 1000’s of backtracks and relatively complex mechanisms like learning to achieve simple propagation Best solver with only unit propagation Best solver (Qbf-Cornell) with learning conf-r1 2.5 0.2 conf-r5 8603 5.4 conf-r6 >21600 7.1 ? q-unsat: too few steps for White QBF solvers now achieve effective propagation across quantifiers New solver (Qbf-CornellD) exploiting new modeling techniques [details to follow] 76 Example: Lack of Effective Propagation (in Traditional QBF Solvers) Question: Can White reach the pink square without being captured? Impossible! White has one too few available moves [ click image for video ] This instance should ideally be easy even with many additional (irrelevant) pieces! Unfortunately, all CNF-based QBF solvers scale exponentially  Good news: Qbf-CornellD based on dual encoding technology resolves this issue!77 Key Advance #4 QBF solvers suffer from the “illegal search space issue” [Cornell 2005] – – Auxiliary variables needed for conversion into Boolean form Can push solver into large irrelevant parts of search space – – Note: negligible impact on SAT solvers due to effective propagation Our first solution for QBF: Qbf-Cornell [2005] • Pass “flags” to the solver, which detect this event and trigger backtracking Dual encoding based QBF solvers can completely avoid the illegal search space issue Achieved through better modeling and solving techniques (Qbf-CornellD). [details to follow] 78 Intuition for Illegal Search Space: Search Space for SAT Approaches Search Space SAT Encoding 2N+M Space Searched Original by SAT Solvers Search Space 2N/C ; Nlog(N) 2N; Poly(N) Original 2N In practice, for many real-world applications, polytime scaling. 79 Search Space of QBF Search Space QBF Encoding 2N+M’ Search Space Standard QBF Encoding 2N+M’’ Space Searched Original by Qbf-Cornell Search Space with Streamlining 2N Original 2N 80 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning  QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning  Key research advances  The technology behind QBF – A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning     Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 81 Progress in QBF Reasoning QBF technology has grown rapidly and can now scale to 100,000 variables, compared to 200 variables 5 years ago. (Advances largely due to research supported by Darpa’s REAL program.) What were the key techniques that made these advances possible? 82 High-Performance Multi-Agent Reasoning #2. Learning while Reasoning #1. New QBF Modeling Techniques gain factor > 102 gain factor > 104 (w.r.t. best QBF solver, 2004) #3. Structure Exploitation: gain factor > 103 Streamlining Technology behind Qbf-Cornell [’05] and Qbf-CornellD [’06] Goal: solve 80% chess boards in < 5 secs [12 pieces, 5 moves, 10K vars, 40K rules] Result: achieved 88.5% success! [10-12 pieces, 7 moves, 12K vars, 100K rules] Overall performance gain over state of the art in 2004 = 109x Reveals: • Power of QBF for developing and analyzing strategies in multi-agent settings • As tool for augmented cognition, e.g. in military adversarial analysis 83 Qbf-Cornell on a Chess Instance 16 pieces 11 steps [ click image for video of a sample run ] Raw search space size ~ 1043 Solved in ~ 20 seconds QBF solution gives a strategy with which White king can provably reach the goal square – no matter what Black tries to do! 84 Generality of the QBF Approach General reasoning very different from specialized programs. 1. Generality: Improved general reasoning  faster solutions for all domains – If you can model a domain (logistics, security protocols, network configuration, etc.), you can solve it with general reasoning 2. Flexibility: End-user can ask different (sub)-queries – “Can two black pieces get within two squares from white king in 4 moves?” – “Can my troops capture that bridge in one day without using heavy artillery? – Provides strategic analysis capabilities QBF Reasoning Technology offers even more: 3. Optimality: Guarantees optimality in any multi-agent scenario – Can analyze worst-case e.g. “Can my troops capture that bridge no matter what the enemy does?” 4. Complex internal reasoning: – – Exploits interplay between representations (e.g. move-based & location-based) Can go beyond a human programmer 85 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning  QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning  Key research advances – The technology behind QBF  A. New modeling techniques – B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning     Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 86 A. New QBF Modeling Techniques a) b) Logarithmic encoding • Succinctly represents collective behavior of objects e.g. “in chess, only one move can be taken at each time step” • Technique extended to QBF and our setting Both moves and locations used as building blocks • • c) Trigger-based pruning of search space • • d) Allows interplay between move-based and location-based reasoning / representation Most heuristic chess programs use only move-based search Automatically generated Indicator variables flag illegal actions of the universal agent e.g. in chess, illegal moves of the black player when white plays first QBF-Cornell implements flag-based pruning Dual-encoding, combining conjunctive and disjunctive rules (CNF and DNF) • • Exploit a dichotomy between the existential and universal agents Qbf-CornellD exploits this dual format to further boost QBF-Cornell RESULT: Over six orders of magnitude improvement 87 Encoding: The Traditional Approach Problem of interest CNF-based QBF encoding QBF Solver e.g. circuit minimization Any discrete adversarial task Solution! 88 Encoding: Our Game-Based Approach Game G: Adversarial Task players E & U, states, actions, rules, goal “Planning as Satisfiability” framework [SelmanKautz ’96] e.g. circuit minimization Create CNF encoding separately for E and U: Solution! QBF Solver Qbf-Cornell [2005] Flag-based CNF encoding QBF Solver Qbf-CornellD [2006] Dual (split) CNF-DNF encoding initial state axioms, action implies precondition, fact implies achieving action, frame axioms, goal condition Solution! Negate CNF part for U (creates DNF) 89 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning  QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning  Key research advances – The technology behind QBF  A. New modeling techniques  B. Learning while reasoning – C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning     Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 96 B. Learning while Reasoning A key component of the QBF solver is the ability to learn new constraints during the search. • The learned information is captured by new clauses that are added to the formula. These clauses can dramatically prune the remaining search. Recall: Computational cost (secs) • 4000 50 var 40 var 20 var 3000 2000 Delete Rules Add Rules 1000 0 Ratio of clauses to variables Hardness complexity curves 97 Encoding: Before Learning Before learning, constraints in the encoding specify the rules of the game (i.e. what it means for a move to be legal) and the query as Boolean logic. E.g. Rule : “if White Rook moves from a2 to a5 at step 7, then a4 must be empty at 7.” Encoding : (move_wRook_a2_a5_7  empty_a4_7) True/False variables Rule : “Only one piece can move at each step t.” Encoding : Several constraints of the form: (move_pieceA_cell1_cell2_t  NOT move_pieceB_cell3_cell4_t) a b c d e f 6 : “Can White King reach cell e2 in at most 6 steps?” Encoding : (location_wKing_e2_1 OR location_wKing_e2_2 OR … OR location_wKing_e2_6) Query 5 4 3 2 1 98 Encoding: After Learning After learning, new constraints specify 1000’s of more complex observations logically deduced from the board. These observations typically combine spatial and temporal knowledge, often in terms of “if-then” facts. Some of these are used subsequently in the search, and the rest are gradually discarded. E.g. Knowledge : “if White King is at c5 and White Pawn is at d3 initially, then White King cannot be at d3 after making 2 moves (its own piece is blocking d3).” Encoding : ((location_wKing_c5_1 AND location_wPawn_d3_1)  NOT location_wKing_d3_5) Note: White moves on odd steps, Black on even steps. a b c d e f 6 Knowledge : “if White moves the bottom Rook at step 1, then wKing cannot be in row 2 at step 7.” 5 Encoding 3 : For each cell X in {a2,b2,c2,d2,e2,f2}, a constraint: (NOT location_wRook_f2_2)  (NOT location_wKing_X_7) 4 2 1 99 Learning while Reasoning QBF reasoning engine has no hard-coded knowledge of chess or distance functions! – still learns automatically, e.g., that a square is not reachable! – continuously adds learned/inferred constraints from failure, simplifying future search NOTE: Such knowledge must be hand-programmed into conventional solvers E.g. can w-king reach top-left cell in 5 moves? Without Learning explores > 20K possibilities With Learning explores < 5 possibilities 100 Example: Learning “Reachability” Query: Can the white king reach the top-left cell in 5 moves? Without Learning (> 20K possibilities explored) [ click images for video ] With Learning (< 5 possibilities explored) RECALL: No hard-coded knowledge. Learns automatically about reachability 101 Learning while Reasoning: A Visualization Solver learns, e.g., to focus the search to the most relevant areas of the board without explicitly being told anything about it! RESULT: Over two orders of magnitude improvement 102 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning  QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning  Key research advances – The technology behind QBF  A. New modeling techniques  B. Learning while reasoning  C. Structure discovery – Experimental Results 5. Summary  network planning, logistics planning     Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 103 C. Structure Exploitation: Streamlining Search space R2 R1 Exploit domain and problem structure to narrow down search to small regions very likely to have a solution! E.g in chess, solutions (white player wins) Try region R1: white never moves a piece away from the goal : no solution found Try region R2: white doesn’t use the two of its far away pawns : solution found! Key invariant: if an agent’s actions are restricted and it still succeeds, it will succeed in the original setting as well The art lies in choosing “good” restrictions – not too strong, not too weak! RESULT: Over three orders of magnitude improvement 104 Structure Exploitation: Structure Discovery Constraint graph of a reasoning problem One node per variable: edge between two variables if they share a constraint. Planning problem encoding 843 vars, 7301 constraints, approx. min backdoor 16 (“backdoor” set = reasoning shortcut) (Gomes et al. ’03, ’04) 105 Structure Exploitation: Structure Discovery After setting 5 backdoor vars (out of 800) After setting just 12 backdoor vars – problem almost solved! 106 Structure Exploitation: Structure Discovery MAP-6-7.cnf infeasible planning instances. Strong backdoor of size 3. 392 vars, 2,578 clauses. Structure Exploitation: Structure Discovery After setting 2 (out of 392) backdoor vars -- complexity almost gone! Dynamic view: Running SAT solver (no backdoor detection) [ click image for video ] 109 SAT solver detects backdoor set [ click image for video ] 110 Tractable Sub-Structure Can Lead to Near-Linear Scaling in QBF as Well! Many real-world problems contain tractable (but hidden) sub-problems. Qbf-CornellD can often automatically learn and exploit these. Reachability-based boards: Scaling with no. of pieces, Hidden tractable sub-structure (log-plot) Runtime in seconds (logscale) 10000 Near-linear scaling with number of steps! Raw Search at 100M moves/sec Cornell QBF Solver y = x, for comparison (1 hour) 1000 77K vars, 2.2M constraints 100 (1 minute) 10 1 3 5 7 9 11 13 15 Number of steps 111 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning  QBF reasoning (extends SAT)  A new range of applications  Two motivating examples 4. High-Performance QBF reasoning  Key research advances  The technology behind QBF  A. New modeling techniques  B. Learning while reasoning  C. Structure discovery  Experimental Results 5. Summary  network planning, logistics planning     Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 112 Experimental Results 5-15 quantifier levels (reachability) 7-9 quantifier levels Dual Encoding Pure CNF Encoding xChess instance name T/F Semprop [Germany] sKizzo [France] Quaffle [Princeton] Qbf-Cornell [2005] Qbf-CornellD [2006] conf-r1 F 12 4.0 15 1.3 0.01 conf-r2 F 25 5.86 33 2.5 0.02 conf-r3 F 55 9.3 62 4.1 0.03 conf-r4 F 85 26 124 6.4 0.04 conf-r5 F 985 84 676 34 0.08 conf-r6 F 2042 73 713 49 0.10 conf01 F 1225 492 -- 539 6.4 conf02 F 93 30 6.0 1.0 0.0 conf03 T -- 1532 -- 83 1.4 conf04 T -- -- 2352 100 3.5 conf05 F 3290 448 510 196 0.1 conf06 F -- memout -- 633 30.6 conf07 F 261 42 78 3.5 0.0 conf08 T -- 1509 -- 1088 31.2 113 Experimental Results, contd. 7-9 quantifier levels Dual Encoding Pure CNF Encoding xChess instance name T/F Semprop [Germany] sKizzo [France] Quaffle [Princeton] Qbf-Cornell [2005] Qbf-CornellD [2006] conf1a T 627 83 -- 161 1.8 conf1b F 682 176 2939 124 1.3 conf1c T 659 804 -- 156 2.1 conf1d F 706 1930 1473 148 2.2 conf2a T -- -- -- 438 65.9 conf2b F -- -- -- 275 56.9 conf3a T -- memout -- 653 5.2 conf3b F -- -- 2128 327 2.2 conf4 F -- -- -- 274 32.0 conf5 F 1018 427 142 11 0.1 Qbf-Cornell and Qbf-CornellD dramatically outperform all leading QBF solvers on these challenging instances 114 Tutorial Roadmap  Automated reasoning  The complexity challenge  State of the art in Boolean reasoning  SAT-based reasoning     Boolean logic Search space, worst-case complexity Hardness profiles, scaling in practice Modeling problems as SAT  Example domain: planning  QBF reasoning (extends SAT)  A new range of applications  Two motivating examples  High-Performance QBF reasoning  Key research advances  The technology behind QBF  A. New modeling techniques  B. Learning while reasoning  C. Structure discovery  Experimental results 5. Summary  network planning, logistics planning     Quantified Boolean logic Modeling problems as QBF Search space, worst-case complexity Scaling in practice 115 Where Does QBF Reasoning Stand? We have come a long way since the first QBF solvers 5 years ago! (thanks to the Darpa REAL program) • From 200 variable problems to 100,000 variable problems (Qbf-Cornell and Qbf-CornellD) • From 2-3 quantifier alternations to 10+ quantifiers • New techniques for modeling and solving • A better understanding of issues like propagation across quantifiers and illegal search space • Many more benchmarks and test suites 116 Summary QBF Reasoning: a promising new automated reasoning technology! On the road to a whole new range of applications: • Strategic decision making • Performance guarantees in complex multi-agent scenarios • Secure communication and data networks in hostile environments • Robust logistics planning in adversarial settings • Large scale contingency planning • Provably robust and secure software and hardware 117 Creating the Next Generation Reasoning Technology at the Intelligent Information Systems Institute (IISI), Cornell University Director: Dr. Carla Gomes Connections and Collaborations Branching Processes K. Athreya (Cornell) Power laws vs. Small-world S. Strogatz (Cornell) T. Walsh (U. New South Wales) Approximations and Randomization Lucian Leahu (Cornell) David Shmoys (Cornell) Random CSP Models C. Fernandez, M. Valls (U. Lleida) C. Bessiere (LIRMM-CNRS) C. Moore (U. New Mexico) Learning Dynamic Restart Strategies Formal Models. Problem structure, Backdoors Robustness vs. Fragility John Doyle (Caltech) Walter Willinger (AT&T Labs) E. Horvitz (Micrsoft Research) H. Kautz and Y. Ruan (U. Washington) Nudelman and Shoham (Stanford) H. Chen (Cornell) John Hopcroft (Cornell) Jon Kleinberg (Cornell) R. Williams (CMU) Joerg Hoffman (Max-Planck Inst.) Information Theory S. Wicker (Cornell) 118 Computer Science Engineering Mathematics Operations Research Physics Cross-fertilization of ideas for the study and design of Intelligent Systems Economics Cognitive Science Research part of Cornell’s Intelligent Information Systems Institute. Director: Carla Gomes. Additional Material Ease of Use / Portability Q: Can someone else easily use our techniques for problems other than chess? A: Yes, it is straightforward once they understand the concept of a QBF encoding. – Solver: they can use the Qbf-Cornell and Qbf-CornellD off-the-shelf as a black box. – Encoder: A general encoder can be written for each application domain. Our chess encoder will serve as a useful starting point. – Our technique is described in research publications [AAAI-05, SAT-06] and has already been explored and extended by others in various contexts [e.g. Zhang AAAI06, Tang-Malik SAT-06, Bessiere-Verger CP-06]. (Our related work received best paper awards at the AAAI-06 and CP-06 conferences.) – We have an initial proposal for a generic automated QBF encoder for a higher-level adversarial planning language. This language will be similar to commonly used single-agent planning specification languages like STRIPS. Q: Are we creating a general solution? A: Yes, the approach provides a fully general platform for adversarial reasoning. – Solver: generic; no chess-based or other specific knowledge embedded. – Encoder technology: the core is applicable to any discrete adversarial reasoning task. – Chess encoder: can be adapted manually for other tasks. 121 Encoding: Generation • The encoding process is automatic and very efficient. – – – – – The “encoding” is a re-statement of the problem in the language of the solver. We use a fully automated chess-to-QBF encoder. Each of the 600 go/no-go boards required < 0.3 sec to encode. The hardest instances we have tried required < 2 sec to encode. For chess and most future applications of interest, encoder time will be polynomial in the problem parameters. The challenge is always the solver time. The encoding process takes a fraction of the time compared to the solver. 122 Encoding: Effect of Number of Steps The number of steps is simply a parameter to the encoder. If a different number of steps is desired, the new encoding can be generated within seconds by re-running the encoder. Additional information: – Decreasing the number of steps (and possibly also changing the query itself) : Requires changing only the few constraints that encode the new query (< 1% of the encoding). The rest of the encoding (> 99%) remains untouched. – Increasing the number of steps in the query: Requires additional variables and constraints for the extra steps, which are simply appended to the original encoding. – Transfer learning technology can be exploited to re-use knowledge gained from the original encoding when the query is changed; we are exploring this. 123 From Adversarial Tasks To Games Example #2: The Chromatic Number Problem: Given a graph G and a positive number k, does G have chromatic number k? – Chromatic number: minimum number of colors needed to color G so that every two adjacent vertices get different colors – A game with 2 turns – • Moves : First, E produces a coloring S of G; second, U produces a coloring T of G • Rules : S must be a legal k-coloring of G; T must be a legal (k-1)-coloring of G • Goal : E wins if S is valid and T is not “E wins” iff graph G has chromatic number k 124

Quantified Boolean Formula (QBF) Reasoning Bart Selman, Carla Gomes, Ashish Sabharwal

Related documents

Products

Support

Quantified Boolean Formula (QBF) Reasoning Bart Selman, Carla Gomes, Ashish Sabharwal

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib