Introduction to Mathematical Programming

advertisement
Explorations in Artificial Intelligence
Prof. Carla P. Gomes
gomes@cs.cornell.edu
Module 7
Part 3
Integer Programming
Divisibility
Decision variables in an LP model are allowed to have any values,
including noninteger values, that satisfy the functional and
nonnegativity constraints. i.e., activities can be run at fractional levels.
What to do when divisibility assumption violated:
realm of integer programming!!!
Revisiting the TBA Airlines Problem
An Example where Integrality Matters
The TBA Airlines Problem
•
TBA Airlines is a small regional company that specializes in short flights in small
airplanes.
•
The company has been doing well and has decided to expand its operations.
•
The basic issue facing management is whether to purchase more small airplanes to add
some new short flights, or start moving into the national market by purchasing some
large airplanes, or both.
Question: How many airplanes of each type should be purchased to maximize their
total net annual profit?
Data for the TBA Airlines Problem
Small
Airplane
Large
Airplane
Net annual profit per airplane
$1 million
$5 million
Purchase cost per airplane
5 million
50 million
2
—
Maximum purchase quantity
Capital
Available
$100 million
Linear Programming Formulation
Let S = Number of small airplanes to purchase
L = Number of large airplanes to purchase
Maximize Profit = S + 5L ($millions)
subject to
Capital Available:
5S + 50L ≤ 100 ($millions)
Max Small Planes: S ≤ 2
and
S ≥ 0, L ≥ 0.
Graphical Method for Linear Programming
Number
of large
airplanes
purchased
L
3
2
(2, 1.8) = Optimal solution
Profit = 11 = S + 5 L
1
0
Feasible
region
(2, 1) = Rounded solution
(Profit = 7)
1
2
3
Number of small airplanes purchased
S
Violates Divisibility Assumption of LP
•
Divisibility Assumption of Linear Programming: Decision variables in a linear
programming model are allowed to have any values, including fractional values, that
satisfy the functional and nonnegativity constraints. Thus, these variables are not
restricted to just integer values.
•
Since the number of airplanes purchased by TBA must have an integer value, the
divisibility assumption is violated.
Integer Programming Formulation
Let S = Number of small airplanes to purchase
L = Number of large airplanes to purchase
Maximize Profit = S + 5L ($millions)
subject to
Capital Available:
5S + 50L ≤ 100 ($millions)
Max Small Planes: S ≤ 2
and
S ≥ 0, L ≥ 0
S, L are integers.
Graphical Method for Integer Programming
L
Number
of large
3
airplanes
purchased
2
1
0
(0, 2) = Optimal solution for the integer programming problem (Profit = 10)
(2, 1.8) = Optimal solution for the
LP relaxation (Profit = 11)
Profit = 10 = S + 5 L
(2, 1) = Rounded solution (Profit = 7)
1
2
3
Number of small airplanes purchased
S
Graphical Method for Integer Programming
•
When an integer programming problem has just two decision variables, its optimal solution can be found
by applying the graphical method for linear programming with just one change at the end.
•
We begin as usual by graphing the feasible region for the LP relaxation, determining the slope of the
objective function lines, and moving a straight edge with this slope through this feasible region in the
direction of improving values of the objective function.
•
However, rather than stopping at the last instant the straight edge passes through this feasible region, we
now stop at the last instant the straight edge passes through an integer point that lies within this feasible
region.
•
This integer point is the optimal solution.
Why integer programs?
•
Advantages of restricting variables to take on integer values
– More realistic
– More flexibility
•
Disadvantages
– More difficult to model
– Can be much more difficult to solve
Integer Programming
•
When are “non-integer” solutions okay?
– Solution is naturally divisible
• e.g., $, pounds, hours
– Solution represents a rate
• e.g., units per week
– Solution only for planning purposes
•
When is rounding okay?
– When numbers are large
• e.g., rounding 114.286 to 114 is probably okay.
•
When is rounding not okay?
– When numbers are small
• e.g., rounding 2.6 to 2 or 3 may be a problem.
– Binary variables
• yes-or-no decisions
Types of Integer Programming Problems
•
Pure integer programming problems are those where all the decision
variables must be integers.
•
Mixed integer programming problems only require some of the variables
(the “integer variables”) to have integer values so the divisibility assumption
holds for the rest (the “continuous variables”).
•
Binary variables are variables whose only possible values are 0 and 1.
•
Binary integer programming (BIP) problems are those where all the
decision variables restricted to integer values are further restricted to be binary
variables.
– Such problems can be further characterized as either pure BIP problems or mixed
BIP problems, depending on whether all the decision variables or only some of
them are binary variables.
Examples of Applications of Binary Variables
•
Making “yes-or-no” type decisions
–
–
–
–
•
Logical constraints
–
–
•
Alternative constraints
Conditional constraints
Representing non-linear functions
–
–
–
–
•
Build a factory?
Manufacture a product?
Do a project?
Assign a person to a task?
Fixed Charge Problem
• If a product is produced, must incur a fixed setup cost.
• If a warehouse is operated, must incur a fixed cost.
Piecewise linear representation
Diseconomies of scale
Approximation of nonlinear functions
Set-covering, and set partitioning
–
–
Make a set of assignments that “cover” a set of requirements.
Partition a set into subsets meeting given requirements
StockCompany Example
Capital Budgeting Allocation Problem
StockCompany is considering 6 investments. The cash required from each
investment as well as the NPV of the investment is given next. The cash available
for the investments is $14,000. Stockco wants to maximize its NPV. What is the
optimal strategy?
An investment can be selected or not. One cannot select a fraction of an investment.
Data for the StockCompany Problem
Investment budget = $14,000
Investment 1
Cash
Required
(1000s)
NPV
added
(1000s)
$5
2
3
4
5
6
$7
$4
$3
$4
$6
$16 $22 $12 $8
$11 $19
Integer Programming Formulation
What are the decision variables?
1, if we invest in i  1,...,6,
xi  
else
0,
Objective and Constraints?
Max 16x1+ 22x2+ 12x3+ 8x4+ 11x5+ 19x6
5x1+ 7x2+ 4x3+ 3x4+ 4x5+ 6x6  14
xj e {0,1} for each j = 1 to 6
Capital Budgeting Allocation Problem (one resource)
Knapsack Problem
• Why is a problem with the characteristics of the previous problem
called the Knapsack Problem?
• It is an abstraction, considering the simple problem:
A hiker trying to fill her knapsack to maximum total value.
Each item she considers taking with her has a certain value and a
certain weight. An overall weight limitation gives the single
constraint.
Practical applications:
Project selection and capital budgeting allocation problems
Storing a warehouse to maximum value given the indivisibility of
goods and space limitations
Sub-problem of other problems e.g., generation of columns for a given
model in the course of optimization – cutting stock problem (beyond
the scope of this course)
• The previous constraints represent “economic indivisibilities”, either a
project is selected, or it is not. There is no selecting of a fraction of a
project.
• Similarly, integer variables can model logical requirements (e.g., if
stock 2 is selected, then so is stock 1.)
How to model “logical” constraints
•
Exactly 3 stocks are selected.
•
If stock 2 is selected, then so is stock 1.
•
If stock 1 is selected, then stock 3 is not selected.
•
Either stock 4 is selected or stock 5 is selected, but not both.
Formulating Constraints
•
Exactly 3 stocks are selected
x1+ x2+ x3+ x4+ x5+ x6=3
If stock 2 is selected then so is stock 1
The integer programming constraint:
A 2-dimensional
representation
x1  x2
Stock 2
Stock 1
If stock 1 is selected then stock 3 is not selected
The integer programming constraint:
A 2-dimensional
representation
x1 + x3  1
Stock 3
Stock 1
Either stock 4 is selected or stock 5 is selected, but
not both.
The integer programming constraint:
A 2-dimensional
representation
x4 + x5 = 1
stock 5
stock 4
California Manufacturing Company
•
The California Manufacturing Company is a diversified company with several
factories and warehouses throughout California, but none yet in Los Angeles
or San Francisco.
•
A basic issue is whether to build a new factory in Los Angeles or San
Francisco, or perhaps even both.
•
Management is also considering building at most one new warehouse, but will
restrict the choice to a city where a new factory is being built.
Question: Should the California Manufacturing Company expand with
factories and/or warehouses in Los Angeles and/or San Francisco?
Data for California Manufacturing
Decision
Number
Yes-or-No
Question
Decision
Variable
Net Present
Value
(Millions)
Capital
Required
(Millions)
1
Build a factory in Los Angeles?
x1
$8
$6
2
Build a factory in San Francisco?
x2
5
3
3
Build a warehouse in Los Angeles?
x3
6
5
4
Build a warehouse in San Francisco?
x4
4
2
Capital Available: $10 million
Binary Decision Variables
Decision
Number
Decision
Variable
Possible
Value
Interpretation
of a Value of 1
Interpretation
of a Value of 0
1
x1
0 or 1
Build a factory in
Los Angeles
Do not build
this factory
2
x2
0 or 1
Build a factory in
San Francisco
Do not build
this factory
3
x3
0 or 1
Build a warehouse in
Los Angeles
Do not build
this warehouse
4
x4
0 or 1
Build a warehouse in
San Francisco
Do not build
this warehouse
Algebraic Formulation
Let x1 = 1 if build a factory in L.A.; 0 otherwise
x2 = 1 if build a factory in S.F.; 0 otherwise
x3 = 1 if build a warehouse in Los Angeles; 0 otherwise
x4 = 1 if build a warehouse in San Francisco; 0 otherwise
Maximize NPV = 8x1 + 5x2 + 6x3 + 4x4 ($millions)
subject to
Capital Spent:
6x1 + 3x2 + 5x3 + 2x4 ≤ 10 ($millions) Resource Availability
Max 1 Warehouse:
x3 + x4 ≤ 1 Mutually exclusive decisions
Warehouse only if Factory:
x3 ≤ x1
Contingent decisions
x4 ≤ x2
and
x1, x2, x3, x4 are binary variables.
Mpl Model
Max 8x1 + 5x2 + 6x3 + 4x4;
subject to
6x1 + 3x2 + 5x3 + 2x4 <= 10;
x3 + x4 <= 1;
x3 <= x1;
x4 <= x2;
BINARY
x1;
x2;
x3;
x4;
Modeling Fixed Charge Problems
If a product is produced, must incur a fixed setup cost.
If a warehouse is operated, must incur a fixed cost.
 The problem is non-linear.
x – quantity of product to be manufactured
x = 0  cost =0;
x > 0  cost = C1x + C2
 How to model it? Using an indicator variable y
y = 1  x is produced; y = 0  x is not produced
Objective function becomes  C1x + C2y
Additional Constraint  x ≤ My
Wyndor with Setup Costs (Variation 1)
Suppose that two changes are made to the original Wyndor problem:
1. For each product, producing any units requires a substantial one-time setup
cost for setting up the production facilities.
2. The production runs for these products will be ended after one week, so D and
W in the original model now represent the total number of doors and windows
produced, respectively, rather than production rates. Therefore, these two
variables need to be restricted to integer values.
Graphical Solution to Original Wyndor Problem
W
Production rate
for windows
8
Optimal solution
(2, 6)
6
4
Feasible
Region
P = 3,600 = 300 D + 500 W
2
0
4
2
Production rate for doors
6
8
10 D
Net Profit for Wyndor Problem with Setup Costs
Net Profit ($)
Number of
Units Produced
Doors
Windows
0
0(300) – 0 = 0
0 (500) – 0 = 0
1
1(300) – 700 = –400
1(500) – 1,300 = –800
2
2(300) – 700 = –100
2(500) – 1,300 = –300
3
3(300) – 700 = 200
3(500) – 1,300 = 200
4
4(300) – 700 = 500
4(500) – 1,300 = 700
5
Not feasible
5(500) – 1,300 = 1,200
6
Not feasible
6(500) – 1,300 = 1,700
Feasible Solutions for Wyndor with Setup Costs
W
Production
quantity for
windows
Optimal solution
8
(0, 6) gives P = 1700
6
(2, 6) gives P = -100 + 1700
= 1600
4
(4, 3) gives P = 500 + 200
= 700
2
(0, 0)
gives P = 0
(4, 0) gives P = 500
0
6
2
4
Production quantity for doors
8 D
Algebraic Formulation
Let D = Number of doors to produce,
W = Number of windows to produce,
y1 = 1 if perform setup to produce doors; 0 otherwise,
y2 = 1 if perform setup to produce windows; 0 otherwise .
Maximize P = 300D + 500W – 700y1 – 1,300y2
subject to
Original Constraints:
Plant 1:
D≤4
Plant 2:
2W ≤ 12
Plant 3:
3D + 2W ≤ 18
Produce only if Setup:
Doors:
D ≤ My1
Windows: W ≤ My2
and
D ≥ 0, W ≥ 0, y1 and y2 are binary.
Wyndor with Mutually Exclusive Products
(Variation 2)
Suppose that now the only change from the original Wyndor problem is:
•
The two potential new products (doors and windows) would compete for the
same customers. Therefore, management has decided not to produce both of
them together.
–
At most one can be chosen for production, so either D = 0 or W = 0, or both.
Feasible Solution for
Wyndor with Mutually Exclusive Products
(for non-binary variables)
W
Production
rate for
windows
P = 300 D + 500 W
Either D = 0 or W = 0
8
(0, 6) gives P = 3,000
6
4
2
(0, 0) gives P = 0
0
(4, 0) gives P = 1,200
2
4
Production rate for doors
6
8 D
Algebraic Formulation
Let D = Number of doors to produce,
W = Number of windows to produce,
y1 = 1 if produce doors; 0 otherwise,
y2 = 1 if produce windows; 0 otherwise.
Maximize P = 300D + 500W
subject to
Original Constraints:
Plant 1:
D≤4
Plant 2:
2W ≤ 12
Plant 3:
3D + 2W ≤ 18
Auxiliary variables must =1 if produce any:
Doors:
D ≤ My1
Windows:
W ≤ My2
Mutually Exclusive: y1 + y2 ≤ 1
and
D ≥ 0, W ≥ 0, y1 and y2 are binary.
Wyndor with Either-Or Constraints
(Variation 3)
Suppose that now the only change from the original Wyndor problem is:
•
The company has just opened a new plant (plant 4) that is similar to plant 3, so
the new plant can perform the same operations as plant 3 to help produce the
two new products (doors and windows).
•
However, management wants just one of the plants to be chosen to work on
these new products. The plant chosen should be the one that provides the most
profitable product mix.
Data for Wyndor with Either-Or Constraints
(Variation 3)
Production Time Used for
Each Unit Produced (Hours)
Plant
Doors
Windows
Production Time
Available
per Week (Hours)
1
1
0
4
2
0
2
12
3
3
2
18
4
2
4
28
Unit Profit
$300
$500
Graphical Solution with Plant 3 or Plant 4
(a) Choose Plant 3
(b) Choose Plant 4
W
W
8
8
(2, 6) gives
P = 3,600
6
6
4
4
Feasible
region
Feasible
region
2
0
(4, 5) gives
P = 3,700
2
2
4
D
0
2
4
D
Algebraic Formulation
Let D = Number of doors to produce,
W = Number of windows to produce,
y = 1 if plant 4 is used; 0 if plant 3 is used
Maximize P = 300D + 500W
subject to
Plant 1:
D≤4
Plant 2:
2W ≤ 12
Plant 3:
Plant 4:
3D + 2W ≤ 18 + My
2D + 4W ≤ 28 + M(1 – y)
and
D ≥ 0, W ≥ 0, y is binary.
Applications of Binary Variables
•
Making “yes-or-no” type decisions
–
–
–
–
•
Build a factory?
Manufacture a product?
Do a project?
Assign a person to a task?
Fixed costs
– If a product is produced, must incur a fixed setup cost.
– If a warehouse is operated, must incur a fixed cost.
•
Either-or constraints
– Production must either be 0 or ≥ 100.
•
Subset of constraints
– meet 3 out of 4 constraints.
Special Kinds of Integer Programming Models
• Knapsack Problem
• Set Covering Problem
• Set Partitioning Problem
• Set Packing Problem
• The Traveling Salesman Problem
• The Quadratic Assignment Problem
Set Covering Problem
• We are given a set of objects S = {1, 2, 3, …, n}.
• We are also given a set of subsets of S, S. Each subset has
a cost associated with it.
• Problem:
– to “cover” all the members of S at the minimum cost using
members of S.
• Properties:
– The problem is a minimization and all constraints are >=;
– All RHS coefficients are 1;
– All other matrix coefficients are 0 or 1.
Fire Station Problem
Set Covering Problem
1
2
5
Locate fire stations
so that each district
has a fire station in
it, or next to it.
3
6
7
4
8
11
Minimize the
number of fire
stations needed.
9
12
13
10
14
15
16
Representation as Set Covering Problem
1
2
5
7
8
11
Covers
1
1, 2, 4, 5
2
1, 2, 3, 5, 6
3
2, 3, 6, 7
16
13, 15, 16
3
6
4
Set
9
12
13
10
14
15
16
Representation as Integer program
11
2
xj = 1 if node j is selected
xj = 0 otherwise
3
Minimize x1 + x2 + … + x16
5
6
4
8
11
7
x1 + x2 + x3 + x5 + x6  1
9
12
s.t. x1 + x2 + x4 + x5  1
13
10
14
15
16
x13 + x15 + x16  1
xj  {0, 1} for each j.
Southwestern Airways Crew Scheduling
•
Southwestern Airways needs to assign crews to cover all its upcoming flights.
•
We will focus on assigning 3 crews based in San Francisco (SFO) to 11 flights.
Question: How should the 3 crews be assigned 3 sequences of flights so that
every one of the 11 flights is covered?
Southwestern Airways Flights
Seat tl e
(SEA)
San Francis co
(SFO)
Los Angel es
(LAX)
Denver
(DEN)
Chi cago
ORD)
Data for the Southwestern Airways Problem
Feasible Sequence of Flights (pairings)
Flights
1
1. SFO–LAX
1
2. SFO–DEN
2
3
5
6
1
1
3. SFO–SEA
1
2
3
3
3
4
9. DEN–ORD
3
4
4
4
6
7
3
5
5
3
3
4
5
7
2
4
2
3
2
2
2
11. SEA–LAX
1
5
2
10. SEA–SFO
12
4
7. ORD–SEA
2
11
1
3
3
10
1
1
2
2
9
1
2
8. DEN–SFO
8
1
1
6. ORD–DEN
Cost, $1,000s
7
1
4. LAX–ORD
5. LAX–SFO
4
8
5
2
4
4
2
9
9
8
9
Algebraic Formulation
Let xj = 1 if flight sequence (paring) j is assigned to a crew; 0 otherwise. (j = 1, 2, … , 12).
Minimize Cost = 2x1 + 3x2 + 4x3 + 6x4 + 7x5 + 5x6 + 7x7 + 8x8 + 9x9 + 9x10 + 8x11 + 9x12
(in $thousands)
pairings
subject to
Flight 1 covered:
Flight 2 covered:
:
Flight 11 covered:
x1 + x4 + x7 + x10 ≥ 1
x2 + x5 + x8 + x11 ≥ 1
:
x6 + x9 + x10 + x11 + x12 ≥ 1
Three Crews:
x1 + x2 + x3 + x4 + x5 + x6 + x7 + x8 + x9 + x10 + x11 + x12 ≤ 3
and
xj are binary (j = 1, 2, … , 12).
Set Covering Problem
• We are given a set of objects S = {1, 2, 3, …, n}.
• We are also given S, a set of subsets of S. Each subset has
a cost associated with it.
• Problem:
– to “cover” all the members of S at the minimum cost using
members of S.
• Properties:
– The problem is a minimization and all constraints are >=;
– All RHS coefficients are 1;
– All other matrix coefficients are 0 or 1.
Some Comments on IP models
• There are often multiple ways of modeling the
same integer program.
• Solvers for integer programs are extremely
sensitive to the formulation. (not true for LPs)
Summary on Integer Programming
• Dramatically improves the modeling capability
– Economic indivisibilities
– Logical constraints
– Modeling nonlinearities (e.g., fixed cost)
– classical problems in capital budgeting and in
supply chain management
– Lots of other applications and models
• Not as easy to model
• Not as easy to solve.
• Solving LP
The Challenges of Rounding
•
Rounded Solution may not be
feasible.
•
Rounded solution may not be
close to optimal.
•
There can be many rounded
solutions.
– Example: Consider a problem
with 30 variables that are noninteger in the LP-solution.
How many possible rounded
solutions are there?
x2
5
4
3
2
1
1
2
3
4
5
x1
Overview of Techniques for Solving Integer
Programs
• Enumeration Techniques
– Complete Enumeration
• list all “solutions” and choose the best
– Branch and Bound
• Implicitly search all solutions, but cleverly eliminate the vast majority
before they are even searched
– Implicit Enumeration
• Branch and Bound applied to binary variables
• Cutting Plane Techniques
– Use LP to solve integer programs by adding constraints to eliminate the
fractional solutions.
How Integer Programs are Solved
x2
5
4
3
2
1
1
2
3
4
5
x1
How Integer Programs are Solved
x2
5
4
3
2
1
1
2
3
4
5
x1
Branch and Bound
– It is the starting point for all solution techniques for
integer programming
– Lots of research has been carried out over the past 40
years to make it more and more efficient
– But, it is an art form to make it efficient. (We shall get
a sense why.)
– Integer programming is intrinsically difficult.
Capital Budgeting Example
Investment budget = $14,000
Investment
Cash
Required
(1000s)
NPV
added
(1000s)
1
2
3
4
5
6
$5
$7
$4
$3
$4
$6
$16 $22 $12 $8
$11 $19
maximize 16x1 + 22x2 + 12x3 + 8x4 +11x5 + 19x6
subject to
5x1 + 7x2 + 4x3 + 3x4 +4x5 + 6x6  14
xj binary for j = 1 to 6
Complete Enumeration
• Systematically considers all possible values of the decision
variables.
– If there are n binary variables, there are 2n different
ways.
• Usual idea: iteratively break the problem in two. At the
first iteration, we consider separately the case that x1 = 0
and x1 = 1.
An Enumeration Tree
Original
problem
x1 = 0
x2 = 0
x3 = 0
x3 = 1
x1 = 1
x3 = 0
x2 = 1
x2 = 0
x2 = 1
x3 = 1
x3 = 0
x3 = 1
x3 = 0
x3 = 1
On complete enumeration
• Suppose that we could evaluate 1 billion solutions per
second.
• Let n = number of binary variables
• Solutions times (approx.)
– n = 30,
1 second
– n = 40,
18 minutes
– n = 50
13 days
– n = 60
31 years
On complete enumeration
• Suppose that we could evaluate 1 trillion solutions per
second, and instantaneously eliminate 99.9999999% of all
solutions as not worth considering
• Let n = number of binary variables
• Solutions times
– n = 70,
– n = 80,
– n = 90
– n = 100
1 second
17 minutes
11.6 days
31 years
Branch and Bound
The essential idea: search the enumeration tree, but at
each node
1.
2.
Solve the linear program at the node
Eliminate the subtree (fathom it) if
1. The solution is integer (there is no need to go furtherwhy?) or
2. The best solution in the subtree cannot be as good as
the best available solution (the incumbent- how does
that happen?) or
3. There is no feasible solution
Branch and Bound
1
44 3/7
Node 1 is the
original LP
Relaxation
maximize 16x1 + 22x2 + 12x3 + 8x4 +11x5 + 19x6
subject to
5x1 + 7x2 + 4x3 + 3x4 +4x5 + 6x6  14
0  xj  1 for j = 1 to 6
Solution at node 1:
x1 =1 x2 = 3/7
x3 = x4 = x5 = 0 x6 =1
z = 44 3/7
The IP cannot have value higher than 44 3/7.
Branch and Bound
1
44 3/7
x1 = 0
44
Node 2 is the
original LP
Relaxation plus
the constraint
x1 = 0.
2
maximize 16x1 + 22x2 + 12x3 + 8x4 +11x5 + 19x6
subject to
5x1 + 7x2 + 4x3 + 3x4 +4x5 + 6x6  14
0  xj  1 for j = 1 to 6, x1 = 0
Solution at node 2:
x1 = 0 x2 = 1 x3 = 1/4 x4 = x5 = 0 x6 = 1
z = 44
Branch and Bound
1
x1 = 0
44
Node 3 is the
original LP
Relaxation plus
the constraint
x1 = 1.
44 3/7
x1 = 1
3
2
44 3/7
The solution at node 1 was
x1 =1 x2 = 3/7
x3 = x4 = x5 = 0
x6 =1
z = 44 3/7
Note: it was the best solution with no constraint on x1.
So, it is also the solution for node 3. (If you add a
constraint, and the old optimal solution is feasible, then
it is still optimal.)
Branch and Bound
44 3/7
1
x1 = 0
44
Node 4 is the
original LP
Relaxation plus
the constraints
x1 = 0, x2 = 0.
x1 = 1
2
3
44 3/7
x2 = 0
42
44
Solution at node 4:
0
0
1
0
1
Our first incumbent solution!
No further searching from node 4 because
there cannot be a better integer solution.
1
z = 42
No solution in
the subtree can
have a value
better than 42.
Branch and Bound
The incumbent
is the best
solution on
hand.
44
1
44
44 3/7
x1 = 0
x1 = 1
2
3
x2 = 0
x2 = 1
x2 = 0
42
The
incumbent
solution has
value 42
44
5
44
6
44 3/7
x2 = 1
44 1/3 7
We next solved the LP’s associated with nodes 5, 6, and 7
No new integer solutions were found.
We would eliminate (fatham) a subtree if we were guaranteed
that no solution in the subtree were better than the incumbent.
Branch and Bound
1
44 3/7
x1 = 0
44
44
x1 = 1
2
3
x2 = 0
x2 = 1
x2 = 0
42
The
incumbent
solution has
value 42
44
x3 = 0
43.75 8
5
44
x3 = 1
43.5 9
x3 = 0
43.25 10
6
44 3/7
x2 = 1
44 1/3 7
x3 = 1
43.8 11
x3 = 0
44.3 12
x3 = 1
-
We next solved the LP’s associated with nodes 8 -13
13
13
Summary so far
• We have solved 13 different linear programs so far.
– One integer solution found
– One subtree fathomed (pruned) because the solution
was integer (node 4)
– One subtree fathomed because the solution was
infeasible (node 13)
– No subtrees fathomed because of the bound
Branch and Bound
1
x1 = 0
44
4
x1 = 1
2
3
x2 = 0
x2 = 1
x2 = 0
42
The
incumbent
solution has
value 42
44 3/7
5
44
x3 = 0
43.75 8
44
x3 = 1
43.5 9
x3 = 0
43.25 10
6
44 3/7
x2 = 1
44 1/3 7
x3 = 1
43.8 11
x3 = 0
44.3 12
x3 = 1
- 13
43.75 42.66
We next solved the LP’s associated with the next nodes.
We can fathom the node with z = 42.66. Why?
Getting a better bound
• The bound at each node is obtained by solving an LP.
• But all costs are integer, and so the objective value of each
integer solution is integer. So, the best integer solution has an
integer objective value.
• If the best integer valued solution for a node is at most 42.66,
then we know the best bound is at most 42.
• Other bounds can also be rounded down.
Branch and Bound
1
x1 = 0
44
4
x1 = 1
2
3
x2 = 0
x2 = 1
x2 = 0
42
The
incumbent
solution has
value 42
44 3/7
5
44
x3 = 0
43.75 8
44
x3 = 1
43.5 9
43.75 42.66
x3 = 0
43.25 10
6
44 3/7
x2 = 1
44 1/3 7
x3 = 1
43.8 11
x3 = 0
44.3 12
x3 = 1
-. 13
Branch and Bound
1
x1 = 0
44
x1 = 1
2
3
4
44
5
43 8
43
x3 = 1
43 9
42
43
x2 = 1
6
44
x3 = 0
44
x2 = 0
x2 = 1
x2 = 0
42
The
incumbent
solution has
value 42
44
x3 = 0
x3 = 1
43 10
42
43
44 7
43 11
x3 = 0
44 12
x3 = 1
- 13
43
We found a new incumbent solution!
x1 = 1, x2 = x3 = 0, x4 = 1, x5 = 0, x6 = 1 z = 43
Branch and Bound
1
x1 = 0
44
x1 = 1
2
3
4
44
5
43 88
43
x3 = 1
43 99
42
43
x2 = 1
6
44
x3 = 0
44
x2 = 0
x2 = 1
x2 = 0
42
The new
incumbent
solution has
value 43
44
x3 = 0
x3 = 1
10
43 10
42
43
44 7
11
43 11
x3 = 0
44 12
x3 = 1
- 13
43
We found a new incumbent solution!
x1 = 1, x2 = x3 = 0, x4 = 1, x5 = 0, x6 = 1 z = 43
Branch and Bound
1
x1 = 0
44
4
x1 = 1
2
3
44
x3 = 0
43 88
44
x2 = 1
x2 = 0
x2 = 1
x2 = 0
42
The new
incumbent
solution has
value 43
44
5
44
x3 = 1
43 99
x3 = 0
10
43 10
6
44 7
x3 = 1
11
43 11
x3 = 0
44 12
x3 = 1
- 13
If we had found this incumbent earlier, we could have
saved some searching.
Finishing Up
1
x1 = 0
44
4
x1 = 1
2
3
44
x3 = 0
43 88
44
x2 = 1
x2 = 0
x2 = 1
x2 = 0
42
The new
incumbent
solution has
value 43
44
5
44
x3 = 1
43 99
x3 = 0
10
43 10
6
44 7
x3 = 1
11
43 11
x3 = 0
44 12
44 14
44 16
38 18
x3 = 1
- 13
15 -
17 -
19
-
Lessons Learned
• Branch and Bound can speed up the search
• Only 25 nodes (linear programs) were evaluated
• Other nodes were fathomed
• Obtaining a good incumbent earlier can be valuable
• only 19 nodes would have been evaluated.
• Solve linear programs faster, because we start with an excellent or
optimal solution
• uses a technique called the dual simplex method
• Obtaining better bounds can be valuable.
Branch and Bound
Notation:
– z* = optimal integer solution value
– Subdivision: a node of the B&B Tree
– Incumbent: the best solution on hand
– zI: value of the incumbent
– zLP: value of the LP relaxation of the current node
– Children of a node: the two problems created for a
node, e.g., by saying xj = 1 or xj = 0.
– LIST: the collection of active (not fathomed) nodes,
with no active children.
NOTE: zI  z*
Illustrating the definitions
1
44 3/7
x1 = 0
44
x1 = 1
2
3
x2 = 0
x2 = 1
x2 = 0
42 44
The incumbent
solution has
value 42
44
x3 = 0
43.75 8
5
44
x3 = 1
43.5 9
x3 = 0
43.25 10
6
44 3/7
x2 = 1
44 1/3 7
x3 = 1
43.8 11
x3 = 0
44.3 12
x3 = 1
13
- 13
z* = 43 = optimal integer solution value. (We
found it later in the search)
Incumbent is 0 0 1 0 1 1 zI = 42.
It is the optimal solution for the subdivision 4.
The zLP values for each subdivision are next to the nodes.
Illustrating the definitions
1
44 3/7
x1 = 0
44
x1 = 1
2
3
x2 = 0
x2 = 1
x2 = 0
42 44
The incumbent
solution has
value 42
44
x3 = 0
43.75 8
5
44
x3 = 1
43.5 9
x3 = 0
43.25 10
6
44 3/7
x2 = 1
44 1/3 7
x3 = 1
43.8 11
x3 = 0
44.3 12
x3 = 1
13
- 13
The children of node (subdivision) 1 are nodes 2 and 3.
The children of node 3 are nodes 6 and 7.
LIST = { 8, 9, 10, 11, 12 } = unfathomed nodes with no
active children
Branch and Bound Algorithm
INITIALIZE LIST =
{original problem}
Incumbent: = 
zI = -
SELECT:
If LIST = , then the Incumbent is optimal if it exists,
and the problem is infeasible if no incumbent exists;
else, let S be a node (subdivision) from LIST.
Let xLP be the optimal solution to S
Let zLP = its objective value
e.g., S = {1}
44 3/7 1
e.g., S = {13}
- 13
CASE 1. zLP = - (the LP is infeasible)
Remove S from LIST (fathom it)
Return to SELECT
Branch and Bound Algorithm
INITIALIZE
SELECT:
If LIST = , then the Incumbent is optimal (if it exists),
and the problem is infeasible if no incumbent exists;
else, let S be a node from LIST.
Let xLP be the optimal solution to S
Let zLP = its objective value
CASE 2. - < zLP  zI.
That is, the LP is dominated by the incumbent.
Then remove S from LIST (fathom it)
Return to SELECT
43
43 14
e.g., the incumbent has value 43, and node
14 is selected. zLP = 43.
88
42
Branch and Bound Algorithm
INITIALIZE
SELECT:
If LIST = , then the Incumbent is optimal (if it exists),
and the problem is infeasible if no incumbent exists;
else, let S be a subdivision from LIST.
Let xLP be the optimal solution to S
Let zLP = its objective value
CASE 3. zI < zLP and xLP is integral.
That is, the LP solution is integral and
dominates the incumbent.
e.g., node 4 was selected,
and the solution to the LP
was integer-valued.
Then Incumbent := xLP;
zI := zLP
Remove S from LIST (fathomed by integrality)
Return to SELECT
42
4
Branch and Bound Algorithm
INITIALIZE
SELECT:
If LIST = , then the Incumbent is optimal (if it exists),
and the problem is infeasible if no incumbent exists;
else, let S be a subdivision from LIST.
Let xLP be the optimal solution to S
Let zLP = its objective value
e.g., select node 3.
I
LP
LP
CASE 4. z < z and x is not integral.
There is not enough information to fathom S
x1 = 1
3
Remove S from LIST
Add the children of S to LIST
Return to SELECT
x2 = 0
44 3/7
x2 = 1
6
List := List – 3 + {6,7}
7
Different Selection Rules are Possible
• Rule of Thumb 1: Don’t let LIST get too big (the solutions
must be stored). So, prefer nodes that are further down in
the tree.
• Rule of Thumb 2: Pick a node of LIST that is likely to lead
to an improved incumbent. Sometimes special heuristics
are used to come up with a good incumbent.
Branching
One does not have to have the B&B
tree be symmetric, and one does not
select subtrees by considering variables
in order.
X1
X2
=0
X2
X
X3 =
. .
. .
. .
0
=0
X3 =
. .
. .
. .
3= 0
X1
=1
X8 = 0
X8 = 1
X
X3=
1
X 3=
1
. .
. .
. .
=1
. .
. .
. .
0
X 3=
. .
. .
. .
. .
. .
. .
3
=0
1
X3
.
.
.
.
.
.
.
.
.
.
.
.
=1
Choosing how to
branch so as to
reduce running
time is largely
“art” and based
on experience.
Different Branching Rules are Possible
• Branching: determining children for a node. There are
many choices.
• Rule of thumb 1: if it appears clear that xj = 1 in an
optimal solution, it is often good to branch on xj = 0 vs xj =
1.
– The hope is that a subdivision with xj = 0 can be
pruned.
• Rule of thumb 2: branching on important variables is
worthwhile
Different Bounding Techniques are Possible
• We use the bound obtained by dropping the integrality
constraints (LP relaxation). There are other choices.
• Key tradeoff for bounds: time to obtain a bound vs quality
of the bound.
• If one can obtain a bound much quicker, sometimes we
would be willing to get a bound that is worse
• It usually is worthwhile to get a bound that is better, so
long as it doesn’t take too long.
What if the variables are general integer variables?
• One can choose children as follows:
– child 1: x1  3 (or xj  k)
– child 2 x1  4 (or xj  k+1)
• How would one choose the variable j and the value k
– A common choice would be to take a fractional value
from xLP. e.g., if x7 = 5.62, then we may branch on x7 
5 and x7  6.
– Other choices are also possible.
•
Branch and Bound is the standard way of solving IPs to optimality.
•
There is art to making it work well in practice.
•
Much of the art is built into state-of-the-art solvers such as CPLEX.
New exciting area
combining LP based branch-and-bound based
techniques with constraint programming
techniques - forward checking; arc-consistency
and other forms of consistency checking and
propagation!
A* - well-known AI algorithm that is a
generalization of branch-and-bound with
LP relaxations – notion of admissible
heuristic that overestimates
(underestimates) the objective function for
a maximization (minimization) problem
(analogously to what a relaxation does)
Used in e.g. MapQuest and Darpa Challenge
It was fun to teach INFO 372!
Hope you had fun too!
THE END
!!!
Download