Integer Programming - When all variables are requited to be

advertisement
Integer Programming
- When all variables are requited to be integers, then we have a pure integer programming
problem (IP).
- When some of the variables are requited to be integers, then we have a mixed integer
programming problem (MIP).
- An integer programming problem in which all the variables must equal 0 or 1 is called a 0-1
IP ot Binary IP.
The concept of LP relaxation of an IP problem plays a key role in the solution of IP’s.
Definition: The LP obtained by omitting all integer or 0 – 1 constraints on variables is called
the LP relaxation of the IP.
e.g.
IP
Max Z = 3x1 + 2x2
s.t.
x1 + x2 ≤ 6
→
x1, x2 ≥ 0
x1, x2 integer
Max Z = 3x1 + 2x2
s.t.
x1 + x2 ≤ 6
x1, x2 ≥ 0
LP relaxation
An IP may be viewed as the LP relaxation plus additional constraints (e.g. integer values).
Hence, the feasible region for any IP must be contained in the feasible region for the
corresponding LP relaxation. For any IP that is a max. problem implies that,
Optimal Z-value for LP relaxation ≥ Optimal Z-value for IP.
Many practical solutions can be formulated as IP’s (see section 9.2 on p.477 in text book).
Examples: Capital budgeting (choosing among investments), fixed charge problem (cost
related to activity, not on level of activity), credit card processing operations, facility
locations, etc. The following are some examples.
Formulating IPs (Examples are from Winston or based on Winston)
Capital Budgeting Example
Suppose
we wish to invest $14 000. We have identified four investment
opportunities. Investment 1 requires an investment of $5 000 and has a n e t present
value (NPV) of $16 000; investment 2 requires $7,000 and has a NPV of $22 000;
investment 3 requires $4 000 and has a N P V of $12 000; and investment 4 requires
$3 000 and has a N P V of $8 000. Into which investments should we place our money so
as to maximize our total NPV?
As in LP, we begin by defining a variable for each decision.
In this case, we use a 0-1 variable xj for each investment.
If xj is 1 then we will make investment j.
If it is 0, we will not make the investment. Then, the problem is,
Max Z = 16 x1 + 22x2 + 12x3 + 8x4
s.t
5x1 +7x2 + 4x3 + 3x4 < 14
xj = 0 or 1 (j = 1, 2, 3, 4)
Note: The optimal solution to above is x1 = 0, x2 = x3 = x4 = 1, Z = $42 000. Investment 1,
even though it yields a higher NPV per dollar, is not chosen because investing in 1 would
1
only allow $12 000 to be invested whereas the optimum choices allow all $14 000 to be
invested).
Capital Budgeting (Continued)
There are a number of additional constraints we might want to add. Logical
restrictions can be enforced using 0-1 variables. For instance, consider the following
requirements:
We can only invest in at most two investments – simply add the constraint,
x1 + x2 + x3 + x4 ≤ 2
If we invest in 2,must also invest in investment 1 – we add the constraint,
x2 < x1
or
x2 – x1 < 0
If x2 is 1, then x1 is also 1 as required; if x2 is 0, then there is no restriction for x1 (x1 is 0 or
1)
If investment 2 is made, can not invest in 4 – add the constraint,
x2 + x4 < 1
If x2 is 1, then x4 is 0 as required; if x2 is 0, then there is no restriction for x4 (x3 is 0 or 1)
Fixed Charge Problems
This example illustrates an important trick can be used to formulate many production and
location problems as IPs.
Gandhi Cloth Company is capable of manufacturing three types of clothing: shirts, shorts, and
pants. The manufacture of each type of clothing requires that Gandhi have the ap- propriate
type of machinery available. The machinery needed to manufacture each type of clothing
must be rented at the following rates: shirt machinery, $200 per week; shorts machinery, $150
per week; pants machinery, $100 per week. The manufacture of each type of clothing also
requires the amounts of cloth and labor shown in Table 2. Each week, 150 hours of labor and
160 sq yd of cloth are available. The variable unit cost and sell- ing price for each type of
clothing are shown in Table 3. Formulate an IP whose solution will maximize Gandhi’s
weekly profits.
As in LP formulations, we define a decision variable for each decision that Gandhi must
make. We define,
x1 = number of shirts produced each week
x2 = number of shorts produced each week
x3 = number of pants produced each week
Resource Requirements for Gandhi
Revenue and Cost Information for Gandhi
Clothing
Type
Shirt
Shorts
Pants
Clothing
Type
Shirt
Shorts
Pants
Labour
(Hours)
3
2
6
Cloth
(Sq Yards)
4
3
4
Sales
Price ($)
12
8
15
Variable
Cost ($)
6
4
8
Note that the cost of renting machinery depends only on the types of clothing produced, not
2
on the amount of each type of clothing. This enables us to express the cost of renting
machinery by using the following variables:
y1 = 1 if any shirts are manufactured, y1 = 0 otherwise
y2 = 1 if any shorts are manufactured, y2 = 0 otherwise
y3 = 1 if any pants are manufactured, y3 = 0 otherwise
In short, if xj > 0, then yj = 1, and if xj = 0, then yj = 0. Thus, Gandhi’s weekly profits
= (weekly sales revenue) - weekly variable costs) - (weekly costs of renting machinery).
Also, weekly cost of renting machinery = 200y1 + 150y2 + 100y3
(A)
To justify (A), note that it picks up the rental costs only for the machines needed to
manufacture those products that Gandhi is actually manufacturing. For example, suppose that
shirts and pants are manufactured. Then y1 = y3 = 1 and y2 = 0, and the total weekly
rental cost will be (200 + 100) = $300.
Because the cost of renting, say, shirt machinery does not depend on the number of shirts
produced, the cost of renting each type of machinery is called a fixed charge. A fixed charge
for an activity is a cost that is assessed whenever the activity is undertaken at a nonzero
level. The presence of fixed charges will make the formulation of the Gandhi problem much
more difficult.
Gandhi’s weekly profits will be,
Weekly profit = (12x1 + 8x2 + 15x3) - (6x1 + 4x2 + 8x3) - (200y1 + 150y2 + 100y3)
= 6x1 + 4x2 + 7x3 - 200y1 - 150y2 - 100y3
Thus, Gandhi wants to maximize
z = 6x1 + 4x2 + 7x3 - 200y1 - 150y2 - 100y3
Because its supply of labor and cloth is limited, Gandhi faces the following two constraints:
Constraint 1: At most, 150 hours of labor can be used each week.
Constraint 2: At most, 160 sq yd of cloth can be used each week. Then,
3x1 + 2x2 + 6x3 ≤ 150
4x1 + 3x2 + 4x3 ≤ 160
(Labor constraint)
(Cloth constraint)
(IP1)
Combining above,
Max
s.t.
s.t.
z = 6x1 + 4x2 + 7x3 - 200y1 - 150y2 - 100y3
3x1 + 2x2 + 6x3 ≤ 150
4x1 + 3x2 + 4x3 ≤ 160
x1, x2, x3 ≥ 0; x1, x2, x3 integer
3x1 y1, y2, y3 = 0 or 1
The optimal solution to this problem is found to be x1 = 30, x3 = 10, x2 = y1 = y2 = y3 =
0. This cannot be the optimal solution to Gandhi’s problem because it indicates that Gandhi
can manufacture shirts and pants without incurring the cost of renting the needed machinery.
The current formulation is incorrect because the variables y1, y2, and y3 are not present in
the constraints. This means that there is nothing to stop us from setting y1 = y2 = y3 = 0.
Setting yi = 0 is certainly less costly than setting yi = 1, so a minimum- cost solution to IP1
will always set yi = 0. Somehow we must modify (IP 1) so that whenever xi > 0, yi = 1 must
3
hold. The following trick will accomplish this goal. Let M1, M2, and M3 be three large
positive numbers, and add the following constraints to (IP 1):
x1 ≤ M1y1
(a)
x2 ≤ M2y2
(b)
x3 ≤ M3y3
(c)
Adding these to IP 1 will ensure that if xi > 0, then yi = 1. To illustrate, if x1 > 0, then y1
cannot be 0. For if y1 = 0, then the above (a) would imply x1 ≤ 0 or x1 = 0. Thus, if
x1 > 0, y1 = 1 must hold. If any shirts are produced (x1 > 0), (a) ensures that y1 = 1, and the
objective function will include the cost of the machinery needed to manufacture shirts. Note
that if y1 = 1, then (a) becomes x1 ≤ M1, which does not unnecessarily restrict the value of
x1. If M1 were not chosen large, however (say, M1 = 10), then (a) would unnecessarily restrict
the value of x1. In general, Mi should be set equal to the maximum value that xi can attain.
In the current problem, at most 40 shirts can be produced (if Gandhi produced more than 40
shirts, the company would run out of cloth), so we can safely choose M1 = 40. Also, M2 = 53
and M3 = 25.
If x1 = 0, (a) becomes 0 ≤ M1y1. This allows either y1 = 0 or y1 = 1. Because y1 = 0 is less
costly than y1 = 1, the optimal solution will choose y1 = 0 if x1 = 0. In summary, if (a)–(c)
are added to (IP 1), then xi > 0 will imply yi = 1, and xi = 0 will imply yi = 0.
The optimal solution to the Gandhi problem is z = $75, x3 = 25, y3 = 1. Thus, Gandhi should
produce 25 pants each week.
The Gandhi problem is an example of a fixed-charge problem. In a fixed-charge problem,
there is a cost associated with performing an activity at a nonzero level that does not depend
on the level of the activity. Thus, in the Gandhi problem, if we make any shirts at all (no
matter how many we make), we must pay the fixed charge of $200 to rent a shirt machine.
Problems in which a decision maker must choose where to locate facilities are often fixedcharge problems. The decision maker must choose where to locate various facilities (such as
plants, warehouses, or business offices), and a fixed charge is often associated with building
or operating a facility.
4
The Lockbox Example
Consider a national firm that receives checks from all over the United States. There is a
variable delay from when the check is postmarked (and hence the customer has met her
obligation) and when the check clears (the firm can use the money). It is in the firm's
interest to have the check clear as quickly as possible since then the firm can use the
money. To speed up this clearing, firms open offices (lockboxes) in different cities to
handle the checks.
For example, suppose we receive payments from four regions (West, Midwest, East, and
South). The average daily value from each region is as follows: $70,000 from the West,
$50,000 from the Midwest, $60,000 from the East, and $40,000 from the South. We are
considering opening lockboxes in L.A., Chicago, New York, and/or Atlanta. Operating a
lockbox costs $50,000 per year. The average days from mailing to clearing is given in the
table.
West
Midwest
East
South
LA
2
6
8
8
Chicago
6
2
5
5
NY
8
5
2
5
Atlanta
8
5
5
2
Which lockboxes should we open? Formulate an IP that we can use to minimize the sum of
costs due to lost interest and lockbox operations. Assume that each region must send all its
money to a single city and investment rate is 20%.
First we must calculate the losses due to lost interest for each possible assignment. For
example, if the West sends to New York, then on average there will be $560,000 (= 8 *
$70.000) in process on any given day. Assuming an investment rate of 20%, this
corresponds to a yearly loss of $112,000. We can calculate the losses for the other
possibilities in a similar fashion to get the following table.
West
Midwest
East
South
LA
28
60
96
64
Chicago
84
20
60
40
NY
112
50
24
40
Atlanta
112
50
60
16
Let yj be a 0-1 variable that is 1 if lockbox j is opened and 0 if it is not.
Let xij be 1 if region i sends to lockbox j; and 0 otherwise.
Our objective is to minimize our total yearly costs. That is,
Min z = 28 x11 + 84 x12 + 112 x13 + … + 50 y1 + 50 y2 + 50 y3 + 50 y4
One set of constraint is that each region must be assigned to one lockbox:
𝑛
∑ xij = 1
𝑗=1
A more difficult set of constraint is that a region can only be assigned to an open lockbox.
This can be written as
x1j + x2j + x3j + x4j < M yj
5
M is any number that should be at least 4 as there are four regions.
(Suppose we do not open LA lockbox; then y1 = 0, so all of x11, x21, x31, and x41 must
also be 0. If y1 = 1, then there is no restriction on the x values.)
For this problem we would have twenty variables (four y variables, sixteen x variables)
and eight constraints. This gives the following 0-1 linear program:
Min Z =
28 x11 + 84 x12 + 112 x13 + 112 x14 + 60 x21 + 20 x22 + 50 x23 + 50x24
+ 96 x31 + 60 x32 + 24 x33 + 60 x34 + 64 x41 + 40 x42 + 40 x43 + 16 x44
+ 50 y1 + 50 y2 + 50 y3 + 50 y4
s.t.
x11 + x12 + x13 + x14 = 1
x21 + x22 + x23 + x24 = 1
x31 + x32 + x33 + x34 = 1
x41 + x42 + x43 + x44 = 1
x11 + x21 + x31 + x41 < 4y1
x12 + x22 + x32 + x42 < 4y2
x13 + x23 + x33 + x43 < 4y3
x14 + x24 + x34 + x44 < 4y4
All xij and yj = 0 or 1
There are other formulations, however.
For instance, instead of four constraints of the form x1j + x2j + x3j + x4j < M yj ,
consider the sixteen constraints of the form:
xij < yj
i = 1, 2, 3, 4; j = 1, 2, 3, 4
These constraints also force a region to only use open lockboxes.
(Suppose yj is 0, so by using four corresponding constraints all of x1j, x2j, x3j, and x4j
must also be 0. If yj is 1, then there is no restriction on the x values.
Another point of view: If xij = 1, then yj = 1 as required. Also if x1j = x2j = x3j = x4j = 0,
then there is no restriction on the y values. The act of minimizing costs will result yj = 0.
6
Set-Covering Problems
The following example is typical of an important class of IPs known as set-covering problems.
Example: There are six cities (cities 1–6) in Kilroy County. The county must determine
where to build fire stations. The county wants to build the minimum number of fire stations
needed to ensure that at least one fire station is within 15 minutes (driving time) of each city.
The times (in minutes) required to drive between the cities in Kilroy County are shown in
Table below. Formulate an IP that will tell Kilroy how many fire stations should be built
and where they should be located.
For each city, Kilroy must determine whether to build a fire station there. We define the 0–1
variables x1, x2, x3, x4, x5, and x6 by
xi = 1 if a fire station is built in city 1, otherwise xi = 0.
Then the total number of fire stations that are built is given by ( x1 + x2 + x3 + x4 + x5 +
x6), and Kilroy’s objective function is to minimize
z = x1 + x2 + x3 + x4 + x5 + x6
What are Kilroy’s constraints? Kilroy must ensure that there is a fire station within 15
minutes of each city. Table below indicates which locations can reach the city in 15
minutes or less. To ensure that at least one fire station is within 15 minutes of city 1, we add
the constraint
x1 + x2 ≥ 1
(City 1 constraint)
This constraint ensures that x1 = x2 = 0 is impossible, so at least one fire station will be
built within 15 minutes of city 1. Similarly the constraint
x1 + x2 + x6 ≥ 1
(City 2 constraint)
ensures that at least one fire station will be located within 15 minutes of city 2. In a similar
fashion, we obtain constraints for cities 3–6. Combining these six constraints with the
objective function ( and the fact that each variable must equal 0 or 1), we obtain the following
0 – 1 IP:
Time Required to Travel between Cities in Kilroy County
To
From
City 1
City 2
City 3
City 4
City 5
City 6
City 1
0
10
20
30
30
20
City 2
10
0
25
35
20
10
City
320
25
0
15
30
20
7
City 4
30
35
15
0
15
25
City 5
30
20
30
15
0
14
City
620
10
20
25
14
0
Cities within 15 Minutes of
Given City
City Within 15 Minutes
1
2
3
4
5
6
1,
1,2 2,
3,
3,4 4,
4, 5,
2, 5,
6
5
6
6
Min z = x1 + x2 + x3 + x4 + x5 + x6
s.t.
x1
x1
+ x2
+ x2
x6
x3
x3
+ x4
+ x4
x4
x2
+ x5
+ x5
+ x5
+ x6
+ x6
≥1
≥1
≥1
≥1
≥1
≥1
(City 1 constraint)
(City 2 constraint)
(City 3 constraint)
(City 4 constraint)
(City 5 constraint)
(City 6 constraint)
xi = 0 or 1 ( i = 1, 2, 3, 4, 5, 6)
One optimal solution to this IP is z =2, x2 = x4 = 1, x1 = x3 = x5 = x6 = 0. Thus, Kilroy
County can build two fire stations: one in city 2 and one in city 4.
Dealing with Either – or Constraints
The following is a commonly occurring mathematical programming problem. We have two
constraints:
f(x1, x2,...... ,xn) ≤ 0
g(x1, x2,...... ,xn) ≤ 0
We want to ensure that at least one of the constraints is satisfied. To ensure that, we write
f(x1, x2,...... ,xn) ≤ My
(1)
g(x1, x2,...... ,xn) ≤ M(1 – y)
(2)
where y is a 0 – 1 variable and M is a number chosen large enough to ensure that
f(x1,x2,......,xn) ≤ M and g(x1, x2,...... ,xn) ≤ M are satisfied for all values of x1, x2, ..., xn that
satisfy other constraints in the problem. [y = 0, (1) is satisfied, possibly also (2). If y = 1, (2)
is satisfied and possibly (1) also]
If – The Constraints
We want to ensure that if a constraint f(x1, x2, ...,xn) > 0 is satisfied, then the constraint
g(x1,x2,.....xn) ≥ 0 must be satisfied while if f(x1, x2, .... xn) > 0 is not satisfied, then
g(x1,x2,......xn) ≥ 0 may or may not be satisfied. To ensure this, we include the following
constraints in the formulations:
- g(x1, x2, ......, xn) ≤ My
f(x1, x2, ......., xn) ≤ M(1 – y)
y = 0 or 1
(3)
(4)
8
(M is chosen large enough that f ≤ M and –g ≤ M for all values of x1, x2, ...., xn that satisfy the
other constraints). f > 0 can only be satisfied if y = 0; then (3) is satisfied. If f > 0 is not
satisfied and y = 1, (3) is satisfied, then g < 0 or g ≥ 0 are both possible.
Primary determinants of computational difficulty for an IP problem are (1) the number of
integer variables and (2) any special structure in the problem. In LP, the number of functional
constraints is much more important than the number of variables. In IP, number of functional
constraints are of secondary importance. For MIP problems, it is the number of integer
variables rather than the total number of variables that is important.
The Branch – and – Bound Method for Solving Pure IP Problems
In practice, most IP’s are solved by using this technique.
Note: If we solve the LP relaxation of a pure IP and obtain a solution in which all variables
are integers, then the optimal solution to the LP relaxation is also the optimal solution to the
IP.
Let us consider the example of pure IP:
Max Z = 8x1 + 5x2
s.t.
x1 + x2 ≤ 6
9x1 + 5x2 ≤ 45
x1, x2 ≥ 0;
x1, x2 integer.
The branch-and-bound method begins by solving LP relaxation of the IP ( i.e. no integer
constraint). If in the optimal solution variables are integers, then it is also optimal solution for
the IP. However, LP relaxation solution yields Z = 165/4, x1 = 15/4, x2 = 9/4.
We now know that optimal Z-value for IP ≤ 165/4 (= 41.25). Upper bound = 41 since we have
integer values in IP.
The next step is to partition the feasible region for the LP relaxation. We arbitrarily choose a
variable that is fractional in the optimal solution to the LP relaxation – say x1. Now, every
point in the feasible region for the IP must have either x1 ≤ 3 or x1 ≥ 4.
Subproblem 2 = Subproblem 1 (LP relaxation) + constraint x1 ≥ 4
Subproblem 3 = Subproblem 1 (LP relaxation) + constraint x1 ≤ 3
9
10
9
9x1 + 5x2 = 45
8
7
x1 + x2 = 6
6
5
4
3
2
F
Optimal LP solution to Subproblem 1
1
C
x1 = 3.75, x2 = 2.25
0
0
1
2
3
4
A5
6
7
We see that every point described by integer numbers in the feasible region are the feasible
points (intersections of dotted lines and intersections of the dotted lines and the axes). We also
see that every feasible point is included in the feasible region for subproblem 2 or
subproblem3. We say that subproblems 2 and 3 were created by branching on x1.
We now choose arbitrarily to solve subproblem 2.
Solution is the point C where Z = 41, x1 = 4, and x2 = 9/5
Solution step at this stage is as follows.
t=1
Subproblem 1
Z = 165/4
x1= 15/4
x2 = 9/4
x1 ≥ 4
x1 ≤ 3
Subproblem 2
Z = 41
x1= 4
x2 = 9/5
Subproblem 3
A display of all subproblems that have been created is called a tree. Each subproblem is
referred to as a node of the tree and each line connecting two nodes is called an arc.
Subproblem 2 did not yield all integer solution. So, we choose subproblem 2 to create two
new subproblems. In this case we branch on x2 (only fractional variable).
10
Subproblem 4 = Subproblem 1 + constraint x1 ≥ 4 and x2 ≥ 2 = subproblem 2 + constraint
x2 ≥ 2
Subproblem 5 = Subproblem 2 + constraint x2 ≤ 1
For feasible regions for subproblems 4 and 5 see figure. (B,A,I,H for subpr. 5).
C
H
B
x2 = 2
I
A
x2 = 1
4
5
Now, unsolved problems are: Subproblems 3, 4, and 5. We now choose one to solve. We
choose to solve the most recently created subproblem (LIFO, last in first out rule). We should
then solve subproblem 4 or 5. Arbitrarily we choose subproblem 4. However, we see that
subproblem 4 is infeasible and, therefore, can not yield optimal solution. We place an ‘X’ by
the subproblem 4 as any branch from it will not yield useful information. When further
branching on a subproblem can not yield any useful information, we say that the subproblem
(or node) is fathomed.
Now, unsolved problems are Subpr. 3 and Subpr. 5. With LIFO rule, we choose to solve
subpr. 5. Solution is point I where Z = 365/9, x1 = 40/9, and x2 = 1. No immediate useful
information (one variable is not an integer). We therefore choose to partition Subpr. 5 by
branching on x1.
Subpr. 6 – Subpr. 5 + Constraint x1 ≥ 5
Subpr. 7 – Subpr. 5 + Constraint x1 ≤4
We should now solve subpr. 6 or 7. Arbitrarily we choose subpr. 7. Solution point is H where
Z = 37, x1 = 4, and x2 =1.
Both variables have integer values, so this solution is feasible for the original IP. (Subpr. 7
can not yield any solution where Z > 37).
Further branching will not yield new information, hence subpr. 7 is fathomed (see tree). This,
at the same time, is a candidate solution since all variables are integer values. We keep the
candidate solution until a better one is found. This is now a lower bound for Z. Optimal
Z ≥ 37. We indicate Lower Bound by placing LB = 37 in the box corresponding to the next
solved subproblem.
Next to sove is Subpr. 6. The solution is point A where Z = 40, x1 = 5, and x2 = 0.
All decision variables are integers. Z = 40 is larger than Z = 37 of subpr. 7. We then place ‘X’
by subproblem 7, we update LB to 40 and make this solution candidate solution.
11
Subproblem 3 is the only unsolved problem. Solution is point F where Z = 39, x 1 = x2 = 3.
Subpr. 3 can not yield a Z-value exceeding the current lower bound 40, so it can not yield the
optimal solution. We place an ‘X’ next to it (it is fathomed). Optimal solution is,
x1 = 5, x2 = 0, and Z = 40.
t=1
t=2
Subpr. 1
Z = 165/4
x1= 15/4
x2 = 9/4
x1 ≥ 4
x1 ≤ 3
Subpr. 2
Z = 41
x1= 4
x2 = 9/5
Subpr. 3
Z = 39
x1 = 3
x2 = 3
LB = 40
t=7
x2 ≥ 2
x2 ≤ 1
Subpr. 4
Subpr. 5
t = 3 Infeasible
t = 4 Z = 365/9
X
x1 = 40/9
x2 = 1
x1 ≥ 5
t=6
X
x1 ≤ 4
Subpr. 6
Z = 40
x1 = 5
x2 = 0
LB = 37
Cand. Soln
t=5
Subpr.7
Z = 37
x1 = 4
x2 = 1
Cand. Soln
X
Optimal Solution
Key Aspects of Branch-and- Bound Method for Solving pure IP’s
Step 1 – If it is unnecessary to branch on a subproblem, then it is fathomed. The following
three situations result in a subproblem being fathomed:
(1) The optimal Z value for the subproblem does not exceed (in a max. problem) the current
LB.
(2) The subproblem is infeasible, and
(3) The subproblem yields an optimal solution in which all variables have integer values.
Step 2 – A subproblem may be eliminated from consideration in the following situations:
(1) The subproblem is infeasible, (2) The LB is at least as large as the Z-value for the
subproblem.
Branch-and-Bound Method for Solving Mixed Integer Programming Problems
These problems are solved with similar procedure to the IP method. For these problems, the
procedure is modified by branching only on variables that are required to be integers.
12
Branch-and-Bound Method for Solving Binary Integer Programming Problems
There are many problems which are binary IP problems. Often, they are of decision making
variety with “Yes”, “No” decisions (1, 0). The method used to solve these problems is similar
to the method described earlier.
Example.
Max. Z = 9x1 + 5x2 + 6x3 + 4x4
s.t.
6x1 + 3x2 + 5x3 + 2x4 ≤ 10
x3 + x4 ≤ 1
- x1 +
x3
≤0
-x2 +
x4 ≤ 0
and xj is binary for j = 1, 2, 3, 4
We first solve the LP relaxation of the problem. Usually we remove the set of constraints that
make the problem difficult to solve. For IP, it is the integer requirement.
The LP relaxation is as shown before with the xj is binary for j = 1, 2, 3, 4 replaced by
xj ≤ 1 and xj ≥ 0 for j = 1, 2, 3, 4
Solution yields (x1, x2, x3, x4) = (5/6, 1, 0, 1) with Z = 16 1/2
Therefore, Z ≤ 16 1/2 for all feasible solutions for the BIP problem. (Bound for the problem:
Z ≤ 16 since variables are integers).
Branching for BIP problems start by fixing one of the variables, say x 1, at x1 = 0 for one
subproblem and x1 = 1 for another subproblem.
Subproblem 1 :
For x1 = 0
Max. Z = 5x2 + 6x3 + 4x4
s.t.
3x2 + 5x3 + 2x4 ≤ 10
x3 + x4 ≤ 1
x3
≤0
-x2 +
x4 ≤ 0
and xj is binary for j = 2, 3, 4
Subproblem 2: x2 = 1
Max. Z = 9 + 5x2 + 6x3 + 4x4
3x2 + 5x3 + 2x4 ≤ 4
x3 + x4 ≤ 1
x3
≤1
-x2 +
x4 ≤ 0
and xj is binary for j = 2, 3, 4
s.t.
LP relaxation of subproblem 1: (x1, x2, x3, x4) = (0, 1, 0, 1) with Z = 9
(Solution obtained by removing integer requirement and adding constraint x j ≤ 1 and xj ≥ 0
for j = 1, 2, 3, 4.
LP relaxation of subproblem 2: (x1, x2, x3, x4) = (1, 4/5, 0, 4/5) with Z = 16 1/5
13
Solution to subproblem 1 is integer, we therefore fathome it with Z = 9 which now becomes
candidate solution or incumbent. The subproblem 1 has been fathomed by test 3, as indicated
by F(3) in diagram below.
x1
0
9 = Z* candidate solution or incumbent
(0, 1, 0, 1)
All
Bound
16
1
16
(1, 4/5, 0, 4/5)
To complete the solution, similar steps are taken. To summarize the steps to reach a solution:
1. Branching: Among the remaining (unfathomed) subproblems select the one that was
created most recently (break ties according to which has the larger bound). Branch from the
node for this subproblem to create two new subproblems by fixing the next variable at either 0
or 1.
2. Bounding: For each new subproblem, obtain its bound by applying the simplex method to
its LP relaxation and rounding down the value of Z for the resulting optimal solution.
3. Fathoming: For each new subproblem, apply the three fathoming tests summarized earlier,
and discard those subproblems that are fathomed by any of the tests.
Optimality test: Stop when there are no remaining subproblems. Then, the candidate solution
or the current incumbent is optimal solution. Otherwise, return to perform another iteration.
(If there is no candidate solution, the conclusion is that the problem has no feasible solutions).
To complete solution of problem, we branch on subproblem 2:
Subproblem 3:
x1 = 1,
x2 = 0
Subproblem 4:
x1 = 1,
x2 = 1
LP relaxation solutions:
Subproblem 3: (x1, x2, x3, x4) = (1, 0, 4/5, 0)
Z = 13 4/5
Subproblem 4: (x1, x2, x3, x4) = (1, 2, 0, 1/2)
Z = 16
Bound for subproblem 3:
Z ≤ 13
Bound for subproblem 4:
Z ≤ 16
Both of these subproblems fail fathoming tests.
We next branch on subproblem 3 or 4; subproblem 4 has higher bound, we choose
subproblem4.
14
Subproblem 5: x1 = 1, x2 = 1, x3 = 0
LP relaxation: (1, 1, 0, 1/2)
Subproblem 6: x1 = 1, x2 = 1, x3 = 1
LP relaxation: No feasible solution.
Bound for subproblem 5:
Z = 16
Z ≤ 16
Subproblem 5 can not be fathomed (fails test); Subproblem 6 is fathomed F(2).
Since, now, the resulting branching variable x4 is the last variable, fixing its value at either 0
or 1 actually creates a single solution rather than subproblems requiring full investigation.
These single solutions are:
x4 = 0: (x1, x2, x3, x4) = (1, 1, 0, 0) is feasible with Z = 14.
x4 = 1: (x1, x2, x3, x4) = (1, 1, 0, 1) is infeasible.
Fathoming tests: First solution passes test 3, second passes test 2. Furthermore, first solution
is better than the candidate or incumbent solution (14 > 9), so it becomes the new candidate
solution or incumbent with Z* = 14.
The only remaining subproblem is now the node subproblem 3 (x1 = 1, x2 = 0). Its bound is 13
which is less than 14. Therefore, subproblem 3 is also fathomed by test 1.
We now have the solution tree shown below. The candidate solution or the incumbent is,
(x1, x2, x3, x4) = (1, 1, 0, 0) is optimal with Z = 14.
x1
x2
x3
x4
Subpr 1
F(3)
9 = Z*
Subpr 3
F(1)
0
0
All
0
Subpr 5
16
13
Subpr 4
16
14 = Z*
(1, 1, 0, 0) = candidate soln.
0
1
Subpr 2
F(3)
16
1
1
F(2)
16
Subpr 6
1
F(2)
15
Cutting Plane Algorithm
An alternative method to branch-and-bound method is the Cutting Plane Algorithm.
Reconsidering the problem that was solved by branch-and-bound method:
Max Z = 8x1 + 5x2
s.t.
x1 + x2 ≤ 6
9x1 + 5x2 ≤ 45
x1, x2 ≥ 0;
x1, x2 integer.
Optimal tableau for the LP relaxation, i.e. without integer constraint, is:
Z
1
0
0
x1
0
0
1
x2
0
1
0
s1
1.25
2.25
-1.25
s2
0.75
-0.25
0.25
RHS
41.25
2.25
3.75
To apply cutting plane method, we begin by choosing any constraint in the LP relaxation’s
optimal tableau in which a basic variable is fractional. Arbitrarily, we choose 2nd constraint.
x1 – 1.25s1 + 0.25s2 = 3.75
(*)
We now define [x] to be the largest integer less than or equal to x, i.e. [3.75] = 3 and
[1.25] = - 2.
Any number can be written in the form
Then, 3.75 = 3 + 0.75 or
[x] + f
where 0 ≤ f < 1.
- 1.25 = - 2 + 0.75
Now, 2nd constraint becomes,
x1 – 2s1 + 0.75s1 + 0s2 + 0.25s2 = 3 + 0.75
or
x1 – 2s1 + 0s2 – 3 = 0.75 – 0.75s1 – 0.25s2
(**)
Cutting plane algorithm now suggests adding the following constraint to the LP relaxation’s
optimal tableau:
0.75 – 0.75s1 – 0.25s2 ≤ 0
This constraint is called c cut (Gomory cut).
Note: The logic for the above cut is that,
- 0.75s1 – 0.25s2 + 0.75 ≤ 0.75
since s1 and s2 are integers.
Also, in equation (**) LHS is integer, therefore, RHS is also integer.
Then,
0.75 – 0.75s1 – 0.25s2 ≤ 0
since integer less than 0.75 is zero.
This cut “cuts off” the current optimal solution to the LP relaxation, but not any feasible
solutions to the IP. If solution with this constraint added yields an integer solution, then we
have found the optimal solution to the original IP. If the new optimal solution (to LP
relaxation + cut) has some fractional-valued variables, then we generate another cut and
continue the process.
16
We now add the “cut” constraint to the LP relaxation’s optimal tableau after adding slack
variable s3: (i.e. - 0.75s1 – 0.25s2 + s3 = - 0.75)
Z
1
0
0
0
x1
0
0
1
0
x2
0
1
0
0
s1
1.25
2.25
-1.25
-0.75
s2
0.75
-0.25
0.25
-0.25
s3
0
0
0
1
RHS
41.25
2.25
3.75
-0.75
Dual simplex algorithm is most appropriate for this solution. s1 should enter the basis in the
third constraint (Ratios: 1.25/0.75 for s1, 0.75/0.25 for s2). The tableau becomes:
Z
1
0
0
0
x1
0
0
1
0
x2
0
1
0
0
s1
0
0
0
1
s2
0.33
-1
0.67
0.33
s3
1.67
3
-1.67
-1.33
RHS
40
0
5
1
Solution yields the optimal solution with Z = 40, x1 = 5, x2 = 0.
Note: The algorithm requires that all coefficients of variables in the constraints and all
RHS’s of constraints be integers. This is to ensure that if the original decision variables are
integers, then the slack and excess variables will also be integers. Thus, a constraint such
as
x1 + 0.5x2 ≤ 3.6 must be replaced by 10x1 + 5x2 ≤ 36.
Example.
s.t.
Max Z = 7x1 + 10x2
- x1 + 3x2 ≤ 6
7x1 + x2 ≤ 35
x1, x2 ≥ 0 and integer.
Given the slacks s1 and s2 for constraints 1 and 2, the optimum LP relaxation tableau is:
BV
Z
x2
x1
Z
1
0
0
x1
0
0
1
x2
0
1
0
Optimal solutin - Z = 66 1/2 ,
s1
63/22
7/22
-1/22
s2
31/22
1/22
3/22
RHS
66 1/2
3 1/2
4 1/2
x1 = 4 1/2, x2 = 3 1/2, s1 = s2 = 0
The cut is developed under the assumption that all the variables are integers (including s1 and
s2).
We can generate cut from any row above, row 0 or row 1 or row 2, and any one can be used
in the first iteration.
Row 0:
Z + 2s1 + s2 – 66 = -(19/22)s1 – (9/22)s2 + 1/2
Then, the cut is
-(19/22)s1 – (9/22)s2 + 1/2 ≤ 0
Row 2 (for x1):
Then, the cut is
x1 + (- 1 + 21/22)s1 + (0 + 3/22)s2 = (4 + 1/2)
- (21/22)s1 – (3/22)s2 + 1/2 ≤ 0
Row 1 (for x2):
Then, the cut is
x2 + (0 + 7/22)s1 + (0 + 1/22)s2 = (3 + 1/2)
- (7/22)s1 – (1/22)s2 + 1/2 ≤ 0
17
Note: A constraint equation can be used as a source row for generating a cut provided its
RHS is fractional. We also note that Z row , i.e. row 0, can be used as a source row because
Z happens to be integer in this example.
We select x2 row (row 1) and add,
- (7/22)s1 – (1/22)s2 + s3 = - 1/2
BV
Z
x2
x1
s3
Z
1
0
0
0
x1
0
0
1
0
x2
0
1
0
0
s1
63/22
7/22
-1/22
-7/22
s2
31/22
1/22
3/22
-1/22
cut I
s3
0
0
0
1
RHS
66 1/2
3 1/2
4 1/2
-1/2
Tableau is optimal but infeasible. We apply the dual simplex method. Solution becomes,
BV
Z
x2
x1
s1
Z
1
0
0
0
x1
0
0
1
0
x2
0
1
0
0
s1
0
0
0
1
s2
1
0
1/7
1/7
s3
9
1
-1/7
-22/7
RHS
62
3
4 4/7
1 4/7
Still non integer x1 and s1.
Arbitrarily select row 2 (x1 row) for the next cut.
x1 + (0 + 1/7)s2 + (-1 + 6/7)s3 = 4 + 4/7
The cut is,
- (1/7)s2 – (6/7)s3 + 4/7 ≤ 0
We add constraint,
BV
Z
x2
x1
s1
s4
Z
1
0
0
0
0
cut II
- (1/7)s2 – (6/7)s3 + s4 = - 4/7 s4 ≥ 0
x1
0
0
1
0
0
x2
0
1
0
0
0
s1
0
0
0
1
0
s2
1
0
1/7
1/7
-1/7
s3
9
1
-1/7
-22/7
-6/7
s4
0
0
0
0
1
RHS
62
3
4 4/7
1 4/7
- 4/7
s1
0
0
0
1
0
s2
0
0
0
0
1
s3
3
1
-1
-4
6
s4
7
0
1
1
-7
RHS
58
3
4
1
4
Applying dual simplex method,
BV
Z
x2
x1
s1
s2
Z
1
0
0
0
0
x1
0
0
1
0
0
x2
0
1
0
0
0
We have optimum solution with
x1 = 4, x2 = 3, and Z = 58.
18
Download