LN8-Constrained opti..

advertisement
Lecture Notes #8
CONSTRAINED OPTIMIZATION
1. Effects of a Constraint
The primary purpose of imposing a constraint is to recognize the limiting factors present in the optimization
problems. In constraint optimization the goal is to find the optimum values for the objective function 𝑦 = 𝑓(𝑥1 , 𝑥2 ) ,
that is find the value for 𝑥1 and 𝑥2 that yields the extremum value for 𝑦, subject to constraint 𝑔(𝑥1 , 𝑥2 ) = 𝑐.
1.1. Total-Differential Approach
First find the total-differential of 𝑦 = 𝑓(𝑥1 , 𝑥2 ) and set it equal to zero.
𝑑𝑦 =
𝜕𝑦
𝜕𝑦
𝑑𝑥1 +
𝑑𝑥
𝜕𝑥1
𝜕𝑥2 2
𝑑𝑦 = 𝑓1 𝑑𝑥1 + 𝑓2 𝑑𝑥2 = 0
Form which we obtain
𝑑𝑥2
𝑓1
=−
𝑑𝑥1
𝑓2
Now find the total differential of the constraint function,
𝑔1 𝑑𝑥1 + 𝑔2 𝑑𝑥2 = 𝑑𝑐 = 0
(Note that since c is a constant, dc = 0)
which provides us
𝑑𝑥2
𝑔1
=−
𝑑𝑥1
𝑔2
Thus,
𝑓1 𝑔1
=
𝑓2 𝑔2
Rearranging this relation and using the constraint expression we get two equations which we can solve for the
optimum values of 𝑥1 and 𝑥2 .
𝑓1 𝑔2 − 𝑓2 𝑔1 = 0
𝑔(𝑥1 , 𝑥2 ) − 𝑐 = 0
Example
In the utility function 𝑈 = 𝑥1 𝑥2 + 2𝑥1 , find the values, quantities, or goods 𝑥1 and 𝑥2 such that utility 𝑈 is maximized,
subject to the budget constraint 𝑃1 𝑥1 + 𝑃2 𝑥2 = 𝐵. Let 𝑃1 = $4, 𝑃2 = $2, and 𝐵 = $60.
Maximize 𝑈 = 𝑥1 𝑥2 + 2𝑥1 subject to 4𝑥1 + 2𝑥2 = 60
First, the total differential of U:
LN 8—Constrained Optimization
Page 1 of 38
𝑑𝑈 = (𝑥2 + 2)𝑑𝑥1 + 𝑥1 𝑑𝑥2 = 0
𝑑𝑥2
𝑥2 + 2
=−
𝑑𝑥1
𝑥1
Next, the total differential of the budget constraint:
4𝑑𝑥1 + 2𝑑𝑥2 = 0
𝑑𝑥2
4
=−
𝑑𝑥1
2
Setting the write-hand-side quotients equal,
𝑥2 + 2
=2
𝑥1
Rearranging the above, and along with the budget constraint, we have
2𝑥1 − 𝑥2 = 2
4𝑥1 + 2𝑥2 = 60
which yields,
𝑥1 = 8, 𝑥2 = 14
Substituting these values in the utility function, we obtain
𝑈 = (8)(14) + 2(8) = 128
1.2.
The Lagrange-Multiplier Method of Optimization
In general, given an objective function
𝑦 = 𝑓(𝑥1 , 𝑥2 )
subject to the constraint
𝑔(𝑥1 , 𝑥2 ) = 𝑐
The objective function and the constraint are combined in a format called the 𝐿𝑎𝑔𝑟𝑎𝑛𝑔𝑖𝑎𝑛 function as
𝑍 = 𝑓(𝑥1 , 𝑥2 ) + 𝜆[𝑐 − 𝑔(𝑥1 , 𝑥2 )]
To find the stationary values of Z, the first order condition is
𝑍𝜆 = 𝑐 − 𝑔(𝑥1 , 𝑥2 ) = 0
𝑍1 = 𝑓1 − 𝜆𝑔1 = 0
𝑍2 = 𝑓2 − 𝜆𝑔2 = 0
The first order conditions yield three equations with three unknowns, which include the original unknowns 𝑥1 and 𝑥2
plus the multiplier 𝜆.
LN 8—Constrained Optimization
Page 2 of 38
Now, using the above example, we can incorporate the constraint in the objective function as follows:
𝑍 = (𝑥1 𝑥2 + 2𝑥1 ) + 𝜆(60 − 4𝑥1 − 2𝑥2 )
The term λ (𝑙𝑎𝑚𝑏𝑑𝑎) above is known as the Lagrange Multiplier. Taking the partial derivative of the
𝐿𝑎𝑔𝑟𝑎𝑛𝑔𝑖𝑎𝑛 𝑓𝑢𝑛𝑐𝑡𝑖𝑜𝑛 with respect to 𝑥1 , 𝑥2 , and λ and setting them equal to zero, we have
𝜕𝑧
= 60 − 4𝑥1 − 2𝑥2 = 0
𝜕𝜆
𝜕𝑧
= 𝑥2 + 2 − 4𝜆 = 0
𝜕𝑥1
𝜕𝑧
= 𝑥1 − 2𝜆 = 0
𝜕𝑥2
−4
[ 0
1
−2
1
0
0 𝑥1
−60
−4] [𝑥2 ] = [ −2]
−2 𝜆
0
→
𝑥1
8
[𝑥2 ] = [14]
𝜆
4
Note that using the Lagrange Multiplier “trick” the term 𝜆(60 − 4𝑥1 − 2𝑥2 ) in the Lagrangian function allows us to
determine the values of 𝑥1 and 𝑥2 such the budget constraint is exhausted (via 𝜕𝑍 ⁄𝜕𝜆 = 0) and Z reaches the
extremum (here maximum) via 𝜕𝑍 ⁄𝜕𝑥1 = 𝜕𝑍 ⁄𝑥2 = 0. In other word, finding the values for 𝑥1 and 𝑥2 such that when
the budget constraint is exhausted, 𝑍 = 𝑈 = 𝑥1 𝑥2 + 2𝑥1 . Thus, the same values of 𝑥1 and 𝑥2 give us extremum (her the
maximum) for 𝑈.
𝑍 = (𝑥1 𝑥2 + 2𝑥1 ) + 𝜆(60 − 4𝑥1 − 2𝑥2 )
𝑍 = 8 × 14 + 2 × 8 + 4(60 − 4 × 8 − 2 × 14) = 268 + 4(0) = 268
Example
Find the stationary values of
𝑦 = 𝑥1 𝑥2
subject to
𝑥1 + 𝑥2 = 6
The Lagrangian function is
𝑍 = 𝑥1 𝑥2 + 𝜆(6 − 𝑥1 − 𝑥2 )
𝑍𝜆 = 6 − 𝑥1 − 𝑥2 = 0
𝑍1 = 𝑥2 − 𝜆 = 0
𝑍2 = 𝑥1 − 𝜆 = 0
0
[−1
−1
1
0
1
1 𝜆
6
1] [𝑥1 ] = [0]
0 𝑥2
0
→
→
→
−𝜆 + 𝑥1 + 𝑥2 = 6
−𝜆 + 𝑥1 + 𝑥2 = 0
−𝜆 + 𝑥1 + 𝑥2 = 0
→
𝜆
3
[𝑥1 ] = [3]
𝑥2
3
LN 8—Constrained Optimization
Page 3 of 38
1.3. An Interpretation of the Lagrange Multiplier
In the Lagrangian function, 𝑍 = 𝑓(𝑥1 , 𝑥2 ) + 𝜆[𝑐 − 𝑔(𝑥1 , 𝑥2 )], λ shows the responsiveness of optimum value of Z, and
hence the objective function, to changes in the constraint parameter c. 1 A change in the constraint would cause a shift
in the constraint function, thus altering the optimal solution.
Now, using the chain rule find 𝑑𝑍 ⁄𝑑𝑐 from
𝑍 = 𝑓(𝑥1 , 𝑥2 ) + 𝜆[𝑐 − 𝑔(𝑥1 , 𝑥2 )]
𝑑𝑍
𝑑𝑥1
𝑑𝑥2
𝑑𝜆
𝑑𝑥1
𝑑𝑥2
= 𝑓1
+ 𝑓2
+ [𝑐 − 𝑔(𝑥1 , 𝑥2 )]
+ 𝜆 (1 − 𝑔1
− 𝑔2
)
𝑑𝑐
𝑑𝑐
𝑑𝑐
𝑑𝑐
𝑑𝑐
𝑑𝑐
𝑑𝑍
𝑑𝑥1
𝑑𝑥2
𝑑𝜆
= (𝑓1 − 𝜆𝑔1 )
+ (𝑓2 − 𝜆𝑔2 )
+ [𝑐 − 𝑔(𝑥1 , 𝑥2 )]
+𝜆
𝑑𝑐
𝑑𝑐
𝑑𝑐
𝑑𝑐
Given the FOC condition,
𝑐 − 𝑔(𝑥1 , 𝑥2 ) = 0
𝑓1 − 𝜆𝑔1 = 0
𝑓2 − 𝜆𝑔2 = 0
then
𝑑𝑍
𝑑
[𝑓(𝑥1 , 𝑥2 )] = 𝜆
=
𝑑𝑐 𝑑𝑐
Thus, if c is the only exogenous variable in the constraint, then λ represents the change in the optimal value of Z, hence
a change in the optimal value of the objective function, per unit change in the constraint.
1.4. n-Variable and 𝐌𝐮𝐥𝐭𝐢𝐜𝐨𝐧𝐬𝐭𝐫𝐚𝐢𝐧𝐭 Cases
Consider the multivariable objective function,
𝑦 = 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
subject to the constraint
𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) = 𝑐
The Lagrangian function is
𝑍 = 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) + 𝜆[𝑐 − 𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 )]
1
For values of 𝑥1 and 𝑥2 such that the constraint holds, 𝑔(𝑥1 , 𝑥2 ) − 𝑐 = 0. Therefore, the functions 𝑍 and 𝑓(𝑥1 , 𝑥2 ) have the same
values at the optimum values of 𝑥1 and 𝑥2 . Maximizing the objective function 𝑈 = 𝑥1 𝑥2 + 2𝑥1 subject to the constraint 4𝑥1 +
2𝑥2 = 60, by using the Lagrangian method,
𝑍 = (𝑥1 𝑥2 + 2𝑥1 ) + 𝜆(60 − 4𝑥1 − 2𝑥2 )
we found the optimum values, 𝑥1∗ = 8 and 𝑥2∗ = 14, which yields 60 − 4 × 8 − 2 × 14 = 0. Thus, 𝑍 = 𝑥1 𝑥2 + 2𝑥1 = 𝑈
In the context of utility maximization, the objective function is 𝑈 = 𝑓(𝑥1 , 𝑥2 ) and 𝑐 is the budget or income. Thus, the
responsiveness of 𝑈 to changes in 𝑐, 𝜆 = 𝑑𝑈 ⁄𝑑𝑐, represents the marginal utility of income. This is discussed further below.
LN 8—Constrained Optimization
Page 4 of 38
The FOC consists of 𝑛 + 1 simultaneous equations,
𝑍𝜆 = 𝑐 − 𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) = 0
𝑍1 = 𝑓1 − 𝜆𝑔1 = 0
𝑍2 = 𝑓2 − 𝜆𝑔2 = 0
⋮= 𝑓𝑛 − 𝜆𝑔𝑛 = 0
𝑍𝑛 = 𝑓𝑛 − 𝜆𝑔𝑛 = 0
We can also have an objective function which may be subject to more than one constraint.
𝑦 = 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
subject to constraints,
𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) = 𝑐
ℎ(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) = 𝑑
The Lagrangian function is then,
𝑍 = 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) + 𝜆[𝑐 − 𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 )] + µ[𝑑 − 𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 )]
The FOC in this situation is
𝑍𝜆 = 𝑐 − 𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) = 0
𝑍µ = 𝑑 − ℎ(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) = 0
(𝑖 = 1,2, … , 𝑛)
𝑍𝑖 = 𝑓𝑖 − 𝜆𝑔𝑖 = 0
2. Second-Order Condition for Constrained Optimization
2.1. Review of Unconstrained Second-Order Condition
Let’s start with the objective function 𝒚 = 𝒇(𝒙𝟏 , 𝒙𝟐 ). To explain how the addition of the constraint changes the SOC, it
is instructive to review the unconstrained SOC first. First find the first total differential of y.
𝑑𝑦 = 𝑓1 𝑑𝑥1 + 𝑓2 𝑑𝑥2
To get to the SOC, find the second order differential,
𝑑 2 𝑦 = 𝑑(𝑑𝑦) =
𝑑2𝑦 =
𝜕(𝑑𝑦)
𝜕(𝑑𝑦)
𝑑𝑥1 +
𝑑𝑥2
𝜕𝑥1
𝜕𝑥2
𝜕
𝜕
(𝑓1 𝑑𝑥1 + 𝑓2 𝑑𝑥2 )𝑑𝑥1 +
(𝑓 𝑑𝑥 + 𝑓2 𝑑𝑥2 )𝑑𝑥2
𝜕𝑥1
𝜕𝑥2 1 1
𝑑 2 𝑦 = (𝑓11 𝑑𝑥1 + 𝑓12 𝑑𝑥2 )𝑑𝑥1 + (𝑓21 𝑑𝑥1 + 𝑓22 𝑑𝑥2 )𝑑𝑥2
𝑑 2 𝑦 = 𝑓11 𝑑𝑥12 + 2𝑓12 𝑑𝑥1 𝑑𝑥2 + 𝑓22 𝑑𝑥22
This quadratic equation, with variables 𝑑𝑥1 and 𝑑𝑥2 , can be written in matrix format as follows.
LN 8—Constrained Optimization
Page 5 of 38
𝑑 2 𝑦 = [𝑑𝑥1
𝑓
𝑑𝑥2 ] [ 11
𝑓21
𝑓12 𝑑𝑥1
][
]
𝑓22 𝑑𝑥2
The determinant of the 2 × 2 matrix is the Hessian determinant
𝑓
𝑓
|𝐻| = | 11 12 |
𝑓21 𝑓22
𝑑 2 𝑦 is positive definite:
|𝑓11 | = 𝑓11 > 0
and
|
𝑓11
𝑓21
𝑓12
2
| = 𝑓11 𝑓22 − 𝑓12
> 0,
𝑓22
or
2
𝑓11 𝑓22 > 𝑓12
𝑑 2 𝑦 is negative definite:
|𝑓𝑥𝑥 | = 𝑓𝑥𝑥 < 0
and
|
𝑓11
𝑓21
𝑓12
2
| = 𝑓11 𝑓22 − 𝑓12
> 0,
𝑓22
or
2
𝑓11 𝑓22 > 𝑓12
2.2. Constrained Second-Order Differential
Note that to obtain the quadratic form in the unconstrained optimization the quantities 𝑑𝑥1 and 𝑑𝑥2 constitute two
arbitrary and independent changes. Now with the addition of the constraint 𝑔(𝑥1 , 𝑥2 ) = 𝑐 to the picture, 𝑑𝑥1 and 𝑑𝑥2
are bound to the constraint. If we take, say, 𝑑𝑥1 as an arbitrary change, 𝑑𝑥2 is dependent on 𝑑𝑥1 , as indicated by the
following
𝑔(𝑥1 , 𝑥2 ) = 𝑐
𝑑𝑔 = 𝑔1 𝑑𝑥1 + 𝑔2 𝑑𝑥2 = 0
𝑑𝑥2 = −
(𝑑𝑐 = 0)
𝑔1
𝑑𝑥
𝑔2 1
Thus, in the presence of constraint, to find the second order differential 𝑑 2 𝑦, we must treat 𝑑𝑥2 as a variable (if we
choose to treat 𝑑𝑥1 as an arbitrary change) dependent on 𝑥1 and 𝑥2 . Taken these factors into account, let’s now attend
to 𝑑 2 𝑦. Again, we will start from the beginning,
𝑑𝑦 = 𝑓1 𝑑𝑥1 + 𝑓2 𝑑𝑥2
𝑑 2 𝑦 = 𝑑(𝑑𝑦) =
𝑑2𝑦 =
𝜕(𝑑𝑦)
𝜕(𝑑𝑦)
𝑑𝑥1 +
𝑑𝑥2
𝜕𝑥1
𝜕𝑥2
𝜕
𝜕
(𝑓1 𝑑𝑥1 + 𝑓2 𝑑𝑥2 )𝑑𝑥1 +
(𝑓 𝑑𝑥 + 𝑓2 𝑑𝑥2 )𝑑𝑥2
𝜕𝑥1
𝜕𝑥2 1 1
(𝐴)
From here on the complications from the presence of the constraint are introduced—where 𝑑𝑥2 is treated as a
dependent variable. Consider the first term of the right-hand-side of second order total differential (A).
𝜕
𝜕
𝜕
(𝑓1 𝑑𝑥1 + 𝑓2 𝑑𝑥2 )𝑑𝑥1 =
(𝑓1 𝑑𝑥1 )𝑑𝑥1 +
(𝑓 𝑑𝑥 )𝑑𝑥1
𝜕𝑥1
𝜕𝑥1
𝜕𝑥1 2 2
The first term on the right-hand-side of this expression is
𝜕
(𝑓 𝑑𝑥 )𝑑𝑥1 = 𝑓11 𝑑𝑥12
𝜕𝑥1 1 1
The second term, however, must be treated differently because 𝑑𝑥2 is no longer treated as an arbitrary change. It is
now a function of, dependent on, 𝑥1 and 𝑥2 due to the constraint.
LN 8—Constrained Optimization
Page 6 of 38
𝜕
𝜕𝑑𝑥2
(𝑓 𝑑𝑥 )𝑑𝑥1 = (𝑓21 𝑑𝑥2 + 𝑓2
) 𝑑𝑥1
𝜕𝑥1 2 2
𝜕𝑥1
Thus, the first term in (A) is written as
𝜕
𝜕𝑑𝑥2
(𝑓 𝑑𝑥 + 𝑓2 𝑑𝑥2 )𝑑𝑥1 = 𝑓11 𝑑𝑥12 + (𝑓21 𝑑𝑥2 + 𝑓2
) 𝑑𝑥1
𝜕𝑥1 1 1
𝜕𝑥1
In a similar fashion, the second term of (A) is written as
𝜕
𝜕𝑑𝑥2
(𝑓 𝑑𝑥 + 𝑓2 𝑑𝑥2 )𝑑𝑥2 = (𝑓12 𝑑𝑥1 + 𝑓2
) 𝑑𝑥2 + 𝑓22 𝑑𝑥22
𝜕𝑥2 1 1
𝜕𝑥2
Thus, the second differential can be written as,
𝜕𝑑𝑥2
𝜕𝑑𝑥2
𝑑 2 𝑦 = 𝑓11 𝑑𝑥12 + 2𝑓12 𝑑𝑥1 𝑑𝑥2 + 𝑓22 𝑑𝑥22 + +𝑓2 (
𝑑𝑥1 +
𝑑𝑥2 )
𝜕𝑥1
𝜕𝑥2
Note that the parenthetical expression in the second order differential can be simplified to
𝜕𝑑𝑥2
𝜕𝑑𝑥2
𝑑𝑥1 +
𝑑𝑥2 = 𝑑(𝑑𝑥2 ) = 𝑑 2 𝑥2
𝜕𝑥1
𝜕𝑥2
Thus,
𝑑 2 𝑦 = 𝑓11 𝑑𝑥12 + 2𝑓12 𝑑𝑥1 𝑑𝑥2 + 𝑓22 𝑑𝑥22 + 𝑓2 𝑑 2 𝑥2
The right-hand-side of the above expression is not a quadratic form. The presence of the constraint no longer
provides for a quadratic form as the second order differential of the objective function. However, we can use the
constraint function itself to transform 𝑑 2 𝑦 into a quadratic form.
Using
𝑑𝑔 = 𝑔1 𝑑𝑥1 + 𝑔2 𝑑𝑥2 = 0
Going through the same procedure as for 𝑑 2 𝑦, we find that
𝑑 2 𝑔 = 𝑑(𝑑𝑔) =
𝑑2𝑔 =
𝜕(𝑑𝑔)
𝜕(𝑑𝑔)
𝑑𝑥1 +
𝑑𝑥2
𝜕𝑥1
𝜕𝑥2
𝜕
𝜕
(𝑔 𝑑𝑥 + 𝑔2 𝑑𝑥2 )𝑑𝑥1 +
(𝑔 𝑑𝑥 + 𝑔2 𝑑𝑥2 )𝑑𝑥2
𝜕𝑥1 1 1
𝜕𝑥2 1 1
Consider the first term on the RHS:
𝜕
𝜕
𝜕
(𝑔1 𝑑𝑥1 + 𝑔2 𝑑𝑥2 )𝑑𝑥1 =
(𝑔1 𝑑𝑥1 )𝑑𝑥1 +
(𝑔 𝑑𝑥 )𝑑𝑥1
𝜕𝑥1
𝜕𝑥1
𝜕𝑥1 2 2
𝜕
𝜕𝑑𝑥2
(𝑔1 𝑑𝑥1 + 𝑔2 𝑑𝑥2 )𝑑𝑥1 = 𝑔11 𝑑𝑥12 + (𝑔21 𝑑𝑥2 + 𝑔2
) 𝑑𝑥1
𝜕𝑥1
𝜕𝑥1
Now the second term:
LN 8—Constrained Optimization
Page 7 of 38
𝜕
𝜕𝑑𝑥2
(𝑔 𝑑𝑥 + 𝑔2 𝑑𝑥2 )𝑑𝑥2 = (𝑔12 𝑑𝑥1 + 𝑔2
) 𝑑𝑥2 + 𝑔22 𝑑𝑥22
𝜕𝑥2 1 1
𝜕𝑥2
Thus, the second differential 𝑑 2 𝑔 can be written as,
𝜕𝑑𝑥2
𝜕𝑑𝑥2
𝑑 2 𝑔 = 𝑔11 𝑑𝑥12 + 2𝑔12 𝑑𝑥1 𝑑𝑥2 + 𝑔22 𝑑𝑥22 + +𝑔2 (
𝑑𝑥1 +
𝑑𝑥2 )
𝜕𝑥1
𝜕𝑥2
Note that the parenthetical expression in the second order differential can be simplified to
𝜕𝑑𝑥2
𝜕𝑑𝑥2
𝑑𝑥1 +
𝑑𝑥2 = 𝑑(𝑑𝑥2 ) = 𝑑 2 𝑥2
𝜕𝑥1
𝜕𝑥2
Therefore,
𝑑 2 𝑔 = 𝑔11 𝑑𝑥12 + 2𝑔12 𝑑𝑥1 𝑑𝑥2 + 𝑔22 𝑑𝑥22 + 𝑔2 𝑑 2 𝑥2 = 0
Solving for 𝑑 2 𝑥2 ,
𝑑 2 𝑥2 = −
𝑔11 𝑑𝑥12 + 2𝑔12 𝑑𝑥1 𝑑𝑥2 + 𝑔22 𝑑𝑥22
𝑔2
and multiplying both sides by 𝑓2 ,
𝑓2 𝑑 2 𝑥2 = −𝑓2
𝑔11 𝑑𝑥12 + 2𝑔12 𝑑𝑥1 𝑑𝑥2 + 𝑔22 𝑑𝑥22
𝑔2
and then substituting in 𝑑 2 𝑦, we have,
𝑓2
𝑓2
𝑓2
𝑔11 𝑑𝑥12 − 2 𝑔12 𝑑𝑥1 𝑑𝑥2 − 𝑔22 𝑑𝑥22
𝑔2
𝑔2
𝑔2
𝑓2
𝑓2
𝑓2
2
2
𝑑 𝑦 = (𝑓11 − 𝑔11 ) 𝑑𝑥1 + 2 (𝑓12 − 𝑔12 ) 𝑑𝑥1 𝑑𝑥2 + (𝑓22 − 𝑔22 ) 𝑑𝑥22
𝑔2
𝑔2
𝑔2
𝑑 2 𝑦 = 𝑓11 𝑑𝑥12 + 2𝑓12 𝑑𝑥1 𝑑𝑥2 + 𝑓22 𝑑𝑥22 −
Note that from the FOC for the Lagrangian function we have,
𝑍2 = 𝑓2 − 𝜆𝑔2 = 0
𝑓2
=𝜆
𝑔2
Thus,
𝑑 2 𝑦 = (𝑓11 − 𝜆𝑔11 )𝑑𝑥12 + 2(𝑓12 − 𝜆𝑔12 )𝑑𝑥1 𝑑𝑥2 + (𝑓22 − 𝜆𝑔22 )𝑑𝑥22
Also finding the second derivatives from the FOC,
𝑍11 = 𝑓11 − 𝜆𝑔11
𝑍12 = 𝑓12 − 𝜆𝑔12
𝑍22 = 𝑓22 − 𝜆𝑔2
we finally have,
𝑑 2 𝑦 = 𝑍11 𝑑𝑥12 + 2𝑍12 𝑑𝑥1 𝑑𝑥2 + 𝑍22 𝑑𝑥22
LN 8—Constrained Optimization
Page 8 of 38
which now gives us the quadratic form.
2.3. The “Bordered” Hessian Determinant for SOC
The quadratic form above allows us to determine the SOC.
𝑑 2 𝑦 = 𝑍11 𝑑𝑥12 + 2𝑍12 𝑑𝑥1 𝑑𝑥2 + 𝑍22 𝑑𝑥22
For the optimum solutions of 𝑥1 and 𝑥1 to be a maximum, 𝑑 2 𝑦 < 0, and for a minimum 𝑑 2 𝑦 > 0.
To determine the negative or positive definiteness of the quadratic form, we form the “bordered” Hessian
determinant. This is explained next.
Using the first differential of the constraint function,
𝑔1 𝑑𝑥1 + 𝑔2 𝑑𝑥2 = 0
solving for 𝑑𝑥2
𝑑𝑥2 = −
𝑔1
𝑑𝑥
𝑔2 1
and substituting in the quadratic form, we have
2
𝑔1
𝑔1
𝑑 2 𝑦 = 𝑍11 𝑑𝑥12 − 2𝑍12 𝑑𝑥1 ( 𝑑𝑥1 ) + 𝑍22 (− 𝑑𝑥1 )
𝑔2
𝑔2
2
𝑔
𝑔
1
1
𝑑 2 𝑦 = 𝑍11 𝑑𝑥12 − 2𝑍12 𝑑𝑥12 + 𝑍22 2 𝑑𝑥12
𝑔2
𝑔2
Factoring out 𝑑𝑥12 ⁄𝑔22 ,
𝑑 2 𝑦 = (𝑍11 𝑔22 − 2𝑍12 𝑔1 𝑔2 + 𝑍22 𝑔12 )
𝑑𝑥12
𝑔22
The SOC requires that the quadratic form in the parenthetical expression to be negative definite for a maximum and
positive definite for a minimum.
Take the three equations from the first order condition and rewrite them in the implicit function format, where 𝜆, 𝑥1 ,
and 𝑥2 are the endogenous variables and c is the exogenous variable.
𝜕𝑍
= 𝑐 − 𝑔(𝑥1 , 𝑥2 ) = 0
𝜕𝜆
𝜕𝑍
= 𝑓1 − 𝜆𝑔1 = 0
𝜕𝑥1
𝜕𝑍
= 𝑓2 − 𝜆𝑔2 = 0
𝜕𝑥2
→
𝐹 𝜆 (𝜆, 𝑥1 , 𝑥2 ; 𝑐) = 0
→
𝐹1 (𝜆, 𝑥1 , 𝑥2 ; 𝑐) = 0
→
𝐹 2 (𝜆, 𝑥1 , 𝑥2 ; 𝑐) = 0
To form the Jacobian determinant
LN 8—Constrained Optimization
Page 9 of 38
𝜕𝐹 𝜆
| 𝜕𝜆
𝜕𝐹1
|𝐽| =
𝜕𝜆
|𝜕𝐹 2
𝜕𝜆
𝜕𝐹 𝜆
𝜕𝑥1
𝜕𝐹1
𝜕𝑥1
𝜕𝐹 2
𝜕𝑥1
𝜕𝐹 𝜆
𝜕𝑥2 |
𝜕𝐹1
𝜕𝑥2
𝜕𝐹 2 |
𝜕𝑥2
0
|𝐽| = |−𝑔1
−𝑔2
−𝑔1
𝑓11 − 𝜆𝑔11
𝑓21 − 𝜆𝑔21
0
|𝐽| = |−𝑔1
−𝑔2
−𝑔1
𝑍11
𝑍21
−𝑔2
𝑓12 − 𝜆𝑔12 |
𝑓22 − 𝜆𝑔22
−𝑔2
𝑍12 |
𝑍22
|𝐽| = −𝑍11 𝑔22 + 2𝑍12 𝑔1 𝑔2 − 𝑍22 𝑔12 = −(𝑍11 𝑔22 − 2𝑍12 𝑔1 𝑔2 + 𝑍22 𝑔12 )
The Jacobian determinant is the negative of the quadratic form in the parenthetical expression on the RHS of 𝑑 2 𝑦
above, shown again here.
𝑑 2 𝑦 = (𝑍11 𝑔22 − 2𝑍12 𝑔1 𝑔2 + 𝑍22 𝑔12 )
𝑑𝑥12
𝑔22
The determinant expression of (𝑍11 𝑔22 − 2𝑍12 𝑔1 𝑔2 + 𝑍22 𝑔12 ) is called a bordered Hessian and is denoted by |𝐻|.
0
|𝐻| = |𝑔1
𝑔2
𝑔1
𝑍11
𝑍21
𝑔2
𝑍12 |
𝑍22
Thus,
positive definite
𝑑 2 𝑦 is {
} subject to
negative definite
𝑔1 𝑑𝑥1 + 𝑔2 𝑑𝑥2 = 0
iff
0
|𝐻| = |𝑔1
𝑔2
𝑔1
𝑍11
𝑍21
𝑔2
𝑍12 | {< 0}
>0
𝑍22
There is no mistake in typing above! For 𝑑 2 𝑦 to be positive definite, |𝐻| < 0. This is because the bordered Hessian is
formed by using the negative of the quadratic form.
Example
Use the Lagrangian-function method to find the optimum values 𝑥1 and 𝑥2 for the utility function
𝑈 = 𝑥1 𝑥2 + 2𝑥1
subject to the budget constraint 4𝑥1 + 2𝑥2 = 60. Develop the bordered Hessian determinant to determine if the
optimum values maximize the utility function.
𝑍 = 𝑥1 𝑥2 + 2𝑥1 + 𝜆(60 − 4𝑥1 − 2𝑥2 )
𝑍𝜆 = 60 − 4𝑥1 − 2𝑥2 = 0
𝑍1 = 𝑥2 + 2 − 4𝜆 = 0
𝑍2 = 𝑥1 − 2𝜆 = 0
0
[−4
−2
4
0
1
2 𝜆
60
1] [𝑥1 ] = [−2]
0 𝑥2
0
LN 8—Constrained Optimization
→
𝜆
4
[𝑥1 ] = [ 8]
𝑥2
14
Page 10 of 38
𝑔1 = 4
𝑍11 = 0
0
|𝐻| = |4
2
𝑔2 = 2
𝑍12 = 1 = 𝑍21
𝑍22 = 0
4 2
0 1| = 16 > 0
1 0
The optimum values maximize U.
Example
Use the Lagrangian-function method to find the optimum values 𝑥1 and 𝑥2 for the function 𝑦 = 4𝑥12 + 6𝑥22
subject to the constraint 2𝑥1 + 3𝑥2 = 10. Develop the bordered Hessian determinant to determine if the optimum
values yield a maximum or a minimum.
𝑍 = 4𝑥12 + 6𝑥22 + 𝜆(10 − 2𝑥1 − 3𝑥2 )
𝑍𝜆 = 10 − 2𝑥1 − 3𝑥2 = 0
𝑍1 = 8𝑥1 − 2𝜆 = 0
𝑍2 = 12𝑥2 − 3𝜆 = 0
0
[−2
−3
2
8
0
3 𝜆
10
0] [𝑥1 ] = [ 0]
12 𝑥2
0
𝑔1 = 2
𝑍11 = 8
0
|𝐻| = |𝑔1
𝑔2
→
𝑔2 = 3
𝑍12 = 0 = 𝑍21
𝑔1
𝑍11
𝑍21
𝑔2
0
𝑍12 | = |2
𝑍22
3
2
8
0
𝜆
8
[𝑥1 ] = [2]
𝑥2
2
𝑍22 = 12
3
0| = −120 < 0
12
The optimum values yield a minimum.
Example
Let the function
𝑈(𝑥1 𝑥2 ) = 𝑥1 𝑥2
represent a simple two-period utility function where 𝑥1 and 𝑥2 are consumption in period 1 and consumption in
period 2, respectively. The consumer’s budget constraint at the beginning of period 1 is
𝑥1 +
𝑥2
=𝐵
1+𝑟
The consumer is allowed to borrow or lend across the periods at the market interest rate of r. Note that the value of
expenditure in period 2 is discounted using r.
The Lagrangian function is then
𝑍 = 𝑥1 𝑥2 + 𝜆 (𝐵 − 𝑥1 −
𝑥2
)
1+𝑟
providing the following FOC:
LN 8—Constrained Optimization
Page 11 of 38
𝑍𝜆 = 𝐵 − 𝑥1 −
𝑥2
=0
1+𝑟
𝑍1 = 𝑥2 − 𝜆 = 0
𝑍2 = 𝑥1 −
𝜆
=0
1+𝑟
We can find the solutions for λ, 𝑥1 , and 𝑥2 through substitution. From the second equation,
𝑥2 = 𝜆
Substituting in the third equation,
𝑥1 =
𝑥2
1+𝑟
and then substituting for 𝑥1 in the budget constraint,
𝑥2
𝑥2
+
=𝐵
1+𝑟 1+𝑟
2𝑥2 = 𝐵(1 + 𝑟)
The solutions are then,
𝜆 = 𝑥2 =
𝐵(1 + 𝑟)
𝐵
, 𝑥1 =
2
2
Now form the bordered Hessian,
0
|𝐻| = |𝑔1
𝑔2
𝑔1
𝑍11
𝑍21
0
𝑔2
|
𝑍12 | = 1
|
1
𝑍22
1+𝑟
1
0
1
1
1+𝑟
1 || = (−1) (−
1
1
2
)+
=
>0
1+𝑟
1+𝑟 1+𝑟
0
Since |𝐻| > 0, then the SOC is satisfied for a maximum U.
2.4. n-Variable Constrained Optimization
Here the objective function contains more than two variables, subject to a constraint with the same number of
variables.
subject to
𝑦 = 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) = 𝑐
The bordered Hessian is,
0
𝑔1
|𝐻| = || 𝑔2
⋮
𝑔𝑛
𝑔1
𝑍11
𝑍21
⋮
𝑍𝑛1
𝑔2
𝑍12
𝑍22
⋮
𝑍𝑛2
⋯ 𝑔𝑛
⋯ 𝑍1𝑛
⋯ 𝑍2𝑛 |
|
⋮
⋮
⋯ 𝑍𝑛𝑛
LN 8—Constrained Optimization
Page 12 of 38
The leading principal minors of |𝐻| are defined as
0
𝑔
|𝐻2 | ≡ | 1
𝑔2
𝑔1
𝑍11
𝑍21
𝑔2
𝑍12 |
𝑍22
0
𝑔1
|𝐻3 | ≡ |
𝑔2
𝑔3
𝑔1
𝑍11
𝑍21
𝑍31
𝑔2
𝑍12
𝑍22
𝑍32
𝑔3
𝑍13
|
𝑍23
𝑍33
⋯
|𝐻𝑛 | ≡ |𝐻|
The determinantal test for relative constrained extremum with
𝑍 = 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) + 𝜆[𝑐 − 𝑔(𝑥1 , 𝑥2 , … , 𝑥𝑛 )]
is
Condition
First-order condition
Second-order condition
Maximum
𝑍1 = 𝑍2 = ⋯ = 𝑍𝑛 = 0
|𝐻2 | > 0; |𝐻3 | < 0; |𝐻4 | > 0; ⋯ ;
(−1)𝑛 |𝐻4 | > 0
Minimum
𝑍1 = 𝑍2 = ⋯ = 𝑍𝑛 = 0
|𝐻2 |, |𝐻3 |, ⋯ , |𝐻4 | < 0
3. Utility Maximization and Consumer Demand
Maximize
𝑈 = 𝑈(𝑥1 , 𝑥2 )
subject to the budget constraint
𝐵 = 𝑥1 𝑃1 + 𝑥2 𝑃2
3.1. First-Order Condition
Form the Lagrangian function
𝑍 = 𝑈(𝑥1 , 𝑥2 ) + 𝜆(𝐵 − 𝑥1 𝑃1 − 𝑥2 𝑃2 )
and obtain the set of simultaneous equations
𝑍𝜆 = 𝐵 − 𝑥1 𝑃1 − 𝑥2 𝑃2 = 0
𝑍1 = 𝑈1 − 𝜆𝑃1 = 0
𝑍2 = 𝑈2 − 𝜆𝑃2 = 0
From the last two equations we have the familiar utility maximization principle.
𝑈1 𝑈2
=
=𝜆
𝑃1 𝑃2
This implies that the limited budget be allocated between the two goods such that marginal utility per dollar of 𝑥1 be
equal to the marginal utility per dollar of 𝑥2 . Furthermore the ratio of marginal utility to price for the two goods be
equal to λ. It was shown that λ shows the sensitivity of the optimum value of the objective function with respect to
changes in the constraint parameter.
LN 8—Constrained Optimization
Page 13 of 38
𝜕𝑈
=𝜆
𝜕𝐵
Here we can interpret λ as the marginal utility of the budget money.
We can write the utility maximization condition 𝑈1 ⁄𝑃1 = 𝑈2 ⁄𝑃2 as
𝑈1 𝑃1
=
𝑈2 𝑃2
To interpret this last expression of the relationship between marginal utilities and prices, let’s consider the concept of
the indifference curve.
3.1.1. The Indifference Curve Analysis
The indifference curve represents the locus of various combinations of the quantities of 𝑥1 and 𝑥2 which yield the
same level of utility. Mathematically, this constant level of utility is expressed by setting the total differential of the
utility function 𝑈 = 𝑈(𝑥1 , 𝑥2 ) equal to zero.
𝑑𝑈 = 𝑈1 𝑑𝑥1 + 𝑈2 𝑑𝑥2 = 0
from which we obtain,
𝑑𝑥2
𝑈1
=−
𝑑𝑥1
𝑈2
To maintain the same level of utility any decrease (increase) in the quantity of one good must be compensated by an
increase (a decrease) in the quantity of the other good. These substitutions take place at the margin. Thus, the rate at
which the quantity of one good is substituted or exchanged for the quantity of the other, 𝑑𝑥2 ⁄𝑑𝑥1 , is called the
marginal rate of substitution. Along the indifference curve (Figure 1) the marginal rate of substitution, which also
represents the slope at any point on the indifference curve, is equal to the (negative of) the ratio of marginal utilities.
Figure 1
x₂
𝑑𝑥2
𝑈1
=−
𝑑𝑥1
𝑈2
x₂¹
A
B
x₂²
I₁
x₁¹
LN 8—Constrained Optimization
x₁²
x₁
Page 14 of 38
The indifference curve represents a given level of utility, to maintain which the consumer must engage in the
substitution activity. What if the consumer is able to have more of 𝑥2 without giving up any of 𝑥2 , or have more of
both? Clearly, a consumer is not confined to just one level of utility. More of either one or both goods would raise the
level of utility, moving the consumer to a higher indifference curve; a curve further away from the origin. Figure 2
shows the a segment of the consumer’s indifference map consisting of three indifference curves.
Figure 2
x₂
I₃
I₂
I₁
x₁
3.1.1.1.1.
How to Plot Indifference Curves
One way to plot indifference curves using an specific utility function is to use the Cobb-Douglas function format.
𝑢 = 𝑥1𝛼 𝑥21−𝛼
Rewrite the function, using a specified value for the utility level, 𝑢0 .
1
𝑢0 1−𝛼
𝑥2 = ( 𝛼 )
𝑥1
The indifference curves in Figure 2 are drawn for three different values of 𝑢0 and using 𝛼 = 0.50. By assigning
different values of 𝑥1 you can determine the values for 𝑥2 and then plot the indifference curve(s).
3.1.1.2.
Budget Constraint
Utility maximization makes sense only when the consumer encounters a budget constraint. Without the budget
constraint, there would be no optimal solutions for the quantity of the two goods. We can now superimpose the
budget constraint 𝐵 = 𝑥1 𝑃1 + 𝑥2 𝑃2 upon the indifference map. To draw the budget constraint, first we must rearrange
the terms as follows,
𝑥2 =
𝐵 𝑃1
− 𝑥
𝑃2 𝑃2 1
where 𝐵 ⁄𝑃2 is the vertical intercept, and − 𝑃1 ⁄𝑃2 = 𝑑𝑥2 ⁄𝑑𝑥1 is the slope of the budget line, as shown in Figure 3. The
same diagram shows the optimum combination of 𝑥1 and 𝑥2 at point E, where the budget line is tangent to the
indifference curve 𝐼2 , the point at which the slope of the indifference curve 𝐼2 is equal the slope of the budget line
LN 8—Constrained Optimization
Page 15 of 38
𝑑𝑥2
𝑈1
𝑃1
=−
=−
𝑑𝑥1
𝑈2
𝑃2
This is the same conclusion reached by the FOC of the constrained maximization above.
Figure 3
x₂
𝑑𝑥2
𝑈1
𝑃1
=−
=−
𝑑𝑥1
𝑈2
𝑃2
D
x₂*
E
𝑥2 =
𝐵 𝑃1
− 𝑥
𝑃2 𝑃2 1
I₃
F
x₁*
I₂
I₁
x₁
Example
Using the utility function 𝑢 = 𝑥1 𝑥2 + 2𝑥1 and the budget constraint 4𝑥1 + 2𝑥2 = 60 show the optimum quantities of 𝑥1
and 𝑥2 as the point of tangency of the indifference curve and the budget constraint.
To draw the budget line, first rewrite the budget constraints and utility function, respectively, as,
𝑥2 = 30 − 2𝑥1
𝑥2 =
𝑢 − 2𝑥1
𝑥1
The optimum quantities, as determined above, are 𝑥1∗ = 8 and 𝑥2∗ = 14, and 𝑢∗ = 128. We can draw the budget line
128−2𝑥1
𝑥2 = 30 − 2𝑥1 , and using 𝑢 = 128, we can plot the following indifference curve 𝑥2 =
together with the budget
line. The two curves will be tangent at the optimum quantities. (Figure 4).
LN 8—Constrained Optimization
𝑥1
Page 16 of 38
Figure 4
x₂
𝑥2 =
128 − 2𝑥1
𝑥1
14
E
𝑑𝑥2
= −2
𝑑𝑥1
x₂ = 30 − 2x₁
8
x₁
3.2. Second-Order Condition
For the SOC to hold—for the stationary values to be a maximum—the bordered Hessian |𝐻| must be positive.
0
|𝐻| = |𝑔1
𝑔2
𝑔1
𝑍11
𝑍21
𝑔2
0
𝑍12 | = |𝑃1
𝑍22
𝑃2
𝑃1
𝑈11
𝑈21
𝑃2
𝑈12 | = 2𝑃1 𝑃2 𝑈12 − 𝑃22 𝑈11 − 𝑃12 𝑈22 > 0
𝑈22
The presence of 𝑈11 , 𝑈22 , and 𝑈12 in the bordered Hessian determinant indicates that the condition |𝐻| > 0 requires
certain restrictions on the utility function and, hence, on the indifference curves. This requires that the second
derivative of the indifference curve be positive, that is, the indifference curve be strictly convex.
𝑑 2 𝑥2
>0
𝑑𝑥12
How do we obtain an expression for 𝑑 2 𝑥2 ⁄𝑑𝑥12 ? Let’s start with
𝑑𝑥2
𝑈1
=−
𝑑𝑥1
𝑈2
Differentiate the right-hand-side with respect to 𝑥1 .
𝑑 2 𝑥2
𝑑
𝑈1
1
𝑑𝑈1
𝑑𝑈2
2 = 𝑑𝑥 (− 𝑈 ) = − 2 (𝑈2 𝑑𝑥 − 𝑈1 𝑑𝑥 )
𝑑𝑥1
𝑈2
1
2
1
1
The terms
𝑑𝑈1
𝑑𝑥1
and
𝑑𝑈2
𝑑𝑥1
in the right-hand-side parenthesis require further adjustments. Along the indifference curve
the change in marginal utility is affected by the marginal rate of substitution 𝑑𝑥2 ⁄𝑑𝑥1 .
𝑑𝑈1
𝑑𝑥2
𝑃1
= 𝑈11 + 𝑈12
= 𝑈11 − 𝑈12
𝑑𝑥1
𝑑𝑥1
𝑃2
LN 8—Constrained Optimization
Page 17 of 38
𝑑𝑈2
𝑑𝑥2
𝑃1
= 𝑈21 + 𝑈22
= 𝑈21 − 𝑈22
𝑑𝑥1
𝑑𝑥1
𝑃2
Thus,
𝑑 2 𝑥2
1
𝑑𝑈1
𝑑𝑈2
1
𝑃1
𝑃1
= − 2 (𝑈2
− 𝑈1
) = − 2 [𝑈2 (𝑈11 − 𝑈12 ) − 𝑈1 (𝑈21 − 𝑈22 )]
𝑑𝑥1
𝑑𝑥1
𝑃2
𝑃2
𝑑𝑥12
𝑈2
𝑈2
𝑑 2 𝑥2
1
𝑃1
𝑃1
𝑃1
2 = − 2 [𝑈2 (𝑈11 − 𝑈12 𝑃 ) − 𝑈2 𝑃 (𝑈21 − 𝑈22 𝑃 )]
𝑑𝑥1
𝑈2
2
2
2
𝑑 2 𝑥2
1
(2𝑃1 𝑃2 𝑈12 − 𝑃22 𝑈11 − 𝑃12 𝑈22 )
2 =
𝑑𝑥1
𝑈2 𝑃22
We can finally write,
𝑑 2 𝑥2
|𝐻|
=
>0
𝑑𝑥12
𝑈2 𝑃22
When the second order condition |𝐻| > 0 is satisfied, then 𝑑 2 𝑥2 ⁄𝑑𝑥12 > 0, and the relevant indifference curve is
strictly convex at the point of tangency.
3.3. Comparative-Static Analysis
The term comparative-static analysis in economics simply means the comparison of two stationary conditions. How
do we move from an initial equilibrium to a new equilibrium when the system is disturbed by a change in an
exogenous variable?
In the consumer constrained utility maximization model the exogenous variables are the prices of the goods and
consumers initial endowment (the budget). In this model the consumer’s equilibrium condition, where the limited
budget is allocated among 𝑥1 and 𝑥2 such that utility is maximized. The equilibrium was obtained by using the
Lagrangian function
𝑍 = 𝑈(𝑥1 , 𝑥2 ) + 𝜆(𝐵 − 𝑥1 𝑃1 − 𝑥2 𝑃2 )
and determining the first order condition. The first order condition can be represented by the following implicit
functions, where λ, 𝑥1 , and 𝑥2 are the endogenous variables, and 𝑃1 , 𝑃2 , and B are the exogenous variables.
𝐹 𝜆 (𝜆, 𝑥1 , 𝑥2 ; 𝐵, 𝑃1 , 𝑃2 ) = 0
𝐹1 (𝜆, 𝑥1 , 𝑥2 ; 𝐵, 𝑃1 , 𝑃2 ) = 0
𝐹 2 (𝜆, 𝑥1 , 𝑥2 ; 𝐵, 𝑃1 , 𝑃2 ) = 0
→
→
→
𝐵 − 𝑥1 𝑃1 − 𝑥2 𝑃2 = 0
𝑈1 − 𝜆𝑃1 = 0
𝑈2 − 𝜆𝑃2 = 0
Using the implicit functions and simultaneous equations we studied previously, now we can perform the comparative
static analysis. To develop the framework to analyze the impact of changes in 𝑃1 , 𝑃2 , and B on the equilibrium 𝑥1 and
𝑥2 first take the total differentials of the FOC equations above.
LN 8—Constrained Optimization
Page 18 of 38
𝜕𝐹 𝜆
𝜕𝐹 𝜆
𝜕𝐹 𝜆
𝜕𝐹 𝜆
𝜕𝐹 𝜆
𝜕𝐹 𝜆
𝑑𝜆 +
𝑑𝑥1 +
𝑑𝑥2 = −
𝑑𝑃1 −
𝑑𝑃2 −
𝑑𝐵
𝜕𝜆
𝜕𝑥1
𝜕𝑥2
𝜕𝑃1
𝜕𝑃2
𝜕𝐵
𝜕𝐹1
𝜕𝐹1
𝜕𝐹1
𝜕𝐹1
𝜕𝐹1
𝜕𝐹1
𝑑𝜆 +
𝑑𝑥1 +
𝑑𝑥2 = −
𝑑𝑃1 −
𝑑𝑃2 −
𝑑𝐵
𝜕𝜆
𝜕𝑥1
𝜕𝑥2
𝜕𝑃1
𝜕𝑃2
𝜕𝐵
𝜕𝐹 2
𝜕𝐹 2
𝜕𝐹 2
𝜕𝐹 2
𝜕𝐹 2
𝜕𝐹 2
𝑑𝜆 +
𝑑𝑥1 +
𝑑𝑥2 = −
𝑑𝑃1 −
𝑑𝑃2 −
𝑑𝐵
𝜕𝜆
𝜕𝑥1
𝜕𝑥2
𝜕𝑃1
𝜕𝑃2
𝜕𝐵
Note that all the derivatives with respect to the endogenous variables are on the left-hand-side and those with respect
the exogenous variables appear on the right-hand-side. Applying that pattern to the derivatives of the FOC, we have
−𝑃1 𝑑𝜆 − 𝑃1 𝑑𝑥1 − 𝑃2 𝑑𝑥2 = 𝑥1 𝑑𝑃1 + 𝑥2 𝑑𝑃2 − 𝑑𝐵
−𝑃1 𝑑𝜆 + 𝑈11 𝑑𝑥1 − 𝑈12 𝑑𝑥2 = 𝜆 𝑑𝑃1
−𝑃2 𝑑𝜆 + 𝑈21 𝑑𝑥1 − 𝑈22 𝑑𝑥2 = 𝑥1 𝑑 + 𝜆 𝑑𝑃2
3.3.1.
Impact of a change in income (budget), with 𝒅𝑷𝟏 = 𝒅𝑷𝟐 = 𝟎
Dividing the terms of total differentials above by 𝜕𝐵 and rearranging in the matrix format, we have
𝜕𝐹 𝜆
𝜕𝜆
𝜕𝐹1
𝜕𝜆
𝜕𝐹 2
[ 𝜕𝜆
𝜕𝐹 𝜆
𝜕𝑥1
𝜕𝐹1
𝜕𝑥1
𝜕𝐹 2
𝜕𝑥1
𝜕𝐹 𝜆 𝜕𝜆
𝜕𝐹 𝜆
−
𝜕𝑥2 𝜕𝐵
𝜕𝐵
𝜕𝐹1 𝜕𝑥1
𝜕𝐹1
= −
𝜕𝑥2 𝜕𝐵
𝜕𝐵
𝜕𝐹 2
𝜕𝐹 2 𝜕𝑥2
−
𝜕𝑥2 ] [ 𝜕𝐵 ] [ 𝜕𝐵 ]
→
0
[−𝑃1
−𝑃2
−𝑃1
𝑈11
𝑈21
𝜕𝜆
𝜕𝐵
−𝑃2
−1
𝜕𝑥1
𝑈12 ]
= [ 0]
𝑈22 𝜕𝐵
0
𝜕𝑥2
[ 𝜕𝐵 ]
Now recall that
0
|𝐽| = |−𝑃1
−𝑃2
−𝑃1
𝑈11
𝑈21
−𝑃2
𝑈12 |
𝑈22
Solving for 𝜕𝑥1 ⁄𝜕𝐵 and 𝜕𝑥2 ⁄𝜕𝐵
𝜕𝑥1
1 0
= |−𝑃1
𝜕𝐵 |𝐽| −𝑃
2
−1
0
0
𝜕𝑥2
1 0
= |−𝑃1
|𝐽|
𝜕𝐵
−𝑃2
−𝑃1
𝑈11
𝑈21
−𝑃2
1 −𝑃
𝑈12 | = | 1
|𝐽| −𝑃2
𝑈22
𝑃2 𝑈12 − 𝑃1 𝑈22
𝑈12
|=
𝑈22
|𝐽|
−1
−1 −𝑃1
0 |=
|
|𝐽| −𝑃2
0
𝑃1 𝑈12 − 𝑃2 𝑈11
𝑈11
|=
𝑈21
|𝐽|
The signs of the above two comparative static derivatives cannot be determined without additional information about
the relative magnitude of terms in the numerator. However, a positive change would indicate that it is a normal good
(Figure 4-a), but a negative change in equilibrium quantity of either good due to an increase in budget would imply
the good is an inferior good and a (Figure 4-b).
LN 8—Constrained Optimization
Page 19 of 38
Figure 4
(a)
Normal Good
x₂
(b)
Inferior Good
x₂
x₂*
x₂*
E₂
E₂
x₂
I₂
E₁
x₂
I₂
E₁
I₁
x₁
x₁ x₁*
3.3.2.
I₁
x₁* x₁
x₁
Impact of a change in 𝑷𝟏 , with 𝒅𝑷𝟐 = 𝒅𝑩 = 𝟎
Dividing the terms of total differentials above by 𝜕𝑃1 and rearranging in the matrix format, we have
𝜕𝐹 𝜆
𝜕𝜆
𝜕𝐹1
𝜕𝜆
𝜕𝐹 2
[ 𝜕𝜆
𝜕𝐹 𝜆
𝜕𝑥1
𝜕𝐹1
𝜕𝑥1
𝜕𝐹 2
𝜕𝑥1
𝜕𝐹 𝜆 𝜕𝜆
𝜕𝐹 𝜆
−
𝜕𝑥2 𝜕𝑃1
𝜕𝑃1
𝜕𝐹1 𝜕𝑥1
𝜕𝐹1
= −
𝜕𝑥2 𝜕𝑃1
𝜕𝑃1
𝜕𝐹 2 𝜕𝑥2
𝜕𝐹 2
−
𝜕𝑥2 ] [𝜕𝑃1 ] [ 𝜕𝑃1 ]
𝜕𝑥1
1 0
= |−𝑃1
𝜕𝑃1 |𝐽| −𝑃
2
𝜕𝑥1 −𝑥1 −𝑃1
=
|
|𝐽| −𝑃2
𝜕𝑃1
𝑥1
𝜆
0
→
0
[−𝑃1
−𝑃2
−𝑃1
𝑈11
𝑈21
𝜕𝜆
𝜕𝑃1
𝑥1
−𝑃2 𝜕𝑥
1
𝑈12 ]
= [ 𝜆]
𝑈22 𝜕𝑃1
0
𝜕𝑥2
[𝜕𝑃1 ]
−𝑃2
𝑈12 |
𝑈22
−𝑃2 −𝑃1
𝑈12
|+
|
𝑈22
|𝐽| −𝑃2
𝜆
|
0
𝜕𝑥1 −𝑥1 (𝑃2 𝑈12 − 𝑃1 𝑈22 ) −𝜆𝑃22
=
+
|𝐽|
|𝐽|
𝜕𝑃1
For further elucidation of the impact of the change in 𝑃1 , let’s denote the terms on the right-hand side by 𝑇1 and 𝑇2 ,
respectively. Thus
𝜕𝑥1
= 𝑇1 + 𝑇2
𝜕𝑃1
First, let’s attend to 𝑇1 .
𝑇1 =
−𝑥1 (𝑃2 𝑈12 − 𝑃1 𝑈22 )
|𝐽|
LN 8—Constrained Optimization
Page 20 of 38
In the analysis of the impact of a change in the budget we saw that
𝜕𝑥1 𝑃2 𝑈12 − 𝑃1 𝑈22
=
|𝐽|
𝜕𝐵
Substituting this in 𝑇1 we have
𝜕𝑥1
𝑇1 = − (
)𝑥
𝜕𝐵 1
Furthermore, holding 𝑃2 unchanged in the budget constraint 𝐵 = 𝑥1 𝑃1 + 𝑥2 𝑃2 , then
𝜕𝐵
= 𝑥1
𝜕𝑃1
Substituting for 𝑥1 ,
𝜕𝑥1 𝜕𝐵
𝑇1 = − (
)
𝜕𝐵 𝜕𝑃1
Therefore, 𝑇1 shows the impact of a change in 𝑃1 on the equilibrium quantity of 𝑥1 via the budget constraint. This is
the income effect of the change in 𝑃1 .
What about 𝑇2 ? To explain, consider the first equation in the system of equations formed by the total differential of
the FOC terms.
−𝑃1 𝑑𝑥1 − 𝑃2 𝑑𝑥2 = 𝑥1 𝑑𝑃1 + 𝑥2 𝑑𝑃2 − 𝑑𝐵
Here the terms 𝑥2 𝑑𝑃2 and 𝑑𝐵 on the right-hand-side are both equal to zero because we are holding 𝑃2 and B constant
to observe the impact of a change in 𝑃1 . With respect to the remaining term on the right-hand-side, 𝑥1 𝑑𝑃1 represents
the effective loss in income due to the higher price. If we compensate the consumer for the effective income loss due
to the higher 𝑃1 such that the consumer remains at the same utility level (the same indifference) curve, then 𝑥1 𝑑𝑃1 = 0.
Our system of equations then becomes
−𝑃1 𝑑𝜆 − 𝑃1 𝑑𝑥1 − 𝑃2 𝑑𝑥2 = 0
−𝑃1 𝑑𝜆 + 𝑈11 𝑑𝑥1 − 𝑈12 𝑑𝑥2 = 𝜆 𝑑𝑃1
−𝑃2 𝑑𝜆 + 𝑈21 𝑑𝑥1 − 𝑈22 𝑑𝑥2 = 𝑥1 𝑑 + 𝜆 𝑑𝑃2
Again, dividing the terms of total differentials by 𝜕𝑃1 and rearranging in the matrix format, we have
0
[−𝑃1
−𝑃2
−𝑃1
𝑈11
𝑈21
𝜕𝜆
𝜕𝑃1
−𝑃2 𝜕𝑥
0
1
𝑈12 ]
= [ 𝜆]
𝑈22 𝜕𝑃1
0
𝜕𝑥2
[𝜕𝑃1 ]
Solving for the compensated version of 𝜕𝑥1 ⁄𝜕𝑃1 , we have
𝜕𝑥1
1 0
(
)
= |−𝑃1
𝜕𝑃1 𝑐𝑜𝑚𝑝𝑒𝑛𝑠𝑎𝑡𝑒𝑑 |𝐽| −𝑃
2
LN 8—Constrained Optimization
0
𝜆
0
−𝑃2
−𝜆𝑃22
𝑈12 | =
= 𝑇2
|𝐽|
𝑈22
Page 21 of 38
Since, due to compensation, the change in 𝑥1 is purely due the impact of the price, 𝑇2 then represents the substitution
effect of the price change. In short,
𝜕𝑥1
𝜕𝑥1 𝜕𝐵
𝜕𝑥1
= (
)
+(
)
𝜕𝑃1
𝜕𝐵 𝜕𝑃1
𝜕𝑃1 𝑐𝑜𝑚𝑝𝑒𝑛𝑠𝑎𝑡𝑒𝑑
𝜕𝑥1
= 𝑖𝑛𝑐𝑜𝑚𝑒 𝑒𝑓𝑓𝑒𝑐𝑡 + 𝑠𝑢𝑏𝑠𝑡𝑖𝑡𝑢𝑡𝑖𝑜𝑛 𝑒𝑓𝑓𝑒𝑐𝑡
𝜕𝑃1
Thus, the impact of an increase in 𝑃1 on the quantity of good 1 consumed consists of the income effect and the
substitution effect. This is the so-called Slutsky equation
Figure 5 shows the impact of a decrease in 𝑃1 on the equilibrium 𝑥1 . The initial equilibrium is established at point 𝐸1
on the indifference curve 𝐼1 , where the budget line is tangent to that indifference curve. A fall in the price 𝑃1 causes the
budget line to rotate rightward. The consumer now consumes more of good 1 (and less of good 2), moving to the
higher indifference curve 𝐼2 . The new equilibrium is established at 𝐸2 , with 𝑥1∗ amount of good 1 consumed. The two
components of the move from 𝐸1 to 𝐸2 is shown in the diagram, with 𝑇2 = (𝜕𝑥1 ⁄𝜕𝑃1 )𝑐𝑜𝑚𝑝𝑒𝑛𝑠𝑎𝑡𝑒𝑑 (the substitution
effect) shown as the move from 𝐸1 to 𝐸 𝑆 . Here the consumer is “compensated” for the price-induced gain in effective
income, to maintain the same utility level. On the horizontal axis the substitution effect is shown as the increase from
𝑥1 to 𝑥1𝑆 . The income effect is shown as the move from 𝐸 𝑆 to 𝐸2 , and on the horizontal axis as the increase from 𝑥1𝑆 to
𝑥1∗ .
Figure 5
x₂
x₂
E₁
x₂*
x₂S
E₂
ES
I₁
x₁
x₁S
I₂
x₁*
x₁
4. Homogeneous Functions
A function is said to be homogenous of degree 𝑟, if multiplying each of its independent variables by a constant 𝑐
changes the value of the function by the proportion 𝑐 𝑟 .
For 𝑟 = 0
For 𝑟 = 1
For 𝑟 = 2
For 𝑟 ⋮ 2
𝑓(𝑐𝑥1 , 𝑐𝑥2 , … , 𝑐𝑥𝑛 ) = 𝑐 0 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 ) = 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
𝑓(𝑐𝑥1 , 𝑐𝑥2 , … , 𝑐𝑥𝑛 ) = 𝑐𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
𝑓(𝑐𝑥1 , 𝑐𝑥2 , … , 𝑐𝑥𝑛 ) = 𝑐 2 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
𝑓(𝑐𝑥1 , 𝑐𝑥2 , … , 𝑐𝑥𝑛 ) ⋮ 𝑐 2 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
LN 8—Constrained Optimization
Homogeneous of degree zero
Homogeneous of degree one
Homogeneous of degree two
𝐻𝑜𝑚𝑜𝑔𝑒𝑛𝑒𝑜𝑢𝑠 ⋮ 𝑑𝑒𝑔𝑟𝑒𝑒 𝑡𝑤𝑜
Page 22 of 38
𝑓(𝑐𝑥1 , 𝑐𝑥2 , … , 𝑐𝑥𝑛 ) = 𝑐 𝑘 𝑓(𝑥1 , 𝑥2 , … , 𝑥𝑛 )
For 𝑟 = 𝑘
Homogeneous of degree 𝑘
Example
Consider
𝑓(𝑥1 , 𝑥2 , 𝑥3 ) =
𝑓(𝑐𝑥1 , 𝑐𝑥2 , 𝑐𝑥3 ) =
𝑥1 2𝑥3
+
𝑥2 3𝑥1
(𝑐𝑥1 ) 2(𝑐𝑥3 ) 𝑥1 2𝑥3
+
= +
= 𝑐 0 𝑓(𝑥1 , 𝑥2 , 𝑥3 )
(𝑐𝑥2 ) 3(𝑐𝑥1 ) 𝑥2 3𝑥1
Homogeneous of degree zero.
Example
Consider
𝑓(𝑥1 , 𝑥2 , 𝑥3 ) =
𝑓(𝑐𝑥1 , 𝑐𝑥2 , 𝑐𝑥3 ) =
Example
Consider
𝑥12 2𝑥32
+
𝑥2
𝑥1
(𝑐𝑥1 )2 2(𝑐𝑥3 )2 𝑐𝑥12 2𝑐𝑥32
𝑥12 2𝑥32
+
=
+
= 𝑐( +
) = 𝑐𝑓(𝑥1 , 𝑥2 , 𝑥3 )
(𝑐𝑥2 )
3(𝑐𝑥1 )
𝑥2
3𝑥1
𝑥2 3𝑥1
Homogeneous of degree one.
𝑓(𝑥1 , 𝑥2 , 𝑥3 ) = 2𝑥12 + 3𝑥2 𝑥3 − 𝑥32
𝑓(𝑐𝑥1 , 𝑐𝑥2 , 𝑐𝑥3 ) = 2(𝑐𝑥1 )2 + 3(𝑐𝑥2 )(𝑐𝑥3 ) − (𝑐𝑥3 )2 = 𝑐 2 (2𝑥12 + 3𝑥2 𝑥3 − 𝑥32 ) = 𝑐 2 𝑓(𝑥1 , 𝑥2 , 𝑥3 )
𝐻𝑂𝐷 2
4.1. Linearly Homogenous Production Functions
A production function is linearly homogenous (homogenous of degree one) if increasing all inputs c-fold would raise
the output also c-fold.
𝑄 = 𝑓(𝐾, 𝐿)
𝑓(𝑐𝐾, 𝑐𝐿) = 𝑐𝑓(𝐾, 𝐿) = 𝑐𝑄
This property is also known as the constant returns to scale
4.1.1. Properties of Linearly Homogenous Production Functions
Property I
If 𝑄 = 𝑓(𝐾, 𝐿) is homogenous of degree one, then the 𝐴𝑃𝑃𝐿 and 𝐴𝑃𝑃𝐾 each can be expressed as a function of 𝑘 ≡ 𝐾 ⁄𝐿
ratio alone.
Given that 𝑓(𝑐𝐾, 𝑐𝐿) = 𝑐𝑓(𝐾, 𝐿) = 𝑐𝑄, multiply the independent variables by 𝑐 = 1⁄𝐿. Thus,
𝑄
𝐾 𝐿
= 𝑓 ( , ) = 𝑓(𝑘, 1) = φ(𝑘)
𝐿
𝐿 𝐿
By definition, 𝐴𝑃𝑃𝐿 = 𝑄 ⁄𝐿 . Therefore,
𝐴𝑃𝑃𝐿 = φ(𝑘)
Also, 𝐴𝑃𝑃𝐾 = 𝑄 ⁄𝐾 . Multiply the right-hand-side by 𝐿⁄𝐿,
𝐴𝑃𝑃𝐾 =
𝑄𝐿
1 φ(𝑘)
= φ(𝑘) =
𝐿𝐾
𝑘
𝑘
LN 8—Constrained Optimization
Page 23 of 38
Since 𝐴𝑃𝑃𝐿 and 𝐴𝑃𝑃𝐾 are both function of the 𝐾 ⁄𝐿 ratio, as long as this ratio is maintained at any input level,
increasing the amount of inputs will leave both average products unchanged. This means that both average product
functions are homogenous of degree zero. If average products are constant at any input (and output) level, then
average cost functions will also remain constant at any output level. Thus, instead of the U-shaped average cost
curves, we have the horizontal line. This is exactly the meaning of constant returns to scale.
Property II
If 𝑄 = 𝑓(𝐾, 𝐿) is homogenous of degree one, then the 𝑀𝑃𝑃𝐿 and 𝑀𝑃𝑃𝐾 each can be expressed as a function of 𝑘 ≡ 𝐾 ⁄𝐿
ratio alone.
It was shown above that 𝑄 ⁄𝐿 = φ(𝑘). Thus,
𝑄 = 𝐿φ(𝑘)
𝑀𝑃𝑃𝑘 =
𝜕𝑄
𝜕
𝜕
𝐾
[𝐿φ(𝑘)] =
=
[𝐿φ ( )]
𝜕𝐾 𝜕𝐾
𝜕𝐾
𝐿
1
𝑀𝑃𝑃𝑘 = 𝐿 ( ) 𝜑 ′ (𝑘) = 𝜑 ′ (𝑘)
𝐿
And,
𝑀𝑃𝑃𝐿 =
𝜕𝑄
𝜕
[𝐿φ(𝑘)]
=
𝜕𝐿 𝜕𝐿
𝜕𝑄
𝜕φ(𝑘)
= φ(𝑘) + 𝐿
𝜕𝐿
𝜕𝐿
𝜕𝑄
𝑑φ(𝑘) 𝜕𝑘
𝑀𝑃𝑃𝐿 =
= φ(𝑘) + 𝐿
𝜕𝐿
𝑑𝑘 𝜕𝐿
𝑀𝑃𝑃𝐿 =
using the product rule
using the chain rule
Since 𝑘 = 𝐾 ⁄𝐿, then
𝜕𝑘
𝐾
=− 2
𝜕𝐿
𝐿
𝑀𝑃𝑃𝐿 =
𝜕𝑄
𝐾
= φ(𝑘) + 𝐿𝜑 ′ (𝑘) (− 2 )
𝜕𝐿
𝐿
𝑀𝑃𝑃𝐿 =
𝜕𝑄
= φ(𝑘) − 𝑘𝜑 ′ (𝑘)
𝜕𝐿
Property III (Euler’s Theorem)
If 𝑄 = 𝑓(𝐾, 𝐿) is homogenous of degree one, then
𝐾
𝜕𝑄
𝜕𝑄
+𝐿
≡𝑄
𝜕𝐾
𝜕𝐿
To prove this, using the results of Property II,
𝐾
𝜕𝑄
𝜕𝑄
+𝐿
= 𝐾𝜑 ′ (𝑘) + 𝐿[φ(𝑘) − 𝑘𝜑 ′ (𝑘)]
𝜕𝐾
𝜕𝐿
𝐾
𝜕𝑄
𝜕𝑄
+𝐿
= 𝐾𝜑 ′ (𝑘) + 𝐿φ(𝑘) − 𝐿𝑘𝜑 ′ (𝑘) = 𝐾𝜑 ′ (𝑘) + 𝐿φ(𝑘) − 𝐾𝜑 ′ (𝑘)
𝜕𝐾
𝜕𝐿
LN 8—Constrained Optimization
Page 24 of 38
𝐾
𝜕𝑄
𝜕𝑄
+𝐿
= 𝐿φ(𝑘) = 𝑄
𝜕𝐾
𝜕𝐿
Since this result is valid for any values of K and L, we can write this property as an identical equality. In economic
terms, this property of linearly homogenous production functions (with constant returns to scale) implies that the
total product is distributed among the factor inputs according to the marginal productivity of each input. That is, if
each factor input is paid the amount of its marginal product, the total product will be exactly exhausted by the
distributive shares for all factor inputs.
4.2. Cobb-Douglas Production Function
The CDPF is generally represented as
𝑄 = 𝐴𝐾 𝛼 𝐿1−𝛼
where 𝐴 > 0 and 0 < 𝛼 < 1.
A generalized version of the CDPF is
𝑄 = 𝐴𝐾 𝛼 𝐿𝛽
where 0 < 𝛽 < 1, but is not necessarily equal to 1 − 𝛼.
4.2.1. Features of 𝑸 = 𝑨𝑲𝜶 𝑳𝜷
4.2.1.1.
Homogenous of Degree (𝛂 + 𝜷)
𝐴(𝑐𝐾)𝛼 (𝑐𝐿)𝛽 = 𝑐 𝛼+𝛽 (𝐴𝐾 𝛼 𝐿𝛽 ) = 𝑐 𝛼+𝛽 𝑄
If 𝛼 + 𝛽 = 1, then 𝐴(𝑐𝐾)𝛼 (𝑐𝐿)𝛽 = 𝑐(𝐴𝐾 𝛼 𝐿𝛽 ) = 𝑐𝑄. The production function is HOD1, and there will be constant
returns to scale.
4.2.1.2.
Isoquants Have Negative Slopes
An isoquant, similar to the indifference curve, is the locus of various combinations of factor inputs that yield the same
output. Figure 6 shows three of such isoquants each showing a different output level.
LN 8—Constrained Optimization
Page 25 of 38
Figure 6
For any output level 𝑄0 , the production function can be written as
𝐴𝐾 𝛼 𝐿𝛽 = 𝑄0
𝐴𝐾 𝛼 𝐿𝛽 − 𝑄0 = 0
Taking the natural log,
ln 𝐴 + 𝛼 ln 𝐾 + 𝛽 ln 𝐿 − ln 𝑄0 = 0
This is an implicit function which defines 𝐾as a function of 𝐿.
𝐹(𝐾, 𝐿) = 0
Taking the total derivative of the implicit function, we have
𝜕𝐹
𝜕𝐹
𝑑𝐾 +
𝑑𝐿 = 0
𝜕𝐾
𝜕𝐿
𝛼
𝛽
𝑑𝐾 + 𝑑𝐿 = 0
𝐾
𝐿
which yields,
𝑑𝐾
𝛽𝐾
=−
<0
𝑑𝐿
𝛼𝐿
(𝛼, 𝛽, 𝐾, 𝐿 > 0)
4.2.1.3.
Isoquants are Strictly Convex
For the isoquant to be strictly convex,
𝑑2𝐾
>0
𝑑𝐿2
LN 8—Constrained Optimization
Page 26 of 38
𝑑2𝐾
𝑑
𝛽𝐾
𝛽 𝑑 𝐾
𝛽1
𝑑𝐾
=
(−
)=−
( ) = − 2 (𝐿
− 𝐾) > 0
𝑑𝐿2 𝑑𝐿
𝛼𝐿
𝛼 𝑑𝐿 𝐿
𝛼𝐿
𝑑𝐿
since (𝐿
𝑑𝐾
− 𝐾) < 0
𝑑𝐿
4.2.2. Properties of HOD1 Cobb-Douglas Production Function
Property I
If 𝑄 = 𝑓(𝐾, 𝐿) is homogenous of degree one, then the 𝐴𝑃𝑃𝐿 and 𝐴𝑃𝑃𝐾 each can be expressed as a function of 𝑘 ≡ 𝐾 ⁄𝐿
ratio alone.
We can write the CDPF as
𝑄 = 𝐴𝐾 𝛼 𝐿1−𝛼 = 𝐴𝐾 𝛼
𝐿
𝐾 𝛼
=
𝐴
(
) 𝐿 = 𝐿𝐴𝑘 𝛼
𝐿𝛼
𝐿
𝐴𝑃𝑃𝐿 =
𝑄
= 𝐴𝑘 𝛼
𝐿
𝐴𝑃𝑃𝐾 =
𝑄 𝑄𝐿
1
=
= 𝐴𝑘 𝛼 = 𝐴𝑘 𝛼−1
𝐾 𝐿𝐾
𝑘
Property II
If 𝑄 = 𝑓(𝐾, 𝐿) is homogenous of degree one, then the 𝑀𝑃𝑃𝐿 and 𝑀𝑃𝑃𝐾 each can be expressed as a function of 𝑘 ≡ 𝐾 ⁄𝐿
ratio alone.
𝑄 = 𝐴𝐾 𝛼 𝐿1−𝛼
𝑀𝑃𝑃𝐿 =
𝜕𝑄
𝐾 𝛼
= 𝐴𝐾 𝛼 (1 − 𝛼)𝐿−𝛼 = 𝐴(1 − 𝛼) ( ) = 𝐴(1 − 𝛼)𝑘 𝛼
𝜕𝐿
𝐿
𝑀𝑃𝑃𝐾 =
𝜕𝑄
𝐾 𝛼−1
= 𝐴𝛼𝐾 𝛼−1 𝐿−(𝛼−1) = 𝐴𝛼 ( )
= 𝐴𝛼𝑘 𝛼−1
𝜕𝐾
𝐿
Property III (Euler’s Theorem)
If 𝑄 = 𝑓(𝐾, 𝐿) is homogenous of degree one, then
𝐾
𝜕𝑄
𝜕𝑄
+𝐿
≡𝑄
𝜕𝐾
𝜕𝐿
𝜕𝑄
𝜕𝑄
+𝐿
= 𝐾𝐴𝛼𝑘 𝛼−1 + 𝐿𝐴(1 − 𝛼)𝑘 𝛼
𝜕𝐾
𝜕𝐿
𝜕𝑄
𝜕𝑄
𝐾𝛼
𝐾
+𝐿
= 𝐿𝐴𝑘 𝛼 (
+ 1 − 𝛼)
𝜕𝐾
𝜕𝐿
𝐿𝑘
𝜕𝑄
𝜕𝑄
𝐾
+𝐿
= 𝐿𝐴𝑘 𝛼 (𝛼 + 1 − 𝛼) = 𝐿𝐴𝑘 𝛼 = 𝑄
𝜕𝐾
𝜕𝐿
𝐾
4.2.3. Economic Meaning of α and 𝟏 − 𝜶 in CDPF
As shown above, per Euler’s Theorem, when the production function is homogenous of degree one the relative share
of each input from the total output is the marginal product of that input. This is, of course, true about the CobbDouglas production function. As it happens, α and 1 − 𝛼 are, respectively, the relative shares of capital (K) and labor
(L) in the CDPF.
LN 8—Constrained Optimization
Page 27 of 38
The relative share of capital is
𝐾(𝜕𝑄 ⁄𝜕𝐾 ) 𝐾(𝐴𝛼𝑘 𝛼−1 )
𝐾 1
𝑘
=
= 𝛼( ) = 𝛼 = 𝛼
𝑄
𝐿𝐴𝑘 𝛼
𝐿 𝑘
𝑘
The relative share of labor is
𝐿(𝜕𝑄 ⁄𝜕𝐿) 𝐿[𝐴(1 − 𝛼)𝑘 𝛼 ]
=
=1−𝛼
𝑄
𝐿𝐴𝑘 𝛼
Also, writing the relative share of K slightly differently, it can be seen that this relative share is also the elasticity of
output with respect to K
𝛼=
𝜕𝑄 ⁄𝜕𝐾 𝜕𝑄 𝐾
=
= 𝜀𝑄𝐾
𝑄 ⁄𝐾
𝜕𝐾 𝑄
Similarly
1−𝛼 =
𝜕𝑄 ⁄𝜕𝐿 𝜕𝑄 𝐿
=
= 𝜀𝑄𝐿
𝑄 ⁄𝐿
𝜕𝐿 𝑄
4.2.4. Economic Meaning of 𝑨 in 𝑸 = 𝑨𝑲𝜶 𝑳𝟏−𝜶
For given values of K and L, the magnitude of A will proportionately affect the level of output Q. Thus, A may be
considered as an efficiency parameter, that is, an indicator of the state of technology.
5. Least-Cost Combination of Inputs
Another example of constrained optimization is the problem of least-cost combination of inputs. Given a level of
output 𝑄, what is the least-cost combination of factor inputs to produce that output. Let 𝐾 and 𝐿 represent the factor
inputs. The total cost of hiring these inputs is
𝐶 = 𝑃𝐾 𝐾 + 𝑃𝐿 𝐿
Since the objective is to minimize the cost of production for a given level of output, the above cost equation is the
objective function to be minimized subject to a specified level of output. The production function is
𝑄 = 𝑄(𝐾, 𝐿)
The output constraint is then,
𝑄(𝐾, 𝐿) = 𝑄0
5.1. First-Order Condition
Forming the Lagrangian function,
𝑍 = 𝑃𝐾 𝐾 + 𝑃𝐿 𝐿 + µ[𝑄0 − 𝑄(𝐾, 𝐿)]
the FOC are
LN 8—Constrained Optimization
Page 28 of 38
𝜕𝑍
= 𝑄0 − 𝑄(𝐾, 𝐿) = 0
𝜕µ
𝜕𝑍
𝑍𝐾 =
= 𝑃𝐾 − µ𝑄𝐾 = 0
𝜕𝐾
𝑍µ =
𝑍𝐿 =
𝜕𝑍
= 𝑃𝐿 − µ𝑄𝐿 = 0
𝜕𝐿
From the last two equations we deduce,
𝑃𝐾
𝑃𝐿
=
=µ
𝑄𝐾 𝑄𝐿
This implies that the optimum combination of inputs requires that the outlay per unit of marginal product must be
equal for both inputs. The Lagrange multiplier µ here represents the marginal cost of production in the optimum
state.
The optimum condition can also be shown as
𝑃𝐾 𝑄𝐾
=
𝑃𝐿
𝑄𝐿
The first order condition provides that the ratio of marginal products be equal to the factor input price ratio. To
further explore this expression let’s consider the concepts of 𝑖𝑠𝑜𝑞𝑢𝑎𝑛𝑡𝑠 and 𝑖𝑠𝑜𝑐𝑜𝑠𝑡𝑠
5.2. Isoquants and 𝐈𝐬𝐨𝐜𝐨𝐬𝐭𝐬
A given level of output can be produced by using different combination of factor inputs. An isoquant curve, therefore,
represents the locus of various combination of inputs that yield that output. This was explained in section 4.2.1.2 and
shown in Figure 6 above. An isocost is the locus of various combination of inputs that cost the same. Figure 7 shows
an isoquant depicting the output level 𝑄0 . It also shows two isocosts, 𝐶0 and 𝐶1 .
Figure 7
K
D
K*
E
F
C₁
L*
LN 8—Constrained Optimization
Q₁
C₂
L
Page 29 of 38
Mathematically, the given or fixed level of output is expressed by setting the total differential of the production
function 𝑄 = 𝑄(𝐾, 𝐿) equal to zero.
𝑑𝑄 = 𝑄𝐾 𝑑𝐾 + 𝑄𝐿 𝑑𝐿 = 0
from which we obtain,
𝑑𝐾
𝑄𝐿
=−
𝑑𝐿
𝑄𝐾
To maintain the same level of output any decrease (increase) in the quantity of one input must be compensated by an
increase (a decrease) in the quantity of the other input. These substitutions take place at the margin. Thus, the rate at
which the quantity of one input is substituted or exchanged for the quantity of the other, 𝑑𝐾 ⁄𝑑𝐿, is called the
marginal rate of technical substitution (𝑀𝑅𝑇𝑆𝐿𝐾 ). Along the isoquant the marginal rate of technical substitution,
which also represents the slope at any point on the isoquant, is equal to the (negative of) the ratio of marginal product
of each input.
To draw the isocost line use the cost equation 𝐶 = 𝑃𝐾 𝐾 + 𝑃𝐿 𝐿, rearranging which provides
𝐾=
𝐶
𝑃𝐿
− 𝐿
𝑃𝐾 𝑃𝐾
To draw each isocost line, 𝐶 is treated as a parameter that causes the a shift of isocost. In Figure 6 two isocost lines
appear, one for 𝐶 = 𝐶0 and the higher one for 𝐶 = 𝐶1 . Taking the first derivative of the cost function, we have
𝑑𝐾
𝑃𝐿
=−
𝑑𝐿
𝑃𝐾
In Figure 6, to produce 𝑄0 , the optimal combination of the inputs is achieved at point 𝐸, where the slope of isocost line
is equal to the slope of the isoquant.
𝑃𝐿
𝑄𝐿
=
𝑃𝐾 𝑄𝐾
Stated differently, at E
𝑄𝐾 𝑄𝐿
=
𝑃𝐾
𝑃𝐿
That is, marginal product per dollar spent on each input is equal. Note that the input combination at points D and F on
the isoquant 𝑄0 are not optimal. At 𝐷, 𝑄𝐾 ⁄𝑃𝐾 < 𝑄𝐿 ⁄𝑃𝐿 . By substituting more L for K, cost of producing the same
output level would be reduced; the isocost line will shift left until it is tangent to the isoquant. The reverse would take
place if the initial position were point F, where 𝑄𝐾 ⁄𝑃𝐾 > 𝑄𝐿 ⁄𝑃𝐿 . More K will be substituted for L.
5.3. Second-Order Condition
In this situation the objective function 𝐶 = 𝑃𝐾 𝐾 + 𝑃𝐿 𝐿 is to be minimized subject to the output constraint 𝑄(𝐾, 𝐿) = 𝑄0 .
After forming the Lagrangian function and solving the system of the first-order equations,
𝑍𝜇 = 𝑄0 − 𝑄(𝐾, 𝐿) = 0
𝑍𝐾 = 𝑃𝐾 − µ𝑄𝐾 = 0
LN 8—Constrained Optimization
Page 30 of 38
𝑍𝐿 = 𝑃𝐿 − µ𝑄𝐿 = 0
Finding the second-order total differentials
𝑍𝜇𝜇 = 0
𝑍𝐾𝐾 = −µ𝑄𝐾𝐾
𝑍𝐿𝐾 = −µ𝑄𝐿𝐾
𝑍𝜇𝐾 = −𝑄𝐾
𝑍𝐾𝐿 = −µ𝑄𝐾𝐿
𝑍𝐿𝐿 = −µ𝑄𝐿𝐿
𝑍𝜇𝐿 = −𝑄𝐿
The bordered Hessian, as written below, must be negative for the stationary value of the cost to be a minimum.
0
|𝐻| = |𝑄𝐾
𝑄𝐿
𝑄𝐾
−µ𝑄𝐾𝐾
−µ𝑄𝐿𝐾
𝑄𝐿
−µ𝑄𝐾𝐿 | = µ(𝑄𝐾𝐾 𝑄𝐿2 − 2𝑄𝐾𝐿 𝑄𝐾 𝑄𝐿 + 𝑄𝐿𝐿 𝑄𝐾2 ) < 0
−µ𝑄𝐿𝐿
Since µ > 0, then (𝑄𝐾𝐾 𝑄𝐿2 − 2𝑄𝐾𝐿 𝑄𝐾 𝑄𝐿 + 𝑄𝐿𝐿 𝑄𝐾2 ) must be negative at the point of tangency of the isocost and the
isoquant (point E). The presence of the terms 𝑄𝐾𝐾 , 𝑄𝐿𝐿 , and 𝑄𝐾𝐿 requires certain restrictions on the curvature of the
isoquants. Keep in mind that the isoquant is shown as a functional relationship between the variables L and K. Thus,
the curvature of the isoquant is indicated by the second derivative of K with respect to L. The restriction required is
that the isoquant be strictly convex, for which the second derivative must be positive.
𝑑2𝐾
>0
𝑑𝐿2
The determination of second derivative is as follows.
𝑑𝐾
𝑄𝐿
=−
𝑑𝐿
𝑄𝐾
𝑑2𝐾
𝑑
𝑄𝐿
1
𝑑𝑄𝐿
𝑑𝑄𝐾
=
(− ) = − 2 (𝑄𝐾
− 𝑄𝐿
)
2
𝑑𝐿
𝑑𝐿
𝑄𝐾
𝑑𝐿
𝑑𝐿
𝑄𝐾
𝑑𝑄𝐿
𝑑𝐾
𝑄𝐿
= 𝑄𝐿𝐿 + 𝑄𝐿𝐾
= 𝑄𝐿𝐿 − 𝑄𝐿𝐾
𝑑𝐿
𝑑𝐿
𝑄𝐾
𝑑𝑄𝐾
𝑑𝐾
𝑄𝐿
= 𝑄𝐾𝐿 + 𝑄𝐾𝐾
= 𝑄𝐾𝐿 − 𝑄𝐾𝐾
𝑑𝐿
𝑑𝐿
𝑄𝐾
𝑑2𝐾
1
𝑄𝐿
𝑄𝐿
= − 2 [𝑄𝐾 (𝑄𝐿𝐿 − 𝑄𝐿𝐾 ) − 𝑄𝐿 (𝑄𝐾𝐿 − 𝑄𝐾𝐾 )]
2
𝑑𝐿
𝑄𝐾
𝑄𝐾
𝑄𝐾
𝑑2𝐾
1
𝑄𝐿2
=
−
(𝑄
𝑄
−
𝑄
𝑄
−
𝑄
𝑄
+
𝑄
)
𝐿𝐿
𝐾
𝐿𝐾
𝐿
𝐾𝐿
𝐿
𝐾𝐾
2
𝑑𝐿2
𝑄𝐾
𝑄𝐾
𝑑2𝐾
1
= − 3 (𝑄𝐿𝐿 𝑄𝐾2 − 2𝑄𝐾𝐿 𝑄𝐾 𝑄𝐿 + 𝑄𝐾𝐾 𝑄𝐿2 )
2
𝑑𝐿
𝑄𝐾
Since the terms in the parenthetical expression is the same as those in the bordered Hessian, therefore, if |𝐻| < 0,
then this parenthetical expression must also be negative, hence a positive second derivative. The isoquant is
therefore strictly convex at the point of tangency.
LN 8—Constrained Optimization
Page 31 of 38
5.4. Comparative-Statics of Least-Cost Combination Model
The comparative-static aspect of the current model involves the analysis of changes in the equilibrium (optimum)
combination of inputs when the output level changes, that is when the isoquant shifts, holding the input price ratio
(𝑃𝐾 ⁄𝑃𝐿 ) , the slope of the isocost, constant.
Figure 7 shows two successive increases in output shown by the shift in isoquant from 𝑄0 to 𝑄1 and then to 𝑄2 , with
each shift resulting in a new different point of tangency with the isocost. The locus of these points of tangency is
called the expansion path. Note that the expansion path in Figure 7 is linear. Along such expansion path the 𝐾/𝐿
ratio remains constant. The linear expansion path is a basic property of homogenous production functions.
Consider the generalized version of CDPF, 𝑄 = 𝐴𝐾 𝛼 𝐿𝛽 . Per FOC,
𝑃𝐾 𝑄𝐾 𝐴𝛼𝐾 𝛼−1 𝐿𝛽 𝛼𝐿
=
=
=
𝑃𝐿
𝑄𝐿 𝐴𝛽𝐾 𝛼 𝐿𝛽−1 𝛽𝐾
From which we derive,
𝐾 𝛼𝑃𝐿
=
𝐿 𝛽𝑃𝐾
Since all the terms on the right-hand-side (𝛼, 𝛽, 𝑃𝐾 , and 𝑃𝐿 ) remain unchanged as the isoquant shifts right, then 𝐾 ⁄𝐿
remains constant. In Figure 7,
𝐾0 𝐾1 𝐾2
=
=
𝐿0 𝐿1 𝐿2
Figure 7
K
Expansion
Path
K₂
Q₂
K₁
Q₁
K₀
Q₀
L₀
L₁
L₂
L
Note that any homogenous production function will generate a linear expansion path. However, a production function
HOD1, in addition to generating a linear expansion path, also has the property that if inputs are doubled (the scale of
production is doubled), output will also double, or if the inputs are tripled, output will also triple, ... Consider the
CDPF,
𝑄 = 𝐴𝐾 𝛼 𝐿1−𝛼
LN 8—Constrained Optimization
Page 32 of 38
𝑃𝐾 𝑄𝐾
𝐴𝛼𝐾 𝛼−1 𝐿1−𝛼
𝛼𝐿
=
=
=
𝑃𝐿
𝑄𝐿 𝐴(1 − 𝛼)𝐾 𝛼 𝐿−𝛼 (1 − 𝛼)𝐾
𝐴(2𝐾)𝛼 (2𝐿)1−𝛼 = 2𝐴𝐾 𝛼 𝐿1−𝛼 = 2𝑄
𝐾
𝛼𝑃𝐿
=
𝐿 (1 − 𝛼)𝑃𝐾
𝐴(𝑐𝐾)𝛼 (𝑐𝐿)1−𝛼 = 𝑐𝐴𝐾 𝛼 𝐿1−𝛼 = 𝑐𝑄
(Linear expansion path)
(Constant returns to scale)
In Figure 7, the production function represented by the three isoquants is HOD1. Note that
𝑂𝐿1 = 2𝑂𝐿0
𝑂𝐿2 = 3𝑂𝐿0
5.5.
𝑂𝐾1 = 2𝑂𝐾0
𝑂𝐾2 = 3𝑂𝐾0
𝑂𝐸1 = 2𝑂𝐸0
𝑂𝐸2 = 3𝑂𝐸2
Homothetic Functions
If the function 𝑄 = 𝑄(𝐾, 𝐿) is homogenous of degree r (r = 0, 1, 2,...), then this function can give rise to a homothetic
function in the form of a composite function,
𝐻 = 𝐻(𝑄) = 𝐻[𝑄(𝐾, 𝐿)]
[𝐻′(𝑄) ≠ 0]
The homothetic function is in general not homogeneous in the variables K, and L, but it does generate a linear
expansion path. The latter property is due to the fact that the H isoquants share the same slope as the Q isoquants.
The slope of the Q isoquant is,
𝑑𝐾
𝜕𝑄 ⁄𝜕𝐾
𝑄𝐾
=−
=−
𝑑𝐿
𝜕𝑄 ⁄𝜕𝐿
𝑄𝐿
The slope of the H isoquant is
𝑑𝐾
𝐻𝐾
=−
𝑑𝐿
𝐻𝐿
where,
𝐻𝐾 =
𝑑𝐻 𝜕𝑄
= 𝐻 ′ (Q)𝑄𝐾
𝑑𝑄 𝜕𝐾
𝐻𝐿 =
𝑑𝐻 𝜕𝑄
= 𝐻′ (Q)𝑄𝐿
𝑑𝑄 𝜕𝐿
Thus,
𝑑𝐾
𝐻′ (Q)𝑄𝐾
𝑄𝐾
=− ′
=−
𝑑𝐿
𝐻 (Q)𝑄𝐿
𝑄𝐿
Another important property of homothetic production functions is the elasticity of optimal input levels (input
combinations at the point of tangency) with respect to the output level is uniform for the inputs K and L. To show this,
consider the property of linear expansion paths (which applies to homothetic production functions) that the 𝐾 ⁄𝐿 ratio
remains constant for all output levels. Mathematically this is expressed as
𝜕 𝐾
( )=0
𝜕𝐻0 𝐿
LN 8—Constrained Optimization
Page 33 of 38
𝜕 𝐾
1
𝜕𝐾
𝜕𝐿
( ) = 2 (𝐿
−𝐾
)=0
𝜕𝐻0 𝐿
𝐿
𝜕𝐻0
𝜕𝐻0
which implies,
𝐿
𝜕𝐾
𝜕𝐿
=𝐾
𝜕𝐻0
𝜕𝐻0
Multiply both sides by 𝐻0 ⁄(𝐾𝐿)
𝐿
𝜕𝐾 𝐻0
𝜕𝐿 𝐻0
=𝐾
𝜕𝐻0 𝐾𝐿
𝜕𝐻0 𝐾𝐿
𝜕𝐾 𝐻0
𝜕𝐿 𝐻0
=
𝜕𝐻0 𝐾
𝜕𝐻0 𝐿
𝜀𝐾𝐻0 = 𝜀𝐿𝐻0
Example
Let 𝐻 = 𝑄2 , where 𝑄 = 𝐴𝐾 𝛼 𝐿𝛽 . First show that H is homothetic. Then show that the slope of H isoquant is equal to
the slope of the Q isoquant.
For H to be homothetic, Q must be homogenous (which is) and 𝐻′ (𝑄) > 0.
𝐻′ (𝑄) =
𝑑𝐻
= 2𝑄 > 0
𝑑𝑄
Next,
2
𝐻 = 𝑄2 = (𝐴𝐾 𝛼 𝐿𝛽 ) = 𝐴2 𝐾 2𝛼 𝐿2𝛽
𝑑𝐾
𝐻𝐾
𝐴2 2𝛼𝐾 2𝛼−1 𝐿2𝛽
𝛼𝐿
=−
=− 2
=−
𝑑𝐿
𝐻𝐿
𝛽𝐾
𝐴 2𝛽𝐾 2𝛼 𝐿2𝛽−1
𝑑𝐾
𝑄𝐾
𝐴𝛼𝐾 𝛼−1 𝐿𝛽
𝛼𝐿
=−
=−
=−
𝛼
𝛽−1
𝑑𝐿
𝑄𝐿
𝛽𝐾
𝐴𝛽𝐾 𝐿
Note that,
𝑄(𝑐𝐾, 𝑐𝐿) = 𝑐 𝛼+𝛽 𝐴𝐾 𝛼 𝐿𝛽 = 𝑐 𝛼+𝛽 𝑄(𝐾, 𝐿)
𝐻(𝑐𝐾, 𝑐𝐿) = 𝐴2 (𝑐𝐾)2𝛼 (𝑐𝐿)2𝛽 = 𝑐 2(𝛼+𝛽) 𝐻
[Homogenous of degree (𝛼 + 𝛽)]
[Homogenous of degree 2(𝛼 + 𝛽)]
Example
Let 𝐻 = 𝑒 𝑄 , where 𝑄 = 𝐴𝐾 𝛼 𝐿𝛽 . First show that H is homothetic. Then show that the slope of H isoquant is equal to
the slope of the Q isoquant.
For H to be homothetic, Q must be homogenous (which is) and 𝐻′ (𝑄) > 0.
𝐻 = 𝑒 𝐴𝐾
𝐻′ (𝑄) =
𝛼𝐿𝛽
𝑑𝐻
= 𝑒𝑄 > 0
𝑑𝑄
LN 8—Constrained Optimization
Page 34 of 38
𝑑𝐾
𝐻𝐾
=−
𝑑𝐿
𝐻𝐿
𝐻𝐾 =
𝜕𝐻
𝜕
𝛼 𝛽
𝛼 𝛽
=
(𝑒 𝐴𝐾 𝐿 ) = 𝐴𝛼𝐾 𝛼−1 𝐿𝛽 𝑒 𝐴𝐾 𝐿 = 𝐴𝛼𝐾 𝛼−1 𝐿𝛽 𝐻
𝜕𝐾 𝜕𝐾
𝐻𝐿 =
𝜕𝐻
𝜕
𝛼 𝛽
𝛼 𝛽
=
(𝑒 𝐴𝐾 𝐿 ) = 𝐴𝛽𝐾 𝛼 𝐿𝛽−1 𝑒 𝐴𝐾 𝐿 = 𝐴𝛽𝐾 𝛼 𝐿𝛽−1 𝐻
𝜕𝐿 𝜕𝐿
𝑑𝐾
𝐴𝛼𝐾 𝛼−1 𝐿𝛽 𝐻
𝛼𝐿
=−
=−
𝛼
𝛽−1
𝑑𝐿
𝛽𝐾
𝐴𝛽𝐾 𝐿 𝐻
Is 𝐻 = 𝑒 𝑄 homogenous?
𝐻(𝑐𝐾, 𝑐𝐿) = 𝑒 𝐴(𝑐𝐾)
𝐻(𝑐𝐾, 𝑐𝐿) = (𝑒 𝐴𝐾
𝐻(𝐾, 𝐿) = 𝑒 𝐴𝐾
𝛼𝐿𝛽
𝛼 (𝑐𝐿)𝛽
𝛼𝐿𝛽
)
= 𝑒 𝐴𝐾
𝑐 𝛼+𝛽
𝛼 𝐿 𝛽 𝑐 𝛼+𝛽
= [𝐻(𝐾, 𝐿)]𝑐
𝛼+𝛽
is not homogeneous. It does not satisfy the condition 𝐻(𝑐𝐾, 𝑐𝐿) = 𝑐 𝑟 𝐻(𝐾, 𝐿).
5.6. Elasticity of Substitution
Elasticity of substitution pertains to the impact of changes in the factor input price ratio 𝑃𝐿 ⁄𝑃𝐾 on the optimal
combination of inputs for a given level of output. Note that optimum combination of L, and K is achieved at the point
of tangency of the isocost with the isoquant. The factor price ratio is the slope of the isocost line,
𝐾=
𝐶
𝑃𝐿
− 𝐿
𝑃𝐾 𝑃𝐾
Any change in the price ratio will change the isocost line, hence optimum combination of K, and L, that is the input
ratio 𝐾 ⁄𝐿. For example, a rise in 𝑃𝐾 will cause a substitution of L for K, thus changing the 𝐾 ⁄𝐿 ratio. The sensitivity of
the factor input ratio to changes in the factor price ratio is measured by the elasticity of (input) substitution. EOS is
computed as the proportional change in 𝐾 ⁄𝐿 relative to a proportional change in 𝑃𝐿 ⁄𝑃𝐾 .
𝑑(𝐾 ⁄𝐿)
𝑑(𝐾 ⁄𝐿) 𝑃𝐿 ⁄𝑃𝐾
𝐾 ⁄𝐿
𝜎=
=
𝑑(𝑃𝐿 ⁄𝑃𝐾 ) 𝑑(𝑃𝐿 ⁄𝑃𝐾 ) 𝐾 ⁄𝐿
𝑃𝐿 ⁄𝑃𝐾
Note that when the input ratio is fixed for all output level, then σ = 0. In this case, the inputs are said to be
complements. Any change in the price ratio, will leave the input combination unchanged. But for any positive EOS
coefficient , the larger σ, the more closely substitutable are the inputs.
5.6.1. Elasticity of Substitution for Homogenous Production Functions
Once again, consider the generalized CDPF,
𝑄 = 𝐴𝐾 𝛼 𝐿𝛽
We learned that for such functions at the point tangency
LN 8—Constrained Optimization
Page 35 of 38
𝐾 𝛼𝑃𝐿
=
𝐿 𝛽𝑃𝐾
Thus,
𝑑(𝐾 ⁄𝐿)
𝛼
=
𝑑(𝑃𝐿 ⁄𝑃𝐾 ) 𝛽
And, by rearranging,
𝑃𝐿 ⁄𝑃𝐾 𝛽
=
𝐾 ⁄𝐿
𝛼
The EOS is then,
𝜎=
𝑑(𝐾 ⁄𝐿) 𝑃𝐿 ⁄𝑃𝐾 𝛼 𝛽
=
=1
𝑑(𝑃𝐿 ⁄𝑃𝐾 ) 𝐾 ⁄𝐿
𝛽𝛼
This is true for a homogenous production function of any degree.
5.7.
Constant Elasticity of Substitution (CES) Production Functions
A CES production, as the name implies has the EOS coefficient which is a constant value other 1. The function is as
follows,
𝑄(𝐾, 𝐿) = 𝐴[𝛿𝐾 −𝜌 + (1 − 𝛿)𝐿−𝜌 ]−1⁄𝜌
In this function,
𝐴 > 0, represents the efficiency parameter, an indicator of the state of technology.
0 < 𝛿 < 1, is the distribution parameter and like α in the CDPF indicates the relative factor shares in the product.
−1 < 𝜌 <≠ 0, is the substitution parameter, representing the (constant) elasticity of substitution.
5.7.1.
CES Production Function is HOD1
This is easy to show,
𝑄(𝑐𝐾, 𝑐𝐿) = 𝐴[𝛿(𝑐𝐾)−𝜌 + (1 − 𝛿)(𝑐𝐿)−𝜌 ]−1⁄𝜌 = 𝐴(𝑐 −𝜌 [𝛿𝐾 −𝜌 + (1 − 𝛿)𝐿−𝜌 ])−1⁄𝜌
𝑄(𝑐𝐾, 𝑐𝐿) = 𝐴[𝛿(𝑐𝐾)−𝜌 + (1 − 𝛿)(𝑐𝐿)−𝜌 ]−1⁄𝜌 = (𝑐 −𝜌 )−1⁄𝜌 𝐴[𝛿𝐾 −𝜌 + (1 − 𝛿)𝐿−𝜌 ]−1⁄𝜌 = 𝑐𝑄(𝐾, 𝐿)
5.7.2. Isoquants of the CES Production Function are Negatively Sloped and Strictly Convex
Show that,
𝑑𝐾
𝑄𝐿
=−
<0
𝑑𝐿
𝑄𝐾
To facilitate the presentation of obtaining 𝑄𝐿 and 𝑄𝐾 , let [𝛿𝐾 −𝜌 + (1 − 𝛿)𝐿−𝜌 ] ≡ [𝐵]
LN 8—Constrained Optimization
Page 36 of 38
𝑄𝐿 =
𝜕𝑄
1
= 𝐴 (− ) [𝐵]−(1⁄𝜌)−1 (1 − 𝛿)(−𝜌)𝐿−𝜌−1
𝜕𝐿
𝜌
𝑄𝐿 =
𝜕𝑄
= (1 − 𝛿)𝐴[𝐵]−(1+𝜌)⁄𝜌 𝐿−(1+𝜌)
𝜕𝐿
𝑄𝐿 =
𝜕𝑄
𝐴1+𝜌
= (1 − 𝛿) 𝜌 [𝐵]−(1+𝜌)⁄𝜌 𝐿−(1+𝜌)
𝜕𝐿
𝐴
1
− (1+𝜌)
𝜕𝑄 (1 − 𝛿) 1+𝜌 −(1+𝜌)⁄𝜌 −(1+𝜌) (1 − 𝛿) 𝐴1+𝜌 [𝐵] 𝜌
𝑄𝐿 =
=
𝐴 [𝐵]
𝐿
=
𝜕𝐿
𝐴𝜌
𝐴𝜌
𝐿1+𝜌
1+𝜌
𝜕𝑄 (1 − 𝛿) (𝐴𝐵 −1⁄𝜌 )
𝑄𝐿 =
=
𝜕𝐿
𝐴𝜌
𝐿1+𝜌
𝑄𝐿 =
𝜕𝑄 (1 − 𝛿) 𝑄 1+𝜌
=
( )
>0
𝜕𝐿
𝐴𝜌
𝐿
In the same fashion,
𝑄𝐾 =
𝜕𝑄
𝛿 𝑄 1+𝜌
= 𝜌( )
>0
𝜕𝐾 𝐴 𝐾
Thus,
(1 − 𝛿) 𝐾 1+𝜌
𝑑𝐾
𝑄𝐿
=−
=−
( )
<0
𝑑𝐿
𝑄𝐾
𝛿
𝐿
Now, what is the EOS of the CES function?
𝜎=
𝑑(𝐾 ⁄𝐿) 𝑃𝐿 ⁄𝑃𝐾
𝑑(𝑃𝐿 ⁄𝑃𝐾 ) 𝐾 ⁄𝐿
To find 𝐾 ⁄𝐿 as a function of 𝑃𝐿 ⁄𝑃𝐾 ,
𝑄𝐿
𝑃𝐿
=
𝑄𝐾 𝑃𝐾
(1 − 𝛿) 𝐾 1+𝜌 𝑃𝐿
( )
=
𝛿
𝐿
𝑃𝐾
𝐾
𝛿 1
=(
)
𝐿
1−𝛿
⁄(1+𝜌)
𝑃𝐿 1⁄(1+𝜌)
𝑃𝐿 1⁄(1+𝜌)
( )
≡ 𝐷( )
𝑃𝐾
𝑃𝐾
To facilitate presentation, 𝐷 ≡ (
𝛿
1−𝛿
)
1⁄(1+𝜌)
.
Thus,
𝑑(𝐾 ⁄𝐿)
𝐷
𝑃𝐿 1⁄(1+𝜌)−1
=
( )
𝑑(𝑃𝐿 ⁄𝑃𝐾 ) 1 + 𝜌 𝑃𝐾
LN 8—Constrained Optimization
Page 37 of 38
and,
𝑃𝐿 ⁄𝑃𝐾
𝑃𝐿 ⁄𝑃𝐾
1 𝑃𝐿 1−1⁄(1+𝜌)
=
=
( )
𝐾 ⁄𝐿
𝐷(𝑃𝐿 ⁄𝑃𝐾 )1⁄(1+𝜌) 𝐷 𝑃𝐾
𝜎=
𝑑(𝐾 ⁄𝐿) 𝑃𝐿 ⁄𝑃𝐾
𝐷
𝑃𝐿 1⁄(1+𝜌)−1 1 𝑃𝐿 1−1⁄(1+𝜌)
=
( )
( )
𝑑(𝑃𝐿 ⁄𝑃𝐾 ) 𝐾 ⁄𝐿
1 + 𝜌 𝑃𝐾
𝐷 𝑃𝐾
𝜎=
1
1+𝜌
This shows that the coefficient of EOS is a constant whose magnitude depends on the value of the parameter 𝜌.
−1 < 𝜌 < 0
𝜎>1
𝜌 = 0 } ⇒ {𝜎 = 1
0<𝜌<∞
𝜎<1
Note that when 𝜌 = 0, then 𝜎 = 1. This indicates that CDPF, where 𝜎 = 1, is a special case of the CES function.
Actually, the CES production function tends to the CDPF as ρ → 0.
LN 8—Constrained Optimization
Page 38 of 38
Download