Uploaded by Yuenyee Lau

01 mathreview

advertisement
Math Review for MECON6021
Hongsong Zhang
School of Economics and Finance
University of Hong Kong
September 10, 2016
Abstract
This note quickly reviews the basic mathematics which will be used many times in MECON6021,
as well as in some other economic courses in the department. Our purpose is not to give you
a mathematics course to cover all math concepts in these fields. Instead, we want to review
only the terms frequently used in the course. We also only focus on the idea of these concepts and do not concentrate on rigorous detail and proofs of the theorem. Hopefully this note
is especially useful for you if: 1) you never took any math courses covering these fields before,
or 2) you took these courses long time ago and probably are professionals working in the industry.
Keywords: Set, Function, Calculus, Optimization
1
Outline of the Math Review
1. Set
(a) Set Operations
(b) Sequence and Limit
(c) Close/Open Sets
(d) Bounded Sets
(e) Compact Sets
(f) Convex Sets
2. Function
(a) Definition of a function
(b) Shift of a function and movement along the function
(c) Monotone Function
(d) Continuous Function
(e) Concave/Convex Function
3. Calculus
(a) Calculus (single and multivariate)
i. Derivative (Single Variable Function)
ii. Higher-Order Derivative
iii. Composite Function and Chain Rule
iv. Partial Derivative (Multi-Variable Function)
v. Total Differentiation
vi. Implicit Function Theorem
(b) Define Concave/Convex Function Using Calculus
i. Function of One Variable
2
ii. Function of Two Variables
4. Optimization
(a) Unconstrained Optimization
i. Notation
ii. Different Concepts of Maximum
iii. FOC and SOC
iv. Envelope Theorem
(b) Equality Constrained Optimization
i. Notation
ii. Lagrange Condition
iii. Envelope Theorem
1
Set
This section reviews some basic concepts about set and set operation. We will use it to define the
consumption set and production set. We will also use them in the optimizing decisions of firms and
consumers.
1.1
Set Operations
Definition 1 (Set) Set A is a collection of distinct objects. (could be empty)
A element in A: x ∈ A.
Elements in a set are not ordered. so the following two sets are the same: {1,2}={2,1}.
Operations of sets: [show numerical example and figure]
1. Subset: set A is contained in B.
• A ⊂ B: set A is contained in B, and A 6= B.
• A ⊆ B: set A is contained in B, and A and B may be equal.
3
2. Union: A ∪ B. All points in either A or B.
3. Intersection: A ∩ B. All points in both A and B.
4. Complement: Ā. All points not in A.
5. A \ B: all points that are in A, but not in B.
De Morgan’s Law:
1. (A ∪ B) = A ∩ B.
The complement of the union of two sets equals the interception of their complement.
2. (A ∩ B) = A ∪ B.
The complement of the interception of two sets equals the interception of their union.
1.2
Limit of a sequence
• A sequence: x1 , x2 , x3 , ..., xn , ...
Elements in a sequence is ordered. sequence 1,2,3,4 6= sequence 4,3,2,1.
— finite sequence: x1 , x2 , x3 , ..., xn .
— infinite sequence: x1 , x2 , x3 , ..., xn , ...
• Limit: if a function f (n) approaches x0 arbitrarily when n approaches n0 , we call “f(n) has
limit x0 when n approaches n0 ”. Notation: limn→n0 |f (n) − x0 | = 0.
“f (n) approaches x0 arbitrarily” means that the difference between f (n) and x0 is infinitely
small.
In math language: “f(n) has limit x0 when n approaches n0 ” if for any > 0, we can always
find a d > 0, such that for any points satisfying |n − n0 | < d, we always have |f (n) − x0 | < .
• Convergence sequence
convergent sequence: we call a sequence convergent sequence if xn approaches a point x0
as n goes to infinite. Notation: limn→inf |xn − x0 | = 0.
limit of a sequence: the above x0 is called the limit of the sequence. Denoted: limn→inf xn =
4
x0 .
inf
Example: sequence { n1 }inf
n=1 is convergent with limit 0. sequence {n}n=1 is not convergent.
1.3
Properties of Sets
Next, we study some basic properties of a set.
¨How to represent an open ball in mathematics? Br (x0 ) = {x ∈ X : |x − x0 | < r}.
• Open Set: A set X is called an open set if: for every point x in X, there is a open ball Br (x)
such that Br (x) is contained in X.
Note: A open set does not contain its own “bound”.
• Closed set: A set is a closed set if its complement is open.
Note: A closed set contains its own “bound”.
• Bounded Set: If a set X can be contained in a “finite size open ball”, then we call X a
“bounded set”.
• Compact Set: If a set is both bounded and closed, we call it a compact set.
• Convex Sets: A set X is convex if for arbitrary x, y ∈ X, tx + (1 − t)y ∈ X for any t ∈ [0, 1].
¨ Exercise: prove that the budget constraint B = {x ∈ X : px ≤ m} is convex.
Proof. for any two points x1 , x2 ∈ B, we have px1 ≤ m and px2 ≤ m. Then for any t ∈ [0, 1],
we must have tx1 + (1 − t)x2 ≤ m. This is equivalent to say that tx1 + (1 − t)x2 ∈ B. So the
budget constraint B is convex.
2
Function
A function F maps a set X to another set Y. Denote as F : X → Y . Usually in economics we
choose Y to be the space of real number, R.
Three elements of a function: a domain (X), a range (Y), and a mapping (F).
Example: y = 2x2 (mapping F ), x ∈ X = R(domain), y ∈ Y = R+ (range).
Key requirement: unique value. For any x ∈ X, there is only one y ∈ Y , such that y = F (x).
5
2.1
Monotone Function
Increasing functions and decreasing functions are both called monotone functions.
Increasing function: when x1 < x2 , then F (x1 ) ≤ F (x2 ).
Decreasing function: when x1 < x2 , then F (x1 ) ≥ F (x2 ).
Strictly monotone function: when the above inequality is strict.
Examples:
1. y = 2x is monotone on R. (figure)
2. y = x2 is NOT monotone on R. But it is monotone on R+ . (figure)
2.2
Continuous Function
Function F (x) is continuous at a point x0 : if you can always reach F (x0 ) when you approaches x0
from both left and right.
Notation: limx→x0 + F (x) = limx→x0 − F (x) = F (x0 ).
The function F (x) : X → Y is called a continuous function when it is continuous at all points in
X.
2.3
Concave/Convex Function
Concave function: a function (of one variable) is concave if F [tx + (1 − t)y] ≥ tF (x) + (1 − t)F (y),
for all x,y and t ∈ [0, 1].
Convex function: a function (of one variable) is convex if F [tx + (1 − t)y] ≤ tF (x) + (1 − t)F (y),
for all x,y and t ∈ [0, 1].
Strictly concave (convex) when the inequality is strict when x 6= y and t ∈ (0, 1).
Note that if F(x) is convex, then -F(x) must be concave. Vice versa.
Concave functions and convex functions are important to determine whether there is an interior
maximum/minimum.
6
3
Calculus
In this course, we are going to use calculus a lot. Economic analysis in some sense is based on
“marginal effect analysis”, which is usually captured by derivative of a particular function (e.g.
marginal utility, marginal cost, marginal revenue, etc.). You may find this short note about basics
in calculus useful throughout the course.
3.1
Definition
We start with the simple case, a single variable function y = F (x), where both x, y ∈ R.
Definition 2 (Function F(x) is differentiable at point x0 ) A function y = F (x) is differen(x0 )
tiable at point x0 if and only if limx→x0 F (x)−F
converges. We call F 0 (x0 ) the derivative of F(x)
x−x0
at point x0 :
dy
F (x) − F (x0 )
= F 0 (x0 ) = limx→x0
.
dx
x − x0
(1)
Example 1 Suppose F (x) = x2 . Calculate its derivative at point x0 = 2.
Use the definition of derivative in equation 1, we have
F (x) − F (2)
x−2
x2 − 22
= limx→2
x−2
(x + 2)(x − 2)
= limx→2
x−2
F 0 (x0 = 2) = limx→2
= limx→2 (x + 2)
= 4
Remarks:
1. If a function F (x) is differentiable at all points in its domain, we call F (x) a differentiable
function.
2. A function is differentiable intuitively means that it is “smooth” in some sense.
3. Not all functions are differentiable, however. Examples.
7
4. A function may be differentiable at some points, but not others. Examples.
3.2
Derivatives: Some Useful Formula
While it is useful to use the definition of derivative to verify the differentiability of a function at
a point, it is time-consuming to compute derivatives using the definition. For particular forms of
functions, it is easier to remember the formula to compute derivatives. In this subsection, I list
formula to compute the derivative for those functions widely used in this course. The following
table summarizes the formula to compute the derivatives for popular functions.
Function
Derivative
F(x)=c
F’(x)=0
F(x)=cx
F’(x)=c
F(x)=cx2
F’(x)=2cx
F(x)=cxn
F’(x)=ncxn−1
F(x)=exp(cx)
F 0 (x) = c exp(cx)
F(x)=ln(x)
F’(x)= x1
Example 2 For the example discussed above F (x) = x2 . Compute its derivative at point x0 = 2.
Use the derivative formula for polynomial functions in the above table, we have
F 0 (x0 = 2) = 2x|x=2
= 4
3.3
Chain Rule
Chain rule is used to compute derivatives of composite functions.
Consider the composite function: y = F (z), where z = G(x). Then the derivative:
dy dz
dy
=
= F 0 (z)G0 (x)
dx
dz dx
Exercise: y = 3z 2 , z = exp(2x). Compute the derivative
8
dy
dx .
3.4
Higher-Order Derivative
Definition 3 (Second Order Derivative) A function y = F (x) is twice differentiable at point
x0 . Denote F 00 (x) or
d2 y
dx2
as the second-order derivative of F(x) at x0 . Then,
F 00 (x) =
dy
d( dx
)
d2 y
=
.
2
dx
dx
(2)
Using the above definition, you can easily prove the following way to compute second-order derivatives.
Proposition 1 (Second Order Derivative) A function y = F (x) is twice differentiable at point
x0 , and F 0 (x) is the first-order derivative of F(x) at point x. Then the second-order derivative of
F(x) at x0 is
F 00 (x0 ) = limx→x0
F 0 (x) − F 0 (x0 )
.
x − x0
(3)
Remark 1 (Intuitive meaning of first-order and second-order derivatives) The first order derivative defines whether a function is increasing or decreasing. The second-order derivative
determines whether a function is concave or convex. We will discuss the latter in more detail.
3.5
Partial Derivative
A multivariate function: y = F (x, z). Both x and z are variables that affect y.
We ask the question: given z, how does the change of x affect y?
This question is characterized by the so-called partial derivative.
Definition 4 (Partial Derivative) A function y = F (x, z). The partial derivative of y w.r.t x
at x0 , denoted as
∂y
∂x ,
is defined as
∂F (x, z)
F (x, z) − F (x0 , z)
∂y
=
= limx→x0
∂x
∂x
x − x0
Remark 2 The definition of partial derivative looks complicated, right? Do not be scared by it.
In practice when computing the partial derivative, you just need to treat all other variables (z in
9
the above case) as ”constant”. Then you can use exactly the same rule of ordinary derivative to
compute partial derivative.
3.6
Total Differentiation
The economic question we ask here is: what are the factors causes the observed change of result?
This question can be answered by total differentiation.
Consider the function y = F (x, z). We observed that y changed by an amount
a
y. How does this
change links to the change of x and z?
Definition 5 (Total differentiation) For a multivariate function y = F (x, z), the total differentiation, dy, is defined as
dy =
3.7
∂F (x, z)
∂F (x, z)
dx +
dz
∂x
∂z
(4)
Derivative of Implicit Functions
Suppose we know a implicit function F(x,y)=0. Given that the equation F(x,y)=0 is always satisfied, how does a change of x affect y?
We can apply the concept of total differentiation to derive it.
Proposition 2 (Derivative of Implicit Functions) Given an implicit function F(x,y)=0, then
∂F (x,y)
dy
= − ∂F∂x
.
(x,y)
dx
(5)
∂y
Proof. For the implicit function 0=F(x,y), apply the total differentiation, we have
d0 =
∂F (x, y)
∂F (x, y)
dx +
dy
∂x
∂y
As d0=0, we can easily derive equation 5.
Exercise: F (x, y) = x3 y 2 + exp(xy 2 ) = 0. Compute the derivative
10
dy
dx
at point (x,y)=(1,2).
3.8
Increasing and Decreasing Functions
Proposition 3 If function F(x) is differentiable at point x, then
(1) F(x) is increasing at point x, iff F 0 (x) ≥ 0;
(2)F(x) is decreasing at point x, iff F 0 (x) ≤ 0.
In application, we use this condition more than the definition to check the monotonicity of a
function.
3.9
Concave and Convex Functions and Second order derivative
Proposition 4 If function F(x) is differentiable up to second order at point x, then
(1) F(x) is concave at point x, iff F 00 (x) ≤ 0;
(2)F(x) is convex at point x, iff F 00 (x) ≥ 0.
(3) F(x) is globally concave if F 00 (x) ≤ 0 for all x; F(x) is globally convex if F 00 (x) ≥ 0 for all x.
4
Optimization
Armed with the concepts of sets, functions, and calculus, we can now talk about optimization –
maximizing/minimizing a objective function. We will start with the simpler case without constraint
and continued with the more popular case with constraint (e.g. budget constraint in consumers’
optimization problem).
4.1
Unconstrained Optimization
Suppose we have a (single-variable) function F(x). For example, you can think about F(x) as profit
of a firm, utility of a consumer, or your own satisfaction about your life. You want to maximize
this objective function. Does it have a maximum (minimum)? How can we find it?
4.1.1
The Problem
The problem is written as
maxx∈X F (x)
11
(6)
There are three elements in this so-called maximization problem: an objective function (F(x)), a
area (X) on which you do the maximization, and the choice variable (x). Changing any of them
may change your maximization problem. We call the maximized value of the objective function
”maximum” or ”value function”, V.
V = maxx∈X F (x)
4.1.2
(7)
Different Concepts of Maximum
There are different concepts of maximum: local maximum, global maximum; unique maximum,
multiple maximum. I illustrate the idea in the following figure 1.
Figure 1: Different types of maximum
• Interior solution VS corner solution
4.1.3
FOC and SOC
The maximum/minimum point is highly related to the first-order and second-order derivatives of
the objective function. The conditions based on first- and second- order derivatives determines
whether there is a maximum or minimum. We formally call these conditions first-order conditions
(FOC) and second-order conditions (FOC). In particular, FOC usually determines whether it is a
(interior) optimal point (could be maximum, minimum, or neither). SOC determines whether the
optimal point is a maximum, minimum or sometimes neither.
12
First Order Condition
The first order condition for the above maximization problem as defined in equation 7 is
F 0 (x) = 0
(8)
The idea for FOC is that when you move around the point at which F’(x)=0, the change of the
objective function F(x) is zero.
By solving this equation F’(x)=0, we may get no solution, 1 solution, or many solutions. This
depends on the shape of the function F(x).
In the ideal case, suppose we get only one solution, denoted as x∗ . Then, as discussed above, x∗
may be a minimum, maximum, or neither.
Exercise: Consider the maximization problem maxx∈R F (x) = x2 − 4x + 4. Write out the first
order condition and find the optimal solution x∗ .
Second Order Condition
The second order condition provides a justification about whether x∗ is a minimum, maximum, or
neither. A sufficient condition to ensure that x∗ is a maximum point is that
F 00 (x∗ ) < 0
or
F 00 (x∗ ) = 0, F 0 (x∗ −) > 0, andF 0 (x∗ +) < 0
(9)
This is called the second order condition (FOC) for maximization.
Exercise:
1. In the exercise above, write out the second order condition and check whether the x∗ you get is
a maximum.
2. Consider the maximization problem: maxx∈R F (x) = x3 + 4x2 + 4x + 1. (1) Write out the first
order condition, solve for all x∗ . (2) Write out the second order condition, and check whether the
13
solution(s) you got in the first step is a maximum. Find the maximum point, if any.
4.1.4
Envelope Theorem
The maximization problem tries to choose a x (called choice variable) to maximize the objective
function F(x). Usually the objective function is also a function of other exogenous variables, e.g.
F(x,a). The variable a is given when you are choosing x to maximize the objective function. In
this case, the value function, V, is a function of the exogenous variable a.
V (a) = maxx∈X F (x, a)
We want to ask the question: how does a change of the exogenous variable a affect the value
function V(a)? Envelope theorem gives you the answer.
Proposition 5 (Envelope theorem) Suppose V (a) = maxx∈X F (x, a) and x∗ (a) is the maximizer, then
V 0 (a) =
∂F (x, a)
|x=x∗ (a)
∂a
(10)
Exercise (revised): Suppose a firm’s cost function, C(p, a) = ap2 − 4a2 p + 8a2 , depends on
its wage rate p and material price 1 > a > 0. The firm can choose p, but the material price
is exogenously given. (1) Write out the first order condition and solve out the solution to FOC,
denoted as p∗ . (2) Write out SOC, and check whether p∗ maximizes or minimizes the cost function.
(3) Denote V(a) as the maximized profit (if any), use the envelope them to find out the marginal
effect of material price on profit. (4) Find V(a) explicitly, compute
dV (a)
dA .
Is the result the same as
what you get in (3)?
4.2
4.2.1
Equality Constrained Optimization
The Problem
In many cases, the optimization/maximization problem is subject to some constraint. For example,
when you decide how much to purchase to maximize your “satisfaction”, you are constrained by
14
your budget constraint. That is, your maximization problem is in the following form:
maxx U (x)
s.t.px ≤ m
To simplify the analysis, we assume that you “always use up your money” to maximize your
satisfaction. So we have equality constraint. The equality-constrained maximization problem is
written as
V (p, m) = maxx U (x)
s.t.
px = m
This subsection reviews how to find the maximizer of this problem.
4.2.2
Lagrangian Function
We can utilize the Lagrangian condition to solve the above problem in 11.
First define the Lagrangian function as
L = U (x1 , x2 ) + λ(m − p1 x1 − p2 x2 )
Then the associated FOC becomes
∂L
=0
∂x1
∂L
=0
∂x2
∂L
=0
∂λ
15
(11)
Or equivalently (please verify this),
∂U
∂x1
∂U
∂x2
= λp1
(12)
= λp2
(13)
px = m
(14)
There are three equations and three unknowns, x1 , x2 , and λ, in this equation system. In principle
we can solve out the solution: x∗1 (p, m), x∗2 (p, m), and λ∗ (p, m).
We are not sure whether the solution maximizes the objective function. In principle we need the
second order condition to verify this. We temporarily do not talk about the second order condition
here, as it involves some linear algebra. We will talk about this point when necessary.
In practice, under some conditions we can make sure that the solution from FOC is the maximizer
of the problem. I write a very popularly used case in the following proposition.
Proposition 6 If function U (x1 , x2 ) is strictly concave in (x1 , x2 ) and p and m are positive, then
the solution to FOC maximizes the objective function.
In this course, we usually assume that the utility function and production function are concave.
So this proposition can be used to check whether the solution to FOC is enough to guarantee a
maximum.
4.2.3
Envelope Theorem
Similarly, we can consider the effect of a change of exogenous variables (p,m) on the objective
function, in the case of constrained optimization.
Proposition 7 Suppose the Lagrangian function for a maximization problem can be written as
L(x, a) and x∗ (a) and V (a) are the associated maximizer and value function. Then we have
∂L(x, a)
∂V (a)
=
|x=x∗ (a)
∂a
∂a
Example 3 (Envelope theorem in utility maximization)
16
(15)
In the maximization problem defined in equation 11, the envelope theorem says that
∂L(x, p, m)
∂V (p, m)
=
|x=x∗ (p,m) = −λ∗ (p, m)x∗ (p, m)
∂pi
∂pi
∂V (p, m)
∂L(x, p, m)
=
|x=x∗ (p,m) = λ∗ (p, m)
∂m
∂m
We will use these two equations many times in consumer theory.
17
(16)
(17)
Download