Uploaded by pjdup33

ia2017lm

advertisement
Contents
1
The Real Number System
1.1 Definition and Properties of Real Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2 Bounded Sets, Suprema and Infima . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.3 Natural Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
1
6
10
2
Sequences
2.1 Sequences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.2 Bounded and Monotonic Sequences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
14
14
19
3
Limits and Continuity of Functions
3.1 Limits of Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2 Limits at Infinity and Infinite Limits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.3 Limit Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.4 Continuity of Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.5 The Intermediate Value Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
24
24
26
29
32
35
4
Differentiation
4.1 Revision – Self Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4.2 The Chain Rule and Inverse Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
39
39
42
5
Series
5.1 Definitions and Basic Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.2 Convergence Tests for Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.3 Power Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
45
45
48
52
6
Riemann Integration
6.1 Suprema and Infima (self study) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.2 Riemann Sums . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.3 Refinements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.4 The Riemann Integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.5 Properties of the Integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
54
54
55
57
59
62
7
Introduction to Metric Spaces
7.1 Metric Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.2 Complete Metric Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.3 Banach’s Fixed Point Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.4 Existence of solutions of differential equations . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.5 Power Series and the Exponential Function in 𝑀𝑛 (𝔽 ) . . . . . . . . . . . . . . . . . . . . . . . .
7.6 Solutions of Linear Differential Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
69
69
71
74
77
81
85
i
ii
Introductory Analysis 2017 Lecture Manual
MATH2022
Chapter 1
The Real Number System
1.1
Definition and Properties of Real Numbers
We will give an axiomatic definition of the real numbers. We will define the set of real numbers, denoted by ℝ, as
a set with two operations + and ⋅, called addition and multiplication, as well as an ordering <, which satisfy the
laws of addition (A), the laws of multiplication (M), the distributive law (D), the order laws (O) and the Dedekind
Completeness Axiom (C). These laws and axioms will be given and discussed below.
Additional Note. An alternative way to define the real numbers is to use Peano’s axiom to define the natural
numbers, and then construct the integers and rational numbers in turn. Then the axioms (A), (M), (D) and (O)
become properties of rational numbers, so that we call them laws.
Addition and multiplication are maps which assign to every two elements in π‘Ž, 𝑏 ∈ ℝ an element in ℝ which is
denoted by π‘Ž+𝑏 and π‘Ž⋅𝑏 (in general, written π‘Žπ‘), respectively. We require that these operations satisfy the following
axioms.
A. Axioms of addition
(A1)
Associative Law: π‘Ž + (𝑏 + 𝑐) = (π‘Ž + 𝑏) + 𝑐 for all π‘Ž, 𝑏, 𝑐 ∈ ℝ.
(A2)
Commutative Law: π‘Ž + 𝑏 = 𝑏 + π‘Ž for all π‘Ž, 𝑏 ∈ ℝ.
(A3)
Zero: There is a real number 0 such that π‘Ž + 0 = π‘Ž for all π‘Ž ∈ ℝ.
(A4)
Additive inverse: For each π‘Ž ∈ ℝ there is −π‘Ž ∈ ℝ such that π‘Ž + (−π‘Ž) = 0.
Notation. For π‘Ž, 𝑏 ∈ ℝ one writes π‘Ž − 𝑏 ∢= π‘Ž + (−𝑏).
Additional Notes. 1. Any set with the operation + satisfying (A1), (A3), (A4) is called a group. If also (A2) is
satisfied, the group is called an Abelian (or commutative) group.
2. You will encounter detailed discussions of (Abelian) groups in algebra courses.
3. The additive inverse of a number is also called the negation of a number.
Theorem 1.1 (Basic group properties).
(a) The number 0 is unique.
(b) For all π‘Ž ∈ ℝ, the number −π‘Ž is unique.
(c) For all π‘Ž, 𝑏 ∈ ℝ, the equation π‘Ž + π‘₯ = 𝑏 has a unique solution. This solution is π‘₯ = 𝑏 − π‘Ž.
(d) ∀π‘Ž ∈ ℝ, −(−π‘Ž) = π‘Ž.
(e) ∀π‘Ž, 𝑏 ∈ ℝ, −(π‘Ž + 𝑏) = −π‘Ž − 𝑏.
(f) −0 = 0.
Proof. (a) Let 0, 0′ ∈ ℝ such that π‘Ž + 0 = π‘Ž and π‘Ž + 0′ = π‘Ž for all π‘Ž ∈ ℝ. We must show that 0 = 0′ :
0 = 0 + 0′
= 0′ + 0
by (A2)
= 0′
1
2
Introductory Analysis 2017 Lecture Manual
(b) Let π‘Ž ∈ ℝ and π‘Ž′ , π‘Ž′′ ∈ ℝ such that π‘Ž + π‘Ž′ = 0 and π‘Ž + π‘Ž′′ = 0. We must show that π‘Ž′ = π‘Ž′′ :
π‘Ž′ = π‘Ž′ + 0
by (A3)
′
′′
= π‘Ž + (π‘Ž + π‘Ž )
= (π‘Ž′ + π‘Ž) + π‘Ž′′
by (A1)
′
by (A2)
′′
= (π‘Ž + π‘Ž ) + π‘Ž
′′
=0+π‘Ž
= π‘Ž′′ + 0
by (A2)
= π‘Ž′′
by (A3)
(c) First we show that π‘₯ = 𝑏 − π‘Ž is a solution. So let π‘₯ = 𝑏 − π‘Ž.
Then
π‘Ž + π‘₯ = π‘Ž + (𝑏 − π‘Ž) = π‘Ž + (𝑏 + (−π‘Ž))
= π‘Ž + ((−π‘Ž) + 𝑏)
by (A2)
= (π‘Ž + (−π‘Ž)) + 𝑏
by (A1)
=0+𝑏
by (A4)
=𝑏+0
by (A2)
=𝑏
by (A3)
To show that the solution is unique let π‘₯ ∈ ℝ such that π‘Ž + π‘₯ = 𝑏.
Then
π‘₯ = π‘₯ + 0 = π‘₯ + (π‘Ž + (−π‘Ž))
by (A3), (A4)
= (π‘₯ + π‘Ž) + (−π‘Ž)
by (A1)
= (π‘Ž + π‘₯) + (−π‘Ž)
by (A2)
=𝑏−π‘Ž
βˆ΅π‘Ž+π‘₯=𝑏
This shows that the solution is unique.
(d) Note that
−π‘Ž + (−(−π‘Ž)) = 0 by (A4).
On the other hand
−π‘Ž + π‘Ž = π‘Ž + (−π‘Ž) = 0 by (A2), (A4).
By part (b), it follows that
−(−π‘Ž) = π‘Ž.
Alternatively:
−(−π‘Ž) = −(−π‘Ž) + 0 = −(−π‘Ž) + (π‘Ž + (−π‘Ž))
by (A3), (A4)
= (π‘Ž + (−π‘Ž)) + (−(−π‘Ž))
by (A2)
= π‘Ž + ((−π‘Ž) + (−(−π‘Ž)))
by (A1)
=π‘Ž+0
=π‘Ž
by (A4)
by (A3)
(e)
(π‘Ž + 𝑏) + (−π‘Ž − 𝑏) = (π‘Ž + 𝑏) + ((−π‘Ž) + (−𝑏))
= ((𝑏 + π‘Ž) + (−π‘Ž)) + (−𝑏)
by (A1), (A2)
= (𝑏 + (π‘Ž + (−π‘Ž))) + (−𝑏)
by (A1)
= (𝑏 + 0) + (−𝑏)
by (A4)
= 𝑏 + (−𝑏)
by (A3)
=0
by (A4)
By part (b), −(π‘Ž + 𝑏) = −π‘Ž − 𝑏.
(f) 0 + 0 = 0 by (A3), so that −0 = 0 by (A4) and (b).
MATH2022
MATH2022
Introductory Analysis 2017 Lecture Manual
3
M. Axioms of multiplication
(M1)
Associative Law: π‘Ž(𝑏𝑐) = (π‘Žπ‘)𝑐 for all π‘Ž, 𝑏, 𝑐 ∈ ℝ.
(M2)
Commutative Law: π‘Žπ‘ = π‘π‘Ž for all π‘Ž, 𝑏 ∈ ℝ.
(M3)
One: There is a real number 1 such that 1 ≠ 0 and π‘Ž ⋅ 1 = π‘Ž for all π‘Ž ∈ ℝ.
(M4)
Multiplicative inverse: For each π‘Ž ∈ ℝ with π‘Ž ≠ 0 there is π‘Ž−1 ∈ ℝ such that π‘Žπ‘Ž−1 = 1.
D. The distributive law axiom
(D)
Distributive Law: π‘Ž(𝑏 + 𝑐) = π‘Žπ‘ + π‘Žπ‘ for all π‘Ž, 𝑏, 𝑐 ∈ ℝ.
Additional Notes. 1. Any set with the operations +, ⋅ satisfying the axioms (A1)–(A4), (M1)–(M4) and (D) is
called a field.
2. You will encounter detailed discussions of fields in algebra courses.
3. The multiplicative inverse of a number is also briefly called the inverse of a number.
4. The set of nonzero real numbers, ℝ ⧡ {0}, is an Abelian group with respect to multiplication.
Theorem 1.2 (Basic field properties: Distributive laws).
(a) ∀π‘Ž, 𝑏, 𝑐 ∈ ℝ, (π‘Ž + 𝑏)𝑐 = π‘Žπ‘ + 𝑏𝑐.
(b) ∀π‘Ž ∈ ℝ, π‘Ž ⋅ 0 = 0.
(c) ∀π‘Ž, 𝑏 ∈ ℝ, π‘Žπ‘ = 0 ⇔ π‘Ž = 0 or 𝑏 = 0.
(d) ∀π‘Ž, 𝑏 ∈ ℝ, (−π‘Ž)𝑏 = −(π‘Žπ‘).
(e) ∀π‘Ž ∈ ℝ, (−1)π‘Ž = −π‘Ž.
(f) ∀π‘Ž, 𝑏 ∈ ℝ, (−π‘Ž)(−𝑏) = π‘Žπ‘.
Proof. (a) By (M2) and (D),
(π‘Ž + 𝑏)𝑐 = 𝑐(π‘Ž + 𝑏) = π‘π‘Ž + 𝑐𝑏 = π‘Žπ‘ + 𝑏𝑐.
(b) By (A3) and (D),
π‘Ž ⋅ 0 = π‘Ž(0 + 0) = π‘Ž ⋅ 0 + π‘Ž ⋅ 0,
and by (A3), (A2), π‘Ž ⋅ 0 = 0 + π‘Ž ⋅ 0. Then Theorem 1.1,(c) gives π‘Ž ⋅ 0 = 0.
(c) If 𝑏 = 0, then π‘Žπ‘ = π‘Ž ⋅ 0 = 0 by (b).
If π‘Ž = 0, then π‘Žπ‘ = π‘π‘Ž = 𝑏 ⋅ 0 = 0 by (M2) and (b).
Now assume that π‘Žπ‘ = 0. If 𝑏 = 0, the property “π‘Ž = 0 or 𝑏 = 0” follows. So now assume 𝑏 ≠ 0. Then
π‘Ž = π‘Ž ⋅ 1 = π‘Ž(𝑏𝑏−1 ) = (π‘Žπ‘)𝑏−1 = 0 ⋅ 𝑏−1 = 0
by (M3), (M4), (M1), (M2) and (b).
(d) Using field laws, we get
π‘Žπ‘ + (−π‘Ž)𝑏 = (π‘Ž + (−π‘Ž))𝑏 = 0 ⋅ 𝑏 = 0,
and from Theorem 1.1(b), (−π‘Ž)𝑏 = −π‘Žπ‘.
(e) is a special case of (d).
(f) From (d) and other laws and rules (which one’s?) we find
(−π‘Ž)(−𝑏) = −[π‘Ž(−𝑏)] = −[(−𝑏)π‘Ž] = −[−π‘π‘Ž] = π‘π‘Ž = π‘Žπ‘.
Theorem 1.3 (Basic field properties: multiplication).
(a) The number 1 is unique.
(b) For all π‘Ž ∈ ℝ with π‘Ž ≠ 0, the number π‘Ž−1 is unique.
(c) For all π‘Ž, 𝑏 ∈ ℝ with π‘Ž ≠ 0, the equation π‘Žπ‘₯ = 𝑏 has a unique solution. This solution is π‘₯ = π‘Ž−1 𝑏.
(d) ∀π‘Ž ∈ ℝ ⧡ {0}, (π‘Ž−1 )−1 = π‘Ž.
(e) ∀π‘Ž, 𝑏 ∈ ℝ ⧡ {0}, (π‘Žπ‘)−1 = π‘Ž−1 𝑏−1 .
(f) ∀π‘Ž ∈ ℝ ⧡ {0}, (−π‘Ž)−1 = −π‘Ž−1 .
(g) 1−1 = 1.
4
Introductory Analysis 2017 Lecture Manual
MATH2022
Proof. See tutorials. The proofs are similar to those of Theorem 1.1.
𝑏
1
Notation. π‘Ž−1 𝑏 is also written as , In particular, π‘Ž−1 = .
π‘Ž
π‘Ž
Tutorial 1.1.1. 1. Prove Theorem 1.3.
Next we give the axioms for the set of positive real numbers. It is convenient to use the notation π‘Ž > 0 for positive
numbers π‘Ž.
O. The order axioms
(O1) Trichotomy: For each π‘Ž ∈ ℝ, exactly one of the following statements is true:
π‘Ž > 0 or π‘Ž = 0 or −π‘Ž > 0.
(O2)
If π‘Ž > 0 and 𝑏 > 0, then π‘Ž + 𝑏 > 0.
(O3)
If π‘Ž > 0 and 𝑏 > 0, then π‘Žπ‘ > 0.
The definition of positivity of real numbers gives rise to an order relation for real numbers:
Definition. Let π‘Ž, 𝑏 ∈ ℝ. Then π‘Ž is called larger than 𝑏, written π‘Ž > 𝑏, if π‘Ž − 𝑏 > 0.
Notes. 1. Since π‘Ž − 0 = π‘Ž, the notation π‘Ž > 0 is consistent.
2. It is convenient to introduce the following notations:
π‘Ž ≥ 𝑏 ⇔ π‘Ž > 𝑏 or π‘Ž = 𝑏,
π‘Ž < 𝑏 ⇔ 𝑏 > π‘Ž,
π‘Ž ≤ 𝑏 ⇔ π‘Ž < 𝑏 or π‘Ž = 𝑏.
3. We will define general powers later. Below we use the notation π‘Ž2 = π‘Ž ⋅ π‘Ž.
Theorem 1.4 (Basic order properties). Let π‘Ž, 𝑏, 𝑐, 𝑑 ∈ ℝ. Then
(a) π‘Ž < 0 ⇔ −π‘Ž > 0.
(b) π‘Ž < 𝑏 and 𝑏 < 𝑐 ⇒ π‘Ž < 𝑐.
(c) π‘Ž < 𝑏 ⇒ π‘Ž + 𝑐 < 𝑏 + 𝑐.
(d) π‘Ž < 𝑏 and 𝑐 < 𝑑 ⇒ π‘Ž + 𝑐 < 𝑏 + 𝑑.
(e) π‘Ž < 𝑏 and 𝑐 > 0 ⇒ π‘π‘Ž < 𝑐𝑏.
(f) 0 ≤ π‘Ž < 𝑏 and 0 ≤ 𝑐 < 𝑑 ⇒ π‘Žπ‘ < 𝑏𝑑.
(g) π‘Ž < 𝑏 and 𝑐 < 0 ⇒ π‘π‘Ž > 𝑐𝑏.
(h) π‘Ž ≠ 0 ⇒ π‘Ž2 > 0.
(i) π‘Ž > 0 ⇒ π‘Ž−1 > 0 and π‘Ž < 0 ⇒ π‘Ž−1 < 0.
(j) 0 < π‘Ž < 𝑏 ⇒ 𝑏−1 < π‘Ž−1 .
(k) 1 > 0.
Proof. (a)
π‘Ž<0⇔0>π‘Ž
by definition of <
⇔0−π‘Ž>0
by definition of 0 > π‘Ž
⇔ −π‘Ž > 0
∡ 0 − π‘Ž = −π‘Ž + 0 = −π‘Ž
(b)
π‘Ž < 𝑏 and 𝑏 < 𝑐 ⇒ 𝑏 − π‘Ž > 0 and 𝑐 − 𝑏 > 0
by definition
⇒ (𝑏 − π‘Ž) + (𝑐 − 𝑏) > 0
by (O2)
⇒𝑐−π‘Ž>0
⇒π‘Ž<𝑐
by (A1)-(A4)
by definition
MATH2022
Introductory Analysis 2017 Lecture Manual
5
(h) Since π‘Ž ≠ 0, either π‘Ž > 0 or π‘Ž < 0.
If π‘Ž > 0, then π‘Ž2 = π‘Žπ‘Ž > 0 by (O3).
If π‘Ž < 0, then −π‘Ž > 0 by (a) and π‘Ž2 = (−π‘Ž)(−π‘Ž) by Theorem 1.2, (f).
Hence π‘Ž2 = (−π‘Ž)(−π‘Ž) > 0 by (O3).
(i) Since π‘Ž ≠ 0, π‘Ž−1 exists with π‘Žπ‘Ž−1 = 1 by (M4). Then π‘Ž−1 ≠ 0 by Theorem 1.2, (c). Hence (π‘Ž−1 )2 > 0 by (h).
Thus, if π‘Ž > 0,
π‘Ž−1 = π‘Ž(π‘Ž−1 )2 > 0 by (O3).
Similarly, use (g) if π‘Ž < 0.
(j) By (i), π‘Ž−1 > 0 and 𝑏−1 > 0. Hence π‘Ž−1 𝑏−1 > 0 by (O3). Then
𝑏−1 = π‘Ž(π‘Ž−1 𝑏−1 ) < 𝑏(π‘Ž−1 𝑏−1 )
−1
−1
= (𝑏𝑏 )π‘Ž
by (e)
−1
=π‘Ž .
(k) 1 = 1 ⋅ 1 = 12 > 0 by (h). (c)–(g): See tutorials.
Note. There is still one axiom missing, the axiom of Dedekind completeness. However, we will postpone the
formulation of this axiom to the next section since we need some further definitions.
Tutorial 1.1.2. 1. Prove Theorem 1.4, (c)–(g).
2. The absolute value function. Define the following function on ℝ:
|π‘₯| =
{
π‘₯
if π‘₯ ≥ 0,
−π‘₯
if π‘₯ < 0.
Prove the following statements for π‘₯, 𝑦 ∈ ℝ:
(a) |π‘₯| ≥ 0,
(b) |π‘₯𝑦| = |π‘₯| |𝑦|,
(c) |𝑦| < π‘₯ ⇔ −π‘₯ < 𝑦 < π‘₯,
(d) |π‘₯ + 𝑦| ≤ |π‘₯| + |𝑦|.
3. Let π‘₯, 𝑦, 𝑧 ∈ ℝ. Which of the following statements are true and which are false?
(a) π‘₯ ≤ 𝑦 ⇒ π‘₯𝑧 ≤ 𝑦𝑧,
(b) 0 < π‘₯ ≤ 𝑦 ⇒ 1𝑦 ≤ π‘₯1 ,
(c) π‘₯ < 𝑦 < 0 ⇒ 1𝑦 < π‘₯1 ,
(d) π‘₯2 < 1 ⇒ π‘₯ < 1,
(e) π‘₯2 < 1 ⇒ −1 < π‘₯ < 1,
(f) π‘₯2 > 1 ⇒ π‘₯ > 1.
4. In each of the following questions fill in the
π‘Ž
(a) π‘Ž ≥ 3 ⇒ π‘Ž−2
,
7
7
3
(b) π‘Ž ≥ 1 ⇒ π‘Ž+1
3
,
π‘Ž
(c) π‘Ž > 1 ⇒ π‘Ž9
10
,
π‘Ž−1
(d) π‘Ž > 1 ⇒ π‘Ž12
1
,
π‘Ž
(e) π‘Ž ≥ 2 ⇒ π‘Ž21−1
1
,
π‘Ž
(f) π‘Ž > 3 ⇒ −3
π‘Ž
−2
.
π‘Ž−1
with < or >.
5. Let π‘₯ ≥ 0 and 𝑦 ≥ 0. Show that π‘₯ < 𝑦 ⇔ π‘₯2 < 𝑦2 .
6
1.2
Introductory Analysis 2017 Lecture Manual
MATH2022
Bounded Sets, Suprema and Infima
Definition 1.1. Let 𝑆 be a nonempty subset of ℝ. Then
1. If there is a number 𝐴 ∈ ℝ such that π‘₯ ≤ 𝐴 for all π‘₯ ∈ 𝑆, then 𝐴 is said to be an upper bound of 𝑆, and 𝑆 is
said to be bounded above.
2. If there is a number π‘Ž ∈ ℝ such that π‘₯ ≥ π‘Ž for all π‘₯ ∈ 𝑆, then π‘Ž is said to be a lower bound of 𝑆, and 𝑆 is said
to be bounded below.
3. If 𝑆 is bounded above and bounded below, then 𝑆 is called bounded.
Note. Recall the definition of intervals. [Note for lecturers: We do not need to write this down. Students have seen
this in first year. Just refer students to the notes in case they have forgotten the definitions.] Let π‘Ž, 𝑏. Then
(π‘Ž, 𝑏) = {π‘₯ ∈ ℝ ∢ π‘Ž < π‘₯ < 𝑏},
[π‘Ž, 𝑏] = {π‘₯ ∈ ℝ ∢ π‘Ž ≤ π‘₯ ≤ 𝑏},
(π‘Ž, 𝑏] = {π‘₯ ∈ ℝ ∢ π‘Ž < π‘₯ ≤ 𝑏},
[π‘Ž, 𝑏) = {π‘₯ ∈ ℝ ∢ π‘Ž ≤ π‘₯ < 𝑏},
(−∞, 𝑏) = {π‘₯ ∈ ℝ ∢ π‘₯ < 𝑏},
(−∞, 𝑏] = {π‘₯ ∈ ℝ ∢ π‘₯ ≤ 𝑏},
(π‘Ž, ∞) = {π‘₯ ∈ ℝ ∢ π‘Ž < π‘₯},
[π‘Ž, ∞) = {π‘₯ ∈ ℝ ∢ π‘Ž ≤ π‘₯}.
Example 1.2.1. Let 𝑆 = (−∞, 1). Does 𝑆 have upper and lower bounds?
Solution. Since π‘₯ < 1 for all π‘₯ ∈ 𝑆, 1 is an upper bound of 𝑆, and 𝑆 is therefore bounded above. But also 2 and
7.3 are upper bounds of 𝑆, for example.
Assume that 𝑆 has a lower bound π‘š. Then π‘š ≤ 0 since 0 ∈ 𝑆, and π‘š − 1 < π‘š ≤ 0 < 1 shows that π‘š − 1 ∈ 𝑆. So
π‘š ≤ π‘š − 1 since π‘š is a lower bound of 𝑆. But this is false, so that π‘š cannot be a lower bound of 𝑆. Therefore 𝑆
has no lower bound, and 𝑆 is therefore not bounded below.
Definition 1.2. Let 𝑆 be a nonempty subset of ℝ.
1. If 𝑆 has an upper bound 𝑀 which is an element of 𝑆, then 𝑀 is called the greatest element or maximum of
𝑆, and we write 𝑀 = max 𝑆.
2. If 𝑆 has a lower bound π‘š which is an element of 𝑆, then π‘š is called the least element or minimum of 𝑆, and
we write π‘š = min 𝑆.
From the definition, we immediately obtain
Proposition 1.5. Let 𝑆 be a nonempty subset of ℝ. Then
1. 𝑀 = max 𝑆 ⇔ 𝑀 ∈ 𝑆 and π‘₯ ≤ 𝑀 for all π‘₯ ∈ 𝑆,
2. π‘š = min 𝑆 ⇔ π‘š ∈ 𝑆 and π‘₯ ≥ π‘š for all π‘₯ ∈ 𝑆.
We have already implicitly used in our notation that maximum and minimum are unique if they exist. The next
result states this formally.
Proposition 1.6. Let 𝑆 be a nonempty subset of ℝ. If maximum or minimum of 𝑆 exist, then they are unique.
Proof. Assume that 𝑆 has maxima 𝑀1 and 𝑀2 . We must show 𝑀1 = 𝑀2 . Since 𝑀1 ∈ 𝑆 and 𝑀2 is an upper
bound of 𝑆, we have 𝑀1 ≤ 𝑀2 . Since 𝑀2 ∈ 𝑆 and 𝑀1 is an upper bound of 𝑆, we have 𝑀2 ≤ 𝑀1 . Hence
𝑀1 = 𝑀2 .
A similar proof holds for the minimum.
Example 1.2.2. Let π‘Ž, 𝑏 ∈ ℝ, π‘Ž < 𝑏, and 𝑆 = [π‘Ž, 𝑏). Then min 𝑆 = π‘Ž, but 𝑆 has no maximum.
Solution. Clearly, π‘Ž ∈ 𝑆 and π‘Ž ≤ π‘₯ for all π‘₯ ∈ 𝑆, so that π‘Ž = min 𝑆.
𝑀 +𝑏
Assume 𝑆 has a maximum 𝑀. Since 𝑀 ∈ 𝑆, π‘Ž ≤ 𝑀 < 𝑏. Put 𝑐 =
. Then
2
π‘Ž≤𝑀 =
(1 + 1)𝑀
𝑀 +𝑀
𝑀 +𝑏 𝑏+𝑏
2𝑀
=
=
<
<
= 𝑏.
2
2
2
2
2
Therefore
π‘Ž ≤ 𝑀 < 𝑐 < 𝑏,
which shows 𝑐 ∈ 𝑆 and 𝑀 < 𝑐, contradicting that 𝑀 = max 𝑆 is an upper bound of 𝑆.
MATH2022
Introductory Analysis 2017 Lecture Manual
7
Definition 1.3. Let 𝑆 be a nonempty subset of ℝ.
A real number 𝑀 is said to be the supremum or least upper bound of 𝑆, if
(a) 𝑀 is an upper bound of 𝑆, and
(b) if 𝐿 is any upper bound of 𝑆, then 𝑀 ≤ 𝐿.
The supremum of 𝑆 is denoted by sup 𝑆.
Definition 1.4. Let 𝑆 be a nonempty subset of ℝ.
A real number π‘š is said to be the infimum or greatest lower bound of 𝑆, if
(a) π‘š is a lower bound of 𝑆, and
(b) if 𝑙 is any lower bound of 𝑆, then π‘š ≥ 𝑙.
The infimum of 𝑆 is denoted by inf 𝑆.
Note. Let 𝑆 be a nonempty subset of ℝ.
1. By definition, if 𝑆 has a supremum, then sup 𝑆 is the minimum of the (nonempty) set of the upper bounds of
𝑆. Hence sup 𝑆 is unique by Proposition 1.6.
2. Similarly, inf 𝑆 is unique if it exists.
Proposition 1.7. Let 𝑆 be a nonempty subset of ℝ.
1. If max 𝑆 exists, then sup 𝑆 exists, and sup 𝑆 = max 𝑆.
2. If min 𝑆 exists, then inf 𝑆 exists, and inf 𝑆 = min 𝑆.
Proof. 1. max 𝑆 is an upper bound of 𝑆 by definition. Since max 𝑆 ∈ 𝑆, max 𝑆 ≤ 𝐿 for any upper bound 𝐿 of
𝑆. Hence max 𝑆 = sup 𝑆.
The proof of 2. is similar.
Example 1.2.3. Let π‘Ž < 𝑏 and put 𝑆 = [π‘Ž, 𝑏). Find inf 𝑆 and sup 𝑆 if they exist.
Solution. By Proposition 1.7 and Example 1.2.2, inf 𝑆 = min 𝑆 = π‘Ž exists.
From π‘₯ < 𝑏 for all π‘₯ ∈ 𝑆 we have that 𝑏 is an upper bound of 𝑆. If there were an an upper bound 𝐿 of 𝑆 with
𝐿 < 𝑏, it would follow as in the solution to Example 1.2.2 that there is 𝑐 ∈ 𝑆 with 𝐿 < 𝑐, which contradicts the
fact that 𝐿 is an upper bound of 𝑆.
Hence sup 𝑆 = 𝑏.
Note. It is crucial for Analysis that sufficiently may subsets have a supremum and/or infimum. For example, with
the currently formulated axioms, we cannot decide if the sets
𝑆1 = {π‘₯ ∈ ℝ ∢ π‘₯ > 0 and π‘₯2 < 2} or 𝑆2 = {π‘₯ ∈ ℝ ∢ π‘₯ > 0 and π‘₯2 ≤ 2}
have suprema.
√
√
Note that the naïve approach putting 𝑆2 = (0, 2] is invalid as we do not know if 2, i. e., a real number π‘₯ > 0
with π‘₯2 = 2, exists.
Thus we have the last axiom of the real number system.
C. The Dedekind completeness axiom
(C)
Every nonempty subset of ℝ which is bounded above has a supremum.
Additional Note. It can be shown that the real number system is uniquely determined by the axioms (A1)-(A4),
(M1)-(M4), (D), (O1)-(O3), (C).
Theorem 1.8 (Positive square root). Let π‘Ž ≥ 0. Then there is a unique π‘₯ ≥ 0 such that π‘₯2 = π‘Ž. We write
√
1
π‘₯ = π‘Ž = π‘Ž2 .
√
Proof. If π‘Ž = 0, we have 02 = 0 and π‘₯2 > 0 if π‘₯ > 0, so that 0 = 0 is the unique number π‘₯ ≥ 0 such that π‘₯2 = 0.
Now let π‘Ž > 0. Note that π‘₯ ≥ 0 and π‘₯2 = π‘Ž > 0 gives π‘₯ > 0. For the uniqueness proof let π‘₯1 > 0 and π‘₯2 > 0 such
that π‘₯21 = π‘Ž and π‘₯22 = π‘Ž. Then
0 = π‘Ž − π‘Ž = π‘₯21 − π‘₯22 = (π‘₯1 + π‘₯2 )(π‘₯1 − π‘₯2 ).
From π‘₯1 + π‘₯2 > 0 it follows that π‘₯1 − π‘₯2 = 0, i. e., the uniqueness π‘₯1 = π‘₯2 .
8
Introductory Analysis 2017 Lecture Manual
MATH2022
For the existence of the square root define
π‘†π‘Ž = {π‘₯ ∈ ℝ ∢ 0 < π‘₯, π‘₯2 < π‘Ž}.
First we are going to show that π‘†π‘Ž ≠ ∅ and that π‘†π‘Ž is bounded.
If π‘Ž < 1, then π‘Ž2 < π‘Ž ⋅ 1 = π‘Ž, so that π‘Ž ∈ π‘†π‘Ž .
If π‘Ž ≥ 1, then ( 12 )2 < 12 = 1 ≤ π‘Ž, so that 21 ∈ π‘†π‘Ž .
For π‘₯ > π‘Ž + 1 we have
π‘₯2 > (π‘Ž + 1)2 = π‘Ž2 + 2π‘Ž + 1 > 2π‘Ž > π‘Ž,
so that any π‘₯ > π‘Ž + 1 does not belong to π‘†π‘Ž . Thus π‘₯ ≤ π‘Ž + 1 for all π‘₯ ∈ π‘†π‘Ž . We have shown that π‘†π‘Ž is bounded
above with upper bound π‘Ž + 1.
By the Dedekind completeness axiom there exists π‘€π‘Ž = sup π‘†π‘Ž . Note that π‘€π‘Ž > 0 since π‘†π‘Ž has positive elements.
To complete the proof we will show that π‘€π‘Ž2 = π‘Ž.
By proof by contradiction, assume that π‘€π‘Ž2 ≠ π‘Ž.
Case I: π‘€π‘Ž2 < π‘Ž. Put
}
{
π‘Ž − π‘€π‘Ž2
, π‘€π‘Ž .
πœ€ = min
4π‘€π‘Ž
Then πœ€ > 0 and
(π‘€π‘Ž + πœ€)2 − π‘Ž = π‘€π‘Ž2 + 2π‘€π‘Ž πœ€ + πœ€2 − π‘Ž
= π‘€π‘Ž2 − π‘Ž + (2π‘€π‘Ž + πœ€)πœ€
≤ π‘€π‘Ž2 − π‘Ž + 3π‘€π‘Ž πœ€
≤ π‘€π‘Ž2 − π‘Ž + 3π‘€π‘Ž
π‘Ž − π‘€π‘Ž2
4π‘€π‘Ž
1
= (π‘€π‘Ž2 − π‘Ž) < 0.
4
Thus
(π‘€π‘Ž + πœ€)2 < π‘Ž,
giving π‘€π‘Ž + πœ€ ∈ π‘†π‘Ž , contradicting the fact that π‘€π‘Ž is an upper bound of π‘†π‘Ž .
Case II: π‘€π‘Ž2 > π‘Ž. Put
𝑀2 − π‘Ž
πœ€= π‘Ž
.
2π‘€π‘Ž
Then 0 < πœ€ < 12 π‘€π‘Ž and
(π‘€π‘Ž − πœ€)2 − π‘Ž = π‘€π‘Ž2 − 2π‘€π‘Ž πœ€ + πœ€2 − π‘Ž
> π‘€π‘Ž2 − π‘Ž − 2π‘€π‘Ž πœ€
= π‘€π‘Ž2 − π‘Ž − 2π‘€π‘Ž
π‘€π‘Ž2 − π‘Ž
2π‘€π‘Ž
= 0.
Hence for all π‘₯ ≥ π‘€π‘Ž − πœ€ > 12 π‘€π‘Ž > 0,
π‘₯2 ≥ (π‘€π‘Ž − πœ€)2 > π‘Ž
so that any π‘₯ ≥ π‘€π‘Ž − πœ€ does not belong to π‘†π‘Ž . Hence π‘€π‘Ž − πœ€ is an upper bound of π‘†π‘Ž , contradicting the fact that
π‘€π‘Ž is the least upper bound of π‘†π‘Ž .
So π‘€π‘Ž2 ≠ π‘Ž is impossible, and π‘€π‘Ž2 = π‘Ž follows.
The completeness axiom can be thought of as ensuring that there are no ’gaps’ on the real line.
MATH2022
Introductory Analysis 2017 Lecture Manual
9
Theorem 1.9 (Characterizations of the supremum). Let 𝑆 be a nonempty subset of ℝ. Let 𝑀 ∈ ℝ. The following
are equivalent:
(a) 𝑀 = sup 𝑆;
(b) 𝑀 is an upper bound of 𝑆, and for each πœ€ > 0, there is 𝑠 ∈ 𝑆 such that 𝑀 − πœ€ < 𝑠;
(c) 𝑀 is an upper bound of 𝑆, and for each π‘₯ < 𝑀, there exists 𝑠 ∈ 𝑆 such that π‘₯ < 𝑠.
Proof. (a) ⇒ (b): Since (a) holds, 𝑀 is the least upper bound and thus an upper bound.
Let πœ€ > 0. Then 𝑀 − πœ€ is not an upper bound of 𝑆 since 𝑀 is the least upper bound of 𝑆. Hence π‘₯ ≤ 𝑀 − πœ€ does
not hold for all π‘₯ ∈ 𝑆, and therefore there must be 𝑠 ∈ 𝑆 such that 𝑀 − πœ€ < 𝑠.
(b) ⇒ (c): Let π‘₯ < 𝑀. Then πœ€ = 𝑀 − π‘₯ > 0, and by assumption there is 𝑠 ∈ 𝑆 with 𝑀 − πœ€ < 𝑠. Then
π‘₯ = 𝑀 − πœ€ < 𝑠.
(c) ⇒ (a): By assumption, 𝑀 is an upper bound of 𝑆. Suppose that 𝑀 is not the least upper bound. Then there is
an upper bound 𝐿 of 𝑆 with 𝐿 < 𝑀. By assumption (c), there is 𝑠 ∈ 𝑆 such that 𝐿 < 𝑠, contradicting that 𝐿 is an
upper bound of 𝑆. So 𝑀 must be indeed the least upper bound of 𝑆.
There is an apparent asymmetry in the Dedekind completion. However, there is a version for infima, which is
obtained by reflection. To this end, if 𝑆 is a (nonempty) subset of ℝ set
−𝑆 = {−π‘₯ ∢ π‘₯ ∈ 𝑆}.
Note that because of −(−π‘₯) = π‘₯ this can also can be written as
−𝑆 = {π‘₯ ∈ ℝ ∢ −π‘₯ ∈ 𝑆}.
Since π‘₯ ≤ 𝑦 ⇔ −π‘₯ ≥ −𝑦 it is easy to see that the following properties hold:
Proposition 1.10. Let 𝑆 be be nonempty subset of ℝ. Then
(a) −𝑆 is bounded below if and only if 𝑆 is bounded above, inf (−𝑆) = − sup 𝑆, and if max 𝑆 exists, then min(−𝑆)
exists and min(−𝑆) = − max 𝑆.
(b) −𝑆 is bounded above if and only if 𝑆 is bounded below, sup(−𝑆) = − inf 𝑆, and if min 𝑆 exists, then max(−𝑆)
exists and max(−𝑆) = − min 𝑆.
(c) −𝑆 is bounded if and only if 𝑆 is bounded.
Thus the Dedekind completeness axiom immediately gives
Theorem 1.11. Every nonempty subset of ℝ which is bounded below has an infimum.
Theorem 1.12 (Characterizations of the infimum). Let 𝑆 be a nonempty subset of ℝ. Let π‘š ∈ ℝ. The following
are equivalent:
(a) π‘š = inf 𝑆;
(b) π‘š is a lower bound of 𝑆, and for each πœ€ > 0, there is 𝑠 ∈ 𝑆 such that 𝑠 < π‘š + πœ€;
(c) π‘š is a lower bound of 𝑆, and for each π‘₯ > π‘š, there exists 𝑠 ∈ 𝑆 such that 𝑠 < π‘₯.
Theorem 1.13 (Dedekind cut). Let 𝐴 and 𝐡 be nonempty subsets of ℝ such that
(i) 𝐴 ∩ 𝐡 = ∅,
(ii) 𝐴 ∪ 𝐡 = ℝ,
(iii) ∀ π‘Ž ∈ 𝐴 ∀ 𝑏 ∈ 𝐡, π‘Ž ≤ 𝑏.
Then there is 𝑐 ∈ ℝ such that π‘Ž ≤ 𝑐 ≤ 𝑏 for all π‘Ž ∈ 𝐴 and 𝑏 ∈ 𝐡.
Proof. 𝐴 is nonempty and bounded above (any 𝑏 ∈ 𝐡 is an upper bound of 𝐴), so 𝑐 = sup 𝐴 exists by the Dedekind
completeness axiom, and 𝑐 ≥ π‘Ž for all π‘Ž ∈ 𝐴 by definition of upper bound. (iii) says that each 𝑏 ∈ 𝐡 is an upper
bound of 𝐴, and hence 𝑐 ≤ 𝑏 since 𝑐 is the least upper bound of 𝐴.
Note. The above theorem says that Dedekind completeness implies the Dedekind cut property. Conversely, if the
ordered field axioms and the Dedekind cut property are satisfied, then the Dedekind completeness axiom holds
(see tutorials).
10
Introductory Analysis 2017 Lecture Manual
MATH2022
Tutorial 1.2.1. 1. In each of the following cases, state if the given set is bounded above or not. If a set is bounded
above, give two different upper bounds for the set, give the supremum of the set and state if the set has a maximum
or not.
(a) (−3, 2)
(b) (1, ∞)
(c) [10, 11]
(d) {5, 4}
(e) {10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0, −1, −2, −5}
(f) (−∞, 2]
(g) {π‘₯ ∈ ℝ ∢ π‘₯2 < 3}
(h) {π‘₯ ∈ ℝ ∢ π‘₯2 ≤ 3}
2. For each of the sets in Q. 1, state if the given set is bounded below or not. If a set is bounded below, give two
different lower bounds for the set, give the infimum of the set and state if the set has a minumum or not.
3. Let 𝑆 be a nonempty subset of ℝ.
(a) If π‘Ž is the greatest element of 𝑆, then what is sup 𝑆?
(b) If sup 𝑆 = π‘Ž, then what are the upper bounds of 𝑆? .
(c) If sup 𝑆 = π‘Ž, does 𝑆 have a maximum?
(d) If sup 𝑆 = π‘Ž and π‘Ž ∈ 𝑆, does 𝑆 have a maximum?
4. Prove Proposition 1.10.
5. Prove Theorem 1.11.
6. Prove Theorem 1.12.
7. Let 𝑆 and 𝑇 be non-empty subsets of ℝ which are bounded above. Use Theorem 1.9 to prove that sup(𝑆 + 𝑇 ) =
sup 𝑆 + sup 𝑇 .
Step 1: Let 𝐾 = sup 𝑆 and 𝑀 = sup 𝑇 . Show that 𝐾 + 𝑀 is an upper bound of 𝑆 + 𝑇 .
Step 2: Let πœ€ > 0. Then 2πœ€ > 0 as well. Since sup 𝑆 = 𝐾, by Theorem 1.9 there is 𝑠 ∈ 𝑆 such that 𝐾 − 2πœ€ < 𝑠. Also,
since sup 𝑇 = 𝑀, there is 𝑑 ∈ 𝑇 such that 𝑀 − 2πœ€ < 𝑑. Show that there exists 𝑒 ∈ 𝑆 + 𝑇 such that 𝐾 + 𝑀 − πœ€ < 𝑒.
Step 3: Conclude, using Theorem 1.9, that sup(𝑆 + 𝑇 ) = 𝐾 + 𝑀 = sup 𝑆 + sup 𝑇 .
8. Some simple questions. Decide which of the following statements are true and which are false.
(b) 3 ∈ (0, 3)
(c) 17 ∈ [0, 17]
(a) 12 ∈ {0, 1}
(d) 17 ∈ (−3, 18)
(e) 17 ∈ [16, 18]
(f) 2 ∈ {1, 3, 5, 7}
(g) 2.5 ∈ {π‘₯ ∈ ℝ ∢ π‘₯2 ≥ 4}
(h) −1 ∈ {π‘₯ ∈ ℝ ∢ 2π‘₯ + 7 < 5}
9∗ . Assume that the Dedekind cut property, Theorem 1.13 as well as the ordered field axioms are satisfied. Show
that the Dedekind completeness holds.
1.3
Natural Numbers
Naïvely, the natural numbers are 0, 1, 2, 3, … , where the dots stand for the numbers which are obtained by adding
one to the previous number. However, our axioms of the real number system do not tell us that this is possible.
Hence we need another axiom to characterize the natural numbers.
P. Peano’s axioms of the natural numbers
There is a unique subset β„• of ℝ, called the set of natural numbers, satisfying the following properties:
(P1)
0 ∈ β„•,
(P2)
π‘Ž ∈ β„• ⇒ π‘Ž + 1 ∈ β„•,
(P3)
∀ π‘Ž ∈ β„•, π‘Ž + 1 ≠ 0,
(P4)
If 𝑆 ⊂ β„• such that 0 ∈ 𝑆 and π‘Ž + 1 ∈ 𝑆 for all π‘Ž ∈ 𝑆, then 𝑆 = β„•.
Additional Notes. 1. Here we define natural numbers to start at 0. This is common if one uses the axiomatic
definition of the natural numbers. You may start the natural numbers with 1, if you are consistent with this.
2. If one defines the natural numbers without reference to real number, there is no arithmetic on β„• needed. Then
the number π‘Ž + 1 is called the successor of π‘Ž, and axioms (P2) and (P3) can be phrased as follows: Every natural
number has a successor, and 0 is not the successor of a natural number.
3. The natural numbers are used to “count”. In everyday life, counting refers to concrete objects (‘There are 220
students in the class’, ‘I have got no pen with me’, ‘I ate three rolls this morning’). However, they can also be
defined as cardinalities of abstract (finite) sets. Here the cardinality #𝑆 of a (finite) set 𝑆 is the number of elements
in the set. The empty set ∅ has no elements, so #∅ = 0 serves as the definition of the number 0. To get the next
MATH2022
Introductory Analysis 2017 Lecture Manual
11
number, consider the set which consists of the empty set. It has exactly one element, namely the empty set. So
#{∅} = 1. Next #{∅, {∅}} = 2, and so on.
Recall that the integers β„€ and the rational numbers β„š are defined by
β„€ = {π‘₯ ∈ ℝ ∢ π‘₯ ∈ β„• or − π‘₯ ∈ β„•},
{
}
𝑝
β„š = π‘₯ ∈ ℝ ∢ π‘₯ = , 𝑝, π‘ž ∈ β„€, π‘ž ≠ 0 .
π‘ž
As a consequence of (P4) one has
The Principle of Mathematical Induction
Suppose we have a statement 𝐴(𝑛) associated with each 𝑛 ∈ β„•. If
(i) the statement 𝐴(0) is true, and
(ii) whenever the statement 𝐴(𝑛) is true, also the statement 𝐴(𝑛 + 1) is true,
then the statement 𝐴(𝑛) is true for all 𝑛 ∈ β„•.
Note. The statement 𝐴(0) is called the induction base. The induction base does not need to be taken at 0, one can
take any integer which is suitable for the statements to be proved.
Theorem 1.14. 1. All nonzero natural numbers 𝑛 satisfy 𝑛 ≥ 1.
2. ∀ 𝑛 ∈ β„• ⧡ {0} ∃ π‘š ∈ β„•, 𝑛 = π‘š + 1.
3. ∀ π‘š, 𝑛 ∈ β„•, π‘š + 𝑛 ∈ β„•.
4. for all π‘š, 𝑛 ∈ β„• with π‘š ≥ 𝑛 there is π‘˜ ∈ β„• such that π‘š = 𝑛 + π‘˜.
Proof. A selection of these proofs may be presented in class.
1. For each 𝑛 ∈ β„• our statement 𝐴(𝑛) is: if 𝑛 ≠ 0, then 𝑛 ≥ 1.
We have to show that this statement is true for all 𝑛 ∈ β„•.
Induction base: 𝐴(0) is true since the assumption 0 ≠ 0 does not hold.
Induction step: Now let 𝑛 ∈ β„• and assume that 𝐴(𝑛) holds. We must show that 𝐴(𝑛 + 1) holds.
If 𝑛 = 0 we prove 𝐴(1) directly: 0 + 1 = 1 ≥ 1.
Now let 𝑛 ≠ 0. Then 𝑛 ≥ 1 since 𝐴(𝑛) is true. Since 1 > 0 by Theorem 1.4, (k), we therefore have 𝑛+1 ≥ 1+1 ≥ 1.
Hence, by the principle of mathematical induction, 𝐴(𝑛) is true for all 𝑛 ∈ β„•.
2. Let 𝑆 = {0} ∪ {𝑛 ∈ β„• ∢ ∃ π‘š ∈ β„•, 𝑛 = π‘š + 1}.
Then 𝑆 ⊂ β„•, 0 ∈ 𝑆, and if 𝑛 ∈ 𝑆 then 𝑛 + 1 ∈ 𝑆 since 𝑛 ∈ β„•.
Hence 𝑆 = β„• by (P4).
3. For 𝑛 ∈ β„• let 𝐴(𝑛) be the statement:
For all π‘š ∈ β„•, π‘š + 𝑛 ∈ β„•.
𝐴(0) is trivially true.
Now assume that 𝐴(𝑛) is true. Then, for π‘š ∈ β„•,
π‘š + (𝑛 + 1) = (π‘š + 1) + 𝑛 ∈ β„•
by induction hypothesis. Hence 𝐴(𝑛 + 1) holds.
By the principle of mathematical induction, 𝐴(𝑛) holds for all 𝑛 ∈ β„•. The proof is complete.
4. For 𝑛 ∈ β„• let 𝐴(𝑛) be the statement:
For all π‘š ∈ β„• with π‘š ≥ 𝑛 there exists π‘˜ ∈ β„• such that π‘š = 𝑛 + π‘˜.
𝐴(0) is true with π‘˜ = π‘š for all π‘š.
Now assume 𝐴(𝑛) is true and let π‘š ∈ β„• with π‘š ≥ 𝑛 + 1. By induction hypothesis, since π‘š ≥ 𝑛 + 1 ≥ 𝑛 there is
π‘˜ ∈ β„• such that π‘š = 𝑛 + π‘˜.
If π‘˜ = 0, then π‘š = 𝑛 < 𝑛 + 1, which is impossible.
Hence π‘˜ ≠ 0, and so π‘˜ = 𝑙 + 1 for some 𝑙 ∈ β„• by part 2. It follows that
π‘š = 𝑛 + 𝑙 + 1 = (𝑛 + 1) + 𝑙,
and therefore 𝐴(𝑛 + 1) is true.
By the principle of mathematical induction, 𝐴(𝑛) hold for all 𝑛 ∈ β„•. The proof is complete.
Finally, we state and prove two important principles which are consequences of the Dedekind completeness and
Peano’s axioms.
12
Introductory Analysis 2017 Lecture Manual
MATH2022
Theorem 1.15 (Well-ordering principle). Every nonempty subset of β„• has a smallest element.
Proof. Let 𝑆 ⊂ β„•, 𝑆 ≠ ∅. Assume 𝑆 has no minimum. Let
𝑇 = {𝑛 ∈ β„• ∢ ∀ π‘š ∈ 𝑆, 𝑛 < π‘š}.
Note that 𝑛 ∈ 𝑇 and π‘š ∈ 𝑆 gives 𝑛 < π‘š, so that 𝑛 ≠ π‘š. Hence 𝑇 ∩ 𝑆 = ∅.
Assume that 0 ∈ 𝑆. Since 0 ≤ 𝑛 for all 𝑛 ∈ β„• by Theorem 1.14, 1, and since 𝑆 ⊂ β„•, we have 0 ≤ π‘š for all π‘š ∈ 𝑆.
But 𝑆 has no minimum, and therefore 0 ∉ 𝑆. Thus 0 < 1 ≤ π‘š for all π‘š ∈ 𝑆.
Hence 0 ∈ 𝑇 .
Now let 𝑛 ∈ 𝑇 . We want to show that 𝑛 + 1 ∈ 𝑇 .
By definition of 𝑇 , we have 𝑛 < π‘š for all π‘š ∈ 𝑆. By Theorem 1.14, 4, For each such π‘š there is π‘˜ ∈ β„• such that
π‘š = 𝑛 + π‘˜. Since 𝑛 < π‘š, it follows that π‘˜ ≠ 0, and hence π‘˜ ≥ 1 by Theorem 1.14, 1. Thus
π‘š=𝑛+π‘˜≥𝑛+1
for all π‘š ∈ 𝑆. Since we assume that 𝑆 has no minimum, it follows that 𝑛 + 1 ∉ 𝑆 and hence 𝑛 + 1 ≠ π‘š for all
π‘š ∈ 𝑆. Altogether 𝑛 + 1 < π‘š for all π‘š ∈ 𝑆. This shows that 𝑛 + 1 ∈ 𝑇 .
By (P4), 𝑇 = β„•. Hence
𝑆 = β„• ∩ 𝑆 = 𝑇 ∩ 𝑆 = ∅,
a contradiction.
Note. One can show that if 𝑆 ⊂ β„€, 𝑆 ≠ ∅, and 𝑆 is bounded below, then 𝑆 has a minimum.
Theorem 1.16 (The Archimedean principle). For each π‘₯ ∈ ℝ there is 𝑛 ∈ β„• such that 𝑛 > π‘₯.
Proof. Assume the Archimedian principle is false. Then there is π‘₯ ∈ ℝ such that 𝑛 ≤ π‘₯ for all 𝑛 ∈ β„•. That means,
β„• is bounded above and therefore has a supremum 𝑀.
By Theorem 1.9 there is 𝑛 ∈ β„• such that 𝑀 − 1 < 𝑛. Then
𝑛 + 1 > (𝑀 − 1) + 1 = 𝑀
and 𝑛 + 1 ∈ β„• contradict the fact that 𝑀 is an upper bound of β„•. Hence the Archimedian principle must be
true.
Definition 1.5. A subset 𝑆 of ℝ is said to be dense in ℝ if for all π‘₯, 𝑦 ∈ ℝ with π‘₯ < 𝑦 there is 𝑠 ∈ 𝑆 such that
π‘₯ < 𝑠 < 𝑦.
Real numbers which are not rational numbers are called irrational numbers.
√
Note. 1. 2 is irrational (see tutorials).
2. β„š is an ordered field, i. e., β„š satisfies all axioms of ℝ except the Dedekind completeness axiom (see tutorials).
3. If π‘Ž is rational and 𝑏 is irrational, then π‘Ž + 𝑏 is irrational (see tutorials).
Theorem 1.17. The set of rational numbers as well as the set of irrational numbers are dense in ℝ.
Proof. Let π‘₯, 𝑦 ∈ ℝ, π‘₯ < 𝑦. By the Archimedian principle, there is a natural number 𝑛0 such that
𝑛0 >
2
> 0.
𝑦−π‘₯
Let
𝑆 = {𝑛 ∈ β„€ ∢ 𝑛 > 𝑛0 π‘₯}.
By the Archimedean principle, 𝑆 ≠ ∅. Also, 𝑆 is bounded below. So 𝑆 has a minimum π‘š. Then π‘š − 1 ∉ 𝑆, so
that π‘š − 1 ≤ 𝑛0 π‘₯.
√
From 1 < 2 < 4 it follows that 1 < 2 < 2. Then
√
𝑛0 π‘₯
π‘š − 1 + 2 π‘š + 1 𝑛0 π‘₯ + 2
𝑦−π‘₯
π‘š
2
π‘₯=
<
<
<
≤
=π‘₯+
<π‘₯+2
= 𝑦.
𝑛0
𝑛0
𝑛0
𝑛0
𝑛0
𝑛0
2
MATH2022
Introductory Analysis 2017 Lecture Manual
Then
π‘š
𝑒=
∈ β„š,
𝑛0
π‘š−1+
𝑣=
𝑛0
√
2
13
∈β„β§΅β„š
and, by above,
π‘₯ < 𝑒 < 𝑣 < 𝑦,
so that we have have a rational and an irrational number between each pair of real numbers π‘₯, 𝑦 with π‘₯ < 𝑦. Hence
both β„š and ℝ ⧡ β„š are dense in ℝ.
√
√
Example 1.3.1. Let 𝑆 = {π‘₯ ∈ β„š ∢ π‘₯ < 2} and 𝑇 = {π‘₯ ∈ β„š ∢ π‘₯ ≤ 2}. Then 𝑆 = 𝑇 , and 𝑆 and 𝑇 have no
supremum in β„š.
Example 1.3.2 (Bernoulli’s inequality). For all π‘₯ ∈ ℝ with π‘₯ ≥ −1 and all 𝑛 ∈ β„•, (1 + π‘₯)𝑛 ≥ 1 + 𝑛π‘₯.
Solution. For 𝑛 = 0, (1 + π‘₯)𝑛 = (1 + π‘₯)0 = 1 = 1 + 0 ⋅ π‘₯ = 1 + 𝑛π‘₯, so that the inequality holds for 𝑛 = 0.
Let 𝑛 ∈ β„• and assume the Bernoulli inequality holds for this 𝑛. Then
(1 + π‘₯)𝑛+1 = (1 + π‘₯)(1 + π‘₯)𝑛 ≥ (1 + π‘₯)(1 + 𝑛π‘₯) = 1 + π‘₯ + 𝑛π‘₯ + 𝑛π‘₯2 = 1 + (𝑛 + 1)π‘₯ + 𝑛π‘₯2 ≥ 1 + (𝑛 + 1)π‘₯.
The first inequality follows from 1 + π‘₯ ≥ 0 and the induction hypothesis, the last from 𝑛π‘₯2 ≥ 0. By the principle
of induction, the Bernoulli inequality holds for all 𝑛 ∈ β„•.
Note. We will make use of more rules of real numbers without proof. See the tutorials for the power rules, for
example.
Tutorial 1.3.1. 1. Show that if 𝑆 ⊂ β„€, 𝑆 ≠ ∅, and 𝑆 is bounded below, then 𝑆 has a minimum.
√
√
2. Show that 2 is irrational. Hint. Assume that 2 = π‘žπ‘ with positive integers 𝑝 and π‘ž. You may assume that 𝑝
and π‘ž do not have common factors, i. e., there are no positive integers π‘˜, 𝑝1 , π‘ž1 with π‘˜ ≠ 1 such that 𝑝 = 𝑝1 π‘˜ and
π‘ž = π‘ž1 π‘˜.
3. Show that the rational numbers satisfy the axioms (A1)–(A4), (M1)–(M4), (D), and (O1)–(O3).
4. Let π‘Ž, 𝑏 ∈ β„š with 𝑏 ≠ 0 and π‘Ÿ ∈ ℝ ⧡ β„š. Show that π‘Ž + π‘π‘Ÿ ∈ ℝ ⧡ β„š.
5. For π‘₯ ∈ ℝ and 𝑛 ∈ β„•, π‘₯𝑛 is defined inductively by
(i) π‘₯0 = 1,
(ii) π‘₯𝑛+1 = π‘₯π‘₯𝑛 for 𝑛 ∈ β„•.
Show that (a) π‘₯𝑛 π‘₯π‘š = π‘₯𝑛+π‘š , (b) (π‘₯𝑛 )π‘š = π‘₯π‘›π‘š , (c) π‘₯𝑛 𝑦𝑛 = (π‘₯𝑦)𝑛 .
Chapter 2
Sequences
Students have seen sequences and learnt rules for limits and how to apply these rules to find limits. In this chapter,
students will learn the proper definitions of limits and the proofs of the rules for limits.
2.1
Sequences
Definition 2.1. A (real) sequence is an ordered list of infinitely many real numbers
π‘Ž1 , π‘Ž2 , π‘Ž3 , π‘Ž4 , … .
We usually denote a sequence by (π‘Žπ‘› ) = π‘Ž1 , π‘Ž2 , π‘Ž3 , … .
The number π‘Žπ‘› is called the 𝑛-th term of the sequence. The subscript 𝑛 is called the index.
Note. The index of the sequence does not have to start at 1, and therefore one also writes (π‘Žπ‘› )∞
𝑛=𝑛0 , where 𝑛0 can
∞
∞
∞
be any integer, e. g., (2𝑛 − 3)𝑛=1 , (2𝑛 − 3)𝑛=0 , (2𝑛 − 3)𝑛=−5 .
Recall the following note and sketch from First Year Calculus. [Lecturers: do not write this down in class.]
Note. (i) A sequence can be identified with a function on the (positive) integers:
π‘Žπ‘› = 𝑓 (𝑛).
(ii) A sequence can be plotted as points of the real axis or as the graph of the function, see (i).
Below is a plot for the sequence
(
𝑛
𝑛+1
)∞
.
𝑛=1
π‘Ž1
π‘Ž2
1
2
0
π‘Ž3
1
π‘Žπ‘›
1
𝑛
1 2 3 4 5 6 7 8
14
MATH2022
Introductory Analysis 2017 Lecture Manual
15
In first year calculus students have seen the following informal definition. [Lecturers: You do not need to write
this down. In any case, emphasize that this notion has to be formalized.]
Informal Definition. A sequence {π‘Žπ‘› } is said to converge if there is a number 𝐿 such that π‘Žπ‘› is as close to 𝐿 as we
like for all sufficiently large 𝑛.
We have to formalize “as close to” and “for all sufficiently large”: “as close to” means that the distance does not
exceed any given (small) positive number. The distance between two numbers π‘₯ and 𝑦 is 𝑦 − π‘₯ if π‘₯ < 𝑦 and π‘₯ − 𝑦
if π‘₯ > 𝑦. Hence the distance between π‘₯ and 𝑦 is |𝑦 − π‘₯|.
Therefore we have the following formal definition:
Definition 2.2. Let (π‘Žπ‘› ) be a sequence and 𝐿 ∈ ℝ.
1. The statement ‘π‘Žπ‘› tends to 𝐿 as 𝑛 tends to infinity’, which we write as ’π‘Žπ‘› → 𝐿 as 𝑛 → ∞’, is defined by:
∀ πœ€ > 0 ∃ 𝐾 ∈ ℝ ∀𝑛 ∈ β„•, 𝑛 ≥ 𝐾, |π‘Žπ‘› − 𝐿| < πœ€.
2. If π‘Žπ‘› → 𝐿 as 𝑛 → ∞, we say that (π‘Žπ‘› ) converges to 𝐿, and we also write lim π‘Žπ‘› = 𝐿.
𝑛→∞
3. The sequence (π‘Žπ‘› ) is said to be convergent if it converges to some real number. Otherwise, the sequence (π‘Žπ‘› ) is
said to be divergent.
Note. 1. It is useful to note that |π‘Žπ‘› − 𝐿| < πœ€ is equivalent to 𝐿 − πœ€ < π‘Žπ‘› < 𝐿 + πœ€.
2. The number 𝐾 depends on πœ€. We may write πΎπœ€ to emphasize the dependence on πœ€.
3. If one wants to prove convergence of a sequence from first principles, then one has to ‘guess’ a limit 𝐿 and then
prove that it is indeed the limit.
4. The ‘first few terms’ do not matter for convergence and the limit. That is, the sequence (π‘Žπ‘› )∞
𝑛=𝑛0 converges if
converges,
and
their
limits
coincide.
and only if the sequence (π‘Žπ‘› )∞
𝑛=π‘š0
5. By the Archimedean principle, there are only finitely many natural numbers such that 𝑛 < 𝐾, and one may also
assume 𝐾 ∈ β„•, without loss of generality.
)
(
𝑛
converges and find its limit.
Example 2.1.1. Prove that the sequence (π‘Žπ‘› ) = 𝑛+1
Solution. We first have to guess the limit. From the rules used in first year, we guess that the limit should be 𝐿 = 1.
Now let πœ€ > 0. We must find πΎπœ€ such that
𝑛 ≥ πΎπœ€ ⇒ |π‘Žπ‘› − 1| < πœ€.
For this we first simplify:
| 𝑛
|
1
|π‘Žπ‘› − 1| = ||
− 1|| =
.
𝑛
+
1
𝑛
+
1
|
|
1
< πœ€, which can be written as 𝑛 + 1 > 1πœ€ or 𝑛 > 1πœ€ − 1. Hence if we take
𝑛+1
𝑛
πΎπœ€ = 1πœ€ − 1, then |π‘Žπ‘› − 1| < πœ€ for all 𝑛 > πΎπœ€ . Since πœ€ > 0 was arbitrary, we have shown that 𝑛+1
→ 1 as 𝑛 → ∞.
Hence |π‘Žπ‘› − 1| < πœ€ provided that
Lemma 2.1. If 𝐿, 𝑀 ∈ ℝ such that |𝐿 − 𝑀| < πœ€ for all πœ€ > 0, then 𝐿 = 𝑀.
Proof. Assume 𝐿 ≠ 𝑀. Then either 𝐿 < 𝑀 or 𝐿 > 𝑀. But
𝐿 < 𝑀 ⇒ |𝐿 − 𝑀| = 𝑀 − 𝐿 > 0,
𝑀 < 𝐿 ⇒ |𝐿 − 𝑀| = 𝐿 − 𝑀 > 0,
so that |𝐿 − 𝑀| > 0. Then
0<
|𝐿 − 𝑀|
< |𝐿 − 𝑀|,
2
which contradicts |𝐿 − 𝑀| < πœ€ for πœ€ = |𝐿−𝑀|
. Hence the assumption 𝐿 ≠ 𝑀 must be false, and 𝐿 = 𝑀
2
follows.
Theorem 2.2. If the sequence (π‘Žπ‘› ) converges, then its limit is unique.
16
Introductory Analysis 2017 Lecture Manual
MATH2022
Proof. Assume that (π‘Žπ‘› ) converges to 𝐿 and 𝑀. Let πœ€ > 0. Then there are numbers π‘˜πΏ and π‘˜π‘€ such that
(i) |π‘Žπ‘› − 𝐿| < 2πœ€ if 𝑛 ≥ π‘˜πΏ ,
(ii) |π‘Žπ‘› − 𝑀| < 2πœ€ if 𝑛 ≥ π‘˜π‘€ .
Put 𝐾 = max{π‘˜πΏ , π‘˜π‘€ }. For positive integers 𝑛 ≥ 𝐾 we have 𝑛 ≥ π‘˜πΏ and 𝑛 ≥ π‘˜π‘€ and therefore
|𝐿 − 𝑀| = |(𝐿 − π‘Žπ‘› ) + (π‘Žπ‘› − 𝑀)| ≤ |π‘Žπ‘› − 𝐿| + |π‘Žπ‘› − 𝑀| <
πœ€ πœ€
+ = πœ€.
2 2
By Lemma 2.1, 𝐿 = 𝑀.
We are now going to prove the rules you have learnt in first year.
Theorem 2.3 (Limit Laws). Let 𝑐 ∈ ℝ and suppose that lim π‘Žπ‘› = 𝐿 and lim 𝑏𝑛 = 𝑀 both exist. Then
𝑛→∞
𝑛→∞
(a) lim 𝑐 = 𝑐.
𝑛→∞
(b) lim (π‘Žπ‘› + 𝑏𝑛 ) = lim π‘Žπ‘› + lim 𝑏𝑛 = 𝐿 + 𝑀.
𝑛→∞
𝑛→∞
𝑛→∞
(c) lim (π‘π‘Žπ‘› ) = 𝑐 lim π‘Žπ‘› = 𝑐𝐿.
𝑛→∞
(𝑛→∞ ) (
)
(d) lim (π‘Žπ‘› 𝑏𝑛 ) = lim π‘Žπ‘›
lim 𝑏𝑛 = 𝐿𝑀.
𝑛→∞
𝑛→∞
𝑛→∞
lim π‘Žπ‘›
π‘Žπ‘›
𝑛→∞
𝐿
=
=
.
𝑛→∞ 𝑏𝑛
lim 𝑏𝑛
𝑀
𝑛→∞
π‘Ž
(f) If 𝐿 ≠ 0 and 𝑀 = 0, then lim 𝑛 does not exist.
𝑛→∞ 𝑏𝑛
(
)π‘˜
(g) If π‘˜ ∈ β„€+ , lim π‘Žπ‘˜π‘› = lim π‘Žπ‘› = πΏπ‘˜ .
𝑛→∞
𝑛→∞
(e) If 𝑀 ≠ 0, lim
√
√
√
π‘˜
(h) If π‘˜ ∈ β„€+ , lim π‘˜ π‘Žπ‘› = π‘˜ lim π‘Žπ‘› = 𝐿. If π‘˜ is even, we assume π‘Žπ‘› ≥ 0 and 𝐿 ≥ 0.
𝑛→∞
𝑛→∞
(i) If lim |π‘Žπ‘› | = 0, then lim π‘Žπ‘› = 0.
𝑛→∞
𝑛→∞
Proof. (a) For all πœ€ > 0 and all indices 𝑛, with π‘Žπ‘› = 𝑐, |π‘Žπ‘› − 𝑐| = |𝑐 − 𝑐| = 0 < πœ€.
(b) Let πœ€ > 0. Then there are numbers π‘˜πΏ and π‘˜π‘€ such that
(i) |π‘Žπ‘› − 𝐿| < 2πœ€ if 𝑛 ≥ π‘˜πΏ ,
(ii) |𝑏𝑛 − 𝑀| < 2πœ€ if 𝑛 ≥ π‘˜π‘€ .
Put 𝐾 = max{π‘˜πΏ , π‘˜π‘€ }. For positive integers 𝑛 ≥ 𝐾 we have 𝑛 ≥ π‘˜πΏ and 𝑛 ≥ π‘˜π‘€ and therefore
|(π‘Žπ‘› + 𝑏𝑛 ) − (𝐿 + 𝑀)| = |(π‘Žπ‘› − 𝐿) + (𝑏𝑛 − 𝑀)| ≤ |π‘Žπ‘› − 𝐿| + |𝑏𝑛 − 𝑀| <
πœ€ πœ€
+ = πœ€.
2 2
Hence (π‘Žπ‘› + 𝑏𝑛 ) converges to 𝐿 + 𝑀.
(c) Let πœ€ > 0. Then there is a number 𝐾 such that
πœ€
|π‘Žπ‘› − 𝐿| < 1+|𝑐|
if 𝑛 ≥ 𝐾.
For positive integers 𝑛 ≥ 𝐾 we have
|π‘π‘Žπ‘› − 𝑐𝐿| = |𝑐(π‘Žπ‘› − 𝐿)| = |𝑐| |π‘Žπ‘› − 𝐿| < |𝑐|
πœ€
< πœ€.
1 + |𝑐|
(d) We consider 2 cases: one special case, to which then the general case is reduced.
Case I: 𝐿 = 𝑀 = 0. Let πœ€ > 0 and put πœ€′ = min{1, πœ€}. Then 0 < πœ€′ ≤ πœ€, and there are numbers π‘˜πΏ and π‘˜π‘€ such
that
(i) |π‘Žπ‘› − 0| < πœ€′ if 𝑛 ≥ π‘˜πΏ ,
(ii) |𝑏𝑛 − 0| < πœ€′ if 𝑛 ≥ π‘˜π‘€ .
Put 𝐾 = max{π‘˜πΏ , π‘˜π‘€ }. For positive integers 𝑛 ≥ 𝐾 we have 𝑛 ≥ π‘˜πΏ and 𝑛 ≥ π‘˜π‘€ and therefore
2
|π‘Žπ‘› 𝑏𝑛 | = |π‘Žπ‘› | |𝑏𝑛 | < πœ€′ ≤ πœ€′ ≤ πœ€.
Case II: 𝐿 and 𝑀 are arbitrary. Then
π‘Žπ‘› 𝑏𝑛 = (π‘Žπ‘› − 𝐿)(𝑏𝑛 − 𝑀) + 𝐿(𝑏𝑛 − 𝑀) + π‘Žπ‘› 𝑀.
MATH2022
17
Introductory Analysis 2017 Lecture Manual
By (a), (b), (c), (π‘Žπ‘› − 𝐿) → 0 and (𝑏𝑛 − 𝑀) → 0 as 𝑛 → ∞, and by (b), (c), and Case I, it follows that (π‘Žπ‘› 𝑏𝑛 )
converges with
lim π‘Žπ‘› 𝑏𝑛 = 0 + 𝐿 ⋅ 0 + 𝐿𝑀 = 𝐿𝑀.
𝑛→∞
(e) First consider π‘Žπ‘› = 1. Then
|1
|
| − 1 | = |𝑀 − 𝑏𝑛 | .
|𝑏
|
|𝑏𝑛 𝑀|
| 𝑛 𝑀|
We must assure that 𝑏𝑛 stays away from 0, and hence we estimate
|𝑀| = |𝑏𝑛 + 𝑀 − 𝑏𝑛 | ≤ |𝑏𝑛 | + |𝑏𝑛 − 𝑀|,
which gives
|𝑏𝑛 | ≥ |𝑀| − |𝑏𝑛 − 𝑀|.
Since 𝑏𝑛 → 𝑀 ≠ 0, there is π‘˜1 such that |𝑏𝑛 − 𝑀| < |𝑀|
for 𝑛 ≥ π‘˜1 , and hence
2
|𝑀| |𝑀|
=
.
2
2
|𝑏𝑛 | ≥ |𝑀| −
If we therefore let πœ€ > 0 and choose 𝐾 such that
|𝑏𝑛 − 𝑀| < min
{
|𝑀| 𝑀 2
,
πœ€
2
2
}
for 𝑛 ≥ 𝐾, it follows that
2
|1
|
| − 1 | = |𝑀 − 𝑏𝑛 | ≤ |𝑀 − 𝑏𝑛 | 2 < |𝑀| πœ€ 2 = πœ€.
|𝑏
|
|𝑏𝑛 | |𝑀|
2
|𝑀|2
|𝑀|2
| 𝑛 𝑀|
The general case now follows with (d):
π‘Žπ‘›
1
1
= lim π‘Žπ‘› ⋅ lim
=𝐿⋅
.
𝑛→∞ 𝑏𝑛
𝑛→∞
𝑛→∞ 𝑏𝑛
𝑀
lim
π‘Ž
(f) Assume that 𝑃 = lim 𝑏𝑛 exists. Then, by (d),
𝑛→∞ 𝑛
(
𝐿 = lim π‘Žπ‘› = lim
𝑛→∞
𝑛→∞
π‘Žπ‘›
𝑏
𝑏𝑛 𝑛
)
π‘Žπ‘›
⋅ lim 𝑏 = 𝑃 ⋅ 𝑀 = 𝑃 ⋅ 0 = 0,
𝑛→∞ 𝑏𝑛 𝑛→∞ 𝑛
= lim
which contradicts 𝐿 ≠ 0.
(g) Follows from (d) by induction.
(h) We will only prove the case π‘˜ = 2 as we have not yet established the existence of the π‘˜-th root.
√
√
√
If 𝐿 = 0, let πœ€ > 0 and choose 𝐾 such that π‘Žπ‘› < πœ€2 for 𝑛 ≥ 𝐾. Then π‘Žπ‘› < πœ€ for these 𝑛, and lim π‘Žπ‘› = 0 = 𝐿.
𝑛→∞
If 𝐿 > 0, then
|
|
√
| π‘Ž −𝐿 |
√
|π‘Ž − 𝐿|
1
| π‘Žπ‘› − 𝐿| = || √ 𝑛 √ || = √ 𝑛 √ ≤ √ |π‘Žπ‘› − 𝐿|,
| π‘Žπ‘› + 𝐿 |
π‘Žπ‘› + 𝐿
𝐿
|
|
√
√
√
and choosing 𝐾 such that |π‘Žπ‘› − 𝐿| < πΏπœ€ for 𝑛 ≥ 𝐾, it follows that | π‘Žπ‘› − 𝐿| < πœ€ for 𝑛 ≥ 𝐾.
(i) is trivial.
Theorem 2.4 (Sandwich Theorem). If π‘Žπ‘› ≤ 𝑏𝑛 ≤ 𝑐𝑛 and lim π‘Žπ‘› = lim 𝑐𝑛 = 𝐿, then lim 𝑏𝑛 = 𝐿.
𝑛→∞
𝑛→∞
𝑛→∞
Proof. Let πœ€ > 0 and choose π‘˜1 and π‘˜2 such that |π‘Žπ‘› − 𝐿| < πœ€ if 𝑛 ≥ π‘˜1 and |𝑐𝑛 − 𝐿| < πœ€ if 𝑛 ≥ π‘˜2 . In particular,
for 𝑛 ≥ 𝐾 = max{π‘˜1 , π‘˜2 },
𝐿 − πœ€ < π‘Žπ‘› < 𝐿 + πœ€, 𝐿 − πœ€ < 𝑐𝑛 < 𝐿 + πœ€
gives
𝐿 − πœ€ < π‘Žπ‘› ≤ 𝑏𝑛 ≤ 𝑐𝑛 < 𝐿 + πœ€.
Hence |𝑏𝑛 − 𝐿| < πœ€ if 𝑛 ≥ 𝐾.
18
Introductory Analysis 2017 Lecture Manual
MATH2022
Theorem 2.5. Let π‘₯ ∈ ℝ. Then π‘₯𝑛 converges if and only if −1 < π‘₯ ≤ 1.
Proof. For π‘₯ = 1, 1𝑛 = 1, so that lim 1𝑛 = 1 by Theorem 2.3 (a).
𝑛→∞
Let 0 < π‘₯ < 1 and let πœ€ > 0. Then 0 < 1 < π‘₯1 and therefore
𝑦 ∢=
1
− 1 > 0.
π‘₯
Then
( )𝑛
1
1
=
= (1 + 𝑦)𝑛 ≥ 1 + 𝑛𝑦
𝑛
π‘₯
π‘₯
1−πœ€
. Then, for 𝑛 > 𝐾,
by Bernoulli’s inequality. Put 𝐾 =
π‘¦πœ€
0 < π‘₯𝑛 ≤
1
1
<
= πœ€.
1 + 𝑛𝑦 1 + 1−πœ€
πœ€
Hence π‘₯𝑛 → 0 as 𝑛 → ∞. Now let π‘₯ > 1. If π‘₯𝑛 would converge to some 𝐿 as 𝑛 → ∞, then
1 = lim 1𝑛 = lim
𝑛→∞
( )𝑛
1
⋅ lim π‘₯𝑛 = 0 ⋅ 𝐿 = 0,
𝑛→∞
π‘₯
which is impossible.
The remaining cases are left as an exercise.
Theorem 2.6. If π‘Ÿ > 0, then
lim
1
𝑛→∞ π‘›π‘Ÿ
= 0.
Proof. We are only going to prove the case that π‘Ÿ is a positive integer.
Let π‘Ÿ = 1 and choose πœ€ > 0. Let 𝐾 = 2πœ€ . Then, for 𝑛 ≥ 𝐾,
0<
1
1
πœ€
≤
= < πœ€,
𝑛 𝐾
2
| |
whence | 1𝑛 | < πœ€.
| |
Now assume the statement holds for an integer π‘Ÿ > 0. Then
lim
1
𝑛→∞ π‘›π‘Ÿ+1
= lim
1
𝑛→∞ 𝑛
⋅ lim
1
𝑛→∞ π‘›π‘Ÿ
= 0 ⋅ 0 = 0.
So the result follows for all positive integers by induction.
(
)
𝑛
Tutorial 2.1.1. 1. (a) Prove, using the definition of convergence, that the sequence 𝑛+1
does not converge to 2.
𝑛
(b) Prove, using the definition of convergence, that the sequence ((−1) ) does not converge to any 𝐿.
2. (a) Prove that if lim π‘Žπ‘› = 𝐿, then lim |π‘Žπ‘› | = |𝐿|. (Hint: use (and prove) the inequality ||π‘₯| − |𝑦|| ≤ |π‘₯ − 𝑦|.)
𝑛→∞
𝑛→∞
(b) Give an example to show that the converse to part (a) is not true.
3. Contractive maps. Suppose that for some 𝑐 ∈ ℝ with 0 < 𝑐 < 1, we have |π‘Žπ‘›+1 − 𝐿| ≤ 𝑐|π‘Žπ‘› − 𝐿| for all 𝑛 ∈ β„•.
(a) Use induction to prove that |π‘Žπ‘› − 𝐿| ≤ 𝑐 𝑛 |π‘Ž0 − 𝐿| for all 𝑛 ∈ β„•.
(b) Use the Sandwich Theorem and the fact that lim 𝑐 𝑛 = 0 to prove that lim π‘Žπ‘› = 𝐿.
𝑛→∞
𝑛→∞
√
4. Recursive algorithm for finding π‘Ž. Let π‘Ž > 1 and define
(
)
1
π‘Ž
π‘Ž0 = π‘Ž and π‘Žπ‘› =
π‘Žπ‘›−1 +
for 𝑛 ≥ 1.
2
π‘Žπ‘›−1
√
√
(a) Prove that 0 < π‘Žπ‘› − π‘Ž = 2π‘Ž1 (π‘Žπ‘›−1 − π‘Ž)2 for 𝑛 ≥ 1.
𝑛−1
√
√
(b) Use (a) to prove that 0 ≤ π‘Žπ‘› − π‘Ž ≤ 21 (π‘Žπ‘›−1 − π‘Ž) for 𝑛 ≥ 1.
√
(c) Deduce that lim π‘Žπ‘› = π‘Ž.
𝑛→∞
√
(d) Apply four steps of the recursive algorithm with π‘Ž = 3 to approximate 3.
MATH2022
2.2
Introductory Analysis 2017 Lecture Manual
19
Bounded and Monotonic Sequences
Definition 2.3. 1. A sequence (π‘Žπ‘› ) is said to be bounded above if there is a number 𝑀 ∈ ℝ such that π‘Žπ‘› ≤ 𝑀
for all indices 𝑛.
2. A sequence (π‘Žπ‘› ) is said to be bounded below if there is a number π‘š ∈ ℝ such that π‘Žπ‘› ≥ π‘š for all indices 𝑛.
3. A sequence (π‘Žπ‘› ) is said to be bounded if it is a bounded above and bounded below.
Note. A sequence (π‘Žπ‘› )∞
𝑛=𝑛0 is bounded (above, below) if and only if the set {π‘Žπ‘› ∢ 𝑛 ∈ β„€, 𝑛 ≥ 𝑛0 } is bounded
(above, below).
Definition 2.4. A sequence {π‘Žπ‘› }∞
is called
𝑛=1
increasing if π‘Žπ‘› ≤ π‘Žπ‘›+1 for all indices 𝑛,
strictly increasing if π‘Žπ‘› < π‘Žπ‘›+1 for all indices 𝑛,
decreasing if π‘Žπ‘› ≥ π‘Žπ‘›+1 for all indices 𝑛,
strictly decreasing if π‘Žπ‘› > π‘Žπ‘›+1 for all indices 𝑛,
monotonic if it is either increasing or decreasing,
strictly monotonic if it is either strictly increasing or strictly decreasing.
Theorem 2.7. Every convergent sequence is bounded.
Proof. Here we use the fact, which is easy to prove (by induction), that a set of the form {π‘Žπ‘› ∢ 𝑛 ∈ β„€, 𝑛0 ≤ 𝑛 ≤ 𝑛1 }
is bounded (and indeed has minimum and maximum). We also use that the union of two bounded sets is bounded.
Since (π‘Žπ‘› ) converges, there are 𝐿 and π‘˜ such that |π‘Žπ‘› − 𝐿| < 1 for all 𝑛 ≥ π‘˜. Hence {π‘Žπ‘› ∢ 𝑛 < π‘˜} and {π‘Žπ‘› ∢ 𝑛 ≥
π‘˜} ⊂ (𝐿 − 1, 𝐿 + 1) are bounded. So the sequence is bounded by the above note.
Although each unbounded sequence diverges, there are some divergent sequences which have a limiting behaviour
which we want to exploit. Recall that we use the notation ‘𝑛 → ∞’ in the definition of the limit. This extends to
‘π‘Žπ‘› → ∞’, and we have therefore the
Definition 2.5. 1. We say that π‘Žπ‘› tends to infinity as 𝑛 tends to infinity and write π‘Žπ‘› → ∞ as 𝑛 → ∞ if for every
𝐴 ∈ ℝ there is 𝐾 ∈ ℝ such that π‘Žπ‘› > 𝐴 for all 𝑛 ≥ 𝐾.
2. We say that π‘Žπ‘› tends to minus infinity as 𝑛 tends to infinity and write π‘Žπ‘› → −∞ as 𝑛 → ∞ if for every 𝐴 ∈ ℝ
there is 𝐾 ∈ ℝ such that π‘Žπ‘› < 𝐴 for all 𝑛 ≥ 𝐾.
Notes. 1. For the definition of π‘Žπ‘› → ∞(−∞) as 𝑛 → ∞, we may restrict 𝐴 to 𝐴 > 𝐴0 (𝐴 < 𝐴0 ) for suitable
numbers 𝐴0 .
2. For π‘Žπ‘› → ∞ as 𝑛 → ∞ we also write lim π‘Žπ‘› = ∞ and say that (π‘Žπ‘› ) diverges to ∞.
𝑛→∞
3. For π‘Žπ‘› → −∞ as 𝑛 → ∞ we also write lim π‘Žπ‘› = −∞ and say that (π‘Žπ‘› ) diverges to −∞.
𝑛→∞
Example 2.2.1. 1. Let π‘Žπ‘› = (−1)𝑛 . Then (π‘Žπ‘› ) is bounded but not convergent.
2. Let π‘Žπ‘› = 𝑛. Then (π‘Žπ‘› ) is unbounded (by the Archimedean principle). Hence (π‘Žπ‘› ) diverges.
Example 2.2.2. Prove that 𝑛2 − 𝑛3 + 10 → −∞ as 𝑛 → ∞.
Solution. Let 𝐴 < 0 and put 𝐾 = −𝐴 + 11. Then 𝐾 > 11, and for 𝑛 ≥ 𝐾 we have
𝑛2 − 𝑛3 + 10 = −𝑛2 (𝑛 − 1) + 10 ≤ −10𝑛2 + 10 ≤ −10𝐾 2 + 10
= −10(𝐾 − 1)(𝐾 + 1) ≤ −100(𝐾 + 1) ≤ −𝐾 = 𝐴 − 11 ≤ 𝐴.
Theorem 2.8. 1. Let (π‘Žπ‘› ) be a sequence with π‘Žπ‘› > 0 for all indices 𝑛. Then
1
lim π‘Ž = ∞ ⇔ lim
= 0.
𝑛→∞ 𝑛
𝑛→∞ π‘Žπ‘›
2. Let (π‘Žπ‘› ) be a sequence with π‘Žπ‘› < 0 for all indices 𝑛. Then
1
lim π‘Ž = −∞ ⇔ lim
= 0.
𝑛→∞ 𝑛
𝑛→∞ π‘Žπ‘›
20
Introductory Analysis 2017 Lecture Manual
Proof. 1. Assume that lim π‘Žπ‘› = ∞. To show that lim
0 < 𝐴 < π‘Žπ‘› for all 𝑛 ≥ 𝐾. Then
0<
for these 𝑛, which shows that lim
1
𝑛→∞ π‘Žπ‘›
Conversely, assume that lim
1
𝑛→∞ π‘Žπ‘›
Hence there is 𝐾 ∈ ℝ such that
The proof of part 2 is similar.
1
𝑛→∞ π‘Žπ‘›
𝑛→∞
= 0 let πœ€ > 0. Put 𝐴 =
MATH2022
1
. Then there is 𝐾 such that
πœ€
1
1
<
=πœ€
π‘Žπ‘›
𝐴
= 0.
= 0. Let 𝐴 ∈ ℝ and put 𝐴0 = max{𝐴, 1}. Then 𝐴0 > 0 and put πœ€ =
|1|
1
1
= || || < πœ€ for all 𝑛 ≥ 𝐾. This give π‘Žπ‘› > = 𝐴0 ≥ 𝐴 for all 𝑛 ≥ 𝐾.
π‘Žπ‘› | π‘Žπ‘› |
πœ€
1
> 0.
𝐴0
Apart from this rule, there are more rules for infinite limits. Some of these rules are listed below. [Note to lecturers:
do not write this down, just refer the students to the notes.]
Theorem 2.9. Consider π‘˜ ∈ ℝ and sequences with the following properties as 𝑛 → ∞:
π‘Žπ‘› → ∞, 𝑏𝑛 → ∞, 𝑐𝑛 → 𝑐 ∈ ℝ, 𝑑𝑛 → −∞.
Then, as 𝑛 → ∞,
⎧∞
if k>0,
βŽͺ
(a) π‘˜π‘Žπ‘› → ⎨ −∞ if k<0,
βŽͺ0
if k=0.
⎩
(b) π‘Žπ‘› + 𝑏𝑛 → ∞,
(c) π‘Žπ‘› + 𝑐𝑛 → ∞,
(d) −𝑑𝑛 → ∞,
{
∞
if c>0,
(e) π‘Žπ‘› 𝑐𝑛 →
−∞ if c<0.
(f) π‘Žπ‘› 𝑏𝑛 → ∞.
Proof. See tutorials.
It is convenient to extend the notions of supremum and infimum to unbounded set.
Definition 2.6. Let 𝐴 ⊂ ℝ be nonempty. If 𝐴 is not bounded above, we write sup 𝐴 = ∞, and if 𝐴 is not bounded
below, we write inf 𝐴 = −∞.
We know that not every sequence converges, and it may be hard to decide if a sequence converges. However, the
situation is different with monotonic sequences:
Theorem 2.10. 1. Let (π‘Žπ‘› ) be an increasing sequence. If (π‘Žπ‘› ) is bounded, then (π‘Žπ‘› ) converges, and
lim π‘Žπ‘› = sup{π‘Žπ‘› ∢ 𝑛 ∈ β„•}. If (π‘Žπ‘› ) is not bounded, then (π‘Žπ‘› ) diverges to ∞.
𝑛→∞
2. Let (π‘Žπ‘› ) be a decreasing sequence. If (π‘Žπ‘› ) is bounded, then (π‘Žπ‘› ) converges, and
lim π‘Žπ‘› = inf {π‘Žπ‘› ∢ 𝑛 ∈ β„•}. If (π‘Žπ‘› ) is not bounded, then (π‘Žπ‘› ) diverges to −∞.
𝑛→∞
Proof. 1. Assume (π‘Žπ‘› ) is bounded. Put 𝐿 = sup{π‘Žπ‘› ∢ 𝑛 ∈ β„•} and let πœ€ > 0.
We must show that there is 𝐾 ∈ β„• such that 𝑛 ≥ 𝐾 gives 𝐿 − πœ€ < π‘Žπ‘› < 𝐿 + πœ€.
Indeed by the definition of the supremum we have π‘Žπ‘› ≤ 𝐿 < 𝐿 + πœ€ for all 𝑛 and by Theorem 1.9 there is 𝐾 such
that 𝐿 − πœ€ < π‘ŽπΎ . Then we have for all 𝑛 ≥ 𝐾 that
𝐿 − πœ€ < π‘ŽπΎ ≤ π‘Žπ‘› .
Hence 𝐿 − πœ€ < π‘Žπ‘› < 𝐿 + πœ€ for all 𝑛 ≥ 𝐾, which proves π‘Žπ‘› → 𝐿 as 𝑛 → ∞.
Now assume that (π‘Žπ‘› ) is not bounded. Since π‘Ž0 ≤ π‘Žπ‘› for all 𝑛, (π‘Žπ‘› ) is bounded below. Hence (π‘Žπ‘› ) is not bounded
above. Let 𝐴 ∈ ℝ. Then there is an index 𝐾 such that π‘ŽπΎ > 𝐴, and thus π‘Žπ‘› ≥ π‘ŽπΎ > 𝐴 for all 𝑛 ≥ 𝐾.
MATH2022
Introductory Analysis 2017 Lecture Manual
21
(
)𝑛
(
)𝑛+1
Example 2.2.3. Let π‘Žπ‘› = 1 + 1𝑛 and 𝑏𝑛 = 1 + 1𝑛
. Then π‘Žπ‘› < 𝑏𝑛 for all 𝑛 ∈ β„•, (π‘Žπ‘› ) is an increasing
sequence, (𝑏𝑛 ) is a decreasing sequence, both sequences converge to the same number, and the limit is denoted by
𝑒, Euler’s number.
Proof. The inequality π‘Žπ‘› < 𝑏𝑛 is obvious from
(
)
1
𝑏𝑛 = π‘Žπ‘› 1 +
.
𝑛
Using Bernoulli’s inequality, see Example 1.3.2, we calculate
(
)
𝑛 + 2 𝑛+1 (
)𝑛+1
π‘Žπ‘›+1
𝑛(𝑛 + 2)
𝑛+1
𝑛+1
= (
=
)
π‘Žπ‘›
𝑛
𝑛+1 𝑛
(𝑛 + 1)2
𝑛
)𝑛+1
(
𝑛+1
1
= 1−
𝑛
(𝑛 + 1)2
)
(
𝑛+1
𝑛+1
𝑛 𝑛+1
≥ 1−
=
=1
𝑛
𝑛+1 𝑛
(𝑛 + 1)2
and
(
)
𝑛 + 1 𝑛+1 (
)𝑛+2
𝑏𝑛
(𝑛 + 1)2
𝑛
=
=( 𝑛 )
𝑏𝑛+1
𝑛(𝑛 + 2)
𝑛+1
𝑛 + 2 𝑛+2
𝑛+1
)𝑛+2
(
1
𝑛
= 1+
𝑛(𝑛 + 2)
𝑛+1
(
)
𝑛+2
𝑛+1 𝑛
𝑛
≥ 1+
=
= 1.
𝑛(𝑛 + 2) 𝑛 + 1
𝑛 𝑛+1
Hence the sequence (π‘Žπ‘› ) is increasing and bounded (by 𝑏1 ). By Theorem 2.10, (π‘Žπ‘› ) has a limit 𝛼. Similarly, (𝑏𝑛 )
has a limit 𝛽. Finally
𝛽 = lim 𝑏𝑛 = lim
𝑛→∞
((
𝑛→∞
1+
) )
(
)
1
1
π‘Žπ‘› = lim 1 +
lim π‘Ž = 𝛼.
𝑛→∞
𝑛
𝑛 𝑛→∞ 𝑛
Definition 2.7. 1. With every sequence (π‘Žπ‘› ) which is bounded above, we can associate the sequence of numbers
sup{π‘Žπ‘˜ ∢ π‘˜ ≥ 𝑛}.
Since {π‘Žπ‘˜ ∢ π‘˜ ≥ 𝑛} ⊃ {π‘Žπ‘˜ ∢ π‘˜ ≥ 𝑛 + 1}, this sequence is decreasing, and we denote its limit by
lim sup π‘Žπ‘› = lim sup{π‘Žπ‘˜ ∢ π‘˜ ≥ 𝑛}.
𝑛→∞
𝑛→∞
If (π‘Žπ‘› ) is not bounded above, we put lim sup π‘Žπ‘› = ∞.
𝑛→∞
2. Similarly, if the sequence (π‘Žπ‘› ) is bounded below, the sequence of numbers
inf {π‘Žπ‘˜ ∢ π‘˜ ≥ 𝑛}
is increasing, and we denote its limit by
lim inf π‘Žπ‘› = lim inf {π‘Žπ‘˜ ∢ π‘˜ ≥ 𝑛}.
𝑛→∞
𝑛→∞
If (π‘Žπ‘› ) is not bounded below, we put lim inf π‘Žπ‘› = −∞.
𝑛→∞
22
Introductory Analysis 2017 Lecture Manual
MATH2022
Note. 1. Since inf 𝑆 ≤ sup 𝑆 for every nonempty set 𝑆, it follows that lim inf π‘Žπ‘› ≤ lim sup π‘Žπ‘› , where we write
𝑛→∞
𝑛→∞
−∞ < π‘₯ and π‘₯ < ∞ for each π‘₯ ∈ ℝ.
2. The sequence (π‘Žπ‘› ) is bounded if and only if both lim inf π‘Žπ‘› and lim sup π‘Žπ‘› are real numbers (also called finite).
𝑛→∞
𝑛→∞
We have now the following characterization of the limit of a sequence and its existence:
Theorem 2.11. 1. The sequence (π‘Žπ‘› ) converges if and only if lim inf π‘Žπ‘› and lim sup π‘Žπ‘› are finite and equal, and
𝑛→∞
𝑛→∞
then
lim inf π‘Žπ‘› = lim π‘Žπ‘› = lim sup π‘Žπ‘› .
𝑛→∞
𝑛→∞
𝑛→∞
2. lim π‘Žπ‘› = ∞ ⇔ lim inf π‘Žπ‘› = ∞ and lim sup π‘Žπ‘› = ∞.
𝑛→∞
𝑛→∞
𝑛→∞
3. lim π‘Žπ‘› = −∞ ⇔ lim inf π‘Žπ‘› = −∞ and lim sup π‘Žπ‘› = −∞.
𝑛→∞
𝑛→∞
𝑛→∞
Proof. 1. For a sequence (π‘Žπ‘› ), denote 𝑏𝑛 = inf {π‘Žπ‘˜ ∢ π‘˜ ≥ 𝑛} and 𝑐𝑛 = sup{π‘Žπ‘˜ ∢ π‘˜ ≥ 𝑛}. Note that 𝑏𝑛 ≤ π‘Žπ‘› ≤ 𝑐𝑛 .
1. Assume that (π‘Žπ‘› ) converges to 𝐿. Let πœ€ > 0. Then there is 𝐾 ∈ β„• such that for all 𝑛 ≥ 𝐾, 𝐿 − 3πœ€ < π‘Žπ‘› < 𝐿 + 3πœ€ .
Hence
πœ€
πœ€
𝐿 − ≤ 𝑏𝑛 ≤ π‘Žπ‘› ≤ 𝑐𝑛 ≤ 𝐿 + .
3
3
Since (𝑏𝑛 ) is increasing and (𝑐𝑛 ) is decreasing, we have
𝐿−
πœ€
πœ€
≤ lim inf π‘Žπ‘› ≤ lim sup π‘Žπ‘› ≤ 𝐿 + .
𝑛→∞
3
3
𝑛→∞
Hence
0 ≤ lim sup π‘Žπ‘› − lim inf π‘Žπ‘› ≤
𝑛→∞
𝑛→∞
2πœ€
< πœ€.
3
From Lemma 2.1 we obtain that lim inf π‘Žπ‘› = lim sup π‘Žπ‘› .
𝑛→∞
𝑛→∞
Conversely, if lim inf π‘Žπ‘› = lim sup π‘Žπ‘› , then it follows from 𝑏𝑛 ≤ π‘Žπ‘› ≤ 𝑐𝑛 and the Sandwich Theorem that (π‘Žπ‘› )
𝑛→∞
𝑛→∞
converges and that
lim inf π‘Žπ‘› = lim π‘Žπ‘› = lim sup π‘Žπ‘› .
𝑛→∞
𝑛→∞
𝑛→∞
2., 3. Exercise.
Another important concept is that of a Cauchy sequence:
Definition 2.8 (Cauchy sequence). A seqence (π‘Žπ‘› ) is called a Cauchy sequence if for all πœ€ > 0 there is 𝐾 ∈ ℝ
such that for all 𝑛, π‘š ∈ β„• with 𝑛, π‘š ≥ 𝐾, |π‘Žπ‘› − π‘Žπ‘š | < πœ€.
Theorem 2.12. A sequence (π‘Žπ‘› ) converges if and only if it is a Cauchy sequence.
Proof. Let (π‘Žπ‘› ) be a convergent sequence with limit 𝐿. Let πœ€ > 0 and let 𝐾 such that |π‘Žπ‘› − 𝐿| < 2πœ€ for 𝑛 ≥ 𝐾.
Then it follows for 𝑛, π‘š ≥ 𝐾 that
|π‘Žπ‘› − π‘Žπ‘š | = |(π‘Žπ‘› − 𝐿) − (π‘Žπ‘š − 𝐿)| ≤ |π‘Žπ‘› − 𝐿| + |π‘Žπ‘š − 𝐿| <
πœ€ πœ€
+ = πœ€.
2 2
Conversely, assume that (π‘Žπ‘› ) is a Cauchy sequence. Let πœ€ > 0 and choose 𝐾 such that |π‘Žπ‘› − π‘Žπ‘š | < 3πœ€ for all
π‘š, 𝑛 ≥ 𝐾. Then
πœ€
πœ€
< π‘Žπ‘› < π‘Žπ‘š +
3
3
πœ€
πœ€
⇒ π‘Žπ‘š − ≤ lim inf π‘Žπ‘› ≤ lim sup π‘Žπ‘› ≤ π‘Žπ‘š +
𝑛→∞
3
3
𝑛→∞
(
) (
)
πœ€
πœ€
2πœ€
⇒ 0 ≤ lim sup π‘Žπ‘› − lim inf π‘Žπ‘› ≤ π‘Žπ‘š +
− π‘Žπ‘š −
=
< πœ€.
𝑛→∞
3
3
3
𝑛→∞
π‘Žπ‘š −
By Lemma 2.1, lim inf π‘Žπ‘› = lim sup π‘Žπ‘› , and an application of Theorem 2.11, part 1, completes the proof.
𝑛→∞
𝑛→∞
MATH2022
23
Introductory Analysis 2017 Lecture Manual
Tutorial 2.2.1. 1. Prove Theorem 2.9.
2. Use suitable rules or first principles to find
(b) lim (𝑛 − 1𝑛 )
(a) lim (𝑛2 + 2𝑛 − 10)
𝑛→∞
𝑛→∞
𝑛3 − 3𝑛2
𝑛→∞ 𝑛 + 1
(c) lim
3. Prove that if lim |π‘Žπ‘› | = ∞, then (π‘Žπ‘› ) diverges.
𝑛→∞
4. Prove that if 𝑝 ∈ β„•, 𝑝 > 0, then 𝑛𝑝 → ∞ as 𝑛 → ∞.
5. Define a sequence as follows:
π‘Ž0 = 0, π‘Ž1 =
1
1
, π‘Ž
= (1 + π‘Žπ‘› + π‘Ž3𝑛−1 ) for 𝑛 ≥ 2.
2 𝑛+1 3
(a) Use induction to show that 0 ≤ π‘Žπ‘› < 23 for all 𝑛 ∈ β„•.
(b) Use induction to show that π‘Žπ‘› ≤ π‘Žπ‘›+1 for all 𝑛 ∈ β„•.
(c) Explain why we may conclude that (π‘Žπ‘› ) converges.
(d) Using the fact that lim π‘Žπ‘› = lim π‘Žπ‘›−1 = lim π‘Žπ‘›+1 , find lim π‘Žπ‘› .
𝑛→∞
𝑛→∞
𝑛→∞
𝑛→∞
6. Let lim π‘Žπ‘› = ∞, lim 𝑏𝑛 = ∞, lim 𝑐𝑛 = 0. Show, by giving examples, that no general conclusion can be made
𝑛→∞
𝑛→∞
𝑛→∞
about the behaviour of the following sequences:
π‘Ž
π‘Ž
(a) π‘Žπ‘› − 𝑏𝑛 , (b) π‘Žπ‘› 𝑐𝑛 , (c) 𝑛 , (d) 𝑛 .
𝑏𝑛
𝑐𝑛
7. Let (π‘Žπ‘› ) and (𝑏𝑛 ) be sequences such that π‘Žπ‘› ≤ 𝑏𝑛 for all 𝑛 ∈ β„•. Show that lim inf π‘Žπ‘› ≤ lim inf 𝑏𝑛 and lim sup π‘Žπ‘› ≤
𝑛→∞
lim sup 𝑏𝑛 .
𝑛→∞
(
)
π‘₯ 𝑛
8. (a) Show that exp(π‘₯) = lim 1 +
exists for all π‘₯ ∈ ℝ and that exp(1) = 𝑒.
𝑛→∞
𝑛
Hint. Adapt the proof of Example 2.2.3.
(b) Use Bernoulli’s inequality to prove that
(
lim
𝑛→∞
1 + π‘₯+𝑦
𝑛
1 + π‘₯+𝑦
+ π‘₯𝑦
𝑛
𝑛2
)𝑛
=1
for all π‘₯, 𝑦 ∈ ℝ.
(c) Use (b) to show that exp(π‘₯ + 𝑦) = exp(π‘₯) exp(𝑦) for all π‘₯, 𝑦 ∈ ℝ.
(d) Show that exp(π‘₯) ≥ 1 + π‘₯ for all π‘₯ > 0.
(e) Show that exp(π‘₯) > 0 for all π‘₯ ∈ ℝ.
(f) Show that exp is strictly increasing.
(g) Show that exp(𝑛) = 𝑒𝑛 for all 𝑛 ∈ β„€.
𝑛→∞
𝑛→∞
Chapter 3
Limits and Continuity of Functions
The single most important concept in all of analysis is that of a limit. Every single notion of analysis is encapsulated
in one sense or another to that of a limit. In the previous chapter, you learnt about limits of sequences. Here this
notion is extended to limits of functions, which leads to the notion of continuity. You have learnt an intuitive
version in Calculus I. Here you will learn a precise definition and you will learn how to prove the results you have
learnt in Calculus I.
In this course all functions will have domains and ranges which are subsets of ℝ unless otherwise stated. Such
functions are called real functions.
3.1
Limits of Functions
Definition 3.1. Let π‘Ž ∈ ℝ. An interval of the form (𝑐, 𝑑) with 𝑐 < π‘Ž < 𝑑 is called a neighbourhood of π‘Ž, and the
set (𝑐, 𝑑) ⧡ {π‘Ž} is called a deleted neighbourhood of π‘Ž.
In the definition of the limit of sequences you have seen formal definitions for ‘𝑛 tends to ∞’ and ‘π‘Žπ‘› tends to 𝐿’,
and the latter one has an obvious generalization to ‘π‘₯ tends to π‘Ž’ and ‘𝑓 (π‘₯) tends to 𝐿’, which you will encounter
in the next definition.
Definition 3.2 (Limit of a function). Let 𝑓 be a real function, π‘Ž, 𝐿 ∈ ℝ and assume that the domain of 𝑓 contains
a deleted neighbourhood of π‘Ž, that is, 𝑓 (π‘₯) is defined for all π‘₯ in a deleted neighbourhood of 𝑓 .
Then ‘𝑓 (π‘₯) → 𝐿 as π‘₯ → π‘Ž’ is defined to mean:
∀ πœ€ > 0 ∃ 𝛿 > 0 ∀ π‘₯ ∈ (π‘Ž − 𝛿, π‘Ž + 𝛿) ⧡ {π‘Ž}, 𝑓 (π‘₯) ∈ (𝐿 − πœ€, 𝐿 + πœ€),
i. e., ∀ πœ€ > 0 ∃ 𝛿 > 0 (0 < |π‘₯ − π‘Ž| < 𝛿 ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€).
If 𝑓 (π‘₯) → 𝐿 as π‘₯ → π‘Ž, then we write lim 𝑓 (π‘₯) = 𝐿.
π‘₯→π‘Ž
Note. 1. 𝐿 in the definition above is also called the limit of the function at π‘Ž. For the definition of the limit of a
function at π‘Ž, the number 𝑓 (π‘Ž) is never used, and indeed, the function need not be defined at π‘Ž.
2. Observe that the number 𝛿 will depend on πœ€, and also on π‘Ž, if we vary π‘Ž.
24
MATH2022
Introductory Analysis 2017 Lecture Manual
25
Exercise. Use the notation in the above definition to label the following graph:
Example 3.1.1. Prove, from first principles, that π‘₯2 → 4 as π‘₯ → 2.
Solution. Let πœ€ > 0. We must show that there is 𝛿 > 0 such that 0 < |π‘₯ − 2| < 𝛿 implies |π‘₯2 − 4| < πœ€. Since we
want to use |π‘₯ − 2| < 𝛿 to get |π‘₯2 − 4| < πœ€, we try to factor out |π‘₯ − 2| from |π‘₯2 − 4|. Thus
|π‘₯2 − 4| = |π‘₯ − 2| |π‘₯ + 2| ≤ |π‘₯ − 2| (|π‘₯ − 2| + 4).
It is often convenient to make an initial restriction on 𝛿, like 𝛿 ≤ 1. Then we can continue the above estimate to
obtain from |π‘₯ − π‘Ž| < 𝛿 that
|π‘₯2 − 4| ≤ |π‘₯ − 2| (1 + 4) = 5|π‘₯ − 2|.
If we now put
}
{
πœ€
,
𝛿 = min 1,
5
then we can conclude (note that we already used 𝛿 ≤ 1) for 0 < |π‘₯ − 2| < 𝛿 that
πœ€
|π‘₯2 − 4| ≤ 5|π‘₯ − 2| < 5 = πœ€.
5
To justify the notation lim 𝑓 (π‘₯) = 𝐿 we have to show
π‘₯→π‘Ž
Theorem 3.1. If 𝑓 (π‘₯) → 𝐿 as π‘₯ → π‘Ž, then 𝐿 is unique.
Proof. Let 𝑓 (π‘₯) → 𝐿 and 𝑓 (π‘₯) → 𝑀 as π‘₯ → π‘Ž. We must show that 𝐿 = 𝑀.
So let πœ€ > 0. Then there are 𝛿1 > 0 and 𝛿2 > 0 such that
πœ€
πœ€
0 < |π‘₯ − π‘Ž| < 𝛿1 ⇒ |𝑓 (π‘₯) − 𝐿| <
and 0 < |π‘₯ − π‘Ž| < 𝛿2 ⇒ |𝑓 (π‘₯) − 𝑀| < .
2
2
For 𝛿 = min{𝛿1 , 𝛿2 } and 0 < |π‘₯ − π‘Ž| < 𝛿 (we indeed only need one such π‘₯ here) it follows that
πœ€ πœ€
|𝐿 − 𝑀| = |(𝐿 − 𝑓 (π‘₯)) − (𝑓 (π‘₯) − 𝑀)| ≤ |𝑓 (π‘₯) − 𝐿| + |𝑓 (π‘₯) − 𝑀| < + = πœ€.
2 2
Hence 𝐿 = 𝑀 by Lemma 2.1.
Definition 3.3 (One sided limits). 1. Let 𝑓 be a real function, π‘Ž, 𝐿 ∈ ℝ and assume that the domain of 𝑓 contains
an interval (π‘Ž, 𝑑) with 𝑑 > π‘Ž, that is, 𝑓 (π‘₯) is defined for all π‘₯ in (π‘Ž, 𝑑).
Then ‘𝑓 (π‘₯) → 𝐿 as π‘₯ → π‘Ž+ ’ is defined to mean:
∀ πœ€ > 0 ∃ 𝛿 > 0 (π‘Ž < π‘₯ < π‘Ž + 𝛿 ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€).
If 𝑓 (π‘₯) → 𝐿 as π‘₯ → π‘Ž+ , then we write lim+ 𝑓 (π‘₯) = 𝐿.
π‘₯→π‘Ž
2. Let 𝑓 be a real function, π‘Ž, 𝐿 ∈ ℝ and assume that the domain of 𝑓 contains an interval (𝑐, π‘Ž) with 𝑐 < π‘Ž, that
is, 𝑓 (π‘₯) is defined for all π‘₯ in (𝑐, π‘Ž).
Then ‘𝑓 (π‘₯) → 𝐿 as π‘₯ → π‘Ž− ’ is defined to mean:
∀ πœ€ > 0 ∃ 𝛿 > 0 (π‘Ž − 𝛿 < π‘₯ < π‘Ž ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€).
If 𝑓 (π‘₯) → 𝐿 as π‘₯ → π‘Ž− , then we write lim− 𝑓 (π‘₯) = 𝐿.
π‘₯→π‘Ž
26
Introductory Analysis 2017 Lecture Manual
MATH2022
Theorem 3.2. If 𝑓 (π‘₯) is defined in a deleted neighbourhood of π‘Ž, then
lim 𝑓 (π‘₯) = 𝐿 ⇔ lim− 𝑓 (π‘₯) = 𝐿 = lim+ 𝑓 (π‘₯).
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
Proof. ⇒: Assume that lim 𝑓 (π‘₯) = 𝐿 and let πœ€ > 0. Then
π‘₯→π‘Ž
∃ 𝛿 > 0 (π‘₯ ∈ (π‘Ž − 𝛿, π‘Ž + 𝛿), π‘₯ ≠ π‘Ž ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€)
gives
∃ 𝛿 > 0 (π‘₯ ∈ (π‘Ž − 𝛿, π‘Ž) ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€) and
∃ 𝛿 > 0 (π‘₯ ∈ (π‘Ž, π‘Ž + 𝛿) ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€).
Hence lim− 𝑓 (π‘₯) = 𝐿 and lim+ 𝑓 (π‘₯) = 𝐿.
π‘₯→π‘Ž
π‘₯→π‘Ž
⇐: Assume that lim− 𝑓 (π‘₯) = 𝐿 = lim+ 𝑓 (π‘₯) and let πœ€ > 0. Then there are 𝛿− > 0 and 𝛿+ > 0 such that
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯ ∈ (π‘Ž − 𝛿− , π‘Ž) ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€ and
π‘₯ ∈ (π‘Ž, π‘Ž + 𝛿+ ) ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€.
Let 𝛿 = min{𝛿− , 𝛿+ }. Then
π‘₯ ∈ (π‘Ž − 𝛿, π‘Ž + 𝛿), π‘₯ ≠ π‘Ž ⇒ π‘₯ ∈ (π‘Ž − 𝛿− , π‘Ž) or π‘₯ ∈ (π‘Ž, π‘Ž + 𝛿+ )
⇒ |𝑓 (π‘₯) − 𝐿| < πœ€.
This proves that lim 𝑓 (π‘₯) = 𝐿.
π‘₯→π‘Ž
π‘₯
for π‘₯ ∈ ℝ ⧡ {0}. Then 𝑓 (π‘₯) = 1 if π‘₯ > 0 and 𝑓 (π‘₯) = −1 if π‘₯ < 0. It is easy
|π‘₯|
to prove from the definition that lim+ 𝑓 (π‘₯) = 1 and lim− 𝑓 (π‘₯) = −1. Now it follows from Theorem 3.2 that the
Example 3.1.2. Let 𝑓 (π‘₯) =
π‘₯→0
π‘₯→0
function 𝑓 does not have a limit as π‘₯ tends to π‘Ž.
Tutorial 3.1.1. 1. Prove from the definitions that
√
1
2
1
(a) 2 + π‘₯ + π‘₯2 → 8 as π‘₯ → 2, (b)
→ − as π‘₯ → , (c) lim− 1 − π‘₯ = 0.
π‘₯→1
π‘₯−2
3
2
2. By negating the definition of limit of a function show that the statement 𝑓 (π‘₯) →
ΜΈ 𝐿 as π‘₯ → π‘Ž is equivalent to the
following:
∃ πœ€ > 0 ∀ 𝛿 > 0 ∃ π‘₯ with 0 < |π‘₯ − π‘Ž| < 𝛿 such that |𝑓 (π‘₯) − 𝐿| ≥ πœ€.
1
3. For π‘₯ ∈ ℝ ⧡ {0} let 𝑓 (π‘₯) = sin . Prove that 𝑓 does not tend to any limit as π‘₯ → 0.
π‘₯
4. Let 𝑓 (π‘₯) = π‘₯ − ⌊π‘₯⌋. For each integer 𝑛, find lim− 𝑓 (π‘₯) and lim+ 𝑓 (π‘₯) if they exist.
π‘₯→𝑛
3.2
π‘₯→𝑛
Limits at Infinity and Infinite Limits
In π‘₯ → π‘Ž the inequalities 0 < |π‘₯ − π‘Ž| < 𝛿 occur. For π‘₯ → ∞, this has to be replaced by π‘₯ > 𝐾. Hence we have the
following
Definition 3.4. 1. Let 𝑓 be a real function defined on a set containing an interval of the form (𝑐, ∞). Then
‘𝑓 (π‘₯) → 𝐿 as π‘₯ → ∞’ is defined to mean:
∀ πœ€ > 0 ∃ 𝐾(> 0) (π‘₯ > 𝐾 ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€).
If 𝑓 (π‘₯) → 𝐿 as π‘₯ → ∞, then we write lim 𝑓 (π‘₯) = 𝐿.
π‘₯→∞
2. Let 𝑓 be a real function defined on a set containing an interval of the form (−∞, 𝑐).
Then ‘𝑓 (π‘₯) → 𝐿 as π‘₯ → −∞’ is defined to mean:
∀ πœ€ > 0 ∃ 𝐾(< 0) (π‘₯ < 𝐾 ⇒ |𝑓 (π‘₯) − 𝐿| < πœ€).
If 𝑓 (π‘₯) → 𝐿 as π‘₯ → −∞, then we write lim 𝑓 (π‘₯) = 𝐿.
π‘₯→−∞
MATH2022
27
Introductory Analysis 2017 Lecture Manual
Example 3.2.1. Let 𝑓 (π‘₯) =
1
. Find lim 𝑓 (π‘₯) and lim 𝑓 (π‘₯).
π‘₯→∞
π‘₯→−∞
π‘₯
Solution. Guess that both limits are 0. Let πœ€ > 0 and put 𝐾 =
1
. Then π‘₯ > 𝐾 gives
πœ€
|1| 1
| | = < 1 = πœ€.
|π‘₯| π‘₯ 𝐾
| |
So lim 𝑓 (π‘₯) = 0.
π‘₯→∞
1
Now let πœ€ > 0 and put 𝐾 = − . Then π‘₯ < 𝐾 gives
πœ€
|1|
| | = − 1 < 1 = πœ€.
|π‘₯|
π‘₯ −𝐾
| |
So lim 𝑓 (π‘₯) = 0.
π‘₯→−∞
For infinite limits we have similar definitions:
Definition 3.5. Let 𝑓 be a real function whose domain includes a deleted neighbourhood of the number π‘Ž.
1. ‘𝑓 (π‘₯) → ∞ as π‘₯ → π‘Ž’ is defined to mean:
∀ 𝐾(> 0) ∃ 𝛿 > 0 (0 < |π‘₯ − π‘Ž| < 𝛿 ⇒ 𝑓 (π‘₯) > 𝐾).
If 𝑓 (π‘₯) → ∞ as π‘₯ → π‘Ž, then we write lim 𝑓 (π‘₯) = ∞.
π‘₯→π‘Ž
2. ‘𝑓 (π‘₯) → −∞ as π‘₯ → π‘Ž’ is defined to mean:
∀ 𝐾(< 0) ∃ 𝛿 > 0 (0 < |π‘₯ − π‘Ž| < 𝛿 ⇒ 𝑓 (π‘₯) < 𝐾).
If 𝑓 (π‘₯) → −∞ as π‘₯ → π‘Ž, then we write lim 𝑓 (π‘₯) = −∞.
π‘₯→π‘Ž
Example 3.2.2. Prove that
2
→ ∞ as π‘₯ → 0.
π‘₯2
Solution. The following sketch illustrates the result.
40
35
30
25
20
15
10
5
−2
−1
0
1
2
We may choose 𝐾 to be sufficiently large. So choose 𝐾 > 1 and let 𝛿 = 𝐾1 . Then, for 0 < |π‘₯| < 𝛿, and because of
𝛿 < 1 we have
2
1 2
2
2
=
≥
> = 2𝐾 > 𝐾.
|π‘₯| |π‘₯| |π‘₯| 𝛿
π‘₯2
Definition 3.6. Let 𝑓 be a real function defined on a set containing an interval of the form (𝑐, ∞).
Then ‘𝑓 (π‘₯) → ∞ as π‘₯ → ∞’ is defined to mean:
∀ 𝐴(> 0) ∃ 𝐾(> 0) (π‘₯ > 𝐾 ⇒ 𝑓 (π‘₯) > 𝐴).
If 𝑓 (π‘₯) → ∞ as π‘₯ → ∞, then we write lim 𝑓 (π‘₯) = ∞.
π‘₯→∞
Similar definition hold with all other combinations of limits involving ∞ and −∞, including one-sided limits.
28
Introductory Analysis 2017 Lecture Manual
MATH2022
π‘₯
sin π‘₯
Example 3.2.3. Prove that 𝑓 (π‘₯) =
+ 10
→ ∞ as π‘₯ → ∞. A section of the function’s graph is plotted
10
π‘₯
below.
8
6
4
2
20
40
60
80
Additional notes. 1. You have learnt different notations for limits depending on whether real numbers, ∞ or
−∞ are considered. However, using the notion of (deleted) neighbourhood, one can give just one definition which
applies to each case. To define (deleted) neighbourhoods of ∞ and −∞, we first observe that ∞ and −∞ are just
symbols for our purposes, and thus never belong to sets of numbers which we consider. Therefore, no distinction
is made here between neighbourhoods and deleted neighbourhoods of ∞ and −∞, respectively.
A (deleted) neighbourhood of ∞ is any set of the form (𝑐, ∞) with 𝑐 ∈ ℝ.
A (deleted) neighbourhood of −∞ is any set of the form (−∞, 𝑑) with 𝑑 ∈ ℝ.
Now let π‘Ž, 𝐿 ∈ ℝ ∪ {∞, −∞} and 𝑓 be a function defined in a deleted neighbourhood of π‘Ž. Then 𝑓 (π‘₯) → 𝐿 as
π‘₯ → π‘Ž if and only if for each neighbourhood π‘ˆ of 𝐿 there is a deleted neighbourhood 𝑉 of π‘Ž such that π‘₯ ∈ 𝑉
implies 𝑓 (π‘₯) ∈ π‘ˆ .
If we define one-sided deleted neighbourhoods of π‘Ž ∈ ℝ to be the sets (𝑐, π‘Ž) and (π‘Ž, 𝑑), respectively, then the above
formulation also covers one-sided limits.
2. What we call neighbourhood here should actually be called basic neighbourhood. In general topology, a neighborhood is any subset (of a given set) which contains a basic neighbourhood. But since we do not need general
neighbourhoods, we conveniently dropped the word ’basic’.
1
. Find lim− 𝑓 (π‘₯), lim+ 𝑓 (π‘₯), lim 𝑓 (π‘₯).
π‘₯→0
π‘₯→0
π‘₯→0
π‘₯
1
1
1
Solution. Let 𝐾 > 0 and put 𝛿 = . Then, for 0 < π‘₯ < 𝛿, > = 𝐾. Thus lim+ 𝑓 (π‘₯) = ∞.
π‘₯→0
𝐾
π‘₯ 𝛿
1
1
1
Now let 𝐾 < 0 and put 𝛿 = − > 0. Then, for −𝛿 < π‘₯ < 0, <
= 𝐾. Thus lim− 𝑓 (π‘₯) = −∞.
π‘₯→0
𝐾
π‘₯ −𝛿
Since lim− 𝑓 (π‘₯) ≠ lim+ 𝑓 (π‘₯), lim 𝑓 (π‘₯) does not exist.
Example 3.2.4. Let 𝑓 (π‘₯) =
π‘₯→0
π‘₯→0
π‘₯→0
Tutorial 3.2.1. 1. Prove from the definitions that
1 − 3π‘₯
3
1
= − , (b)
→ ∞ as π‘₯ → 1+ ,
(a) lim
π‘₯→−∞ 2π‘₯ − 1
2
π‘₯−1
(c)
1
→ −∞ as π‘₯ → 1− .
π‘₯−1
MATH2022
3.3
29
Introductory Analysis 2017 Lecture Manual
Limit Laws
Rather than calculating limits from the definition, in general one will use limit laws. In this section we state and
prove some of these laws.
Theorem 3.3 (Limit Laws). Let π‘Ž, 𝑐 ∈ ℝ and suppose that the real functions 𝑓 and 𝑔 are defined in a deleted
neighbourhood of π‘Ž and that lim 𝑓 (π‘₯) = 𝐿 ∈ ℝ and lim 𝑔(π‘₯) = 𝑀 ∈ ℝ both exist. Then
π‘₯→π‘Ž
π‘₯→π‘Ž
(a) lim 𝑐 = 𝑐.
π‘₯→π‘Ž
(b) lim [𝑓 (π‘₯) + 𝑔(π‘₯)] = lim 𝑓 (π‘₯) + lim 𝑔(π‘₯) = 𝐿 + 𝑀.
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
(c) lim [𝑓 (π‘₯) − 𝑔(π‘₯)] = lim 𝑓 (π‘₯) − lim 𝑔(π‘₯) = 𝐿 − 𝑀.
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
(d) lim [𝑐𝑓 (π‘₯)] = 𝑐 lim 𝑓 (π‘₯) = 𝑐𝐿.
π‘₯→π‘Ž
π‘₯→π‘Ž
[
][
]
(e) lim [𝑓 (π‘₯)𝑔(π‘₯)] = lim 𝑓 (π‘₯) lim 𝑔(π‘₯) = 𝐿𝑀.
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
lim 𝑓 (π‘₯)
𝑓 (π‘₯) π‘₯→π‘Ž
𝐿
=
=
.
π‘₯→π‘Ž 𝑔(π‘₯)
lim 𝑔(π‘₯)
𝑀
(f) If 𝑀 ≠ 0, lim
π‘₯→π‘Ž
𝑓 (π‘₯)
does not exist.
π‘₯→π‘Ž 𝑔(π‘₯)
[
]𝑛
(h) If 𝑛 ∈ β„•, lim [𝑓 (π‘₯)𝑛 ] = lim 𝑓 (π‘₯) = 𝐿𝑛 .
(g) If 𝐿 ≠ 0 and 𝑀 = 0, lim
π‘₯→π‘Ž
π‘₯→π‘Ž
(i) lim π‘₯ = π‘Ž.
π‘₯→π‘Ž
(j) If 𝑛 ∈ β„•, lim π‘₯𝑛 = π‘Žπ‘› .
π‘₯→π‘Ž
√
√
(k) If 𝑛 ∈ β„•, lim 𝑛 π‘₯ = 𝑛 π‘Ž. If 𝑛 is even, we assume that π‘Ž > 0.
π‘₯→π‘Ž
√
√
√
𝑛
(l) If 𝑛 ∈ β„•, lim 𝑛 𝑓 (π‘₯) = 𝑛 lim 𝑓 (π‘₯) = 𝐿.
π‘₯→π‘Ž
π‘₯→π‘Ž
(m) If lim |𝑓 (π‘₯)| = 0, then lim 𝑓 (π‘₯) = 0.
π‘₯→π‘Ž
π‘₯→π‘Ž
Proof. The proofs are similar to those in Theorem 2.3 and we will only prove (b), (e), (f).
(b) Let πœ€ > 0. Then there are numbers 𝛿1 > 0 and 𝛿2 > 0 such that
(i) |𝑓 (π‘₯) − 𝐿| < 2πœ€ if 0 < |π‘₯ − π‘Ž| < 𝛿1 ,
(ii) |𝑔(π‘₯) − 𝑀| < 2πœ€ if 0 < |π‘₯ − π‘Ž| < 𝛿2 .
Put 𝛿 = min{𝛿1 , 𝛿2 }. For 0 < |π‘₯ − π‘Ž| < 𝛿 we have 0 < |π‘₯ − π‘Ž| < 𝛿1 and 0 < |π‘₯ − π‘Ž| < 𝛿2 and therefore
|(𝑓 (π‘₯) + 𝑔(π‘₯)) − (𝐿 + 𝑀)| = |(𝑓 (π‘₯) − 𝐿) + (𝑔(π‘₯) − 𝑀)| ≤ |𝑓 (π‘₯) − 𝐿| + |𝑔(π‘₯) − 𝑀| <
πœ€ πœ€
+ = πœ€.
2 2
Hence 𝑓 (π‘₯) + 𝑔(π‘₯) converges to 𝐿 + 𝑀 as π‘₯ → π‘Ž.
(e) We consider 2 cases: one special case, to which then the general case is reduced.
Case I: 𝐿 = 𝑀 = 0. Let πœ€ > 0 and put πœ€′ = min{1, πœ€}. Then 0 < πœ€′ ≤ πœ€, and there are numbers 𝛿1 and 𝛿2 such
that
(i) |𝑓 (π‘₯)| < πœ€′ if 0 < |π‘₯ − π‘Ž| < 𝛿1 ,
(ii) |𝑔(π‘₯)| < πœ€′ if 0 < |π‘₯ − π‘Ž| < 𝛿2 .
Put 𝛿 = min{𝛿1 , 𝛿2 }. For 0 < |π‘₯ − π‘Ž| < 𝛿 we have
2
|𝑓 (π‘₯)𝑔(π‘₯)| = |𝑓 (π‘₯)| |𝑔(π‘₯)| < πœ€′ ≤ πœ€′ ≤ πœ€.
Case II: 𝐿 and 𝑀 are arbitrary. Then
𝑓 (π‘₯)𝑔(π‘₯) = (𝑓 (π‘₯) − 𝐿)(𝑔(π‘₯) − 𝑀) + 𝐿(𝑔(π‘₯) − 𝑀) + 𝑓 (π‘₯)𝑀.
30
Introductory Analysis 2017 Lecture Manual
MATH2022
By (a), (b), (c), (𝑓 (π‘₯) − 𝑀) → 0 and (𝑔(π‘₯) − 𝑀) → 0 as π‘₯ → π‘Ž, and by (b), (c), (d), and Case I, it follows that
lim 𝑓 (π‘₯)𝑔(π‘₯) = 0 + 𝐿 ⋅ 0 + 𝐿𝑀 = 𝐿𝑀.
π‘₯→π‘Ž
(f) First consider 𝑓 = 1. Since 𝑀 ≠ 0 and 𝑔(π‘₯) → 𝑀 as π‘₯ → π‘Ž, there is 𝛿0 > 0 such that |𝑔(π‘₯) − 𝑀| <
0 < |π‘₯ − π‘Ž| < 𝛿0 . Then, for 0 < |π‘₯ − π‘Ž| < 𝛿0 ,
|𝑀|
for
2
|𝑀| |𝑀|
=
2
2
| 1
| |𝑀 − 𝑔(π‘₯)| 2|𝑔(π‘₯) − 𝑀|
1
|=
⇒ ||
−
≤
.
|
|𝑔(π‘₯)𝑀|
|𝑀|2
| 𝑔(π‘₯) 𝑀 |
|𝑔(π‘₯)| ≥ |𝑀| − |𝑔(π‘₯) − 𝑀| ≥ |𝑀| −
Now let πœ€ > 0 and 𝛿1 > 0 such that 0 < |π‘₯ − π‘Ž| < 𝛿1 implies |𝑔(π‘₯) − 𝑀| ≤
follows for 0 < |π‘₯ − π‘Ž| < 𝛿 that
|𝑀|2
πœ€. Put 𝛿 = min{𝛿0 , 𝛿1 }. It
2
| 1
|𝑀|2
1 || 2|𝑔(π‘₯) − 𝑀|
1
|
<2
= πœ€.
πœ€
| 𝑔(π‘₯) − 𝑀 | ≤
2
2
2
|𝑀|
|𝑀|
|
|
The general case now follows with (d):
𝑓 (π‘₯)
1
1
= lim 𝑓 (π‘₯) ⋅ lim
=𝐿⋅
.
π‘₯→π‘Ž 𝑔(π‘₯)
π‘₯→π‘Ž
π‘₯→π‘Ž 𝑔(π‘₯)
𝑀
lim
Recall that a polynomial function is of the form
𝑓 (π‘₯) = 𝑏𝑛 π‘₯𝑛 + 𝑏𝑛−1 π‘₯𝑛−1 + β‹― + 𝑏2 π‘₯2 + 𝑏1 π‘₯ + 𝑏0
with 𝑏𝑖 ∈ ℝ for 𝑖 = 1, 2, … , 𝑛 and 𝑛 any non-negative integer. A rational function is of the form 𝑓 (π‘₯) =
𝑝(π‘₯)
with
π‘ž(π‘₯)
𝑝(π‘₯) and π‘ž(π‘₯) polynomials. We then have the following as a consequence of Theorem 2.3, (b),(d),(i),(j).
Corollary 3.4. If 𝑓 is a polynomial or a rational function and π‘Ž is in the domain of 𝑓 , then lim 𝑓 (π‘₯) = 𝑓 (π‘Ž).
π‘₯→π‘Ž
Corollary 3.5. All the limit rules in Theorem 3.3 remain true if π‘₯ → π‘Ž is replaced by any of the following: π‘₯ → π‘Ž+ ,
π‘₯ → π‘Ž− , π‘₯ → ∞, π‘₯ → −∞.
Proof. For π‘₯ → π‘Ž+ and π‘₯ → π‘Ž− one just has to replace 0 < |π‘₯ − π‘Ž| < 𝛿 with 0 < π‘₯ − π‘Ž < 𝛿 and −𝛿 < −(π‘₯ − π‘Ž) < 0,
respectively, in the proof of each of the statements. For π‘₯ → ∞ and π‘₯ → −∞, the proofs are very similar to those
for sequences.
Similar rules hold if the functions have infinite limits. We state some of the results for π‘₯ → π‘Ž, observing that there
are obvious extensions as in Corollary 3.5.
Theorem 3.6. Assume that lim 𝑓 (π‘₯) = ∞, lim 𝑔(π‘₯) = ∞ and lim β„Ž(π‘₯) = 𝑐 ∈ ℝ. Then
π‘₯→π‘Ž
(a) 𝑓 (π‘₯) + 𝑔(π‘₯) → ∞ as π‘₯ → π‘Ž,
(b) 𝑓 (π‘₯) + β„Ž(π‘₯) → ∞ as π‘₯ → π‘Ž,
(c) 𝑓 (π‘₯)𝑔(π‘₯) → ∞
{ as π‘₯ → π‘Ž,
∞
if c>0
(d) 𝑓 (π‘₯)β„Ž(π‘₯) →
−∞ if c<0
π‘₯→π‘Ž
π‘₯→π‘Ž
as π‘₯ → π‘Ž.
Proof. We prove (c) and leave the other parts as exercises.
Let 𝐴 > 1 (we can take any 𝐴 ∈ ℝ, but the restriction 𝐴 > 1 is convenient here). Then there are 𝛿1 > 0 and 𝛿2 > 0
such that
(i) 𝑓 (π‘₯) > 𝐴 if 0 < |π‘₯ − π‘Ž| < 𝛿1 ,
(ii) 𝑔(π‘₯) > 𝐴 if 0 < |π‘₯ − π‘Ž| < 𝛿2 .
MATH2022
Introductory Analysis 2017 Lecture Manual
31
With 𝛿 = min{𝛿1 , 𝛿2 } it follows for 0 < |π‘₯ − π‘Ž| < 𝛿 that
𝑓 (π‘₯)𝑔(π‘₯) > 𝐴 ⋅ 𝐴 > 𝐴.
Theorem 3.7 (Sandwich Theorem). Let π‘Ž ∈ ℝ ∪ {∞, −∞} and assume that 𝑓 , 𝑔 and β„Ž are real functions defined
in a deleted neighbourhood of π‘Ž. If 𝑓 (π‘₯) ≤ 𝑔(π‘₯) ≤ β„Ž(π‘₯) for π‘₯ in a deleted neighbourhood of π‘Ž and
lim 𝑓 (π‘₯) = 𝐿 = lim β„Ž(π‘₯), then lim 𝑔(π‘₯) = 𝐿.
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
Proof. Note that 𝐿 ∈ ℝ ∪ {∞, −∞}. We will prove this theorem in the case π‘Ž ∈ ℝ and 𝐿 ∈ ℝ. The other cases
are left as an exercise.
Let πœ€ > 0. Then there are 𝛿1 and 𝛿2 such that
(i) |𝑓 (π‘₯) − 𝐿| < πœ€ if 0 < |π‘₯ − π‘Ž| < 𝛿1 ,
(ii) |β„Ž(π‘₯) − 𝐿| < πœ€ if 0 < |π‘₯ − π‘Ž| < 𝛿2 .
Put 𝛿 = min{𝛿1 , 𝛿2 }. Then, for 0 < |π‘₯ − π‘Ž| < 𝛿,
𝐿 − πœ€ < 𝑓 (π‘₯) < 𝐿 + πœ€,
𝐿 − πœ€ < β„Ž(π‘₯) < 𝐿 + πœ€
gives
𝐿 − πœ€ < 𝑓 (π‘₯) ≤ 𝑔(π‘₯) ≤ β„Ž(π‘₯) < 𝐿 + πœ€.
Hence |𝑔(π‘₯) − 𝐿| < πœ€ if 0 < |π‘₯ − π‘Ž| < πœ€.
Theorem 3.8. Let 𝑓 be defined on an interval (π‘Ž, 𝑏), where π‘Ž = −∞ and 𝑏 = ∞ are allowed. With the convenient
notation π‘Ž+ = −∞ if π‘Ž = −∞ and 𝑏− = ∞ if 𝑏 = ∞, we obtain
(a) if 𝑓 is increasing, then
lim− 𝑓 (π‘₯) = sup{𝑓 (π‘₯) ∢ π‘₯ ∈ (π‘Ž, 𝑏)} and lim+ 𝑓 (π‘₯) = inf {𝑓 (π‘₯) ∢ π‘₯ ∈ (π‘Ž, 𝑏)};
π‘₯→𝑏
π‘₯→π‘Ž
(b) if 𝑓 is decreasing, then
lim− 𝑓 (π‘₯) = inf {𝑓 (π‘₯) ∢ π‘₯ ∈ (π‘Ž, 𝑏)} and lim+ 𝑓 (π‘₯) = sup{𝑓 (π‘₯) ∢ π‘₯ ∈ (π‘Ž, 𝑏)}.
π‘₯→𝑏
π‘₯→π‘Ž
Proof. Since all four cases have similar proofs, we only prove (a) in the case 𝑏 ∈ ℝ.
Let 𝐿 = sup{𝑓 (π‘₯) ∢ π‘₯ ∈ (π‘Ž, 𝑏)}.
Case I: 𝐿 ∈ ℝ.
Let πœ€ > 0. By Theorem 1.9 there is 𝑐 ∈ (π‘Ž, 𝑏) such that 𝐿 − πœ€ < 𝑓 (𝑐). Put 𝛿 = 𝑏 − 𝑐 > 0. Now let 𝑏 − 𝛿 < π‘₯ < 𝑏,
i. e., π‘₯ ∈ (𝑐, 𝑏). Then 𝑐 < π‘₯ gives 𝑓 (𝑐) ≤ 𝑓 (π‘₯) since 𝑓 is increasing and 𝑓 (π‘₯) ≤ 𝐿 for all π‘₯ ∈ (𝑐, 𝑏) ⊂ (π‘Ž, 𝑏) by
definition of the supremum, so that
𝐿 − πœ€ < 𝑓 (𝑐) ≤ 𝑓 (π‘₯) ≤ 𝐿 < 𝐿 + πœ€
for these π‘₯. By definition, this mean 𝑓 (π‘₯) → 𝐿 as π‘₯ → 𝑏− .
Case II: 𝐿 = ∞. In this case, {𝑓 (π‘₯) ∢ π‘₯ ∈ (π‘Ž, 𝑏)} is not bounded above. Therefore, for each 𝐴 ∈ ℝ there is
𝑐 ∈ (π‘Ž, 𝑏) such that 𝑓 (𝑐) > 𝐴. Since 𝑓 is increasing, it follows for all π‘₯ ∈ (𝑐, 𝑏) that 𝐴 < 𝑓 (𝑐) ≤ 𝑓 (π‘₯). Therefore
𝑓 (π‘₯) → ∞ as π‘₯ → 𝑏− .
Tutorial 3.3.1. 1. Let 𝑛 be a positive integer. Prove that
(a) lim π‘₯𝑛 = ∞,
π‘₯→∞
{
∞
if n is even,
𝑛
(b) lim π‘₯ =
π‘₯→−∞
−∞ if n is odd,
(c) lim+ π‘₯−𝑛 = ∞,
π‘₯→0
{
∞
if n is even,
−𝑛
(d) lim− π‘₯ =
π‘₯→0
−∞ if n is odd.
2. (a) Let 𝑓 , 𝑔 be defined in a deleted neighbourhood of π‘Ž and assume that 𝑓 (π‘₯) < 𝑔(π‘₯) for all π‘₯ in a deleted
neighbourhood of π‘Ž. Show that if lim 𝑓 (π‘₯) = 𝐿 and lim 𝑔(π‘₯) = 𝑀 exist, then 𝐿 ≤ 𝑀.
π‘₯→π‘Ž
π‘₯→π‘Ž
(b) Give examples for 𝐿 < 𝑀 and for 𝐿 = 𝑀 in (a).
(c) Formulate and prove the result corresponding to (a) for one-sided limits.
32
Introductory Analysis 2017 Lecture Manual
MATH2022
3. Using rules for limits, determine the behaviour of 𝑓 (π‘₯) as π‘₯ tends to the given limit:
4π‘₯
(a) 𝑓 (π‘₯) =
as π‘₯ → 3− ,
3−π‘₯
(π‘₯ − 4)(π‘₯ − 1)
(b) 𝑓 (π‘₯) =
as π‘₯ → 2+ ,
π‘₯−2
2π‘₯ + 1
(c) 𝑓 (π‘₯) =
as π‘₯ → 0+ .
π‘₯2 − π‘₯
3.4
Continuity of Functions
Definition 3.7 (Continuity of a function at a point). Let 𝑓 be a real function, π‘Ž ∈ ℝ and assume that the domain
of 𝑓 contains a neighbourhood of π‘Ž, that is, 𝑓 (π‘₯) is defined for all π‘₯ in a neighbourhood of π‘Ž.
We say that 𝑓 is continuous at π‘Ž if
∀ πœ€ > 0 ∃ 𝛿 > 0 ∀ π‘₯ ∈ (π‘Ž − 𝛿, π‘Ž + 𝛿), 𝑓 (π‘₯) ∈ (𝑓 (π‘Ž) − πœ€, 𝑓 (π‘Ž) + πœ€),
i. e., ∀ πœ€ > 0 ∃ 𝛿 > 0 (|π‘₯ − π‘Ž| < 𝛿 ⇒ |𝑓 (π‘₯) − 𝑓 (π‘Ž)| < πœ€).
Note that the sketch on page 25 also illustrates the definition of continuity.
We realize that the definition of the existence of a limit of a function at a point and continuity at that point are very
similar, but that there are subtle (and important) differences:
For limits, 𝑓 does not need to be defined at π‘Ž, and even if 𝑓 (π‘Ž) exists, this value is not used at all when finding the
limit of the function 𝑓 at π‘Ž.
We conclude
𝑓 is continuous at π‘Ž ⇔ ∀ πœ€ > 0 ∃ 𝛿 > 0 (|π‘₯ − π‘Ž| < 𝛿 ⇒ |𝑓 (π‘₯) − 𝑓 (π‘Ž)| < πœ€) (by definition)
⇔ ∀ πœ€ > 0 ∃ 𝛿 > 0 (0 < |π‘₯ − π‘Ž| < 𝛿 ⇒ |𝑓 (π‘₯) − 𝑓 (π‘Ž)| < πœ€)
(∡ π‘₯ = π‘Ž ⇒ 𝑓 (π‘₯) = 𝑓 (π‘Ž))
⇔ lim 𝑓 (π‘₯) = 𝑓 (π‘Ž).
π‘₯→π‘Ž
Hence we have shown
Theorem 3.9. 𝑓 is continuous at π‘Ž if and only if the following three conditions are satisfied:
1. 𝑓 (π‘Ž) is defined, i. e., π‘Ž is in the domain of 𝑓 ,
2. lim 𝑓 (π‘₯) exists, i.e., lim− 𝑓 (π‘₯) = lim+ 𝑓 (π‘₯), and
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
3. 𝑓 (π‘Ž) = lim 𝑓 (π‘₯).
π‘₯→π‘Ž
Example 3.4.1. 1. Let
{
𝑓 (π‘₯) =
π‘₯2
if π‘₯ ≠ 2,
2
if π‘₯ = 2.
Then lim 𝑓 (π‘₯) = 4 exists, but this limit is different from 𝑓 (2) = 2. Hence 𝑓 is not continuous at 2.
π‘₯→2
sin π‘₯
for π‘₯ ≠ 0 while 𝑓 is not defined at π‘₯ = 0. Then lim 𝑓 (π‘₯) = 1 exists, but 𝑓 is not defined at 0.
π‘₯→0
π‘₯
Hence 𝑓 is not continuous at 0.
3. Let
{
sin π‘₯
if π‘₯ ≠ 0,
𝑓 (π‘₯) =
π‘₯
1
if π‘₯ = 0.
2. Let 𝑓 (π‘₯) =
Then lim 𝑓 (π‘₯) = 1 exists, and 𝑓 (0) = 1. Hence 𝑓 is continuous at 0.
π‘₯→0
MATH2022
Introductory Analysis 2017 Lecture Manual
33
Theorem 3.10. If 𝑓 and 𝑔 are continuous at π‘Ž and if 𝑐 ∈ ℝ, then
1. the sum 𝑓 + 𝑔,
2. the difference 𝑓 − 𝑔,
3. the product 𝑓 𝑔,
𝑓
4. the quotient if 𝑔(π‘Ž) ≠ 0 and
𝑔
5. the scalar multiple 𝑐𝑓
are functions that are also continuous at π‘Ž.
Proof. The statements follow immediate from the limit laws, Theorem 3.3, and Theorem 3.9.
For example, for 3. we have
lim 𝑓 (π‘₯) = 𝑓 (π‘Ž) and lim 𝑔(π‘₯) = 𝑔(π‘Ž),
π‘₯→π‘Ž
π‘₯→π‘Ž
and then Theorem 3.3 gives
lim (𝑓 𝑔)(π‘₯) = lim 𝑓 (π‘₯)𝑔(π‘₯) = lim 𝑓 (π‘₯) lim 𝑔(π‘₯) = 𝑓 (π‘Ž)𝑔(π‘Ž) = (𝑓 𝑔)(π‘Ž).
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
Then Theorem 3.9 says that 𝑓 𝑔 is continuous at π‘Ž.
Recall that the composite 𝑔◦𝑓 of two functions 𝑓 and 𝑔 is defined by (𝑔◦𝑓 )(π‘₯) = 𝑔(𝑓 (π‘₯)).
Theorem 3.11. If 𝑓 is continuous at π‘Ž and 𝑔 is continuous at 𝑓 (π‘Ž), then 𝑔◦𝑓 is continuous at π‘Ž.
Proof. Let πœ€ > 0. Since 𝑔 is continuous at 𝑓 (π‘Ž), there is πœ‚ > 0 such that
|𝑦 − 𝑓 (π‘Ž)| < πœ‚ ⇒ |𝑔(𝑦) − 𝑔(𝑓 (π‘Ž))| < πœ€.
(1)
Since 𝑓 is continuous at π‘Ž, there is 𝛿 > 0 such that
|π‘₯ − π‘Ž| < 𝛿 ⇒ |𝑓 (π‘₯) − 𝑓 (π‘Ž)| < πœ‚.
Putting 𝑦 = 𝑓 (π‘₯) in (1) it follows from (1) and (2) that
|π‘₯ − π‘Ž| < 𝛿 ⇒ |𝑓 (π‘₯) − 𝑓 (π‘Ž)| < πœ‚ ⇒ |𝑔(𝑓 (π‘₯)) − 𝑔(𝑓 (π‘Ž))| < πœ€
that is,
|π‘₯ − π‘Ž| < 𝛿 ⇒ |(𝑔◦𝑓 )(π‘₯) − (𝑔◦𝑓 )(π‘Ž)| < πœ€.
Hence 𝑔◦𝑓 is continuous at π‘Ž.
Definition 3.8. 1. A function 𝑓 is continuous from the right at π‘Ž if lim+ 𝑓 (π‘₯) = 𝑓 (π‘Ž).
π‘₯→π‘Ž
2. A function 𝑓 is continuous from the left at π‘Ž if lim− 𝑓 (π‘₯) = 𝑓 (π‘Ž).
π‘₯→π‘Ž
Example 3.4.2. 1. Let
⎧ |π‘₯| + π‘₯
βŽͺ
𝑓 (π‘₯) = ⎨ 2π‘₯
βŽͺ0
⎩
if π‘₯ ≠ 0,
if π‘₯ = 0.
Determine the right and left continuity of 𝑓 at π‘₯ = 0.
Solution. 𝑓 (0) = 0 whilst
lim 𝑓 (π‘₯) = lim−
|π‘₯| + π‘₯
−π‘₯ + π‘₯
= lim−
= lim− 0 = 0
π‘₯→0
π‘₯→0
2π‘₯
2π‘₯
lim 𝑓 (π‘₯) = lim+
|π‘₯| + π‘₯
π‘₯+π‘₯
= lim+
= lim− 1 = 1.
π‘₯→0
π‘₯→0
2π‘₯
2π‘₯
π‘₯→0−
π‘₯→0
and
π‘₯→0+
π‘₯→0
(2)
34
Introductory Analysis 2017 Lecture Manual
MATH2022
Since 𝑓 (0) = 0 = lim− 𝑓 (π‘₯), 𝑓 is continuous from the left at π‘₯ = 0. Since 𝑓 (0) = 0 ≠ 1 = lim+ 𝑓 (π‘₯), 𝑓 is not
π‘₯→0
π‘₯→0
continuous from the right at π‘₯ = 0.
Note. 1. It is easy to show that 𝑓 is continuous at π‘Ž if and only if 𝑓 is continuous from the right and continuous
from the left at π‘Ž.
2. If π‘Ž ∈ dom(𝑓 ) and if there is πœ€ > 0 such that dom(𝑓 ) ∩ (π‘Ž − πœ€, π‘Ž + πœ€) = (π‘Ž − πœ€, π‘Ž], then we say that 𝑓 is continuous
at π‘Ž if lim− = 𝑓 (π‘Ž).
π‘₯→π‘Ž
3. If π‘Ž ∈ dom(𝑓 ) and if there is πœ€ > 0 such that dom(𝑓 ) ∩ (π‘Ž − πœ€, π‘Ž + πœ€) = [π‘Ž, π‘Ž + πœ€), then we say that 𝑓 is continuous
at π‘Ž if lim+ = 𝑓 (π‘Ž).
π‘₯→π‘Ž
4. The convention in 2 and 3 is consistent with what you will learn in General Topology about continuity. Just
note that the condition |𝑓 (π‘₯) − 𝑓 (π‘Ž)| < πœ€ has to be checked for all π‘₯ ∈ dom(𝑓 ) which satisfy |π‘₯ − π‘Ž| < 𝛿.
Lemma 3.12. If 𝑓 (π‘₯) → 𝑏 as π‘₯ → π‘Ž (π‘Ž+ , π‘Ž− ) and 𝑔 is continuous at 𝑏, then 𝑔(𝑓 (π‘₯)) → 𝑔(𝑏) as π‘₯ → π‘Ž (π‘Ž+ , π‘Ž− ),
which can be written, e. g., as
(
)
lim 𝑔(𝑓 (π‘₯)) = 𝑔 lim 𝑓 (π‘₯) .
π‘₯→π‘Ž
Proof. The function
{
𝑓̃(π‘₯) =
π‘₯→π‘Ž
𝑓 (π‘₯)
if π‘₯ ∈ dom(𝑓 ), π‘₯ ≠ π‘Ž,
𝑏
if π‘₯ = π‘Ž,
is continuous (from the right, from the left) at π‘Ž. Hence the result follows from Theorem 3.11.
Definition 3.9. A function is continuous on a set 𝑋 ⊂ ℝ if 𝑓 is continuous at each π‘₯ ∈ 𝑋. Here continuity is
understood in the sense of the above note with 𝑋 = dom(𝑓 ).
√
Example 3.4.3. Show that 𝑓 (π‘₯) = π‘₯2 − 4 is continuous.
Solution. The domain of 𝑓 is {π‘₯ ∈ ℝ ∢ |π‘₯| ≥ 2} = (−∞, −2] ∪ [2, ∞). By Theorem 3.10, the function π‘₯ ξ‚Ά→ π‘₯2 − 4
is continuous on ℝ, and by Theorem 3.3 (k), the square root is continuous at each positive number. So also the
composite function 𝑓 is continuous at π‘Ž. Also, the proof of Theorem 3.3 (k) can be easily adapted to show that the
square root is continuous from the right at 0. Then it easily follows that 𝑓 is continuous (from the right) at 2 and
continuous (from the left) at −2.
Theorem 3.13. The following functions are continuous on their domains.
1. Polynomials 𝑝(π‘₯) = π‘Žπ‘› π‘₯𝑛 + π‘Žπ‘›−1 π‘₯𝑛−1 + β‹― + π‘Ž2 π‘₯2 + π‘Ž1 π‘₯ + π‘Ž0 , π‘Žπ‘– ∈ ℝ, 𝑛 ∈ β„•.
2. Rational functions
𝑝(π‘₯)
, 𝑝 and π‘ž ≠ 0 polynomials.
π‘ž(π‘₯)
3. Sums, differences, products and quotients of continuous functions.
4. Root functions.
5. The trigonometric functions sin π‘₯, cos π‘₯, tan π‘₯, cosec π‘₯, sec π‘₯ and cot π‘₯.
6. The exponential function exp(π‘₯).
7. The absolute value function |π‘₯|.
Proof. 1, 2 and 3 easily follow from previous theorems on limits and continuity, as does 7. However, 7 can be
easily proved dircetly: For each πœ€ > 0 let 𝛿 = πœ€. Then, for |π‘₯ − π‘Ž| < 𝛿 we have
| |π‘₯| − |π‘Ž| | ≤ |π‘₯ − π‘Ž| < 𝛿 = πœ€.
The continuity of sin and cos follows from the sum of angles formulae and from the limits proved in Calculus I
(the proofs used the Sandwich Theorem, which now has been proved). The continuity of the other trigonometric
functions then follows from part 3.
Finally, the continuity of exp is a tutorial problem.
MATH2022
35
Introductory Analysis 2017 Lecture Manual
Theorem 3.14. Let π‘Ž ∈ ℝ and let 𝑓 be a real function which is defined in a neighbourhood of π‘Ž. Then 𝑓 is
continuous at π‘Ž if and only if for each sequence (π‘₯𝑛 ) in dom(𝑓 ) with lim π‘₯𝑛 = π‘Ž the sequence 𝑓 (π‘₯𝑛 ) satisfies
𝑛→∞
lim 𝑓 (π‘₯𝑛 ) = 𝑓 (π‘Ž).
𝑛→∞
Proof. ⇒∢ Let (π‘₯𝑛 ) be a sequence in dom(𝑓 ) with lim π‘₯𝑛 = π‘Ž. We must show that lim 𝑓 (π‘₯𝑛 ) = 𝑓 (π‘Ž). Hence let
𝑛→∞
𝑛→∞
πœ€ > 0. Since 𝑓 is continuous at π‘Ž, there is 𝛿 > 0 such that
|π‘₯ − π‘Ž| < 𝛿 ⇒ |𝑓 (π‘₯) − 𝑓 (π‘Ž)| < πœ€.
(1)
Since lim π‘₯𝑛 = π‘Ž, there is 𝐾 ∈ ℝ such that for 𝑛 > 𝐾, |π‘₯𝑛 − π‘Ž| < 𝛿. But then, by (1), |𝑓 (π‘₯𝑛 ) − 𝑓 (π‘Ž)| < πœ€ for
𝑛→∞
𝑛 > 𝐾.
⇐∢ (indirect proof) Assume that 𝑓 is not continuous at π‘Ž. Then
∃ πœ€ > 0 ∀ 𝛿 > 0 ∃ π‘₯ ∈ dom(𝑓 ), |π‘₯ − π‘Ž| < 𝛿 and |𝑓 (π‘₯) − 𝑓 (π‘Ž)| ≥ πœ€.
In particular, for 𝛿 = 1𝑛 , 𝑛 = 1, 2, … we find π‘₯𝑛 ∈ dom(𝑓 ) such that |π‘₯𝑛 − π‘Ž| <
then lim π‘₯𝑛 = π‘Ž, whereas (𝑓 (π‘₯𝑛 )) does not converge to 𝑓 (π‘Ž).
1
and |𝑓 (π‘₯𝑛 ) − 𝑓 (π‘Ž)| ≥ πœ€. But
𝑛
𝑛→∞
Tutorial 3.4.1. 1. Consider the function
{
𝑓 (π‘₯) =
⌊π‘₯⌋
π‘₯
−1
if π‘₯ ≠ 0,
if π‘₯ = 0.
Investigate continuity from the left and the right at π‘₯ = 0, π‘₯ = πœ‹ and π‘₯ = 1.
{
0 if π‘₯ ≠ 0,
2. Let 𝑓 (π‘₯) = π‘₯ sin π‘₯1 for π‘₯ ≠ 0 and 𝑔(π‘₯) =
1 if π‘₯ = 0.
Show that 𝑓 (π‘₯) → 0 as π‘₯ → 0 and that 𝑔(π‘₯) → 0 as π‘₯ → 0, but that 𝑔(𝑓 (π‘₯)) does not have a limit as π‘₯ → 0.
Explain this behaviour.
3. Find the values of π‘Ž and 𝑏 which make the function
⎧π‘₯ − 1
βŽͺ
𝑓 (π‘₯) = ⎨ π‘Žπ‘₯2 + 𝑐
βŽͺπ‘₯ + 1
⎩
if π‘₯ ≤ −2,
if −2 < π‘₯ < 1,
if π‘₯ ≥ 1,
continuous at π‘₯ = −2 and π‘₯ = 1.
4. Prove that if lim− 𝑓 (π‘₯) exists, then lim+ 𝑓 (−π‘₯) = lim− 𝑓 (π‘₯).
π‘₯→0
π‘₯→0
π‘₯→0
5. Prove that exp is continuous. You may use the following steps.
(a) The inequality exp(π‘₯) ≥ 1 + π‘₯ is true for all π‘₯ ∈ ℝ.
(b) lim− exp(π‘₯) = 1.
π‘₯→0
(c) lim+ exp(π‘₯) = 1. Hint. Use tutorial problem 4.
π‘₯→0
3.5
The Intermediate Value Theorem
The following important theorem on continuous functions tells us that the graph of a continuous function cannot
jump from one side of a horizontal line 𝑦 = π‘˜ to the other without intersecting the line at least once.
36
Introductory Analysis 2017 Lecture Manual
MATH2022
𝑓 (𝑏)
𝑦=π‘˜
𝑓 (π‘Ž)
π‘Ž
𝑐
𝑏
Theorem 3.15 (Intermediate Value Theorem (IVT)). Suppose that 𝑓 is continuous on the closed interval [π‘Ž, 𝑏]
with 𝑓 (π‘Ž) ≠ 𝑓 (𝑏). Then for any number π‘˜ between 𝑓 (π‘Ž) and 𝑓 (𝑏) there exists a number 𝑐 in the open interval (π‘Ž, 𝑏)
such that 𝑓 (𝑐) = π‘˜.
Proof. Let
𝑔(π‘₯) = 𝑓 (π‘₯) − π‘˜,
(π‘₯ ∈ [π‘Ž, 𝑏]).
Then 𝑔 is continuous, and 𝑔(π‘Ž) and 𝑔(𝑏) have opposite signs: 𝑔(π‘Ž)𝑔(𝑏) < 0.
Let [π‘Ž0 , 𝑏0 ] = [π‘Ž, 𝑏] and use bisection to define intervals [π‘Žπ‘› , 𝑏𝑛 ] as follows: If [π‘Žπ‘› , 𝑏𝑛 ] with 𝑔(π‘Žπ‘› )𝑔(𝑏𝑛 ) < 0 has
been found, let 𝑑 be the midpoint of the interval [π‘Žπ‘› , 𝑏𝑛 ]. If 𝑔(𝑑) = 0, the result follows with 𝑐 = 𝑑. If 𝑔(𝑑)
has the same sign as 𝑔(𝑏𝑛 ), then 𝑔(π‘Žπ‘› ) and 𝑔(𝑑) have opposite signs, and putting π‘Žπ‘›+1 = π‘Žπ‘› , 𝑏𝑛+1 = 𝑑, we have
𝑔(π‘Žπ‘›+1 )𝑔(𝑏𝑛+1 ) < 0. Otherwise, if 𝑔(𝑑) has the opposite sign to 𝑔(𝑏𝑛 ), we put π‘Žπ‘›+1 = 𝑑, 𝑏𝑛+1 = 𝑏𝑛 and get again
𝑔(π‘Žπ‘›+1 )𝑔(𝑏𝑛+1 ) < 0.
If this procedure does not stop, we obtain an increasing sequence (π‘Žπ‘› ) and a decreasing sequence (𝑏𝑛 ), both of
which converge by Theorem 2.10. We observe that
1
𝑏𝑛 = π‘Žπ‘› + (𝑏𝑛−1 − π‘Žπ‘›−1 ) = π‘Žπ‘› + 2−𝑛 (𝑏 − π‘Ž).
2
Then
𝑐 ∢= lim 𝑏𝑛 = lim π‘Žπ‘› + lim 2−𝑛 (𝑏 − π‘Ž) = lim π‘Žπ‘› .
𝑛→∞
𝑛→∞
𝑛→∞
𝑛→∞
Since π‘Ž ≤ 𝑐 ≤ 𝑏 and 𝑔 is continuous at 𝑐, it follows in view of Theorem 3.14 that
𝑔(𝑐)2 = lim 𝑔(π‘Žπ‘› ) lim 𝑔(𝑏𝑛 ) = lim 𝑔(π‘Žπ‘› )𝑔(𝑏𝑛 ) ≤ 0.
𝑛→∞
𝑛→∞
𝑛→∞
Therefore 𝑔(𝑐) = 0, which gives 𝑓 (𝑐) = π‘˜.
Since 𝑓 (π‘Ž) ≠ π‘˜ and 𝑓 (𝑏) ≠ π‘˜, it follows that 𝑐 ≠ π‘Ž and 𝑐 ≠ 𝑏, so that π‘Ž < 𝑐 < 𝑏.
Note. You have seen the definition of interval in first year and you will recall that that the definition required several
cases, depending on whether the endpoints belong to the interval or not and whether the interval is bounded (above,
below), namely (π‘Ž, 𝑏), [π‘Ž, 𝑏), (π‘Ž, 𝑏], [π‘Ž, 𝑏], (−∞, 𝑏), (−∞, 𝑏], (π‘Ž, ∞), [π‘Ž, ∞), (−∞, ∞) where π‘Ž, 𝑏 ∈ ℝ and π‘Ž < 𝑏.
However, intervals can be characterized by one common property. For this we need the following notion: A subset
𝑆 of ℝ is called a singleton if the set 𝑆 has exactly one element.
Definition 3.10. 1. A set 𝑆 ⊂ ℝ is called an interval if
(i) 𝑆 ≠ ∅,
(ii) 𝑆 is not a singleton,
(iii) if π‘₯, 𝑦 ∈ 𝑆, π‘₯ < 𝑦, then each 𝑧 ∈ ℝ with π‘₯ < 𝑧 < 𝑦 satisfies 𝑧 ∈ 𝑆.
2. An interval of the form [π‘Ž, 𝑏] with π‘Ž < 𝑏 is called a closed bounded interval.
Note. A subset 𝑆 of ℝ is an interval if and only if it contains at least two elements and if all real numbers between
any two elements in 𝑆 also belong to 𝑆.
Definition 3.11. For a function 𝑓 ∢ 𝑋 → π‘Œ and 𝐴 ⊂ 𝑋, the set
𝑓 (𝐴) = {𝑦 ∈ π‘Œ ∢ ∃ π‘₯ ∈ 𝐴 ∩ dom(𝑓 ), 𝑓 (π‘₯) = 𝑦} = {𝑓 (π‘₯) ∢ π‘₯ ∈ 𝐴 ∩ dom(𝑓 )}
is called the image of 𝐴 under 𝑓 .
MATH2022
Introductory Analysis 2017 Lecture Manual
37
Corollary 3.16. Let 𝐼 be an interval and let 𝑓 be a continuous real function on 𝐼. Then 𝑓 (𝐼) is either an interval
or a singleton.
Proof. Since 𝐼 ≠ ∅, there is π‘₯ ∈ 𝐼 and so 𝑓 (π‘₯) ∈ 𝑓 (𝐼). Hence 𝑓 (𝐼) ≠ ∅. Hence we have to show that if 𝑓 (𝐼) is
not a singleton, then it is an interval, that is, we must show that for any π‘₯, 𝑦 ∈ 𝐼 with 𝑓 (π‘₯) < 𝑓 (𝑦) and any π‘˜ ∈ ℝ
with 𝑓 (π‘₯) < π‘˜ < 𝑓 (𝑦) there is 𝑐 ∈ 𝐼 such that 𝑓 (𝑐) = π‘˜.
Indeed, from 𝑓 (π‘₯) ≠ 𝑓 (𝑦), we have π‘₯ ≠ 𝑦. If π‘₯ < 𝑦, then 𝑓 is continuous on [π‘₯, 𝑦], and by the intermediate value
theorem, there is 𝑐 ∈ (π‘₯, 𝑦) with 𝑓 (𝑐) = π‘˜. A similar argument holds for π‘₯ > 𝑦.
Example 3.5.1. Let 𝑓 (π‘₯) = π‘₯2 . Then 𝑓 ((−1, 2)) = [0, 4). Notice that 𝐼 = (−1, 2) is an open interval, while 𝑓 (𝐼)
is not.
Theorem 3.17. Let 𝑓 be a real function which is continuous on [π‘Ž, 𝑏], where π‘Ž < 𝑏. Then 𝑓 is bounded on [π‘Ž, 𝑏],
i. e., 𝑓 ([π‘Ž, 𝑏]) is bounded.
Proof. Assume that 𝑓 ([π‘Ž, 𝑏]) is unbounded. Let 𝑑 be the midpoint of the interval [π‘Ž, 𝑏]. Then at least one of the sets
𝑓 ([π‘Ž, 𝑑]), 𝑓 ([𝑑, 𝑏]) would be unbounded, because otherwise 𝑓 ([π‘Ž, 𝑏]) = 𝑓 ([π‘Ž, 𝑑]) ∪ 𝑓 ([𝑑, 𝑏]) would be bounded.
By induction, we find subintervals [π‘Žπ‘› , 𝑏𝑛 ] of [π‘Ž, 𝑏] such that (π‘Žπ‘› ) is increasing, (𝑏𝑛 ) is decreasing, 𝑓 ([π‘Žπ‘› , 𝑏𝑛 ]) is
unbounded, and
1
𝑏𝑛 = π‘Žπ‘› + (𝑏𝑛−1 − π‘Žπ‘›−1 ) = 2−𝑛 (𝑏 − π‘Ž),
2
see the proof of Theorem 3.15, and we infer that both sequences converge with
𝑐 ∢= lim 𝑏𝑛 = lim π‘Žπ‘› ∈ [π‘Ž, 𝑏].
𝑛→∞
𝑛→∞
Since 𝑓 is continuous at 𝑐, there is 𝛿 > 0 such that
𝑓 ((𝑐 − 𝛿, 𝑐 + 𝛿) ∩ [π‘Ž, 𝑏]) ⊂ (𝑓 (𝑐) − 1, 𝑓 (𝑐) + 1).
Since 𝑐 = lim 𝑏𝑛 , there is 𝐾 ∈ β„• such that |𝑏𝑛 −𝑐| < 𝛿 for 𝑛 > 𝐾. Similarly, there is 𝑀 ∈ β„• such that |π‘Žπ‘› −𝑐| < 𝛿
𝑛→∞
for 𝑛 > 𝑀. Let 𝑛 > max{𝐾, 𝑀}. Then
𝑐 − 𝛿 < π‘Žπ‘› ≤ 𝑐 ≤ 𝑏𝑛 < 𝑐 + 𝛿,
and
𝑓 ([π‘Žπ‘› , 𝑏𝑛 ]) ⊂ 𝑓 ((𝑐 − 𝛿, 𝑐 + 𝛿) ∩ [π‘Ž, 𝑏]) ⊂ (𝑓 (𝑐) − 1, 𝑓 (𝑐) + 1)
would give the contradiction that 𝑓 ([π‘Žπ‘› , 𝑏𝑛 ]) would have to be bounded as well as unbounded.
Theorem 3.18. A continuous function on a closed bounded interval achieves its supremum and infimum.
Proof. Let [π‘Ž, 𝑏] be a closed bounded interval and 𝑓 be a continuous function on [π‘Ž, 𝑏]. We must show that there
are π‘₯1 , π‘₯2 ∈ [π‘Ž, 𝑏] such that 𝑓 (π‘₯1 ) = inf 𝑓 ([π‘Ž, 𝑏]) and 𝑓 (π‘₯2 ) = sup 𝑓 ([π‘Ž, 𝑏]).
We are going to show the latter; the proof of the first statement is similar.
By Theorem 3.17, 𝑆 = 𝑓 ([π‘Ž, 𝑏]) is bounded. Let 𝑀 = sup 𝑆. By proof of contradiction, assume that 𝑓 (π‘₯) ≠ 𝑀
for all π‘₯ ∈ [π‘Ž, 𝑏]. Define
1
𝑔(π‘₯) =
, π‘₯ ∈ [π‘Ž, 𝑏].
𝑀 − 𝑓 (π‘₯)
By Theorem 3.13, 𝑔 is continuous on [π‘Ž, 𝑏], and by Theorem 3.17, 𝑔([π‘Ž, 𝑏]) is bounded. So there is 𝐾 ∈ ℝ such
that 0 < 𝑔(π‘₯) ≤ 𝐾 for all π‘₯ ∈ [π‘Ž, 𝑏]. Hence for all π‘₯ ∈ [π‘Ž, 𝑏]:
1
1
1
≤
= 𝑀 − 𝑓 (π‘₯) ⇒ 𝑓 (π‘₯) ≤ 𝑀 − ,
𝐾
𝑔(π‘₯)
𝐾
1
so that the number 𝑀 −
< 𝑀 would be an upper bound of 𝑓 ([π‘Ž, 𝑏]). This contradicts the fact that 𝑀 =
𝐾
sup 𝑓 ([π‘Ž, 𝑏]).
Corollary 3.19. If 𝑓 is continuous on [π‘Ž, 𝑏], π‘Ž < 𝑏, then either 𝑓 ([π‘Ž, 𝑏]) is a singleton or 𝑓 ([π‘Ž, 𝑏]) = [𝑐, 𝑑] with
𝑐 < 𝑑.
38
Introductory Analysis 2017 Lecture Manual
MATH2022
Proof. From Corollary 3.16 and Theorems 3.17 and 3.18 it follows that 𝑓 ([π‘Ž, 𝑏]) is either a singleton or a bounded
interval which contains both its infimum and supremum. But such an interval is of the form [𝑐, 𝑑] with 𝑐 < 𝑑.
Theorem 3.20. Let 𝐼 be an interval and 𝑓 ∢ 𝐼 → ℝ be a stricly monotonic continuous function. Then 𝑓 (𝐼) is an
interval, and the inverse function 𝑓 −1 ∢ 𝑓 (𝐼) → ℝ is continuous.
Proof. By Corollary 3.16, 𝑓 (𝐼) is an interval. Assume that 𝑓 is strictly increasing. Then also 𝑓 −1 is strictly
increasing. Let 𝑏 ∈ 𝑓 (𝐼), i. e., 𝑏 = 𝑓 (π‘Ž) for some π‘Ž ∈ 𝐼. If π‘Ž is not the left endpoint of 𝐼, then 𝑏 is not the left
endpoint of 𝑓 (𝐼), and for 𝑦 ∈ 𝑓 (𝐼) with 𝑦 < 𝑏 = 𝑓 (π‘Ž) we have 𝑓 −1 (𝑦) < 𝑓 −1 (𝑓 (π‘Ž)) = π‘Ž. Therefore, 𝑓 −1 is
bounded above and increasing on 𝑓 (𝐼) ∩ (−∞, 𝑏), and thus
𝛼 = lim− 𝑓 −1 (𝑦)
𝑦→𝑏
exists and 𝛼 ≤ 𝑓 −1 (𝑏) = π‘Ž, see Theorem 3.8. Let π‘₯0 ∈ 𝐼 ∩ (−∞, π‘Ž). Then 𝑓 (π‘₯0 ) < 𝑓 (π‘Ž) = 𝑏 and hence
π‘₯0 = 𝑓 −1 (𝑓 (π‘₯0 )) ≤ 𝛼 ≤ π‘Ž.
Since π‘₯0 and π‘Ž belong to the interval 𝐼, also 𝛼 ∈ 𝐼. Since 𝑓 is continuous, it follows from Lemma 3.12 that
(
)
−1
𝑓 (𝛼) = 𝑓 lim− 𝑓 (𝑦) = lim− 𝑓 (𝑓 −1 (𝑦)) = lim− 𝑦 = 𝑏
𝑦→𝑏
𝑦→𝑏
𝑦→𝑏
which gives
lim 𝑓 −1 (𝑦) = 𝛼 = 𝑓 −1 (𝑏).
𝑦→𝑏−
Therefore 𝑓 −1 is continuous from the left. Similarly, one can show that 𝑓 −1 is continuous from the right. Therefore
𝑓 −1 is continuous. The case 𝑓 strictly decreasing is similar.
Tutorial 3.5.1. 1. Prove that a subset 𝑆 of ℝ is an interval if and only if 𝑆 has the form (π‘Ž, 𝑏), [π‘Ž, 𝑏), (π‘Ž, 𝑏], [π‘Ž, 𝑏],
(−∞, 𝑏), (−∞, 𝑏], (π‘Ž, ∞), [π‘Ž, ∞) or (−∞, ∞) where π‘Ž, 𝑏 ∈ ℝ and π‘Ž < 𝑏.
Hint. Consider inf 𝑆 and sup 𝑆.
2. Let 𝑓 be a real function and let ∅ ≠ 𝐴 ⊂ 𝐡 ⊂ dom(𝑓 ) such that for each π‘₯ ∈ 𝐴 there is πœ€ > 0 such that
(π‘₯ − πœ€, π‘₯] ⊂ 𝐴 or [π‘₯, π‘₯ + πœ€) ⊂ 𝐴 or (π‘₯ − πœ€, π‘₯ + πœ€) ⊂ 𝐴, and the same property for 𝐡. Show that if 𝑓 is continuous
on 𝐡, then 𝑓 is also continuous on 𝐴.
3. A fixed point theorem. Let π‘Ž < 𝑏 and let 𝑓 be a continuous function on [π‘Ž, 𝑏] such that 𝑓 ([π‘Ž, 𝑏]) ⊂ [π‘Ž, 𝑏].
Show that there is π‘₯ ∈ [π‘Ž, 𝑏] such that 𝑓 (π‘₯) = π‘₯.
4. Let 𝐼 be an interval and 𝑓 be a continuous function on 𝐼 such that 𝑓 (𝐼) is unbounded. What can you say about
𝑓 (𝐼)? Find examples which illustrate your answer.
5. Let 𝑓 ∢ [0, 1] → ℝ be a continuous function which only assumes rational values. Show that 𝑓 is constant.
6. Find a continuous function 𝑓 ∢ [−1, 1] → ℝ which is one-to-one when restricted to rational numbers in [−1, 1]
but which is not one-to-one on the whole interval [−1, 1].
Chapter 4
Differentiation
4.1
Revision – Self Study
[Note to lecturers: None of these results have to be presented formally. But, time permitting, some of these definitions and results may be discussed in class; foremost, of course, the definition of the derivative itself.]
In this section we recall definitions and results which have been stated and proved in first year calculus. These
results must be known but will not be examined directly.
In this section let 𝐴 ⊂ ℝ such that for each π‘Ž ∈ 𝐴 there is πœ€ > 0 such that (π‘Ž − πœ€, π‘Ž] ⊂ 𝐴, [π‘Ž, π‘Ž + πœ€) ⊂ 𝐴 or
(π‘Ž − πœ€, π‘Ž + πœ€) ⊂ 𝐴.
Definition 4.1. Let 𝑓 ∢ 𝐴 → ℝ and π‘Ž ∈ 𝐴.
1. 𝑓 is called differentiable at π‘Ž if the limit
𝑓 ′ (π‘₯) = lim
π‘₯→π‘Ž
𝑓 (π‘₯) − 𝑓 (π‘Ž)
π‘₯−π‘Ž
exists. The number 𝑓 ′ (π‘₯) is called the derivative of 𝑓 .
2. 𝑓 is called differentiable on 𝐴 if 𝑓 is differentiable at each π‘Ž ∈ 𝐴.
Note. We also use the notations
𝑑𝑓
𝑑
or
𝑓 for 𝑓 ′ .
𝑑π‘₯
𝑑π‘₯
Theorem 4.1. Let 𝑓 ∢ 𝐴 → ℝ and π‘Ž ∈ 𝐴. If 𝑓 is differentiable at π‘Ž, then 𝑓 is continuous at π‘Ž.
Proof. From
𝑓 (π‘₯) = 𝑓 (π‘Ž) +
𝑓 (π‘₯) − 𝑓 (π‘Ž)
(π‘₯ − π‘Ž)
π‘₯−π‘Ž
it follows that
lim 𝑓 (π‘₯) = 𝑓 (π‘Ž) + lim
π‘₯→π‘Ž
π‘₯→π‘Ž
𝑓 (π‘₯) − 𝑓 (π‘Ž)
lim (π‘₯ − π‘Ž) = 𝑓 (π‘Ž) + [𝑓 ′ (π‘Ž)](0) = 𝑓 (π‘Ž).
π‘₯→π‘Ž
π‘₯−π‘Ž
Theorem 4.2 (Rules for the derivative). Let 𝑓 , 𝑔 ∢ 𝐴 → ℝ, π‘Ž ∈ 𝐴, 𝑓 and 𝑔 differentiable at π‘Ž, and 𝑐 ∈ ℝ. Then
1. Linearity of the derivative: 𝑓 + 𝑔 and 𝑐𝑓 are differentiable at π‘Ž, and
(𝑓 + 𝑔)′ (π‘Ž) = 𝑓 ′ (π‘Ž) + 𝑔 ′ (π‘Ž) and (𝑐𝑓 )′ (π‘Ž) = 𝑐𝑓 ′ (π‘Ž).
2. Product Rule: 𝑓 𝑔 is differentiable at π‘Ž, and
(𝑓 𝑔)′ (π‘Ž) = 𝑓 ′ (π‘Ž)𝑔(π‘Ž) + 𝑓 (π‘Ž)𝑔 ′ (π‘Ž).
3. Quotient Rule: If
𝑓
𝑓
is defined at π‘Ž, i. e., 𝑔(π‘Ž) ≠ 0, then is differentiable at π‘Ž, and
𝑔
𝑔
( )′
𝑓
𝑓 ′ (π‘Ž)𝑔(π‘Ž) − 𝑓 (π‘Ž)𝑔 ′ (π‘Ž)
(π‘Ž) =
.
𝑔
[𝑔(π‘Ž)]2
39
40
Introductory Analysis 2017 Lecture Manual
MATH2022
Proof. 1. From first principles and rules of limits, we have
(𝑓 + 𝑔)(π‘₯) − (𝑓 + 𝑔)(π‘Ž)
π‘₯−π‘Ž
[𝑓 (π‘₯) + 𝑔(π‘₯)] − [𝑓 (π‘Ž) + 𝑔(π‘Ž)]
= lim
π‘₯→π‘Ž
π‘₯−π‘Ž
𝑔(π‘₯) − 𝑔(π‘Ž)
𝑓 (π‘₯) − 𝑓 (π‘Ž)
+ lim
= lim
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯−π‘Ž
π‘₯−π‘Ž
= 𝑓 ′ (π‘Ž) + 𝑔 ′ (π‘Ž)
(𝑓 + 𝑔)′ (π‘Ž) = lim
π‘₯→π‘Ž
and
(𝑐𝑓 )′ (π‘Ž) = lim
π‘₯→π‘Ž
(𝑐𝑓 )(π‘₯) − (𝑐𝑓 )(π‘Ž)
𝑐𝑓 (π‘₯) − 𝑐𝑓 (π‘Ž)
𝑓 (π‘₯) − 𝑓 (π‘Ž)
= lim
= 𝑐 lim
= 𝑐𝑓 ′ (π‘Ž).
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯−π‘Ž
π‘₯−π‘Ž
π‘₯−π‘Ž
2. Observing Theorem 4.1 we have
(𝑓 𝑔)(π‘₯) − (𝑓 𝑔)(π‘Ž)
π‘₯−π‘Ž
𝑓 (π‘₯)𝑔(π‘₯) − 𝑓 (π‘Ž)𝑔(π‘₯) + 𝑓 (π‘Ž)𝑔(π‘₯) − 𝑓 (π‘Ž)𝑔(π‘Ž)
= lim
π‘₯→π‘Ž
π‘₯−π‘Ž
[𝑓 (π‘₯) − 𝑓 (π‘Ž)]𝑔(π‘₯)
𝑓 (π‘Ž)[𝑔(π‘₯) − 𝑔(π‘Ž)]
+ lim
= lim
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯−π‘Ž
π‘₯−π‘Ž
𝑔(π‘₯) − 𝑔(π‘Ž)
𝑓 (π‘₯) − 𝑓 (π‘Ž)
lim 𝑔(π‘₯) + 𝑓 (π‘Ž) lim
= lim
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯−π‘Ž
π‘₯−π‘Ž
= 𝑓 ′ (π‘Ž)𝑔(π‘Ž) + 𝑓 (π‘Ž)𝑔 ′ (π‘Ž).
(𝑓 𝑔)′ (π‘Ž) = lim
π‘₯→π‘Ž
3. We first consider the case 𝑓 = 1:
( )
( )
1
1
(π‘₯) −
(π‘Ž)
( )′
𝑔
𝑔
1
(π‘Ž) = lim
π‘₯→π‘Ž
𝑔
π‘₯−π‘Ž
1
1
−
𝑔(π‘₯) 𝑔(π‘Ž)
= lim
π‘₯→π‘Ž
π‘₯−π‘Ž
𝑔(π‘Ž) − 𝑔(π‘₯)
= lim
π‘₯→π‘Ž 𝑔(π‘₯)𝑔(π‘Ž)(π‘₯ − π‘Ž)
𝑔(π‘Ž) − 𝑔(π‘₯)
1
= lim
π‘₯→π‘Ž
π‘₯ − π‘Ž 𝑔(π‘Ž) lim 𝑔(π‘₯)
π‘₯→π‘Ž
𝑔 ′ (π‘Ž)
=−
.
[𝑔(π‘Ž)]2
To prove the statement for general 𝑓 , we use the product rule:
( )′
( )′
( )
( )′
(
)
𝑓
𝑓 ′ (π‘Ž)
𝑔 ′ (π‘Ž)
1
1
1
′
(π‘Ž) = 𝑓
(π‘Ž) = 𝑓 (π‘Ž)
(π‘Ž) + 𝑓 (π‘Ž)
(π‘Ž) =
+ 𝑓 (π‘Ž) −
𝑔
𝑔
𝑔
𝑔
𝑔(π‘Ž)
[𝑔(π‘Ž)]2
′
′
𝑓 (π‘Ž)𝑔(π‘Ž) − 𝑓 (π‘Ž)𝑔 (π‘Ž)
=
.
[𝑔(π‘Ž)]2
𝑑𝑐
Example 4.1.1. 1. If 𝑐 ∈ ℝ, then 𝑑π‘₯
= 0.
𝑑
2. For 𝑛 ∈ β„•∗ we have
π‘₯𝑛 = 𝑛π‘₯𝑛−1 .
𝑑π‘₯
Proof. 1. follows immediately from the definition.
2. We present an alternate proof to that given in first year: proof by induction. Again, for 𝑛 = 1 the statement is
straightforward:
𝑦−π‘₯
𝑑
π‘₯ = lim
= 1 = (1)π‘₯1−1 .
𝑦→π‘₯ 𝑦 − π‘₯
𝑑π‘₯
MATH2022
Introductory Analysis 2017 Lecture Manual
41
Now assume the statement is true for 𝑛 ∈ β„•∗ . Then, making use of the Product Rule,
(
)
𝑑 𝑛+1
𝑑
𝑑 𝑛
𝑑
π‘₯
=
π‘₯ (π‘₯) + π‘₯𝑛 π‘₯ = 𝑛π‘₯𝑛−1 (π‘₯) + π‘₯𝑛 (1) = (𝑛 + 1)π‘₯𝑛 = (𝑛 + 1)π‘₯(𝑛+1)−1 .
((π‘₯𝑛 )(π‘₯)) =
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
𝑑π‘₯
𝑑 𝑛
π‘₯ = 𝑛π‘₯𝑛−1 for negative integers 𝑛 with the aid of the
𝑑π‘₯
quotient rule. Also, the derivatives of the trigonometric functions sin, cos, tan have been derived in first year
calculus and should be known.
As in first year calculus, we now obtain the derivative
The proof of the differentiability of 𝑒π‘₯ in first year calculus was incomplete; it will be given in the next section.
Also, the differentiability of the inverse trigonometric functions, ln π‘₯ and π‘₯π‘Ÿ for π‘Ÿ ∈ ℝ ⧡ β„• will be postponed to the
next section.
Theorem 4.3 (Fermat’s Theorem). Let 𝐼 be an interval, 𝑓 ∢ 𝐼 → ℝ, and let 𝑐 be in the interior of 𝐼. If 𝑓 has a
local maximum or local minimum at 𝑐 and 𝑓 is differentiable at 𝑐, then 𝑓 ′ (𝑐) = 0.
Proof. Since 𝑐 is an interior point of 𝐼, there is πœ€0 > 0 such that (𝑐 − πœ€0 , 𝑐 + πœ€0 ) ⊂ 𝐼. Assume that 𝑓 has a local
maximum at 𝑐. Then there is πœ€ ∈ (0, πœ€0 ) such that 𝑓 (π‘₯) ≤ 𝑓 (𝑐) for all π‘₯ ∈ (𝑐 − πœ€, 𝑐 + πœ€). Therefore
{
𝑓 (π‘₯) − 𝑓 (𝑐) ≥ 0 for π‘₯ ∈ (𝑐 − πœ€, 𝑐),
π‘₯−𝑐
≤ 0 for π‘₯ ∈ (𝑐, 𝑐 + πœ€).
Hence, since 𝑓 is differentiable at 𝑐,
𝑓 ′ (𝑐) = lim−
π‘₯→𝑐
𝑓 (π‘₯) − 𝑓 (𝑐)
≥ lim− 0 = 0
π‘₯→𝑐
π‘₯−𝑐
𝑓 ′ (𝑐) = lim+
𝑓 (π‘₯) − 𝑓 (𝑐)
≤ lim+ 0 = 0.
π‘₯→𝑐
π‘₯−𝑐
and
π‘₯→𝑐
Therefore 0 ≤ 𝑓 ′ (𝑐) ≤ 0, which proves 𝑓 ′ (𝑐) = 0.
The case of a local minimum at 𝑐 can be dealt with in a similar way, or we may use the fact that then −𝑓 has a
local maximum at 𝑐 and therefore −𝑓 ′ (𝑐) = 0 by the first part of the proof.
Theorem 4.4 (Rolle’s Theorem). Let π‘Ž < 𝑏 be real numbers and 𝑓 ∢ [π‘Ž, 𝑏] → ℝ be a function having the following
properties:
1. 𝑓 is continuous on the closed interval [π‘Ž, 𝑏],
2. 𝑓 is differentiable on the open interval (π‘Ž, 𝑏),
3. 𝑓 (π‘Ž) = 𝑓 (𝑏).
Then there is 𝑐 ∈ (π‘Ž, 𝑏) such that 𝑓 ′ (𝑐) = 0.
Proof. If 𝑓 is constant, then 𝑓 ′ = 0, and the statement is true for any 𝑐 ∈ (π‘Ž, 𝑏). If 𝑓 is not constant, then there is
π‘₯0 ∈ [π‘Ž, 𝑏] such that 𝑓 (π‘₯0 ) ≠ 𝑓 (π‘Ž). We may assume 𝑓 (π‘₯0 ) > 𝑓 (π‘Ž). Otherwise, 𝑓 (π‘₯0 ) < 𝑓 (π‘Ž), and we can consider
−𝑓 . Since 𝑓 is continuous on [π‘Ž, 𝑏] by property 1, 𝑓 has a maximum on [π‘Ž, 𝑏], see Corollary 3.19. That is, there
is some 𝑐 ∈ [π‘Ž, 𝑏] such that 𝑓 (𝑐) ≥ 𝑓 (π‘₯) for all π‘₯ ∈ [π‘Ž, 𝑏]. In particular, 𝑓 (𝑐) ≥ 𝑓 (π‘₯0 ) > 𝑓 (π‘Ž). Since 𝑓 (π‘Ž) = 𝑓 (𝑏),
we have 𝑐 ≠ π‘Ž and 𝑐 ≠ 𝑏, and therefore 𝑐 ∈ (π‘Ž, 𝑏). Hence 𝑓 ′ (𝑐) = 0 by Fermat’s Theorem.
Theorem 4.5 (First Mean Value Theorem). Let π‘Ž < 𝑏 be real numbers and 𝑓 ∢ [π‘Ž, 𝑏] → ℝ be a continuous
function which is differentiable on (π‘Ž, 𝑏).
Then there is 𝑐 ∈ (π‘Ž, 𝑏) such that
𝑓 (𝑏) − 𝑓 (π‘Ž)
= 𝑓 ′ (𝑐).
𝑏−π‘Ž
Proof. The function
𝑓 (𝑏) − 𝑓 (π‘Ž)
(π‘₯ − π‘Ž)
𝑏−π‘Ž
is continuous on [π‘Ž, 𝑏], differentiable on (π‘Ž, 𝑏), and 𝑔(π‘Ž) = 𝑓 (π‘Ž) = 𝑔(𝑏). Hence 𝑔 satisfies the assumptions of
Rolle’s theorem. Therefore, there is 𝑐 ∈ (π‘Ž, 𝑏) such that 𝑔 ′ (𝑐) = 0. But
𝑔(π‘₯) = 𝑓 (π‘₯) −
𝑔 ′ (π‘₯) = 𝑓 ′ (π‘₯) −
𝑓 (𝑏) − 𝑓 (π‘Ž)
,
𝑏−π‘Ž
42
Introductory Analysis 2017 Lecture Manual
MATH2022
and substituting 𝑐 completes the proof.
Tutorial 4.1.1. Let π‘Ž < 𝑏 be real numbers, 𝑓 , 𝑔 ∢ [π‘Ž, 𝑏] → ℝ be continuous and differentiable on (π‘Ž, 𝑏).
1. Show that 𝑔 is injective on [π‘Ž, 𝑏] if 𝑔 ′ (π‘₯) ≠ 0 for all π‘₯ ∈ (π‘Ž, 𝑏).
2. Prove the Second Mean Value Theorem: If 𝑔 ′ (π‘₯) ≠ 0 for all 𝑐 ∈ (π‘Ž, 𝑏), then there is 𝑐 ∈ (π‘Ž, 𝑏) such that
𝑓 (𝑏) − 𝑓 (π‘Ž) 𝑓 ′ (𝑐)
= ′ .
𝑔(𝑏) − 𝑔(π‘Ž)
𝑔 (𝑐)
Hint. Consider the function
β„Ž(π‘₯) = 𝑓 (π‘₯) −
𝑓 (𝑏) − 𝑓 (π‘Ž)
(𝑔(π‘₯) − (𝑔(π‘Ž)).
𝑔(𝑏) − 𝑔(π‘Ž)
3. Show that the statement of the Second Mean Value Theorem remains correct if one replaces the condition
that 𝑔 ′ (π‘₯) ≠ 0 for all π‘₯ ∈ (π‘Ž, 𝑏) with the weaker condition that 𝑔(π‘Ž) ≠ 𝑔(𝑏) and that there is no π‘₯ ∈ (π‘Ž, 𝑏) with
𝑔 ′ (π‘₯) = 𝑓 ′ (π‘₯) = 0.
4. Prove the following one-sided version of l’Hôpital’s Rule: If 𝑓 (π‘Ž) = 𝑔(π‘Ž) = 0, 𝑔 ′ (π‘₯) ≠ 0 for π‘₯ near π‘Ž and
𝑓 ′ (π‘₯)
exists, then
lim+ ′
π‘₯→π‘Ž 𝑔 (π‘₯)
𝑓 (π‘₯)
𝑓 ′ (π‘₯)
lim+
= lim+ ′ .
π‘₯→π‘Ž 𝑔(π‘₯)
π‘₯→π‘Ž 𝑔 (π‘₯)
*Tutorial 4.1.2. Let −∞ ≤ π‘Ž < 𝑏 ≤ ∞ and let 𝑓 , 𝑔 ∢ (π‘Ž, 𝑏) → ℝ be differentiable on (π‘Ž, 𝑏) such that 𝑔 ′ (π‘₯) ≠ 0 for
all π‘₯ ∈ (π‘Ž, 𝑏). Prove the following one-sided versions of l’Hôpital’s Rule.
𝑓 ′ (π‘₯)
1. If lim+ 𝑓 (π‘₯) = lim+ 𝑔(π‘₯) = 0 and lim+ ′
exists as a proper or improper limit, then
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž 𝑔 (π‘₯)
lim+
π‘₯→π‘Ž
2. If lim+ 𝑓 (π‘₯) = lim+ 𝑔(π‘₯) = ∞ and lim+
π‘₯→π‘Ž
π‘₯→π‘Ž
π‘₯→π‘Ž
𝑓 ′ (π‘₯)
exists as a proper or improper limit, then
𝑔 ′ (π‘₯)
lim+
π‘₯→π‘Ž
4.2
𝑓 (π‘₯)
𝑓 ′ (π‘₯)
= lim+ ′ .
𝑔(π‘₯) π‘₯→π‘Ž 𝑔 (π‘₯)
𝑓 (π‘₯)
𝑓 ′ (π‘₯)
= lim+ ′ .
𝑔(π‘₯) π‘₯→π‘Ž 𝑔 (π‘₯)
The Chain Rule and Inverse Functions
Proposition 4.6. Let 𝐼 be an interval, 𝑓 ∢ 𝐼 → ℝ and π‘Ž ∈ 𝐼. The following are equivalent:
(i) 𝑓 is differentiable at π‘Ž.
(ii) There is 𝛾 ∈ ℝ such that the function
{
𝑓 (π‘₯) − 𝑓 (π‘Ž)
if π‘₯ ∈ 𝐼 ⧡ {π‘Ž},
𝑓 ∗ (π‘₯) =
π‘₯−π‘Ž
𝛾
if π‘₯ = π‘Ž,
is continuous at π‘Ž.
(iii) There is a function 𝑓̃ ∢ 𝐼 → ℝ such that 𝑓̃ is continuous at π‘Ž and
𝑓 (π‘₯) = 𝑓 (π‘Ž) + 𝑓̃(π‘₯)(π‘₯ − π‘Ž),
π‘₯ ∈ 𝐼.
(iv) There are 𝛾 ∈ ℝ and a function πœ€ ∢ 𝐼 → ℝ such that πœ€(π‘Ž) = 0, πœ€ is continuous at π‘Ž, and
𝑓 (π‘₯) = 𝑓 (π‘Ž) + 𝛾(π‘₯ − π‘Ž) + πœ€(π‘₯)(π‘₯ − π‘Ž),
π‘₯ ∈ 𝐼.
If (ii) or (iv) hold, then the number 𝛾 is unique and 𝛾 = 𝑓 ′ (π‘Ž). If (iii) holds, then 𝑓̃(π‘Ž) = 𝑓 ′ (π‘Ž).
Proof. (i)⇒(ii): If 𝑓 is differentiable at π‘Ž, then
lim 𝑓 ∗ (π‘₯) = 𝑓 ′ (π‘₯),
π‘₯→π‘Ž
MATH2022
Introductory Analysis 2017 Lecture Manual
43
so that 𝑓 ∗ is continuous at π‘Ž for 𝛾 = 𝑓 ′ (π‘Ž).
(ii)⇒(iii): We simply put 𝑓̃ = 𝑓 ∗ . In particular, 𝑓̃(π‘Ž) = 𝛾.
(iii)⇒(iv): We have
𝑓 (π‘₯) = 𝑓 (π‘Ž) + 𝑓̃(π‘Ž)(π‘₯ − π‘Ž) + (𝑓̃(π‘₯) − 𝑓̃(π‘Ž))(π‘₯ − π‘Ž).
Then (iv) holds with πœ€(π‘₯) = 𝑓̃(π‘₯) − 𝑓̃(π‘Ž) since πœ€(π‘Ž) = 0 and the continuity of 𝑓̃ at π‘Ž implies the continuity of πœ€ at
π‘Ž.
(iv)⇒(i): For π‘₯ ∈ 𝐼 ⧡ {π‘Ž} we have
𝑓 (π‘₯) − 𝑓 (π‘Ž)
= 𝛾 + πœ€(π‘₯) → 𝛾 as π‘₯ → π‘Ž.
π‘₯−π‘Ž
Hence 𝑓 is differentiable at π‘Ž, and 𝛾 = 𝑓 ′ (π‘Ž).
Theorem 4.7 (Chain Rule). Let 𝐼 and 𝐽 be intervals, 𝑔 ∢ 𝐽 → ℝ and 𝑓 ∢ 𝐼 → ℝ with 𝑓 (𝐼) ⊂ 𝐽 , and let π‘Ž ∈ 𝐼.
Assume that 𝑓 is differentiable at π‘Ž and that 𝑔 is differentiable at 𝑓 (π‘Ž). Then 𝑔◦𝑓 is differentiable at π‘Ž and
(𝑔◦𝑓 )′ (π‘Ž) = 𝑔 ′ (𝑓 (π‘Ž))𝑓 ′ (π‘Ž).
Proof. Using the property (iii) of Proposition 4.6 we obtain a function 𝑓̃ on 𝐼 which is continuous at π‘Ž with
𝑓 (π‘₯) = 𝑓 (π‘Ž) + 𝑓̃(π‘₯)(π‘₯ − π‘Ž)
and a function 𝑔̃ on 𝐽 which is continuous at 𝑓 (π‘Ž) with
𝑔(𝑦) = 𝑔(𝑓 (π‘Ž)) + 𝑔(𝑦)(𝑦
Μƒ
− 𝑓 (π‘Ž)).
Then we can write
(𝑔◦𝑓 )(π‘₯) = 𝑔(𝑓 (π‘₯)) = 𝑔(𝑓 (π‘Ž)) + 𝑔(𝑓
Μƒ (π‘₯))(𝑓 (π‘₯) − 𝑓 (π‘Ž))
Μƒ
= (𝑔◦𝑓 )(π‘Ž) + 𝑔(𝑓
Μƒ (π‘₯))𝑓 (π‘₯)(π‘₯ − π‘Ž).
Since 𝑓 and 𝑓̃ are continuous at π‘Ž and since 𝑔 is continuous at 𝑓 (π‘Ž), (𝑔◦𝑓
Μƒ )𝑓̃ is continuous at π‘Ž, see Theorem 3.11,
and by Proposition 4.6, (𝑔◦𝑓 )′ (π‘Ž) = 𝑔(𝑓
Μƒ (π‘Ž))𝑓̃(π‘Ž) = 𝑔 ′ (𝑓 (π‘Ž))𝑓 ′ (π‘Ž).
Theorem 4.8. Let 𝐼 be an interval, let 𝑓 ∢ 𝐼 → ℝ be continuous and strictly increasing or decreasing, and
𝑏 ∈ 𝑓 (𝐼). Assume that 𝑓 is differentiable at π‘Ž = 𝑓 −1 (𝑏) with 𝑓 ′ (π‘Ž) ≠ 0. Then 𝑓 −1 is differentiable at 𝑏 = 𝑓 (π‘Ž)
and
1
1
(𝑓 −1 )′ (𝑏) =
or, equivalently, (𝑓 −1 )′ (𝑓 (π‘Ž)) = ′ .
′
−1
𝑓 (π‘Ž)
𝑓 (𝑓 (𝑏))
Proof. By Proposition 4.6 there is a function 𝑓̃ on 𝐼 which is continuous at π‘Ž such that
𝑓 (π‘₯) − 𝑓 (π‘Ž) = 𝑓̃(π‘₯)(π‘₯ − π‘Ž),
(∗)
and 𝑓̃(π‘Ž) = 𝑓 ′ (π‘Ž). By Theorem 3.20, 𝑓 −1 is continuous, and therefore 𝑓̃◦𝑓 −1 is continuous at 𝑓 (π‘Ž). Since
𝑓 ′ (π‘Ž) ≠ 0, 𝑓̃(π‘Ž) ≠ 0, and since 𝑓 is injective, (∗) gives that 𝑓̃(π‘₯) ≠ 0 for π‘₯ ∈ 𝐼 ⧡ {π‘Ž}. Therefore the function
1
is well-defined on 𝑓 (𝐼) and continuous at 𝑓 (π‘Ž). Setting π‘₯ = 𝑓 −1 (𝑦) for 𝑦 ∈ 𝑓 (𝐼) we get
Μƒ
𝑓 ◦𝑓 −1
𝑦 − 𝑓 (π‘Ž) = 𝑓̃(𝑓 −1 (𝑦))(𝑓 −1 (𝑦) − π‘Ž),
or, equivalently,
𝑓 −1 (𝑦) − 𝑓 −1 (𝑓 (π‘Ž)) =
1
(𝑦 − 𝑓 (π‘Ž)).
𝑓̃(𝑓 −1 (𝑦))
By Proposition 4.6, 𝑓 −1 is differentiable at 𝑓 (π‘Ž), and
(𝑓 −1 )′ (𝑓 (π‘Ž)) =
1
1
= ′ .
−1
Μƒ
𝑓
(π‘Ž)
𝑓 (𝑓 (𝑓 (π‘Ž)))
This result now allows us to find the derivatives of arcsin and arctan.
44
Introductory Analysis 2017 Lecture Manual
MATH2022
Example 4.2.1. Show that arcsin ∢ [−1, 1] → [− πœ‹2 , πœ‹2 ] is continuous on [−1, 1] and differentiable on (−1, 1), and
𝑑
find
arcsin.
𝑑π‘₯
Solution. sin ∢ [− πœ‹2 , πœ‹2 ] → [−1, 1] is strictly increasing and continuous. By Theorem 3.20, arcsin is continuous
on [−1, 1]. Since
(
)
πœ‹ πœ‹
𝑑
sin π‘₯ = cos π‘₯ ≠ 0 for π‘₯ ∈ − ,
,
𝑑π‘₯
2 2
that is, if sin π‘₯ ∈ (−1, 1), it follows by Theorem 4.8 that arcsin is differentiable on (−1, 1) and
𝑑
1
1
1
arcsin π‘₯ =
=√
=√
𝑑π‘₯
cos(arcsin π‘₯)
1 − π‘₯2
1 − sin2 (arcsin π‘₯)
since cos 𝑑 > 0 for 𝑑 ∈ (− πœ‹2 , πœ‹2 ).
Finally, in this section we shall show that 𝑒π‘₯ = exp(π‘₯) is differentiable and find its derivative. This will then allow
us to find the derivative of ln π‘₯ as an inverse function.
Theorem 4.9 (Derivative of 𝑒π‘₯ ).
1
1. 𝑒π‘₯ ≥ 1 + π‘₯ for π‘₯ ∈ ℝ and 𝑒π‘₯ ≤
for π‘₯ < 1.
1−π‘₯
𝑑 π‘₯
2. 𝑒π‘₯ is differentiable and
𝑒 = 𝑒π‘₯ .
𝑑π‘₯
Proof. 1. Let π‘₯ ∈ ℝ and 𝑛 ∈ β„• with 𝑛 > |π‘₯|. Then π‘₯𝑛 > −1, and with the aid of Bernoulli’s inequality we calculate
(
)𝑛
π‘₯
≥1+𝑛 =1+π‘₯
𝑛
(
)
π‘₯ 𝑛
π‘₯
⇒ 𝑒 = lim 1 +
≥ 1 + π‘₯.
𝑛→∞
𝑛
1+
π‘₯
𝑛
Thus we have shown that 𝑒π‘₯ ≥ 1 + π‘₯ for all π‘₯ ∈ ℝ. Replacing π‘₯ with −π‘₯ gives 𝑒−π‘₯ ≥ 1 − π‘₯. Taking inverses for
1
1
π‘₯ < 1, i. e., 1 − π‘₯ > 0, leads to 𝑒π‘₯ = −π‘₯ ≤
for π‘₯ < 1.
𝑒
1−π‘₯
2. The above estimates give for β„Ž < 1 that
⎧ 1 ≤ π‘’β„Ž − 1 ≤ 1
βŽͺ
β„Ž
1
β„Ž
1−β„Ž
−1=
⇒⎨
β„Ž ≤ π‘’β„Ž − 1 ≤
β„Ž−1
1−β„Ž
1−β„Ž
𝑒
1
βŽͺ
⎩1−β„Ž ≤ β„Ž ≤1
if h>0,
if h<0.
Application of the Sandwich Theorem leads to
π‘’β„Ž − 1
= 1.
β„Ž→0
β„Ž
lim
Therefore
𝑒π‘₯+β„Ž − 𝑒π‘₯
𝑒π‘₯ π‘’β„Ž − 𝑒π‘₯
π‘’β„Ž − 1
𝑑 π‘₯
𝑒 = lim
= lim
= 𝑒π‘₯ lim
= 𝑒π‘₯ .
β„Ž→0
β„Ž→0
β„Ž→0
𝑑π‘₯
β„Ž
β„Ž
β„Ž
Tutorial 4.2.1. 1. Find the derivatives of
(a) arctan π‘₯, (b) arccos π‘₯, (c) ln π‘₯ and ln |π‘₯|,
(d) ln |𝑓 (π‘₯)|, where 𝑓 ∢ 𝐼 → ℝ ⧡ {0} is differentiable on 𝐼.
1
𝑝
2. Let π‘₯ > 0 and let π‘Ÿ = with 𝑝 ∈ β„€ and π‘ž ∈ β„•∗ . Define π‘₯π‘Ÿ = (π‘₯𝑝 ) π‘ž .
π‘ž
Show that π‘₯π‘Ÿ = exp(π‘Ÿ ln π‘₯).
3. In view of tutorial problem 2 above, we define π‘₯π‘Ÿ = exp(π‘Ÿ ln π‘₯) for all π‘₯ > 0 and π‘Ÿ ∈ ℝ.
(a) Show that lim exp(π‘₯) = 0.
π‘₯→−∞
(b) Show that lim
1
π‘₯→∞ π‘₯π‘Ÿ
= 0 for all π‘Ÿ > 0.
Chapter 5
Series
5.1
Definitions and Basic Examples
Given a sequence {π‘Žπ‘› }∞
of real numbers, the symbol
𝑛=1
∞
∑
π‘Žπ‘›
𝑛=1
is called a series (of real numbers).
Definition 5.1. Let
1. The number 𝑠𝑛 =
2. The series
∞
∑
∞
∑
π‘Žπ‘› be a series.
𝑛=1
𝑛
∑
π‘Žπ‘– is called the 𝑛-th partial sum of the series.
𝑖=1
π‘Žπ‘› is said to converge if the sequence (𝑠𝑛 ) converges.
𝑛=1
In this case, the number 𝑠 = lim 𝑠𝑛 is called the sum of the series and we write
𝑛→∞
∞
∑
π‘Žπ‘› = 𝑠
𝑛=1
A series which does not converge is said to diverge or be divergent.
Note. 1. Observe that
∞
∑
π‘Žπ‘› denotes a series (convergent or divergent) as well as its sum if it converges.
𝑛=1
2. A series does not have to start at 𝑛 = 1. The change of notation is obvious for other starting indices.
Example 5.1.1 (See Calculus I). [Note to lecturers: Students have seen this in Calculus I, so one does not need to
write down much.] Consider the geometric series
∞
∑
π‘Žπ‘Ÿπ‘› ,
π‘Ž ≠ 0, π‘Ÿ ∈ ℝ.
𝑛=0
Recall the partial sums for π‘Ÿ ≠ 1:
𝑠𝑛 = π‘Ž + π‘Žπ‘Ÿ + π‘Žπ‘Ÿ2 + β‹― + π‘Žπ‘Ÿπ‘› ,
π‘Ÿπ‘ π‘› =
π‘Žπ‘Ÿ + π‘Žπ‘Ÿ2 + β‹― + π‘Žπ‘Ÿπ‘› + π‘Žπ‘Ÿπ‘›+1 .
Subtracting these equations gives
𝑠𝑛 − π‘Ÿπ‘ π‘› = π‘Ž − π‘Žπ‘Ÿπ‘›+1 ,
45
46
Introductory Analysis 2017 Lecture Manual
MATH2022
and so
π‘Ž(1 − π‘Ÿπ‘›+1 )
.
1−π‘Ÿ
By Theorem 2.5, (𝑠𝑛 ) and thus the geometric series converges if |π‘Ÿ| < 1, with
𝑠𝑛 =
∞
∑
π‘Žπ‘Ÿπ‘› = lim 𝑠𝑛 =
𝑛→∞
𝑛=0
π‘Ž
,
1−π‘Ÿ
and diverges if |π‘Ÿ| > 1 or π‘Ÿ = −1. Finally, for π‘Ÿ = 1, 𝑠𝑛 = π‘Ž(𝑛 + 1), and thus 𝑠𝑛 → ∞ as 𝑛 → ∞.
Thus we have shown
Theorem 5.1. The geometric series
∞
∑
π‘Žπ‘Ÿπ‘› ,
π‘Ž ≠ 0,
π‘Žπ‘Ÿπ‘› =
π‘Ž
.
1−π‘Ÿ
𝑛=0
is convergent if |π‘Ÿ| < 1, with sum
∞
∑
𝑛=0
If |π‘Ÿ| ≥ 1, the geometric series diverges.
Example 5.1.2. Is the series
∞
∑
22𝑛 31−𝑛 convergent or divergent?
𝑛=1
Solution. The 𝑛-th term of the series is obviously of the form π‘Žπ‘Ÿπ‘› :
π‘Žπ‘› = 22𝑛 31−𝑛 =
So π‘Ÿ =
( )𝑛
4
3 ⋅ 4𝑛
.
=
3
3𝑛
3
4
> 1, and the series diverges.
3
Example 5.1.3. Write the number 5.4417 = 5.4417417417 … as a ratio of integers.
Solution. Write
5.4417 = 5.4 +
417 417
417
+ 7 +
+ ….
4
10
10
1010
417
−3 . Hence the series converges,
Starting with the second term we have a geometric series with π‘Ž = 10
−4 and π‘Ÿ = 10
and
5.4417 = 5.4 +
=
417
10−4
1 − 10−3
= 5.4 +
417
539877 179959
54
+
=
=
.
10 9990
9990
3330
The following theorem has been proved in Calculus I.
Theorem 5.2. If the series
∞
∑
𝑛=1
417
104 − 10
π‘Žπ‘› converges, then lim π‘Žπ‘› = 0.
𝑛→∞
Proof. Let
𝑠𝑛 = π‘Ž1 + π‘Ž2 + … π‘Žπ‘› .
Then
π‘Žπ‘› = 𝑠𝑛 − 𝑠𝑛−1 .
MATH2022
Since
∞
∑
47
Introductory Analysis 2017 Lecture Manual
π‘Žπ‘› converges,
𝑛=1
lim 𝑠 = 𝑠
𝑛→∞ 𝑛
exists. Since also 𝑛 − 1 → ∞ as 𝑛 → ∞,
lim 𝑠
= 𝑠.
𝑛→∞ 𝑛−1
Hence
lim π‘Ž = lim (𝑠𝑛 − 𝑠𝑛−1 ) = lim 𝑠𝑛 − lim 𝑠𝑛−1 = 𝑠 − 𝑠 = 0.
𝑛→∞ 𝑛
𝑛→∞
𝑛→∞
𝑛→∞
The contrapositive statement to Theorem 5.2 is very useful:
Theorem 5.3 (Test for Divergence). If lim π‘Žπ‘› does not exist or if lim π‘Žπ‘› ≠ 0, then the series
𝑛→∞
𝑛→∞
∞
∑
π‘Žπ‘› diverges.
𝑛=1
Note. From lim π‘Žπ‘› = 0 nothing can be concluded about the convergence of the series
𝑛→∞
∞
∑
π‘Žπ‘› .
𝑛=1
From Theorem 2.3 we immediately infer
Theorem 5.4 (Sum Laws). Let 𝑐 ∈ ℝ and suppose that
∞
∑
(π‘Žπ‘› + 𝑏𝑛 ) and
𝑛=1
1.
2.
3.
∞
∑
∞
∑
π‘Žπ‘› and
𝑛=1
∞
∑
𝑏𝑛 both converge. Then also
𝑛=1
(π‘Žπ‘› − 𝑏𝑛 ) converge, and
𝑛=1
∞
∑
(π‘π‘Žπ‘› ) = 𝑐
∞
∑
π‘Žπ‘› ,
𝑛=1
∞
∑
∞
∑
𝑛=1
∞
∑
𝑛=1
∞
∑
𝑛=1
∞
∑
𝑛=1
𝑛=1
𝑛=1
∞
∑
(π‘Žπ‘› + 𝑏𝑛 ) =
(π‘Žπ‘› − 𝑏𝑛 ) =
π‘Žπ‘› +
π‘Žπ‘› −
𝑏𝑛 ,
𝑏𝑛 ,
𝑛=1
Example 5.1.4. Find the sum of the series
∞
∑
𝑛=1
(
)
1
5
+ 𝑛 .
𝑛(𝑛 + 1) 5
Solution. Recall from Calculus I that
π‘š (
)
∑
1
1
1
1
=
−
=1−
.
𝑛(𝑛 + 1) 𝑛=1 𝑛 𝑛 + 1
π‘š+1
𝑛=1
π‘š
∑
Hence
∞
∑
1
= 1.
𝑛(𝑛 + 1)
𝑛=1
And
∞
∑
1
5𝑛
𝑛=1
is a geometric series with π‘Ž = 51 and π‘Ÿ = 15 . Hence
∞
∑
1
5𝑛
𝑛=1
Therefore
∞
∑
𝑛=1
(
5
1
+
𝑛(𝑛 + 1) 5𝑛
)
=5
=
1 1
1
= .
51− 1
4
5
∞
∑
∞
∑
1
1
1 21
+
=5+ =
.
𝑛
𝑛(𝑛
+
1)
5
4
4
𝑛=1
𝑛=1
Note. For convergence it does not matter at which index the series starts. But it matters for the sum.
∞
∑
(π‘π‘Žπ‘› ),
𝑛=1
48
Introductory Analysis 2017 Lecture Manual
Theorem 5.5. The series
∞
∑
MATH2022
π‘Žπ‘› converges if and only if for each πœ€ > 0 there is 𝐾 ∈ β„• such that for all π‘š ≥ π‘˜ ≥ 𝐾,
𝑛=1
π‘š
|∑
|
|
|
| π‘Žπ‘› | < πœ€.
|
|
|𝑛=π‘˜ |
Proof. Let π‘ π‘˜ =
π‘˜
∑
π‘Žπ‘› . Then
𝑛=1
∞
∑
π‘Žπ‘› converges ⇔ (𝑠𝑛 )∞
converges
𝑛=1
(by definition)
𝑛=1
⇔ ∀ πœ€ > 0 ∃𝐾 ∈ β„• ∀π‘š ≥ π‘˜ ≥ 𝐾, |π‘ π‘š − π‘ π‘˜−1 | < πœ€
π‘š
|∑
|
|
|
⇔ ∀ πœ€ > 0 ∃𝐾 ∈ β„• ∀π‘š ≥ π‘˜ ≥ 𝐾, | π‘Žπ‘› | < πœ€.
|
|
|𝑛=π‘˜ |
(by Theorem 2.12)
Tutorial 5.1.1. 1. Test each of the following series for convergence or divergence:
∞
∞
∞ (
)
( )
∑
∑
∑
1
𝑛2 − 1
1 𝑛
,
(b)
𝑛 sin
,
(c)
.
(a)
1+
2
𝑛
𝑛
𝑛=1
𝑛=1 𝑛 − 50𝑛
𝑛=1
2. Which of the following is valid? Justify your conclusions.
∞
∑
(a) If π‘Žπ‘› → 0 as 𝑛 → ∞, then
π‘Žπ‘› is convergent.
(b) If π‘Žπ‘› ΜΈ→ 0 as 𝑛 → ∞, then
∞
∑
(c) If
𝑛=1
∞
∑
π‘Žπ‘› is divergent.
𝑛=1
π‘Žπ‘› is divergent, then π‘Žπ‘› ΜΈ→ 0 as 𝑛 → ∞.
𝑛=1
5.2
Convergence Tests for Series
Lemma 5.6. Let π‘Žπ‘› ≥ 0 for all 𝑛 ∈ β„•∗ and let π‘ π‘˜ =
π‘˜
∑
𝑛=1
π‘Žπ‘› . Then
∞
∑
π‘Žπ‘› converges if and only if (𝑠𝑛 ) is bounded.
𝑛=1
Proof. Since 𝑠𝑛+1 = 𝑠𝑛 + π‘Žπ‘›+1 ≥ 𝑠𝑛 , it follows that (𝑠𝑛 ) is an increasing sequence. By Theorem 2.10, this sequence
and hence the series converges if and only if the sequence (𝑠𝑛 ) is bounded.
Theorem 5.7 (Comparison Test). Let
∞
∑
π‘Žπ‘› and
𝑛=1
for all 𝑛 ∈ β„•∗ .
∞
∞
∑
∑
(i) If
𝑏𝑛 converges, then also
π‘Žπ‘› converges.
𝑛=1
∞
∑
(ii) If
𝑏𝑛 be series with nonnegative terms and assume that π‘Žπ‘› ≤ 𝑏𝑛
𝑛=1
𝑛=1
π‘Žπ‘› diverges, then also
𝑛=1
Proof. Let π‘ π‘˜ =
∞
∑
∞
∑
∞
∑
𝑏𝑛 diverges.
𝑛=1
π‘˜
∑
𝑛=1
π‘Žπ‘› and π‘‘π‘˜ =
π‘˜
∑
𝑏𝑛 . Then both (π‘ π‘˜ ) and (π‘‘π‘˜ ) are increasing sequences with π‘ π‘˜ ≤ π‘‘π‘˜ . Hence, if
𝑛=1
𝑏𝑛 converges, then (π‘‘π‘˜ ) is bounded, say π‘‘π‘˜ ≤ 𝑀 for all π‘˜ ∈ β„•, and π‘ π‘˜ ≤ π‘‘π‘˜ ≤ 𝑀 for all π‘˜ ∈ β„•. Hence (π‘ π‘˜ ) is a
𝑛=1
bounded sequence and thus converges by Lemma 5.6.
(ii) is the contrapositive of (i).
MATH2022
49
Introductory Analysis 2017 Lecture Manual
Example 5.2.1. Test the series
∞
∑
sin2 𝑛 + 10
𝑛 + 2𝑛
𝑛=1
for convergence.
Solution. Putting
sin2 𝑛 + 10
𝑛 + 2𝑛
π‘Žπ‘› =
it follows that 0 < π‘Žπ‘› < 11
( )𝑛
1
2
=∢ 𝑏𝑛 . By Theorem 5.1, the series with general term 𝑏𝑛 converges. Hence
∞
∑
sin2 𝑛 + 10
converges by the comparison test.
𝑛 + 2𝑛
𝑛=1
Definition 5.2. 1. A series
converges.
2. A series
∞
∑
π‘Žπ‘› is called absolutely convergent if the series of its absolute values
|π‘Žπ‘› |
𝑛=1
𝑛=1
∞
∑
∞
∑
π‘Žπ‘› is called conditionally convergent if it is convergent but not absolutely convergent.
𝑛=1
Theorem 5.8. Every absolutely convergent series is convergent.
Proof. Let
∞
∑
π‘Žπ‘› be absolutely convergent. Let πœ€ > 0. Then, by Theorem 5.5, there is 𝐾 ∈ β„• such that for all
𝑛=1
π‘š
∑
π‘š ≥ π‘˜ ≥ 𝐾,
|π‘Žπ‘› | < πœ€. Because of
𝑛=π‘˜
π‘š
π‘š
|∑
| ∑
|
|
|π‘Žπ‘› |
| π‘Žπ‘› | ≤
|
|
|𝑛=π‘˜ | 𝑛=π‘˜
π‘š
∞
|∑
|
∑
|
|
it follows that | π‘Žπ‘› | < πœ€ for these π‘˜, π‘š, and therefore
π‘Žπ‘› converges by Theorem 5.5.
|
|
|𝑛=π‘˜ |
𝑛=1
∞
∞
∑
∑
Alternative proof. By assumption and Theorem 5.4, both
|π‘Žπ‘› | and
2|π‘Žπ‘› | converge. From
𝑛=1
𝑛=1
0 ≤ π‘Žπ‘› + |π‘Žπ‘› | ≤ 2|π‘Žπ‘› |
and the comparison test, it follows that also
∞
∑
(π‘Žπ‘› + |π‘Žπ‘› |) converges. Again by Theorem 5.4 it follows that
𝑛=1
∞
∑
π‘Žπ‘› =
𝑛=1
∞
∑
((π‘Žπ‘› + |π‘Žπ‘› |) − |π‘Žπ‘› |)
𝑛=1
converges.
Recall from Calculus I that there are convergent series which are not absolutely convergent.
Definition 5.3. An alternating series is a series of the form
∞
∑
𝑛=1
𝑛
(−1) 𝑏𝑛
or
∞
∑
(−1)𝑛−1 𝑏𝑛
with 𝑏𝑛 ≥ 0.
𝑛=1
∞
∞
∑
∑
𝑛
Theorem 5.9 (Alternating series test). If the alternating series
(−1) 𝑏𝑛 or
(−1)𝑛−1 𝑏𝑛 satisfies
𝑛=1
(i) 𝑏𝑛 ≥ 𝑏𝑛+1 for all 𝑛,
(ii) lim 𝑏𝑛 = 0,
𝑛→∞
then the series converges.
𝑛=1
50
Introductory Analysis 2017 Lecture Manual
MATH2022
Proof. For π‘˜ ∈ β„•∗ and π‘š ∈ β„• we have
(−1)π‘˜
π‘˜+2π‘š
∑
(−1)𝑛 𝑏𝑛 = (π‘π‘˜ − π‘π‘˜+1 ) + (π‘π‘˜+2 − π‘π‘˜+3 ) + β‹― + (π‘π‘˜+2π‘š−2 − π‘π‘˜+2π‘š−1 ) + π‘π‘˜+2π‘š
𝑛=π‘˜
= π‘π‘˜ − (π‘π‘˜+1 − π‘π‘˜+2 ) − β‹― − (π‘π‘˜+2π‘š−1 − π‘π‘˜+2π‘š ).
Hence
0 ≤ π‘π‘˜+2π‘š ≤ (−1)π‘˜
π‘˜+2π‘š
∑
(−1)𝑛 𝑏𝑛 ≤ π‘π‘˜ .
𝑛=π‘˜
Similarly,
0 ≤ (−1)
π‘˜
π‘˜+2π‘š+1
∑
(−1)𝑛 𝑏𝑛 ≤ π‘π‘˜ − π‘π‘˜+2π‘š+1 ≤ π‘π‘˜ .
𝑛=π‘˜
Now let πœ€ > 0. Since π‘π‘˜ → 0 as π‘˜ → ∞, there is 𝐾 ∈ β„• such that 𝑏𝐾 < πœ€. Hence for all 𝑙 ≥ π‘˜ ≥ 𝐾:
|∑
|
| 𝑙
|
| (−1)𝑛 𝑏𝑛 | ≤ π‘π‘˜ ≤ 𝑏𝐾 < πœ€.
|
|
|𝑛=π‘˜
|
|
|
Hence the alternating series converges.
Note. lim 𝑏𝑛 = 0 is necessary by Theorem 5.3 since lim (−1)𝑛 𝑏𝑛 = 0 ⇔ lim 𝑏𝑛 = 0 ⇔ lim (−1)𝑛−1 𝑏𝑛 = 0.
𝑛→∞
Example 5.2.2. The alternating series
𝑛→∞
∞
∑
(−1)𝑛−1
𝑛=1
𝑛→∞
𝑛→∞
3𝑛
does not converge since
4𝑛 − 1
3𝑛
lim
𝑛→∞ 4𝑛 − 1
3
≠0
4
=
and thus (ii) is not satisfied, which is necessary for convergence. See the note following the statement of the
Alternating Series Test.
Worked Example 5.2.3. Find whether the series
∞
∑
(−1)𝑛−1
𝑛=1
𝑛2
𝑛3 + 1
is convergent.
Solution. Clearly, we have an alternating series with
lim 𝑏𝑛 = lim
𝑛→∞
𝑛2
𝑛→∞ 𝑛3 + 1
= 0.
To show that 𝑏𝑛 ≥ 𝑏𝑛+1 we have at least the following 2 choices:
1. Show the inequality directly by cross-multiplication and simplification.
π‘₯2
2. Show that 𝑓 (π‘₯) =
has a negative derivative for sufficiently large π‘₯.
π‘₯3 + 1
Alternatively, write
1
𝑛2 + 1𝑛
𝑛2
1
1
=
− 𝑛 = −
.
𝑛3 + 1
𝑛3 + 1 𝑛3 + 1 𝑛 𝑛(𝑛3 + 1)
Then
∞
∞
∞
∑
∑
(−1)𝑛−1 ∑ (−1)𝑛
𝑛2
(−1)𝑛−1
+
=
3
𝑛
𝑛3 + 1 𝑛=1
𝑛=1
𝑛=1 𝑛(𝑛 + 1)
is the sum of two alternating series which converge by the alternating series test; hence the series converges.
∞
∑
| π‘Žπ‘›+1 |
| = 𝐿 < 1, then the series
Theorem 5.10 (Ratio Test). (i) If lim sup ||
π‘Žπ‘› converges absolutely.
|
𝑛→∞ | π‘Žπ‘› |
𝑛=1
∞
∑
| π‘Žπ‘›+1 |
| = 𝑙 > 1, then the series
(ii) If lim inf ||
π‘Žπ‘› diverges.
𝑛→∞ | π‘Žπ‘› ||
𝑛=1
MATH2022
51
Introductory Analysis 2017 Lecture Manual
Proof. Note that the ratio test assumes π‘Žπ‘› ≠ 0 for all 𝑛 ∈ β„•.
(i) Let πœ€ > 0 such that 𝐿 + πœ€ < 1.
| π‘Žπ‘›+1 |
| < 𝐿 + πœ€ for all 𝑛 ≥ 𝐾. Hence for π‘š > 𝐾:
Then there is 𝐾 ∈ β„• such that ||
|
| π‘Žπ‘› |
|
|
|
|π‘Ž
|π‘Žπ‘š | = |π‘ŽπΎ | | 𝐾+1 | ⋅ β‹― ⋅ | π‘Žπ‘š | < |π‘ŽπΎ |(𝐿 + πœ€)π‘š−𝐾 .
|
|π‘Ž
| | | || π‘Ž |
| π‘š−1 |
| 𝐾 |
Since
∞
∑
(*)
|π‘ŽπΎ |(𝐿 + πœ€)π‘š−𝐾 is a convergent geometric series, it follows from (*) and the Comparison Test, Theorem
π‘š=𝐾
∞
∑
5.7, that
π‘Žπ‘š converges absolutely. Hence also
π‘š=𝐾
(ii) Let 𝑙′ ∈ (1, 𝑙).
∞
∑
π‘Žπ‘› converges absolutely.
𝑛=1
| π‘Žπ‘›+1 |
| > 𝑙′ for all 𝑛 ≥ 𝐾. Hence for π‘š > 𝐾:
Then there is 𝐾 ∈ β„• such that ||
|
| π‘Žπ‘› |
|
|
|
|π‘Ž
|π‘Žπ‘š | = |π‘ŽπΎ | | 𝐾+1 | ⋅ β‹― ⋅ | π‘Žπ‘š | > |π‘ŽπΎ |,
| | | || π‘Ž |
|π‘Ž
|
| π‘š−1 |
| 𝐾 |
so that π‘Žπ‘› ΜΈ→ 0 as 𝑛 → ∞. Hence
∞
∑
π‘Žπ‘› diverges by the Test for Divergence, Theorem 5.3.
𝑛=1
∞
∑
√
Theorem 5.11 (Root Test). (i) If lim sup 𝑛 |π‘Žπ‘› | = 𝐿 < 1, then the series
π‘Žπ‘› converges absolutely.
𝑛→∞
(ii) If lim sup
√
𝑛
𝑛→∞
|π‘Žπ‘› | = 𝐿 > 1, then the series
𝑛=1
∞
∑
π‘Žπ‘› diverges.
𝑛=1
Proof. (i) Let πœ€ > 0 such that 𝐿 + πœ€ < 1.
∞
∑
√
𝑛
𝑛
(𝐿 + πœ€)𝑛 is
Then there is 𝐾 ∈ β„• such that |π‘Žπ‘› | < 𝐿 + πœ€ for all 𝑛 ≥ 𝐾. Hence |π‘Žπ‘› | < (𝐿 + πœ€) for 𝑛 ≥ 𝐾: Since
a convergent geometric series, it follows from the comparison test, Theorem 5.7, that
Hence also
∞
∑
∞
∑
𝑛=𝐾
π‘Žπ‘› converges absolutely.
𝑛=𝐾
π‘Žπ‘› converges absolutely.
𝑛=1
(ii) Let 𝐿′ ∈ (1, 𝐿).
√
Then for each 𝐾 ∈ β„• there is π‘š ≥ 𝐾 such that π‘š |π‘Žπ‘š | > 𝐿′ . Hence |π‘Žπ‘š | > (𝐿′ )π‘š > 1 for this π‘š, and we conclude
π‘Žπ‘› ΜΈ→ 0 as 𝑛 → ∞.
Indeed, if π‘Žπ‘› → 0 as 𝑛 → ∞, for πœ€ = 1 there would be 𝐾 ∈ β„• such that |π‘Žπ‘› | < 1 for all 𝑛 > 𝐾.
∞
∑
π‘Žπ‘› diverges by the Test for Divergence, Theorem 5.3.
Hence
𝑛=1
{( )𝑛
( )𝑛 }
( )
1
3
1
Tutorial 5.2.1. 1. Test
+5
𝑛 sin
for convergence.
2
4
𝑛
𝑛=1
∞
∑
2. Prove that the sequence (π‘Žπ‘› )∞
converges if and only if
𝑛=1
(i) (π‘Ž2𝑛 )∞
converges,
𝑛=1
(ii) (π‘Ž2𝑛−1 )∞
converges,
𝑛=1
(iii) (π‘Žπ‘› − π‘Žπ‘›−1 ) → 0 as 𝑛 → ∞.
3. Use Tutorial 2 to prove the Alternating Series Test.
Hint. Show that (𝑠2𝑛 )∞
and (𝑠2𝑛−1 )∞
are monotonic sequences.
𝑛=1
𝑛=1
52
Introductory Analysis 2017 Lecture Manual
MATH2022
4. Use the alternating series test, ratio test or root test to test for convergence:
)2𝑛
∞ (
∞
∑
∑
(−1)𝑛 𝑛
𝑛!2𝑛
(a)
,
(b)
,
2𝑛 + 1
(2𝑛)!
𝑛=1
𝑛=1
(
∞
∞
(
) )
∑
∑
1 𝑛
𝑛𝑛
𝑛
(−1) 𝑒 − 1 +
(c)
,
, (d)
2
𝑛
𝑛=1
𝑛=1 (𝑛!)
(e)
∞ 𝑛
∑
2
𝑛=1
5.3
𝑛
.
Power Series
Definition 5.4. Let π‘Ž and 𝑐𝑛 , 𝑛 ∈ β„•, be real numbers. A series
∞
∑
𝑐𝑛 (π‘₯ − π‘Ž)𝑛
𝑛=0
is called a power series in (π‘₯ − π‘Ž) or a power series centred at π‘Ž or a power series about π‘Ž.
Note that (π‘₯ − π‘Ž)0 = 1. For π‘₯ = π‘Ž, all terms from the second onwards are 0, so the series converges to 𝑐0 for π‘₯ = π‘Ž.
Each power series defines a function whose domain is those π‘₯ ∈ ℝ for which the series converges.
Notation. With each power series
∞
∑
𝑐𝑛 (π‘₯ − π‘Ž)𝑛 we associate a number 𝑅 or ∞, called the radius of convergence
𝑛=0
of the series, which is defined as follows:
√
(i) 𝑅 = 0 if lim sup 𝑛 |𝑐𝑛 | = ∞,
𝑛→∞
(ii) 𝑅 =
1
if 0 < lim sup
√
𝑛→∞
lim sup 𝑛 |𝑐𝑛 |
√
𝑛
|𝑐𝑛 | ∈ ℝ,
𝑛→∞
√
(iii) 𝑅 = ∞ if lim sup 𝑛 |𝑐𝑛 | = 0.
𝑛→∞
Theorem 5.12. There are three alternatives for the domain of a power series
∞
∑
𝑐𝑛 (π‘₯ − π‘Ž)𝑛 :
𝑛=0
(i) If 𝑅 = 0, then the series converges only for π‘₯ = π‘Ž.
(ii) If 𝑅 = ∞, then the series converges absolutely for all π‘₯ ∈ ℝ.
(iii) If 0 < 𝑅 ∈ ℝ, then the series converges absolutely if |π‘₯ − π‘Ž| < 𝑅 and diverges if |π‘₯ − π‘Ž| > 𝑅.
Note that (iii) says that the series converges for π‘₯ in the interval (π‘Ž − 𝑅, π‘Ž + 𝑅) and diverges outside [π‘Ž − 𝑅, π‘Ž + 𝑅].
For π‘₯ = π‘Ž − 𝑅 and π‘₯ = π‘Ž + 𝑅 anything can happen, see Calculus I. In any case, the domain of the series is an
interval, called the interval of convergence.
√
Proof. (i): We note that if lim sup 𝑛 |𝑐𝑛 | = ∞, then for π‘₯ ≠ π‘Ž also
𝑛→∞
√
√
lim sup 𝑛 |𝑐𝑛 (π‘₯ − π‘Ž)𝑛 | = lim sup 𝑛 |𝑐𝑛 | |π‘₯ − π‘Ž| = ∞.
𝑛→∞
𝑛→∞
In view of the root test,
√ this shows that the power series diverges if π‘₯ ≠ π‘Ž.
(ii), (iii): If lim sup 𝑛 |𝑐𝑛 | ∈ ℝ, then
𝑛→∞
√
√
lim sup 𝑛 |𝑐𝑛 (π‘₯ − π‘Ž)𝑛 | = |π‘₯ − π‘Ž| lim sup 𝑛 |𝑐𝑛 |.
𝑛→∞
𝑛→∞
Hence the power series converges for all π‘₯ ∈ ℝ if lim sup
𝑛→∞
√
𝑛
|𝑐𝑛 | = 0, proving (ii).
MATH2022
Introductory Analysis 2017 Lecture Manual
53
√
If finally 0 < lim sup 𝑛 |𝑐𝑛 | ∈ ℝ, then, by the Root Test, the series converges if
𝑛→∞
√
1
,
|π‘₯ − π‘Ž| lim sup 𝑛 |𝑐𝑛 | < 1, i. e., |π‘₯ − π‘Ž| <
√
𝑛→∞
lim sup 𝑛 |𝑐𝑛 |
𝑛→∞
and the series diverges if
1
|π‘₯ − π‘Ž| >
.
√
lim sup 𝑛 |𝑐𝑛 |
𝑛→∞
√
Tutorial 5.3.1. 1. (a) Let 𝑐 > 1 and put 𝑐𝑛 = 𝑛 𝑐 − 1.
(i) Show that 𝑐𝑛 ≥ 0.
(ii) Show that lim sup 𝑐𝑛 ≤ 0. Hint. Use Bernoulli’s inequality.
𝑛→∞
√
(iii) Conclude that lim 𝑛 𝑐 = 1.
𝑛→∞
√
(b) Use (a) to show that lim 𝑛 𝑐 = 1 for all 𝑐 > 0.
𝑛→∞
2. Consider
∞
∑
π‘Žπ‘› with π‘Žπ‘› ≠ 0 for all 𝑛 ∈ β„•. Show that
𝑛=1
√
√
| π‘Žπ‘›+1 |
|π‘Ž |
| ≤ lim inf 𝑛 |π‘Žπ‘› | ≤ lim sup 𝑛 |π‘Žπ‘› | ≤ lim sup | 𝑛+1 | .
lim inf ||
|
|
|
𝑛→∞ | π‘Žπ‘› |
𝑛→∞
𝑛→∞
𝑛→∞ | π‘Žπ‘› |
| π‘Žπ‘›+1 |
| exists or is ∞?
What can you say if lim ||
𝑛→∞ | π‘Žπ‘› ||
∞
∑
3. Consider the power series
π‘Žπ‘› (π‘₯ − π‘Ž)𝑛 with π‘Žπ‘› ≠ 0 for all 𝑛 ∈ β„•. Using tutorial problem 2 above or otherwise,
𝑛=1
| π‘Ž |
|
|
prove that if 𝑅 = lim | 𝑛 | exists or is ∞, then 𝑅 is the radius of convergence of the power series.
𝑛→∞ | π‘Žπ‘›+1 |
|
|
√
4. Prove that lim 𝑛 𝑛 = 1.
𝑛→∞
5. Find the radius and interval of convergence for each of the following power series:
∞
∞
∞
∑
∑
∑
(2π‘₯)𝑛
π‘₯𝑛
(a)
,
(b)
,
(c)
𝑛𝑛 π‘₯𝑛 ,
𝑛
𝑛𝑛
𝑛=1
𝑛=0
𝑛=0
(d)
(g)
∞
∑
(π‘₯ − 1)𝑛
√ ,
𝑛 𝑛
𝑛=1 3
∞ (
∑
𝑛=0
𝑛
𝑛+1
)
𝑛2
(e)
∞
∑
(−2π‘₯)𝑛
𝑛=1
π‘₯𝑛 ,
(h)
∞
∑
𝑛3
,
(π‘₯ + 3)𝑛
,
𝑛3
𝑛=1
(f)
∞
∑
(−1)𝑛 π‘₯𝑛 ,
𝑛=0
(i)
∞
∑
(𝑛π‘₯)𝑛
𝑛=0
(2𝑛)!
.
Chapter 6
Riemann Integration
6.1
Suprema and Infima (self study)
Recall that if a nonempty set 𝐴 ⊂ ℝ is bounded above, then the supremum of 𝐴 is the least upper bound of 𝐴,
and if 𝐴 is bounded below, then the infimum of 𝐴 is the greatest lower bound of 𝐴. Hence for all π‘₯ ∈ 𝐴 we
have inf 𝐴 ≤ π‘₯ ≤ sup 𝐴 and for each πœ€ > 0 there exist 𝑝, π‘ž ∈ 𝐴 such that 𝑝 < πœ€ + inf 𝐴 and π‘ž > −πœ€ + sup 𝐴, see
Theorems 1.9 and 1.12.
Lemma 6.1. Let 𝐴 and 𝐡 be non-empty bounded sets of real numbers, let 𝑐 ∈ ℝ, and define
𝑐𝐴 ∢= {𝑐π‘₯ ∣ π‘₯ ∈ 𝐴},
𝑐 + 𝐴 ∢= {𝑐 + π‘₯ ∣ π‘₯ ∈ 𝐴}
and
𝐴 + 𝐡 ∢= {π‘₯ + 𝑦 ∣ π‘₯ ∈ 𝐴, 𝑦 ∈ 𝐡}.
Then 𝑐𝐴, 𝑐 + 𝐴 and 𝐴 + 𝐡 are bounded, and
(a) sup(𝑐𝐴) = 𝑐 sup 𝐴, inf (𝑐𝐴) = 𝑐 inf 𝐴, if 𝑐 ≥ 0,
(b) sup(𝑐𝐴) = 𝑐 inf 𝐴, inf (𝑐𝐴) = 𝑐 sup 𝐴, if 𝑐 ≤ 0,
(c) sup(𝑐 + 𝐴) = 𝑐 + sup 𝐴, inf (𝑐 + 𝐴) = 𝑐 + inf 𝐴,
(d) sup(𝐴 + 𝐡) = sup 𝐴 + sup 𝐡, inf (𝐴 + 𝐡) = inf 𝐴 + inf 𝐡.
Proof. (a) Let 𝑀 = sup 𝐴. Then π‘₯ ≤ 𝑀 for all π‘₯ ∈ 𝐴 and therefore 𝑐π‘₯ ≤ 𝑐𝑀 for all 𝑐π‘₯ ∈ 𝑐𝐴. Hence 𝑐𝐴 is
bounded above with bound 𝑐𝑀. This shows that 𝐿 = sup(𝑐𝐴) satisfies 𝐿 ≤ 𝑐𝑀.
If 𝑐 = 0 then 𝐿 = sup{0} = 0 = 0 ⋅ 𝑀 = 𝑐𝑀, and if 𝑐 > 0, the the above inequality gives
𝑐𝑀 = 𝑐 sup(𝑐 −1 (𝑐𝐴)) ≤ sup(𝑐𝐴) = 𝐿.
Hence 𝐿 ≤ 𝑐𝑀 and 𝑐𝑀 ≤ 𝐿, which proves (a) for supremum.
The proof for the infimum is similar, or we can use Proposition 1.10, with 𝐴 = −(−𝐴).
(b) Since 𝑐𝐴 = −((−𝑐)𝐴) and −𝑐 ≥ 0, this follows from (a) and Proposition 1.10.
(d) This is Tutorial 1.2.1, part 7.
(c) is a special case of (d) if one observes 𝑐 + 𝐴 = {𝑐} + 𝐴 and inf {𝑐} = sup{𝑐} = 𝑐.
Lemma 6.2. If ∅ ≠ 𝐴 ⊂ 𝐡 ⊂ ℝ, then
sup 𝐴 ≤ sup 𝐡,
inf 𝐴 ≥ inf 𝐡.
Proof. Assume 𝐡 is bounded. Let π‘Ž ∈ 𝐴. Then π‘Ž ∈ 𝐡, and hence inf 𝐡 ≤ π‘Ž ≤ sup 𝐡. It follows that 𝐴 is bounded,
that inf 𝐡 is a lower bound of 𝐴 and that sup 𝐴 is an upper bound of 𝐴. Hence inf 𝐴 ≥ inf 𝐡 and sup 𝐴 ≤ sup 𝐡.
The cases that inf 𝐡 = −∞ or sup 𝐡 = ∞ are similar.
Lemma 6.3. Let 𝐴 and 𝐡 be nonempty subsets of ℝ such that π‘₯ ≤ 𝑦 for all π‘₯ ∈ 𝐴 and 𝑦 ∈ 𝐡. Then 𝐴 is bounded
above, 𝐡 is bounded below, and
sup 𝐴 ≤ inf 𝐡.
Proof. Let 𝑏 ∈ 𝐡. Then π‘Ž ≤ 𝑏 for all π‘Ž ∈ 𝐴. So 𝑏 is an upper bound of 𝐴. Hence 𝐴 is bounded above and
sup 𝐴 ≤ 𝑏 since sup 𝐴 is the least upper bound. Then sup 𝐴 ≤ 𝑏 for all 𝑏 ∈ 𝐡, and therefore sup 𝐴 is a lower bound
for 𝐡. Thus sup 𝐴 ≤ inf 𝐡 follows.
54
MATH2022
6.2
55
Introductory Analysis 2017 Lecture Manual
Riemann Sums
The aim of this part of your course is to formalize the Riemann integral and consequently prove the major theorems
concerning its properties.
Definition 6.1. A partition, 𝑃 , of the interval [π‘Ž, 𝑏] is a finite set of points π‘Ž = π‘₯0 < π‘₯1 < β‹― < π‘₯𝑁 = 𝑏 which for
brevity will be denoted
𝑃 = {π‘Ž = π‘₯0 < π‘₯1 < β‹― < π‘₯𝑁 = 𝑏}.
The set of all partitions of the interval [π‘Ž, 𝑏] will be denoted by (π‘Ž, 𝑏).
Next we consider the upper and lower Riemann sums with respect to a partition.
Definition 6.2. Let 𝑓 be a bounded function on [π‘Ž, 𝑏], (i.e. 𝑓 ∢ [π‘Ž, 𝑏] → ℝ and there exists a constant 𝐾 such that
|𝑓 (π‘₯)| ≤ 𝐾 for all π‘₯ ∈ [π‘Ž, 𝑏]) and let 𝑃 = {π‘Ž = π‘₯0 < β‹― < π‘₯𝑁 = 𝑏} be a partition of the closed finite interval
[π‘Ž, 𝑏].
(a) We define the upper sum of 𝑓 over [π‘Ž, 𝑏] with respect to the partition 𝑃 by
π‘ˆ (𝑓 , 𝑃 ) ∢=
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 ) sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}.
𝑗=1
(b) We define the lower sum of 𝑓 over [π‘Ž, 𝑏] with respect to the partition 𝑃 by
𝐿(𝑓 , 𝑃 ) ∢=
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 ) inf {𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}.
𝑗=1
𝑓
𝑦
π‘₯0
π‘₯2
π‘₯3
π‘₯𝑛
π‘₯
=
=
π‘Ž
π‘₯1
𝑏
Observe that if 𝑓 (π‘₯) ≥ 0 on [π‘Ž, 𝑏], then both 𝐿(𝑓 , 𝑃 ) and π‘ˆ (𝑓 , 𝑃 ) are approximations of the area under the curve
𝑦 = 𝑓 (π‘₯) for π‘₯ ∈ [π‘Ž, 𝑏]. Here 𝐿(𝑓 , 𝑃 ) under approximates the area, while π‘ˆ (𝑓 , 𝑃 ) over approximates the area.
Worked Example 6.2.1. Find π‘ˆ (𝑓 , 𝑃 ) and 𝐿(𝑓 , 𝑃 ) for 𝑓 (π‘₯) = π‘₯2 − 1 and 𝑃 = {0 < 1βˆ•4 < 1βˆ•2 < 1}.
Solution.
56
Introductory Analysis 2017 Lecture Manual
0
1
4
MATH2022
Upper Sum:
1
2
𝑓
1
π‘₯
sup 𝑓 (π‘₯) =
π‘₯∈[0, 14 ]
sup 𝑓 (π‘₯) =
π‘₯∈[ 14 , 12 ]
1
−15
−1=
16
16
1
−3
−1=
4
4
sup 𝑓 (π‘₯) = 1 − 1 = 0
π‘₯∈[ 12 ,1]
⇒ π‘ˆ (𝑓 , 𝑃 ) =
(
(
)
(
)
)
1
27
−15 1
−3 1 1
+0 1−
=−
−0 +
−
16 4
4 2 4
2
64
Lower Sum:
−15
−3
1
1
−1=
,
inf 𝑓 (π‘₯) = − 1 =
1
16
16
4
4
π‘₯∈[ 2 ,1]
(
(
)
(
)
)
1
−3
1
55
−15 1 1
⇒ 𝐿(𝑓 , 𝑃 ) = −1
+
1−
=−
−0 +
−
4
16 2 4
4
2
64
inf 𝑓 (π‘₯) = 0 − 1 = −1,
π‘₯∈[0, 14 ]
inf
π‘₯∈[ 14 , 12 ]
1
Remark. From 1st year:
∫
(π‘₯2 − 1) 𝑑π‘₯ =
0
𝑓 (π‘₯) =
]1
[ 3
2
π‘₯
− π‘₯ = − so we observe that
3
3
0
1
55
27
−
= 𝐿(π‘₯2 − 1, 𝑃 ) ≤ (π‘₯2 − 1) 𝑑π‘₯ ≤ π‘ˆ (𝑓 , 𝑃 ) = − .
∫
64
64
0
The following lemma gives some useful bounds.
Lemma 6.4. Let 𝑓 be a bounded function on the closed interval [π‘Ž, 𝑏] and let 𝑃 be a partition of [π‘Ž, 𝑏]. Then
(𝑏 − π‘Ž) inf {𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘Ž, 𝑏]} ≤ 𝐿(𝑓 , 𝑃 ) ≤ π‘ˆ (𝑓 , 𝑃 ) ≤ (𝑏 − π‘Ž) sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘Ž, 𝑏]}.
Proof. From the definition of supremum and infimum:
inf
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
𝑓 (π‘₯) ≤
sup
𝑓 (π‘₯).
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
Also, since [π‘₯𝑗−1 , π‘₯𝑗 ] ⊂ [π‘Ž, 𝑏], it follows from Lemma 6.2 that
sup
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
𝑓 (π‘₯) ≤ sup 𝑓 (π‘₯),
π‘₯∈[π‘Ž,𝑏]
inf 𝑓 (π‘₯) ≤
π‘₯∈[π‘Ž,𝑏]
inf
𝑓 (π‘₯).
sup
𝑓 (π‘₯) ≤ sup 𝑓 (π‘₯).
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
Thus
inf 𝑓 (π‘₯) ≤
π‘₯∈[π‘Ž,𝑏]
inf
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
𝑓 (π‘₯) ≤
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
π‘₯∈[π‘Ž,𝑏]
Multiplying by (π‘₯𝑗 − π‘₯𝑗−1 ) (> 0) and summing up over 𝑗 gives
𝑛
∑
(π‘₯𝑗 − π‘₯𝑗−1 )inf 𝑓 (π‘₯) ≤
π‘₯∈[π‘Ž,𝑏]
(π‘₯𝑗 − π‘₯𝑗−1 ) inf 𝑓 (π‘₯) ≤
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
(π‘₯𝑗 − π‘₯𝑗−1 ) sup 𝑓 (π‘₯) ≤
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
(π‘₯𝑗 − π‘₯𝑗−1 )sup 𝑓 (π‘₯)
π‘₯∈[π‘Ž,𝑏]
=
𝑗=1
=
𝑗=1
𝑛
∑
=
𝑗=1
𝑛
∑
=
𝑗=1
𝑛
∑
(𝑏 − π‘Ž) inf 𝑓 (π‘₯)
𝐿(𝑓 , 𝑃 )
π‘ˆ (𝑓 , 𝑃 )
(𝑏 − π‘Ž) sup 𝑓 (π‘₯)
π‘₯∈[π‘Ž,𝑏]
π‘₯∈[π‘Ž,𝑏]
MATH2022
Introductory Analysis 2017 Lecture Manual
57
Worked Example 6.2.2. Let 𝑓 (π‘₯) = π‘₯2 and 𝑃 = {1 < 4βˆ•3 < 5βˆ•3 < 2}. With π‘Ž = 1 and 𝑏 = 2 we have
inf 𝑓 = 1,
[1,2]
sup 𝑓 = 4,
𝑏 − π‘Ž = 1.
[1,2]
Thus we conclude
1 ≤ 𝐿(𝑓 , 𝑃 ) ≤ π‘ˆ (𝑓 , 𝑃 ) ≤ 4.
Note to lecturers: This calculation may be skipped. In fact, since 𝑓 is increasing on [1, 2], infimum and supremum
on each subdivision interval are taken at the left and right endpoint, respectively. Therefore
1 16 1 25 1
77
⋅
+ ⋅
+ ⋅4=
3 9
3 9
3
27
1 16 1 25 50
1
+ ⋅
=
,
𝐿(𝑓 , 𝑃 ) = ⋅ 1 + ⋅
3
3 9
3 9
27
π‘ˆ (𝑓 , 𝑃 ) =
i. e.
≤
≤
4
=
77
27
=
≤
=
50
27
=
1
(𝑏 − π‘Ž) inf 𝑓
𝐿(𝑓 , 𝑃 )
π‘ˆ (𝑓 , 𝑃 )
(𝑏 − π‘Ž) sup 𝑓
[1,2]
[1,2]
Tutorial 6.2.1. 1. Let 𝑓 (π‘₯) = π‘₯ − 21 and 𝑃 = {0 < 1βˆ•2 < 1 < 3βˆ•2 < 2}. Find π‘ˆ (𝑓 , 𝑃 ) and 𝐿(𝑓 , 𝑃 ) and compare
2
them with ∫0 𝑓 (π‘₯) 𝑑π‘₯.
2. Let 𝑓 (π‘₯) = π‘₯ and 𝑃𝑁 = {−1 < −1 + 𝑁1 < −1 + 𝑁2 < β‹― < −1 + 3𝑁
= 2}, 𝑁 ∈ β„•∗ .
𝑁
]}
{
[
]}
{
[
𝑗−1
𝑗
𝑗
and
inf
𝑓
(π‘₯)
∣
π‘₯
∈
−1
+
.
,
−1
+
,
−1
+
(i) Find sup 𝑓 (π‘₯) ∣ π‘₯ ∈ −1 + 𝑗−1
𝑁
𝑁
𝑁
𝑁
(ii) Hence compute π‘ˆ (𝑓 , 𝑃𝑁 ) and 𝐿(𝑓 , 𝑃𝑁 ).
(iii) Find lim π‘ˆ (𝑓 , 𝑃𝑁 ) and lim 𝐿(𝑓 , 𝑃𝑁 ).
𝑁→∞
𝑁→∞
2
(iv) Compare these limits with ∫−1 π‘₯ 𝑑π‘₯.
⎧1
βŽͺ
3. Verify Lemma 6.4 for the function sgn(π‘₯) ∢= ⎨ 0
βŽͺ −1
⎩
𝑃 = {−2 < −1 < 0 < 1 < 2 < 3}.
6.3
if π‘₯ > 0
if π‘₯ = 0
on the interval [−2, 3] with partition
if π‘₯ < 0
Refinements
Definition 6.3. Let 𝑃 = {π‘Ž = π‘₯0 < π‘₯1 < β‹― < π‘₯π‘š = 𝑏} and 𝑄 = {π‘Ž = 𝑦0 < 𝑦1 < β‹― < 𝑦𝑛 = 𝑏} be partitions of
[π‘Ž, 𝑏]. Then 𝑄 is said to be a refinement of 𝑃 if 𝑃 ⊂ 𝑄 i.e. {π‘₯0 , π‘₯1 , … , π‘₯π‘š } ⊂ {𝑦0 , 𝑦1 , … , 𝑦𝑛 }.
𝑄 is called a refinement because 𝑄 chops [π‘Ž, 𝑏] into “smaller” pieces than 𝑃 did. Or you can think of 𝑃 as first
cutting [π‘Ž, 𝑏] into pieces and then 𝑄 additionally chops some of those pieces into smaller pieces.
Worked Example 6.3.1. 1. 𝑃 = {0 < 1βˆ•2 < 1} and 𝑄 = {0 < 1βˆ•3 < 2βˆ•3 < 1} are both partitions of [0, 1] but
𝑄 is not a refinement of 𝑃 as 12 belongs to the partition 𝑃 but not to 𝑄.
2. 𝑃 = {0 < 1βˆ•2 < 1} and 𝑄 = {0 < 1βˆ•4 < 2βˆ•4 < 3βˆ•4 < 1} are both partitions of [0, 1] and 𝑄 is a refinement of
𝑃.
𝑛
3. Let 𝑃𝑛 = {0 < 21𝑛 < 22𝑛 < β‹― < 22𝑛 }. Then 𝑃𝑛 is a partition of [0, 1] for each 𝑛 ∈ β„• and 𝑃𝑛 ⊂ 𝑃𝑛+1 since
𝑗
= 22𝑗
𝑛+1 .
2𝑛
58
Introductory Analysis 2017 Lecture Manual
MATH2022
Lemma 6.5. Let 𝑓 be a bounded function on [π‘Ž, 𝑏] and let 𝑃 and 𝑅 be partitions of [π‘Ž, 𝑏] with 𝑅 a refinement of
𝑃 . Then
𝐿(𝑓 , 𝑃 ) ≤ 𝐿(𝑓 , 𝑅) ≤ π‘ˆ (𝑓 , 𝑅) ≤ π‘ˆ (𝑓 , 𝑃 ).
Proof. Since 𝑅 can be formed by adding a finite number of partition points to 𝑃 , proceeding inductively it suffices
to consider the case where 𝑅 is formed by introducing one extra point to 𝑃 :
Suppose 𝑃 = {π‘₯0 < π‘₯1 < β‹― < π‘₯𝑁 } and 𝑅 = {π‘₯0 < π‘₯1 < β‹― < π‘₯π‘˜−1 < 𝑦 < π‘₯π‘˜ < β‹― < π‘₯𝑁 }. We have
π‘ˆ (𝑓 , 𝑃 ) =
=
𝑛
∑
𝑗=1
𝑛
∑
(π‘₯𝑗 − π‘₯𝑗−1 )
(π‘₯𝑗 − π‘₯𝑗−1 )
𝑗=1
𝑗≠π‘˜
sup
𝑓 (π‘₯)
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
sup
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
𝑓 (π‘₯) + (π‘₯π‘˜ − π‘₯π‘˜−1 )
sup
𝑓 (π‘₯)
π‘₯∈[π‘₯π‘˜−1 ,π‘₯π‘˜ ]
But 𝑄 = {π‘₯π‘˜−1 < 𝑦 < π‘₯π‘˜ } is a partition of [π‘₯π‘˜−1 , π‘₯π‘˜ ], and Lemma 6.4 gives
(𝑦 − π‘₯π‘˜−1 )
sup
π‘₯∈[π‘₯π‘˜−1 ,𝑦]
𝑓 (π‘₯) + (π‘₯π‘˜ − 𝑦) sup 𝑓 (π‘₯) ≤ (π‘₯π‘˜ − π‘₯π‘˜−1 )
π‘₯∈[𝑦,π‘₯π‘˜ ]
sup
𝑓 (π‘₯).
π‘₯∈[π‘₯π‘˜−1 ,π‘₯π‘˜ ]
Hence
π‘ˆ (𝑓 , 𝑃 ) ≥
𝑛
∑
(π‘₯𝑗 − π‘₯𝑗−1 )
𝑗=1
𝑗≠π‘˜
sup
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
𝑓 (π‘₯) + (𝑦 − π‘₯π‘˜−1 )
sup
π‘₯∈[π‘₯π‘˜−1 ,𝑦]
𝑓 (π‘₯) + (π‘₯π‘˜ − 𝑦) sup 𝑓 (π‘₯)
π‘₯∈[𝑦,π‘₯π‘˜ ]
= π‘ˆ (𝑓 , 𝑅).
The inequality 𝐿(𝑓 , 𝑃 ) ≤ 𝐿(𝑓 , 𝑅) follows in a similar way, and 𝐿(𝑓 , 𝑅) ≤ π‘ˆ (𝑓 , 𝑅) has been shown in Lemma
6.4.
Definition 6.4. If 𝑃 and 𝑄 are partitions of [π‘Ž, 𝑏], then 𝑃 ∪ 𝑄 (with elements listed in increasing order and without
repetitions) is called the common refinement of 𝑃 and 𝑄.
Worked Example 6.3.2. Let 𝑃 = {−1 < 0 < 1} and 𝑄 = {−1 < −1βˆ•3 < 1βˆ•3 < 1}. Then the common
refinement of 𝑃 and 𝑄 is 𝑃 ∪ 𝑄 = {−1 < −1βˆ•3 < 0 < 1βˆ•3 < 1}.
Lemma 6.6. If 𝑃 and 𝑄 are partitions of [π‘Ž, 𝑏] and 𝑓 is a bounded function on [π‘Ž, 𝑏], then
𝐿(𝑓 , 𝑄) ≤ π‘ˆ (𝑓 , 𝑃 ).
Proof. Since
𝑃 ⊂ 𝑃 ∪ 𝑄 and
𝑄 ⊂ 𝑃 ∪ 𝑄,
Lemma 6.5 gives
𝐿(𝑓 , 𝑄) ≤ 𝐿(𝑓 , 𝑃 ∪ 𝑄),
π‘ˆ (𝑓 , 𝑃 ∪ 𝑄) ≤ π‘ˆ (𝑓 , 𝑃 ) and
𝐿(𝑓 , 𝑃 ∪ 𝑄) ≤ π‘ˆ (𝑓 , 𝑃 ∪ 𝑄).
Hence
𝐿(𝑓 , 𝑄) ≤ 𝐿(𝑓 , 𝑃 ∪ 𝑄) ≤ π‘ˆ (𝑓 , 𝑃 ∪ 𝑄) ≤ π‘ˆ (𝑓 , 𝑃 ).
Tutorial 6.3.1. 1. Let 𝑃𝑛 = {0 < 1𝑛 < 2𝑛 < β‹― < 𝑛𝑛 }. Then 𝑃𝑛 is a partition of [0, 1] for each 𝑛 ∈ β„•. Show that
𝑃𝑛+1 is not a refinement of 𝑃𝑛 for 𝑛 ≥ 2.
2. Let 𝑓 (π‘₯) = π‘₯3 + π‘₯2 , 𝑃 = {−1 < 0 < 1}, 𝑄 = {−1 < −1βˆ•2 < 0 < 1}. Compute 𝐿(𝑓 , 𝑃 ), 𝐿(𝑓 , 𝑄), π‘ˆ (𝑓 , 𝑃 ),
π‘ˆ (𝑓 , 𝑄) and compare their values.
MATH2022
6.4
Introductory Analysis 2017 Lecture Manual
59
The Riemann Integral
Definition 6.5. Let 𝑓 be a bounded function on the interval [π‘Ž, 𝑏].
(a) We define the lower integral of 𝑓 over [π‘Ž, 𝑏] as
𝑏
𝑓 (𝑑) 𝑑𝑑 ∢= sup{𝐿(𝑓 , 𝑃 ) ∣ 𝑃 a partition of [π‘Ž, 𝑏]}.
∫
π‘Ž
(b) We define the upper integral of 𝑓 over [π‘Ž, 𝑏] as
𝑏
∫
𝑓 (𝑑) 𝑑𝑑 ∢= inf {π‘ˆ (𝑓 , 𝑃 ) ∣ 𝑃 a partition of [π‘Ž, 𝑏]}.
π‘Ž
(c) We say that 𝑓 is Riemann integrable on [π‘Ž, 𝑏] if
𝑏
𝑏
𝑓 (𝑑) 𝑑𝑑 =
∫
π‘Ž
𝑓 (𝑑) 𝑑𝑑,
∫
π‘Ž
𝑏
in which case the common value is denoted by
∫
𝑓 (𝑑) 𝑑𝑑 and is called the Riemann integral of 𝑓 over [π‘Ž, 𝑏].
π‘Ž
Note. By the properties of suprema and infima, see Lemmas 6.6 and 6.3,
𝑏
𝑏
𝑓 (𝑑) 𝑑𝑑 ≤
∫
π‘Ž
𝑓 (𝑑) 𝑑𝑑.
∫
π‘Ž
Hence 𝑓 is Riemann integrable if and only if
𝑏
∫
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 ≤
∫
𝑓 (𝑑) 𝑑𝑑.
π‘Ž
Theorem 6.7 (Integrability Condition). A bounded function 𝑓 on the interval [π‘Ž, 𝑏] is Riemann integrable if and
only if for each πœ€ > 0 there exists a partition 𝑃 of [π‘Ž, 𝑏] such that
π‘ˆ (𝑓 , 𝑃 ) − 𝐿(𝑓 , 𝑃 ) < πœ€.
Proof. ⇒: Let 𝑓 be Riemann integrable on [π‘Ž, 𝑏] and let πœ€ > 0. Then there is a partition 𝑃 of [π‘Ž, 𝑏] such that
𝑏
∫
𝑏
𝑓 (𝑑) 𝑑𝑑 ≤ π‘ˆ (𝑓 , 𝑃 ) <
π‘Ž
∫
𝑓 (𝑑) 𝑑𝑑 +
πœ€
2
π‘Ž
and there is a partition 𝑄 of [π‘Ž, 𝑏] such that
𝑏
𝑏
πœ€
𝑓 (𝑑) 𝑑𝑑 − < 𝐿(𝑓 , 𝑄) ≤ 𝑓 (𝑑) 𝑑𝑑.
∫
∫
2
π‘Ž
π‘Ž
60
Introductory Analysis 2017 Lecture Manual
MATH2022
Let 𝑅 = 𝑃 ∪ 𝑄. Then, in view of Lemma 6.5,
𝑏
𝑏
πœ€
πœ€
𝑓 (𝑑) 𝑑𝑑 − = 𝑓 (𝑑) 𝑑𝑑 − < 𝐿(𝑓 , 𝑄) ≤ 𝐿(𝑓 , 𝑅)
∫
2 ∫
2
π‘Ž
π‘Ž
𝑏
𝑏
πœ€
πœ€
𝑓 (𝑑) 𝑑𝑑 + .
≤ π‘ˆ (𝑓 , 𝑅) ≤ π‘ˆ (𝑓 , 𝑃 ) < 𝑓 (𝑑) 𝑑𝑑 + =
∫
2 ∫
2
π‘Ž
π‘Ž
In particular,
𝑏
π‘ˆ (𝑓 , 𝑅) <
πœ€
𝑓 (𝑑) 𝑑𝑑 + ,
2
∫
π‘Ž
𝑏
−𝐿(𝑓 , 𝑅) < −
πœ€
𝑓 (𝑑) 𝑑𝑑 + .
2
∫
π‘Ž
Summing up gives
π‘ˆ (𝑓 , 𝑅) − 𝐿(𝑓 , 𝑅) < πœ€.
⇐: Suppose that there are πœ€ > 0 and a partition π‘ƒπœ€ of [π‘Ž, 𝑏] such that
π‘ˆ (𝑓 , π‘ƒπœ€ ) − 𝐿(𝑓 , π‘ƒπœ€ ) < πœ€.
Then from
𝑏
𝑓 (𝑑) 𝑑𝑑 ≥ 𝐿(𝑓 , π‘ƒπœ€ ),
∫
π‘Ž
𝑏
∫
𝑓 (𝑑) 𝑑𝑑 ≤ π‘ˆ (𝑓 , π‘ƒπœ€ )
π‘Ž
we get
𝑏
∫
𝑏
𝑓 (𝑑) 𝑑𝑑 −
π‘Ž
𝑓 (𝑑) 𝑑𝑑 ≤ π‘ˆ (𝑓 , π‘ƒπœ€ ) − 𝐿(𝑓 , π‘ƒπœ€ ) < πœ€.
∫
π‘Ž
Hence
𝑏
∀πœ€ > 0
0≤
∫
𝑏
𝑓 (𝑑) 𝑑𝑑 −
π‘Ž
∫
𝑓 (𝑑) 𝑑𝑑 ≤ πœ€,
π‘Ž
giving
𝑏
∫
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 =
∫
𝑓 (𝑑) 𝑑𝑑.
π‘Ž
Theorem 6.8. If 𝑓 is an increasing function on the interval [π‘Ž, 𝑏], then 𝑓 is Riemann integrable on [π‘Ž, 𝑏].
Proof. By assumption, 𝑓 (π‘Ž) ≤ 𝑓 (π‘₯) ≤ 𝑓 (𝑏) for all π‘₯ ∈ [π‘Ž, 𝑏]. Hence 𝑓 is bounded.
Let πœ€ > 0 and choose 𝑁 ∈ β„• such that
𝑓 (𝑏) − 𝑓 (π‘Ž)
(𝑏 − π‘Ž)
< πœ€.
𝑁
MATH2022
61
Introductory Analysis 2017 Lecture Manual
Consider the partition
𝑃 =
{
}
2(𝑏 − π‘Ž)
(𝑁 − 1)(𝑏 − π‘Ž)
𝑁(𝑏 − π‘Ž)
𝑏−π‘Ž
π‘Ž<π‘Ž+
<π‘Ž+
<β‹―<π‘Ž+
<π‘Ž+
=𝑏 .
𝑁
𝑁
𝑁
𝑁
Observing that over each subinterval of [π‘Ž, 𝑏] the increasing function 𝑓 takes its minimum at the left endpoint and
its maximum at the right endpoint of the interval, it follows that
{
]}
(
)
[
𝑁
|
∑
𝑗(𝑏 − π‘Ž)
𝑗(𝑏 − π‘Ž)
(𝑗 − 1)(𝑏 − π‘Ž)
𝑏−π‘Ž
|
π‘ˆ (𝑓 , 𝑃 ) =
sup 𝑓 (π‘₯) | π‘₯ ∈ π‘Ž +
,π‘Ž +
=
𝑓 π‘Ž+
,
|
𝑁
𝑁
𝑁
𝑁
𝑁
|
𝑗=1
𝑗=1
{
]}
(
)
[
𝑁
𝑁
|
∑
∑
𝑗(𝑏 − π‘Ž)
(𝑗 − 1)(𝑏 − π‘Ž)
(𝑗 − 1)(𝑏 − π‘Ž)
𝑏−π‘Ž
𝑏−π‘Ž
|
𝐿(𝑓 , 𝑃 ) =
inf 𝑓 (π‘₯) | π‘₯ ∈ π‘Ž +
,π‘Ž +
=
𝑓 π‘Ž+
.
|
𝑁
𝑁
𝑁
𝑁
𝑁
|
𝑗=1
𝑗=1
𝑁
∑
𝑏−π‘Ž
Hence
π‘ˆ (𝑓 , 𝑃 ) − 𝐿(𝑓 , 𝑝) =
𝑁
∑
𝑏−π‘Ž
𝑁
𝑗=1
=
(
π‘Ž+
𝑓
𝑗(𝑏 − π‘Ž)
𝑁
)
−
𝑁−1
∑
𝑏−π‘Ž
𝑓
𝑁
𝑗=0
(
π‘Ž+
𝑗(𝑏 − π‘Ž)
𝑁
)
𝑏−π‘Ž
(𝑓 (𝑏) − 𝑓 (π‘Ž)) < πœ€.
𝑁
Hence 𝑓 is integrable by Theorem 6.7.
1
Worked Example 6.4.1. 1. Let 𝑓 (π‘₯) = π‘₯. Then 𝑓 is Riemann integrable on [0, 1] and ∫0 𝑓 (𝑑) 𝑑𝑑 =
this let 𝑃𝑛 = {0 < 1𝑛 < 2𝑛 < β‹― < 𝑛𝑛 }. Then
𝐿(𝑓 , 𝑃𝑛 ) =
𝑛
∑
1 𝑗−1
𝑗=1
𝑛
𝑛
and
π‘ˆ (𝑓 , 𝑃𝑛 ) =
𝑛
∑
1 𝑗
𝑗=1
𝑛 𝑛
=
=
1
. To see
2
(
)
1 (𝑛 − 1)𝑛 1
1
=
1−
2
2
𝑛
𝑛2
(
)
1 𝑛(𝑛 + 1) 1
1
=
1+
.
2
2
𝑛
𝑛2
1
, which can be made arbitrarily small but taking 𝑛 sufficiently large. Thus 𝑓 is
𝑛
Riemann integrable. Also the integral is sandwiched between 𝐿(𝑓 , 𝑃𝑛 ) and π‘ˆ (𝑓 , 𝑃𝑛 ), both of which converge to
1
1
1
, giving ∫0 𝑓 (𝑑) 𝑑𝑑 = .
2
2
{
0 if π‘₯ ∉ β„š
. Then 𝑓 is not Riemann integrable on [0, 1], since for each partition 𝑃 of [0, 1],
2. Let 𝑓 (π‘₯) =
1 if π‘₯ ∈ β„š
π‘ˆ (𝑓 , 𝑃 ) = 1, while 𝐿(𝑓 , 𝑃 ) = 0.
Thus π‘ˆ (𝑓 , 𝑃𝑛 ) − 𝐿(𝑓 , 𝑃𝑛 ) =
2
2
and 𝐿(sgn, 𝑃𝑛 ) = − .
𝑛
𝑛
4
4
Thus π‘ˆ (sgn, 𝑃𝑛 ) − 𝐿(sgn, 𝑃𝑛 ) = → 0 as 𝑛 → ∞. For each πœ€ > 0 there exists 𝑛 ∈ β„• such that < πœ€. Hence sgn
𝑛
𝑛
1
is Riemann integrable on [−1, 1]. Finally, lim π‘ˆ (sgn, 𝑃𝑛 ) = lim 𝐿(sgn, 𝑃𝑛 ) = 0 gives ∫−1 sgn(π‘₯) 𝑑π‘₯ = 0.
3. Let 𝑓 (π‘₯) = sgn(π‘₯) on [−1, 1]. Let 𝑃𝑛 ∢= {−1 < − 1𝑛 < 1𝑛 < 1} then π‘ˆ (sgn, 𝑃𝑛 ) =
𝑛→∞
𝑛→∞
Tutorial 6.4.1. Let 𝑓 ∢ [π‘Ž, 𝑏] → ℝ be bounded. For 𝑛 ∈ β„•∗ let 𝑃𝑛 ∈ (π‘Ž, 𝑏) such that lim 𝐿(𝑓 , 𝑃𝑛 ) =
𝑛→∞
𝑏
lim π‘ˆ (𝑓 , 𝑃𝑛 ). Show that 𝑓 is Riemann integrable on [π‘Ž, 𝑏] and that ∫π‘Ž 𝑓 (π‘₯) 𝑑π‘₯ = lim 𝐿(𝑓 , 𝑃𝑛 ) = lim π‘ˆ (𝑓 , 𝑃𝑛 ).
𝑛→∞
𝑛→∞
1
Tutorial 6.4.2. Show that 𝑓 (π‘₯) = π‘₯2 is Riemann integrable on [0, 1] and that ∫0 𝑓 (𝑑) 𝑑𝑑 =
𝑛→∞
1
.
3
62
Introductory Analysis 2017 Lecture Manual
6.5
MATH2022
Properties of the Integral
Theorem 6.9 (Homogeneity). If 𝑓 is a bounded Riemann integrable function on the interval [π‘Ž, 𝑏] and if 𝑐 ∈ ℝ,
then 𝑐𝑓 is Riemann integrable and
𝑏
𝑏
𝑐𝑓 (𝑑) 𝑑𝑑 = 𝑐
∫
π‘Ž
∫
𝑓 (𝑑) 𝑑𝑑.
π‘Ž
Proof. Note that whenever 𝑓 is bounded, then also 𝑐𝑓 is bounded.
Case I: 𝑐 ≥ 0.
Let 𝑃 = {π‘Ž = π‘₯0 < π‘₯1 < β‹― < π‘₯𝑁 = 𝑏} be a partition of [π‘Ž, 𝑏]. Then, by Lemma 6.1,
π‘ˆ (𝑐𝑓 , 𝑃 ) =
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 ) sup{𝑐𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}
𝑗=1
=
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 )𝑐 sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}
𝑗=1
=𝑐
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 ) sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}
𝑗=1
= π‘π‘ˆ (𝑓 , 𝑃 ).
Thus
𝑏
𝑐𝑓 (𝑑) 𝑑𝑑 = inf {π‘ˆ (𝑐𝑓 , 𝑃 ) ∣ 𝑃 ∈ (π‘Ž, 𝑏)}
∫
π‘Ž
= inf {π‘π‘ˆ (𝑓 , 𝑃 ) ∣ 𝑃 ∈ (π‘Ž, 𝑏)}
= 𝑐 inf {π‘ˆ (𝑓 , 𝑃 ) ∣ 𝑃 ∈ (π‘Ž, 𝑏)}
𝑏
=𝑐
𝑓 (𝑑) 𝑑𝑑.
∫
π‘Ž
Interchanging sup and inf we get 𝐿(𝑐𝑓 , 𝑃 ) = 𝑐𝐿(𝑓 , 𝑃 ) and then
𝑏
∫
𝑏
𝑐𝑓 (𝑑) 𝑑𝑑 = 𝑐
π‘Ž
𝑏
But
∫
π‘Ž
∫
𝑓 (𝑑) 𝑑𝑑.
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 =
∫
𝑓 (𝑑) 𝑑𝑑 now gives
π‘Ž
𝑏
∫
𝑏
𝑐𝑓 (𝑑) 𝑑𝑑 = 𝑐
π‘Ž
∫
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 = 𝑐
∫
∫
π‘Ž
𝑓 (𝑑) 𝑑𝑑 = 𝑐
π‘Ž
𝑏
𝑐𝑓 (𝑑) 𝑑𝑑 = 𝑐
∫
π‘Ž
𝑏
Hence 𝑐𝑓 is Riemann integrable and
𝑏
∫
π‘Ž
𝑓 (𝑑) 𝑑𝑑.
𝑏
𝑓 (𝑑) 𝑑𝑑 =
∫
π‘Ž
𝑐𝑓 (𝑑) 𝑑𝑑.
MATH2022
63
Introductory Analysis 2017 Lecture Manual
Case II: 𝑐 = −1. Let 𝑃 = {π‘Ž = π‘₯0 < π‘₯1 < β‹― < π‘₯𝑁 = 𝑏} be a partition of [π‘Ž, 𝑏]. Then, using the results of Section
6.1,
π‘ˆ (−𝑓 , 𝑃 ) =
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 ) sup{−𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}
𝑗=1
=−
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 ) inf {𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}
𝑗=1
= −𝐿(𝑓 , 𝑃 ).
Thus
𝑏
∫
(−𝑓 (𝑑)) 𝑑𝑑 = inf {π‘ˆ (−𝑓 , 𝑃 ) ∣ 𝑃 ∈ (π‘Ž, 𝑏)}
π‘Ž
= inf {−𝐿(𝑓 , 𝑃 ) ∣ 𝑃 ∈ (π‘Ž, 𝑏)}
= − sup{𝐿(𝑓 , 𝑃 ) ∣ 𝑃 ∈ (π‘Ž, 𝑏)}
𝑏
= − 𝑓 (𝑑) 𝑑𝑑.
∫
π‘Ž
Since so far we have only used that 𝑓 and −𝑓 are bounded, we can replace 𝑓 by −𝑓 and obtain
𝑏
∫
𝑏
𝑓 (𝑑) 𝑑𝑑 =
∫
π‘Ž
𝑏
(−(−𝑓 (𝑑))) 𝑑𝑑 = − (−𝑓 (𝑑)) 𝑑𝑑,
∫
π‘Ž
π‘Ž
and therefore
𝑏
∫
𝑏
(−𝑓 (𝑑)) 𝑑𝑑 = − 𝑓 (𝑑) 𝑑𝑑.
∫
π‘Ž
π‘Ž
Thus
𝑏
∫
π‘Ž
𝑏
𝑏
𝑏
𝑏
(−𝑓 (𝑑)) 𝑑𝑑 = − 𝑓 (𝑑) 𝑑𝑑 = − 𝑓 (𝑑) 𝑑𝑑 = − 𝑓 (𝑑) 𝑑𝑑 = (−𝑓 (𝑑)) 𝑑𝑑.
∫
∫
∫
∫
π‘Ž
π‘Ž
π‘Ž
This shows that −𝑓 is Riemann integrable and that
𝑏
∫
𝑏
(−𝑓 (𝑑)) 𝑑𝑑 = −
π‘Ž
𝑓 (𝑑) 𝑑𝑑.
∫
π‘Ž
Case III: 𝑐 < 0.
By Case I, (−𝑐)𝑓 is Riemann integrable, and
𝑏
∫
𝑏
(−𝑐)𝑓 (𝑑) 𝑑𝑑 = (−𝑐)
π‘Ž
∫
𝑓 (𝑑) 𝑑𝑑.
π‘Ž
Then, by Case II, −(−𝑐)𝑓 is Riemann integrable, and
𝑏
∫
π‘Ž
𝑏
(−(−𝑐)𝑓 (𝑑)) 𝑑𝑑 = −
∫
π‘Ž
(−𝑐)𝑓 (𝑑) 𝑑𝑑.
π‘Ž
64
Introductory Analysis 2017 Lecture Manual
MATH2022
Thus 𝑐𝑓 is Riemann integrable, and
𝑏
∫
𝑏
𝑐𝑓 (𝑑) 𝑑𝑑 = −
∫
π‘Ž
𝑏
(−𝑐)𝑓 (𝑑) 𝑑𝑑 = −(−𝑐)
∫
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 = 𝑐
π‘Ž
∫
𝑓 (𝑑) 𝑑𝑑.
π‘Ž
Lemma 6.10. If 𝑓 and 𝑔 are bounded functions on [π‘Ž, 𝑏] and 𝑃 is a partition of [π‘Ž, 𝑏], then
π‘ˆ (𝑓 + 𝑔, 𝑃 ) ≤ π‘ˆ (𝑓 , 𝑃 ) + π‘ˆ (𝑔, 𝑃 ),
𝐿(𝑓 + 𝑔, 𝑃 ) ≥ 𝐿(𝑓 , 𝑃 ) + 𝐿(𝑔, 𝑃 ).
Proof. Let 𝑃 = {π‘₯0 < π‘₯1 < β‹― < π‘₯𝑁 } be a partion of [π‘Ž, 𝑏]. For each 𝑗 ∈ {1, … , 𝑁} and 𝑑 ∈ [π‘₯𝑗−1 , π‘₯𝑗 ] we have
𝑓 (𝑑) + 𝑔(𝑑) ≤ sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]} + sup{𝑔(π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]},
whence
sup{𝑓 (π‘₯) + 𝑔(π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]} ≤ sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]} + sup{𝑔(π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}.
Multiplying with (π‘₯𝑗 − π‘₯𝑗−1 ) and summing up over 𝑗 gives
π‘ˆ (𝑓 + 𝑔, 𝑃 ) ≤ π‘ˆ (𝑓 , 𝑃 ) + π‘ˆ (𝑔, 𝑃 ).
𝐿(𝑓 +𝑔, 𝑃 ) ≥ 𝐿(𝑓 , 𝑃 )+𝐿(𝑔, 𝑃 ) can be shown similarly, or we may use the estimate for upper sums and 𝐿(𝑓 , 𝑃 ) =
−π‘ˆ (−𝑓 , 𝑃 ).
Theorem 6.11 (Additivity). If 𝑓 and 𝑔 are bounded Riemann integrable functions on the interval [π‘Ž, 𝑏], then 𝑓 + 𝑔
is Riemann integrable and
𝑏
∫
𝑏
(𝑓 (𝑑) + 𝑔(𝑑)) 𝑑𝑑 =
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 +
∫
π‘Ž
𝑔(𝑑) 𝑑𝑑.
∫
π‘Ž
Proof. Let πœ€ > 0 and choose 𝑃1 , 𝑃2 ∈ (π‘Ž, 𝑏) such that
𝑏
∫
πœ€
𝑓 (𝑑) 𝑑𝑑 ≥ π‘ˆ (𝑓 , 𝑃1 ) − ,
2
π‘Ž
𝑏
∫
πœ€
𝑔(𝑑) 𝑑𝑑 ≥ π‘ˆ (𝑔, 𝑃2 ) − .
2
π‘Ž
Let 𝑃 = 𝑃1 ∪ 𝑃2 . By Lemmas 6.5 and 6.10,
π‘ˆ (𝑓 + 𝑔, 𝑃 ) ≤ π‘ˆ (𝑓 , 𝑃 ) + π‘ˆ (𝑔, 𝑃 ) ≤ π‘ˆ (𝑓 , 𝑃1 ) + π‘ˆ (𝑔, 𝑃2 ).
Thus
𝑏
∫
(𝑓 (𝑑) + 𝑔(𝑑)) 𝑑𝑑 ≤ π‘ˆ (𝑓 + 𝑔, 𝑃 ) ≤ π‘ˆ (𝑓 , 𝑃1 ) + π‘ˆ (𝑔, 𝑃2 )
π‘Ž
𝑏
≤
∫
𝑏
𝑓 (𝑑) 𝑑𝑑 +
π‘Ž
∫
𝑔(𝑑) 𝑑𝑑 + πœ€.
π‘Ž
Since πœ€ > 0 was arbitrary, it follows that
𝑏
∫
π‘Ž
𝑏
(𝑓 (𝑑) + 𝑔(𝑑)) 𝑑𝑑 ≤
∫
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 +
∫
π‘Ž
𝑔(𝑑) 𝑑𝑑.
MATH2022
65
Introductory Analysis 2017 Lecture Manual
Similarly,
𝑏
𝑏
(𝑓 (𝑑) + 𝑔(𝑑)) 𝑑𝑑 ≥
∫
𝑓 (𝑑) 𝑑𝑑 +
∫
π‘Ž
𝑏
𝑔(𝑑) 𝑑𝑑.
∫
π‘Ž
π‘Ž
But since 𝑓 and 𝑔 are integrable over [π‘Ž, 𝑏]
𝑏
𝑏
𝑓 (𝑑) 𝑑𝑑 +
∫
π‘Ž
𝑏
𝑔(𝑑) 𝑑𝑑 =
∫
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 +
∫
𝑔(𝑑) 𝑑𝑑 =
∫
π‘Ž
𝑏
𝑓 (𝑑) 𝑑𝑑 +
∫
π‘Ž
𝑏
π‘Ž
∫
𝑔(𝑑) 𝑑𝑑,
π‘Ž
and therefore
𝑏
∫
𝑏
(𝑓 (𝑑) + 𝑔(𝑑)) 𝑑𝑑 ≤
π‘Ž
∫
𝑏
𝑓 (𝑑) 𝑑𝑑 +
∫
π‘Ž
𝑏
𝑔(𝑑) 𝑑𝑑 ≤
(𝑓 (𝑑) + 𝑔(𝑑)) 𝑑𝑑,
∫
π‘Ž
π‘Ž
𝑏
𝑏
which shows that 𝑓 + 𝑔 is integrable, and
𝑏
(𝑓 (𝑑) 𝑑𝑑 + 𝑔(𝑑)) 𝑑𝑑 =
∫
π‘Ž
∫
𝑓 (𝑑) 𝑑𝑑 +
π‘Ž
∫
𝑔(𝑑) 𝑑𝑑.
π‘Ž
Theorem 6.12. If 𝑓 is a bounded Riemann integrable function on the interval [π‘Ž, 𝑏] and on the interval [𝑏, 𝑐], then
𝑓 is Riemann integrable on [π‘Ž, 𝑐] and
𝑐
∫
𝑏
𝑓 (π‘₯) 𝑑π‘₯ =
π‘Ž
∫
𝑐
𝑓 (π‘₯) 𝑑π‘₯ +
π‘Ž
∫
𝑓 (π‘₯) 𝑑π‘₯.
𝑏
Proof. Let 𝑃 be a partition of [π‘Ž, 𝑏] and 𝑄 be a partition of [𝑏, 𝑐]. Writing
𝑃 = {π‘Ž = π‘₯0 < π‘₯1 < β‹― < π‘₯π‘š = 𝑏},
𝑄 = {𝑏 = π‘₯π‘š < π‘₯π‘š+1 < β‹― < π‘₯𝑛 = 𝑐},
we have that 𝑅 = {π‘Ž = π‘₯0 < π‘₯1 < β‹― < π‘₯𝑛 = 𝑐} is a partition of [π‘Ž, 𝑐] and
π‘ˆ (𝑓 , 𝑅) =
=
𝑛
∑
𝑗=1
π‘š
∑
(π‘₯𝑗 − π‘₯𝑗−1 ) sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}
(π‘₯𝑗 − π‘₯𝑗−1 ) sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]} +
𝑗=1
𝑛
∑
(π‘₯𝑗 − π‘₯𝑗−1 ) sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]}
𝑗=π‘š+1
= π‘ˆ (𝑓 , 𝑃 ) + π‘ˆ (𝑓 , 𝑄).
Hence, by Lemmas 6.1 and 6.2,
𝑐
∫
𝑓 (π‘₯) 𝑑π‘₯ = inf {π‘ˆ (𝑓 , 𝑆) ∣ 𝑆 ∈ (π‘Ž, 𝑐)}
π‘Ž
≤ inf {π‘ˆ (𝑓 , 𝑅) ∣ 𝑅 ∈ (π‘Ž, 𝑐), 𝑏 ∈ 𝑅}
= inf {π‘ˆ (𝑓 , 𝑃 ) + π‘ˆ (𝑓 , 𝑄) ∣ 𝑃 ∈ (π‘Ž, 𝑏), 𝑄 ∈ (𝑏, 𝑐)}
= inf {π‘ˆ (𝑓 , 𝑃 ) ∣ 𝑃 ∈ (π‘Ž, 𝑏)} + inf {π‘ˆ (𝑓 , 𝑄) ∣ 𝑄 ∈ (𝑏, 𝑐)}
𝑏
=
∫
π‘Ž
𝑐
𝑓 (π‘₯) 𝑑π‘₯ +
∫
𝑏
𝑓 (π‘₯) 𝑑π‘₯.
66
Introductory Analysis 2017 Lecture Manual
MATH2022
Similarly
𝑐
∫
𝑓 (π‘₯) 𝑑π‘₯ = sup{𝐿(𝑓 , 𝑆) ∣ 𝑆 ∈ (π‘Ž, 𝑐)}
π‘Ž
≥ sup{𝐿(𝑓 , 𝑃 ) + 𝐿(𝑓 , 𝑄) ∣ 𝑃 ∈ (π‘Ž, 𝑏), 𝑄 ∈ (𝑏, 𝑐)}
𝑏
=
𝑐
𝑓 (π‘₯) 𝑑π‘₯ +
∫
π‘Ž
𝑓 (π‘₯) 𝑑π‘₯.
∫
𝑏
Since 𝑓 is integrable over [π‘Ž, 𝑏] and over [𝑏, 𝑐], it follows from above that
𝑐
∫
𝑏
𝑓 (π‘₯) 𝑑π‘₯ ≤
π‘Ž
∫
𝑐
𝑓 (π‘₯) 𝑑π‘₯ +
∫
π‘Ž
𝑐
𝑓 (π‘₯) 𝑑π‘₯ ≤
∫
𝑓 (π‘₯) 𝑑π‘₯.
π‘Ž
𝑏
Therefore 𝑓 is integrable over [π‘Ž, 𝑐] and
𝑐
∫
𝑏
𝑓 (π‘₯) 𝑑π‘₯ =
π‘Ž
∫
𝑐
𝑓 (π‘₯) 𝑑π‘₯ +
π‘Ž
∫
𝑓 (π‘₯) 𝑑π‘₯.
𝑏
Lemma 6.13. Let 𝑓 be a continuous function on [π‘Ž, 𝑏] and let πœ€ > 0. Then there is a partition  = {π‘₯0 < π‘₯1 <
β‹― < π‘₯𝑁 } of [π‘Ž, 𝑏] such that |𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| ≤ πœ€ for all 𝑗 ∈ {1, … , 𝑁} and all π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ].
Proof. Let πœ€ > 0 and put π‘₯0 = π‘Ž. For 𝑗 ∈ β„•∗ we define numbers π‘₯𝑗 recursively as follows: Let
𝐸𝑗 = {π‘₯ ∈ [π‘₯𝑗−1 , 𝑏] ∣ |𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| ≥ πœ€}.
Note that 𝐸𝑗 = ∅ if π‘₯𝑗−1 = 𝑏.
If 𝐸𝑗 = ∅, let π‘₯𝑗 = 𝑏.
If 𝐸𝑗 ≠ ∅, let π‘₯𝑗 = inf 𝐸𝑗 .
Claim 1. If π‘₯𝑗−1 < 𝑏, then π‘₯𝑗−1 < π‘₯𝑗 .
If 𝐸𝑗 = ∅, then π‘₯𝑗−1 < 𝑏 = π‘₯𝑗 . Now let 𝐸𝑗 ≠ ∅. Since 𝑓 is continuous at π‘₯𝑗−1 , there is 𝛿 > 0 such that
|𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| < πœ€ whenever π‘₯ ∈ [π‘Ž, 𝑏] with |π‘₯ − π‘₯𝑗−1 | < 𝛿. Hence π‘₯ ∈ 𝐸𝑗 implies π‘₯ ≥ π‘₯𝑗−1 and |π‘₯ − π‘₯𝑗−1 | ≥ 𝛿,
and therefore π‘₯ ≥ π‘₯𝑗−1 + 𝛿. Thus π‘₯𝑗−1 + 𝛿 is a lower bound of 𝐸𝑗 and therefore
π‘₯𝑗 = inf 𝐸𝑗 ≥ π‘₯𝑗−1 + 𝛿 > π‘₯𝑗−1 .
Claim 2. If π‘₯𝑗−1 < 𝑏, then |𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| < πœ€ for all π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ), |𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯𝑗−1 )| ≤ πœ€, and |𝑓 (π‘₯𝑗 ) −
𝑓 (π‘₯𝑗−1 )| = πœ€ if 𝐸𝑗 ≠ ∅.
Indeed, if π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ), then π‘₯ ∈ [π‘₯𝑗−1 , 𝑏] and π‘₯ < inf 𝐸𝑗 , and so π‘₯ ∉ 𝐸𝑗 . By definition of 𝐸𝑗 it follows that
|𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| < πœ€, and, since π‘₯𝑗−1 < π‘₯𝑗 by Claim 1, the continuity of 𝑓 and the absolute value function give
|𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯𝑗−1 )| = lim− |𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| ≤ πœ€.
π‘₯→π‘₯𝑗
If 𝐸𝑗 ≠ ∅, then there is a sequence (π‘Žπ‘˜ ) in 𝐸𝑗 such that
lim π‘Žπ‘˜ = π‘₯𝑗 .
π‘˜→∞
Observing |𝑓 (π‘Žπ‘˜ ) − 𝑓 (π‘₯𝑗−1 )| ≥ πœ€, the same continuity argument as above gives
|𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯𝑗−1 )| = lim |𝑓 (π‘Žπ‘˜ ) − 𝑓 (π‘₯𝑗−1 )| ≥ πœ€.
π‘˜→∞
Claim 3. There is an index 𝑛 such that π‘₯𝑛 = 𝑏.
MATH2022
67
Introductory Analysis 2017 Lecture Manual
Assume that Claim 3 is false. Then, by Claim 1, (π‘₯𝑗 ) is an increasing sequence, bounded by 𝑏, which therefore has
a limit, say π‘₯, see Theorem 6.11, and π‘₯ ∈ [π‘Ž, 𝑏]. Then the continuity of 𝑓 at π‘₯ gives
|𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯𝑗−1 )| ≤ |𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯)| + |𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| → 0 as 𝑗 → ∞.
Hence there is 𝑗 such that |𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯𝑗−1 )| < πœ€, contradicting Claim 2 since π‘₯𝑗 ≠ 𝑏 implies 𝐸𝑗 ≠ ∅.
Putting 𝑁 = min{𝑗 ∈ β„• ∣ π‘₯𝑗 = 𝑏} completes the proof.
Theorem 6.14. If 𝑓 is a continuous function on [π‘Ž, 𝑏], then 𝑓 is Riemann integrable on [π‘Ž, 𝑏].
Proof. From Theorem 3.17, we know that a continuous function on a closed interval [π‘Ž, 𝑏] is bounded. Let πœ€ > 0
and let 𝑃 = {π‘₯0 < β‹― < π‘₯𝑁 } be a partition of [π‘Ž, 𝑏] according to Lemma 6.13. Then, for 𝑗 ∈ {1, … , 𝑁}, with
Lemma 6.1,
sup
𝑓 (π‘₯) −
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
inf
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
𝑓 (π‘₯) =
sup
[𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )] −
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
=
sup
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
≤2
[𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )] +
sup
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
inf
[𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )]
sup
[𝑓 (π‘₯𝑗−1 ) − 𝑓 (π‘₯)]
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
|𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )|
≤ 2πœ€
in view of Lemma 6.13, Claim 2. Hence
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 )
π‘ˆ (𝑓 , 𝑃 ) − 𝐿(𝑓 , 𝑃 ) =
𝑗=1
≤
[
]
sup
𝑓 (π‘₯) −
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
inf
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
𝑓 (π‘₯)]
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 )2πœ€ = 2(𝑏 − π‘Ž)πœ€.
𝑗=1
By Theorem 6.7, 𝑓 is Riemann integrable.
Theorem 6.15 (Fundamental Theorem of Calculus). Let 𝑓 be a differentiable function on [π‘Ž, 𝑏] such that 𝑓 ′ is
continuous on [π‘Ž, 𝑏]. Then 𝑓 ′ is Riemann integrable on [π‘Ž, 𝑏] and
𝑏
∫
𝑓 ′ (𝑑) 𝑑𝑑 = 𝑓 (𝑏) − 𝑓 (π‘Ž).
π‘Ž
Proof. Since 𝑓 ′ is continuous, 𝑓 ′ is Riemann integrable by the previous theorem. Let 𝑃 = {π‘₯
0 < β‹― < π‘₯𝑁 } be a
partition of [π‘Ž, 𝑏]. By the First Mean Value Theorem, Theorem 4.5, for each 𝑗 ∈ {1, … , 𝑁} there is 𝑐𝑗 ∈ (π‘₯𝑗−1 , π‘₯𝑗 )
such that
𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯𝑗−1 ) = (π‘₯𝑗 − π‘₯𝑗−1 )𝑓 ′ (𝑐𝑗 ).
Then
𝐿(𝑓 ′ , 𝑃 ) =
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 )
𝑗=1
≤
inf
π‘₯∈[π‘₯𝑗−1 ,π‘₯𝑗 ]
𝑓 ′ (π‘₯)
𝑁
∑
(π‘₯𝑗 − π‘₯𝑗−1 )𝑓 ′ (𝑐𝑗 )
𝑗=1
=
𝑁
∑
𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯𝑗−1 ) = 𝑓 (𝑏) − 𝑓 (π‘Ž).
𝑗=1
Hence
𝑏
∫
π‘Ž
𝑓 ′ (π‘₯) 𝑑π‘₯ = sup{𝐿(𝑓 ′ , 𝑃 ) ∣ 𝑃 ∈ (π‘Ž, 𝑏)} ≤ 𝑓 (𝑏) − 𝑓 (π‘Ž).
68
Introductory Analysis 2017 Lecture Manual
MATH2022
Similarly,
𝑏
𝑓 ′ (π‘₯) 𝑑π‘₯ ≥ 𝑓 (𝑏) − 𝑓 (π‘Ž).
∫
π‘Ž
Since 𝑓 ′ is Riemann integrable, the upper and lower Riemann integrals are equal, and
𝑏
∫
𝑓 ′ (π‘₯) 𝑑π‘₯ = 𝑓 (𝑏) − 𝑓 (π‘Ž)
π‘Ž
follows.
Tutorial 6.5.1. Let 𝑓 be a bounded Riemann integrable function on [π‘Ž, 𝑏] and let 𝑃 be a partition of [π‘Ž, 𝑏]. Then
prove that
𝑏
(𝑏 − π‘Ž) inf {𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘Ž, 𝑏]} ≤ 𝐿(𝑓 , 𝑃 ) ≤
∫
𝑓 (𝑑) 𝑑𝑑 ≤ π‘ˆ (𝑓 , 𝑃 ) ≤ (𝑏 − π‘Ž) sup{𝑓 (π‘₯) ∣ π‘₯ ∈ [π‘Ž, 𝑏]}.
π‘Ž
*Tutorial 6.5.2. Let π‘Ž < 𝑏, let 𝑓 ∢ [π‘Ž, 𝑏] → ℝ and let 𝑃 = {π‘Ž = π‘₯0 < π‘₯1 < β‹― < π‘₯𝑛 = 𝑏} ∈ (π‘Ž, 𝑏). Define
𝑉 (𝑓 , 𝑃 ) ∢=
𝑛
∑
|𝑓 (π‘₯𝑗 ) − 𝑓 (π‘₯𝑗−1 )|
𝑗=1
and
π‘‰π‘Žπ‘ 𝑓 ∢= sup{𝑉 (𝑓 , 𝑃 ) ∢ 𝑃 ∈ (π‘Ž, 𝑏)}.
One sets π‘‰π‘Žπ‘Ž 𝑓 = 0. The function 𝑓 is said to be of bounded variation on [π‘Ž, 𝑏] if π‘‰π‘Žπ‘ 𝑓 < ∞.
1. Let π‘Ž ≤ π‘₯ < 𝑦 ≤ 𝑏. Show that π‘‰π‘Žπ‘₯ 𝑓 + |𝑓 (𝑦) − 𝑓 (π‘₯)| ≤ π‘‰π‘Žπ‘¦ 𝑓 .
2. Assume that 𝑓 is of bounded variation on [π‘Ž, 𝑏] and for π‘₯ ∈ [π‘Ž, 𝑏] define
𝑓1 (π‘₯) = π‘‰π‘Žπ‘₯ 𝑓 ,
𝑓2 (π‘₯) = π‘‰π‘Žπ‘₯ 𝑓 − 𝑓 (π‘₯).
Show that 𝑓1 and 𝑓2 are increasing on [π‘Ž, 𝑏] and that 𝑓 = 𝑓1 − 𝑓2 .
3. Show that 𝑓 is of bounded variation on [π‘Ž, 𝑏] if and only if 𝑓 is the difference of two increasing functions on
[π‘Ž, 𝑏].
4. Show that if 𝑓 is of bounded variation on [π‘Ž, 𝑏], then 𝑓 is Riemann integrable over [π‘Ž, 𝑏].
5. Give an example of a continuous function on [π‘Ž, 𝑏] which is not of bounded variation.
Chapter 7
Introduction to Metric Spaces
7.1
Metric Spaces
Definition 7.1. Let 𝑋 be a set and 𝑑 ∢ 𝑋 × π‘‹ → ℝ. 𝑑 is called a metric on 𝑋 if for all π‘₯, 𝑦, 𝑧 ∈ 𝑋 the following
properties are satisfied:
(i) 𝑑(π‘₯, 𝑦) = 0 ⇔ π‘₯ = 𝑦,
(ii) 𝑑(π‘₯, 𝑦) = 𝑑(𝑦, π‘₯)
(symmetry),
(iii) 𝑑(π‘₯, 𝑧) ≤ 𝑑(π‘₯, 𝑦) + 𝑑(𝑦, 𝑧)
(triangle inequality).
If 𝑑 is a metric on 𝑋, then (𝑋, 𝑑) or shortly 𝑋 is called a metric space.
Remark 7.1.1. If (𝑋, 𝑑) is a metric space, then 𝑑(π‘₯, 𝑦) ≥ 0 for all π‘₯, 𝑦 ∈ 𝑋.
Proof. From
0 = 𝑑(π‘₯, π‘₯) ≤ 𝑑(π‘₯, 𝑦) + 𝑑(𝑦, π‘₯) = 𝑑(π‘₯, 𝑦) + 𝑑(π‘₯, 𝑦) = 2𝑑(π‘₯, 𝑦)
(i)
(ii)
(iii)
it follows that 𝑑(π‘₯, 𝑦) ≥ 0.
In the following, 𝑛 ∈ β„•∗ = β„• ⧡ {0} unless stated otherwise.
Worked Example 7.1.1. 1. For π‘₯, 𝑦 ∈ ℝ let 𝑑(π‘₯, 𝑦) = |π‘₯ − 𝑦|. Then (ℝ, 𝑑) is a metric space.
2. For π‘₯, 𝑦 ∈ β„‚ let 𝑑(π‘₯, 𝑦) = |π‘₯ − 𝑦|. Then (β„‚, 𝑑) is a metric space.
( 𝑛
)1
2
∑
3. For π‘₯ = (π‘₯𝑗 )𝑛𝑗=1 , 𝑦 = (𝑦𝑗 )𝑛𝑗=1 ∈ ℂ𝑛 let 𝑑(π‘₯, 𝑦) =
|π‘₯𝑗 − 𝑦𝑗 |2 . Then (ℂ𝑛 , 𝑑) is a metric space.
𝑗=1
Solution. Parts 1 and 2 are clear if one observes the triangle inequality for real and complex numbers.
For the triangle inequality in 3, we use the notation
β€–π‘₯β€– = 𝑑(π‘₯, 0).
Then β€–π‘₯ + 𝑦‖ ≤ β€–π‘₯β€– + ‖𝑦‖ can be shown as was done for ℝ𝑛 in Multivariable Calculus. But rather than repeating
that proof we use it to prove the result for ℂ𝑛 . To this end, put π‘₯
Μ‚ = (|π‘₯𝑗 |)𝑛𝑗=1 ∈ ℝ𝑛 . Obviously, β€–Μ‚
π‘₯β€– = β€–π‘₯β€– and
β€–π‘₯ + 𝑦‖ ≤ β€–Μ‚
π‘₯ + 𝑦̂‖ by part 2. Then, from Multivarible Calculus results applicable to π‘₯
Μ‚ and 𝑦̂, it follows that
β€–π‘₯ + 𝑦‖ ≤ β€–Μ‚
π‘₯ + 𝑦̂‖ ≤ β€–Μ‚
π‘₯β€– + β€–Μ‚
𝑦‖ = β€–π‘₯β€– + ‖𝑦‖.
Then, for π‘₯, 𝑦, 𝑧 ∈ ℂ𝑛 ,
𝑑(π‘₯, 𝑧) = β€–π‘₯ − 𝑧‖ = β€–(π‘₯ − 𝑦) + (𝑦 − 𝑧)β€– ≤ β€–π‘₯ − 𝑦‖ + ‖𝑦 − 𝑧‖
= 𝑑(π‘₯, 𝑦) + 𝑑(𝑦, 𝑧).
In the following, let 𝔽 be either ℝ or β„‚.
69
70
Introductory Analysis 2017 Lecture Manual
MATH2022
Worked Example 7.1.2. Let π‘Œ be a nonempty set and let 𝐡(π‘Œ ) = 𝐡(π‘Œ , 𝔽 𝑛 ) be the set of all bounded functions
from π‘Œ to 𝔽 𝑛 . Here a function 𝑓 from π‘Œ to 𝔽 𝑛 is called bounded if there is a real number 𝑀 such that ‖𝑓 (π‘₯)β€– =
( 𝑛
)1
2
∑
|𝑓𝑗 (π‘₯)|2
≤ 𝑀 for all π‘₯ ∈ π‘Œ , where 𝑓𝑗 (π‘₯) is the 𝑗-th component of 𝑓 (π‘₯). For 𝑓 , 𝑔 ∈ 𝐡(π‘Œ ) let
𝑗=1
𝑑(𝑓 , 𝑔) = sup ‖𝑓 (π‘₯) − 𝑔(π‘₯)β€–.
π‘₯∈π‘Œ
Then (𝐡(π‘Œ ), 𝑑) is a metric space.
Solution. We have for 𝑓 , 𝑔 ∈ 𝐡(π‘Œ ) that there are 𝑀, 𝑁 ∈ ℝ such that ‖𝑓 (π‘₯)β€– ≤ 𝑀 and ‖𝑔(π‘₯)β€– ≤ 𝑁 for all
π‘₯ ∈ π‘Œ . Denoting the metric on ℂ𝑛 from Example 7.1.1 by 𝑑𝑛 , it follows for π‘₯ ∈ π‘Œ that
‖𝑓 (π‘₯) − 𝑔(π‘₯)β€– = 𝑑𝑛 (𝑓 (π‘₯), 𝑔(π‘₯)) ≤ 𝑑𝑛 (𝑓 (π‘₯), 0) + 𝑑𝑛 (𝑔(π‘₯), 0) = ‖𝑓 (π‘₯)β€– + ‖𝑔(π‘₯)β€– ≤ 𝑀 + 𝑁,
so that 𝑑(𝑓 , 𝑔) ∈ ℝ.
Clearly, 𝑑(𝑓 , 𝑓 ) = 0 for all 𝑓 ∈ 𝐡(π‘Œ ). If, on the other hand, 𝑓 ≠ 𝑔, then there is π‘₯0 ∈ π‘Œ such that 𝑓 (π‘₯0 ) ≠ 𝑔(π‘₯0 ).
Hence
𝑑(𝑓 , 𝑔) = sup ‖𝑓 (π‘₯) − 𝑔(π‘₯)β€– ≥ ‖𝑓 (π‘₯0 ) − 𝑔(π‘₯0 )β€– > 0.
π‘₯∈π‘Œ
From ‖𝑓 (π‘₯) − 𝑔(π‘₯)β€– = ‖𝑔(π‘₯) − 𝑓 (π‘₯)β€– it easily follows that 𝑑(𝑓 , 𝑔) = 𝑑(𝑔, 𝑓 ).
Finally, for 𝑓 , 𝑔, β„Ž ∈ 𝐡(π‘Œ ),
𝑑(𝑓 , β„Ž) = sup ‖𝑓 (π‘₯) − β„Ž(π‘₯)β€–
π‘₯∈π‘Œ
≤ sup (‖𝑓 (π‘₯) − 𝑔(π‘₯)β€– + ‖𝑔(π‘₯) − β„Ž(π‘₯)β€–)
π‘₯∈π‘Œ
≤ sup (‖𝑓 (π‘₯) − 𝑔(π‘₯)β€– + ‖𝑔(𝑦) − β„Ž(𝑦)β€–)
π‘₯,𝑦∈π‘Œ
= sup ‖𝑓 (π‘₯) − 𝑔(π‘₯)β€– + sup ‖𝑔(𝑦) − β„Ž(𝑦)β€–
π‘₯∈π‘Œ
𝑦∈π‘Œ
= 𝑑(𝑓 , 𝑔) + 𝑑(𝑔, β„Ž).
Proposition 7.1. Let 𝑋 = (𝑋, 𝑑) be a metric space and let π‘Œ ⊂ 𝑋 be nonempty. For π‘₯, 𝑦 ∈ π‘Œ let π‘‘π‘Œ (π‘₯, 𝑦) =
𝑑(π‘₯, 𝑦). Then π‘Œ = (π‘Œ , π‘‘π‘Œ ) = (π‘Œ , 𝑑) is a metric space.
Note. When it is clear what the metric is, it is convenient to write briefly 𝑋 for a metric space, and π‘Œ ⊂ 𝑋 is called
a (metric) subspace of 𝑋.
Worked Example 7.1.3. 1. ℝ𝑛 is a subspace of ℂ𝑛 .
2. If π‘Œ is a nonempty set and 𝐴 ⊂ 𝔽 𝑛 , then 𝐡(π‘Œ , 𝐴) is a subspace of 𝐡(π‘Œ , 𝔽 𝑛 ).
Definition 7.2. A sequence (π‘₯π‘˜ ) in a metric space (𝑋, 𝑑) is called
(i) convergent if there is π‘₯ ∈ 𝑋 such that
lim 𝑑(π‘₯π‘˜ , π‘₯) = 0.
π‘˜→∞
We then write π‘₯ = lim π‘₯π‘˜ and say that (π‘₯π‘˜ ) converges to π‘₯.
π‘˜→∞
(ii) a Cauchy sequence if for all πœ€ > 0 there is 𝑁 ∈ β„• such that for all π‘˜, π‘š ≥ 𝑁: 𝑑(π‘₯π‘˜ , π‘₯π‘š ) < πœ€.
Theorem 7.2. Let (𝑋, 𝑑) be a metric space and let (π‘₯π‘˜ ) be a convergent sequence in 𝑋. Then
1. lim π‘₯π‘˜ is unique.
π‘˜→∞
2. (π‘₯π‘˜ ) is a Cauchy sequence.
Proof. 1. Let π‘₯, 𝑦 ∈ 𝑋 such that lim 𝑑(π‘₯π‘˜ , π‘₯) = 0 and lim 𝑑(π‘₯π‘˜ , 𝑦) = 0. Then
π‘˜→∞
π‘˜→∞
𝑑(π‘₯, 𝑦) ≤ 𝑑(π‘₯, π‘₯π‘˜ ) + 𝑑(π‘₯π‘˜ , 𝑦)
= 𝑑(π‘₯π‘˜ , π‘₯) + 𝑑(π‘₯π‘˜ , 𝑦)
MATH2022
Introductory Analysis 2017 Lecture Manual
71
gives
0 ≤ 𝑑(π‘₯, 𝑦) ≤ lim 𝑑(π‘₯π‘˜ , π‘₯) + lim 𝑑(π‘₯π‘˜ , 𝑦) = 0.
π‘˜→∞
π‘˜→∞
Hence 𝑑(π‘₯, 𝑦) = 0, giving π‘₯ = 𝑦.
πœ€
2. Let πœ€ > 0. Since there is π‘₯ ∈ 𝑋 such that lim 𝑑(π‘₯π‘˜ , π‘₯) = 0, there is 𝑁 ∈ β„• such that 𝑑(π‘₯π‘˜ , π‘₯) < for π‘˜ ≥ 𝑁.
π‘˜→∞
2
Then, for π‘˜, π‘š ≥ 𝑁,
𝑑(π‘₯π‘˜ , π‘₯π‘š ) ≤ 𝑑(π‘₯π‘˜ , π‘₯) + 𝑑(π‘₯π‘š , π‘₯) < πœ€.
Tutorial 7.1.1. 1. Let 𝑋 be a set and let 𝑓 ∢ 𝑋 → ℝ be injective.
a) Show that 𝑑(π‘₯, 𝑦) = |𝑓 (π‘₯) − 𝑓 (𝑦)| defines a metric on 𝑋.
b) Show that a sequence (π‘₯π‘˜ ) in (𝑋, 𝑑) converges to some π‘₯ ∈ 𝑋 if and only if the sequence (𝑓 (π‘₯π‘˜ )) converges to
some 𝑦 ∈ range 𝑓 , and that then 𝑦 = 𝑓 (π‘₯).
2. Let ℝ = ℝ ∪ {−∞, ∞}, define 𝑓 ∢ ℝ → ℝ by
⎧ arctan π‘₯ if π‘₯ ∈ ℝ,
βŽͺ
βŽͺ πœ‹
if π‘₯ = −∞ ,
𝑓 (π‘₯) = ⎨ −
2
βŽͺπœ‹
if π‘₯ = ∞,
βŽͺ
⎩2
and let 𝑑 be as in part 1. Show that a sequence (π‘₯π‘˜ ) in ℝ converges in (ℝ, 𝑑) if and only if either (π‘₯π‘˜ ) converges in
ℝ in the usual sense or π‘₯π‘˜ → −∞ or π‘₯π‘˜ → ∞.
3. Let 𝑋 be a nonempty set. For π‘₯, 𝑦 ∈ 𝑋 define
{
𝑑(π‘₯, 𝑦) =
0
if π‘₯ = 𝑦,
1
if π‘₯ ≠ 𝑦.
a) Show that (𝑋, 𝑑) is a metric space. (𝑋, 𝑑) is called a discrete metric space.
b) Show that a sequence (π‘₯π‘˜ ) converges in (𝑋, 𝑑) if and only if there is 𝑁 ∈ β„• such that π‘₯π‘˜ = π‘₯𝑁 for all π‘˜ > 𝑁.
4. Let π‘₯, 𝑦 ∈ ℂ𝑛 . Prove directly that |(π‘₯, 𝑦)| ≤ β€–π‘₯‖‖𝑦‖ and β€–π‘₯ + 𝑦‖ ≤ β€–π‘₯β€– + ‖𝑦‖.
7.2
Complete Metric Spaces
Definition 7.3. A metric space (𝑋, 𝑑) is called complete if every Cauchy sequence in (𝑋, 𝑑) converges.
Theorem 7.3. ℂ𝑛 is complete.
Proof. Let (π‘§π‘˜ ) be a Cauchy sequence in ℂ𝑛 . Write π‘§π‘˜ = (π‘₯π‘˜,𝑗 + π‘–π‘¦π‘˜,𝑗 )𝑛𝑗=1 with π‘₯π‘˜,𝑗 , π‘¦π‘˜,𝑗 ∈ ℝ. Then
|π‘₯π‘˜,𝑗 − π‘₯𝑙,𝑗 | ≤ 𝑑(π‘§π‘˜ , 𝑧𝑙 ) → 0 as π‘˜, 𝑙 → ∞.
Thus (π‘₯π‘˜,𝑗 )𝑛𝑗=1 is a Cauchy sequence in ℝ for each 𝑗 = 1, … , 𝑛. By Theorem 2.12, the sequence converges, so that
there is π‘Žπ‘— ∈ ℝ such that
lim π‘₯π‘˜,𝑗 = π‘Žπ‘— .
π‘˜→∞
Similarly, there is 𝑏𝑗 ∈ ℝ such that
lim π‘¦π‘˜,𝑗 = 𝑏𝑗 .
π‘˜→∞
Put
𝑧 = (π‘Žπ‘— + 𝑖𝑏𝑗 )𝑛𝑗=1 .
Let πœ€ > 0. Then there is 𝐾 ∈ ℝ such that for all π‘˜ ≥ 𝐾 and all 𝑗 = 1, … , 𝑛,
πœ€
|π‘₯π‘˜,𝑗 − π‘Žπ‘— | < √ ,
2𝑛
πœ€
|π‘¦π‘˜,𝑗 − 𝑏𝑗 | < √ .
2𝑛
72
Introductory Analysis 2017 Lecture Manual
MATH2022
Hence, for π‘˜ ≥ 𝐾,
( 𝑛
)1
2
∑(
)
𝑑(π‘§π‘˜ , 𝑧) =
|π‘₯π‘˜,𝑗 − π‘Žπ‘— |2 + |π‘¦π‘˜,𝑗 − 𝑏𝑗 |2
𝑗=1
<
( 𝑛 (
∑ πœ€2
𝑗=1
πœ€2
+
2𝑛 2𝑛
)) 12
= πœ€.
Theorem 7.4. Let π‘Œ be a nonempty set. Then 𝐡(π‘Œ ) = 𝐡(π‘Œ , 𝔽 𝑛 ) is complete.
Proof. Let (π‘“π‘˜ ) be a Cauchy sequence in 𝐡(π‘Œ ). For each π‘₯ ∈ π‘Œ ,
β€–π‘“π‘˜ (π‘₯) − π‘“π‘š (π‘₯)β€– ≤ 𝑑(π‘“π‘˜ , π‘“π‘š ) → 0 as π‘˜, π‘š → 0.
Hence (π‘“π‘˜ (π‘₯))∞
is a Cauchy sequence in 𝔽 𝑛 . Since 𝔽 𝑛 is complete, there is 𝑓 (π‘₯) ∈ 𝔽 𝑛 such that π‘“π‘˜ (π‘₯) → 𝑓 (π‘₯) as
π‘˜=1
𝑛 → ∞.
First we will show that 𝑓 is bounded. Indeed, for each πœ€ > 0 there is π‘›πœ€ ∈ β„• such that for all π‘˜, π‘š ≥ π‘›πœ€ ,
𝑑(π‘“π‘˜ , π‘“π‘š ) < πœ€. Also, for all π‘₯ ∈ π‘Œ , there is π‘š ≥ 𝑛1 such that
β€–π‘“π‘š (π‘₯) − 𝑓 (π‘₯)β€– < 1.
Hence
‖𝑓 (π‘₯)β€– ≤ ‖𝑓 (π‘₯) − π‘“π‘š (π‘₯)β€– + β€–π‘“π‘š (π‘₯) − 𝑓𝑛1 (π‘₯)β€– + ‖𝑓𝑛1 (π‘₯)β€–
< ‖𝑓𝑛1 (π‘₯)β€– + 2.
Hence
sup ‖𝑓 (π‘₯)β€– ≤ sup ‖𝑓𝑛1 (π‘₯)β€– + 2 < ∞.
π‘₯∈π‘Œ
π‘₯∈π‘Œ
Therefore 𝑓 is bounded.
Finally, for π‘₯ ∈ π‘Œ choose π‘š ≥ 𝑛 πœ€ such that
3
β€–π‘“π‘š (π‘₯) − 𝑓 (π‘₯)β€– <
πœ€
.
4
Then, for π‘˜ ≥ 𝑛 πœ€ ,
4
β€–π‘“π‘˜ (π‘₯) − 𝑓 (π‘₯)β€– ≤ β€–π‘“π‘˜ (π‘₯) − π‘“π‘š (π‘₯)β€– + β€–π‘“π‘š (π‘₯) − 𝑓 (π‘₯)β€–
πœ€
< β€–π‘“π‘˜ (π‘₯) − π‘“π‘š (π‘₯)β€– +
4
πœ€
≤ 𝑑(π‘“π‘˜ , π‘“π‘š ) +
4
1
< πœ€.
2
Hence, for π‘˜ ≥ 𝑛 πœ€ ,
4
𝑑(π‘“π‘˜ , 𝑓 ) = sup β€–π‘“π‘˜ (π‘₯) − 𝑓 (π‘₯)β€–
π‘₯∈π‘Œ
1
πœ€
2
< πœ€.
≤
This proves that (π‘“π‘˜ ) converges in 𝐡(π‘Œ ). Since (π‘“π‘˜ ) was an arbitrary Cauchy sequence in 𝐡(π‘Œ ), it follows that
𝐡(π‘Œ ) is complete.
MATH2022
73
Introductory Analysis 2017 Lecture Manual
Definition 7.4. 1. A subset 𝐴 of a metric space (𝑋, 𝑑) is called open if for each π‘₯ ∈ 𝐴 there is πœ€ > 0 such that
𝐡(π‘₯, πœ€) ⊂ 𝐴, where 𝐡(π‘₯, πœ€) = {𝑦 ∈ 𝑋 ∢ 𝑑(𝑦, π‘₯) < πœ€} is the πœ€-ball with centre π‘₯.
2. A subset 𝐴 of a metric space (𝑋, 𝑑) is called closed if 𝑋 ⧡ 𝐴 is open.
Theorem 7.5. A subset 𝐴 of a metric space (𝑋, 𝑑) is closed if and only if for each sequence (π‘₯π‘˜ ) in 𝐴 which
converges to some π‘₯ ∈ 𝑋 it follows that π‘₯ ∈ 𝐴.
Proof. Let 𝐴 be closed and let (π‘₯π‘˜ ) be a sequence in 𝐴 which converges to some π‘₯ ∈ 𝑋. We must show π‘₯ ∈ 𝐴.
Assume that π‘₯ ∉ 𝐴. Since π‘₯ ∈ 𝑋 ⧡ 𝐴 and 𝑋 ⧡ 𝐴 is open, there is πœ€ > 0 such that 𝐡(π‘₯, πœ€) ⊂ 𝑋 ⧡ 𝐴. Since π‘₯π‘˜ ∈ 𝐴
for all π‘˜ ∈ β„•, it follows that 𝑑(π‘₯π‘˜ , π‘₯) ≥ πœ€, which contradicts π‘₯π‘˜ → π‘₯.
Conversely, if 𝐴 ⊂ 𝑋 is not closed, then there is π‘₯ ∈ 𝑋 ⧡ 𝐴 such that 𝐡(π‘₯, πœ€) ⊄ 𝑋 ⧡ 𝐴 for all πœ€ > 0. Therefore, for
positive integers π‘˜, there is π‘₯π‘˜ ∈ 𝐴 ∩ 𝐡(π‘₯, π‘˜1 ). Thus, for each πœ€ > 0 and π‘˜ ≥ 1πœ€ we have 𝑑(π‘₯π‘˜ , π‘₯) < π‘˜1 ≤ πœ€. Thus
we have found a sequence (π‘₯π‘˜ ) in 𝐴 which converges to some π‘₯ ∈ 𝑋 which does not belong to 𝐴.
Theorem 7.6. Let (𝑋, 𝑑) be a complete metric space and let 𝐴 ⊂ 𝑋. Then 𝐴 is complete if and only if 𝐴 is closed.
Proof. Assume that 𝐴 is complete and let (π‘₯π‘˜ ) be a sequence in 𝐴 which converges to some π‘₯ ∈ 𝑋. We must show
that π‘₯ ∈ 𝐴. But since (π‘₯π‘˜ ) is a convergent sequence in 𝑋, it is a Cauchy sequence in 𝑋 by Theorem 7.2, 2, and
therefore also in 𝐴. Since 𝐴 is complete, (π‘₯π‘˜ ) converges to some π‘Ž ∈ 𝐴, and therefore also in 𝑋. Since limits are
unique by Theorem 7.2, 1, it follows that π‘₯ = π‘Ž ∈ 𝐴. Hence, by Theorem 7.5, 𝐴 is closed.
Conversely, assume that 𝐴 is closed. Let (π‘₯π‘˜ ) be a Cauchy sequence in 𝐴. Then (π‘₯π‘˜ ) is a Cauchy sequence in 𝑋,
and since 𝑋 is complete, there is π‘₯ ∈ 𝑋 such that π‘₯π‘˜ → π‘₯. Since 𝐴 is closed, it follows that π‘₯ ∈ 𝐴. Hence each
Cauchy sequence in 𝐴 converges in 𝐴. It follows that 𝐴 is complete.
Definition 7.5. Let 𝐴 ⊂ ℝ and 𝑓 ∢ 𝐴 → 𝔽 𝑛 . Then 𝑓 is uniformly continuous if for each πœ€ > 0 there is 𝛿 > 0
such that for all π‘₯, 𝑦 ∈ 𝐴 (|π‘₯ − 𝑦| < 𝛿 ⇒ ‖𝑓 (π‘₯) − 𝑓 (𝑦)β€– < πœ€).
Remark 7.2.1. It is straightforward to show that a function 𝑓 ∢ [π‘Ž, 𝑏] → β„‚ is (uniformly) continuous if and only if
its real and imaginary part are (uniformly) continuous and that a function 𝑓 ∢ [π‘Ž, 𝑏] → 𝔽 𝑛 is (uniformly) continuous
if and only if all its component are (uniformly) continuous.
Theorem 7.7. Let π‘Ž, 𝑏 ∈ ℝ, π‘Ž < 𝑏, and let 𝑓 ∢ [π‘Ž, 𝑏] → 𝔽 𝑛 be continuous. Then 𝑓 is uniformly continuous.
Proof. Because of Remark 7.2.1 we only have to consider 𝑓 ∢ [π‘Ž, 𝑏] → ℝ. Let πœ€ > 0 and choose π‘Ž = π‘₯0 < π‘₯1 <
β‹― < π‘₯𝑁 = 𝑏 according to Lemma 6.13 with respect to 4πœ€ . Put
𝛿 = min{|π‘₯𝑗 − π‘₯𝑗−1 | ∢ 𝑗 = 1, … , 𝑁}.
Now let π‘₯, 𝑦 ∈ [π‘Ž, 𝑏] such that |π‘₯ − 𝑦| < 𝛿. We want to show |𝑓 (π‘₯) − 𝑓 (𝑦)| < πœ€. This is trivial if π‘₯ = 𝑦. Thus let
π‘₯ ≠ 𝑦, and we may assume π‘₯ < 𝑦.
There is 𝑗 ∈ {1, … , 𝑁} such that π‘₯ ∈ [π‘₯𝑗−1 , π‘₯𝑗 ]. Hence
π‘₯𝑗−1 ≤ π‘₯ < 𝑦 < π‘₯ + 𝛿 ≤ π‘₯𝑗 + 𝛿.
Case I: 𝑦 ≤ π‘₯𝑗 . Then 𝑦 ∈ [π‘₯𝑗−1 , π‘₯𝑗 ], and Lemma 6.13 gives
|𝑓 (π‘₯) − 𝑓 (𝑦)| ≤ |𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| + |𝑓 (π‘₯𝑗−1 ) − 𝑓 (𝑦)| ≤
πœ€ πœ€
+ < πœ€.
4 4
Case II: 𝑦 > π‘₯𝑗 . Then 𝑗 < 𝑁, and π‘₯𝑗 < 𝑦 < π‘₯𝑗 + 𝛿 ≤ π‘₯𝑗+1 . Again by Lemma 6.13,
|𝑓 (π‘₯) − 𝑓 (𝑦)| ≤ |𝑓 (π‘₯) − 𝑓 (π‘₯𝑗−1 )| + |𝑓 (π‘₯𝑗−1 ) − 𝑓 (π‘₯𝑗 )| + |𝑓 (π‘₯𝑗 ) − 𝑓 (𝑦)| ≤
πœ€ πœ€ πœ€
+ + < πœ€.
4 4 4
For an interval 𝐼 we denote by 𝐢(𝐼, 𝔽 𝑛 ) the set of continuous functions from 𝐼 to 𝔽 𝑛 .
Theorem 7.8. Let π‘Ž, 𝑏 ∈ ℝ and 𝑛 ∈ ℕ⧡{0}. Then 𝐢([π‘Ž, 𝑏], 𝔽 𝑛 ) is a closed subspaces of 𝐡([π‘Ž, 𝑏], 𝔽 𝑛 ). In particular,
𝐢([π‘Ž, 𝑏], 𝔽 𝑛 ) is complete.
Proof. By Remark 7.2.1, it suffices to consider 𝐢([π‘Ž, 𝑏], ℝ). By Theorem 3.17, each continuous function on [π‘Ž, 𝑏]
is bounded, so that 𝐢([π‘Ž, 𝑏], ℝ) ⊂ 𝐡([π‘Ž, 𝑏], ℝ).
74
Introductory Analysis 2017 Lecture Manual
MATH2022
To show that 𝐢([π‘Ž, 𝑏], ℝ) is closed in 𝐡([π‘Ž, 𝑏], ℝ) let (π‘“π‘˜ ) be a sequence in 𝐢([π‘Ž, 𝑏], ℝ) which converges to some
𝑓 in 𝐡([π‘Ž, 𝑏], ℝ). We must show that 𝑓 is continuous.
πœ€
Let πœ€ > 0. Then there is π‘˜ ∈ β„• such that 𝑑(π‘“π‘˜ , 𝑓 ) < . Since π‘“π‘˜ is uniformly continuous, there is 𝛿 > 0 such that
3
πœ€
|π‘“π‘˜ (π‘₯) − π‘“π‘˜ (𝑦)| < for π‘₯, 𝑦 ∈ [π‘Ž, 𝑏], |π‘₯ − 𝑦| < 𝛿.
3
Then, for π‘₯, 𝑦 ∈ [π‘Ž, 𝑏] and |π‘₯ − 𝑦| < 𝛿,
|𝑓 (π‘₯) − 𝑓 (𝑦)| ≤ |𝑓 (π‘₯) − π‘“π‘˜ (π‘₯)| + |π‘“π‘˜ (π‘₯) − π‘“π‘˜ (𝑦)| + |π‘“π‘˜ (𝑦) − 𝑓 (𝑦)|
≤ 𝑑(𝑓 , π‘“π‘˜ ) + |π‘“π‘˜ (π‘₯) − π‘“π‘˜ (𝑦)| + 𝑑(π‘“π‘˜ , 𝑓 )
πœ€ πœ€ πœ€
< + +
3 3 3
= πœ€.
Thus 𝑓 is uniformly continuous.
By Theorem 7.6, 𝐢([π‘Ž, 𝑏], 𝔽 𝑛 ) is complete.
Tutorial 7.2.1. 1. Let (𝑋, 𝑑) be the metric space from Tutorial 7.1.1, 1. Show that (𝑋, 𝑑) is complete if and only
if range 𝑓 is closed in ℝ.
2. Show that the metric space (ℝ, 𝑑) from Tutorial 7.1.1, 2, is complete.
3. Show that ℝ is not complete with respect to the metric given in Tutorial 7.1.1, 2.
7.3
Banach’s Fixed Point Theorem
In this section, 𝑋 = (𝑋, 𝑑𝑋 ), π‘Œ = (π‘Œ , π‘‘π‘Œ ) and 𝑍 = (𝑍, 𝑑𝑍 ) are metric spaces.
Definition 7.6. Let 𝑇 ∢ 𝑋 → π‘Œ . 𝑇 is called bounded if there is a number 𝑀 ≥ 0 such that
π‘‘π‘Œ (𝑇 π‘₯, 𝑇 𝑦) ≤ 𝑀𝑑𝑋 (π‘₯, 𝑦) for all π‘₯, 𝑦 ∈ 𝑋.
If 𝑇 is bounded, then
‖𝑇 β€– = inf {𝑀 ≥ 0 ∢ ∀ π‘₯, 𝑦 ∈ 𝑋, π‘‘π‘Œ (𝑇 π‘₯, 𝑇 𝑦) ≤ 𝑀𝑑𝑋 (π‘₯, 𝑦)}
is called the norm of 𝑇 .
If 𝑇 is not bounded, we may write for convenience ‖𝑇 β€– = ∞.
Since it is clear that π‘‘π‘Œ (𝑇 π‘₯, 𝑇 𝑦) ≤ (‖𝑇 β€– + πœ€)𝑑𝑋 (π‘₯, 𝑦) for all πœ€ > 0, it follows that
∀ π‘₯, 𝑦 ∈ 𝑋, π‘‘π‘Œ (𝑇 π‘₯, 𝑇 𝑦) ≤ ‖𝑇 ‖𝑑𝑋 (π‘₯, 𝑦).
Proposition 7.9. Let 𝑇 ∢ 𝑋 → π‘Œ . Then
π‘‘π‘Œ (𝑇 π‘₯, 𝑇 𝑦)
.
π‘₯,𝑦∈𝑋 𝑑𝑋 (π‘₯, 𝑦)
‖𝑇 β€– = sup
π‘₯≠𝑦
Proof. Let
{
𝐴=
}
π‘‘π‘Œ (𝑇 π‘₯, 𝑇 𝑦)
∢ π‘₯, 𝑦 ∈ 𝑋, π‘₯ ≠ 𝑦 , 𝛼 = sup 𝐴.
𝑑𝑋 (π‘₯, 𝑦)
If 𝑇 is bounded, then ‖𝑇 β€– is an upper bound of 𝐴 and thus 𝛼 ≤ ‖𝑇 β€–.
𝑑 (𝑇 π‘₯, 𝑇 𝑦)
On the other hand, 𝛼 ≥ π‘Œ
for all π‘₯, 𝑦 ∈ 𝑋 with π‘₯ ≠ 𝑦, and thus π‘‘π‘Œ (𝑇 π‘₯, 𝑇 𝑦) ≤ 𝛼𝑑𝑋 (π‘₯, 𝑦) for all π‘₯, 𝑦 ∈ 𝑋.
𝑑𝑋 (π‘₯, 𝑦)
Hence 𝛼 ≥ ‖𝑇 β€–.
The case that 𝑇 is not bounded is left as an exercise.
Notation. 1. If 𝑇 ∢ 𝑋 → π‘Œ and 𝑆 ∢ π‘Œ → 𝑍, then we write 𝑆𝑇 = 𝑆◦𝑇 .
2. If 𝑇 ∢ 𝑋 → 𝑋, we define
𝑇 0 by 𝑇 0 π‘₯ = π‘₯, π‘₯ ∈ 𝑋,
𝑇 π‘˜+1 = 𝑇 𝑇 π‘˜ inductively for π‘˜ ∈ β„•.
MATH2022
Introductory Analysis 2017 Lecture Manual
75
Proposition 7.10. If 𝑇 ∢ 𝑋 → π‘Œ and 𝑆 ∢ π‘Œ → 𝑍 are bounded, then 𝑆𝑇 is bounded, and ‖𝑆𝑇 β€– ≤ ‖𝑆‖ ‖𝑇 β€–.
Proof. For all π‘₯, 𝑦 ∈ 𝑋 we have
𝑑𝑍 ((𝑆𝑇 )π‘₯, (𝑆𝑇 )𝑦) = 𝑑𝑍 (𝑆(𝑇 π‘₯), 𝑆(𝑇 𝑦)) ≤ β€–π‘†β€–π‘‘π‘Œ (𝑇 π‘₯, 𝑇 𝑦) ≤ ‖𝑆‖ ‖𝑇 ‖𝑑𝑋 (π‘₯, 𝑦).
Hence, by definition, 𝑆𝑇 is bounded and ‖𝑆𝑇 β€– ≤ ‖𝑆‖ ‖𝑇 β€–.
Definition 7.7. Let 𝑇 ∢ 𝑋 → 𝑋 be bounded.
1. 𝑇 is called a contraction if ‖𝑇 β€– < 1.
2. 𝑇 is called a generalized contraction if there is π‘š ∈ β„• such that ‖𝑇 π‘š β€– < 1.
Remark 7.3.1. Let 𝑇 ∢ 𝑋 → 𝑋 be bounded. Then 𝑇 is a generalized contraction if and only if
∞
∑
‖𝑇 𝑗 β€– < ∞.
𝑗=0
Proof. If the sum converges, then ‖𝑇 𝑗 β€– → 0 as 𝑗 → ∞, and therefore there is π‘š ∈ β„• such that ‖𝑇 π‘š β€– < 1.
Conversely, if ‖𝑇 π‘š β€– < 1, then we group the sum into blocks of π‘š terms, writing every natural number as a multiple
of π‘š plus its remainder, use
‖𝑇 π‘˜π‘š+𝑙 β€– = ‖𝑇 π‘˜π‘š 𝑇 𝑙 β€– ≤ β€–(𝑇 π‘š )π‘˜ β€– ‖𝑇 𝑙 β€– ≤ ‖𝑇 π‘š β€–π‘˜ ‖𝑇 𝑙 β€–
and get
∞
∑
‖𝑇 β€– =
𝑗
𝑗=0
∞ π‘š−1
∑
∑
‖𝑇 π‘˜π‘š+𝑙 β€–
π‘˜=0 𝑙=0
≤
∞ π‘š−1
∑
∑
‖𝑇 π‘š β€–π‘˜ ‖𝑇 𝑙 β€–
π‘˜=0 𝑙=0
=
(∞
∑
π‘˜=0
=
‖𝑇 π‘š β€–π‘˜
) (π‘š−1
∑
)
‖𝑇 𝑙 β€–
𝑙=0
π‘š−1
∑
1
‖𝑇 𝑙 β€– < ∞.
1 − ‖𝑇 π‘š β€– 𝑙=0
Theorem 7.11 (Banach’s Fixed Point Theorem). Let (𝑋, 𝑑) be a complete metric space and let 𝑇 ∢ 𝑋 → 𝑋 be a
generalized contraction. Then there is a unique π‘Ž ∈ 𝑋 such that 𝑇 π‘Ž = π‘Ž.
Proof. Since 𝑇 is a generalized contraction, there is π‘š ∈ β„• such that ‖𝑇 π‘š β€– < 1.
For uniqueness, let π‘Ž, 𝑏 ∈ 𝑋 such that 𝑇 π‘Ž = π‘Ž and 𝑇 𝑏 = 𝑏. By induction
𝑇 π‘˜ π‘Ž = 𝑇 𝑇 π‘˜−1 π‘Ž = 𝑇 π‘Ž = π‘Ž and 𝑇 π‘˜ 𝑏 = 𝑏,
so that
𝑑(π‘Ž, 𝑏) = 𝑑(𝑇 π‘˜ π‘Ž, 𝑇 π‘˜ 𝑏) ≤ ‖𝑇 π‘˜ ‖𝑑(π‘Ž, 𝑏),
and 𝑑(π‘Ž, 𝑏) ≠ 0 would give the contradiction 1 ≤ ‖𝑇 π‘š β€–. Hence 𝑑(π‘Ž, 𝑏) = 0, i. e., π‘Ž = 𝑏.
For existence let π‘₯0 ∈ 𝑋 and consider the sequence (𝑇 π‘˜ π‘₯0 ). Then, for 𝑗, π‘˜ ∈ β„•, π‘˜ > 𝑗,
𝑑(𝑇 π‘˜ π‘₯0 , 𝑇 𝑗 π‘₯0 ) ≤ 𝑑(𝑇 π‘˜ π‘₯0 , 𝑇 π‘˜−1 π‘₯0 ) + 𝑑(𝑇 π‘˜−1 π‘₯0 , 𝑇 π‘˜−2 π‘₯0 ) + β‹― + 𝑑(𝑇 𝑗+1 π‘₯0 , 𝑇 𝑗 π‘₯0 )
≤ ‖𝑇 π‘˜−1 β€– 𝑑(𝑇 π‘₯0 , π‘₯0 ) + ‖𝑇 π‘˜−2 β€– 𝑑(𝑇 π‘₯0 , π‘₯0 ) + β‹― + ‖𝑇 𝑗 β€– 𝑑(𝑇 π‘₯0 , π‘₯0 )
=
π‘˜−1
∑
𝑙=𝑗
‖𝑇 𝑙 β€– 𝑑(𝑇 π‘₯0 , π‘₯0 )
76
Introductory Analysis 2017 Lecture Manual
Since
∞
∑
MATH2022
‖𝑇 𝑙 β€– < ∞,
𝑙=0
it follows that (𝑇 π‘˜ π‘₯0 ) is a Cauchy sequence. Since 𝑋 is complete, this Cauchy sequence has a limit, say π‘Ž. Finally,
to show that 𝑇 π‘Ž = π‘Ž, let πœ€ > 0. Then there is 𝐾 such that for 𝑗 ≥ 𝐾,
𝑑(𝑇 𝑗 π‘₯0 , π‘Ž) <
πœ€
.
1 + ‖𝑇 β€–
For π‘˜ ≥ 𝐾 + 1 we get
𝑑(𝑇 π‘Ž, π‘Ž) ≤ 𝑑(𝑇 π‘Ž, 𝑇 π‘˜ π‘₯0 ) + 𝑑(𝑇 π‘˜ π‘₯0 , π‘Ž)
≤ ‖𝑇 ‖𝑑(π‘Ž, 𝑇 π‘˜−1 π‘₯0 ) + 𝑑(𝑇 π‘˜ π‘₯0 , π‘Ž)
πœ€
πœ€
+
< ‖𝑇 β€–
1 + ‖𝑇 β€– 1 + ‖𝑇 β€–
= πœ€.
Hence 𝑑(𝑇 π‘Ž, π‘Ž) = 0, i. e., 𝑇 π‘Ž = π‘Ž.
Tutorial 7.3.1. Let (𝑋, 𝑑) be a complete metric space and let 𝑇 be a generalized contraction on 𝑋. Let π‘š ∈ β„•∗
such that ‖𝑇 π‘š β€– < 1 and let π‘₯ be the fixed point of 𝑇 . Prove the following a priori estimate: For any π‘₯0 ∈ 𝑋,
𝑑(𝑇 π‘₯0 , π‘₯0 ) ∑ 𝑗
‖𝑇 β€–.
1 − ‖𝑇 π‘š β€– 𝑗=0
π‘š−1
𝑑(π‘₯, π‘₯0 ) ≤
Hint. Use that 𝑇 π‘š π‘₯ = π‘₯ and use a "telescoping" sum of distances involving the points 𝑇 π‘š π‘₯, 𝑇 π‘š π‘₯0 , 𝑇 π‘š−1 π‘₯0 , … ,
𝑇 π‘₯0 .
Tutorial 7.3.2. Newton Raphson method of finding zeros
Let 𝑓 ∢ [π‘Ž, 𝑏] → ℝ be twice continuously differentiable and assume there are 𝑐, 𝑑 ∈ [π‘Ž, 𝑏], 𝑐 < 𝑑, such that 𝑓 (𝑐)
and 𝑓 (𝑑) have opposite signs. By the Intermediate Value Theorem, 𝑓 has a zero 𝑦 in (𝑐, 𝑑). Also assume that
𝑓 ′ (π‘₯) ≠ 0 for all π‘₯ ∈ [𝑐, 𝑑]. Since 𝑓 ′ is continuous, 𝑓 ′ is either positive or negative by the Internmediate Value
Theorem, so that 𝑓 is strictly monotonic by the Mean Value Theorem. Hence the zero is unique. To find this zero
one can use the Newton Raphson method:
1. Let
𝑓 (π‘₯)
𝑇 π‘₯ = π‘₯ − ′ , π‘₯ ∈ [𝑐, 𝑑].
𝑓 (π‘₯)
Show that for all π‘₯1 , π‘₯2 ∈ [𝑐, 𝑑] with π‘₯1 ≠ π‘₯2 there is πœ‰ between π‘₯1 and π‘₯2 such that
𝑇 π‘₯1 − 𝑇 π‘₯2 =
𝑓 (πœ‰)𝑓 ′′ (πœ‰)
(π‘₯1 − π‘₯2 ) .
(𝑓 ′ (πœ‰))2
2. Show that ‖𝑇 β€– < 1 if 𝑐 and 𝑑 are sufficiently close to the zero 𝑦 of 𝑓 .
3. Show that one can choose 𝑐 and 𝑑 in such a way that 𝑇 ([𝑐, 𝑑]) ⊂ [𝑐, 𝑑].
Hint. You may show that 𝑇 ([𝑐, 𝑑]) ⊂ [𝑐, 𝑑] if 𝑑 − 𝑦 = 𝑦 − 𝑐, i. e., 𝑦 is the midpoint of the interval [𝑐, 𝑑]. However,
this is not very practical since you do not know 𝑦 (otherwise, the whole procedure would be pointless). The easiest
way out of this is to choose suitable values (by trial and error), 𝑐1 < 𝑑1 in [𝑐, 𝑑] such that 𝑓 (𝑐1 ) and 𝑓 (𝑐2 ) have
opposite sign, 𝑇 𝑐1 , 𝑇 𝑐2 ∈ [𝑐1 , 𝑐2 ], and 𝑓 𝑓 ′′ has no zeros on [𝑐1 , 𝑑1 ] ⧡ {𝑦}.
∗ 4. Assume that statement 2 holds and define
𝑐1 =
𝑐 + ‖𝑇 ‖𝑑
,
1 + ‖𝑇 β€–
𝑑1 =
𝑑 + ‖𝑇 ‖𝑐
.
1 + ‖𝑇 β€–
Show that 𝑐 < 𝑐1 < 𝑦 < 𝑑1 < 𝑑 if 𝑓 (𝑐1 ) and 𝑓 (𝑑1 ) have opposite signs. Then show that 𝑇 ([𝑐, 𝑑]) ⊂ [𝑐, 𝑑].
Hence show that for any π‘₯ ∈ [𝑐, 𝑑], 𝑇 π‘˜ π‘₯ → 𝑦 as π‘˜ → ∞.
MATH2022
Introductory Analysis 2017 Lecture Manual
77
√
√
√
Tutorial 7.3.3. Let 𝑐 > 0. Show that 𝑐 + 𝑐 + 𝑐 + … converges by using the following proof.
[
)
[
)
√
(a) Let 𝑇𝑐 π‘₯ = 𝑐 + π‘₯, π‘₯ ∈ [0, ∞). Show that 𝑇𝑐 π‘₯ > π‘₯ for π‘₯ ∈ [0, 1] and that 𝑇𝑐 ∢ 14 , ∞ → 14 , ∞ is a
contraction.
Show that if π‘₯π‘˜ ≤ 1 for all π‘˜ ∈ β„•, then (π‘₯π‘˜ ) is (strictly) increasing. (Actually, this never happens.)
(b) Let π‘₯0 = 𝑐, π‘₯π‘˜+1 = 𝑇𝑐 π‘₯π‘˜ (π‘˜ ∈ β„•).
(c) Show that (π‘₯π‘˜ ) converges as π‘˜ → ∞.
(d) Find lim π‘₯π‘˜ .
π‘˜→∞
7.4
Existence of solutions of differential equations
Let 𝐼 ⊂ ℝ be an interval, let π‘ˆ ⊂ 𝔽 𝑛 and let 𝑓 ∢ 𝐼 × π‘ˆ → 𝔽 𝑛 be continuous. Then
𝑦′ = 𝑓 (π‘₯, 𝑦)
(7.4.1)
is called a first-order system of differential equations, and any continuously differentiable function 𝑦 ∢ 𝐼 → π‘ˆ
satisfying (7.4.1) is called a solution of (7.4.1).
Let π‘Ž ∈ 𝐼 and 𝑏 ∈ 𝔽 𝑛 . Then a solution of (7.4.1) satisfying
𝑦(π‘Ž) = 𝑏
(7.4.2)
is called a solution of the initial value problem (IVP) (7.4.1), (7.4.2).
From the Fundamental Theorem of Calculus, Theorem 6.15, we obtain
Lemma 7.12. 𝑦 is a a solution of the IVP (7.4.1), (7.4.2) if and only if 𝑦 ∈ 𝐢(𝐼, 𝔽 𝑛 ), 𝑦(𝐼) ⊂ π‘ˆ , and
π‘₯
𝑦(π‘₯) = 𝑏 +
∫
𝑓 (𝑑, 𝑦(𝑑)) 𝑑𝑑
(π‘₯ ∈ 𝐼).
(7.4.3)
π‘Ž
Proof. We only have to note that we can apply the Fundamental Theorem of Calculus componentwise and to real
and imaginary parts separately. Also, if 𝑦 ∈ 𝐢(𝐼, 𝔽 𝑛 ) satisfies (7.4.3), then 𝑓 is differentiable.
We are now going to show that under certain conditions, (7.4.3) and thus the IVP (7.4.1), (7.4.2) has a unique
solution. We write 𝐢 1 (𝐼, 𝔽 𝑛 ) for the set of all continuously differentiable functions from 𝐼 to 𝔽 𝑛 .
78
Introductory Analysis 2017 Lecture Manual
MATH2022
Theorem 7.13. We make the following assumptions:
(i) Let 𝐼 be a closed bounded interval, π‘Ž ∈ 𝐼, 𝐴 = max |π‘₯ − π‘Ž|, 𝑏 ∈ 𝔽 𝑛 , 0 < 𝐡 ≤ ∞,
π‘₯∈𝐼
π‘ˆ = {𝑦 ∈ 𝔽 𝑛 ∢ ‖𝑦 − 𝑏‖ ≤ 𝐡}, 𝑅 = 𝐼 × π‘ˆ .
(ii) Let 𝑓 ∢ 𝑅 → 𝔽 𝑛 be continuous and let 𝑀 = sup ‖𝑓 (π‘₯, 𝑦)β€–.
(π‘₯,𝑦)∈𝑅
(iii) ∃ 𝐿 > 0 ∀ (π‘₯, 𝑦1 ), (π‘₯, 𝑦2 ) ∈ 𝑅 ‖𝑓 (π‘₯, 𝑦1 ) − 𝑓 (π‘₯, 𝑦2 )β€– ≤ 𝐿‖𝑦1 − 𝑦2 β€–.
(This is called a Lipschitz condition.)
(iv) 𝐴𝑀 ≤ 𝐡.
Then there is exactly one 𝑦 ∈ 𝐢 1 (𝐼, 𝔽 𝑛 ) such that
𝑦(π‘Ž) = 𝑏,
(π‘₯, 𝑦(π‘₯)) ∈ 𝑅 (π‘₯ ∈ 𝐼),
𝑦′ (π‘₯) = 𝑓 (π‘₯, 𝑦(π‘₯))
(π‘₯ ∈ 𝐼).
Proof. Since π‘ˆ is a closed subset of 𝔽 𝑛 , 𝐢(𝐼, π‘ˆ ) is a complete metric space. For 𝑦 ∈ 𝐢(𝐼, π‘ˆ ) define
π‘₯
(𝑇 𝑦)(π‘₯) = 𝑏 +
𝑓 (𝑑, 𝑦(𝑑)) 𝑑𝑑
∫
(π‘₯ ∈ 𝐼).
(7.4.4)
π‘Ž
By Lemma 7.12, the theorem is proved if we show that 𝑇 maps 𝐢(𝐼, π‘ˆ ) into 𝐢(𝐼, π‘ˆ ) and that 𝑇 has exactly one
fixed point.
Thus let 𝑦 ∈ 𝐢(𝐼, π‘ˆ ) and π‘₯ ∈ 𝐼. Clearly, 𝑇 𝑦 ∈ 𝐢(𝐼, 𝔽 𝑛 ) and
β€–
β€– π‘₯
β€–
β€–
β€–
β€–
β€–(𝑇 𝑦)(π‘₯) − 𝑏‖ = β€– 𝑓 (𝑑, 𝑦(𝑑)) 𝑑𝑑‖
β€–
β€–∫
β€–
β€–π‘Ž
β€–
β€–
| π‘₯
|
|
|
|
|
≤ | ‖𝑓 (𝑑, 𝑦(𝑑))β€– 𝑑𝑑|
|∫
|
|π‘Ž
|
|
|
| π‘₯
|
|
|
|
|
≤ | 𝑀 𝑑𝑑|
|∫
|
|π‘Ž
|
|
|
= |π‘₯ − π‘Ž|𝑀
≤ 𝐴𝑀
≤ 𝐡.
Thus we have shown that 𝑇 𝑦 ∈ 𝐢(𝐼, π‘ˆ ).
Next we are going to show that 𝑇 is a generalized contraction by proving the following estimate: For all 𝑛 ∈ β„•,
𝑦1 , 𝑦2 ∈ 𝐢(𝐼, π‘ˆ ) and π‘₯ ∈ 𝐼,
β€– π‘˜
β€– πΏπ‘˜ |π‘₯ − π‘Ž|π‘˜
𝑑(𝑦1 , 𝑦2 ).
β€–(𝑇 𝑦1 )(π‘₯) − (𝑇 π‘˜ 𝑦2 )(π‘₯)β€– ≤
β€–
β€–
π‘˜!
The induction base π‘˜ = 0 is trivial since
‖𝑦1 (π‘₯) − 𝑦2 (π‘₯)β€– ≤ sup ‖𝑦1 (𝑑) − 𝑦2 (𝑑)β€– = 𝑑(𝑦1 , 𝑦2 ).
𝑑∈𝐼
(7.4.5)
MATH2022
Introductory Analysis 2017 Lecture Manual
79
For the induction step assume that (7.4.5) is true for π‘˜. Then
β€– π‘₯
β€–
β€–
] β€–
β€– π‘˜+1
β€– β€– [
β€–
π‘˜+1
π‘˜
π‘˜
𝑦1 )(π‘₯) − (𝑇
𝑦2 )(π‘₯)β€– = β€–
𝑓 (𝑑, (𝑇 𝑦1 )(𝑑)) − 𝑓 (𝑑, (𝑇 𝑦2 )(𝑑)) 𝑑𝑑‖
β€–(𝑇
β€–
β€– β€–∫
β€–
β€–π‘Ž
β€–
β€–
β€–
π‘₯
|
|
|
|
| β€–
β€– |
π‘˜
π‘˜
≤ | ‖𝑓 (𝑑, (𝑇 𝑦1 )(𝑑)) − 𝑓 (𝑑, (𝑇 𝑦2 )(𝑑))β€– 𝑑𝑑|
|∫ β€–
β€– |
|π‘Ž
|
|
|
| π‘₯
|
|
|
|
β€–
β€– |
≤ | 𝐿 β€–(𝑇 π‘˜ 𝑦1 )(𝑑) − (𝑇 π‘˜ 𝑦2 )(𝑑)β€– 𝑑𝑑| (by assumption (iii))
|∫
β€–
β€– |
|π‘Ž
|
|
|
|
| π‘₯
|
πΏπ‘˜ |π‘₯ − π‘Ž|π‘˜ ||
|
≤| 𝐿
𝑑𝑑| 𝑑(𝑦1 , 𝑦2 ) (by induction hypothesis)
|
|∫
π‘˜!
|π‘Ž
|
|
|
π‘˜+1
π‘˜+1
𝐿 |π‘₯ − π‘Ž|
𝑑(𝑦1 , 𝑦2 ).
=
(π‘˜ + 1)!
Taking the maximum over all π‘₯ ∈ 𝐼 in (7.4.5) gives
𝑑(𝑇 π‘˜ 𝑦1 , 𝑇 π‘˜ 𝑦2 ) ≤
and therefore
‖𝑇 π‘˜ β€– ≤
which leads to
∞
∑
‖𝑇 π‘˜ β€– ≤
πΏπ‘˜ π΄π‘˜
,
π‘˜!
∞
∑
(𝐿𝐴)π‘˜
π‘˜=0
π‘˜=0
πΏπ‘˜ π΄π‘˜
𝑑(𝑦1 , 𝑦2 ),
π‘˜!
π‘˜!
= 𝑒𝐿𝐴 < ∞.
Hence 𝑇 is a generalized contraction on 𝐢(𝐼, π‘ˆ ). An application of Banach’s Fixed Point Theorem completes the
proof.
Worked Example 7.4.1. Consider the initial value problem 𝑦′ = 1 + 𝑦2 , 𝑦(0) = 0.
a) Find the largest 𝑐 > 0 such that the assumptions of Theorem 7.12 are satisfied for 𝐼 = [0, 𝑐].
b) Find a maximal interval on which a solution exists.
Solution. a) We have 𝑓 (π‘₯, 𝑦) = 1 + 𝑦2 . Hence
𝑓 (π‘₯, 𝑦1 ) − 𝑓 (π‘₯, 𝑦2 ) = 𝑦21 − 𝑦22 = (𝑦1 + 𝑦2 )(𝑦1 − 𝑦2 ),
so that
|𝑓 (π‘₯, 𝑦1 ) − 𝑓 (π‘₯, 𝑦2 )| = |𝑦1 + 𝑦2 | |𝑦1 − 𝑦2 |.
Hence the Lipschitz condition is satisfied if and only if
sup
|𝑦1 |,|𝑦2 |≤𝐡
|𝑦1 + 𝑦2 | < ∞,
which is satisfied if and only if 𝐡 ∈ ℝ, i. e., 𝐡 < ∞.
We have 𝐴 = 𝑐 and
𝑀 = sup |1 + 𝑦2 | = 1 + 𝐡 2 .
|𝑦|≤𝐡
We must satisfy 𝐴𝑀 ≤ 𝐡, i. e.,
𝑐(1 + 𝐡 2 ) ≤ 𝐡
But
or 𝑐 ≤
𝐡
.
1 + 𝐡2
(1 + 𝐡 2 ) − (1 − 𝐡)2
(1 − 𝐡)2
2𝐡
=
=1−
1 + 𝐡2
1 + 𝐡2
1 + 𝐡2
80
Introductory Analysis 2017 Lecture Manual
MATH2022
1
.
2
)
(
πœ‹ πœ‹
.
b) The solution of the IVP is 𝑦(π‘₯) = tan π‘₯. So the maximal interval where a solution exists is − ,
2 2
Let 𝑀𝑛 (𝔽 ) be the set of 𝑛 × π‘› matrices with values in 𝔽 . Then the norm ‖𝑀‖ of 𝑀 ∈ 𝑀𝑛 (𝔽 ) is the norm of
𝑀 ∢ 𝔽 𝑛 → 𝔽 𝑛 . It is easy to see that
has a maximum of 1 for 𝐡 = 1, so that the optimal 𝑐 is 𝑐 =
𝑛
max |π‘šπ‘–π‘— | ≤ ‖𝑀‖ ≤
𝑖,𝑗=1
𝑛
∑
|π‘šπ‘–π‘— |,
where 𝑀 = (π‘šπ‘–π‘— )𝑛𝑖,𝑗=1 .
𝑖,𝑗=1
Corollary 7.14. Let 𝐼 be an interval, π‘Ž ∈ 𝐼, 𝑏 ∈ 𝔽 𝑛 , and consider the linear system of differential equations
𝑦′ = 𝐹 (π‘₯)𝑦 + 𝑔(π‘₯),
where 𝐹 ∢ 𝐼 → 𝑀𝑛 (𝔽 ) and 𝑔 ∢ 𝐼 → 𝔽 𝑛 are continuous.
Then the initial value problem
𝑦′ = 𝐹 (π‘₯)𝑦 + 𝑔(π‘₯),
has a unique solution 𝑦 ∈ 𝐢 1 (𝐼, 𝔽 𝑛 ).
Proof. First assume that 𝐼 = [𝑐, 𝑑]. Then, for 𝑦1 , 𝑦2 ∈ 𝔽 𝑛 , π‘₯ ∈ 𝐼,
β€–(𝐹 (π‘₯)𝑦1 + 𝑔(π‘₯)) − (𝐹 (π‘₯)𝑦2 − 𝑔(π‘₯))β€– ≤ ‖𝐹 (π‘₯)β€– ‖𝑦1 − 𝑦2 β€–
(
)
≤ max ‖𝐹 (π‘₯)β€– ‖𝑦1 − 𝑦2 β€–.
π‘₯∈𝐼
Hence the Lipschitz condition is satisfied with 𝐡 = ∞. Now apply Theorem 7.13.
If 𝐼 is arbitrary, we can choose a decreasing sequence (π‘π‘˜ ) and an increasing sequence (π‘‘π‘˜ ) such that 𝑐1 ≤ π‘Ž ≤ 𝑑1
and
∞
⋃
𝐼=
[π‘π‘˜ , π‘‘π‘˜ ].
π‘˜=1
[If, e. g., the right endpoint 𝑑 of 𝐼 belongs to 𝐼, choose π‘‘π‘˜ = 𝑑.]
By the first part of the proof, the IVP has a unique solution π‘¦π‘˜ on [π‘π‘˜ , π‘‘π‘˜ ]. Therefore, π‘¦π‘˜+π‘š (π‘₯) = π‘¦π‘˜ (π‘₯) for all π‘š > 0
and π‘₯ ∈ [π‘π‘˜ , π‘‘π‘˜ ], so that 𝑦(π‘₯) = π‘¦π‘˜ (π‘₯) for π‘₯ ∈ [π‘π‘˜ , π‘‘π‘˜ ] is well-defined and solves the IVP. Because this solution is
unique on [π‘π‘˜ , π‘‘π‘˜ ] for each π‘˜, 𝑦 is unique.
For πœ‚ (𝑛) = 𝑓 (π‘₯, πœ‚, πœ‚ ′ , … , πœ‚ (𝑛−1) ) define
βŽ› πœ‚ ⎞
⎜ πœ‚′ ⎟
𝑦 = ⎜ . ⎟,
⎜ .. ⎟
⎜
⎟
⎝ πœ‚ (𝑛−1) ⎠
βŽ› 𝑦2 ⎞
⎜ .. ⎟
𝐹 (π‘₯, 𝑦) = ⎜ . ⎟.
⎜ 𝑦
⎟
𝑛
⎜
⎟
⎝ 𝑓 (π‘₯, 𝑦) ⎠
Then
πœ‚ (𝑛) = 𝑓 (π‘₯, πœ‚, πœ‚ ′ , … , πœ‚ (𝑛−1) ) ⇔ 𝑦′ = 𝐹 (π‘₯, 𝑦).
Let 𝑏0 , … , 𝑏𝑛−1 ∈ 𝔽 . Then
where
πœ‚ (𝑗) (π‘Ž) = 𝑏𝑗 (𝑗 = 0, … , 𝑛 − 1) ⇔ 𝑦(π‘Ž) = 𝑏,
βŽ› 𝑏0 ⎞
⎜
⎟
𝑏 = ⎜ ... ⎟.
⎜
⎟
⎝ 𝑏𝑛−1 ⎠
Clearly,
‖𝐹 (π‘₯, 𝑦) − 𝐹 (π‘₯, 𝑧)β€– ≤ (𝑛 − 1)‖𝑦 − 𝑧‖ + |𝑓 (π‘₯, 𝑦) − 𝑓 (π‘₯, 𝑧)|
and
|𝑓 (π‘₯, 𝑦) − 𝑓 (π‘₯, 𝑧)| ≤ ‖𝐹 (π‘₯, 𝑦) − 𝐹 (π‘₯, 𝑧)β€–,
so that 𝐹 satisfies a Lipschitz condition if and only if 𝑓 satisfies a Lipschitz condition.
MATH2022
Introductory Analysis 2017 Lecture Manual
81
Corollary 7.15. Let 𝐼 be a bounded closed interval, π‘Ž ∈ 𝐼, 𝑏0 , … , 𝑏𝑛−1 ∈ 𝔽 , 0 < 𝐡 ≤ ∞.
Let π‘ˆ = {𝑦 ∈ 𝔽 𝑛 ∢ ‖𝑦 − 𝑏‖ ≤ 𝐡}, 𝑅 = 𝐼 × π‘ˆ . Let 𝑓 ∢ 𝑅 → 𝔽 be continuous and assume there is 𝐿 > 0 such that
|𝑓 (π‘₯, 𝑦) − 𝑓 (π‘₯, 𝑧)| ≤ 𝐿‖𝑦 − 𝑧‖ for all π‘₯ ∈ 𝐼 and 𝑦, 𝑧 ∈ π‘ˆ . Then there is 𝐴 > 0 such that the initial value problem
πœ‚ (𝑛) = 𝑓 (π‘₯, πœ‚, πœ‚ ′ , … , πœ‚ (𝑛−1) ),
πœ‚ (𝑗) (π‘Ž) = 𝑏𝑗 (𝑗 = 0, … , 𝑛 − 1)
has a unique solution on 𝐼 ∩ [π‘Ž − 𝐴, π‘Ž + 𝐴].
Note. According to Theorem 7.13 one uses 𝐴𝑀 ≤ 𝐡 where 𝑀 is the supremum of 𝐹 on 𝑅.
Finally, a combination of Corollaries 7.14 and 7.15 gives
Corollary 7.16. Let 𝐼 be an interval, π‘Ž ∈ 𝐼, 𝑏0 , … , 𝑏𝑛−1 ∈ 𝔽 , and consider the linear differential equation
πœ‚ (𝑛) = 𝑓𝑛−1 (π‘₯)πœ‚ (𝑛−1) + 𝑓𝑛−2 (π‘₯)𝑦(𝑛−2) + … 𝑓1 (π‘₯)𝑦′ + 𝑓0 (π‘₯)𝑦 + 𝑔(π‘₯),
with initial conditions
𝑦(𝑗) (π‘Ž) = 𝑏𝑗 ,
(𝑗 = 0, … , 𝑛 − 1),
where 𝑓𝑗 ∢ 𝐼 → 𝔽 (𝑗 = 0, … , 𝑛 − 1) and 𝑔 ∢ 𝐼 → 𝔽 are continuous.
Then the initial value problem has a unique solution πœ‚ ∈ 𝐢 𝑛 (𝐼, 𝔽 ).
Tutorial 7.4.1. 1. Let 𝑛 ∈ β„•∗ and 𝑀 ∈ 𝑀𝑛 (𝔽 ). Prove that
𝑛
max |π‘šπ‘–π‘— | ≤ ‖𝑀‖ ≤
𝑖,𝑗=1
𝑛
∑
|π‘šπ‘–π‘— |,
where 𝑀 = (π‘šπ‘–π‘— )𝑛𝑖,𝑗=1 .
𝑖,𝑗=1
2. Let 𝑀1 , 𝑀2 ∈ 𝑀𝑛 (𝔽 ).
a) Prove that ‖𝑀1 + 𝑀2 β€– ≤ ‖𝑀1 β€– + ‖𝑀2 β€–.
b) Let 𝑑(𝑀1 , 𝑀2 ) = ‖𝑀1 − 𝑀2 β€–. Prove that 𝑑 is a metric on 𝑀𝑛 (𝔽 ).
3. Prove that 𝑀𝑛 (𝔽 ) is complete.
Power Series and the Exponential Function in 𝑀𝑛 (𝔽 )
7.5
In the following let 𝐡 be either 𝔽 𝑛 or 𝑀𝑛 (𝔽 ). We know that 𝐡 is complete, see Theorem 7.3 and Tutorial 7.4.1, 3.
Definition 7.8. Let (π‘Žπ‘— ) be a sequence in 𝐡. Then
∞
∑
π‘Žπ‘— is called a series in 𝐡.
𝑗=0
The series is said to converge in 𝐡 if the sequence of its partial sums
π‘˜
∑
π‘Žπ‘— converges in 𝐡 as π‘˜ → ∞.
𝑗=0
The series is said to converge absolutely if
∞
∑
β€–π‘Žπ‘— β€– < ∞.
𝑗=0
Since 𝐡 is complete, it follows as in Theorems 5.5 and 5.8:
Theorem 7.17. Let
∞
∑
π‘Žπ‘— be a series in 𝐡.
𝑗=0
1. The series
∞
∑
π‘Žπ‘— converges in 𝐡 if and only if for all πœ€ > 0 there is 𝐾 ∈ ℝ such that for all π‘˜ ≥ π‘š ≥ 𝐾,
𝑗=0
β€–∑
β€–
β€– π‘˜
β€–
β€–
π‘Žπ‘— β€–
β€–
β€– < ∞.
‖𝑗=π‘š β€–
β€–
β€–
2. If
∞
∑
𝑗=0
π‘Žπ‘— converges absolutely, then it converges in 𝐡.
82
Introductory Analysis 2017 Lecture Manual
Theorem 7.18. Let
∞
∑
MATH2022
√
π‘Žπ‘— be a series in 𝐡, assume that 𝜌 = lim sup 𝑗 β€–π‘Žπ‘— β€– < ∞ and put 𝑅 = 𝜌1 . Let π‘Ž, π‘₯ ∈ ℝ.
𝑗→∞
𝑗=0
Then
1. The series
∞
∑
π‘Žπ‘— (π‘₯ − π‘Ž)𝑗
𝑗=0
converges absolutely if |π‘₯ − π‘Ž| < 𝑅, and it diverges if |π‘₯ − π‘Ž| > 𝑅 in case 𝑅 < ∞.
∞
∑
π‘Žπ‘— (π‘₯ − π‘Ž)𝑗 is differentiable for π‘₯ ∈ 𝐼, and
2. Let 𝐼 = ℝ if 𝑅 = ∞ and 𝐼 = (π‘Ž − 𝑅, π‘Ž + 𝑅) if 𝑅 ∈ ℝ. The series
𝑗=0
the derivative is represented by the absolutely convergent series
∞
∑
π‘—π‘Žπ‘— (π‘₯ − π‘Ž)𝑗−1
𝑗=1
Proof. The proof of part 1 is similar to that of Theorem ??.
For the proof of part 2 define for π‘₯ ∈ 𝐼:
𝑓 (π‘₯) =
𝑔(π‘₯) =
∞
∑
𝑗=0
∞
∑
π‘Žπ‘— (π‘₯ − π‘Ž)𝑗 ,
π‘—π‘Žπ‘— (π‘₯ − π‘Ž)𝑗−1 .
𝑗=1
It is straightforward to show that
√
√
√
√
√
lim sup 𝑗 β€–(𝑗 + 1)π‘Žπ‘—+1 β€– = lim sup 𝑗 β€–π‘—π‘Žπ‘— β€– = lim 𝑗 ‖𝑗‖ lim sup 𝑗 β€–π‘Žπ‘— β€– = lim sup 𝑗 β€–π‘Žπ‘— β€– = 𝜌,
𝑗→∞
𝑗→∞
𝑗→∞
𝑗→∞
𝑗→∞
so that 𝑔 is an absolutely convergent series.
Now let π‘₯ ∈ 𝐼 and choose β„Ž0 > 0 such that [π‘₯ − β„Ž0 , π‘₯ + β„Ž0 ] ⊂ 𝐼. Then π‘Ÿ = max{|π‘₯ − β„Ž0 − π‘Ž|, |π‘₯ + β„Ž0 − π‘Ž|} < 𝑅.
Let β„Ž ∈ ℝ with 0 < |β„Ž| < β„Ž0 . Since 𝑓 (π‘₯), 𝑓 (π‘₯ + β„Ž) and 𝑔(π‘₯) are represented by (absolutely) convergent series,
we have
𝑓 (π‘₯ + β„Ž) − 𝑓 (π‘₯) − 𝑔(π‘₯)β„Ž = lim
π‘˜→∞
= lim
π‘˜→∞
π‘˜
∑
[
]
π‘Žπ‘— (π‘₯ + β„Ž − π‘Ž)𝑗 − (π‘₯ − π‘Ž)𝑗 − 𝑗(π‘₯ − π‘Ž)𝑗−1 β„Ž
𝑗=1
π‘˜
∑
[
]
π‘Žπ‘— (π‘₯ + β„Ž − π‘Ž)𝑗 − (π‘₯ − π‘Ž)𝑗 − 𝑗(π‘₯ − π‘Ž)𝑗−1 β„Ž .
𝑗=2
By the Mean Value Theorem, for each 𝑗 ≥ 2 there is β„Žπ‘— ∈ ℝ with 0 < |β„Žπ‘— | < |β„Ž| such that
(π‘₯ + β„Ž − π‘Ž)𝑗 − (π‘₯ − π‘Ž)𝑗 = 𝑗(π‘₯ + β„Žπ‘— − π‘Ž)𝑗−1 β„Ž.
Another application of the MVT gives Μ‚
β„Žπ‘— ∈ ℝ with 0 < |Μ‚
β„Žπ‘— | < |β„Žπ‘— | such that
[
]
(π‘₯ + β„Ž − π‘Ž)𝑗 − (π‘₯ − π‘Ž)𝑗 − 𝑗(π‘₯ − π‘Ž)𝑗−1 β„Ž = π‘—β„Ž (π‘₯ + β„Žπ‘— − π‘Ž)𝑗−1 − (π‘₯ − π‘Ž)𝑗−1
= 𝑗(𝑗 − 1)β„Ž(π‘₯ + Μ‚
β„Žπ‘— − π‘Ž)𝑗−2 β„Žπ‘— ,
so that
β€–
β€–∑
π‘˜
β€– π‘˜
[
]β€– ∑
β€– π‘Ž (π‘₯ + β„Ž − π‘Ž)𝑗 − (π‘₯ − π‘Ž)𝑗 − 𝑗(π‘₯ − π‘Ž)𝑗−1 β„Ž β€– ≤
𝑗(𝑗 − 1)|β„Ž| |β„Žπ‘— | |π‘₯ + Μ‚
β„Žπ‘— − π‘Ž|𝑗−2 β€–π‘Žπ‘— β€–
𝑗
β€–
β€–
β€– 𝑗=2
β€– 𝑗=2
β€–
β€–
π‘˜
∑
≤ β„Ž2
𝑗(𝑗 − 1)π‘Ÿπ‘—−2 β€–π‘Žπ‘— β€–.
𝑗=2
MATH2022
Introductory Analysis 2017 Lecture Manual
83
With a reasoning as above, we have
√
lim sup 𝑗 𝑗(𝑗 − 1)π‘Ÿπ‘—−2 β€–π‘Žπ‘— β€– = π‘ŸπœŒ < 1.
𝑗→∞
By the Root Test,
𝑐=
∞
∑
𝑗(𝑗 − 1)π‘Ÿπ‘—−2 β€–π‘Žπ‘— β€– < ∞,
𝑗=2
and therefore
‖𝑓 (π‘₯ + β„Ž) − 𝑓 (π‘₯) − 𝑔(π‘₯)β„Žβ€– ≤ β„Ž2 𝑐,
which proves that 𝑓 is differentiable at π‘₯ with
𝑓 ′ (π‘₯) = 𝑔(π‘₯).
Theorem 7.19. Let 𝐴 ∈ 𝑀𝑛 (𝔽 ). Then:
1. The matrix exponential
exp(𝐴) =
∞
∑
1
𝑗!
𝑗=0
𝐴𝑗
converges absolutely in 𝑀𝑛 (𝔽 ).
2. exp(𝐴) is invertible and (exp(𝐴))−1 = exp(−𝐴).
3. If 𝐴, 𝐢 ∈ 𝑀𝑛 (𝔽 ) commute, then
exp(𝐴 + 𝐢) = exp(𝐴) exp(𝐢).
4. π‘₯ ξ‚Ά→ exp(𝐴π‘₯) is differentiable on ℝ with
𝑑
exp(𝐴π‘₯) = 𝐴 exp(𝐴π‘₯) = exp(𝐴π‘₯)𝐴.
𝑑π‘₯
Proof. 1. We calculate
√
√
√
β€–1 β€–
𝑗 β€– ≤ lim sup 𝑗 1 ‖𝐴‖𝑗 = ‖𝐴‖ lim sup 𝑗 1
lim sup 𝑗 β€–
𝐴
β€– 𝑗! β€–
𝑗!
𝑗!
𝑗→∞
𝑗→∞
𝑗→∞
β€–
β€–
𝑗!
1
= ‖𝐴‖ lim
= ‖𝐴‖ lim
= 0,
𝑗→∞ (𝑗 + 1)!
𝑗→∞ 𝑗 + 1
and exp(𝐴π‘₯) converges for all π‘₯ ∈ ℝ by Theorem 7.18, 1.
4. The differentiability follows from Theorem 7.18, 2, with
∞
∑
𝑗 𝑗 𝑗−1 ∑ 1 𝑗+1 𝑗
𝑑
exp(𝐴π‘₯) =
𝐴π‘₯
=
𝐴 π‘₯
𝑑π‘₯
𝑗!
𝑗!
𝑗=1
𝑗=0
=𝐴
Similarly,
𝑑
exp(𝐴π‘₯) =
𝑑π‘₯
∞
∑
1
𝑗!
𝑗=0
(∞
∑ 1
𝑗!
𝑗=0
𝐴𝑗 π‘₯𝑗 = 𝐴 exp(𝐴).
)
𝐴𝑗 π‘₯𝑗
𝐴 = exp(𝐴)𝐴.
2 and 3. A direct proof is quite cumbersome. Instead, we consider the matrix function
𝐹 (π‘₯) = exp((𝐴 + 𝐢)π‘₯) exp(−𝐢π‘₯) exp(−𝐴π‘₯)
and observe that exp(𝐴 + 𝐢), exp(𝐴) and exp(𝐢) commute with 𝐴 and 𝐢 since 𝐴 + 𝐢, 𝐴 and 𝐢 commute with 𝐴
and 𝐢. The product rule for derivatives and commutativity lead to
𝐹 ′ (π‘₯) = exp((𝐴 + 𝐢)π‘₯)(𝐴 + 𝐢) exp(−𝐢π‘₯) exp(−𝐴π‘₯)
− exp((𝐴 + 𝐢)π‘₯)𝐢 exp(−𝐢π‘₯) exp(−𝐴π‘₯) − exp((𝐴 + 𝐢)π‘₯) exp(−𝐢π‘₯)𝐴 exp(−𝐴π‘₯) = 0.
84
Introductory Analysis 2017 Lecture Manual
Since 𝐹 (0) = (exp(0))3 = 𝐼𝑛3 = 𝐼𝑛 , where 𝐼𝑛 is the 𝑛 × π‘› identity matrix, and since
of Corollary 7.16 gives 𝐹 (π‘₯) = 𝐼𝑛 for all π‘₯ ∈ ℝ.
Putting 𝐢 = 0 and π‘₯ = 1, we get
MATH2022
𝑑
𝐼 = 0, the uniqueness part
𝑑π‘₯ 𝑛
exp(𝐴) exp(−𝐴) = 𝐼𝑛 = exp(−𝐴) exp(𝐴),
which proves 2. Finally, with 2,
exp(𝐴 + 𝐢) = 𝐹 (1) exp(𝐴) exp(𝐢) = exp(𝐴) exp(𝐢).
How can one find exp(𝐴π‘₯)? Recall from Linear Algebra that πœ’(πœ†) = det(𝐴 − πœ†πΌπ‘› ) is the characteristic function
of 𝐴. Note that πœ’ is a polynomial of degree 𝑛 with leading coefficient (−πœ†)𝑛 . In Complex Analysis you will learn
about the Fundamental Theorem of Algebra which says for this case that there are 𝑛 complex numbers πœ†1 , … , πœ†π‘›
such that
𝑛
∏
πœ’(πœ†) =
(πœ†π‘— − πœ†).
𝑗=1
Jordan’s canonical form, see Linear Algebra, says that the πœ†π‘— can be sorted into π‘˜ groups of equal number πœ‡1 , … , πœ‡π‘˜ ,
i. e.,
π‘˜
∏
πœ’(πœ†) =
(πœ‡π‘— − πœ†)𝑝𝑗 ,
𝑗=1
such there is an invertible 𝑛 × π‘› matrix 𝑇 so that
𝐴=𝑇
( π‘˜
⨁
)
𝐷𝑗
𝑇 −1 ,
𝑗=1
where
𝐷𝑗 = πœ‡π‘— 𝐼𝑝𝑗 + 𝐽𝑝𝑗
and 𝐽𝜈 is the 𝜈 × πœˆ matrix which has entries 1 just above the diagonal and 0 elsewhere. If we write, for 𝑙 ∈ β„•,
π½πœˆπ‘™ = (π‘šπ‘Ÿπ‘  )πœˆπ‘Ÿ,𝑠=1 , then π‘šπ‘Ÿπ‘  = 1 if 𝑠 − π‘Ÿ = 𝑙 and π‘šπ‘Ÿπ‘  = 0 otherwise. In particular, π½πœˆπ‘™ = 0 if 𝑙 ≥ 𝜈.
Then
)π‘š
)
( π‘˜
( π‘˜
⨁
⨁
π‘š
−1
π‘š
𝐴 =𝑇
𝐷𝑗
𝑇 =𝑇
𝐷𝑗 𝑇 −1 ,
𝑗=1
which gives
exp(𝐴π‘₯) = 𝑇
𝑗=1
( π‘˜
⨁
)
exp(𝐷𝑗 π‘₯) 𝑇 −1 .
𝑗=1
It remains to find exp(𝐷𝑗 π‘₯). For convenience, we drop the index and observe
𝑗 ( )
∑
𝑗 𝑗−𝑙 𝑙
(πœ‡πΌπœˆ + 𝐽𝜈 ) =
πœ‡ 𝐽𝜈 .
𝑙
𝑙=0
𝑗
Using
( )
𝑗!
1
1 𝑗
=
=
,
𝑗! 𝑙
𝑗!𝑙!(𝑗 − 𝑙)! 𝑙!(𝑗 − 𝑙)!
MATH2022
Introductory Analysis 2017 Lecture Manual
85
it follows that
exp(𝐷π‘₯) =
∞
∑
1
𝑗!
𝑗=0
(πœ‡πΌπœˆ + 𝐽𝜈 )𝑗 π‘₯𝑗
∞
𝑗 ( )
∑
1 ∑ 𝑗 𝑗−𝑙 𝑙 𝑗
=
πœ‡ 𝐽𝜈 π‘₯
𝑗! 𝑙=0 𝑙
𝑗=0
( )
𝑗 𝑗−𝑙 𝑙 𝑗
=
πœ‡ 𝐽𝜈 π‘₯
𝑗! 𝑙
𝑙=0 𝑗=𝑙
∞
𝜈−1 ∑
∑
1
=
=
=
7.6
𝜈−1
∞
∑
1∑
1
πœ‡π‘—−𝑙 π½πœˆπ‘™ π‘₯𝑗
𝑙!
(𝑗
−
𝑙)!
𝑗=𝑙
𝑙=0
𝜈−1
∞
∑
1∑ 1
𝑙! π‘˜=0 π‘˜!
𝑙=0
𝜈−1 𝑙
∑
π‘₯
𝑙!
𝑙=0
(πœ‡π‘₯)π‘˜ π½πœˆπ‘™ π‘₯𝑙
exp(πœ‡π‘₯)π½πœˆπ‘™ .
Solutions of Linear Differential Equations
In this section let 𝐼 be an interval, 𝑛 ∈ β„•∗ , and 𝐹 ∈ 𝑀𝑛 (𝐢(𝐼)).
Definition 7.9. A matrix function π‘Œ ∈ 𝑀𝑛 (𝐢 1 (𝐼)) is called a fundamental matrix of 𝑦′ = 𝐹 𝑦 if 𝑦′ (π‘₯) = 𝐹 (π‘₯)𝑦(π‘₯)
and if π‘Œ (π‘₯) is invertible for all π‘₯ ∈ 𝐼.
Proposition 7.20. 1. 𝑦′ = 𝐹 𝑦 has a fundamental matrix.
2. Let π‘Œ be a fundamental matrix of 𝑦′ = 𝐹 𝑦. Then 𝑍 ∈ 𝑀𝑛 (𝐢 1 (𝐼)) is a fundamental matrix of 𝑦′ = 𝐹 𝑦 if and
only if there is 𝐢 ∈ 𝑀𝑛 (𝔽 ) such that 𝑍 = π‘Œ 𝐢.
2
Proof. 1. Let 𝑐 ∈ 𝐼. Identifying 𝑀𝑛 (𝔽 ) with 𝔽 𝑛 , it follows from Corollary 7.14 that there is (a unique) π‘Œ ∈
𝑀𝑛 (𝐢 1 (𝐼)) such that π‘Œ ′ = 𝐹 π‘Œ and π‘Œ (𝑐) = 𝐼𝑛 , and there is 𝑍 ∈ 𝑀𝑛 (𝐢 1 (𝐼)) such that 𝑍 ′ = −𝐹 T 𝑍 and 𝑍(𝑐) = 𝐼𝑛 .
Then
( T )′
( )′
𝑍 π‘Œ = − 𝑍 T π‘Œ + 𝑍 Tπ‘Œ ′
= −𝑍 T 𝐹 T π‘Œ + 𝑍 T 𝐹 T π‘Œ = 0
shows that 𝑍 T π‘Œ is constant. So we have
𝑍(π‘₯)T π‘Œ (π‘₯) = 𝑍(𝑐)T π‘Œ (𝑐) = 𝐼𝑛 ,
which implies that π‘Œ (π‘₯) is invertible for all π‘₯ ∈ 𝐼.
2. If 𝑍 = π‘Œ 𝐢, then 𝑍(π‘₯) = π‘Œ (π‘₯)𝐢 is the product of invertible matrices and hence invertible for all π‘₯ ∈ 𝐼, and
𝑍 ′ = (π‘Œ 𝐢)′ = π‘Œ ′ 𝐢 = 𝐹 π‘Œ 𝐢 = 𝐹 𝑍
shows that 𝑍 is a fundamental matrix of 𝑦′ = 𝐹 𝑦.
Conversely, if 𝑍 is a fundamental matrix of 𝑦′ = 𝐹 𝑦, then 𝑍(π‘₯) is invertible for all π‘₯ ∈ 𝐼, and
(π‘Œ −1 𝑍)′ = −π‘Œ −1 π‘Œ ′ π‘Œ −1 𝑍 + π‘Œ −1 𝑍 ′
= −π‘Œ −1 𝐹 π‘Œ π‘Œ −1 𝑍 + π‘Œ −1 𝐹 𝑍 = 0,
which implies that π‘Œ −1 𝑍 is constant. So there is 𝐢 ∈ 𝑀𝑛 (𝔽 ) such that π‘Œ (π‘₯)−1 𝑍(π‘₯) = 𝐢 for all π‘₯ ∈ 𝐼. In
particular, 𝐢 is invertible since π‘Œ (π‘₯)−1 and 𝑍(π‘₯) are invertible. Finally, multiplying by π‘Œ (π‘₯) from the right, we
get 𝑍(π‘₯) = π‘Œ (π‘₯)𝐢.
86
Introductory Analysis 2017 Lecture Manual
MATH2022
Theorem 7.21. Assume that 𝐹 (π‘₯) = 𝐹 (𝑦) for all π‘₯, 𝑦 ∈ 𝐼. Let 𝑐 ∈ 𝐼 and define
βŽ› π‘₯
⎞
π‘Œ (π‘₯) = exp ⎜ 𝐹 (𝑑) π‘‘π‘‘βŽŸ .
⎜∫
⎟
βŽπ‘
⎠
Then π‘Œ is a fundamental matrix of 𝑦′ = 𝐹 𝑦.
Proof. π‘Œ (𝑐) = exp(0) = 𝐼𝑛 is invertible. Since 𝐹 (π‘₯) commutes with 𝐹 (𝑑) for all π‘₯, 𝑑 ∈ 𝐼, it follows that
𝑗
𝑗−1
βŽ‘βŽ› π‘₯
⎞⎀
βŽ› π‘₯
⎞
𝑑 ⎒⎜
⎟
βŽ₯
𝐹 (𝑑) 𝑑𝑑
= 𝑗𝐹 (π‘₯) ⎜ 𝐹 (𝑑) π‘‘π‘‘βŽŸ
⎟βŽ₯
⎜∫
⎟
𝑑π‘₯ ⎒⎜∫
⎣⎝ 𝑐
⎠⎦
βŽπ‘
⎠
,
so that
𝑗
∞
βŽ› π‘₯
⎞
𝑑 ∑ 1 ⎜
𝐹 (𝑑) π‘‘π‘‘βŽŸ
π‘Œ (π‘₯) =
⎟
𝑑π‘₯ 𝑗=0 𝑗! ⎜∫
βŽπ‘
⎠
′
𝑗
∞
βŽ‘βŽ› π‘₯
⎞⎀
∑
1 𝑑 ⎒⎜
=
𝐹 (𝑑) π‘‘π‘‘βŽŸ βŽ₯
⎟βŽ₯
𝑗! 𝑑π‘₯ ⎒⎜∫
𝑗=0
⎣⎝ 𝑐
⎠⎦
⎞
βŽ› π‘₯
=
𝐹 (π‘₯) ⎜ 𝐹 (𝑑) π‘‘π‘‘βŽŸ
⎟
⎜∫
𝑗!
𝑗=1
⎠
βŽπ‘
∞
∑
𝑗
𝑗−1
⎞
βŽ› π‘₯
= 𝐹 (π‘₯) exp ⎜ 𝐹 (𝑑) π‘‘π‘‘βŽŸ
⎟
⎜∫
⎠
βŽπ‘
= 𝐹 (π‘₯)π‘Œ (π‘₯).
Theorem 7.22. Let π‘Œ be a fundamental matrix of 𝑦′ = 𝐹 𝑦, 𝑓 ∈ 𝐢(𝐼, 𝔽 𝑛 ), 𝑐 ∈ 𝐼 and 𝑑 ∈ 𝔽 𝑛 . Then the unique
solution of the initial value problem 𝑦′ = 𝐹 𝑦 + 𝑓 , 𝑦(𝑐) = 𝑑 is
π‘₯
𝑦(π‘₯) = π‘Œ (π‘₯)π‘Œ
−1
(𝑐)𝑑 + π‘Œ (π‘₯)
∫
π‘Œ −1 (𝑑)𝑓 (𝑑) 𝑑𝑑.
𝑐
Proof. We could simply substitute 𝑦 into the differential equation. However, we give a constructive proof, the
so-called method of variation of the constants. It is easy to see that any solution of the homogeneous equation is
𝑦(π‘₯) = π‘Œ (π‘₯)𝑧 with 𝑧 ∈ 𝔽 𝑛 . (we are not going to prove this separately because it is a special case of this theorem
and therefore follows anyway). Now replace the constant vector 𝑧 by a vector function on 𝐼, i. e., 𝑦(π‘₯) = π‘Œ (π‘₯)𝑧(π‘₯),
where 𝑧 is the solution of the initial value problem. Then
𝑦′ (π‘₯) = π‘Œ ′ (π‘₯)𝑧(π‘₯) + π‘Œ (π‘₯)𝑧′ (π‘₯)
= 𝐹 (π‘₯)π‘Œ (π‘₯)𝑧(π‘₯) + π‘Œ (π‘₯)𝑧′ (π‘₯),
so that
𝑧′ (π‘₯) = π‘Œ −1 (π‘₯)π‘Œ (π‘₯)𝑧′ (π‘₯)
= π‘Œ −1 (π‘₯)[𝑦′ (π‘₯) − 𝐹 (π‘₯)𝑦(π‘₯)]
= π‘Œ −1 (π‘₯)𝑓 (π‘₯).
Since
𝑧(𝑐) = π‘Œ −1 (𝑐)𝑦(𝑐) = π‘Œ −1 (𝑐)𝑑,
MATH2022
Introductory Analysis 2017 Lecture Manual
it follows that
87
π‘₯
𝑧(π‘₯) =
∫
π‘Œ −1 (𝑑)𝑓 (𝑑) 𝑑𝑑 + π‘Œ −1 (𝑐)𝑑.
𝑐
Multiplication by π‘Œ (π‘₯) completes the proof.
Note. 1. If 𝑛 > 1 and 𝐹 is not constant, then, in general, there is no closed form formula for the fundamental
matrix.
2. With the procedure outlined before Corollary 7.16, the results of this section also apply to linear 𝑛-th order
differential equations. In particular, the elements of the first row of a fundamental system are called a fundamental
system, say 𝑦1 , … , 𝑦𝑛 of the 𝑛-th order differential equation, and any solution of the homogeneous 𝑛-th order
equation is a linear combination of the fundamental system.
Conversely, if 𝑦1 , … , 𝑦𝑛 is a fundamental system of the 𝑛-th order differential equation, then
(
)𝑛
π‘Œ = 𝑦(𝑖−1)
𝑗
𝑖,𝑗=1
is a fundamental matrix of the corresponding first order system.
3. In principle, if we consider an 𝑛-th order differential equation with constant coefficients
𝑦(𝑛) =
𝑛−1
∑
π‘Žπ‘— 𝑦(𝑗) ,
𝑗=0
one can use Jordan’s canonical form to find a fundamental system. However, this is very cumbersome. One can
proceed as follows: Consider the characteristic polynomial
πœ‡π‘› −
𝑛−1
∑
π‘Žπ‘— πœ‡ 𝑗
𝑗=0
and find its distinct (complex) zeros, say πœ‡1 , … , πœ‡π‘˜ , with multiplicities 𝑝1 , … , π‘π‘˜ . Then the 𝑛 functions
π‘₯𝑙 π‘’πœ‡π‘— π‘₯ ,
𝑗 = 1 … , π‘˜, 𝑙 = 0, … , 𝑝𝑗 − 1
form a fundamental system of the 𝑛-th order linear differential equations.
Download