Legal Notice
This book is copyright 2018 with all rights reserved. It is illegal to copy, distribute, or
create derivative works from this book in whole or in part or to contribute to the
copying, distribution, or creating of derivative works of this book.
For information on bulk purchases and licensing agreements, please email
support@SATPrepGet800.com
Acknowledgements
Thanks to Daniel Dimijian, Scott Jeffreys, Dan Seabold, C.R. Sincock, Pete Terlecky,
Zoran Sunik, and David Wayne for their helpful input during the creation of this book.
CONNECT WITH DR. STEVE WARNER
Pure Mathematics
for Beginners
A Rigorous Introduction to Logic, Set Theory,
Abstract Algebra, Number Theory, Real Analysis,
Topology, Complex Analysis, and Linear Algebra
--------
Dr. Steve Warner
๐๐ (๐ฟ)
๐๐ฟโจ (๐)
๐
๐ฟ
© 2018, All Rights Reserved
iii
Table of Contents
Introduction
For students
For instructors
7
7
8
Lesson 1 – Logic: Statements and Truth
Statements with Words
Statements with Symbols
Truth Tables
Problem Set 1
Lesson 2 – Set Theory: Sets and Subsets
Describing Sets
Subsets
Unions and Intersections
Problem Set 2
Lesson 3 – Abstract Algebra: Semigroups, Monoids, and Groups
Binary Operations and Closure
Semigroups and Associativity
Monoids and Identity
Groups and Inverses
Problem Set 3
Lesson 4 – Number Theory: The Ring of Integers
Rings and Distributivity
Divisibility
Induction
Problem Set 4
Lesson 5 – Real Analysis: The Complete Ordered Field of Reals
Fields
Ordered Rings and Fields
Why Isn’t โ enough?
Completeness
Problem Set 5
Lesson 6 – Topology: The Topology of โ
Intervals of Real Numbers
Operations on Sets
Open and Closed Sets
Problem Set 6
Lesson 7 – Complex Analysis: The Field of Complex Numbers
A Limitation of the Reals
The Complex Field
Absolute Value and Distance
Basic Topology of โ
Problem Set 7
9
9
10
12
16
19
19
20
24
28
30
30
32
34
34
36
38
38
41
43
48
50
50
52
56
58
62
64
64
66
70
76
78
78
78
82
85
90
iv
Lesson 8 – Linear Algebra: Vector Spaces
Vector Spaces Over Fields
Subspaces
Bases
Problem Set 8
Lesson 9 – Logic: Logical Arguments
Statements and Substatements
Logical Equivalence
Validity in Sentential Logic
Problem Set 9
Lesson 10 – Set Theory: Relations and Functions
Relations
Equivalence Relations and Partitions
Orderings
Functions
Equinumerosity
Problem Set 10
Lesson 11 – Abstract Algebra: Structures and Homomorphisms
Structures and Substructures
Homomorphisms
Images and Kernels
Normal Subgroups and Ring Ideals
Problem Set 11
Lesson 12 – Number Theory: Primes, GCD, and LCM
Prime Numbers
The Division Algorithm
GCD and LCM
Problem Set 12
Lesson 13 – Real Analysis: Limits and Continuity
Strips and Rectangles
Limits and Continuity
Equivalent Definitions of Limits and Continuity
Basic Examples
Limit and Continuity Theorems
Limits Involving Infinity
One-sided Limits
Problem Set 13
Lesson 14 – Topology: Spaces and Homeomorphisms
Topological Spaces
Bases
Types of Topological Spaces
Continuous Functions and Homeomorphisms
Problem Set 14
v
93
93
98
101
105
107
107
108
111
116
118
118
121
124
124
130
135
137
137
142
146
147
150
152
152
155
159
167
169
169
172
175
177
181
183
185
186
189
189
192
197
204
210
Lesson 15 – Complex Analysis: Complex Valued Functions
The Unit Circle
Exponential Form of a Complex Number
Functions of a Complex Variable
Limits and Continuity
The Reimann Sphere
Problem Set 15
Lesson 16 – Linear Algebra: Linear Transformations
Linear Transformations
Matrices
The Matrix of a Linear Transformation
Images and Kernels
Eigenvalues and Eigenvectors
Problem Set 16
212
212
216
218
223
228
230
234
234
239
242
244
247
253
Index
255
About the Author
259
Books by Dr. Steve Warner
260
vi
I N T R O D U C T I O N
PURE MATHEMATICS
This book was written to provide a basic but rigorous introduction to pure mathematics, while exposing
students to a wide range of mathematical topics in logic, set theory, abstract algebra, number theory,
real analysis, topology, complex analysis, and linear algebra.
For students: There are no prerequisites for this book. The content is completely self-contained.
Students with a bit of mathematical knowledge may have an easier time getting through some of the
material, but no such knowledge is necessary to read this book.
More important than mathematical knowledge is “mathematical maturity.” Although there is no single
agreed upon definition of mathematical maturity, one reasonable way to define it is as “one’s ability
to analyze, understand, and communicate mathematics.” A student with a higher level of mathematical
maturity will be able to move through this book more quickly than a student with a lower level of
mathematical maturity.
Whether your level of mathematical maturity is low or high, if you are just starting out in pure
mathematics, then you’re in the right place. If you read this book the “right way,” then your level of
mathematical maturity will continually be increasing. This increased level of mathematical maturity will
not only help you to succeed in advanced math courses, but it will improve your general problem
solving and reasoning skills. This will make it easier to improve your performance in college, in your
professional life, and on standardized tests such as the SAT, ACT, GRE, and GMAT.
So, what is the “right way” to read this book? Simply reading each lesson from end to end without any
further thought and analysis is not the best way to read the book. You will need to put in some effort
to have the best chance of absorbing and retaining the material. When a new theorem is presented,
don’t just jump right to the proof and read it. Think about what the theorem is saying. Try to describe
it in your own words. Do you believe that it is true? If you do believe it, can you give a convincing
argument that it is true? If you do not believe that it is true, try to come up with an example that shows
it is false, and then figure out why your example does not contradict the theorem. Pick up a pen or
pencil. Draw some pictures, come up with your own examples, and try to write your own proof.
You may find that this book goes into more detail than other math books when explaining examples,
discussing concepts, and proving theorems. This was done so that any student can read this book, and
not just students that are naturally gifted in mathematics. So, it is up to you as the student to try to
answer questions before they are answered for you. When a new definition is given, try to think of your
own examples before looking at those presented in the book. And when the book provides an example,
do not just accept that it satisfies the given definition. Convince yourself. Prove it.
Each lesson is followed by a Problem Set. The problems in each Problem Set have been organized into
five levels, Level 1 problems being considered the easiest, and Level 5 problems being considered the
most difficult. If you want to get just a small taste of pure mathematics, then you can work on the
easier problems. If you want to achieve a deeper understanding of the material, take some time to
struggle with the harder problems.
7
For instructors: This book can be used for a wide range of courses. Although the lessons can be taught
in the order presented, they do not need to be. The lessons cycle twice among eight subject areas:
logic, set theory, abstract algebra, number theory, real analysis, topology, complex analysis, and linear
algebra.
Lessons 1 through 8 give only the most basic material in each of these subjects. Therefore, an instructor
that wants to give a brief glimpse into a wide variety of topics might want to cover just the first eight
lessons in their course.
Lessons 9 through 16 cover material in each subject area that the author believes is fundamental to a
deep understanding of that particular subject.
For a first course in higher mathematics, a high-quality curriculum can be created by choosing among
the 16 lessons contained in this book.
As an example, an introductory course focusing on logic, set theory, and real analysis might cover
Lessons 1, 2, 5, 9, 10, and 13. Lessons 1 and 9 cover basic sentential logic and proof theory, Lessons 2
and 10 cover basic set theory including relations, functions, and equinumerosity, and Lessons 5 and 13
cover basic real analysis up through a rigorous treatment of limits and continuity. The first three lessons
are quite basic, while the latter three lessons are at an intermediate level. Instructors that do not like
the idea of leaving a topic and then coming back to it later can cover the lessons in the following order
without issue: 1, 9, 2, 10, 5, and 13.
As another example, a course focusing on algebraic structures might cover Lessons 2, 3, 4, 5, 10, and
11. As mentioned in the previous paragraph, Lessons 2 and 10 cover basic set theory. In addition,
Lessons 3, 4, 5, and 11 cover semigroups, monoids, groups, rings, and fields. Lesson 4, in addition to a
preliminary discussion on rings, also covers divisibility and the principle of mathematical induction.
Similarly, Lesson 5, in addition to a preliminary discussion on fields, provides a development of the
complete ordered field of real numbers. These topics can be included or omitted, as desired. Instructors
that would also like to incorporate vector spaces can include part or all of Lesson 8.
The author strongly recommends covering Lesson 2 in any introductory pure math course. This lesson
fixes some basic set theoretical notation that is used throughout the book and includes some important
exposition to help students develop strong proof writing skills as quickly as possible.
The author welcomes all feedback from instructors. Any suggestions will be considered for future
editions of the book. The author would also love to hear about the various courses that are created
using these lessons. Feel free to email Dr. Steve Warner with any feedback at
steve@SATPrepGet800.com
8
LESSON 1 – LOGIC
STATEMENTS AND TRUTH
Statements with Words
A statement (or proposition) is a sentence that can be true or false, but not both simultaneously.
Example 1.1: “Mary is awake” is a statement because at any given time either Mary is awake or Mary
is not awake (also known as Mary being asleep), and Mary cannot be both awake and asleep at the
same time.
Example 1.2: The sentence “Wake up!” is not a statement because it cannot be true or false.
An atomic statement expresses a single idea. The statement “Mary is awake” that we discussed above
is an example of an atomic statement. Let’s look at a few more examples.
Example 1.3: The following sentences are atomic statements:
1. 17 is a prime number.
2. George Washington was the first president of the United States.
3. 5 > 6.
4. David is left-handed.
Sentences 1 and 2 above are true, and sentence 3 is false. We can’t say for certain whether sentence 4
is true or false without knowing who David is. However, it is either true or false. It follows that each of
the four sentences above are atomic statements.
We use logical connectives to form compound statements. The most commonly used logical
connectives are “and,” “or,” “if…then,” “if and only if,” and “not.”
Example 1.4: The following sentences are compound statements:
1. 17 is a prime number and 0 = 1.
2. Michael is holding a pen or water is a liquid.
3. If Joanna has a cat, then fish have lungs.
4. Albert Einstein is alive today if and only if 5 + 7 = 12.
5. 16 is not a perfect square.
Sentence 1 above uses the logical connective “and.” Since the statement “0 = 1” is false, it follows that
sentence 1 is false. It does not matter that the statement “17 is a prime number” is true. In fact, “T
and F” is always F.
Sentence 2 uses the logical connective “or.” Since the statement “water is a liquid” is true, it follows
that sentence 2 is true. It does not even matter whether Michael is holding a pen. In fact, “T or T” is
always true and “F or T” is always T.
9
It’s worth pausing for a moment to note that in the English language the word “or” has two possible
meanings. There is an “inclusive or” and an “exclusive or.” The “inclusive or” is true when both
statements are true, whereas the “exclusive or” is false when both statements are true. In
mathematics, by default, we always use the “inclusive or” unless we are told to do otherwise. To some
extent, this is an arbitrary choice that mathematicians have agreed upon. However, it can be argued
that it is the better choice since it is used more often and it is easier to work with. Note that we were
assuming use of the “inclusive or” in the last paragraph when we said, “In fact, “T or T” is always true.”
See Problem 4 below for more on the “exclusive or.”
Sentence 3 uses the logical connective “if…then.” The statement “fish have lungs” is false. We need to
know whether Joanna has a cat in order to figure out the truth value of sentence 3. If Joanna does have
a cat, then sentence 3 is false (“if T, then F” is always F). If Joanna does not have a cat, then sentence
3 is true (“if F, then F” is always T).
Sentence 4 uses the logical connective “if and only if.” Since the two atomic statements have different
truth values, it follows that sentence 4 is false. In fact, “F if and only if T” is always F.
Sentence 5 uses the logical connective “not.” Since the statement “16 is a perfect square” is true, it
follows that sentence 5 is false. In fact, “not T” is always F.
Notes: (1) The logical connectives “and,” “or,” “if…then,” and “if and only if,” are called binary
connectives because they join two statements (the prefix “bi” means “two”).
(2) The logical connective “not” is called a unary connective because it is applied to just a single
statement (“unary” means “acting on a single element”).
Example 1.5: The following sentences are not statements:
1. Are you happy?
2. Go away!
3. ๐ฅ − 5 = 7
4. This sentence is false.
5. This sentence is true.
Sentence 1 above is a question and sentence 2 is a command. Sentence 3 has an unknown variable – it
can be turned into a statement by assigning a value to the variable. Sentences 4 and 5 are
self-referential (they refer to themselves). They can be neither true nor false. Sentence 4 is called the
Liar’s paradox and sentence 5 is called a vacuous affirmation.
Statements with Symbols
We will use letters such as ๐, ๐, ๐, and ๐ to denote atomic statements. We sometimes call these letters
propositional variables, and we will generally assign a truth value of T (for true) or F (for false) to each
propositional variable. Formally, we define a truth assignment of a list of propositional variables to be
a choice of T or F for each propositional variable in the list.
10
We use the symbols ∧, ∨, →, ↔, and ¬ for the most common logical connectives. The truth value of a
compound statement is determined by the truth values of its atomic parts together with applying the
following rules for the connectives.
•
๐ ∧ ๐ is called the conjunction of ๐ and ๐. It is pronounced “๐ and ๐.” ๐ ∧ ๐ is true when both
๐ and ๐ are true, and it is false otherwise.
•
๐ ∨ ๐ is called the disjunction of ๐ and ๐. It is pronounced “๐ or ๐.” ๐ ∨ ๐ is true when ๐ or ๐
(or both) are true, and it is false when ๐ and ๐ are both false.
•
๐ → ๐ is called a conditional or implication. It is pronounced “if ๐, then ๐” or “๐ implies ๐.”
๐ → ๐ is true when ๐ is false or ๐ is true (or both), and it is false when ๐ is true and ๐ is false.
•
๐ ↔ ๐ is called a biconditional. It is pronounced “๐ if and only if ๐.” ๐ ↔ ๐ is true when ๐ and
๐ have the same truth value (both true or both false), and it is false when ๐ and ๐ have opposite
truth values (one true and the other false).
•
¬๐ is called the negation of ๐. It is pronounced “not ๐.” ¬๐ is true when ๐ is false, and it is false
when ๐ is true (๐ and ¬๐ have opposite truth values.)
Example 1.6: Let ๐ represent the statement “Fish can swim,” and let ๐ represent the statement
“7 < 3.” Note that ๐ is true and ๐ is false.
1. ๐ ∧ ๐ represents “Fish can swim and 7 < 3.” Since ๐ is false, it follows that ๐ ∧ ๐ is false.
2. ๐ ∨ ๐ represents “Fish can swim or 7 < 3.” Since ๐ is true, it follows that ๐ ∨ ๐ is true.
3. ๐ → ๐ represents “If fish can swim, then 7 < 3.” Since ๐ is true and ๐ is false, ๐ → ๐ is false.
4. ๐ ↔ ๐ represents “Fish can swim if and only if 7 < 3.” Since ๐ is true and ๐ is false, ๐ ↔ ๐ is
false.
5. ¬๐ represents the statement “7 is not less than 3.” This is equivalent to “7 is greater than or
equal to 3,” or equivalently, “7 ≥ 3.” Since ๐ is false, ¬๐ is true.
6. ¬๐ ∨ ๐ represents the statement “Fish cannot swim or 7 < 3.” Since ¬๐ and ๐ are both false,
¬๐ ∨ ๐ is false. Note that ¬๐ ∨ ๐ always means (¬๐) ∨ ๐. In general, without parentheses
present, we always apply negation before any of the other connectives.
7. ¬(๐ ∨ ๐) represents the statement “It is not the case that either fish can swim or 7 < 3.” This
can also be stated as “Neither can fish swim nor is 7 less than 3.” Since ๐ ∨ ๐ is true (see 2
above), ¬(๐ ∨ ๐) is false.
8. ¬๐ ∧ ¬๐ represents the statement “Fish cannot swim and 7 is not less than 3.” This statement
can also be stated as “Neither can fish swim nor is 7 less than 3.” Since this is the same
statement as in 7 above, it should follow that ¬๐ ∧ ¬๐ is equivalent to ¬(๐ ∨ ๐). After
completing this lesson, you will be able to verify this. For now, let’s observe that since ¬๐ is
false, it follows that ¬๐ ∧ ¬๐ is false. This agrees with the truth value we got in 7. (Note: The
equivalence of ¬๐ ∧ ¬๐ with ¬(๐ ∨ ๐) is one of De Morgan’s laws. These laws will be explored
further in Lesson 9. See also Problem 3 below.)
11
Truth Tables
A truth table can be used to display the possible truth values of a compound statement. We start by
labelling the columns of the table with the propositional variables that appear in the statement,
followed by the statement itself. We then use the rows to run through every possible combination of
truth values for the propositional variables followed by the resulting truth values for the compound
statement. Let’s look at the truth tables for the five most common logical connectives.
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐∧๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐∨๐
๐
๐
๐
๐
๐↔๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐→๐
๐
๐
๐
๐
¬๐
๐
๐
We can use these five truth tables to compute the truth values of compound statements involving the
five basic logical connectives.
Note: For statements involving just 1 propositional variable (such as ¬๐), the truth table requires 2
rows, 1 for each truth assignment of ๐ ( T or F ).
For statements involving 2 propositional variables (such as ๐ ∧ ๐), the truth table requires 2 ⋅ 2 = 4 (or
22 = 4) rows, as there are 4 possible combinations for truth assignments of ๐ and ๐ ( TT, TF, FT, FF ).
In general, for a statement involving ๐ propositional variables, the truth table will require 2๐ rows. For
example, if we want to build an entire truth table for ¬๐ ∨ (¬๐ → ๐), we will need 23 = 2 ⋅ 2 ⋅ 2 = 8
rows in the truth table. We will create the truth table for this statement in Example 1.8 below (see the
third solution).
Example 1.7: If ๐ is true and ๐ is false, then we can compute the truth value of ๐ ∧ ๐ by looking at the
second row of the truth table for the conjunction.
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐∧๐
๐
๐
๐
๐
We see from the highlighted row that ๐ ∧ ๐ ≡ T ∧ F ≡ ๐
.
12
Note: Here the symbol ≡ can be read “is logically equivalent to.” So, we see that if ๐ is true and ๐ is
false, then ๐ ∧ ๐ is logically equivalent to F, or more simply, ๐ ∧ ๐ is false.
Example 1.8: Let ๐, ๐, and ๐ be propositional variables with ๐ and ๐ true, and ๐ false. Let’s compute
the truth value of ¬๐ ∨ (¬๐ → ๐).
Solution: We have ¬๐ ∨ (¬๐ → ๐) ≡ ¬T ∨ (¬T → F) ≡ F ∨ (F → F) ≡ F ∨ T ≡ ๐.
Notes: (1) For the first equivalence, we simply replaced the propositional variables by their given truth
values. We replaced ๐ and ๐ by T, and we replaced ๐ by F.
(2) For the second equivalence, we used the first row of the truth table for the
negation (drawn to the right for your convenience).
We see from the highlighted row that ¬T ≡ F. We applied this result twice.
๐
๐
๐
¬๐
๐
๐
(3) For the third equivalence, we used the fourth row of the truth table for the conditional.
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐→๐
๐
๐
๐
๐
We see from the highlighted row that F → F ≡ T.
(4) For the last equivalence, we used the third row of the truth table for the disjunction.
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐∨๐
๐
๐
๐
๐
We see from the highlighted row that F ∨ T ≡ T.
(5) We can save a little time by immediately replacing the negation of a propositional variable by its
truth value (which will be the opposite truth value of the propositional variable). For example, since ๐
has truth value T, we can replace ¬๐ by F. The faster solution would look like this:
¬๐ ∨ (¬๐ → ๐) ≡ F ∨ (F → F) ≡ F ∨ T ≡ ๐.
Quicker solution: Since ๐ has truth value T, it follows that ¬๐ has truth value F. So, ¬๐ → ๐ has truth
value T. Finally, ¬๐ ∨ (¬๐ → ๐) must then have truth value T.
Notes: (1) Symbolically, we can write the following:
¬๐ ∨ (¬๐ → ๐) ≡ ¬๐ ∨ (¬T → ๐) ≡ ¬๐ ∨ (F → ๐) ≡ ¬๐ ∨ T ≡ ๐
13
(2) We can display this reasoning visually as follows:
¬๐ ∨ (¬๐ → ๐)
T
F
T
๐
The vertical lines have just been included to make sure you see which connective each truth value is
written below.
We began by placing a T under the propositional variable ๐ to indicate that ๐ is true. Since ¬T ≡ F, we
then place an F under the negation symbol. Next, since F → ๐ ≡ T regardless of the truth value of ๐,
we place a T under the conditional symbol. Finally, since ¬๐ ∨ T ≡ T regardless of the truth value of
๐, we place a T under the disjunction symbol. We made this last T bold to indicate that we are finished.
(3) Knowing that ๐ has truth value T is enough to determine the truth value of ¬๐ ∨ (¬๐ → ๐), as we
saw in Note 1 above. It’s okay if you didn’t notice that right away. This kind of reasoning takes a bit of
practice and experience.
Truth table solution: An alternative solution is to build the whole truth table of ¬๐ ∨ (¬๐ → ๐) one
column at a time. Since there are 3 propositional variables (๐, ๐, and ๐), we will need 23 = 8 rows to
get all the possible truth values. We then create a column for each compound statement that appears
within the given statement starting with the statements of smallest length and working our way up to
the given statement. We will need columns for ๐, ๐, ๐ (the atomic statements), ¬๐, ¬๐, ¬๐ → ๐, and
finally, the statement itself, ¬๐ ∨ (¬๐ → ๐). Below is the final truth table with the relevant row
highlighted and the final answer circled.
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
¬๐
๐
๐
๐
๐
๐
๐
๐
๐
¬๐
๐
๐
๐
๐
๐
๐
๐
๐
¬๐ → ๐
๐
๐
๐
๐
๐
๐
๐
๐
¬๐ ∨ (¬๐ → ๐)
๐
๐
๐
๐
๐
๐
๐
๐
Notes: (1) We fill out the first three columns of the truth table by listing all possible combinations of
truth assignments for the propositional variables ๐, ๐, and ๐. Notice how down the first column we
have 4 T’s followed by 4 F’s, down the second column we alternate sequences of 2 T’s with 2 F’s, and
down the third column we alternate T’s with F’s one at a time. This is a nice systematic way to make
sure we get all possible combinations of truth assignments.
14
If you’re having trouble seeing the pattern of T’s and F’s, here is another way to think about it: In the
first column, the first half of the rows have a T and the remainder have an F. This gives 4 T’s followed
by 4 F’s.
For the second column, we take half the number of consecutive T’s in the first column (half of 4 is 2)
and then we alternate between 2 T’s and 2 F’s until we fill out the column.
For the third column, we take half the number of consecutive T’s in the second column (half of 2 is 1)
and then we alternate between 1 T and 1 F until we fill out the column.
(2) Since the connective ¬ has the effect of taking the opposite truth value, we generate the entries in
the fourth column by taking the opposite of each truth value in the first column. Similarly, we generate
the entries in the fifth column by taking the opposite of each truth value in the second column.
(3) For the sixth column, we apply the connective → to the fifth and third columns, respectively, and
finally, for the last column, we apply the connective ∨ to the fourth and sixth columns, respectively.
(4) The original question is asking us to compute the truth value of ¬๐ ∨ (¬๐ → ๐) when ๐ and ๐ are
true, and ๐ is false. In terms of the truth table, we are being asked for the entry in the second row and
last (seventh) column. Therefore, the answer is ๐.
(5) This is certainly not the most efficient way to answer the given question. However, building truth
tables is not too difficult, and it’s a foolproof way to determine truth values of compound statements.
15
Problem Set 1
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Determine whether each of the following sentences is an atomic statement, a compound
statement, or not a statement at all:
(i)
I am not going to work today.
(ii)
What is the meaning of life?
(iii)
Don’t go away mad.
(iv)
I watched the television show Parks and Recreation.
(v)
If pigs have wings, then they can fly.
(vi)
3 < – 5 or 38 > 37.
(vii)
This sentence has five words.
(viii)
I cannot swim, but I can run fast.
2. What is the negation of each of the following statements:
(i)
The banana is my favorite fruit.
(ii)
7 > – 3.
(iii)
You are not alone.
(iv)
The function ๐ is differentiable everywhere.
LEVEL 2
3. Let ๐ represent the statement “9 is a perfect square,” let ๐ represent the statement “Orange is a
primary color,” and let ๐ represent the statement “A frog is a reptile.” Rewrite each of the
following symbolic statements in words, and state the truth value of each statement:
(i)
๐∧๐
(ii)
¬๐
(iii)
๐→๐
(iv)
๐↔๐
(v)
¬๐ ∧ ๐
(vi)
¬(๐ ∧ ๐)
(vii)
¬๐ ∨ ¬๐
(viii)
(๐ ∧ ๐) → ๐
16
4. Consider the compound sentence “You can have a cookie or ice cream.” In English this would
most likely mean that you can have one or the other but not both. The word “or” used here is
generally called an “exclusive or” because it excludes the possibility of both. The disjunction is
an “inclusive or.” Using the symbol ⊕ for exclusive or, draw the truth table for this connective.
LEVEL 3
5. Let ๐, ๐, and ๐ represent true statements. Compute the truth value of each of the following
compound statements:
(i)
(๐ ∨ ๐) ∨ ๐
(ii)
(๐ ∨ ๐) ∧ ¬๐
(iii)
¬๐ → (๐ ∨ ๐)
(iv)
¬(๐ ↔ ¬๐) ∧ ๐
(v)
¬[๐ ∧ (¬๐ → ๐)]
(vi)
¬[(¬๐ ∨ ¬๐) ↔ ¬๐]
(vii)
๐ → (๐ → ¬๐)
(viii) ¬[¬๐ → (๐ → ¬๐)]
6. Using only the logical connectives ¬, ∧, and ∨, produce a statement using the propositional
variables ๐ and ๐ that has the same truth values as ๐ ⊕ ๐ (this is the “exclusive or” defined in
problem 4 above).
LEVEL 4
7. Let ๐ represent a true statement. Decide if this is enough information to determine the truth value
of each of the following statements. If so, state that truth value.
(i)
๐∨๐
(ii)
๐→๐
(iii)
¬๐ → ¬(๐ ∨ ¬๐)
(iv)
¬(¬๐ ∧ ๐) ↔ ๐
(v)
(๐ ↔ ๐) ↔ ¬๐
(vi)
¬[(¬๐ ∧ ¬๐) ↔ ¬๐]
(vii)
[(๐ ∧ ¬๐) → ๐] ∧ (๐ ∨ ¬๐)
(viii) ๐ → [¬๐ → (¬๐ → ¬๐)]
17
8. Assume that the given compound statement is true. Determine the truth value of each
propositional variable.
(i)
๐∧๐
(ii)
¬(๐ → ๐)
(iii)
๐ ↔ [¬(๐ ∧ ๐)]
(iv)
[๐ ∧ (๐ ∨ ๐)] ∧ ¬๐
LEVEL 5
9. Show that [๐ ∧ (๐ ∨ ๐)] ↔ [(๐ ∧ ๐) ∨ (๐ ∧ ๐)] is always true.
10. Show that [[(๐ ∧ ๐) → ๐] → ๐ ] → [(๐ → ๐) → ๐ ] is always true.
18
LESSON 2 – SET THEORY
SETS AND SUBSETS
Describing Sets
A set is simply a collection of “objects.” These objects can be numbers, letters, colors, animals, funny
quotes, or just about anything else you can imagine. We will usually refer to the objects in a set as the
members or elements of the set.
If a set consists of a small number of elements, we can describe the set simply by listing the elements
in the set in curly braces, separating elements by commas.
Example 2.1:
1. {apple, banana} is the set consisting of two elements: apple and banana.
2. {anteater, elephant, egg, trapezoid} is the set consisting of four elements: anteater, elephant,
egg, and trapezoid.
3. {2, 4, 6, 8, 10} is the set consisting of five elements: 2, 4, 6, 8, and 10. The elements in this set
happen to be numbers.
A set is determined by its elements, and not the order in which the elements are presented. For
example, the set {4, 2, 8, 6, 10} is the same as the set {2, 4, 6, 8, 10}.
Also, the set {2, 2, 4, 6, 8, 10, 10, 10} is the same as the set {2, 4, 6, 8, 10}. If we are describing a set by
listing its elements, the most natural way to do this is to list each element just once.
We will usually name sets using capital letters such as ๐ด, ๐ต, and ๐ถ. For example, we might write
๐ด = {1, 2, 3}. So, ๐ด is the set consisting of the elements 1, 2, and 3.
Example 2.2: Consider the sets ๐ด = {๐, ๐}, ๐ต = {๐, ๐}, ๐ถ = {๐, ๐, ๐}. Then ๐ด, ๐ต, and ๐ถ all represent the
same set. We can write ๐ด = ๐ต = ๐ถ.
We use the symbol ∈ for the membership relation (we will define the term “relation” more carefully in
Lesson 10). So, ๐ฅ ∈ ๐ด means “๐ฅ is an element of ๐ด,” whereas ๐ฅ ∉ ๐ด means “๐ฅ is not an element of ๐ด.”
Example 2.3: Let ๐ด = {๐, ๐, 3, โก, ⊕}. Then ๐ ∈ ๐ด, ๐ ∈ ๐ด, 3 ∈ ๐ด, โก ∈ ๐ด, and ⊕ ∈ ๐ด.
If a set consists of many elements, we can use ellipses (…) to help describe the set. For example, the
set consisting of the natural numbers between 17 and 5326, inclusive, can be written
{17, 18, 19, … ,5325, 5326} (“inclusive” means that we include 17 and 5326). The ellipses between 19
and 5325 are there to indicate that there are elements in the set that we are not explicitly mentioning.
Ellipses can also be used to help describe infinite sets. The set of natural numbers can be written
โ = {0, 1, 2, 3, … }, and the set of integers can be written โค = {… , – 4, – 3, – 2, – 1, 0, 1, 2, 3, 4, … }.
19
Example 2.4: The odd natural numbers can be written ๐ = {1, 3, 5, … }. The even integers can be
written 2โค = {… , – 6, – 4, – 2, 0, 2, 4, 6, … }. The primes can be written โ = {2, 3, 5, 7, 11, 13, 17, … }.
A set can also be described by a certain property ๐ that all its elements have in common. In this case,
we can use the set-builder notation {๐ฅ|๐(๐ฅ)} to describe the set. The expression {๐ฅ|๐(๐ฅ)} can be read
“the set of all ๐ฅ such that the property ๐(๐ฅ) is true.” Note that the symbol “|” is read as “such that.”
Example 2.5: Let’s look at a few different ways that we can describe the set {2, 4, 6, 8, 10}. We have
already seen that reordering and/or repeating elements does not change the set. For example,
{2, 2, 6, 4, 10, 8} describes the same set. Here are a few more descriptions using set-builder notation:
•
{๐ | ๐ is an even positive integer less than or equal to 10}
•
{๐ ∈ โค | ๐ is even, 0 < ๐ ≤ 10}
•
{2๐ | ๐ = 1, 2, 3, 4, 5}
The first expression in the bulleted list can be read “the set of ๐ such that ๐ is an even positive integer
less than or equal to 10.” The second expression can be read “the set of integers ๐ such that ๐ is even
and ๐ is between 0 and 10, including 10, but excluding 0. Note that the abbreviation “๐ ∈ โค” can be
read “๐ is in the set of integers,” or more succinctly, “๐ is an integer.” The third expression can be read
“the set of 2๐ such that ๐ is 1, 2, 3, 4, or 5.”
If ๐ด is a finite set, we define the cardinality of ๐ด, written |๐ด|, to be the number of elements of ๐ด. For
example, |{๐, ๐}| = 2. In Lesson 10, we will extend the notion of cardinality to also include infinite sets.
Example 2.6: Let ๐ด = {anteater, egg, trapezoid}, ๐ต = {2, 3, 3}, and ๐ถ = {17, 18, 19, … , 5325, 5326}.
Then |๐ด| = 3, |๐ต| = 2, and |๐ถ| = 5310.
Notes: (1) The set ๐ด has the three elements “anteater,” “egg,” and “trapezoid.”
(2) The set ๐ต has just two elements: 2 and 3. Remember that {2, 3, 3} = {2, 3}.
(3) The number of consecutive integers from ๐ to ๐, inclusive, is ๐ − ๐ + ๐. For set ๐ถ, we have
๐ = 17 and ๐ = 5326. Therefore, |๐ถ| = 5326 − 17 + 1 = 5310.
(4) I call the formula “๐ − ๐ + 1” the fence-post formula. If you construct a 3-foot fence by placing a
fence-post every foot, then the fence will consist of 4 fence-posts (3 − 0 + 1 = 4).
The empty set is the unique set with no elements. We use the symbol ∅ to denote the empty set (some
authors use the symbol { } instead).
Subsets
For two sets ๐ด and ๐ต, we say that ๐ด is a subset of ๐ต, written ๐ด ⊆ ๐ต, if every element of ๐ด is an element
of ๐ต. That is, ๐ด ⊆ ๐ต if, for every ๐ฅ, ๐ฅ ∈ ๐ด implies ๐ฅ ∈ ๐ต. Symbolically, we can write ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ต).
20
Notes: (1) The symbol ∀ is called a universal quantifier, and it is pronounced “For all.”
(2) The logical expression ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ต) can be translated into English as “For all ๐ฅ, if ๐ฅ is an
element of ๐ด, then ๐ฅ is an element of ๐ต.”
(3) To show that a set ๐ด is a subset of a set ๐ต, we need to show that the expression ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ต)
is true. If the set ๐ด is finite and the elements are listed, we can just check that each element of ๐ด is also
an element of ๐ต. However, if the set ๐ด is described by a property, say ๐ด = {๐ฅ|๐(๐ฅ)}, we may need to
craft an argument more carefully. We can begin by taking an arbitrary but specific element ๐ from ๐ด
and then arguing that this element ๐ is in ๐ต.
What could we possibly mean by an arbitrary but specific element? Aren’t the words “arbitrary” and
“specific” antonyms? Well, by arbitrary, we mean that we don’t know which element we are choosing
– it’s just some element ๐ that satisfies the property ๐. So, we are just assuming that ๐(๐) is true.
However, once we choose this element ๐, we use this same ๐ for the rest of the argument, and that is
what we mean by it being specific.
(4) To the right we see a physical representation of ๐ด ⊆ ๐ต. This
figure is called a Venn diagram. These types of diagrams are very
useful to help visualize relationships among sets. Notice that set ๐ด
lies completely inside set ๐ต. We assume that all the elements of ๐ด
and ๐ต lie in some universal set ๐.
As an example, let’s let ๐ be the set of all species of animals. If we
let ๐ด be the set of species of cats and we let ๐ต be the set of species
of mammals, then we have ๐ด ⊆ ๐ต ⊆ ๐, and we see that the Venn
diagram to the right gives a visual representation of this situation.
(Note that every cat is a mammal and every mammal is an animal.)
๐ด⊆๐ต
Let’s try to prove our first theorem using the definition of a subset together with Note 3 above about
arbitrary but specific elements.
Theorem 2.1: Every set ๐ด is a subset of itself.
Before writing the proof, let’s think about our strategy. We want to prove ๐ด ⊆ ๐ด. In other words, we
want to show ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ด). So, we will take an arbitrary but specific ๐ ∈ ๐ด and then argue that
๐ ∈ ๐ด. But that’s pretty obvious, isn’t it? In this case, the property describing the set is precisely the
conclusion we are looking for. Here are the details.
Proof of Theorem 2.1: Let ๐ด be a set and let ๐ ∈ ๐ด. Then ๐ ∈ ๐ด. So, ๐ ∈ ๐ด → ๐ ∈ ๐ด is true. Since ๐ was
an arbitrary element of ๐ด, ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ด) is true. Therefore, ๐ด ⊆ ๐ด.
โก
Notes: (1) The proof begins with the opening statement “Let ๐ด be a set and let ๐ ∈ ๐ด.” In general, the
opening statement states what is given in the problem and/or fixes any arbitrary but specific objects
that we will need.
(2) The proof ends with the closing statement “Therefore, ๐ด ⊆ ๐ด.” In general, the closing statement
states the result.
21
(3) Everything between the opening statement and the closing statement is known as the argument.
(4) We place the symbol โก at the end of the proof to indicate that the proof is complete.
(5) Consider the logical statement ๐ → ๐. This statement is always true (T → T ≡ T and F → F ≡ T).
๐ → ๐ is an example of a tautology. A tautology is a statement that is true for every possible truth
assignment of the propositional variables (see Problems 9 and 10 from Lesson 1 for more examples).
(6) If we let ๐ represent the statement ๐ ∈ ๐ด, by Note 5, we see that ๐ ∈ ๐ด → ๐ ∈ ๐ด is always true.
Alternate proof of Theorem 2.1: Let ๐ด be a set and let ๐ ∈ ๐ด. Since ๐ → ๐ is a tautology, we have that
๐ ∈ ๐ด → ๐ ∈ ๐ด is true. Since ๐ was arbitrary, ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ด) is true. Therefore, ๐ด ⊆ ๐ด.
โก
Let’s prove another basic but important theorem.
Theorem 2.2: The empty set is a subset of every set.
Analysis: This time we want to prove ∅ ⊆ ๐ด. In other words, we want to show ∀๐ฅ(๐ฅ ∈ ∅ → ๐ฅ ∈ ๐ด).
Since ๐ฅ ∈ ∅ is always false (the empty set has no elements), ๐ฅ ∈ ∅ → ๐ฅ ∈ ๐ด is always true.
In general, if ๐ is a false statement, then we say that ๐ → ๐ is vacuously true.
Proof of Theorem 2.2: Let ๐ด be a set. The statement ๐ฅ ∈ ∅ → ๐ฅ ∈ ๐ด is vacuously true for any ๐ฅ, and
so, ∀๐ฅ(๐ฅ ∈ ∅ → ๐ฅ ∈ ๐ด) is true. Therefore, ∅ ⊆ ๐ด.
โก
Note: The opening statement is “Let ๐ด be a set,” the closing statement is “Therefore, ∅ ⊆ ๐ด,” and the
argument is everything in between.
Example 2.7: Let ๐ถ = {๐, ๐, ๐}, ๐ท = {๐, ๐}, ๐ธ = {๐, ๐}, ๐น = {๐, ๐}, and ๐บ = ∅. Then ๐ท ⊆ ๐ถ and ๐ธ ⊆ ๐ถ.
Also, since the empty set is a subset of every set, we have ๐บ ⊆ ๐ถ, ๐บ ⊆ ๐ท, ๐บ ⊆ ๐ธ, ๐บ ⊆ ๐น, and ๐บ ⊆ ๐บ.
Every set is a subset of itself, and so, ๐ถ ⊆ ๐ถ, ๐ท ⊆ ๐ท, ๐ธ ⊆ ๐ธ, and ๐น ⊆ ๐น.
Note: Below are possible Venn diagrams for this problem. The diagram on the left shows the
relationship between the sets ๐ถ, ๐ท, ๐ธ, and ๐น. Notice how ๐ท and ๐ธ are both subsets of ๐ถ, whereas ๐น is
not a subset of ๐ถ. Also, notice how ๐ท and ๐ธ overlap, ๐ธ and ๐น overlap, but there is no overlap between
๐ท and ๐น (they have no elements in common). The diagram on the right shows the proper placement of
the elements. Here, I chose the universal set to be ๐ = {๐, ๐, ๐, ๐, ๐, ๐, ๐}. This choice for the universal
set is somewhat arbitrary. Any set containing {๐, ๐, ๐, ๐} would do.
22
Example 2.8: The set ๐ด = {๐, ๐} has 2 elements and 4 subsets. The subsets of ๐ด are ∅, {๐}, {๐}, and
{๐, ๐}.
The set ๐ต = {๐, ๐, ๐} has 3 elements and 8 subsets. The subsets of ๐ต are ∅, {๐}, {๐}, {๐}, {๐, ๐}, {๐, ๐},
{๐, ๐}, and {๐, ๐, ๐}.
Let’s draw a tree diagram for the subsets of each of the sets ๐ด and ๐ต.
{๐, ๐}
{๐}
{๐, ๐, ๐}
{๐}
∅
{๐, ๐}
{๐, ๐}
{๐, ๐}
{๐}
{๐}
{๐}
∅
The tree diagram on the left is for the subsets of the set ๐ด = {๐, ๐}. We start by writing the set
๐ด = {๐, ๐} at the top. On the next line we write the subsets of cardinality 1 ({๐} and {๐}). On the line
below that we write the subsets of cardinality 0 (just ∅). We draw a line segment between any two sets
when the smaller (lower) set is a subset of the larger (higher) set. So, we see that ∅ ⊆ {๐}, ∅ ⊆ {๐},
{๐} ⊆ {๐, ๐}, and {๐} ⊆ {๐, ๐}. There is actually one more subset relationship, namely ∅ ⊆ {๐, ๐} (and
of course each set displayed is a subset of itself). We didn’t draw a line segment from ∅ to {๐, ๐} to
avoid unnecessary clutter. Instead, we can simply trace the path from ∅ to {๐} to {๐, ๐} (or from ∅ to
{๐} to {๐, ๐}). We are using a property called transitivity here (see Theorem 2.3 below).
The tree diagram on the right is for the subsets of ๐ต = {๐, ๐, ๐}. Observe that from top to bottom we
write the subsets of ๐ต of size 3, then 2, then 1, and then 0. We then draw the appropriate line
segments, just as we did for ๐ด = {๐, ๐}.
How many subsets does a set of cardinality ๐ have? Let’s start by looking at some examples.
Example 2.9: A set with 0 elements must be ∅, and this set has exactly 1 subset (the only subset of the
empty set is the empty set itself).
A set with 1 element has 2 subsets, namely ∅ and the set itself.
In the last example, we saw that a set with 2 elements has 4 subsets, and we also saw that a set with
3 elements has 8 subsets.
Do you see the pattern yet? 1 = 20 , 2 = 21 , 4 = 22 , 8 = 23 . So, we see that a set with 0 elements has
20 subsets, a set with 1 element has 21 subsets, a set with 2 elements has 22 subsets, and a set with 3
elements has 23 subsets. A reasonable guess would be that a set with ๐ elements has ๐๐ subsets. You
will be asked to prove this result later (Problem 12 in Lesson 4). We can also say that if |๐ด| = ๐, then
|๐ซ(๐ด)| = 2๐ , where ๐ซ(๐ด) (pronounced the power set of ๐ด) is the set of all subsets of ๐ด. In set-builder
notation, we write ๐ซ(๐ด) = {๐ต | ๐ต ⊆ ๐ด}.
Let’s get back to the transitivity mentioned above in our discussion of tree diagrams.
23
Theorem 2.3: Let ๐ด, ๐ต, and ๐ถ be sets such that ๐ด ⊆ ๐ต and ๐ต ⊆ ๐ถ. Then ๐ด ⊆ ๐ถ.
Proof: Suppose that ๐ด, ๐ต, and ๐ถ are sets with ๐ด ⊆ ๐ต and ๐ต ⊆ ๐ถ, and let ๐ ∈ ๐ด. Since ๐ด ⊆ ๐ต and ๐ ∈ ๐ด,
it follows that ๐ ∈ ๐ต. Since ๐ต ⊆ ๐ถ and ๐ ∈ ๐ต, it follows that ๐ ∈ ๐ถ. Since ๐ was an arbitrary element
of ๐ด, we have shown that every element of ๐ด is an element of ๐ถ. That is, ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ถ) is true.
Therefore, ๐ด ⊆ ๐ถ.
โก
Note: To the right we have a Venn diagram illustrating Theorem
2.3.
Theorem 2.3 tells us that the relation ⊆ is transitive. Since ⊆ is
transitive, we can write things like ๐ด ⊆ ๐ต ⊆ ๐ถ ⊆ ๐ท, and without
explicitly saying it, we know that ๐ด ⊆ ๐ถ, ๐ด ⊆ ๐ท, and ๐ต ⊆ ๐ท.
Example 2.10: The membership relation ∈ is an example of a
relation that is not transitive. For example, let ๐ด = {0},
๐ต = {0, 1, {0}}, and ๐ถ = {๐ฅ, ๐ฆ, {0, 1, {0}}}. Observe that ๐ด ∈ ๐ต
๐จ⊆๐ฉ⊆๐ช
and ๐ต ∈ ๐ถ, but ๐ด ∉ ๐ถ.
Notes: (1) The set ๐ด has just 1 element, namely 0.
{0} ∈ {0, 1, {0}} ∈ {๐ฅ, ๐ฆ, {0, 1, {0}}}
(2) The set ๐ต has 3 elements, namely 0, 1, and {0}. But wait! ๐ด = {0}. So, ๐ด ∈ ๐ต. The set ๐ด is circled
twice in the above image.
(3) The set ๐ถ also has 3 elements, namely, ๐ฅ, ๐ฆ, and {0,1, {0}}. But wait! ๐ต = {0, 1, {0}}. So, ๐ต ∈ ๐ถ. The
set ๐ต has a rectangle around it twice in the above image.
(4) Since ๐ด ≠ ๐ฅ, ๐ด ≠ ๐ฆ, and ๐ด ≠ {0, 1, {0}}, we see that ๐ด ∉ ๐ถ.
(5) Is it clear that {0} ∉ ๐ถ? {0} is in a set that’s in ๐ถ (namely, ๐ต), but {0} is not itself in ๐ถ.
(6) Here is a more basic example showing that ∈ is not transitive: ∅ ∈ {∅} ∈ {{∅}}, but ∅ ∉ {{∅}}
The only element of {{∅}} is {∅}.
Unions and Intersections
The union of the sets ๐ด and ๐ต, written ๐ด ∪ ๐ต, is the set of elements that are in ๐ด or ๐ต (or both).
๐ด ∪ ๐ต = {๐ฅ | ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต}
The intersection of ๐ด and ๐ต, written ๐ด ∩ ๐ต, is the set of elements that are simultaneously in ๐ด and ๐ต.
๐ด ∩ ๐ต = {๐ฅ | ๐ฅ ∈ ๐ด and ๐ฅ ∈ ๐ต}
The following Venn diagrams for the union and intersection of two sets can be useful for visualizing
these operations.
24
๐จ∪๐ฉ
๐จ∩๐ฉ
Example 2.11:
1. Let ๐ด = {0, 1, 2, 3, 4} and ๐ต = {3, 4, 5, 6}. Then ๐ด ∪ ๐ต = {0, 1, 2, 3, 4, 5, 6} and ๐ด ∩ ๐ต = {3, 4}.
See the figure below for a visual representation of ๐ด, ๐ต, ๐ด ∪ ๐ต and ๐ด ∩ ๐ต.
2. Recall that the set of natural numbers is โ = {0, 1, 2, 3, … } and the set of integers is
โค = {… , – 4, – 3, – 2, – 1, 0, 1, 2, 3, 4, … }. Observe that in this case, we have โ ⊆ โค. Also,
โ ∪ โค = โค and โ ∩ โค = โ.
In fact, whenever ๐ด and ๐ต are sets and ๐ต ⊆ ๐ด, then ๐ด ∪ ๐ต = ๐ด and ๐ด ∩ ๐ต = ๐ต. We will prove
the first of these two facts in Theorem 2.5. You will be asked to prove the second of these facts
in Problem 13 below.
3. Let ๐ผ = {0, 2, 4, 6, … } be the set of even natural numbers and let ๐ = {1, 3, 5, 7, … } be the set
of odd natural numbers. Then ๐ผ ∪ ๐ = {0, 1, 2, 3, 4, 5, 6, 7, … } = โ and ๐ผ ∩ ๐ = ∅. In general,
we say that sets ๐ด and ๐ต are disjoint or mutually exclusive if ๐ด ∩ ๐ต = ∅. Below is a Venn
diagram for disjoint sets.
๐จ∩๐ฉ=∅
25
Let’s prove some theorems involving unions of sets. You will be asked to prove the analogous results
for intersections of sets in Problems 11 and 13 below.
Theorem 2.4: If ๐ด and ๐ต are sets, then ๐ด ⊆ ๐ด ∪ ๐ต.
Before going through the proof, look once more at the Venn diagram above for ๐ด ∪ ๐ต and convince
yourself that this theorem should be true.
Proof of Theorem 2.4: Suppose that ๐ด and ๐ต are sets and let ๐ฅ ∈ ๐ด. Then ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต. Therefore,
๐ฅ ∈ ๐ด ∪ ๐ต. Since ๐ฅ was an arbitrary element of ๐ด, we have shown that every element of ๐ด is an element
of ๐ด ∪ ๐ต. That is, ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ด ∪ ๐ต) is true. Therefore, ๐ด ⊆ ๐ด ∪ ๐ต.
โก
Note: Recall from Lesson 1 that if ๐ is a true statement, then ๐ ∨ ๐ (๐ or ๐) is true no matter what the
truth value of ๐ is. In the second sentence of the proof above, we are using this fact with ๐ being the
statement ๐ฅ ∈ ๐ด and ๐ being the statement ๐ฅ ∈ ๐ต.
We will use this same reasoning in the second paragraph of the next proof as well.
Theorem 2.5: ๐ต ⊆ ๐ด if and only if ๐ด ∪ ๐ต = ๐ด.
Before going through the proof, it’s a good idea to draw a Venn diagram for ๐ต ⊆ ๐ด and convince
yourself that this theorem should be true.
Technical note: Let ๐ and ๐ be sets. The Axiom of Extensionality says that ๐ and ๐ are the same set if
and only if ๐ and ๐ have precisely the same elements. In symbols, we have
๐ = ๐ if and only if ∀๐ฅ(๐ฅ ∈ ๐ ↔ ๐ฅ ∈ ๐).
It is easy to verify that ๐ ↔ ๐ is logically equivalent to (๐ → ๐) ∧ (๐ → ๐). To see this, we check that
all possible truth assignments for ๐ and ๐ lead to the same truth value for the two statements. For
example, if ๐ and ๐ are both true, then
๐↔๐≡T↔T≡T
and
(๐ → ๐) ∧ (๐ → ๐) ≡ (T → T) ∧ (T → T) ≡ T ∧ T ≡ T.
The reader should check the other three truth assignments for ๐ and ๐, or draw the entire truth table
for both statements.
Letting ๐ be the statement ๐ฅ ∈ ๐, letting ๐ be the statement ๐ฅ ∈ ๐, and replacing ๐ ↔ ๐ by the logically
equivalent statement (๐ → ๐) ∧ (๐ → ๐) gives us
๐ = ๐ if and only if ∀๐ฅ((๐ฅ ∈ ๐ → ๐ฅ ∈ ๐) ∧ (๐ฅ ∈ ๐ → ๐ฅ ∈ ๐)).
It is also true that ∀๐ฅ(๐(๐ฅ) ∧ ๐(๐ฅ)) is logically equivalent to ∀๐ฅ(๐(๐ฅ)) ∧ ∀๐ฅ(๐(๐ฅ)). And so, we have
๐ = ๐ if and only if ∀๐ฅ(๐ฅ ∈ ๐ → ๐ฅ ∈ ๐) and ∀๐ฅ(๐ฅ ∈ ๐ → ๐ฅ ∈ ๐).
In other words, to show that ๐ = ๐, we can instead show that ๐ ⊆ ๐ and ๐ ⊆ ๐.
26
Proof of Theorem 2.5: Suppose that ๐ต ⊆ ๐ด and let ๐ฅ ∈ ๐ด ∪ ๐ต. Then ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต. If ๐ฅ ∈ ๐ด, then
๐ฅ ∈ ๐ด (trivially). If ๐ฅ ∈ ๐ต, then since ๐ต ⊆ ๐ด, it follows that ๐ฅ ∈ ๐ด. Since ๐ฅ was an arbitrary element of
๐ด ∪ ๐ต, we have shown that every element of ๐ด ∪ ๐ต is an element of ๐ด. That is, ∀๐ฅ(๐ฅ ∈ ๐ด ∪ ๐ต → ๐ฅ ∈ ๐ด)
is true. Therefore, ๐ด ∪ ๐ต ⊆ ๐ด. By Theorem 2.4, ๐ด ⊆ ๐ด ∪ ๐ต. Since ๐ด ∪ ๐ต ⊆ ๐ด and ๐ด ⊆ ๐ด ∪ ๐ต, it follows
that ๐ด ∪ ๐ต = ๐ด.
Now, suppose that ๐ด ∪ ๐ต = ๐ด and let ๐ฅ ∈ ๐ต. Since ๐ฅ ∈ ๐ต, it follows that ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต. Therefore,
๐ฅ ∈ ๐ด ∪ ๐ต. Since ๐ด ∪ ๐ต = ๐ด, we have ๐ฅ ∈ ๐ด. Since ๐ฅ was an arbitrary element of ๐ต, we have shown
that every element of ๐ต is an element of ๐ด. That is, ∀๐ฅ(๐ฅ ∈ ๐ต → ๐ฅ ∈ ๐ด). Therefore, ๐ต ⊆ ๐ด.
โก
27
Problem Set 2
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Determine whether each of the following statements is true or false:
(i)
2 ∈ {2}
(ii)
5∈∅
(iii)
∅ ∈ {1, 2}
(iv)
๐ ∈ {๐, {๐}}
(v)
∅ ⊆ {1, 2}
(vi)
{Δ} ⊆ {๐ฟ, Δ}
(vii)
{๐, ๐, ๐} ⊆ {๐, ๐, ๐}
(viii)
{1, ๐, {2, ๐}} ⊆ {1, ๐, 2, ๐}
2. Determine the cardinality of each of the following sets:
(i)
{๐, ๐, ๐, ๐, ๐, ๐}
(ii)
{1, 2, 3, 2, 1}
(iii)
{1, 2, … , 53}
(iv)
{5, 6, 7, … , 2076, 2077}
3. Let ๐ด = {๐, ๐, Δ, ๐ฟ} and ๐ต = {๐, ๐, ๐ฟ, ๐พ}. Determine each of the following:
(i)
๐ด∪๐ต
(ii)
๐ด∩๐ต
LEVEL 2
4. Determine whether each of the following statements is true or false:
(i)
∅∈∅
(ii)
∅ ∈ {∅}
(iii)
{∅} ∈ ∅
(iv)
{∅} ∈ {∅}
(v)
∅⊆∅
(vi)
∅ ⊆ {∅}
(vii)
{∅} ⊆ ∅
(viii)
{∅} ⊆ {∅}
28
5. Determine the cardinality of each of the following sets:
(i)
{∅, {1, 2, 3}}
(ii)
{{{∅, {∅}}}}
(iii)
{{1,2}, ∅, {∅}, {∅, {∅, 1, 2}}}
(iv)
{∅, {∅}, {{∅}}, {∅, {∅}, {{∅}}}}
6. Let ๐ = {∅, {∅}} and ๐ = {{∅}, {∅, {∅}}}. Determine each of the following:
(i)
๐∪๐
(ii)
๐∩๐
LEVEL 3
7. How many subsets does {๐, ๐, ๐, ๐} have? Draw a tree diagram for the subsets of {๐, ๐, ๐, ๐}.
8. A set ๐ด is transitive if ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ⊆ ๐ด) (in words, every element of ๐ด is also a subset of ๐ด).
Determine if each of the following sets is transitive:
(i)
∅
(ii)
{∅}
(iii)
{{∅}}
(iv)
{∅, {∅}, {{∅}}}
LEVEL 4
9. A relation ๐
is reflexive if ∀๐ฅ(๐ฅ๐
๐ฅ) and symmetric if ∀๐ฅ∀๐ฆ(๐ฅ๐
๐ฆ → ๐ฆ๐
๐ฅ). Show that ⊆ is
reflexive, but ∈ is not. Then decide if each of ⊆ and ∈ is symmetric.
10. Let ๐ด, ๐ต, ๐ถ, ๐ท, and ๐ธ be sets such that ๐ด ⊆ ๐ต, ๐ต ⊆ ๐ถ, ๐ถ ⊆ ๐ท, and ๐ท ⊆ ๐ธ. Prove that ๐ด ⊆ ๐ธ.
11. Let ๐ด and ๐ต be sets. Prove that ๐ด ∩ ๐ต ⊆ ๐ด.
LEVEL 5
12. Let ๐(๐ฅ) be the property ๐ฅ ∉ ๐ฅ. Prove that {๐ฅ|๐(๐ฅ)} cannot be a set.
13. Prove that ๐ต ⊆ ๐ด if and only if ๐ด ∩ ๐ต = ๐ต.
14. Let ๐ด = {๐, ๐, ๐, ๐}, ๐ต = {๐ | ๐ ⊆ ๐ด ∧ ๐ ∉ ๐}, and ๐ถ = {๐ | ๐ ⊆ ๐ด ∧ ๐ ∈ ๐}. Show that there is
a natural one-to-one correspondence between the elements of ๐ต and the elements of ๐ถ. Then
generalize this result to a set with ๐ + 1 elements for ๐ > 0.
29
LESSON 3 – ABSTRACT ALGEBRA
SEMIGROUPS, MONOIDS, AND GROUPS
Binary Operations and Closure
A binary operation on a set is a rule that combines two elements of the set to produce another element
of the set.
Example 3.1: Let ๐ = {0, 1}. Multiplication on ๐ is a binary operation, whereas addition on ๐ is not a
binary operation (here we are thinking of multiplication and addition in the “usual” sense, meaning the
way we would think of them in elementary school or middle school).
To see that multiplication is a binary operation on ๐, observe that 0 ⋅ 0 = 0, 0 ⋅ 1 = 0, 1 ⋅ 0 = 0, and
1 ⋅ 1 = 1. Each of the four computations produces 0 or 1, both of which are in the set ๐.
To see that addition is not a binary operation on ๐, just note that 1 + 1 = 2, and 2 ∉ ๐.
Let’s get a bit more technical and write down the formal definition of a binary operation. The
terminology and notation used in this definition will be clarified in the notes below and formalized
more rigorously later in Lesson 10.
Formally, a binary operation โ on a set ๐ is a function โ โถ ๐ × ๐ → ๐. So, if ๐, ๐ ∈ ๐, then we have
โ (๐, ๐) ∈ ๐. For easier readability, we will usually write โ (๐, ๐) as ๐ โ ๐.
Notes: (1) If ๐ด and ๐ต are sets, then ๐ด × ๐ต is called the Cartesian product of ๐ด and ๐ต. It consists of the
ordered pairs (๐, ๐), where ๐ ∈ ๐ด and ๐ ∈ ๐ต. A function ๐: ๐ด × ๐ต → ๐ถ takes each such pair (๐, ๐) to
an element ๐(๐, ๐) ∈ ๐ถ.
As an example, let ๐ด = {dog, fish}, ๐ต = {cat, snake}, ๐ถ = {0, 2, 4, 6, 8}, and define ๐: ๐ด × ๐ต → ๐ถ by
๐(๐, ๐) = the total number of legs that animals ๐ and ๐ have. Then we have ๐(dog, cat) = 8,
๐(dog, snake) = 4, ๐(fish, cat) = 4, ๐(fish, snake) = 0.
We will look at ordered pairs, cartesian products, and functions in more detail in Lesson 10.
(2) For a binary operation, all three sets ๐ด, ๐ต, and ๐ถ in the expression ๐: ๐ด × ๐ต → ๐ถ are the same.
As we saw in Example 3.1 above, if we let ๐ = {0, 1}, and we let โ be multiplication, then โ is a binary
operation on ๐. Using function notation, we have โ (0, 0) = 0, โ (0, 1) = 0, โ (1, 0) = 0, and
โ (1, 1) = 1.
As stated in the formal definition of a binary operation above, we will usually write the computations
as 0 โ 0 = 0, 0 โ 1 = 0, 1 โ 0 = 0, and 1 โ 1 = 1.
We can use symbols other than โ for binary operations. For example, if the operation is multiplication,
we would usually use a dot (⋅) for the operation as we did in Example 3.1 above. Similarly, for addition
we would usually use +, for subtraction we would usually use −, and so on.
30
Recall: โ = {0, 1, 2, 3, … } is the set of natural numbers and โค = {… , – 4, – 3, – 2, – 1, 0, 1, 2, 3, 4, … } is
the set of integers.
If ๐ด is a set of numbers, we let ๐ด+ be the subset of ๐ด consisting of just the positive numbers from ๐ด.
For example, โค+ = {1, 2, 3, 4, … }, and in fact, โ+ = โค+ .
Example 3.2:
1. The operation of addition on the set of natural numbers is a binary operation because whenever
we add two natural numbers we get another natural number. Here, the set ๐ is โ and the
operation โ is +. Observe that if ๐ ∈ โ and ๐ ∈ โ, then ๐ + ๐ ∈ โ. For example, if ๐ = 1 and
๐ = 2 (both elements of โ), then ๐ + ๐ = 1 + 2 = 3, and 3 ∈ โ.
2. The operation of multiplication on the set of positive integers is a binary operation because
whenever we multiply two positive integers we get another positive integer. Here, the set ๐ is
โค+ and the operation โ is ⋅. Observe that if ๐ ∈ โค+ and ๐ ∈ โค+ , then ๐ ⋅ ๐ ∈ โค+ . For example, if
๐ = 3 and ๐ = 5 (both elements of โค+ ), then ๐ ⋅ ๐ = 3 ⋅ 5 = 15, and 15 ∈ โค+ .
3. Let ๐ = โค and define โ by ๐ โ ๐ = min{๐, ๐}, where min{๐, ๐} is the smallest of ๐ or ๐. Then โ
is a binary operation on โค. For example, if ๐ = – 5 and ๐ = 3 (both elements of โค), then
๐ โ ๐ = – 5, and – 5 ∈ โค.
4. Subtraction on the set of natural numbers is not a binary operation. To see this, we just need
to provide a single counterexample. (A counterexample is an example that is used to prove that
a statement is false.) If we let ๐ = 1 and ๐ = 2 (both elements of โ), then we see that
๐ − ๐ = 1 − 2 is not an element of โ.
5. Let ๐ = {๐ข, ๐ฃ, ๐ค} and define โ using the following table:
โ
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ค
๐ข
๐ฃ
The table given above is called a multiplication table. For ๐, ๐ ∈ ๐, we evaluate ๐ โ ๐ by taking
the entry in the row given by ๐ and the column given by ๐. For example, ๐ฃ โ ๐ค = ๐ข.
โ
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ค
๐ข
๐ฃ
โ is a binary operation on ๐ because the only possible “outputs” are ๐ข, ๐ฃ, and ๐ค.
Some authors refer to a binary operation โ on a set ๐ even when the binary operation is not defined
on all pairs of elements ๐, ๐ ∈ ๐. We will always refer to these “false operations” as partial binary
operations.
We say that the set ๐ is closed under the partial binary operation โ if whenever ๐, ๐ ∈ ๐, we have
๐ โ ๐ ∈ ๐.
31
In Example 3.2, part 4 above, we saw that subtraction is a partial binary operation on โ that is not a
binary operation. In other words, โ is not closed under subtraction.
Semigroups and Associativity
Let โ be a binary operation on a set ๐. We say that โ is associative in ๐ if for all ๐ฅ, ๐ฆ, ๐ง in ๐, we have
(๐ฅ โ ๐ฆ) โ ๐ง = ๐ฅ โ (๐ฆ โ ๐ง)
A semigroup is a pair (๐,โ), where ๐ is a set and โ is an associative binary operation on ๐.
Example 3.3:
1. (โ, +), (โค, +), (โ, ⋅), and (โค, ⋅) are all semigroups. In other words, the operations of addition
and multiplication are both associative in โ and โค.
2. Let ๐ = โค and define โ by ๐ โ ๐ = min{๐, ๐}, where min{๐, ๐} is the smallest of ๐ or ๐. Let’s
check that โ is associative in โค. Let ๐, ๐, and ๐ be elements of โค. There are actually 6 cases to
consider (see Note 1 below). Let’s go through one of these cases in detail. If we assume that
๐ ≤ ๐ ≤ ๐, then we have
(๐ โ ๐) โ ๐ = min{๐, ๐} โ ๐ = ๐ โ ๐ = min{๐, ๐} = ๐.
๐ โ (๐ โ ๐) = ๐ โ min{๐, ๐} = ๐ โ ๐ = min{๐, ๐} = ๐.
Since both (๐ โ ๐) โ ๐ = ๐ and ๐ โ (๐ โ ๐) = ๐, we have (๐ โ ๐) โ ๐ = ๐ โ (๐ โ ๐). After
checking the other 5 cases, we can say the following: Since ๐, ๐, and ๐ were arbitrary elements
from โค, we have shown that โ is associative in โค. It follows that (โค,โ) is a semigroup.
3. Subtraction is not associative in โค. To see this, we just need to provide a single counterexample.
If we let ๐ = 1, ๐ = 2, and ๐ = 3, then (๐ − ๐) − ๐ = (1 − 2) − 3 = – 1 − 3 = – 4 and
๐ − (๐ − ๐) = 1 − (2 − 3) = 1 − (– 1) = 1 + 1 = 2. Since – 4 ≠ 2, subtraction is not
associative in โค. It follows that (โค, −) is not a semigroup.
Note that (โ, −) is also not a semigroup, but for a different reason. Subtraction is not even a
binary operation on โ (see part 4 in Example 3.2).
4. Let ๐ = {๐ข, ๐ฃ, ๐ค} and define โ using the following table (this is the same table from part 5 in
Example 3.2):
โ
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ข
๐ฃ
๐ค
๐ค
๐ข
๐ฃ
Notice that (๐ข โ ๐ฃ) โ ๐ค = ๐ค โ ๐ค = ๐ฃ and ๐ข โ (๐ฃ โ ๐ค) = ๐ข โ ๐ข = ๐ฃ.
So, (๐ข โ ๐ฃ) โ ๐ค = ๐ข โ (๐ฃ โ ๐ค). However, this single computation does not show that โ is
associative in ๐. In fact, we have the following counterexample: (๐ข โ ๐ค) โ ๐ฃ = ๐ค โ ๐ฃ = ๐ฃ and
๐ข โ (๐ค โ ๐ฃ) = ๐ข โ ๐ฃ = ๐ค. Thus, (๐ข โ ๐ค) โ ๐ฃ ≠ ๐ข โ (๐ค โ ๐ฃ).
So, โ is not associative in ๐, and therefore, (๐,โ) is not a semigroup.
32
5. Let 2โค = {… , – 6, – 4, – 2, 0, 2, 4, 6, … } be the set of even integers. When we multiply two even
integers together, we get another even integer (we will prove this in Lesson 4). It follows that
multiplication is a binary operation on 2โค. Since multiplication is associative in โค and 2โค ⊆ โค,
it follows that multiplication is associative in 2โค (see Note 2 below). So, (2โค, ⋅) is a semigroup.
Notes: (1) In part 2 above, we must prove the result for each of the following 6 cases:
๐≤๐≤๐
๐≤๐≤๐
๐≤๐≤๐
๐≤๐≤๐
๐≤๐≤๐
๐≤๐≤๐
The same basic argument can be used for all these cases. For example, we saw in the solution above
that for the first case we get
(๐ โ ๐) โ ๐ = min{๐, ๐} โ ๐ = ๐ โ ๐ = min{๐, ๐} = ๐.
๐ โ (๐ โ ๐) = ๐ โ min{๐, ๐} = ๐ โ ๐ = min{๐, ๐} = ๐.
Let’s also do the last case ๐ ≤ ๐ ≤ ๐:
(๐ โ ๐) โ ๐ = min{๐, ๐} โ ๐ = ๐ โ ๐ = min{๐, ๐} = ๐.
๐ โ (๐ โ ๐) = ๐ โ min{๐, ๐} = ๐ โ ๐ = min{๐, ๐} = ๐.
The reader should verify the other 4 cases to complete the proof.
(2) Associativity is closed downwards. By this, we mean that if โ is associative in a set ๐ด, and ๐ต ⊆ ๐ด,
(๐ต is a subset of ๐ด) then โ is associative in ๐ต.
The reason for this is that the definition of associativity involves only a universal statement—a
statement that describes a property that is true for all elements without mentioning the existence of
any new elements. A universal statement begins with the quantifier ∀ (“For all” or “Every”) and never
includes the quantifier ∃ (“There exists” or “There is”).
As a simple example, if every object in set ๐ด is a fruit, and ๐ต ⊆ ๐ด, then every object in ๐ต is a fruit. The
universal statement we are referring to might be ∀๐ฅ(๐(๐ฅ)), where ๐(๐ฅ) is the property “๐ฅ is a fruit.”
In the case of associativity, the universal statement is ∀๐ฅ∀๐ฆ∀๐ง((๐ฅ โ ๐ฆ) โ ๐ง = ๐ฅ โ (๐ฆ โ ๐ง)).
Let โ be a binary operation on a set ๐. We say that โ is commutative (or Abelian) in ๐ if for all ๐ฅ, ๐ฆ in
๐, we have ๐ฅ โ ๐ฆ = ๐ฆ โ ๐ฅ.
Example 3.4:
1. (โ, +), (โค, +), (โ, ⋅), and (โค, ⋅) are all commutative semigroups. In other words, the
operations of addition and multiplication are both commutative in โ and โค (in addition to being
associative).
2. The semigroup (โค,โ), where โ is defined by ๐ โ ๐ = min{๐, ๐} is a commutative semigroup.
Let’s check that โ is commutative in โค. Let ๐ and ๐ be elements of โค. This time there are just 2
cases to consider (๐ ≤ ๐ and ๐ ≤ ๐). Let’s do the first case in detail, and assume that ๐ ≤ ๐.
We then have ๐ โ ๐ = min{๐, ๐} = ๐ and ๐ โ ๐ = min{๐, ๐} = ๐. So, ๐ โ ๐ = ๐ โ ๐. After
verifying the other case (which you should do), we can say that โ is commutative in โค.
33
3. Define the binary operation โ on โ by ๐ โ ๐ = ๐. Then (โ,โ) is a semigroup that is not
commutative. For associativity, we have (๐ โ ๐) โ ๐ = ๐ โ ๐ = ๐ and ๐ โ (๐ โ ๐) = ๐ โ ๐ = ๐.
Let’s use a counterexample to show that โ is not commutative. Well, 2 โ 5 = 2 and 5 โ 2 = 5.
Note: In part 3 above, the computation ๐ โ (๐ โ ๐) can actually be done in 1 step instead of 2. The way
we did it above was to first compute ๐ โ ๐ = ๐, and then to replace ๐ โ ๐ with ๐ to get
๐ โ (๐ โ ๐) = ๐ โ ๐ = ๐. However, the definition of โ says that ๐ โ (anything) = ๐. In this case, the
“anything” is ๐ โ ๐. So, we have ๐ โ (๐ โ ๐) = ๐ just by appealing to the definition of โ.
Monoids and Identity
Let (๐,โ) be a semigroup. An element ๐ of ๐ is called an identity with respect to the binary operation
โ if for all ๐ ∈ ๐, we have ๐ โ ๐ = ๐ โ ๐ = ๐
A monoid is a semigroup with an identity.
Example 3.5:
1. (โ, +) and (โค, +) are commutative monoids with identity 0 (when we add 0 to any integer ๐,
we get ๐). (โ, ⋅) and (โค, ⋅) are commutative monoids with identity 1 (when we multiply any
integer ๐ by 1, we get ๐).
2. The commutative semigroup (โค,โ), where โ is defined by ๐ โ ๐ = min{๐, ๐} is not a monoid.
To see this, let ๐ ∈ โค. Then ๐ + 1 ∈ โค and ๐ โ (๐ + 1) = ๐ ≠ ๐ + 1. This shows that ๐ is not an
identity. Since ๐ was an arbitrary element of โค, we showed that there is no identity. It follows
that (โค,โ) is not a monoid.
3. The noncommutative semigroup (โ,โ), where ๐ โ ๐ = ๐ is also not a monoid. Use the same
argument given in 2 above with โค replaced by โ.
4. (2โค, ⋅) is another example of a semigroup that is not a monoid. The identity element of (โค, ⋅)
is 1, and this element is missing from (2โค, ⋅).
Groups and Inverses
Let (๐,โ) be a monoid with identity ๐. An element ๐ of ๐ is called invertible if there is an element
๐ ∈ ๐ such that ๐ โ ๐ = ๐ โ ๐ = ๐.
A group is a monoid in which every element is invertible.
Groups appear so often in mathematics that it’s worth taking the time to explicitly spell out the full
definition of a group.
A group is a pair (๐บ,โ) consisting of a set ๐บ together with a binary operation โ satisfying:
(1) (Associativity) For all ๐ฅ, ๐ฆ, ๐ง ∈ ๐บ, (๐ฅ โ ๐ฆ) โ ๐ง = ๐ฅ โ (๐ฆ โ ๐ง).
(2) (Identity) There exists an element ๐ ∈ ๐บ such that for all ๐ฅ ∈ ๐บ, ๐ โ ๐ฅ = ๐ฅ โ ๐ = ๐ฅ.
(3) (Inverse) For each ๐ฅ ∈ ๐บ, there is ๐ฆ ∈ ๐บ such that ๐ฅ โ ๐ฆ = ๐ฆ โ ๐ฅ = ๐.
Notes: (1) If ๐ฆ ∈ ๐บ is an inverse of ๐ฅ ∈ ๐บ, we will usually write ๐ฆ = ๐ฅ −1 .
34
(2) Recall that the definition of a binary operation already implies closure. However, many books on
groups will mention this property explicitly:
(Closure) For all ๐ฅ, ๐ฆ ∈ ๐บ, ๐ฅ โ ๐ฆ ∈ ๐บ.
(3) A group is commutative or Abelian if for all ๐ฅ, ๐ฆ ∈ ๐บ, ๐ฅ โ ๐ฆ = ๐ฆ โ ๐ฅ.
Example 3.6:
1. (โค, +) is a commutative group with identity 0. The inverse of any integer ๐ is the integer – ๐.
2. (โ, +) is a commutative monoid that is not a group. For example, the natural number 1 has no
inverse in โ. In other words, the equation ๐ฅ + 1 = 0 has no solution in โ.
3. (โค, ⋅) is a commutative monoid that is not a group. For example, the integer 2 has no inverse
in โค. In other words, the equation 2๐ฅ = 1 has no solution in โค.
๐
4. A rational number is a number of the form , where ๐ and ๐ are integers and ๐ ≠ 0.
๐
๐
๐
1
3
We identify rational numbers ๐ and ๐ whenever ๐๐ = ๐๐. For example, 2 and 6 represent the
same rational number because 1 ⋅ 6 = 6 and 2 ⋅ 3 = 6.
๐
We denote the set of rational numbers by โ. So, we have โ = {๐ | ๐, ๐ ∈ โค, ๐ ≠ 0}. In words,
โ is “the set of quotients ๐ over ๐ such that ๐ and ๐ are integers and ๐ is not zero.”
๐
We identify the rational number 1 with the integer ๐. In this way, we have โค ⊆ โ.
๐
๐
We add two rational numbers using the rule ๐ + ๐ =
0
๐
๐⋅๐+๐⋅๐
๐⋅๐
0
Note that 0 = 1 is an identity for (โ, +) because ๐ + 1 =
.
๐⋅1+๐⋅0
๐⋅1
๐
0
๐
= ๐ and 1 + ๐ =
0⋅๐+1⋅๐
1⋅๐
๐
= ๐.
You will be asked to show in Problem 11 below that (โ, +) is a commutative group.
๐
๐
๐⋅๐
๐
1
๐⋅1
5. We multiply two rational numbers using the rule ๐ ⋅ ๐ = ๐⋅๐.
1
๐
1
๐
1⋅๐
๐
Note that 1 = 1 is an identity for (โ, ⋅) because ๐ ⋅ 1 = ๐⋅1 = ๐ and 1 ⋅ ๐ = 1⋅๐ = ๐.
๐
0
๐
0⋅๐
0
Now, 0 ⋅ ๐ = 1 ⋅ ๐ = 1⋅๐ = ๐ = 0. In particular, when we multiply 0 by any rational number, we
can never get 1. So, 0 is a rational number with no multiplicative inverse. It follows that (โ, ⋅)
is not a group.
However, 0 is the only rational number without a multiplicative inverse. In fact, you will be
asked to show in Problem 9 below that (โ∗ , ⋅) is a commutative group, where โ∗ is the set of
rational numbers with 0 removed.
Note: When multiplying two numbers, we sometimes drop the dot (⋅) for easier readability. So, we may
๐ ๐
write ๐ฅ ⋅ ๐ฆ as ๐ฅ๐ฆ. We may also use parentheses instead of the dot. For example, we might write ๐ ⋅ ๐ as
๐
๐
๐⋅๐
๐๐
(๐) (๐), whereas we would probably write ๐⋅๐ as ๐๐. We may even use this simplified notation for
arbitrary group operations. So, we could write ๐ โ ๐ as ๐๐. However, we will avoid doing this if it would
lead to confusion. For example, we will not write ๐ + ๐ as ๐๐.
35
Problem Set 3
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. For each of the following multiplication tables defined on the set ๐ = {๐, ๐}, determine if each
of the following is true or false:
(i)
โ defines a binary operation on ๐.
(ii)
โ is commutative in ๐.
(iii) ๐ is an identity with respect to โ.
(iv) ๐ is an identity with respect to โ.
I
) โ
๐
๐
๐
๐
๐
๐
๐
๐
II
โ
๐
๐
๐
๐
๐
๐
๐
๐
III
โ
๐
๐
๐
๐
๐
๐
๐
๐
IV
โ
๐
๐
๐
๐
๐
๐
๐
๐
2. Show that there are exactly two monoids on the set ๐ = {๐, ๐}, where ๐ is the identity. Which of
these monoids are groups? Which of these monoids are commutative?
LEVEL 2
3. Let ๐บ = {๐, ๐, ๐} and let (๐บ,โ) be a group with identity element ๐. Draw a multiplication table
for (๐บ,โ).
4. Prove that in any monoid (๐,โ), the identity element is unique.
LEVEL 3
5. Assume that a group (๐บ,โ) of order 4 exists with ๐บ = {๐, ๐, ๐, ๐}, where ๐ is the identity,
๐2 = ๐ and ๐ 2 = ๐. Construct the table for the operation of such a group.
6. Prove that in any group (๐บ,โ), each element has a unique inverse.
36
LEVEL 4
7. Let (๐บ,โ) be a group with ๐, ๐ ∈ ๐บ, and let ๐−1 and ๐ −1 be the inverses of ๐ and ๐, respectively.
Prove
(i)
(๐ โ ๐)−1 = ๐ −1 โ ๐−1.
(ii)
the inverse of ๐−1 is ๐.
8. Let (๐บ,โ) be a group such that ๐2 = ๐ for all ๐ ๏ ๐บ. Prove that (๐บ,โ) is commutative.
9. Prove that (โ∗ , ⋅) is a commutative group.
LEVEL 5
10. Prove that there are exactly two groups of order 4, up to renaming the elements.
11. Show that (โ, +) is a commutative group.
12. Let ๐ = {๐, ๐}, where ๐ ≠ ๐. How many binary operations are there on ๐? How many semigroups
are there of the form (๐,โ), up to renaming the elements?
37
LESSON 4 – NUMBER THEORY
THE RING OF INTEGERS
Rings and Distributivity
Before giving the general definition of a ring, let’s look at an important example.
Example 4.1: Recall that โค = {… , – 4, – 3, – 2, – 1, 0, 1, 2, 3, 4, … } is the set of integers. Let’s go over
some of the properties of addition and multiplication on this set.
1. โค is closed under addition. In other words, whenever we add two integers, we get another
integer. For example, 2 and 3 are integers, and we have 2 + 3 = 5, which is also an integer. As
another example, – 8 and 6 are integers, and so is – 8 + 6 = – 2.
2. Addition is commutative in โค. In other words, when we add two integers, it does not matter
which one comes first. For example, 2 + 3 = 5 and 3 + 2 = 5. So, we see that 2 + 3 = 3 + 2.
As another example, – 8 + 6 = – 2 and 6 + (– 8) = – 2. So, we see that – 8 + 6 = 6 + (– 8).
3. Addition is associative in โค. In other words, when we add three integers, it doesn’t matter if we
begin by adding the first two or the last two integers. For example, (2 + 3) + 4 = 5 + 4 = 9
and 2 + (3 + 4) = 2 + 7 = 9. So, (2 + 3) + 4 = 2 + (3 + 4). As another example, we have
(– 8 + 6) + (– 5) = – 2 + (– 5) = – 7 and – 8 + (6 + (– 5)) = – 8 + 1 = – 7. So, we see that
(– 8 + 6) + (– 5) = – 8 + (6 + (– 5)).
4. โค has an identity for addition, namely 0. Whenever we add 0 to another integer, the result is
that same integer. For example, we have 0 + 3 = 3 and 3 + 0 = 3. As another example,
0 + (– 5) = – 5 and (– 5) + 0 = – 5.
5. Every integer has an additive inverse. This is an integer that we add to the original integer to
get 0 (the additive identity). For example, the additive inverse of 5 is – 5 because we have
5 + (– 5) = 0 and – 5 + 5 = 0. Notice that the same two equations also show that the inverse
of – 5 is 5. We can say that 5 and – 5 are additive inverses of each other.
We can summarize the five properties above by saying that (โค, +) is a commutative group.
6. โค is closed under multiplication. In other words, whenever we multiply two integers, we get
another integer. For example, 2 and 3 are integers, and we have 2 ⋅ 3 = 6, which is also an
integer. As another example, – 3 and – 4 are integers, and so is (– 3)(– 4) = 12.
7. Multiplication is commutative in โค. In other words, when we multiply two integers, it does not
matter which one comes first. For example, 2 ⋅ 3 = 6 and 3 ⋅ 2 = 6. So, 2 ⋅ 3 = 3 ⋅ 2. As another
example, – 8 ⋅ 6 = – 48 and 6(– 8) = – 48. So, we see that – 8 ⋅ 6 = 6(– 8).
8. Multiplication is associative in โค. In other words, when we multiply three integers, it doesn’t
matter if we begin by multiplying the first two or the last two integers. For example,
(2 ⋅ 3) ⋅ 4 = 6 ⋅ 4 = 24 and 2 ⋅ (3 ⋅ 4) = 2 ⋅ 12 = 24. So, (2 ⋅ 3) ⋅ 4 = 2 ⋅ (3 ⋅ 4). As another
example, (– 5 ⋅ 2) ⋅ (– 6) = −10 ⋅ (– 6) = 60 and – 5 ⋅ (2 ⋅ (– 6)) = – 5 ⋅ (– 12) = 60. So, we
see that (– 5 ⋅ 2) ⋅ (– 6) = – 5 ⋅ (2 ⋅ (– 6)).
38
9. โค has an identity for multiplication, namely 1. Whenever we multiply 1 by another integer, the
result is that same integer. For example, we have 1 ⋅ 3 = 3 and 3 ⋅ 1 = 3. As another example
1 ⋅ (– 5) = – 5 and (– 5) ⋅ 1 = – 5.
We can summarize the four properties above by saying that (โค, ⋅) is a commutative monoid.
10. Multiplication is distributive over addition in โค. This means that whenever ๐, ๐, and ๐ are
integers, we have ๐ ⋅ (๐ + ๐) = ๐ ⋅ ๐ + ๐ ⋅ ๐. For example, 4 ⋅ (2 + 1) = 4 ⋅ 3 = 12 and
4 ⋅ 2 + 4 ⋅ 1 = 8 + 4 = 12. So, 4 ⋅ (2 + 1) = 4 ⋅ 2 + 4 ⋅ 1. As another example, we have
– 2 ⋅ ((– 1) + 3) = – 2(2) = – 4 and – 2 ⋅ (– 1) + (– 2) ⋅ 3 = 2 − 6 = – 4. Therefore, we see
that – 2 ⋅ ((– 1) + 3) = – 2 ⋅ (– 1) + (– 2) ⋅ 3.
Notes: (1) Since the properties listed in 1 through 10 above are satisfied, we say that (โค, +, ⋅) is a ring.
We will give the formal definition of a ring below.
(2) Observe that a ring consists of (i) a set (in this case โค), and (ii) two binary operations on the set
called addition and multiplication.
(3) (โค, +) is a commutative group and (โค, ⋅) is a commutative monoid. The distributive property is the
only property mentioned that requires both addition and multiplication.
(4) We see that โค is missing one nice property—the inverse property for multiplication. For example, 2
has no multiplicative inverse in โค. There is no integer ๐ such that 2 ⋅ ๐ = 1. So, the linear equation
2๐ − 1 = 0 has no solution in โค.
(5) If we replace โค by the set of natural numbers โ = {0, 1, 2, … }, then all the properties mentioned
above are satisfied except property 5—the inverse property for addition. For example, 1 has no
additive inverse in โ. There is no natural number ๐ such that ๐ + 1 = 0.
(6) โค actually satisfies two distributive properties. Left distributivity says that whenever ๐, ๐, and ๐
are integers, we have ๐ ⋅ (๐ + ๐) = ๐ ⋅ ๐ + ๐ ⋅ ๐. Right distributivity says that whenever ๐, ๐, and ๐
are integers, we have (๐ + ๐) ⋅ ๐ = ๐ ⋅ ๐ + ๐ ⋅ ๐. Since multiplication is commutative in โค, left
distributivity and right distributivity are equivalent.
(7) Let’s show that left distributivity together with commutativity of multiplication in โค implies right
distributivity in โค. If we assume that we have left distributivity and commutativity of multiplication,
then for integers ๐, ๐, and ๐, we have (๐ + ๐) ⋅ ๐ = ๐(๐ + ๐) = ๐ ⋅ ๐ + ๐ ⋅ ๐ = ๐ ⋅ ๐ + ๐ ⋅ ๐.
We are now ready to give the more general definition of a ring.
A ring is a triple (๐
, +, ⋅), where ๐
is a set and + and ⋅ are binary operations on ๐
satisfying
(1) (๐
, +) is a commutative group.
(2) (๐
, ⋅) is a monoid.
(3) Multiplication is distributive over addition in ๐
. That is, for all ๐ฅ, ๐ฆ, ๐ง ∈ ๐
, we have
๐ฅ ⋅ (๐ฆ + ๐ง) = ๐ฅ ⋅ ๐ฆ + ๐ฅ ⋅ ๐ง
and
39
(๐ฆ + ๐ง) ⋅ ๐ฅ = ๐ฆ ⋅ ๐ฅ + ๐ง ⋅ ๐ฅ.
Recall: The symbol ∈ is used for membership in a set. Specifically, the statement ๐ ∈ ๐ can be read as
“๐ is a member of the set ๐,” or more simply as “๐ is in ๐.” For example, 2 ∈ โ means “2 is in the set
of natural numbers,” or more simply, “2 is a natural number.”
We will always refer to the operation + as addition and the operation ⋅ as multiplication. We will also
adjust our notation accordingly. For example, we will refer to the identity for + as 0, and the additive
inverse of an element ๐ฅ ∈ ๐
as – ๐ฅ. Also, we will refer to the identity for ⋅ as 1, and the multiplicative
1
inverse of an element ๐ฅ ∈ ๐
(if it exists) as ๐ฅ –1 or ๐ฅ.
Notes: (1) Recall from Lesson 3 that (๐
, +) a commutative group means the following:
•
(Closure) For all ๐ฅ, ๐ฆ ∈ ๐
, ๐ฅ + ๐ฆ ∈ ๐
.
•
(Associativity) For all ๐ฅ, ๐ฆ, ๐ง ∈ ๐
, (๐ฅ + ๐ฆ) + ๐ง = ๐ฅ + (๐ฆ + ๐ง).
•
(Commutativity) For all ๐ฅ, ๐ฆ ∈ ๐
, ๐ฅ + ๐ฆ = ๐ฆ + ๐ฅ.
•
(Identity) There exists an element 0 ∈ ๐
such that for all ๐ฅ ∈ ๐
, 0 + ๐ฅ = ๐ฅ + 0 = ๐ฅ.
•
(Inverse) For each ๐ฅ ∈ ๐
, there is – ๐ฅ ∈ ๐
such that ๐ฅ + (– ๐ฅ) = (– ๐ฅ) + ๐ฅ = 0.
(2) Recall from Lesson 3 that (๐
, ⋅) a monoid means the following:
•
(Closure) For all ๐ฅ, ๐ฆ ∈ ๐
, ๐ฅ ⋅ ๐ฆ ∈ ๐
.
•
(Associativity) For all ๐ฅ, ๐ฆ, ๐ง ∈ ๐
, (๐ฅ ⋅ ๐ฆ) ⋅ ๐ง = ๐ฅ ⋅ (๐ฆ ⋅ ๐ง).
•
(Identity) There exists an element 1 ∈ ๐
such that for all ๐ฅ ∈ ๐
, 1 ⋅ ๐ฅ = ๐ฅ ⋅ 1 = ๐ฅ.
(3) Although commutativity of multiplication is not required for the definition of a ring, our most
important example (the ring of integers) satisfies this condition. When multiplication is commutative
in ๐
, we call the ring a commutative ring. In this case we have the following additional property:
•
(Commutativity) For all ๐ฅ, ๐ฆ ∈ ๐
, ๐ฅ ⋅ ๐ฆ = ๐ฆ ⋅ ๐ฅ.
(4) Observe that we have two distributive properties in the definition for a ring. The first property is
called left distributivity and the second is called right distributivity.
(5) In a commutative ring, left distributivity implies right distributivity and vice versa. In this case, the
distributive property simplifies to
•
(Distributivity) For all ๐ฅ, ๐ฆ, ๐ง ∈ ๐
, ๐ฅ ⋅ (๐ฆ + ๐ง) = ๐ฅ ⋅ ๐ฆ + ๐ฅ ⋅ ๐ง
(6) Some authors leave out the multiplicative identity property in the definition of a ring and call such
a ring a unital ring or a ring with identity. Since we are mostly concerned with the ring of integers, we
will adopt the convention that a ring has a multiplicative identity. If we do not wish to assume that ๐
has a multiplicative identity, then we will call the structure “almost a ring” or rng (note the missing “i”).
(7) The properties that define a ring are called the ring axioms. In general, an axiom is a statement that
is assumed to be true. So, the ring axioms are the statements that are given to be true in all rings. There
are many other statements that are true in rings. However, any additional statements need to be
proved using the axioms.
40
Example 4.2:
1. (โค, +, ⋅) is a commutative ring with additive identity 0 and multiplicative identity 1. The
additive inverse of an integer ๐ is the integer – ๐. This is the ring we will be focusing most of our
attention on. See Example 4.1 for more details.
2. (โ, +, ⋅) is not a ring because (โ, +) is not a group. The only group property that fails is the
additive inverse property. For example, the natural number 1 has no additive inverse. That is,
๐ + 1 = 0 has no solution in โ. Note that (โ, ⋅) is a commutative monoid and the distributive
property holds in โ. Therefore, (โ, +, ⋅) misses being a commutative ring by just that one
property. (โ, +, ⋅) is an example of a structure called a semiring.
๐
3. Recall from Example 3.6 (4 and 5) that the set of rational numbers is โ = {๐ | ๐, ๐ ∈ โค, ๐ ≠ 0}
๐
๐
and we define addition and multiplication on โ by ๐ + ๐ =
๐๐+๐๐
๐๐
๐
๐
๐๐
and ๐ ⋅ ๐ = ๐๐.
0
1
(โ, +, ⋅) is a commutative ring with additive identity 0 = and multiplicative identity 1 = .
1
1
๐
–๐
The additive inverse of a rational number ๐ is the rational number ๐ .
โ has one additional property not required in the definition of a ring. Every nonzero element
๐
of โ has a multiplicative inverse. The inverse of the nonzero rational number ๐ is the rational
๐
๐
๐
๐๐
๐๐
1
๐
๐
๐๐
๐๐
1
number ๐. This is easy to verify: ๐ ⋅ ๐ = ๐๐ = ๐๐ = 1 = 1 and ๐ ⋅ ๐ = ๐๐ = ๐๐ = 1 = 1. So,
(โ∗ , ⋅) is a commutative group, where โ∗ is the set of nonzero rational numbers.
If we replace the condition “(๐
, ⋅) is a monoid” in the definition of a ring (condition 2) with the
condition (๐
∗ , ⋅) is a commutative group, we get a structure called a field. By the remarks in
the last paragraph, we see that (โ, +, ⋅) is a field.
Technical note: The definition of semiring has one additional property: 0 ⋅ ๐ฅ = ๐ฅ ⋅ 0 = 0. Without the
additive inverse property this new property does not follow from the others, and so, it must be listed
explicitly.
Divisibility
An integer ๐ is called even if there is another integer ๐ such that ๐ = 2๐.
Example 4.3:
1. 6 is even because 6 = 2 ⋅ 3.
2. – 14 is even because – 14 = 2 ⋅ (– 7).
1
3. We can write 1 = 2 ⋅ 2, but this does not show that 1 is even (and as we all know, it is not). In
1
the definition of even, it is very important that ๐ is an integer. The problem here is that 2 is not
an integer, and so, it cannot be used as a value for ๐ in the definition of even.
We define the sum of integers ๐ and ๐ to be ๐ + ๐. We define the product of ๐ and ๐ to be ๐ ⋅ ๐.
Theorem 4.1: The sum of two even integers is even.
41
Strategy: Before writing the proof, let’s think about our strategy. We need to start with two arbitrary
but specific even integers. Let’s call them ๐ and ๐. Notice that we need to give them different names
because there is no reason that they need to have the same value.
When we try to add ๐ and ๐, we get ๐ + ๐. Hmmm…I see no reason yet why the expression ๐ + ๐
should represent an even integer.
The problem is that we haven’t yet used the definition of even. If we invoke the definition, we get
integers ๐ and ๐ such that ๐ = 2๐ and ๐ = 2๐.
Now, when we add ๐ and ๐, we get ๐ + ๐ = 2๐ + 2๐.
Is it clear that 2๐ + 2๐ represents an even integer? Nope…not yet. To be even, our final expression
needs to have the form 2๐, where ๐ is an integer.
Here is where we use the fact that (โค, +, ⋅) is a ring. Specifically, we use the distributive property to
rewrite 2๐ + 2๐ as 2(๐ + ๐).
It looks like we’ve done it. We just need to verify one more thing: is ๐ + ๐ an integer? Once again, we
can use the fact that (โค, +, ⋅) is a ring to verify this. Specifically, we use the fact that + is a binary
operation on โค.
I think we’re now ready to write the proof.
Proof of Theorem 4.1: Let ๐ and ๐ be even integers. Then there are integers ๐ and ๐ such that ๐ = 2๐
and ๐ = 2๐. So, ๐ + ๐ = 2๐ + 2๐ = 2(๐ + ๐) because multiplication is distributive over addition in โค.
Since โค is closed under addition, ๐ + ๐ ∈ โค. Therefore, ๐ + ๐ is even.
โก
The property of being even is a special case of the more general notion of divisibility.
An integer ๐ is divisible by an integer ๐, written ๐|๐, if there is another integer ๐ such that ๐ = ๐๐. We
also say that ๐ is a factor of ๐, ๐ is a divisor of ๐, ๐ divides ๐, or ๐ is a multiple of ๐.
Example 4.4:
1. Note that being divisible by 2 is the same as being even.
2. 18 is divisible by 3 because 18 = 3 ⋅ 6.
3. – 56 is divisible by 7 because – 56 = 7 ⋅ (– 8).
Theorem 4.2: The product of two integers that are each divisible by ๐ is also divisible by ๐.
Proof: Let ๐ and ๐ be integers that are divisible by ๐. Then there are integers ๐ and ๐ such that
๐ = ๐๐ and ๐ = ๐๐. So, ๐ ⋅ ๐ = (๐ ⋅ ๐) ⋅ (๐ ⋅ ๐) = ๐ ⋅ (๐ ⋅ (๐ ⋅ ๐)) because multiplication is
associative in โค. Since โค is closed under multiplication, ๐ ⋅ (๐ ⋅ ๐) ∈ โค. Thus, ๐ ⋅ ๐ is divisible by ๐. โก
Notes: (1) If you’re confused about how associativity was used here, it might help to make the
substitution ๐ข = (๐ ⋅ ๐). Then we have (๐ ⋅ ๐) ⋅ (๐ ⋅ ๐) = (๐ ⋅ ๐) ⋅ ๐ข = ๐ ⋅ (๐ ⋅ ๐ข) = ๐(๐ ⋅ (๐ ⋅ ๐)).
42
(2) Although it may seem tempting to simplify ๐ ⋅ (๐ ⋅ (๐ ⋅ ๐)) further, it is unnecessary. The definition
of divisibility by ๐ requires us only to generate an expression of the form ๐ times some integer, and
that’s what we have done.
(3) If the generality of the proof confuses you, try replacing ๐ by a specific integer. For example, if we
let ๐ = 2, we have ๐ = 2๐, ๐ = 2๐, and therefore ๐ ⋅ ๐ = (2๐) ⋅ (2๐) = 2(๐ ⋅ (2๐)). Is it clear that
this final expression is even (divisible by 2)?
(4) It’s worth noting that the product ๐ ⋅ ๐ is actually divisible by ๐ 2 . Indeed, we have
๐ ⋅ ๐ = ๐ ⋅ (๐ ⋅ (๐ ⋅ ๐)) = ๐ ⋅ ((๐ ⋅ ๐) ⋅ ๐) = ๐ ⋅ ((๐ ⋅ ๐) ⋅ ๐) = ๐ ⋅ (๐ ⋅ (๐ ⋅ ๐)) = ๐ 2 (๐ ⋅ ๐)
Induction
The Well Ordering Principle says that every nonempty subset of natural numbers has a least element.
For example, the least element of โ itself is 0.
Theorem 4.3 (The Principle of Mathematical Induction): Let ๐ be a set of natural numbers such that
(i) 0 ∈ ๐ and (ii) for all ๐ ∈ โ, ๐ ∈ ๐ → ๐ + 1 ∈ ๐. Then ๐ = โ.
Notes: (1) The Principle of Mathematical Induction works like a chain reaction. We know that 0 ∈ ๐
(this is condition (i)). Substituting 0 in for ๐ in the expression “๐ ∈ ๐ → ๐ + 1 ∈ ๐” (condition (ii)) gives
us 0 ∈ ๐ → 1 ∈ ๐. So, we have that 0 is in the set ๐, and “if 0 is in the set ๐, then 1 is in the set ๐.” So,
1 ∈ ๐ must also be true.
(2) In terms of Lesson 1 on Sentential Logic, if we let ๐ be the statement 0 ∈ ๐ and ๐ the statement
1 ∈ ๐, then we are given that ๐ ∧ (๐ → ๐) is true. Observe that the only way that this statement can
be true is if ๐ is also true. Indeed, we must have both ๐ ≡ T and ๐ → ๐ ≡ T. If ๐ were false, then we
would have ๐ → ๐ ≡ T → F ≡ F. So, we must have ๐ ≡ T.
(3) Now that we showed 1 ∈ ๐ is true (from Note 1 above), we can substitute 1 for ๐ in the expression
“๐ ∈ ๐ → ๐ + 1 ∈ ๐” (condition (ii)) to get 1 ∈ ๐ → 2 ∈ ๐. So, we have 1 ∈ ๐ ∧ (1 ∈ ๐ → 2 ∈ ๐) is
true. So, 2 ∈ ๐ must also be true.
(4) In general, we get the following chain reaction:
0∈๐→1∈๐→2∈๐→3∈๐→โฏ
I hope that the “argument” presented in Notes 1 through 4 above convinces you that the Principle of
Mathematical Induction should be true. Now let’s give a proof using the Well Ordering Principle. Proofs
involving the Well Ordering Principle are generally done by contradiction.
Proof of Theorem 4.3: Let ๐ be a set of natural numbers such that 0 ∈ ๐ (condition (i)), and such that
whenever ๐ ∈ ๐, ๐ + 1 ∈ ๐ (condition (ii)). Assume toward contradiction that ๐ ≠ โ. Let
๐ด = {๐ ∈ โ | ๐ ∉ ๐} (so, ๐ด is the set of natural numbers not in ๐). Since ๐ ≠ โ, ๐ด is nonempty. So, by
the Well Ordering Principle, ๐ด has a least element, let’s call it ๐. ๐ ≠ 0 because 0 ∈ ๐ and ๐ ∉ ๐. So,
๐ − 1 ∈ โ. Letting ๐ = ๐ − 1, we have ๐ − 1 ∈ ๐ → ๐ ∈ ๐ → ๐ + 1 ∈ ๐ → (๐ − 1) + 1 ∈ ๐ → ๐ ∈ ๐.
But ๐ ∈ ๐ด, which means that ๐ ∉ ๐. This is a contradiction, and so, ๐ = โ.
โก
43
Note: The proof given here is a proof by contradiction. A proof by contradiction works as follows:
1. We assume the negation of what we are trying to prove.
2. We use a logically valid argument to derive a statement which is false.
3. Since the argument was logically valid, the only possible error is our original assumption.
Therefore, the negation of our original assumption must be true.
In this problem we are trying to prove that ๐ = โ. The negation of this statement is that ๐ ≠ โ, and so
that is what we assume.
We then define a set ๐ด which contains elements of โ that are not in ๐. In reality, this set is empty
(because the conclusion of the theorem is ๐ = โ). However, our (wrong!) assumption that ๐ ≠ โ tells
us that this set ๐ด actually has something in it. Saying that ๐ด has something in it is an example of a false
statement that was derived from a logically valid argument. This false statement occurred not because
of an error in our logic, but because we started with an incorrect assumption (๐ ≠ โ).
The Well Ordering Principle then allows us to pick out the least element of this set ๐ด. Note that we can
do this because ๐ด is a subset of โ. This wouldn’t work if we knew only that ๐ด was a subset of โค, as โค
does not satisfy the Well Ordering Principle (for example, โค itself has no least element).
Again, although the argument that ๐ด has a least element is logically valid, ๐ด does not actually have any
elements at all. We are working from the (wrong!) assumption that ๐ ≠ โ.
Once we have our hands on this least element ๐, we can get our contradiction. What can this least
element ๐ be? Well ๐ was chosen to not be in ๐, so ๐ cannot be 0 (because 0 is in ๐). Also, we know
that ๐ − 1 ∈ ๐ (because ๐ is the least element not in ๐). But condition (ii) then forces ๐ to be in ๐
(because ๐ = (๐ − 1) + 1).
So, we wind up with ๐ ∈ ๐, contradicting the fact that ๐ is the least element not in ๐.
The Principle of Mathematical Induction is often written in the following way:
(โ) Let ๐(๐) be a statement and suppose that (i) ๐(0) is true and (ii) for all ๐ ∈ โ, ๐(๐) → ๐(๐ + 1).
Then ๐(๐) is true for all ๐ ∈ โ.
In Problem 9 below, you will be asked to show that statement (โ) is equivalent to Theorem 4.3.
There are essentially two steps involved in a proof by mathematical induction. The first step is to prove
that ๐(0) is true (this is called the base case), and the second step is to assume that ๐(๐) is true, and
use this to show that ๐(๐ + 1) is true (this is called the inductive step). While doing the inductive step,
the statement “๐(๐) is true” is often referred to as the inductive hypothesis.
Subtraction in โค: For ๐ฅ, ๐ฆ ∈ โค, we define the difference ๐ฅ − ๐ฆ to be equal to the sum ๐ฅ + (– ๐ฆ). For
example, ๐2 − ๐ = ๐2 + (– ๐) (where ๐2 is defined to be the product ๐ ⋅ ๐).
Example 4.5: Let’s use the Principle of Mathematical Induction to prove that for all natural numbers ๐,
๐2 − ๐ is even.
44
Base Case (๐ = 0): 02 − 0 = 0 = 2 ⋅ 0. So, 02 − 0 is even.
Inductive Step: Let ๐ ∈ โ and assume that ๐ 2 − ๐ is even. Then ๐ 2 − ๐ = 2๐ for some integer ๐. Now,
(๐ + 1)2 − (๐ + 1) = (๐ + 1)[(๐ + 1) − 1] = (๐ + 1)[๐ + (1 − 1)] = (๐ + 1)(๐ + 0)
= (๐ + 1) ⋅ ๐ = ๐ 2 + ๐ = (๐ 2 − ๐) + 2๐ = 2๐ + 2๐ = 2(๐ + ๐).
Here we used the fact that (โค, +, ⋅) is a ring. Since โค is closed under addition, ๐ + ๐ ∈ โค. Therefore,
(๐ + 1)2 − (๐ + 1) is even.
By the Principle of Mathematical Induction, ๐2 − ๐ is even for all ๐ ∈ โ.
โก
Notes: (1) Instead of listing every property that we used at each step, we simply stated that all the
computations we made were allowed because (โค, +, ⋅) is a ring. We will discuss the property we used
at each step in the notes below.
(2) We first used left distributivity to rewrite (๐ + 1)2 − (๐ + 1) as (๐ + 1)[(๐ + 1) − 1]. If you have
trouble seeing this, try working backwards, and making the substitutions ๐ฅ = (๐ + 1), ๐ฆ = (๐ + 1),
and ๐ง = – 1. We then have
(๐ + 1)[(๐ + 1) − 1] = (๐ + 1)[(๐ + 1) + (– 1)] = ๐ฅ(๐ฆ + ๐ง) = ๐ฅ๐ฆ + ๐ฅ๐ง
= (๐ + 1)(๐ + 1) + (๐ + 1)(– 1) = (๐ + 1)2 + (– 1)(๐ + 1) = (๐ + 1)2 − (๐ + 1).
Notice how we also used commutativity of multiplication for the second to last equality.
(3) For the second algebraic step, we used associativity of addition to write
(๐ + 1) − 1 = (๐ + 1) + (– 1) = ๐ + (1 + (– 1)) = ๐ + (1 − 1).
(4) For the third algebraic step, we used the inverse property for addition to write
1 − 1 = 1 + (– 1) = 0.
(5) For the fourth algebraic step, we used the additive identity property to write ๐ + 0 = ๐.
(6) For the fifth algebraic step, we used right distributivity and the multiplicative identity property to
write (๐ + 1) ⋅ ๐ = ๐ ⋅ ๐ + 1 ⋅ ๐ = ๐ 2 + ๐.
(7) For the sixth algebraic step, we used what I call the “Standard Advanced Calculus Trick.” I
sometimes abbreviate this as SACT. The trick is simple. If you need something to appear, just put it in.
Then correct it by performing the opposite of what you just did.
In this case, in order to use the inductive hypothesis, we need ๐ 2 − ๐ to appear, but unfortunately, we
have ๐ 2 + ๐ instead. Using SACT, I do the following:
•
I simply put in what I need (and exactly where I need it): ๐ 2 − ๐ + ๐.
•
Now, I undo the damage by performing the reverse operation: ๐ 2 − ๐ + ๐ + ๐.
•
Finally, I leave the part I need as is, and simplify the rest: (๐ 2 − ๐) + 2๐
45
(8) For the seventh step, we simply replaced ๐ 2 − ๐ by 2๐. We established that these two quantities
were equal in the second sentence of the inductive step.
(9) For the last step, we used left distributivity to write 2๐ + 2๐ as 2(๐ + ๐).
Sometimes a statement involving the natural numbers may be false for 0, but true from some natural
number on. In this case, we can still use induction. We just need to adjust the base case.
Example 4.6: Let’s use the Principle of Mathematical Induction to prove that ๐2 > 2๐ + 1 for all natural
numbers ๐ ≥ 3.
Base Case (๐ = 3): 32 = 9 and 2 ⋅ 3 + 1 = 6 + 1 = 7. So, 32 > 2 ⋅ 3 + 1.
Inductive Step: Let ๐ ∈ โ with ๐ ≥ 3 and assume that ๐ 2 > 2๐ + 1. Then we have
(๐ + 1)2 = (๐ + 1)(๐ + 1) = (๐ + 1)๐ + (๐ + 1)(1) = ๐ 2 + ๐ + ๐ + 1 > (2๐ + 1) + ๐ + ๐ + 1
= 2๐ + 2 + ๐ + ๐ = 2(๐ + 1) + ๐ + ๐ ≥ 2(๐ + 1) + 1 (because ๐ + ๐ ≥ 3 + 3 = 6 ≥ 1).
By the Principle of Mathematical Induction, ๐2 > 2๐ + 1 for all ๐ ∈ โ with ๐ ≥ 3.
โก
Notes: (1) If we have a sequence of equations and inequalities of the form =, ≥, and > (with at least
one inequality symbol appearing), beginning with ๐ and ending with ๐, then the final result is ๐ > ๐ if
> appears at least once and ๐ ≥ ๐ otherwise.
For example, if ๐ = ๐ = โ = ๐ > ๐ = ๐ = ๐ ≥ ๐, then ๐ > ๐. The sequence that appears in the
solution above has this form.
(๐ + 1)2 = (๐ + 1)(๐ + 1) = (๐ + 1)๐ + (๐ + 1)(1) = ๐ 2 + ๐ + ๐ + 1 > (2๐ + 1) + ๐ + ๐ + 1
= 2๐ + 2 + ๐ + ๐ = 2(๐ + 1) + ๐ + ๐ ≥ 2(๐ + 1) + 1
2
(2) By definition, ๐ฅ = ๐ฅ ⋅ ๐ฅ. We used this in the first equality in the inductive step to write (๐ + 1)2 as
(๐ + 1)(๐ + 1).
(3) For the second equality in the inductive step, we used left distributivity to write (๐ + 1)(๐ + 1) as
(๐ + 1)๐ + (๐ + 1)(1). If you have trouble seeing this, you can make a substitution like we did in Note
2 following Example 4.5.
(4) For the third equality in the inductive step, we used right distributivity to write (๐ + 1)๐ as
๐ ⋅ ๐ + 1 ⋅ ๐ = ๐ 2 + ๐. We also used the multiplicative identity property to write (๐ + 1)(1) = ๐ + 1.
(5) Associativity of addition is being used when we write the expression ๐ 2 + ๐ + ๐ + 1. Notice the
lack of parentheses. Technically speaking, we should have written (๐ 2 + ๐) + (๐ + 1) and then taken
another step to rewrite this as ๐ 2 + (๐ + (๐ + 1)). However, since we have associativity, we can
simply drop all those parentheses.
(6) The inequality “๐ 2 + ๐ + ๐ + 1 > (2๐ + 1) + ๐ + ๐ + 1” was attained by using the inductive
hypothesis “๐ 2 > 2๐ + 1.”
46
(7) The dedicated reader should verify that the remaining equalities in the proof are valid by
determining which ring properties were used at each step.
Example 4.7: Let’s use the Principle of Mathematical Induction to prove that for every natural number
๐, there is a natural number ๐ such that ๐ = 2๐ or ๐ = 2๐ + 1.
Base Case (๐ = 0): 0 = 2 ⋅ 0
Inductive Step: Suppose that ๐ ∈ โ and there is ๐ ∈ โ such that ๐ = 2๐ or ๐ = 2๐ + 1. If ๐ = 2๐, then
๐ + 1 = 2๐ + 1. If ๐ = 2๐ + 1, then ๐ + 1 = (2๐ + 1) + 1 = 2๐ + (1 + 1) = 2๐ + 2 = 2(๐ + 1). Here
we used the fact that (โ, +, ⋅) is a semiring (more specifically, we used associativity of addition in โ
and distributivity of multiplication over addition in โ). Since โ is closed under addition, ๐ + 1 ∈ โ.
By the Principle of Mathematical Induction, for every natural number ๐, there is a natural number ๐
such that ๐ = 2๐ or ๐ = 2๐ + 1.
โก
Notes: (1) We can now prove the analogous result for the integers: “For every integer ๐, there is an
integer ๐ such that ๐ = 2๐ or ๐ = 2๐ + 1.”
We already proved the result for ๐ ≥ 0. If ๐ < 0, then – ๐ > 0, and so there is a natural number ๐ such
that – ๐ = 2๐ or – ๐ = 2๐ + 1. If – ๐ = 2๐, then ๐ = 2(– ๐) (and since ๐ ∈ โ, – ๐ ∈ โค). If – ๐ = 2๐ + 1,
then ๐ = – (2๐ + 1) = – 2๐ − 1 = – 2๐ − 1 − 1 + 1 (SACT) = – 2๐ − 2 + 1 = 2(– ๐ − 1) + 1. Here we
used the fact that (โค, +, ⋅) is a ring. Since โค is closed under addition, – ๐ − 1 = – ๐ + (– 1) ∈ โค.
(2) If there is an integer ๐ such that ๐ = 2๐, we say that ๐ is even. If there is an integer ๐ such that
๐ = 2๐ + 1, we say that ๐ is odd.
(3) An integer ๐ cannot be both even and odd. Indeed, if ๐ = 2๐ and ๐ = 2๐ + 1, then 2๐ = 2๐ + 1.
So, we have
2(๐ − ๐) = 2๐ − 2๐ = (2๐ + 1) − 2๐ = 2๐ + (1 − 2๐) = 2๐ + (– 2๐ + 1)
= (2๐ − 2๐) + 1 = 0 + 1 = 1.
So, 2(๐ − ๐) = 1. But 2 does not have a multiplicative inverse in โค, and so, this is a contradiction.
Theorem 4.4: The product of two odd integers is odd.
Proof: Let ๐ and ๐ be odd integers. Then there are integers ๐ and ๐ such that ๐ = 2๐ + 1 and
๐ = 2๐ + 1. So,
๐ ⋅ ๐ = (2๐ + 1) ⋅ (2๐ + 1) = (2๐ + 1)(2๐) + (2๐ + 1)(1) = (2๐)(2๐ + 1) + (2๐ + 1)
= ((2๐)(2๐) + 2๐) + (2๐ + 1) = (2(๐(2๐)) + 2๐) + (2๐ + 1) = 2(๐(2๐) + ๐) + (2๐ + 1)
= (2(๐(2๐) + ๐) + 2๐) + 1 = 2((๐(2๐) + ๐) + ๐) + 1.
Here we used the fact that (โค, +, ⋅) is a ring. (Which properties did we use?) Since โค is closed under
addition and multiplication, we have (๐(2๐) + ๐) + ๐ ∈ โค. Therefore, ๐๐ is odd.
โก
47
Problem Set 4
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. The addition and multiplication tables below are defined on the set ๐ = {0, 1}. Show that
(๐, +, ⋅) does not define a ring.
+
0
1
0
0
1
1
1
0
⋅
0
1
0
1
0
1
0
1
2. Let ๐ = {0, 1} and define addition (+) and multiplication (⋅) so that (๐, +, ⋅) is a ring. Assume
that 0 is the additive identity in ๐ and 1 is the multiplicative identity in ๐. Draw the tables for
addition and multiplication and verify that with these tables, (๐, +, ⋅) is a ring.
LEVEL 2
3. Use the Principle of Mathematical Induction to prove the following:
(i)
2๐ > ๐ for all natural numbers ๐ ≥ 1.
(ii)
0 + 1 + 2 + โฏ+ ๐ =
๐(๐+1)
2
for all natural numbers.
(iii) ๐! > 2๐ for all natural numbers ๐ ≥ 4 (where ๐! = 1 ⋅ 2 โฏ ๐ for all natural numbers
๐ ≥ 1).
(iv) 2๐ ≥ ๐2 for all natural numbers ๐ ≥ 4.
4. Show that the sum of three integers that are divisible by 5 is divisible by 5.
LEVEL 3
5. Prove that if ๐, ๐, ๐ ∈ โค with ๐|๐ and ๐|๐, then ๐|๐.
6. Prove that ๐3 − ๐ is divisible by 3 for all natural numbers ๐.
LEVEL 4
7. Prove that if ๐, ๐, ๐, ๐, ๐ ∈ โค with ๐|๐ and ๐|๐, then ๐|(๐๐ + ๐๐).
8. Prove that 3๐ − 1 is even for all natural numbers ๐.
48
9. Show that Theorem 4.3 (the Principle of Mathematical Induction) is equivalent to the following
statement:
(โ) Let ๐(๐) be a statement and suppose that (i) ๐(0) is true and (ii) for all ๐ ∈ โ,
๐(๐) → ๐(๐ + 1). Then ๐(๐) is true for all ๐ ∈ โ.
LEVEL 5
10. The Principle of Strong Induction is the following statement:
(โโ) Let ๐(๐) be a statement and suppose that (i) ๐(0) is true and (ii) for all ๐ ∈ โ,
∀๐ ≤ ๐ (๐(๐)) → ๐(๐ + 1). Then ๐(๐) is true for all ๐ ∈ โ.
Use the Principle of Mathematical Induction to prove the Principle of Strong Induction.
11. Show that (โ, +, ⋅) is a field.
12. Use the Principle of Mathematical Induction to prove that for every ๐ ∈ โ, if ๐ is a set with
|๐| = ๐, then ๐ has 2๐ subsets. (Hint: Use Problem 14 from Lesson 2.)
49
LESSON 5 – REAL ANALYSIS
THE COMPLETE ORDERED FIELD OF REALS
Fields
Let’s review the number systems we have discussed so far.
The set โ = {0, 1, 2, 3, … } is the set of natural numbers and the structure (โ, +, ⋅) is a semiring.
The set โค = {… , – 3, – 2, – 1, 0, 1, 2, 3, … } is the set of integers and the structure (โค, +, ⋅) is a ring.
๐
The set โ = {๐ |๐ ∈ โค, ๐ ∈ โค∗ } is the set of rational numbers and the structure (โ, +, ⋅) is a field.
And now let’s formally introduce the notion of a field (and we will review the definitions of ring and
semiring in the notes below).
A field is a triple (๐น, +, ⋅), where ๐น is a set and + and ⋅ are binary operations on ๐น satisfying
(1) (๐น, +) is a commutative group.
(2) (๐น ∗ , ⋅) is a commutative group.
(3) ⋅ is distributive over + in ๐น. That is, for all ๐ฅ, ๐ฆ, ๐ง ∈ ๐น, we have
๐ฅ ⋅ (๐ฆ + ๐ง) = ๐ฅ ⋅ ๐ฆ + ๐ฅ ⋅ ๐ง
and
(๐ฆ + ๐ง) ⋅ ๐ฅ = ๐ฆ ⋅ ๐ฅ + ๐ง ⋅ ๐ฅ.
(4) 0 ≠ 1.
We will refer to the operation + as addition, the operation ⋅ as multiplication, the additive identity as
0, the multiplicative identity as 1, the additive inverse of an element ๐ฅ ∈ ๐น as – ๐ฅ, and the multiplicative
inverse of an element ๐ฅ ∈ ๐น as ๐ฅ −1. We will often abbreviate ๐ฅ ⋅ ๐ฆ as ๐ฅ๐ฆ.
Notes: (1) Recall from Lesson 3 that (๐น, +) a commutative group means the following:
•
(Closure) For all ๐ฅ, ๐ฆ ∈ ๐น, ๐ฅ + ๐ฆ ∈ ๐น.
•
(Associativity) For all ๐ฅ, ๐ฆ, ๐ง ∈ ๐น, (๐ฅ + ๐ฆ) + ๐ง = ๐ฅ + (๐ฆ + ๐ง).
•
(Commutativity) For all ๐ฅ, ๐ฆ ∈ ๐น, ๐ฅ + ๐ฆ = ๐ฆ + ๐ฅ.
•
(Identity) There exists an element 0 ∈ ๐น such that for all ๐ฅ ∈ ๐น, 0 + ๐ฅ = ๐ฅ + 0 = ๐ฅ.
•
(Inverse) For each ๐ฅ ∈ ๐น, there is – ๐ฅ ∈ ๐น such that ๐ฅ + (– ๐ฅ) = (– ๐ฅ) + ๐ฅ = 0.
(2) Similarly, (๐น ∗ , ⋅) a commutative group means the following:
•
(Closure) For all ๐ฅ, ๐ฆ ∈ ๐น ∗ , ๐ฅ๐ฆ ∈ ๐น ∗ .
•
(Associativity) For all ๐ฅ, ๐ฆ, ๐ง ∈ ๐น ∗ , (๐ฅ๐ฆ)๐ง = ๐ฅ(๐ฆ๐ง).
•
(Commutativity) For all ๐ฅ, ๐ฆ ∈ ๐น ∗ , ๐ฅ๐ฆ = ๐ฆ๐ฅ.
•
(Identity) There exists an element 1 ∈ ๐น ∗ such that for all ๐ฅ ∈ ๐น ∗ , 1๐ฅ = ๐ฅ ⋅ 1 = ๐ฅ.
•
(Inverse) For each ๐ฅ ∈ ๐น ∗ , there is ๐ฅ −1 ∈ ๐น ∗ such that ๐ฅ๐ฅ −1 = ๐ฅ −1 ๐ฅ = 1.
50
(3) Recall that ๐น ∗ is the set of nonzero elements of ๐น. We can write ๐น ∗ = {๐ฅ ∈ ๐น | ๐ฅ ≠ 0} (pronounced
“the set of ๐ฅ in ๐น such that ๐ฅ is not equal to 0”) or ๐น ∗ = ๐น โ {0} (pronounced “๐น with 0 removed”).
(4) The properties that define a field are called the field axioms. These are the statements that are
given to be true in all fields. There are many other statements that are true in fields. However, any
additional statements need to be proved using the axioms.
(5) If we replace the condition that “(๐น ∗ , ⋅) is a commutative group” by “(๐น, ⋅) is a monoid,” then the
resulting structure is called a ring. The most well-known example of a ring is โค, the ring of integers. See
Lesson 4 for details about โค and rings in general.
We also do not require 0 and 1 to be distinct in the definition of a ring. If 0 = 1, we get the zero ring,
which consists of just one element, namely 0 (Why?). The operations of addition and multiplication are
defined by 0 + 0 = 0 and 0 ⋅ 0 = 0. The reader may want to verify that the zero ring is in fact a ring.
The main difference between a ring and a field is that in a ring, there can be nonzero elements that do
not have multiplicative inverses. For example, in โค, 2 has no multiplicative inverse. So, the equation
2๐ฅ = 1 has no solution.
(6) If we also replace “(๐น, +) is a commutative group” by “(๐น, +) is a commutative monoid,” then the
resulting structure is a semiring. The most well-known example of a semiring is โ, the semiring of
natural numbers.
The main difference between a semiring and a ring is that in a semiring, there can be elements that do
not have additive inverses. For example, in โ, 1 has no additive inverse. Thus, the equation ๐ฅ + 1 = 0
has no solution.
Technical note: For a semiring, we include one additional axiom: For all ๐ฅ ∈ ๐น, 0 ⋅ ๐ฅ = ๐ฅ ⋅ 0 = 0.
(7) Every field is a commutative ring. Although this is not too hard to show (you will be asked to show
this in Problem 6 below), it is worth observing that this is not completely obvious. For example, if
(๐น, +, ⋅) is a ring, then since (๐น, ⋅) is a monoid with identity 1, it follows that 1 ⋅ 0 = 0 ⋅ 1 = 0.
However, in the definition of a field given above, this property of 0 is not given as an axiom. We are
given that (๐น ∗ , ⋅) is a commutative group, and so, it follows that 1 is an identity for ๐น ∗ . But 0 ∉ ๐น ∗ , and
so, 1 ⋅ 0 = 0 ⋅ 1 = 0 needs to be proved.
Similarly, in the definition of a field given above, 0 is excluded from associativity and commutativity.
These need to be checked.
(8) You were asked to verify that (โ, +, ⋅) is a field in Problems 9 and 11 from Lesson 3 and Problem
11 from Lesson 4.
๐
Subtraction and Division: If ๐, ๐ ∈ ๐น, we define ๐ − ๐ = ๐ + (– ๐) and for ๐ ≠ 0, ๐ = ๐๐ −1 .
51
Ordered Rings and Fields
We say that a ring (๐
, +, ⋅) is ordered if there is a nonempty subset ๐ of ๐
, called the set of positive
elements of ๐
, satisfying the following three properties:
(1) If ๐, ๐ ∈ ๐, then ๐ + ๐ ∈ ๐.
(2) If ๐, ๐ ∈ ๐, then ๐๐ ∈ ๐.
(3) If ๐ ∈ ๐
, then exactly one of the following holds: ๐ ∈ ๐, ๐ = 0, or – ๐ ∈ ๐.
Note: If ๐ ∈ ๐, we say that ๐ is positive and if – ๐ ∈ ๐, we say that ๐ is negative.
Also, we define ๐
+ = ๐ and ๐
– = {๐ ∈ ๐
| – ๐ ∈ ๐}.
Example 5.1: Let ๐
= โค and let ๐โค = {1, 2, 3, … }. It’s easy to see that properties (1), (2), and (3) are
satisfied. It follows that (โค, +, ⋅) is an ordered ring.
Theorem 5.1: (โ, +, ⋅) is an ordered field.
Note: The proof of this result is a bit technical, but I am including it for completeness. The student just
starting out in pure mathematics can feel free to just accept this result and skip the proof.
๐
Recall: (1) Rational numbers have the form ๐ , where ๐ and ๐ are integers and ๐ ≠ 0.
๐
๐
(2) Two rational numbers ๐ and ๐ are equal if and only if ๐๐ = ๐๐.
(3) For rational numbers
๐
๐
๐
๐๐
๐
๐
๐
๐
๐
๐
and , we define addition and multiplication by
๐
๐๐+๐๐
๐
๐๐
+ =
and
⋅ ๐ = ๐๐ .
๐
๐
–๐
(4) The additive inverse of ๐ is – ๐ = ๐ .
Analysis: Before writing out the proof in detail, let’s think about how we would go about it. First of all,
we already know from Problem 11 in Lesson 4 that (โ, +, ⋅) is a field. So, we need only show that it is
ordered. To do this, we need to come up with a set ๐ of positive elements from โ. The natural choice
would be to take the set of quotients whose numerator (number on the top) and denominator (number
on the bottom) are both positive integers. In other words, we will let ๐โ be the set of all the rational
๐
numbers of the form ๐ , where ๐ and ๐ are both elements of ๐โค (as defined in Example 5.1 above).
–๐
๐
Since –๐ = ๐ (because (– ๐)๐ = (– ๐)๐), we must automatically be including all quotients whose
numerator and denominator are both negative integers as well.
With this definition of ๐โ , it is straightforward to verify properties (1) and (2) of an ordered field.
To verify property (3), we need to check three things.
(i) For any rational number ๐, ๐ is positive, zero, or negative (๐ ∈ ๐โ , ๐ = 0, or – ๐ ∈ ๐โ ). We will
show this by assuming ๐ ∉ ๐โ and ๐ ≠ 0, and then proving that we must have – ๐ ∈ ๐โ .
52
(ii) For any rational number ๐, ๐ cannot be both positive and negative. We will show this by
assuming ๐ ∈ ๐โ and – ๐ ∈ ๐โ , and then deriving a contradiction.
(iii) A positive or negative rational number is not zero, and a rational number that is zero is not
positive or negative. This is straightforward to check.
Let’s write out the details.
Proof of Theorem 5.1: By Problem 11 from Lesson 4, (โ, +, ⋅) is a field.
๐
Let ๐น = โ and let ๐โ = {๐ฅ ∈ โ | ๐ฅ = ๐ with ๐, ๐ ∈ ๐โค }. Let ๐, ๐ ∈ ๐โ . Then there are ๐, ๐, ๐, ๐ ∈ ๐โค
๐
๐
๐
๐
with ๐ = ๐ and ๐ = ๐ . We have ๐ + ๐ = ๐ + ๐ =
๐๐+๐๐
๐๐
. Since ๐โค satisfies (2) above, we have
๐๐, ๐๐, ๐๐ ∈ ๐โค . Since ๐โค satisfies (1) above, we have ๐๐ + ๐๐ ∈ ๐โค . Therefore, ๐ + ๐ ∈ ๐โ and (1)
๐ ๐
๐๐
holds. Also, we have ๐๐ = ๐ ⋅ ๐ = ๐๐ . Since ๐โค satisfies (2) above, we have ๐๐, ๐๐ ∈ ๐โค , and therefore,
๐๐ ∈ ๐โ and (2) holds.
๐
Now, suppose ๐ ∉ ๐โ and ๐ ≠ 0. Since ๐ ∈ โ, there are ๐ ∈ โค and ๐ ∈ โค∗ such that ๐ = ๐ . But
๐ ≠ 0, and so, we must have ๐ ∈ โค∗ . Since ๐ ∉ ๐โ , either ๐ ∉ ๐โค or ๐ ∉ ๐โค (or both). If both ๐ ∉ ๐โค
๐
–๐
and ๐ ∉ ๐โค , then we have ๐ = ๐ = –๐ (because ๐(– ๐) = ๐(– ๐)). Then – ๐, – ๐ ∈ ๐โค , and so, ๐ ∈ ๐ โ ,
contrary to our assumption that ๐ ∉ ๐ โ . If ๐ ∉ ๐โค and ๐ ∈ ๐โค , then – ๐ ∈ ๐โค , and therefore,
–๐
–๐
๐
–๐ =
∈ ๐โ . If ๐ ∈ ๐โค and ๐ ∉ ๐โค , then – ๐ ∈ ๐โค , and therefore, – ๐ = ๐ = –๐ ∈ ๐โ . So, at least one
๐
of ๐ ∈ ๐, ๐ = 0, or – ๐ ∈ ๐ holds.
๐
๐
If ๐ ∈ ๐โ and – ๐ ∈ ๐โ , then ๐ = ๐ and – ๐ = ๐ with ๐, ๐, ๐, ๐ ∈ ๐โค . We can also write – ๐ as
–๐
– ๐ = ๐ . So,
–๐
๐
๐
= ๐, and thus, (– ๐)๐ = ๐๐. Since ๐, ๐ ∈ ๐โค , we have ๐๐ ∈ ๐โค . Since (– ๐)๐ = ๐๐, we
must have (– ๐)๐ ∈ ๐โค . But – ๐ ∉ ๐โค , and so, – (– ๐) ∈ ๐โค . Since we also have ๐ ∈ ๐โค , we must have
– (– ๐)๐ ∈ ๐โค . But then by (3) for ๐โค , (– ๐)๐ ∉ ๐โค . This contradiction shows that we cannot have both
๐ ∈ ๐โ and – ๐ ∈ ๐โ .
๐
๐
If ๐ ∈ ๐โ , then ๐ = ๐ with ๐, ๐ ∈ ๐โค . So, ๐ ≠ 0, and therefore, ๐ ≠ 0. If – ๐ ∈ ๐โ , then – ๐ = ๐ with
๐, ๐ ∈ ๐โค . If ๐ = 0, then – ๐ = 0, and so, ๐ = 0. But ๐ ∈ ๐โค , and so, ๐ ≠ 0. Thus, ๐ ≠ 0.
0
–0
0
If ๐ = 0, then we have 0 = 1 ∉ ๐โ and – 0 = 1 = 1 ∉ ๐โ.
It follows that (โ, +, ⋅) is an ordered field.
โก
If (๐
, +, ⋅) is an ordered ring and ๐ is the set of positive elements from the ring, we will write ๐ > 0
instead of ๐ ∈ ๐ and ๐ < 0 instead of – ๐ ∈ ๐. If ๐ − ๐ > 0, we will write ๐ > ๐ or ๐ < ๐.
We write ๐ ≥ 0 if ๐ ∈ ๐ or ๐ = 0, we write ๐ ≤ 0 if – ๐ ∈ ๐ or ๐ = 0, and we write ๐ ≥ ๐ or ๐ ≤ ๐ if
๐ − ๐ ≥ 0.
We may use the notation (๐
, ≤) for an ordered ring, where ≤ is the relation defined in the last
paragraph. Note that + and ⋅ aren’t explicitly mentioned, but of course they are still part of the ring.
53
In the future, we may just use the name of the set for the whole structure when there is no danger of
confusion. For example, we may refer to the ring ๐
or the ordered field ๐น instead of the ring (๐
, +, ⋅)
or the ordered field (๐น, ≤).
Fields are particularly nice to work with because all the arithmetic and algebra we’ve learned through
the years can be used in fields. For example, in the field of rational numbers, we can solve the equation
2๐ฅ = 1. The multiplicative inverse property allows us to do this. Indeed, the multiplicative inverse of 2
1
1
is 2, and therefore, ๐ฅ = 2 is a solution to the given equation. Compare this to the ring of integers. If we
restrict ourselves to the integers, then the equation 2๐ฅ = 1 has no solution.
Working with ordered fields is very nice as well. In the problem set below, you will be asked to derive
some additional properties of fields and ordered fields that follow from the axioms. We will prove a
few of these properties now as examples.
Theorem 5.2: Let (๐น, ≤) be an ordered field. Then for all ๐ฅ ∈ ๐น ∗, ๐ฅ ⋅ ๐ฅ > 0.
Proof: There are two cases to consider: (i) If ๐ฅ > 0, then ๐ฅ ⋅ ๐ฅ > 0 by property (2) of an ordered field.
(ii) If ๐ฅ < 0, then – ๐ฅ > 0, and so, (– ๐ฅ)(– ๐ฅ) > 0, again by property (2) of an ordered field. Now, using
Problem 3 (parts (vi) and (vii)) in the problem set below, together with commutativity and associativity
of multiplication, and the multiplicative identity property, we have
(– ๐ฅ)(– ๐ฅ) = (– 1๐ฅ)(– 1๐ฅ) = (– 1)(– 1)๐ฅ ⋅ ๐ฅ = 1(๐ฅ ⋅ ๐ฅ) = ๐ฅ ⋅ ๐ฅ.
So, again we have ๐ฅ ⋅ ๐ฅ > 0.
โก
Theorem 5.3: Every ordered field (๐น, ≤) contains a copy of the natural numbers. Specifically, ๐น contains
a subset โ = {๐ | ๐ ∈ โ} such that for all ๐, ๐ ∈ โ, we have ๐ + ๐ = ๐ + ๐, ๐ ⋅ ๐ = ๐ ⋅ ๐, and
๐ < ๐ ↔ ๐ < ๐.
Proof: Let (๐น, ≤) be an ordered field. By the definition of a field, 0, 1 ∈ ๐น and 0 ≠ 1.
We let 0 = 0 and ๐ = 1 + 1 + โฏ + 1, where 1 appears ๐ times. Let โ = {๐ | ๐ ∈ โ}. Then โ ⊆ ๐น.
We first prove by induction on ๐ that for all ๐, ๐ ∈ โ, ๐ + ๐ = ๐ + ๐.
Base case (๐ = 0): ๐ + 0 = ๐ = ๐ + 0 = ๐ + 0.
Inductive step: Suppose that ๐ + ๐ = ๐ + ๐. Then we have
๐ + (๐ + 1) = (๐ + ๐) + 1 = ๐ + ๐ + 1 = (๐ + ๐) + 1 = ๐ + (๐ + 1) = ๐ + ๐ + 1.
By the Principle of Mathematical Induction, for all natural numbers ๐, ๐ + ๐ = ๐ + ๐.
Similarly, we prove by induction on ๐ that for all ๐, ๐ ∈ โ, ๐ ⋅ ๐ = ๐ ⋅ ๐.
Base case (๐ = 0): ๐ ⋅ 0 = 0 = ๐ ⋅ 0.
54
Inductive step: Suppose that ๐ ⋅ ๐ = ๐ ⋅ ๐. Then we have
๐ ⋅ (๐ + 1) = ๐๐ + ๐ = ๐๐ + ๐ = ๐ ⋅ ๐ + ๐ = ๐(๐ + 1) = ๐(๐ + 1) = ๐(๐ + 1).
By the Principle of Mathematical Induction, for all natural numbers ๐, ๐ ⋅ ๐ = ๐ ⋅ ๐.
We now wish to prove that for all ๐, ๐ ∈ โ, ๐ < ๐ ↔ ๐ < ๐.
We first note that for all ๐ ∈ โ, ๐ + 1 > ๐ because ๐ + 1 − ๐ = ๐ + 1 − ๐ = 1 = 1 ⋅ 1 > 0 by
Theorem 5.2.
We now prove by induction on ๐ that for all ๐ ∈ โ with ๐ > 0 that ๐ > 0.
Base case (๐ = 1): 1 = 1 = 1 ⋅ 1 > 0 by Theorem 5.2.
Inductive step: Assume that ๐ > 0. Then ๐ + 1 = ๐ + 1 = ๐ + 1 > 0. Here we have used Order
Property 1 together with ๐ > 0 and 1 > 0.
By the Principle of Mathematical Induction, for all natural numbers ๐ with ๐ > 0, we have ๐ > 0.
Conversely, if ๐ > 0, then ๐ ≠ 0 (because 0 = 0). Since ๐ is defined only for ๐ ≥ 0, we have ๐ > 0.
So, we have shown that for ๐ ∈ โ, ๐ > 0 if and only if ๐ > 0.
Next, note that if ๐ < ๐, then ๐ = (๐ − ๐) + ๐ = ๐ − ๐ + ๐. It follows that ๐ − ๐ = ๐ − ๐.
Finally, we have ๐ < ๐ ↔ ๐ − ๐ > 0 ↔ ๐ − ๐ > 0 ↔ ๐ − ๐ > 0 ↔ ๐ > ๐ ↔ ๐ < ๐.
โก
Notes: (1) The function that sends ๐ ∈ โ to ๐ ∈ โ is called an isomorphism. It has the following
properties: (i) ๐ + ๐ = ๐ + ๐, (ii) ๐ ⋅ ๐ = ๐ ⋅ ๐, and (iii) ๐ < ๐ if and only if ๐ < ๐. The function
gives a one-to-one correspondence between the elements of โ and the elements of โ .
So, when we say that every field contains a “copy” of the natural numbers, we mean that there is a
subset โ of the field so that (โ, ≤) is isomorphic to (โ, ≤) (note that addition and multiplication are
preserved as well, even though they’re not explicitly mentioned in the notation).
(2) We will formally introduce isomorphisms in Lesson 11.
1
Theorem 5.4: Let (๐น, ≤) be an ordered field and let ๐ฅ ∈ ๐น with ๐ฅ > 0. Then ๐ฅ > 0.
1
Proof: Since ๐ฅ ≠ 0, ๐ฅ = ๐ฅ −1 exists and is nonzero.
55
1
1
Assume toward contradiction that ๐ฅ < 0. Then – ๐ฅ > 0. Using Problem 3 (part (vi)) from the problem
set below, together with commutativity and associativity of multiplication, the multiplicative inverse
1
property, and the multiplicative identity property, ๐ฅ (– ๐ฅ) = ๐ฅ(– 1)๐ฅ −1 = – 1๐ฅ๐ฅ −1 = – 1 ⋅ 1 = – 1.
1
1
Since ๐ฅ > 0 and – ๐ฅ > 0, we have – 1 = ๐ฅ (– ๐ฅ) > 0. So, 1 โฏ 0. But by Theorem 5.2, 1 = 1 ⋅ 1 > 0. This
1
is a contradiction. Therefore, ๐ฅ > 0.
โก
Why Isn’t โ Enough?
At first glance, it would appear that the ordered field of rational numbers would be sufficient to solve
all “real world” problems. However, a long time ago, a group of people called the Pythagoreans showed
that this was not the case. The problem was first discovered when applying the now well-known
Pythagorean Theorem.
Theorem 5.5 (Pythagorean Theorem): In a right triangle with legs of lengths ๐ and ๐, and a hypotenuse
of length ๐, ๐ 2 = ๐2 + ๐ 2 .
The picture to the right shows a right triangle. The vertical and
horizontal segments (labeled ๐ and ๐, respectively) are called the legs
of the right triangle, and the side opposite the right angle (labeled ๐) is
called the hypotenuse of the right triangle.
There are many ways to prove the Pythagorean Theorem. Here, we will
provide a simple geometric argument. For the proof we will want to
recall that the area of a square with side length ๐ is ๐ด = ๐ 2 , and the area of a triangle with base ๐ and
1
height โ is ๐ด = ๐โ. Notice that in our right triangle drawn here, the base is labeled ๐ (how
2
1
1
convenient), and the height is labeled ๐. So, the area of this right triangle is ๐ด = 2 ๐๐ = 2 ๐๐.
Proof of Theorem 5.5: We draw 2 squares, each of side length ๐ + ๐, by rearranging 4 copies of the
given triangle in 2 different ways:
56
We can get the area of each of these squares by adding the areas of all the figures that comprise each
square.
The square on the left consists of 4 copies of the given right triangle, a square of side length ๐ and a
1
square of side length ๐. It follows that the area of this square is 4 ⋅ 2 ๐๐ + ๐2 + ๐ 2 = 2๐๐ + ๐2 + ๐ 2 .
The square on the right consists of 4 copies of the given right triangle, and a square of side length ๐. It
1
follows that the area of this square is 4 ⋅ 2 ๐๐ + ๐ 2 = 2๐๐ + ๐ 2 .
Since the areas of both squares of side length ๐ + ๐ are equal (both areas are equal to (๐ + ๐)2 ),
2๐๐ + ๐2 + ๐ 2 = 2๐๐ + ๐ 2 . Cancelling 2๐๐ from each side of this equation yields ๐2 + ๐ 2 = ๐ 2 .
โก
Question: In a right triangle where both legs have length 1, what is the length of the hypotenuse?
Let’s try to answer this question. If we let ๐ be the length of the hypotenuse of the triangle, then by the
Pythagorean Theorem, we have ๐ 2 = 12 + 12 = 1 + 1 = 2. Since ๐ 2 = ๐ ⋅ ๐, we need to find a number
with the property that when you multiply that number by itself you get 2. The Pythagoreans showed
that if we use only numbers in โ, then no such number exists.
Theorem 5.6: There does not exist a rational number ๐ such that ๐2 = 2.
Analysis: We will prove this Theorem by assuming that there is a rational number ๐ such that ๐2 = 2,
๐
and arguing until we reach a contradiction. A first attempt at a proof would be to let ๐ = ๐ ∈ โ satisfy
๐ 2
๐2
๐⋅๐
๐
๐
๐ 2
2
๐2
2
( ๐ ) = 2. It follows that ๐๐ = ๐๐๐ ( ๐2 = ๐⋅๐ = ๐ ⋅ ๐ = ( ๐ ) and 2 = 1 ⇒ ๐2 = 1 ⇒ ๐2 = 2๐2 ),
showing that ๐๐ is even. We will then use this information to show that both ๐ and ๐ are even (at
this point, you may want to try to use the two statements in bold to prove this yourself).
Now, in our first attempt, the fact that ๐ and ๐ both turned out to be even did not produce a
contradiction. However, we can modify the beginning of the argument to make this happen.
6
Remember that every rational number has infinitely many representations. For example, 12 is the same
2
rational number as 4 (because 6 ⋅ 4 = 12 ⋅ 2). Notice that in both representations, the numerator
(number on the top) and the denominator (number on the bottom) are even. However, they are both
1
equivalent to 2, which has the property that the numerator is not even.
In Problem 9 below, you will be asked to show that every rational number can be written in the form
๐
, where at least one of ๐ or ๐ is not even. We can now adjust our argument to get the desired
๐
contradiction.
Proof of Theorem 5.6: Assume, toward contradiction, that there is a rational number ๐ such that
๐
๐2 = 2. Since ๐ is a rational number, there are ๐ ∈ โค and ๐ ∈ โค∗ , not both even, so that ๐ = ๐ .
57
๐2
๐⋅๐
๐
๐
2
So, we have ๐2 = ๐⋅๐ = ๐ ⋅ ๐ = ๐ ⋅ ๐ = ๐2 = 2 = 1. Thus, ๐2 ⋅ 1 = ๐2 ⋅ 2. So, ๐2 = 2๐2 . Therefore,
๐2 is even. If ๐ were odd, then by Theorem 4.4 (from Lesson 4), ๐2 = ๐ ⋅ ๐ would be odd. So, ๐ is
even.
Since ๐ is even, there is ๐ ∈ โค such that ๐ = 2๐. Replacing ๐ by 2๐ in the equation ๐2 = 2๐2 gives
us 2๐2 = ๐2 = (2๐)2 = (2๐)(2๐) = 2(๐(2๐)). So, ๐2 = ๐(2๐) = (๐ ⋅ 2)๐ = (2๐)๐ = 2(๐ ⋅ ๐). So,
we see that ๐2 is even, and again by Theorem 4.4, ๐ is even.
So, we have ๐ even and ๐ even, contrary to our original assumption that ๐ and ๐ are not both even.
Therefore, there is no rational number ๐ such that ๐2 = 2.
โก
So, the big question is, “Is there an ordered field ๐น with ๐น containing โ and ๐ ∈ ๐น such that ๐2 = 2?”
Spoiler Alert! There is! We call it โ, the ordered field of real numbers.
Completeness
Let (๐น, ≤) be an ordered field and let ๐ be a nonempty subset of ๐น. We say that ๐ is bounded above if
there is ๐ ∈ ๐น such that for all ๐ ∈ ๐, ๐ ≤ ๐. Each such number ๐ is called an upper bound of ๐.
In words, an upper bound of a set ๐ is simply an element from the field that is at least as big as every
element in ๐.
Similarly, we say that ๐ is bounded below if there is ๐พ ∈ ๐น such that for all ๐ ∈ ๐, ๐พ ≤ ๐ . Each such
number ๐พ is called a lower bound of ๐.
In words, a lower bound of a set ๐ is simply an element from the field that is no bigger than any element
in ๐.
We will say that ๐ is bounded if it is both bounded above and bounded below. Otherwise ๐ is
unbounded.
A least upper bound of a set ๐ is an upper bound that is smaller than any other upper bound of ๐, and
a greatest lower bound of ๐ is a lower bound that is larger than any other lower bound of ๐.
Example 5.2: Let (๐น, ≤) be an ordered field with โ ⊆ ๐น.
Note: The only two examples of ๐น that we are interested in right now are โ (the set of rational
numbers) and โ (the set of real numbers). Although we haven’t finished defining the real numbers,
you probably have some intuition as to what they look like—after all, this is the number system you
have used throughout high school. As you look at the set in each example below, think about what it
looks like as a subset of โ and as a subset of โ.
1. ๐ = {1, 2, 3, 4, 5} is bounded.
5 is an upper bound of ๐, as is any number larger than 5. The number 5 is special in the sense
that there are no upper bounds smaller than it. So, 5 is the least upper bound of ๐.
58
Similarly, 1 is a lower bound of ๐, as is any number smaller than 1. The number 1 is the greatest
lower bound of ๐ because there are no lower bounds larger than it.
Notice that the least upper bound and greatest lower bound of ๐ are inside the set ๐ itself. This
will always happen when the set ๐ is finite.
2. ๐ = {๐ฅ ∈ ๐น | – 2 < ๐ฅ ≤ 2} is also bounded. Any number greater than or equal to 2 is an upper
bound of ๐, and any number less than or equal to – 2 is a lower bound of ๐.
2 is the least upper bound of ๐ and – 2 is the greatest lower bound of ๐.
Note that the least upper bound of ๐ is in ๐, whereas the greatest lower bound of ๐ is not in ๐.
3. ๐ = {๐ฅ ∈ ๐น | ๐ฅ < – 3} is bounded above by any number greater than or equal to – 3, and – 3 is
the least upper bound of ๐. The set ๐ is not bounded below, and therefore, ๐ is unbounded.
4. ๐ = {๐ฅ ∈ ๐น | ๐ฅ 2 < 2} is bounded above by 2. To see this, note that if ๐ฅ > 2, then ๐ฅ 2 > 4 ≥ 2,
and therefore, ๐ฅ ∉ ๐. Any number greater than 2 is also an upper bound.
3
Is 2 the least upper bound of ๐? It’s not! For example, 2 is also an upper bound. Indeed, if
3
9
๐ฅ > 2, then ๐ฅ 2 > 4 ≥ 2 (the reader should verify that for all ๐, ๐ ∈ โ+ , ๐ > ๐ → ๐2 > ๐ 2 ).
Does ๐ have a least upper bound? A moment’s thought might lead you to suspect that a least
upper bound ๐ would satisfy ๐2 = 2. And it turns out that you are right! (Proving this,
however, is quite difficult). Clearly, this least upper bound ๐ is not in the set ๐. The big question
is “Does ๐ exist at all?”
Well, if ๐น = โ, then by Theorem 5.6, ๐ does not exist in ๐น. In this case, ๐ is an example of a
set which is bounded above in โ, but has no least upper bound in โ.
So, if we want an ordered field ๐น containing โ where ๐ does exist, we can insist that ๐น has the
property that any set which is bounded above in ๐น has a least upper bound in ๐น. It turns out
that there is exactly one such ordered field (up to renaming the elements) and we call it the
ordered field of real numbers, โ.
Many authors use the term supremum for “least upper bound” and infimum for “greatest lower
bound,” and they may write sup ๐ด and inf ๐ด for the supremum and infimum of a set ๐ด, respectively (if
they exist).
In the examples above, we stated the least upper bound and greatest lower bound of the sets ๐, ๐, ๐,
and ๐ without proof. Intuitively, it seems reasonable that those numbers are correct. Let’s do one of
the examples carefully.
Theorem 5.7: Let ๐ = {๐ฅ ∈ ๐น | ๐ฅ < – 3}. Then sup ๐ = – 3.
Analysis: We need to show that – 3 is an upper bound of ๐, and that any number less than – 3 is not
an upper bound of ๐. That – 3 is an upper bound of ๐ follows immediately from the definition of ๐.
The harder part of the argument is showing that a number less than – 3 is not an upper bound of ๐.
However, conceptually it’s not hard to see that this is true. If ๐ < – 3, we simply need to find some
number ๐ฅ between ๐ and – 3. Here is a picture of the situation.
59
๐ฅ
Notice that ๐ can be very close to – 3 and we don’t know exactly what ๐ is—we know only that it’s less
than – 3. So, we need to be careful how we choose ๐ฅ. The most natural choice for ๐ฅ would be to go
midway between ๐ and – 3. In other words, we can take the average of ๐ and – 3. So, we will let
1
๐ฅ = 2 (๐ + (– 3)). Then we just need to verify that ๐ < ๐ฅ and that ๐ฅ ∈ ๐ (that is, ๐ฅ < – 3).
Proof of Theorem 5.7: If ๐ฅ ∈ ๐, then ๐ฅ < – 3 by definition, and so, – 3 is an upper bound of ๐.
Suppose that ๐ < – 3 (or equivalently, – ๐ − 3 > 0). We want to show that ๐ is not an upper bound of
1
๐. To do this, we let ๐ฅ = (๐ − 3) = 2−1 (๐ + (– 3)). ๐ฅ ∈ ๐น because ๐น is closed under addition and
2
multiplication, and the multiplicative inverse property holds in ๐น ∗ . We will show that ๐ < ๐ฅ < – 3.
1
1
1
1
1
1
๐ฅ − ๐ = (๐ − 3) − ๐ = (๐ − 3) − (2๐) = (๐ − 3 − 2๐) = (๐ − 2๐ − 3) = (– ๐ − 3).
2
2
2
2
2
2
1
Since 2 > 0 (by Theorem 5.4) and – ๐ − 3 > 0, it follows that ๐ฅ − ๐ > 0, and therefore, ๐ฅ > ๐.
1
2
– 3 − ๐ฅ = – 3 − (๐ − 3) =
1
1
1
1
1
(– 6) − ๐ + ⋅ 3 = (– 6 − ๐ + 3) = (– ๐ − 3).
2
2
2
2
2
1
Again, since 2 > 0 and – ๐ − 3 > 0, it follows that – 3 − ๐ฅ > 0, and therefore, ๐ฅ < – 3. Thus, ๐ฅ ∈ ๐.
So, we found an element ๐ฅ ∈ ๐ (because ๐ฅ < – 3) with ๐ < ๐ฅ. This shows that ๐ is not an upper bound
of ๐. It follows that – 3 = sup ๐.
โก
An ordered field (๐น, ≤) has the Completeness Property if every nonempty subset of ๐น that is bounded
above in ๐น has a least upper bound in ๐น. In this case, we say that (๐น, ≤) is a complete ordered field.
Theorem 5.8: There is exactly one complete ordered field (up to renaming the elements).
The proof of Theorem 5.8 is quite long and requires some machinery that we haven’t yet developed.
We will therefore accept it as true for the purpose of this book, and we let โ be the unique complete
ordered field guaranteed to exist by the theorem.
We will finish this section by proving two useful theorems about the complete ordered field โ.
Theorem 5.9 (The Archimedean Property of โ): For every ๐ฅ ∈ โ, there is ๐ ∈ โ such that ๐ > ๐ฅ.
In other words, the Archimedean Property says that the set of natural numbers is unbounded in the
reals. In particular, the set of natural numbers is not bounded from above in the set of real numbers.
60
We will prove this theorem by contradiction using the Completeness Property of the reals. If we
(wrongly) assume that the set of natural numbers is bounded from above, then the Completeness
Property of the reals gives us a least upper bound ๐ฅ. Since ๐ฅ is a least upper bound, ๐ฅ − 1 is not an
upper bound. Do you see the problem yet? If ๐ฅ − 1 < ๐ ∈ โ, then ๐ฅ < ๐ + 1. But then ๐ฅ is not an
upper bound for the set of natural numbers, contrary to our assumption. Let’s write out the details.
Proof: Suppose toward contradiction that โ is bounded from above. By the Completeness Property of
โ, ๐ฅ = sup โ exists. Since ๐ฅ − 1 is not an upper bound for โ, there is ๐ ∈ โ such that ๐ฅ − 1 < ๐. Then
we have ๐ฅ = ๐ฅ + (– 1 + 1) = (๐ฅ − 1) + 1 < ๐ + 1. Since โ is closed under addition, ๐ + 1 ∈ โ. So, ๐ฅ
is not an upper bound for โ, contradicting the fact that ๐ฅ = sup โ. It follows that โ is not bounded
from above. So, for every ๐ฅ ∈ โ, there is ๐ ∈ โ such that ๐ > ๐ฅ.
โก
Theorem 5.10 (The Density Theorem): If ๐ฅ, ๐ฆ ∈ โ with ๐ฅ < ๐ฆ, then there is ๐ ∈ โ with ๐ฅ < ๐ < ๐ฆ.
In other words, the Density Theorem says that between any two real numbers we can always find a
rational number. We say that โ is dense in โ.
To help understand the proof, let’s first run a simple simulation using a specific example. Let’s let
16
17
1
๐ฅ = 3 and ๐ฆ = 3 . We begin by subtracting to get ๐ฆ − ๐ฅ = 3. This is the distance between ๐ฅ and ๐ฆ. We
1
wish to find a natural number ๐ such that ๐ is smaller than this distance. In other words, we want
1
๐
1
< 3, or equivalently, ๐ > 3. So, we can let ๐ be any natural number greater than 3, say ๐ = 4. We
1
1
now want to “shift” ๐ = 4 to the right to get a rational number between ๐ฅ and ๐ฆ. We can do this as
16
64
22
11
follows. We multiply ๐ times ๐ฅ to get ๐๐ฅ = 4 ⋅ 3 = 3 . We then let ๐ be the least integer greater than
66
๐
16
11
17
๐๐ฅ. So, ๐ = = 22. Finally, we let ๐ = = = . And we did it! Indeed, we have < < . The
3
๐
4
2
3
2
3
reader should confirm that these inequalities hold. Let’s write out the details of the proof.
Proof: Let’s first consider the case where 0 ≤ ๐ฅ < ๐ฆ. Let ๐ง = ๐ฆ − ๐ฅ = ๐ฆ + (– ๐ฅ). Since โ has the
additive inverse property and is closed under addition, ๐ง ∈ โ. Also, ๐ง > 0. By the Archimedean
1
Property, there is ๐ ∈ โ such that ๐ > ๐ง. Using Problem 5 (part (v)) in the problem set below, we have
1
๐
๐
< ๐ง. By the Archimedean Property once again, there is ๐ ∈ โ such that ๐ > ๐๐ฅ. Therefore, ๐ > ๐ฅ
๐
๐
(Check this!). So, {๐ ∈ โ | ๐ > ๐ฅ} ≠ ∅. By the Well Ordering Principle, {๐ ∈ โ | ๐ > ๐ฅ} has a least
element, let’s call it ๐. Since ๐ > 0, (because ๐ฅ ≥ 0 and ๐ > 0) and ๐ is the least natural number such
๐
๐−1
๐
1
that ๐ > ๐ฅ, it follows that ๐ − 1 ∈ โ and ๐ ≤ ๐ฅ, or equivalently, ๐ − ๐ ≤ ๐ฅ. Therefore, we have
๐
๐
1
๐
๐
๐
≤ ๐ฅ + < ๐ฅ + ๐ง = ๐ฅ + (๐ฆ − ๐ฅ) = ๐ฆ. Thus, ๐ฅ <
๐
< ๐ฆ. Since ๐, ๐ ∈ โ, we have ∈ โ.
๐
Now, we consider the case where ๐ฅ < 0 and ๐ฅ < ๐ฆ. By the Archimedean Property, there is ๐ก ∈ โ such
that ๐ก > – ๐ฅ. Then, we have 0 < ๐ฅ + ๐ก < ๐ฆ + ๐ก. So, ๐ฅ + ๐ก and ๐ฆ + ๐ก satisfy the first case above. Thus,
there is ๐ ∈ โ with ๐ฅ + ๐ก < ๐ < ๐ฆ + ๐ก. It follows that ๐ฅ < ๐ − ๐ก < ๐ฆ. Since ๐ก ∈ โ, – ๐ก ∈ โค. Since
โค ⊆ โ, – ๐ก ∈ โ. So, we have ๐, – ๐ก ∈ โ. Since โ is closed under addition, ๐ − ๐ก = ๐ + (– ๐ก) ∈ โ.
โก
61
Problem Set 5
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. The addition and multiplication tables below are defined on the set ๐ = {0, 1, 2}. Show that
(๐, +, ⋅) does not define a field.
+
0
1
2
0
0
1
2
1
1
2
0
⋅
0
1
2
2
2
0
1
0
0
0
0
1
0
1
2
2
0
2
2
2. Let ๐น = {0, 1}, where 0 ≠ 1. Show that there is exactly one field (๐น, +, ⋅), where 0 is the
additive identity and 1 is the multiplicative identity.
LEVEL 2
3. Let (๐น, +, ⋅) be a field. Prove each of the following:
(i)
If ๐, ๐ ∈ ๐น with ๐ + ๐ = ๐, then ๐ = 0.
(ii)
If ๐ ∈ ๐น, ๐ ∈ ๐น ∗ , and ๐๐ = ๐, then ๐ = 1.
(iii)
If ๐ ∈ ๐น, then ๐ ⋅ 0 = 0.
(iv)
If ๐ ∈ ๐น ∗ , ๐ ∈ ๐น, and ๐๐ = 1, then ๐ = .
(v)
If ๐, ๐ ∈ ๐น and ๐๐ = 0, then ๐ = 0 or ๐ = 0.
(vi)
If ๐ ∈ ๐น, then – ๐ = – 1๐
(vii)
(– 1)(– 1) = 1.
1
๐
4. Let (๐น, +, ⋅) be a field with โ ⊆ ๐น. Prove that โ ⊆ ๐น.
LEVEL 3
5. Let (๐น, ≤) be an ordered field. Prove each of the following:
(i)
If ๐, ๐ ∈ ๐น, exactly one of the following holds: ๐ < ๐, ๐ = ๐, or ๐ > ๐.
(ii)
If ๐, ๐ ∈ ๐น, ๐ ≤ ๐, and ๐ ≤ ๐, then ๐ = ๐.
(iii)
If ๐, ๐, ๐ ∈ ๐น, ๐ < ๐, and ๐ < ๐, then ๐ < ๐.
(iv)
If ๐, ๐, ๐ ∈ ๐น, ๐ ≤ ๐, and ๐ ≤ ๐, then ๐ ≤ ๐.
(v)
If ๐, ๐ ∈ ๐น + and ๐ > ๐, then ๐ < ๐.
(vi)
If ๐, ๐ ∈ ๐น, then ๐ > ๐ if and only if – ๐ < – ๐.
(vii)
If ๐, ๐ ∈ ๐น, then ๐ ≥ ๐ if and only if – ๐ ≤ – ๐.
1
1
62
6. Let (๐น, +, ⋅) be a field. Show that (๐น, ⋅) is a commutative monoid.
LEVEL 4
7. Prove that there is no smallest positive real number.
8. Let ๐ be a nonnegative real number. Prove that ๐ = 0 if and only if ๐ is less than every positive
real number. (Note: ๐ nonnegative means that ๐ is positive or zero.)
๐
9. Prove that every rational number can be written in the form ๐ , where ๐ ∈ โค, ๐ ∈ โค∗ , and at least
one of ๐ or ๐ is not even.
LEVEL 5
10. Show that every nonempty set of real numbers that is bounded below has a greatest lower bound
in โ.
11. Show that between any two real numbers there is a real number that is not rational.
12. Let ๐ = {๐ฅ ∈ ๐น | – 2 < ๐ฅ ≤ 2}. Prove sup ๐ = 2 and inf ๐ = – 2.
CHALLENGE PROBLEM
13. Let ๐ = {๐ฅ ∈ ๐น | ๐ฅ 2 < 2} and let ๐ = sup ๐. Prove that ๐2 = 2.
63
LESSON 6 – TOPOLOGY
THE TOPOLOGY OF โ
Intervals of Real Numbers
A set ๐ผ of real numbers is called an interval if any real number that lies between two numbers in ๐ผ is
also in ๐ผ. Symbolically, we can write
∀๐ฅ, ๐ฆ ∈ ๐ผ ∀๐ง ∈ โ (๐ฅ < ๐ง < ๐ฆ → ๐ง ∈ ๐ผ).
The expression above can be read “For all ๐ฅ, ๐ฆ in ๐ผ and all ๐ง ∈ โ, if ๐ฅ is less than ๐ง and ๐ง is less than ๐ฆ,
then ๐ง is in ๐ผ.”
Example 6.1:
1. The set ๐ด = {0, 1} is not an interval. ๐ด consists of just the two real numbers 0 and 1. There are
1
infinitely many real numbers between 0 and 1. For example, the real number 2 satisfies
1
1
0 < 2 < 1, but 2 ∉ ๐ด.
2. โ is an interval. This follows trivially from the definition. If we replace ๐ผ by โ, we get
∀๐ฅ, ๐ฆ ∈ โ ∀๐ง ∈ โ (๐ฅ < ๐ง < ๐ฆ → ๐ง ∈ โ). In other words, if we start with two real numbers, and
take a real number between them, then that number is a real number (which we already said).
When we are thinking of โ as an interval, we sometimes use the notation (– ∞, ∞) and refer to this as
the real line. The following picture gives the standard geometric interpretation of the real line.
In addition to the real line, there are 8 other types of intervals.
Open Interval:
(๐, ๐) = {๐ฅ ∈ โ | ๐ < ๐ฅ < ๐}
Closed Interval:
[๐, ๐] = {๐ฅ ∈ โ | ๐ ≤ ๐ฅ ≤ ๐}
Half-open Intervals:
(๐, ๐] = {๐ฅ ∈ โ | ๐ < ๐ฅ ≤ ๐}
[๐, ๐) = {๐ฅ ∈ โ | ๐ ≤ ๐ฅ < ๐}
Infinite Open Intervals:
(๐, ∞) = {๐ฅ ∈ โ | ๐ฅ > ๐}
(– ∞, ๐) = {๐ฅ ∈ โ | ๐ฅ < ๐}
Infinite Closed Intervals: [๐, ∞) = {๐ฅ ∈ โ | ๐ฅ ≥ ๐}
(– ∞, ๐] = {๐ฅ ∈ โ | ๐ฅ ≤ ๐}
It’s easy to check that each of these eight types of sets satisfies the definition of being an interval.
Conversely, every interval has one of these nine forms. This will follow immediately from Theorem 6.1
and Problem 4 below.
Note that the first four intervals above (the open, closed, and two half-open intervals) are bounded.
They are each bounded below by ๐ and bounded above by ๐. In fact, for each of these intervals, ๐ is
the greatest lower bound and ๐ is the least upper bound. Using the notation from Lesson 5, we have
for example, ๐ = inf(๐, ๐) and ๐ = sup(๐, ๐).
64
Example 6.2:
1. The half-open interval (– 2,1] = {๐ฅ ∈ โ | – 2 < ๐ฅ ≤ 1} has the following graph:
2. The infinite open interval (0, ∞) = {๐ฅ ∈ โ | ๐ฅ > 0} has the following graph:
Theorem 6.1: If an interval ๐ผ is bounded, then there are ๐, ๐ ∈ โ such that one of the following holds:
๐ผ = (๐, ๐), ๐ผ = [๐, ๐], ๐ผ = (๐, ๐], or ๐ผ = [๐, ๐).
Analysis: We will prove this by letting ๐ = inf ๐ผ and ๐ = sup ๐ผ (in other words, ๐ is the greatest lower
bound of ๐ผ and ๐ is the least upper bound of ๐ผ), and then doing each of the following:
(1) We will show ๐ผ ⊆ [๐, ๐].
(2) We will show (๐, ๐) ⊆ ๐ผ.
(3) We will then look at 4 different cases. As one sample case, if ๐, ๐ ∈ ๐ผ, then we will have
๐ผ ⊆ [๐, ๐] and [๐, ๐] ⊆ ๐ผ. It then follows from the “Axiom of Extensionality” that ๐ผ = [๐, ๐].
Recall: Given sets ๐ and ๐, the Axiom of Extensionality says that ๐ and ๐ are the same set if and only
if ๐ and ๐ have precisely the same elements (See the technical note following Theorem 2.5 in Lesson
2). In symbols,
๐ = ๐ if and only if ∀๐ฅ(๐ฅ ∈ ๐ ↔ ๐ฅ ∈ ๐).
Since ∀๐ฅ(๐ฅ ∈ ๐ ↔ ๐ฅ ∈ ๐) is logically equivalent to ∀๐ฅ(๐ฅ ∈ ๐ → ๐ฅ ∈ ๐) ∧ ∀๐ฅ(๐ฅ ∈ ๐ → ๐ฅ ∈ ๐), we
have
๐ = ๐ if and only if ∀๐ฅ(๐ฅ ∈ ๐ → ๐ฅ ∈ ๐) and ∀๐ฅ(๐ฅ ∈ ๐ → ๐ฅ ∈ ๐).
Therefore, to show that ๐ = ๐, we can instead show that ๐ ⊆ ๐ and ๐ ⊆ ๐. This is the approach we
will take in the proof below.
Proof of Theorem 6.1: Let ๐ผ be a bounded interval. Since ๐ผ is bounded, by the Completeness of โ, ๐ผ has
a least upper bound ๐. By Problem 10 in Lesson 5, ๐ผ has a greatest lower bound ๐. If ๐ฅ ∈ ๐ผ, then by the
definitions of upper bound and lower bound, we have ๐ฅ ∈ [๐, ๐]. Since ๐ฅ was an arbitrary element of
๐ผ, ∀๐ฅ(๐ฅ ∈ ๐ผ → ๐ฅ ∈ [๐, ๐]). So, ๐ผ ⊆ [๐, ๐].
Now, let ๐ง ∈ (๐, ๐). It follows that ๐ < ๐ง < ๐. Since ๐ is the least upper bound of ๐ผ, ๐ง is not an upper
bound of ๐ผ. So, there is ๐ฆ ∈ ๐ผ with ๐ง < ๐ฆ. Since ๐ is the greatest lower bound of ๐ผ, ๐ง is not a lower
bound of ๐ผ. So, there is ๐ฅ ∈ ๐ผ with ๐ฅ < ๐ง. Since ๐ผ is an interval, ๐ฅ, ๐ฆ ∈ ๐ผ, and ๐ฅ < ๐ง < ๐ฆ, it follows that
๐ง ∈ ๐ผ. Since ๐ง was an arbitrary element of (๐, ๐), we have shown ∀๐ฅ(๐ฅ ∈ (๐, ๐) → ๐ฅ ∈ ๐ผ). So,
(๐, ๐) ⊆ ๐ผ.
We have shown that (๐, ๐) ⊆ ๐ผ and ๐ผ ⊆ [๐, ๐]. There are now 4 cases to consider.
65
Case 1: If both the greatest lower bound of ๐ผ (namely, ๐) and the least upper bound of ๐ผ (namely, ๐)
are elements of ๐ผ, then we have [๐, ๐] ⊆ ๐ผ and ๐ผ ⊆ [๐, ๐]. So, ๐ผ = [๐, ๐].
Case 2: If ๐ ∈ ๐ผ and ๐ ∉ ๐ผ, then we have [๐, ๐) ⊆ ๐ผ and ๐ผ ⊆ [๐, ๐). So, ๐ผ = [๐, ๐).
Case 3: If ๐ ∉ ๐ผ and ๐ ∈ ๐ผ, then we have (๐, ๐] ⊆ ๐ผ and ๐ผ ⊆ (๐, ๐]. So, ๐ผ = (๐, ๐].
Case 4: If ๐ ∉ ๐ผ and ๐ ∉ ๐ผ, then we have (๐, ๐) ⊆ ๐ผ and ๐ผ ⊆ (๐, ๐). So, ๐ผ = (๐, ๐).
โก
Note: You will be asked to prove the analogous result for unbounded intervals in Problem 4 below.
Operations on Sets
In Lesson 2 we saw how to take the union and intersection of two sets. We now review the definitions
from that lesson and introduce a few more.
The union of the sets ๐ด and ๐ต, written ๐ด ∪ ๐ต, is the set of elements that are in ๐ด or ๐ต (or both).
๐ด ∪ ๐ต = {๐ฅ | ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต}
The intersection of ๐ด and ๐ต, written ๐ด ∩ ๐ต, is the set of elements that are simultaneously in ๐ด and ๐ต.
๐ด ∩ ๐ต = {๐ฅ | ๐ฅ ∈ ๐ด and ๐ฅ ∈ ๐ต}
The following Venn diagrams for the union and intersection of two sets can be useful for visualizing
these operations. As usual, ๐ is some “universal” set that contains both ๐ด and ๐ต.
๐จ∪๐ฉ
๐จ∩๐ฉ
The difference ๐ด โ ๐ต is the set of elements that are in ๐ด and not in ๐ต.
๐ด โ ๐ต = {๐ฅ | ๐ฅ ∈ ๐ด and ๐ฅ ∉ ๐ต}
The symmetric difference between ๐ด and ๐ต, written ๐ด Δ ๐ต, is the set of elements that are in ๐ด or ๐ต,
but not both.
๐ด Δ ๐ต = (๐ด โ ๐ต) ∪ (๐ต โ ๐ด)
Let’s also look at Venn diagrams for the difference and symmetric difference of two sets.
66
๐จโ๐ฉ
๐จ๐ซ๐ฉ
Example 6.3: Let ๐ด = {0, 1, 2, 3, 4} and ๐ต = {3, 4, 5, 6}. We have
1. ๐ด ∪ ๐ต = {0, 1, 2, 3, 4, 5, 6}
2. ๐ด ∩ ๐ต = {3, 4}
3. ๐ด โ ๐ต = {0, 1, 2}
4. ๐ต โ ๐ด = {5, 6}
5. ๐ด Δ ๐ต = {0, 1, 2} ∪ {5,6} = {0, 1, 2, 5, 6}
Example 6.4: Let ๐ด = (– 2,1] and ๐ต = (0, ∞). We have
1. ๐ด ∪ ๐ต = (– 2, ∞)
2. ๐ด ∩ ๐ต = (0,1]
3. ๐ด โ ๐ต = (– 2,0]
4. ๐ต โ ๐ด = (1, ∞)
5. ๐ด Δ ๐ต = (– 2,0] ∪ (1, ∞)
Note: If you have trouble seeing how to compute these, it may be helpful to draw the graphs of ๐ด and
๐ต lined up vertically, and then draw vertical lines through the endpoints of each interval.
๐ด
๐ต
The results follow easily by combining these graphs into a single graph using the vertical lines as guides.
For example, let’s look at ๐ด ∩ ๐ต in detail. We’re looking for all numbers that are in both ๐ด and ๐ต. The
two rightmost vertical lines drawn passing through the two graphs above isolate all those numbers
nicely. We see that all numbers between 0 and 1 are in the intersection. We should then think about
the two endpoints 0 and 1 separately. 0 ∉ ๐ต and therefore, 0 cannot be in the intersection of ๐ด and
๐ต. On the other hand, 1 ∈ ๐ด and 1 ∈ ๐ต. Therefore, 1 ∈ ๐ด ∩ ๐ต. So, we see that ๐ด ∩ ๐ต = (0,1].
67
Unions and intersections have many nice algebraic properties such as commutativity (๐ด ∪ ๐ต = ๐ต ∪ ๐ด
and ๐ด ∩ ๐ต = ๐ต ∩ ๐ด), associativity ((๐ด ∪ ๐ต) ∪ ๐ถ = ๐ด ∪ (๐ต ∪ ๐ถ) and (๐ด ∩ ๐ต) ∩ ๐ถ = ๐ด ∩ (๐ต ∩ ๐ถ)), and
distributivity (๐ด ∩ (๐ต ∪ ๐ถ) = (๐ด ∩ ๐ต) ∪ (๐ด ∩ ๐ถ) and ๐ด ∪ (๐ต ∩ ๐ถ) = (๐ด ∪ ๐ต) ∩ (๐ด ∪ ๐ถ)).
As an example, let’s prove that the operation of forming unions is associative. You will be asked to
prove similar results in the problems below.
Theorem 6.2: The operation of forming unions is associative.
Note: Before beginning the proof, let’s draw Venn diagrams of the situation to convince ourselves that
the theorem is true.
๐จ∪๐ฉ
๐ฉ∪๐ช
(๐จ ∪ ๐ฉ) ∪ ๐ช = ๐จ ∪ (๐ฉ ∪ ๐ช)
Proof of Theorem 6.2: Let ๐ด, ๐ต, and ๐ถ be sets, and let ๐ฅ ∈ (๐ด ∪ ๐ต) ∪ ๐ถ. Then ๐ฅ ∈ ๐ด ∪ ๐ต or ๐ฅ ∈ ๐ถ. If
๐ฅ ∈ ๐ถ, then ๐ฅ ∈ ๐ต or ๐ฅ ∈ ๐ถ. So, ๐ฅ ∈ ๐ต ∪ ๐ถ. Then ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต ∪ ๐ถ. So, ๐ฅ ∈ ๐ด ∪ (๐ต ∪ ๐ถ). If, on the
other hand, ๐ฅ ∈ ๐ด ∪ ๐ต, then ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต. If ๐ฅ ∈ ๐ด, then ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต ∪ ๐ถ. So, ๐ฅ ∈ ๐ด ∪ (๐ต ∪ ๐ถ).
If ๐ฅ ∈ ๐ต, then ๐ฅ ∈ ๐ต or ๐ฅ ∈ ๐ถ. So, ๐ฅ ∈ ๐ต ∪ ๐ถ. Then ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต ∪ ๐ถ. So, ๐ฅ ∈ ๐ด ∪ (๐ต ∪ ๐ถ). Since ๐ฅ
was arbitrary, we have shown ∀๐ฅ(๐ฅ ∈ (๐ด ∪ ๐ต) ∪ ๐ถ → ๐ฅ ∈ ๐ด ∪ (๐ต ∪ ๐ถ)). Therefore, we have shown
that (๐ด ∪ ๐ต) ∪ ๐ถ ⊆ ๐ด ∪ (๐ต ∪ ๐ถ).
A similar argument can be used to show ๐ด ∪ (๐ต ∪ ๐ถ) ⊆ (๐ด ∪ ๐ต) ∪ ๐ถ (the reader should write out the
details).
68
Since (๐ด ∪ ๐ต) ∪ ๐ถ ⊆ ๐ด ∪ (๐ต ∪ ๐ถ) and ๐ด ∪ (๐ต ∪ ๐ถ) ⊆ (๐ด ∪ ๐ต) ∪ ๐ถ, (๐ด ∪ ๐ต) ∪ ๐ถ = ๐ด ∪ (๐ต ∪ ๐ถ), and
therefore, the operation of forming unions is associative.
โก
Remember that associativity allows us to drop parentheses. So, we can now simply write ๐ด ∪ ๐ต ∪ ๐ถ
when taking the union of the three sets ๐ด, ๐ต, and ๐ถ.
Recall from Lesson 2 that sets ๐ด and ๐ต are called disjoint or mutually exclusive if ๐ด ∩ ๐ต = ∅. For
example, the sets (−2, 0] and (1, ∞) are disjoint intervals. Here is a typical Venn diagram of disjoint
sets ๐ด and ๐ต.
๐จ∩๐ฉ=∅
In topology, we will often want to look at unions and intersections of more than two sets. Therefore,
we make the following more general definitions.
Let ๐ฟ be a nonempty set of sets.
โ๐ฟ = {๐ฆ | there is ๐ ∈ ๐ with ๐ฆ ∈ ๐}
and
โ๐ฟ = {๐ฆ | for all ๐ ∈ ๐, ๐ฆ ∈ ๐}.
If you’re having trouble understanding what these definitions are saying, you’re not alone. The notation
probably looks confusing, but the ideas behind these definitions are very simple. You have a whole
bunch of sets (possibly infinitely many). To take the union of all these sets, you simply throw all the
elements together into one big set. To take the intersection of all these sets, you take only the elements
that are in every single one of those sets.
Example 6.5:
1. Let ๐ด and ๐ต be sets and let ๐ฟ = {๐ด, ๐ต}. Then
โ๐ฟ = {๐ฆ | there is ๐ ∈ ๐ with ๐ฆ ∈ ๐} = {๐ฆ | ๐ฆ ∈ ๐ด or ๐ฆ ∈ ๐ต} = ๐ด ∪ ๐ต.
โ๐ฟ = {๐ฆ | for all ๐ ∈ ๐, ๐ฆ ∈ ๐} = {๐ฆ | ๐ฆ ∈ ๐ด and ๐ฆ ∈ ๐ต} = ๐ด ∩ ๐ต.
2. Let ๐ด, ๐ต, and ๐ถ be sets, and let ๐ฟ = {๐ด, ๐ต, ๐ถ}. Then
โ๐ฟ = {๐ฆ | there is ๐ ∈ ๐ with ๐ฆ ∈ ๐} = {๐ฆ | ๐ฆ ∈ ๐ด, ๐ฆ ∈ ๐ต, or ๐ฆ ∈ ๐ถ} = ๐ด ∪ ๐ต ∪ ๐ถ.
โ๐ฟ = {๐ฆ | for all ๐ ∈ ๐, ๐ฆ ∈ ๐} = {๐ฆ | ๐ฆ ∈ ๐ด, ๐ฆ ∈ ๐ต, and ๐ฆ ∈ ๐ถ} = ๐ด ∩ ๐ต ∩ ๐ถ.
3. Let ๐ฟ = {[0, ๐) | ๐ ∈ โ+ }. Then
โ๐ฟ = {๐ฆ | there is ๐ ∈ ๐ with ๐ฆ ∈ ๐} = {๐ฆ | there is ๐ ∈ โ+ with ๐ฆ ∈ [0, ๐)} = [0, ∞).
โ๐ฟ = {๐ฆ | for all ๐ ∈ ๐, ๐ฆ ∈ ๐} = {๐ฆ | for all ๐ ∈ โ+ , ๐ฆ ∈ [0, ๐)} = {0}.
69
Notes: (1) Examples 1 and 2 give a good idea of what โ๐ฟ and โ๐ฟ look like when ๐ฟ is finite. More
generally, if ๐ฟ = {๐ด1 , ๐ด2 , … , ๐ด๐ }, then โ๐ฟ = ๐ด1 ∪ ๐ด2 ∪ โฏ ∪ ๐ด๐ and โ๐ฟ = ๐ด1 ∩ ๐ด2 ∩ โฏ ∩ ๐ด๐ .
(2) As a specific example of Note 1, let ๐ด1 = (– ∞, 5], ๐ด2 = (0, 5), ๐ด3 = [2, 6), and ๐ด4 = (4, 99]. Let
๐ฟ = {๐ด1 , ๐ด2 , ๐ด3 , ๐ด4 }. Then
โ๐ฟ = ๐ด1 ∪ ๐ด2 ∪ ๐ด3 ∪ ๐ด4 = (– ∞, 5] ∪ (0, 5) ∪ [2, 6) ∪ (4, 99] = (– ∞, 99].
โ๐ฟ = ๐ด1 ∩ ๐ด2 ∩ ๐ด3 ∩ ๐ด4 = (– ∞, 5] ∩ (0,5) ∩ [2, 6) ∩ (4, 99] = (4, 5).
If you have trouble seeing how to compute the intersection, it may help to line up the graphs of the
intervals, as was done in the Note following Example 6.4, and/or take the intersections two at a time:
(– ∞, 5] ∩ (0, 5) = (0, 5) because (0, 5) ⊆ (– ∞, 5].
(0, 5) ∩ [2,6) = [2, 5) (draw the line graphs if you don’t see this).
[2, 5) ∩ (4, 99] = (4,5) (again, draw the line graphs if you don’t see this).
(3) Let’s prove carefully that {๐ฆ | there is ๐ ∈ โ+ with ๐ฆ ∈ [0, ๐)} = [0, ∞).
For convenience, let’s let ๐ด = {๐ฆ | there is ๐ ∈ โ+ with ๐ฆ ∈ [0, ๐)}.
If ๐ฆ ∈ ๐ด, then there is ๐ ∈ โ+ with ๐ฆ ∈ [0, ๐). So, 0 ≤ ๐ฆ < ๐. In particular, ๐ฆ ≥ 0. So, ๐ฆ ∈ [0, ∞). Since
๐ฆ ∈ ๐ด was arbitrary, we have shown that ๐ด ⊆ [0, ∞).
Let ๐ฆ ∈ [0, ∞). Since (๐ฆ + 1) − ๐ฆ = 1 > 0, we have ๐ฆ + 1 > ๐ฆ. So, ๐ฆ ∈ [0, ๐ฆ + 1). Since ๐ฆ + 1 ∈ โ+ ,
๐ฆ ∈ ๐ด. Since ๐ฆ ∈ [0, ∞) was arbitrary, we have shown that [0, ∞) ⊆ ๐ด.
Since ๐ด ⊆ [0, ∞) and [0, ∞) ⊆ ๐ด, it follows that ๐ด = [0, ∞).
(4) Let’s also prove carefully that {๐ฆ | for all ๐ ∈ โ+ , ๐ฆ ∈ [0, ๐)} = {0}.
For convenience, let’s let ๐ต = {๐ฆ | for all ๐ ∈ โ+ , ๐ฆ ∈ [0, ๐)}.
If ๐ฆ ∈ ๐ต, then for all ๐ ∈ โ+ , ๐ฆ ∈ [0, ๐). So, for all ๐ฆ ∈ โ+ , 0 ≤ ๐ฆ < ๐. So, ๐ฆ is a nonnegative real
number that is less than every positive real number. By Problem 8 in Problem Set 5, ๐ฆ = 0. Therefore,
๐ฆ ∈ {0}. Since ๐ฆ ∈ ๐ต was arbitrary, we have shown that ๐ต ⊆ {0}.
Now, let ๐ฆ ∈ {0}. Then ๐ฆ = 0. For all ๐ ∈ โ+ , 0 ∈ [0, ๐). So, ๐ฆ ∈ ๐ต. It follows that {0} ⊆ ๐ต.
Since ๐ต ⊆ {0} and {0} ⊆ ๐ต, it follows that ๐ต = {0}.
(5) Note that the empty union is empty. Indeed, we have โ∅ = {๐ฆ | there is ๐ ∈ ∅ with ๐ฆ ∈ ๐} = ∅.
If ๐ฟ is a nonempty set of sets, we say that ๐ฟ is disjoint if โ๐ฟ = ∅. We say that ๐ฟ is pairwise disjoint if
for all ๐ด, ๐ต ∈ ๐ฟ with ๐ด ≠ ๐ต, ๐ด and ๐ต are disjoint. For example, if we let ๐ฟ = {(๐, ๐ + 1) | ๐ ∈ โค}, then
๐ฟ is both disjoint and pairwise disjoint.
Are the definitions of disjoint and pairwise disjoint equivalent? You will be asked to answer this
question in Problem 5 below.
70
Open and Closed Sets
A subset ๐ of โ is said to be open if for every real number ๐ฅ ∈ ๐, there is an open interval (๐, ๐) with
๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐.
In words, a set is open in โ if every number in the set has “some space” on both sides of that number
inside the set. If you think of each point in the set as an animal, then each animal in the set should be
able to move a little to the left and a little to the right without ever leaving the set. Another way to
think of this is that no number is on “the edge” or “the boundary” of the set, about to fall out of it.
Example 6.6:
1. Every bounded open interval is open. To see this, let ๐ = (๐, ๐) and let ๐ฅ ∈ ๐. Then ๐ = (๐, ๐)
3
itself is an open interval with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐. For example, (0, 1) and (– √2, ) are
5
open sets.
2. We will prove in the theorems below that all open intervals are open sets. For example,
(– 2, ∞), (– ∞, 5), and (– ∞, ∞) are all open sets.
3. (0,1] is not an open set because the “boundary point” 1 is included in the set. If (๐, ๐) is any
open interval containing 1, then (๐, ๐) โ (0,1] because there are numbers greater than 1 inside
1
(๐, ๐). For example, let ๐ฅ = 2 (1 + ๐) (the average of 1 and ๐). Since ๐ > 1, we have that
1
1
1
1
๐ฅ > 2 (1 + 1) = 2 ⋅ 2 = 1. So, ๐ฅ > 1. Also, since 1 > ๐, ๐ฅ > ๐. Now, since 1 < ๐, we have that
1
๐ฅ < 2 (๐ + ๐) = 2 (2๐) = (2 ⋅ 2) ๐ = 1๐ = ๐. So, ๐ฅ ∈ (๐, ๐).
4. We can use reasoning similar to that used in 3 to see that all half-open intervals and closed
intervals are not open sets.
Theorem 6.3: Let ๐ ∈ โ. The infinite interval (๐, ∞) is an open set.
The idea behind the proof is quite simple. If ๐ฅ ∈ (๐, ∞), then (๐, ๐ฅ + 1) is an open interval with ๐ฅ inside
of it and with (๐, ๐ฅ + 1) ⊆ (๐, ∞).
Proof of Theorem 6.3: Let ๐ฅ ∈ (๐, ∞) and let ๐ = ๐ฅ + 1.
Since ๐ฅ ∈ (๐, ∞), ๐ฅ > ๐. Since (๐ฅ + 1) − ๐ฅ = 1 > 0, we have ๐ = ๐ฅ + 1 > ๐ฅ.
So, we have ๐ < ๐ฅ < ๐. That is, ๐ฅ ∈ (๐, ๐). Also, (๐, ๐) ⊆ (๐, ∞). Since ๐ฅ ∈ (๐, ∞) was arbitrary, (๐, ∞)
is an open set.
โก
In Problem 6 below (part (i)), you will be asked to show that an interval of the form (– ∞, ๐) is also an
open set.
Theorem 6.4: ∅ and โ are both open sets.
Proof: The statement that ∅ is open is vacuously true (since ∅ has no elements, there is nothing to
check).
71
If ๐ฅ ∈ โ, then ๐ฅ ∈ (๐ฅ − 1, ๐ฅ + 1) and (๐ฅ − 1, ๐ฅ + 1) ⊆ โ. Since ๐ฅ was an arbitrary element of โ, we
have shown that for every ๐ฅ ∈ โ, there is an open interval (๐, ๐) with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ โ. So, โ
is open.
โก
Many authors define “open” in a slightly different way from the definition we’ve been using. This next
Theorem will show that the definition we have been using is equivalent to theirs.
Theorem 6.5: A subset ๐ of โ is open if and only if for every real number ๐ฅ ∈ ๐, there is a positive real
number ๐ such that (๐ฅ − ๐, ๐ฅ + ๐) ⊆ ๐.
Analysis: The harder direction of the proof is showing that if ๐ is open, then for every real number
๐ฅ ∈ ๐, there is a positive real number ๐ such that (๐ฅ − ๐, ๐ฅ + ๐) ⊆ ๐.
To see this, suppose that ๐ is open and let ๐ฅ ∈ ๐. Then there is an open interval (๐, ๐) with ๐ฅ ∈ (๐, ๐)
and (๐, ๐) ⊆ ๐. We want to replace the interval (๐, ๐) by an interval that has ๐ฅ right in the center.
The following picture should help us to come up with an argument.
In the picture, we have an open interval (๐, ๐), containing ๐ฅ. In this particular picture, ๐ฅ is a bit closer
to ๐ than it is to ๐. However, we should remember to be careful that our argument doesn’t assume this
(as we have no control over where ๐ฅ “sits” inside of (๐, ๐)).
In the picture, we see that ๐ฅ − ๐ is the distance from ๐ to ๐ฅ, and ๐ − ๐ฅ is the distance from ๐ฅ to ๐.
Since the distance from ๐ to ๐ฅ is smaller, let’s let ๐ be that smaller distance. In other words, we let
๐ = ๐ฅ − ๐. From the picture, it looks like the interval (๐ฅ − ๐, ๐ฅ + ๐) will be inside the interval (๐, ๐).
In general, if ๐ฅ is closer to ๐, we would let ๐ = ๐ฅ − ๐, and if ๐ฅ is closer to ๐, we would let ๐ = ๐ − ๐ฅ.
We can simply define ๐ to be the smaller of ๐ฅ − ๐ and ๐ − ๐ฅ. That is, ๐ = min{๐ฅ − ๐, ๐ − ๐ฅ}. From the
picture, it seems like with this choice of ๐, the interval (๐ฅ − ๐, ๐ฅ + ๐) should give us what we want.
Proof of Theorem 6.5: Let ๐ be an open subset of โ and let ๐ฅ ∈ ๐. Then there is an open interval (๐, ๐)
with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐. Let ๐ = min{๐ฅ − ๐, ๐ − ๐ฅ}. We claim that (๐ฅ − ๐, ๐ฅ + ๐) is an open
interval containing ๐ฅ and contained in (๐, ๐). We need to show ๐ ≤ ๐ฅ − ๐ < ๐ฅ < ๐ฅ + ๐ ≤ ๐.
Since ๐ = min{๐ฅ − ๐, ๐ − ๐ฅ}, ๐ ≤ ๐ฅ − ๐. So, – ๐ ≥ – (๐ฅ − ๐). It follows that
(๐ฅ − ๐) − ๐ ≥ (๐ฅ − (๐ฅ − ๐)) − ๐ = (๐ฅ − ๐ฅ + ๐) − ๐ = ๐ − ๐ = 0.
So, ๐ฅ − ๐ ≥ ๐.
Since ๐ = min{๐ฅ − ๐, ๐ − ๐ฅ}, ๐ ≤ ๐ − ๐ฅ. So, – ๐ ≥ – (๐ − ๐ฅ). It follows that
๐ − (๐ฅ + ๐) = ๐ − ๐ฅ − ๐ ≥ ๐ − ๐ฅ − (๐ − ๐ฅ) = 0.
So, ๐ ≥ ๐ฅ + ๐, or equivalently, ๐ฅ + ๐ ≤ ๐.
72
Note that ๐ฅ > ๐, so that ๐ฅ − ๐ > 0, and ๐ฅ < ๐, so that ๐ − ๐ฅ > 0. It follows that ๐ > 0.
We have ๐ฅ − (๐ฅ − ๐) = ๐ > 0, so that ๐ฅ > ๐ฅ − ๐. We also have (๐ฅ + ๐) − ๐ฅ = ๐ > 0, so that
๐ฅ + ๐ > ๐ฅ.
We have shown ๐ ≤ ๐ฅ − ๐ < ๐ฅ < ๐ฅ + ๐ ≤ ๐, as desired.
Since (๐ฅ − ๐, ๐ฅ + ๐) ⊆ (๐, ๐) and (๐, ๐) ⊆ ๐, by the transitivity of ⊆ (Theorem 2.3 from Lesson 2), we
have (๐ฅ − ๐, ๐ฅ + ๐) ⊆ ๐.
The converse is immediate since for ๐ฅ ∈ ๐, (๐ฅ − ๐, ๐ฅ + ๐) is an open interval containing ๐ฅ.
โก
The basic definition of a topological space involves open sets, unions, and intersections. We’re not
going to talk about general topological spaces in this lesson (we will look at them in Lesson 14), but in
the spirit of the subject, we will prove some results about unions and intersections of open sets in โ.
Theorem 6.6: The union of two open sets in โ is an open set in โ.
Proof: Let ๐ด and ๐ต be open sets in โ, and let ๐ฅ ∈ ๐ด ∪ ๐ต. Then ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต. Without loss of
generality, we may assume that ๐ฅ ∈ ๐ด (see the Note below). Since ๐ด is open in โ, there is an interval
(๐, ๐) with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐ด. By Theorem 2.4, ๐ด ⊆ ๐ด ∪ ๐ต. Since ⊆ is transitive (Theorem 2.3),
(๐, ๐) ⊆ ๐ด ∪ ๐ต. Therefore, ๐ด ∪ ๐ต is open.
โก
Note: In the proof of Theorem 6.6, we used the expression “Without loss of generality.” This expression
can be used when an argument can be split up into 2 or more cases, and the proof of each of the cases
is nearly identical.
For Theorem 6.6, the two cases are (i) ๐ฅ ∈ ๐ด and (ii) ๐ฅ ∈ ๐ต. The argument for case (ii) is the same as
the argument for case (i), essentially word for word—only the roles of ๐ด and ๐ต are interchanged.
Example 6.7: (– 5, 2) is open by part 1 of Example 6.6 and (7, ∞) is open by Theorem 6.3. Therefore,
by Theorem 6.6, (– 5, 2) ∪ (7, ∞) is also open.
If you look at the proof of Theorem 6.6 closely, you should notice that the proof would still work if we
were taking a union of more than 2 sets. In fact, any union of open sets is open, as we now prove.
Theorem 6.7: Let ๐ฟ be a set of open subsets of โ. Then โ๐ฟ is open.
Proof: Let ๐ฟ be a set of open subsets of โ and let ๐ฅ ∈ โ๐ฟ. Then ๐ฅ ∈ ๐ด for some ๐ด ∈ ๐ฟ. Since ๐ด is open
in โ, there is an interval (๐, ๐) with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐ด. By Problem 9 below (part (i)), we have
๐ด ⊆ โ๐ฟ. Since ⊆ is transitive (Theorem 2.3), (๐, ๐) ⊆ โ๐ฟ. Therefore, โ๐ฟ is open.
โก
Example 6.8:
1. (1,2) ∪ (2,3) ∪ (3,4) ∪ (4, ∞) is open.
73
2. โ โ โค is open because it is a union of open intervals. It looks like this:
โฏ (– 2, – 1) ∪ (– 1, 0) ∪ (0, 1) ∪ (1, 2) ∪ โฏ
โ โ โค can also be written as
โ{(๐, ๐ + 1) | ๐ ∈ โค} or โ(๐, ๐ + 1)
๐∈โค
1
1
3. If we take the union of all intervals of the form (๐+1 , ๐) for positive integers ๐, we get an open
set. We can visualize this open set as follows:
1 1
1 1
1 1
1
1 1
โ {(๐ + 1 , ๐) | ๐ ∈ โค+ } = โฏ ∪ ( , ) ∪ ( , ) ∪ ( , ) ∪ ( , 1)
5 4
4 3
3 2
2
Theorem 6.8: Every open set in โ can be expressed as a union of bounded open intervals.
The main idea of the argument will be the following. Every real number that is in an open set is inside
an open interval that is a subset of the set. Just take the union of all these open intervals (one interval
for each real number in the set).
Proof of Theorem 6.8: Let ๐ be an open set in โ. Since ๐ is open, for each ๐ฅ ∈ ๐, there is an interval
(๐๐ฅ , ๐๐ฅ ) with ๐ฅ ∈ (๐๐ฅ , ๐๐ฅ ) and (๐๐ฅ , ๐๐ฅ ) ⊆ ๐. We Let ๐ = {(๐๐ฅ , ๐๐ฅ ) | ๐ฅ ∈ ๐}. We will show that ๐ = โ๐.
First, let ๐ฅ ∈ ๐. Then ๐ฅ ∈ (๐๐ฅ , ๐๐ฅ ). Since (๐๐ฅ , ๐๐ฅ ) ∈ ๐, ๐ฅ ∈ โ๐. Since ๐ฅ was arbitrary, ๐ ⊆ โ๐.
Now, let ๐ฅ ∈ โ๐. Then there is ๐ง ∈ ๐ with ๐ฅ ∈ (๐๐ง , ๐๐ง ). Since (๐๐ง , ๐๐ง ) ⊆ ๐, ๐ฅ ∈ ๐. Since ๐ฅ ∈ ๐ was
arbitrary, โ๐ ⊆ ๐.
Since ๐ ⊆ โ๐ and โ๐ ⊆ ๐, it follows that ๐ = โ๐.
โก
Theorem 6.9: The intersection of two open sets in โ is an open set in โ.
Proof: Let ๐ด and ๐ต be open sets in โ and let ๐ฅ ∈ ๐ด ∩ ๐ต. Then ๐ฅ ∈ ๐ด and ๐ฅ ∈ ๐ต. Since ๐ด is open, there
is an open interval (๐, ๐) with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐ด. Since ๐ต is open, there is an open interval (๐, ๐)
with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐ต. Let ๐ถ = (๐, ๐) ∩ (๐, ๐). Since ๐ฅ ∈ (๐, ๐) and ๐ฅ ∈ (๐, ๐), ๐ฅ ∈ ๐ถ. By
Problem 6 below (part (ii)), ๐ถ is an open interval. By Problem 11 from Lesson 2 and part (ii) of Problem
3 below, ๐ถ ⊆ ๐ด and ๐ถ ⊆ ๐ต. It follows that ๐ถ ⊆ ๐ด ∩ ๐ต (Prove this!). Since ๐ฅ ∈ ๐ด ∩ ๐ต was arbitrary,
๐ด ∩ ๐ต is open.
โก
In Problem 6 below (part (iii)), you will be asked to show that the intersection of finitely many open
sets in โ is an open set in โ. In problem 8, you will be asked to show that an arbitrary intersection of
open sets does not need to be open.
A subset ๐ of โ is said to be closed if โ โ ๐ is open.
โ โ ๐ is called the complement of ๐ in โ, or simply the complement of ๐. It consists of all real numbers
not in ๐.
74
Example 6.9:
1. Every closed interval is a closed set. For example, [0,1] is closed because its complement in โ
is โ โ [0,1] = (– ∞, 0) ∪ (1, ∞). This is a union of open intervals, which is open.
Similarly, [3, ∞) is a closed set because โ โ [3, ∞) = (– ∞, 3), which is open.
2. Half-open intervals are neither open nor closed. For example, we saw in Example 6.6 that (0,1]
is not an open set. We see that (0,1] is not closed by observing โ โ (0,1] = (– ∞, 0] ∪ (1, ∞),
which is not open.
3. ∅ is closed because โ โ ∅ = โ is open. โ is closed because โ โ โ = ∅ is open. ∅ and โ are the
only two sets of real numbers that are both open and closed.
Theorem 6.10: The intersection of two closed sets in โ is a closed set in โ.
Proof: Let ๐ด and ๐ต be closed sets in โ. Then โ โ ๐ด and โ โ ๐ต are open sets in โ. By Theorem 6.6 (or
6.7), (โ โ ๐ด) ∪ (โ โ ๐ต) is open in โ. Therefore, โ โ [(โ โ ๐ด) ∪ (โ โ ๐ต)] is closed in โ. So, it suffices
to show that ๐ด ∩ ๐ต = โ โ [(โ โ ๐ด) ∪ (โ โ ๐ต)]. Well, ๐ฅ ∈ ๐ด ∩ ๐ต if and only if ๐ฅ ∈ ๐ด and ๐ฅ ∈ ๐ต if and
only if ๐ฅ ∉ โ โ ๐ด and ๐ฅ ∉ โ โ ๐ต if and only if ๐ฅ ∉ (โ โ ๐ด) ∪ (โ โ ๐ต) if and only if
๐ฅ ∈ โ โ [(โ โ ๐ด) ∪ (โ โ ๐ต)]. So, ๐ด ∩ ๐ต = โ โ [(โ โ ๐ด) ∪ (โ โ ๐ต)], completing the proof.
โก
A similar argument can be used to show that the union of two closed sets in โ is a closed set in โ. This
result can be extended to the union of finitely many closed sets in โ with the help of Problem 6 below
(part (iii)). The dedicated reader should prove this. In Problem 10 below, you will be asked to show that
an arbitrary intersection of closed sets in โ is closed. In problem 8, you will be asked to show that an
arbitrary union of closed sets does not need to be closed.
75
Problem Set 6
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Draw Venn diagrams for (๐ด โ ๐ต) โ ๐ถ and ๐ด โ (๐ต โ ๐ถ). Are these two sets equal for all sets ๐ด,
๐ต, and ๐ถ? If so, prove it. If not, provide a counterexample.
2. Let ๐ด = {∅, {∅, {∅}}}, ๐ต = {∅, {∅}}, ๐ถ = (−∞, 2], ๐ท = (−1, 3]. Compute each of the following:
(i)
๐ด∪๐ต
(ii)
๐ด∩๐ต
(iii) ๐ด โ ๐ต
(iv) ๐ต โ ๐ด
(v)
๐ดΔ๐ต
(vi) ๐ถ ∪ ๐ท
(vii) ๐ถ ∩ ๐ท
(viii) ๐ถ โ ๐ท
(ix) ๐ท โ ๐ถ
(x)
๐ถΔ๐ท
LEVEL 2
3. Prove the following:
(i)
The operation of forming unions is commutative.
(ii)
The operation of forming intersections is commutative.
(iii) The operation of forming intersections is associative.
4. Prove that if an interval ๐ผ is unbounded, then ๐ผ has one of the following five forms: (๐, ∞),
(– ∞, ๐), [๐, ∞), (– ∞, ๐], (– ∞, ∞)
LEVEL 3
5. Prove or provide a counterexample:
(i)
Every pairwise disjoint set of sets is disjoint.
(ii)
Every disjoint set of sets is pairwise disjoint.
76
6. Prove the following:
(i)
For all ๐ ∈ โ, the infinite interval (– ∞, ๐) is an open set in โ.
(ii)
The intersection of two open intervals in โ is either empty or an open interval in โ.
(iii) The intersection of finitely many open sets in โ is an open set in โ.
7. Let ๐ด, ๐ต, and ๐ถ be sets. Prove each of the following:
(i)
๐ด ∩ (๐ต ∪ ๐ถ) = (๐ด ∩ ๐ต) ∪ (๐ด ∩ ๐ถ).
(ii)
๐ด ∪ (๐ต ∩ ๐ถ) = (๐ด ∪ ๐ต) ∩ (๐ด ∪ ๐ถ).
(iii) ๐ถ โ (๐ด ∪ ๐ต) = (๐ถ โ ๐ด) ∩ (๐ถ โ ๐ต).
(iv) ๐ถ โ (๐ด ∩ ๐ต) = (๐ถ โ ๐ด) ∪ (๐ถ โ ๐ต).
LEVEL 4
8. Give an example of an infinite collection of open sets whose intersection is not open. Also, give
an example of an infinite collection of closed sets whose union is not closed. Provide a proof for
each example.
9. Let ๐ฟ be a nonempty set of sets. Prove the following:
(i)
For all ๐ด ∈ ๐ฟ, ๐ด ⊆ โ๐ฟ.
(ii)
For all ๐ด ∈ ๐ฟ, โ๐ฟ ⊆ ๐ด.
LEVEL 5
10. Prove that if ๐ฟ is a nonempty set of closed subsets of โ, then โ๐ฟ is closed.
11. Let ๐ด be a set and let ๐ฟ be a nonempty collection of sets. Prove each of the following:
(i)
๐ด ∩ โ๐ฟ = โ{๐ด ∩ ๐ต | ๐ต ∈ ๐ฟ}
(ii)
๐ด ∪ โ๐ฟ = โ{๐ด ∪ ๐ต | ๐ต ∈ ๐ฟ}
(iii) ๐ด โ โ๐ฟ = โ{๐ด โ ๐ต | ๐ต ∈ ๐ฟ}
(iv) ๐ด โ โ๐ฟ = โ{๐ด โ ๐ต | ๐ต ∈ ๐ฟ}.
12. Prove that every closed set in โ can be written as an intersection โ๐ฟ, where each element of ๐ฟ
is a union of at most 2 closed intervals.
CHALLENGE PROBLEM
13. Prove that every nonempty open set of real numbers can be expressed as a union of pairwise
disjoint open intervals.
77
LESSON 7 – COMPLEX ANALYSIS
THE FIELD OF COMPLEX NUMBERS
A Limitation of the Reals
In Lesson 5 we asked (and answered) the question “Why isn’t โ (the field of rational numbers)
enough?” We now ask the same question about โ, the field of real numbers.
A linear equation has the form ๐๐ฅ + ๐ = 0, where ๐ ≠ 0. If we are working inside a field, then this
๐
equation has the unique solution ๐ฅ = – ๐๐−1 = – ๐. For example, the equation 2๐ฅ − 1 = 0 has the
1
unique solution ๐ฅ = 2−1 = 2. Notice how important it is that we are working inside a field here. If we
were allowed to use only the properties of a commutative ring, then we might not be able to solve this
equation. For example, in โค (the ring of integers), the equation 2๐ฅ − 1 = 0 has no solution.
A quadratic equation has the form ๐๐ฅ 2 + ๐๐ฅ + ๐ = 0, where ๐ ≠ 0. Is working inside a field enough
to solve this equation? The answer is no! For example, a solution to the equation ๐ฅ 2 − 2 = 0 must
satisfy ๐ฅ 2 = 2. In Lesson 5, we proved that this equation cannot be solved in โ. This was one of our
main motivations for introducing โ. And, in fact, the equation ๐ฅ 2 − 2 = 0 can be solved in โ. However,
the equation ๐ฅ 2 + 1 = 0 cannot be solved in โ. This follows immediately from Theorem 5.2, which
says that if ๐ฅ is an element of an ordered field, then ๐ฅ 2 = ๐ฅ ⋅ ๐ฅ can never be negative.
Is there a field containing โ, where all quadratic equations can be solved? The answer is yes, and in
fact, we can do much better than that. In this lesson we will define a field containing the field of real
numbers such that every equation of the form ๐๐ ๐ฅ ๐ + ๐๐−1 ๐ฅ ๐−1 + โฏ + ๐1 ๐ฅ + ๐0 = 0 has a solution.
Such an equation is called a polynomial equation, and a field in which every such polynomial equation
has a solution is called an algebraically closed field.
The Complex Field
The standard form of a complex number is ๐ + ๐๐,
where ๐ and ๐ are real numbers. So, the set of
complex numbers is โ = {๐ + ๐๐ | ๐, ๐ ∈ โ}.
If we identify 1 = 1 + 0๐ with the ordered pair (1, 0),
and we identify ๐ = 0 + 1๐ with the ordered pair
(0, 1), then it is natural to write the complex number
๐ + ๐๐ as the point (๐, ๐). Here is a reasonable
justification for this:
๐ + ๐๐ = ๐(1,0) + ๐(0,1) = (๐, 0) + (0, ๐) = (๐, ๐)
In this way, we can visualize a complex number as a
point in The Complex Plane. A portion of the Complex
Plane is shown to the right with several complex
numbers displayed as points of the form (๐ฅ, ๐ฆ).
78
The complex plane is formed by taking two copies of the real line and placing one horizontally and the
other vertically. The horizontal copy of the real line is called the ๐ฅ-axis or the real axis (labeled ๐ฅ in the
above figure) and the vertical copy of the real line is called the ๐ฆ-axis or imaginary axis (labeled ๐ฆ in
the above figure). The two axes intersect at the point (0, 0). This point is called the origin.
We can also visualize the complex number ๐ + ๐๐ as a directed line
segment (or vector) starting at the origin and ending at the point
(๐, ๐). Three examples are shown to the right.
If ๐ง = ๐ + ๐๐ is a complex number, we call ๐ the real part of ๐ง and ๐
the imaginary part of ๐ง, and we write ๐ = Re ๐ง and ๐ = Im ๐ง.
Two complex numbers are equal if and only if they have the same real
part and the same imaginary part. In other words,
๐ + ๐๐ = ๐ + ๐๐ if and only if ๐ = ๐ and ๐ = ๐.
We add two complex numbers by simply adding their real parts and adding their imaginary parts. So,
(๐ + ๐๐) + (๐ + ๐๐) = (๐ + ๐) + (๐ + ๐)๐.
As a point, this sum is (๐ + ๐, ๐ + ๐). We can visualize this sum as the vector starting at the origin that
is the diagonal of the parallelogram formed from the vectors ๐ + ๐๐ and ๐ + ๐๐. Here is an example
showing that (1 + 2๐) + (– 3 + ๐) = – 2 + 3๐.
The definition for multiplying two complex numbers is a bit more complicated:
(๐ + ๐๐)(๐ + ๐๐) = (๐๐ − ๐๐) + (๐๐ + ๐๐)๐.
Notes: (1) If ๐ = 0, then we call ๐ + ๐๐ = ๐ + 0๐ = ๐ a real number. Note that when we add or
multiply two real numbers, we always get another real number.
(๐ + 0๐) + (๐ + 0๐) = (๐ + ๐) + (0 + 0)๐ = (๐ + ๐) + 0๐ = ๐ + ๐.
(๐ + 0๐)(๐ + 0๐) = (๐๐ − 0 ⋅ 0) + (๐ ⋅ 0 + 0๐)๐ = (๐๐ − 0) + (0 + 0)๐ = ๐๐ + 0๐ = ๐๐.
(2) If ๐ = 0, then we call ๐ + ๐๐ = 0 + ๐๐ = ๐๐ a pure imaginary number.
(3) ๐ 2 = – 1. To see this, note that ๐ 2 = ๐ ⋅ ๐ = (0 + 1๐)(0 + 1๐), and we have
79
(0 + 1๐)(0 + 1๐) = (0 ⋅ 0 − 1 ⋅ 1) + (0 ⋅ 1 + 1 ⋅ 0)๐ = (0 − 1) + (0 + 0)๐ = – 1 + 0๐ = – 1.
(4) The definition of the product of two complex numbers is motivated by how multiplication should
behave in a field, together with replacing ๐ 2 by – 1. If we were to naïvely multiply the two complex
numbers, we would have
(๐ + ๐๐)(๐ + ๐๐) = (๐ + ๐๐)๐ + (๐ + ๐๐)(๐๐) = ๐๐ + ๐๐๐ + ๐๐๐ + ๐๐๐ 2
= ๐๐ + ๐๐๐ + ๐๐๐ + ๐๐(– 1) = ๐๐ + (๐๐ + ๐๐)๐ − ๐๐ = (๐๐ − ๐๐) + (๐๐ + ๐๐)๐.
The dedicated reader should make a note of which field properties were used during this computation.
Those familiar with the mnemonic FOIL may notice that “FOILing” will always work to produce the
product of two complex numbers, provided we replace ๐ 2 by – 1 and simplify.
Example 7.1: Let ๐ง = 2 − 3๐ and ๐ค = – 1 + 5๐. Then
๐ง + ๐ค = (2 − 3๐) + (– 1 + 5๐) = (2 + (– 1)) + (– 3 + 5)๐ = ๐ + ๐๐.
๐ง๐ค = (2 − 3๐)(– 1 + 5๐) = (2(– 1) − (– 3)(5)) + (2 ⋅ 5 + (– 3)(– 1))๐
= (– 2 + 15) + (10 + 3)๐ = ๐๐ + ๐๐๐.
With the definitions we just made for addition and multiplication, we get (โ, +, ⋅), the field of complex
numbers. See Lesson 5 if you need to review the definition of a field.
Theorem 7.1: (โ, +, ⋅) is field.
The proof that (โ, +, ⋅) is a field is very straightforward and mostly uses the fact that (โ, +, ⋅) is a
field. For example, to verify that addition is commutative in โ, we have
(๐ + ๐๐) + (๐ + ๐๐) = (๐ + ๐) + (๐ + ๐)๐ = (๐ + ๐) + (๐ + ๐)๐ = (๐ + ๐๐) + (๐ + ๐๐).
We have ๐ + ๐ = ๐ + ๐ because ๐, ๐ ∈ โ and addition is commutative in โ. For the same reason, we
have ๐ + ๐ = ๐ + ๐.
We leave the full verification that (โ, +, ⋅) is a field as an exercise for the reader (Problem 2 below),
and simply note a few things of importance here:
•
The identity for addition is 0 = 0 + 0๐.
•
The identity for multiplication is 1 = 1 + 0๐
•
The additive inverse of ๐ง = ๐ + ๐๐ is – ๐ง = – (๐ + ๐๐) = – ๐ − ๐๐.
•
The multiplicative inverse of ๐ง = ๐ + ๐๐ is ๐ง −1 = ๐2 +๐2 − ๐2 +๐2 ๐.
๐
๐
The reader is expected to verify all this in Problem 2.
Remark: By Note 1 above, we see that (โ, +, ⋅) is a subfield of (โ, +, ⋅). That is, โ ⊆ โ and (โ, +, ⋅)
is a field with respect to the field operations of (โ, +, ⋅) (In other words, we don’t need to “change”
the definition of addition or multiplication to get the appropriate operations in โ—the operations are
already behaving correctly). Subfields will be covered in more detail in Lesson 11.
80
Subtraction: If ๐ง, ๐ค ∈ โ, with ๐ง = ๐ + ๐๐ and ๐ค = ๐ + ๐๐, then we define the difference ๐ง − ๐ค by
๐ง − ๐ค = ๐ง + (– ๐ค) = (๐ + ๐๐) + (– ๐ − ๐๐) = (๐ − ๐) + (๐ − ๐)๐.
As a point, this difference is (๐ − ๐, ๐ − ๐). Here is an example illustrating how subtraction works using
the computation (1 + 2๐) − (2 − ๐) = – 1 + 3๐.
Observe how we first replaced 2 − ๐ by – 2 + ๐ so that we could change the subtraction problem to the
addition problem: (1 + 2๐) + (– 2 + ๐). We then formed a parallelogram using 1 + 2๐ and – 2 + ๐ as
edges, and finally, drew the diagonal of that parallelogram to see the result.
๐ง
Division: If ๐ง ∈ โ and ๐ค ∈ โ∗ with ๐ง = ๐ + ๐๐ and ๐ค = ๐ + ๐๐, then we define the quotient ๐ค by
๐ง
๐
๐
๐๐ + ๐๐ ๐๐ − ๐๐
= ๐ง๐ค −1 = (๐ + ๐๐) ( 2
− 2
๐) = 2
+
๐.
2
2
๐ค
๐ +๐
๐ +๐
๐ + ๐2 ๐ 2 + ๐2
The definition of division in a field unfortunately led to a messy looking formula. However, when
actually performing division, there is an easier way to think about it, as we will see below.
The conjugate of the complex number ๐ง = ๐ + ๐๐ is the complex number ๐ง = ๐ − ๐๐.
Notes: (1) To take the conjugate of a complex number, we simply negate the imaginary part of the
number and leave the real part as it is.
(2) If ๐ง = ๐ + ๐๐ ≠ 0, then at least one of ๐ or ๐ is not zero. It follows that ๐ง = ๐ − ๐๐ is also not 0.
(3) The product of a complex number with its conjugate is always a nonnegative real number.
Specifically, if ๐ง = ๐ + ๐๐, then ๐ง๐ง = (๐ + ๐๐)(๐ − ๐๐) = (๐2 + ๐ 2 ) + (– ๐๐ + ๐๐)๐ = ๐2 + ๐ 2 .
๐ง
(4) We can change the quotient ๐ค to standard form by multiplying the numerator and denominator by
๐ค. So, if ๐ง = ๐ + ๐๐ and ๐ค = ๐ + ๐๐, then we have
๐ง
๐ง๐ค (๐ + ๐๐)(๐ − ๐๐) (๐๐ + ๐๐) + (๐๐ − ๐๐)๐ ๐๐ + ๐๐ ๐๐ − ๐๐
=
=
=
= 2
+
๐.
๐ค ๐ค๐ค (๐ + ๐๐)(๐ − ๐๐)
๐ 2 + ๐2
๐ + ๐2 ๐ 2 + ๐2
81
Example 7.2: Let ๐ง = 2 − 3๐ and ๐ค = – 1 + 5๐. Then
๐ง = ๐ + ๐๐.
๐ค = – ๐ − ๐๐.
(2 − 3๐)(– 1 − 5๐)
(– 2 − 15) + (– 10 + 3)๐ (– 17 − 7๐)
๐ง
๐ง๐ค
๐๐ ๐
=
=
=
=
=
–
−
๐.
(– 1)2 + 52
๐ค ๐ค๐ค (– 1 + 5๐)(– 1 − 5๐)
1 + 25
๐๐ ๐๐
Recall from Lesson 5 that in an ordered field, if ๐ > 0 and ๐ > 0, then ๐ + ๐ > 0 (Order Property 1)
and ๐๐ > 0 (Order Property 2). Also, for every element ๐, exactly one of the following holds: ๐ > 0,
๐ = 0, or ๐ < 0 (Order Property 3).
Theorem 7.2: The field of complex numbers cannot be ordered.
Proof: Suppose toward contradiction that < is an ordering of (โ, +, ⋅).
If ๐ > 0, then – 1 = ๐2 = ๐ ⋅ ๐ > 0 by Order Property 2.
If ๐ < 0, then – ๐ > 0, and therefore, – 1 = ๐2 = (– 1)(– 1)๐ ⋅ ๐ = (– 1๐)(– 1๐) = (– ๐)(– ๐) > 0, again by
Order Property 2.
So, – 1 > 0 and it follows that 1 = (– 1)(– 1) > 0, again by order property 2. Therefore, we have
– 1 > 0 and 1 > 0, violating Order Property 3. So, (โ, +, ⋅) cannot be ordered.
โก
Absolute Value and Distance
If ๐ฅ and ๐ฆ are real or complex numbers such that ๐ฆ = ๐ฅ 2 , then we call ๐ฅ a square root of ๐ฆ. If ๐ฅ is a
positive real number, then we say that ๐ฅ is the positive square root of ๐ฆ and we write ๐ฅ = √๐ฆ.
For positive real numbers, we will use the square root symbol only for the positive square root of the
number. For complex numbers, we will use the square root symbol for the principal square root of the
number. The concept of principal square root will be explained in Lesson 15.
Example 7.3:
1. Since 22 = 4, 2 ∈ โ, and 2 > 0, we see that 2 is the positive square root of 4 and we write
2 = √4.
2. We have (– 2)2 = 4, but – 2 < 0, and so we do not write – 2 = √4. However, – 2 is still a square
root of 4, and we can write – 2 = – √4.
3. Since ๐ 2 = – 1, we see that ๐ is a square root of – 1.
4. Since (– ๐)2 = (– ๐)(– ๐) = (– 1)(– 1)๐ 2 = 1(– 1) = – 1, we see that – ๐ is also a square root of
– 1.
5. (1 + ๐)2 = (1 + ๐)(1 + ๐) = (1 − 1) + (1 + 1)๐ = 0 + 2๐ = 2๐. So, 1 + ๐ is a square root of 2๐.
The absolute value or modulus of the complex number ๐ง = ๐ + ๐๐ is the nonnegative real number
|๐ง| = √๐2 + ๐ 2 = √(Re ๐ง)2 + (Im ๐ง)2
82
Note: If ๐ง = ๐ + 0๐ = ๐ is a real number, then |๐| = √๐2 . This is equal to ๐ if ๐ ≥ 0 and – ๐ if ๐ < 0.
For example, |4| = √42 = √16 = 4 and |– 4| = √(– 4)2 = √16 = 4 = – (– 4).
The statement “|๐| = – ๐ for ๐ < 0” often confuses students. This confusion is understandable, as a
minus sign is usually used to indicate that an expression is negative, whereas here we are negating a
negative number to make it positive. Unfortunately, this is the simplest way to say, “delete the minus
sign in front of the number” using basic notation.
Geometrically, the absolute value of a complex number ๐ง is the distance between the point ๐ง and the
origin.
Example 7.4: Which of the following complex numbers is closest to the origin? 1 + 2๐, – 3 + ๐, or
– 2 + 3๐?
|1 + 2๐| = √12 + 22 = √1 + 4 = √5
|– 3 + ๐| = √(– 3)2 + 12 = √9 + 1 = √10
|– 2 + 3๐| = √(– 2)2 + 32 = √4 + 9 = √13
Since √5 < √10 < √13, we see that 1 + 2๐ is closest to the origin.
Notes: (1) Here we have used the following theorem: If ๐, ๐ ∈ โ+ , then ๐ < ๐ if and only if ๐2 < ๐ 2 .
To see this, observe that ๐2 < ๐ 2 if and only if ๐ 2 − ๐2 > 0 if and only if (๐ + ๐)(๐ − ๐) > 0. Since
๐ > 0 and ๐ > 0, by Order Property 1, ๐ + ๐ > 0. It follows that ๐2 < ๐ 2 if and only if ๐ − ๐ > 0 if and
only if ๐ > ๐ if and only if ๐ < ๐.
Applying this theorem to 5 < 10 < 13, we get √5 < √10 < √13.
(2) The definition of the absolute value of a complex number is motivated by the Pythagorean Theorem.
As an example, look at – 3 + ๐ in the figure below. Observe that to get from the origin to the point
(– 3,1), we move to the left 3 units and then up 1 unit. This gives us a right triangle with legs of lengths
3 and 1. By the Pythagorean Theorem, the hypotenuse has length √32 + 12 = √9 + 1 = √10.
83
The distance between the complex numbers ๐ง = ๐ + ๐๐ and ๐ค = ๐ + ๐๐ is
๐(๐ง, ๐ค) = |๐ง − ๐ค| = √(๐ − ๐)2 + (๐ − ๐)2 .
Geometrically, we can translate the vector ๐ง − ๐ค so that the directed line segment begins at the
terminal point of ๐ค and ends at the terminal point of ๐ง. Let’s look one more time at the figure we drew
for (1 + 2๐) − (2 − ๐) = – 1 + 3๐ and then translate the solution vector as we just suggested.
|– ๐ + ๐๐|
Notice that the expression for the distance between two complex numbers follows from a simple
application of the Pythagorean Theorem. Let’s continue to use the same example to help us see this.
√๐๐
3
1
In the figure above, we can get the lengths of the legs of the triangle either by simply counting the
units, or by subtracting the appropriate coordinates. For example, the length of the horizontal leg is
2 − 1 = 1 and the length of the vertical leg is 2 − (– 1) = 2 + 1 = 3. We can then use the Pythagorean
Theorem to get the length of the hypotenuse of the triangle: ๐ = √12 + 32 = √1 + 9 = √10.
Compare this geometric procedure to the formula for distance given above.
While we’re on the subject of triangles, the next theorem involving arbitrary triangles is very useful.
Theorem 7.3 (The Triangle Inequality): For all ๐ง, ๐ค ∈ โ, |๐ง + ๐ค| ≤ |๐ง| + |๐ค|.
84
Geometrically, the Triangle Inequality says that the length of the third side of a triangle is less than or
equal to the sum of the lengths of the other two sides of the triangle. We leave the proof as an exercise
(see Problem 4 below).
As an example, let’s look at the sum (1 + 2๐) + (– 3 + ๐) = – 2 + 3๐. In Example 7.4, we computed
|1 + 2๐| = √5, |– 3 + ๐| = √10, and |– 2 + 3๐| = √13.
Note that √5 + √10 > √4 + √9 = 2 + 3 = 5, whereas √13 < √16 = 4. So, we see that
|(1 + 2๐) + (– 3 + ๐)| = |– 2 + 3๐| = √13 < 4 < 5 < √5 + √10 = |1 + 2๐| + |– 3 + ๐|.
In the following picture, there are two triangles. We’ve put dark bold lines around the leftmost triangle
and labeled the sides with their lengths.
|1 + 2๐|
Basic Topology of โ
A circle in the Complex Plane is the set of all points that are at a fixed distance from a fixed point. The
fixed distance is called the radius of the circle and the fixed point is called the center of the circle.
If a circle has radius ๐ > 0 and center ๐ = ๐ + ๐๐, then any point ๐ง = ๐ฅ + ๐ฆ๐ on the circle must satisfy
|๐ง − ๐| = ๐, or equivalently, (๐ฅ − ๐)2 + (๐ฆ − ๐)2 = ๐ 2 .
Note: The equation |๐ง − ๐| = ๐ says “The distance between ๐ง and ๐ is equal to ๐.” In other words, the
distance between any point on the circle and the center of the circle is equal to the radius of the circle.
Example 7.5: The circle with equation |๐ง + 2 − ๐| = 2 has
center ๐ = – (2 − ๐) = – 2 + ๐ and radius ๐ = 2.
Note: |๐ง + 2 − ๐| = |๐ง − (– 2 + ๐)|. So, if we rewrite the
equation as |๐ง − (– 2 + ๐)| = 2, it is easy to pick out the
center and radius of the circle.
A picture of the circle is shown to the right. The center is
labeled and a typical radius is drawn.
85
An open disk in โ consists of all the points in the interior of a circle. If ๐ is the center of the open disk
and ๐ is the radius of the open disk, then any point ๐ง inside the disk satisfies |๐ง − ๐| < ๐.
๐๐ (๐) = {๐ง ∈ โ | |๐ง − ๐| < ๐}
๐-neighborhood of ๐.
is
also
called
the
Example 7.6: ๐2 (– 2 + ๐) = {๐ง ∈ โ | |๐ง + 2 − ๐| < 2} is the
2 neighborhood of – 2 + ๐. It consists of all points inside the
circle |๐ง + 2 − ๐| = 2.
Notes: (1) A picture of the 2-neighborhood of – 2 + ๐ is
shown to the right. The center is labeled and a typical radius
is drawn. We drew the boundary of the disk with dashes to
indicate that points on the circle are not in the
neighborhood and we shaded the interior of the disk to
indicate that every point inside the circle is in the
neighborhood.
(2) The definitions of open disk and ๐-neighborhood of ๐ also make sense in โ, but the geometry looks
a bit different. An open disk in โ is simply an open interval. If ๐ฅ and ๐ are real numbers, then we have
๐ฅ ∈ ๐๐ (๐) ⇔ |๐ฅ − ๐| < ๐ ⇔ √(๐ฅ − ๐)2 < ๐ ⇔ 0 ≤ (๐ฅ − ๐)2 < ๐ 2
⇔ – ๐ < ๐ฅ − ๐ < ๐ ⇔ ๐ − ๐ < ๐ฅ < ๐ + ๐ ⇔ ๐ฅ ∈ (๐ − ๐, ๐ + ๐).
So, in โ, an ๐-neighborhood of ๐ is the open interval ๐๐ (๐) = (๐ − ๐, ๐ + ๐). Notice that the length
(or diameter) of this interval is 2๐.
As an example, let’s draw a picture of ๐2 (1) = (1 − 2, 1 + 2) = (– 1, 3). Observe that the center of
this open disk (or open interval or neighborhood) in โ is the real number 1, the radius of the open disk
is 2, and the diameter of the open disk (or length of the interval) is 4.
A closed disk is the interior of a circle together with the circle itself (the boundary is included). If ๐ is
the center of the closed disk and ๐ is the radius of the closed disk, then any point ๐ง inside the closed
disk satisfies |๐ง − ๐| ≤ ๐.
Notes: (1) In this case, the circle itself would be drawn solid to indicate that all points on the circle are
included.
(2) Just like an open disk in โ is an open interval, a closed disk in โ is a closed interval.
(3) The reader is encouraged to draw a few open and closed disks in both โ and โ, and to write down
the corresponding sets of points using set-builder notation and, in the case of โ, interval notation.
86
A punctured open disk consists of all the points in the interior of a circle except for the center of the
circle. If ๐ is the center of the punctured open disk and ๐ is the radius of the open disk, then any point
๐ง inside the punctured disk satisfies |๐ง − ๐| < ๐ and ๐ง ≠ ๐.
Note that ๐ง ≠ ๐ is equivalent to ๐ง − ๐ ≠ 0. In turn, this is equivalent to |๐ง − ๐| ≠ 0. Since |๐ง − ๐| must
be nonnegative, |๐ง − ๐| ≠ 0 is equivalent to |๐ง − ๐| > 0 or 0 < |๐ง − ๐|.
Therefore, a punctured open disk with center ๐ and radius ๐ consists of all points ๐ง that satisfy
๐ < |๐ − ๐| < ๐.
๐๐โจ (๐) = {๐ง | 0 < |๐ง − ๐| < ๐} is also called a deleted ๐-neighborhood of ๐.
Example 7.7: ๐2โจ (– 2 + ๐) = {๐ง ∈ โ | 0 < |๐ง + 2 − ๐| < 2}
is the deleted 2 neighborhood of – 2 + ๐. It consists of all
points inside the circle |๐ง + 2 − ๐| = 2, except for – 2 + ๐.
Notes: (1) A picture of the deleted 2-neighborhood of
– 2 + ๐ is shown to the right. Notice that this time we
excluded the center of the disk – 2 + ๐, as this point is not
included in the set.
(2) In โ, we have
๐๐โจ (๐) = (๐ − ๐, ๐ + ๐) โ {๐} = (๐ − ๐, ๐) ∪ (๐, ๐ + ๐).
This is the open interval centered at ๐ of length (or diameter) 2๐ with ๐ removed.
Let’s draw a picture of ๐2โจ (1) = (– 1, 3) โ {1} = (– 1, 1) ∪ (1, 3).
(3) Notice how all the topological definitions we are presenting make sense in both โ and โ, but the
geometry in each case looks different. You will continue to see this happen. In fact, these definitions
make sense for many, many sets and structures, all with their own “look.” In general, topology allows
us to make definitions and prove theorems that can be applied very broadly and used in many (if not
all) branches of mathematics.
A subset ๐ of โ is said to be open if for every complex number ๐ง ∈ ๐, there is an open disk ๐ท with
๐ง ∈ ๐ท and ๐ท ⊆ ๐.
In words, a set is open in โ if every point in the set has “space” all around it inside the set. If you think
of each point in the set as an animal, then each animal in the set should be able to move a little in any
direction it chooses without leaving the set. Another way to think of this is that no number is right on
“the edge” or “the boundary” of the set, about to fall out of it.
87
Example 7.8:
1. Every open disk ๐ท is an open set. To see this, simply observe that if ๐ง ∈ ๐ท, then ๐ท itself is an
open disk with ๐ง ∈ ๐ท and ๐ท ⊆ ๐ท.
2. A closed disk is not an open set because it contains its “boundary.” As an example, let’s look at
the closed unit disk ๐ท = {๐ง ∈ โ | |๐ง| ≤ 1}. Let’s focus on the point ๐. First note that ๐ ∈ ๐ท
because |๐| = √02 + 12 = √1 = 1 and 1 ≤ 1. Now, any open disk ๐ containing ๐ will contain
points above ๐. Let’s say (1 + ๐)๐ ∈ ๐ for some positive real number ๐. Now, we have
|(1 + ๐)๐| = √02 + (1 + ๐)2 = 1 + ๐, which is greater than 1. Therefore, (1 + ๐)๐ ∉ ๐ท. It
follows that ๐ โ ๐ท, and so, ๐ท is not open.
3. We can use reasoning similar to that used in 2 to see that if we take any subset of a disk that
contains any points on the bounding circle, then that set will not be open.
4. ∅ and โ are both open. You will be asked to prove this in Problem 7 below (parts (i) and (ii)).
As we mentioned in Lesson 6 right before Theorem 6.5, many authors define “open” in a slightly
different way from the definition we’ve been using. Once again, let’s show that the definition we have
been using is equivalent to theirs.
Theorem 7.4: A subset ๐ of โ is open if and only if for every complex number ๐ค ∈ ๐, there is a positive
real number ๐ such that ๐๐ (๐ค) ⊆ ๐.
Analysis: The harder direction of the proof is showing that
if ๐ is open, then for every complex number ๐ค ∈ ๐, there is
a positive real number ๐ such that ๐๐ (๐ค) ⊆ ๐.
๐ท
To see this, suppose that ๐ is open and let ๐ค ∈ ๐. Then
there is an open disk ๐ท = {๐ง ∈ โ | |๐ง − ๐| < ๐} with ๐ค ∈ ๐ท
and ๐ท ⊆ ๐. We want to replace the disk ๐ท with a disk that
has ๐ค right in the center.
To accomplish this, we let ๐ be the distance from ๐ค to ๐.
Then ๐ − ๐ is the distance from ๐ค to the boundary of ๐ท. We
will show that the disk with center ๐ค and radius ๐ − ๐ is a
subset of ๐ท.
The picture to the right illustrates this idea. Notice that
๐ + (๐ − ๐) = ๐, the radius of disk ๐ท.
Proof of Theorem 7.4: Let ๐ be an open subset of โ and let ๐ค ∈ ๐. Then there is an open disk ๐ท with
๐ค ∈ ๐ท and ๐ท ⊆ ๐.
Suppose that ๐ท has center ๐ and radius ๐. So, ๐ท = {๐ง ∈ โ | |๐ง − ๐| < ๐}.
Let ๐ = |๐ค − ๐| and let ๐ = ๐ − ๐. We will show that ๐๐ (๐ค) ⊆ ๐ท.
Let ๐ง ∈ ๐๐ (๐ค). Then |๐ง − ๐ค| < ๐ = ๐ − ๐.
88
By the Triangle Inequality (and SACT—see Note 2 below),
|๐ง − ๐| = |(๐ง − ๐ค) + (๐ค − ๐)| ≤ |๐ง − ๐ค| + |๐ค − ๐| < (๐ − ๐) + ๐ = ๐.
So, ๐ง ∈ ๐ท. Since ๐ง was an arbitrary element of ๐๐ (๐ค), we showed that ๐๐ (๐ค) ⊆ ๐ท.
So, we have ๐๐ (๐ค) ⊆ ๐ท and ๐ท ⊆ ๐. By the transitivity of ⊆ (Theorem 2.3 from Lesson 2), we have
๐๐ (๐ค) ⊆ ๐.
The converse is immediate since for ๐ค ∈ ๐, ๐๐ (๐ค) is an open disk containing ๐ค.
โก
Notes: (1) The picture to the right shows how we used the Triangle
Inequality. The three sides of the triangle have lengths |๐ง − ๐ค|,
|๐ค − ๐|, and |๐ง − ๐|.
(2) Notice how we used SACT (the Standard Advanced Calculus Trick)
here. Starting with ๐ง − ๐, we wanted to make ๐ง − ๐ค and ๐ค − ๐
“appear.” We were able to do this simply by subtracting and then
adding ๐ค between ๐ง and ๐. We often use this trick when applying the
Triangle Inequality. SACT was introduced in Lesson 4 (Note 7 following
Example 4.5).
(3) The same proof used here can be used to prove Theorem 6.5. The geometry looks different (disks
and neighborhoods are open intervals instead of the interiors of circles, and points appear on the real
line instead of in the complex plane), but the argument is identical. Compare this proof to the proof we
used in Theorem 6.5.
A subset ๐ of โ is said to be closed if โ โ ๐ is open.
โ โ ๐ is called the complement of ๐ in โ, or simply the complement of ๐. It consists of all complex
numbers not in ๐.
Example 7.9:
1. Every closed disk is a closed set. For example, ๐ท = {๐ง ∈ โ | |๐ง| ≤ 1} is closed because its
complement in โ is โ โ ๐ท = {๐ง ∈ โ | |๐ง| > 1}. You will be asked to prove that this set ๐ท is open
in Problem 7 below (part (iii)).
2. If we take any subset of a closed disk that includes the interior of the disk, but is missing at least
one point on the bounding circle, then that set will not be closed. You will be asked to prove
this for the closed unit disk {๐ง ∈ โ | |๐ง| ≤ 1} in Problem 10 below.
3. ∅ is closed because โ โ ∅ = โ is open. โ is closed because โ โ โ = ∅ is open. ∅ and โ are the
only two sets of complex numbers that are both open and closed.
89
Problem Set 7
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Let ๐ง = – 4 − ๐ and ๐ค = 3 − 5๐. Compute each of the following:
(i)
๐ง+๐ค
(ii)
๐ง๐ค
(iii) Im ๐ค
(iv) 2๐ง − ๐ค
(v)
(vi)
๐ค
๐ง
๐ค
(vii) |๐ง|
(viii) the distance between ๐ง and ๐ค
LEVEL 2
2. Prove that (โ, +, ⋅) is field.
3. Let ๐ง and ๐ค be complex numbers. Prove the following:
๐ง+๐ง
(i)
Re ๐ง =
(ii)
Im ๐ง = 2๐
2
๐ง−๐ง
(iii) ๐ง + ๐ค = ๐ง + ๐ค
(iv) ๐ง๐ค = ๐ง ⋅ ๐ค
(v)
๐ง
๐ง
(๐ค) = ๐ค
(vi) ๐ง๐ง = |๐ง|2
(vii) |๐ง๐ค| = |๐ง||๐ค|
๐ง
|๐ง|
(viii) If ๐ค ≠ 0, then |๐ค| = |๐ค|
(ix) Re ๐ง ≤ |๐ง|
(x)
Im ๐ง ≤ |๐ง|
90
LEVEL 3
4. Prove the Triangle Inequality (Theorem 7.3).
5. Let ๐ง and ๐ค be complex numbers. Prove ||๐ง| − |๐ค|| ≤ |๐ง ± ๐ค| ≤ |๐ง| + |๐ค|.
6. A point ๐ค is an accumulation point of a set ๐ of complex numbers if each deleted neighborhood
of ๐ค contains at least one point in ๐. Determine the accumulation points of each of the following
sets:
1
(i)
{๐ | ๐ ∈ โค+ }
(ii)
{๐ | ๐ ∈ โค+ }
๐
(iii) {๐ ๐ | ๐ ∈ โค+ }
๐๐
(iv) { ๐ | ๐ ∈ โค+ }
(v)
{๐ง | |๐ง| < 1}
(vi) {๐ง |0 < |๐ง − 2| ≤ 3}
LEVEL 4
7. Determine if each of the following subsets of โ is open, closed, both, or neither. Give a proof in
each case.
(i)
∅
(ii)
โ
(iii) {๐ง ∈ โ | |๐ง| > 1}
(iv) {๐ง ∈ โ | Im ๐ง ≤ −2}
(v)
{๐ ๐ | ๐ ∈ โค+ }
(vi) {๐ง ∈ โ |2 < |๐ง − 2| < 4}
8. Prove the following:
(i)
An arbitrary union of open sets in โ is an open set in โ.
(ii)
A finite intersection of open sets in โ is an open set in โ.
(iii) An arbitrary intersection of closed sets in โ is a closed set in โ.
(iv) A finite union of closed sets in โ is a closed set in โ.
(v)
Every open set in โ can be expressed as a union of open disks.
91
LEVEL 5
9. A complex number ๐ง is an interior point of a set ๐ of complex numbers if there is a neighborhood
of ๐ง that contains only points in ๐, whereas ๐ค is a boundary point of ๐ if each neighborhood of
๐ค contains at least one point in ๐ and one point not in ๐. Prove the following:
(i) A set of complex numbers is open if and only if each point in ๐ is an interior point of ๐.
(ii) A set of complex numbers is open if and only if it contains none of its boundary points.
(iii) A set of complex numbers is closed if and only if it contains all its boundary points.
10. Let ๐ท = {๐ง ∈ โ | |๐ง| ≤ 1} be the closed unit disk and let ๐ be a subset of ๐ท that includes the
interior of the disk but is missing at least one point on the bounding circle of the disk. Show that
๐ is not a closed set.
11. Prove that a set of complex numbers is closed if and only if it contains all its accumulation points.
(See Problem 6 for the definition of an accumulation point.)
12. Prove that a set consisting of finitely many complex numbers is a closed set in โ. (Hint: Show
that a finite set has no accumulation points.)
92
LESSON 8 – LINEAR ALGEBRA
VECTOR SPACES
Vector Spaces Over Fields
Recall the following:
1. In previous lessons, we looked at three structures called fields: โ (the field of rational numbers),
โ (the field of real numbers), and โ (the field of complex numbers). Each of these fields come
with two operations called addition and multiplication. Also, โ is a subfield of โ and โ is a
subfield of โ. This means that every rational number is a real number, every real number is a
complex number, and addition and multiplication in โ, โ, and โ all work the same way.
2. Fields have a particularly nice structure. When working in a field, we can perform all the
arithmetic and algebra that we remember from elementary and middle school. In particular,
we have closure, associativity, commutativity, identity elements, and inverse properties for
both addition and multiplication (with the exception that 0 has no multiplicative inverse), and
multiplication is distributive over addition.
3. The standard form of a complex number is ๐ + ๐๐, where ๐ and ๐ are real numbers. We add
two complex numbers using the rule (๐ + ๐๐) + (๐ + ๐๐) = (๐ + ๐) + (๐ + ๐)๐.
To give some motivation for the definition of a vector space, let’s begin with an example.
Example 8.1: Consider the set โ of complex numbers together with the usual definition of addition.
Let’s also consider another operation, which we will call scalar multiplication. For each ๐ ∈ โ and
๐ง = ๐ + ๐๐ ∈ โ, we define ๐๐ง to be ๐๐ + ๐๐๐.
The operation of scalar multiplication is a little different from other types of operations we have looked
at previously because instead of multiplying two elements from โ together, we are multiplying an
element of โ with an element of โ. In this case, we will call the elements of โ scalars.
Let’s observe that we have the following properties:
1. (โ, +) is a commutative group. In other words, for addition in โ, we have closure, associativity,
commutativity, an identity element (called 0), and the inverse property (the inverse of ๐ + ๐๐
is – ๐ − ๐๐). This follows immediately from the fact that (โ, +, ⋅) is a field. When we choose to
think of โ as a vector space, we will “forget about” the multiplication in โ, and just consider โ
together with addition. In doing so, we lose much of the field structure of the complex numbers,
but we retain the group structure of (โ, +).
2. โ is closed under scalar multiplication. That is, for all ๐ ∈ โ and ๐ ∈ โ, we have ๐๐ ∈ โ. To
see this, let ๐ง = ๐ + ๐๐ ∈ โ and let ๐ ∈ โ. Then, by definition, ๐๐ง = ๐๐ + ๐๐๐. Since ๐, ๐ ∈ โ,
and โ is closed under multiplication, ๐๐ ∈ โ and ๐๐ ∈ โ. It follows that ๐๐ + ๐๐๐ ∈ โ.
3. ๐๐ = ๐. To see this, consider 1 ∈ โ and let ๐ง = ๐ + ๐๐ ∈ โ. Then, since 1 is the multiplicative
identity for โ, we have 1๐ง = 1๐ + 1๐๐ = ๐ + ๐๐ = ๐ง.
93
4. For all ๐, ๐ ∈ โ and ๐ ∈ โ, (๐๐)๐ = ๐(๐๐) (Associativity of scalar multiplication). To see this,
let ๐, ๐ ∈ โ and ๐ง = ๐ + ๐๐ ∈ โ. Then since multiplication is associative in โ, we have
(๐๐)๐ง = (๐๐)(๐ + ๐๐) = (๐๐)๐ + (๐๐)๐๐ = ๐(๐๐) + ๐(๐๐)๐ = ๐(๐๐ + ๐๐๐) = ๐(๐๐ง).
5. For all ๐ ∈ โ and ๐, ๐ ∈ โ, ๐(๐ + ๐) = ๐๐ + ๐๐ (Distributivity of 1 scalar over 2 vectors). To
see this, let ๐ ∈ โ and ๐ง = ๐ + ๐๐, ๐ค = ๐ + ๐๐ ∈ โ. Then since multiplication distributes over
addition in โ, we have
๐(๐ง + ๐ค) = ๐((๐ + ๐๐) + (๐ + ๐๐)) = ๐((๐ + ๐) + (๐ + ๐)๐) = ๐(๐ + ๐) + ๐(๐ + ๐)๐
= (๐๐ + ๐๐) + (๐๐ + ๐๐)๐ = (๐๐ + ๐๐๐) + (๐๐ + ๐๐๐) = ๐(๐ + ๐๐) + ๐(๐ + ๐๐) = ๐๐ง + ๐๐ค.
6. For all ๐, ๐ ∈ โ and ๐ ∈ โ, (๐ + ๐)๐ = ๐๐ + ๐๐ (Distributivity of 2 scalars over 1 vector). To see
this, let ๐, ๐ ∈ โ and ๐ง = ๐ + ๐๐ ∈ โ. Then since multiplication distributes over addition in โ,
we have
(๐ + ๐)๐ง = (๐ + ๐)(๐ + ๐๐) = (๐ + ๐)๐ + (๐ + ๐)๐๐ = (๐๐ + ๐๐) + (๐๐ + ๐๐)๐
= (๐๐ + ๐๐๐) + (๐๐ + ๐๐๐) = ๐(๐ + ๐๐) + ๐(๐ + ๐๐) = ๐๐ง + ๐๐ง.
Notes: (1) Since the properties listed in 1 through 6 above are satisfied, we say that โ is a vector space
over โ. We will give the formal definition of a vector space below.
(2) Note that a vector space consists of (i) a set of vectors (in this case โ), (ii) a field (in this case โ),
and (iii) two operations called addition and scalar multiplication.
(3) The operation of addition is a binary operation on the set of vectors, and the set of vectors together
with this binary operation forms a commutative group. In the previous example (Example 8.1), we have
that (โ, +) is a commutative group.
(4) Scalar multiplication is not a binary operation on the set of vectors. It takes pairs of the form (๐, ๐ฃ),
where ๐ is in the field and ๐ฃ is a vector to a vector ๐๐ฃ. Formally speaking, scalar multiplication is a
function ๐: ๐ฝ × ๐ → ๐, where ๐ฝ is the field of scalars and ๐ is the set of vectors (see the beginning of
Lesson 3 for a brief explanation of this notation).
(5) We started with the example of โ as a vector space over โ because
it has a geometric interpretation where we can draw simple pictures
to visualize what the vector space looks like. Recall from Lesson 7 that
we can think of the complex number ๐ + ๐๐ as a directed line segment
(which from now on we will call a vector) in the complex plane that
begins at the origin and terminates at the point (๐, ๐).
For example, pictured to the right, we can see the vectors ๐ = 0 + 1๐,
1 + 2๐, and 2 = 2 + 0๐ in the complex plane.
We can visualize the sum of two vectors as the vector starting at the
origin that is the diagonal of the parallelogram formed from the original vectors. We see this in the first
figure on the left below. In this figure, we have removed the complex plane and focused on the vectors
1 + 2๐ and 2, together with their sum (1 + 2๐) + (2 + 0๐) = (1 + 2) + (2 + 0)๐ = 3 + 2๐.
94
A second way to visualize the sum of two vectors is to translate one of the vectors so that its initial
point coincides with the terminal point of the other vector. The sum of the two vectors is then the
vector whose initial point coincides with the initial point of the “unmoved” vector and whose terminal
point coincides with the terminal point of the “moved” vector. We see two ways to do this in the center
and rightmost figures below.
Technically speaking, the center figure shows the sum (1 + 2๐) + 2 and the rightmost figure shows the
sum 2 + (1 + 2๐). If we superimpose one figure on top of the other, we can see strong evidence that
commutativity holds for addition.
2
1 + 2๐
(0, 0)
3 + 2๐
3 + 2๐
1 + 2๐
3 + 2๐
(0, 0)
(0, 0)
2
1 + 2๐
2
We can visualize a scalar multiple of a vector as follows: (i) if ๐ is a positive real number and ๐ง ∈ โ, then
the vector ๐๐ง points in the same direction as ๐ง and has a length that is ๐ times the length of ๐ง; (ii) if ๐
is a negative real number and ๐ง ∈ โ, then the vector ๐๐ง points in the direction opposite of ๐ง and has a
length that is |๐| times the length of ๐ง; (iii) if ๐ = 0 and ๐ง ∈ โ, then ๐๐ง is a point.
In the figures below, we have a vector ๐ง ∈ โ, together with several scalar multiples of ๐ง.
– 2๐ง
2๐ง
๐ง
1
2
– ๐ง = (– 1)๐ง
๐ง
1
–2๐ง
We are now ready for the general definition of a vector space.
A vector space over a field ๐ฝ is a set ๐ together with a binary operation + on ๐ (called addition) and
an operation called scalar multiplication satisfying:
(1) (๐, +) is a commutative group.
(2) (Closure under scalar multiplication) For all ๐ ∈ ๐ฝ and ๐ฃ ∈ ๐, ๐๐ฃ ∈ ๐.
(3) (Scalar multiplication identity) If 1 is the multiplicative identity of ๐ฝ and ๐ฃ ∈ ๐, then 1๐ฃ = ๐ฃ.
(4) (Associativity of scalar multiplication) For all ๐, ๐ ∈ ๐ฝ and ๐ฃ ∈ ๐, (๐๐)๐ฃ = ๐(๐๐ฃ).
(5) (Distributivity of 1 scalar over 2 vectors) For all ๐ ∈ ๐ฝ and ๐ฃ, ๐ค ∈ ๐, ๐(๐ฃ + ๐ค) = ๐๐ฃ + ๐๐ค.
(6) (Distributivity of 2 scalars over 1 vector) For all ๐, ๐ ∈ ๐ฝ and ๐ฃ ∈ ๐, (๐ + ๐)๐ฃ = ๐๐ฃ + ๐๐ฃ.
Notes: (1) Recall from Lesson 3 that (๐, +) a commutative group means the following:
•
(Closure) For all ๐ฃ, ๐ค ∈ ๐, ๐ฃ + ๐ค ∈ ๐.
95
•
(Associativity) For all ๐ฃ, ๐ค, ๐ข ∈ ๐, (๐ฃ + ๐ค) + ๐ข = ๐ฃ + (๐ค + ๐ข).
•
(Commutativity) For all ๐ฃ, ๐ค ∈ ๐, ๐ฃ + ๐ค = ๐ค + ๐ฃ.
•
(Identity) There exists an element 0 ∈ ๐ such that for all ๐ฃ ∈ ๐, 0 + ๐ฃ = ๐ฃ + 0 = ๐ฃ.
•
(Inverse) For each ๐ฃ ∈ ๐, there is – ๐ฃ ∈ ๐ such that ๐ฃ + (– ๐ฃ) = (– ๐ฃ) + ๐ฃ = 0.
(2) The fields that we are familiar with are โ (the field of rational numbers), โ (the field of real
numbers), and โ (the field of complex numbers). For our purposes here, we can always assume that ๐ฝ
is one of these three fields.
Let’s look at some basic examples of vector spaces.
Example 8.2:
1. Let โ2 be the set of all ordered pairs of real numbers. That is, โ2 = {(๐, ๐ ) | ๐, ๐ ∈ โ} We
define addition by (๐, ๐) + (๐, ๐) = (๐ + ๐, ๐ + ๐). We define scalar multiplication by
๐(๐, ๐) = (๐๐, ๐๐) for each ๐ ∈ โ. With these definitions, โ2 is a vector space over โ.
Notice that โ2 looks just like โ. In fact, (๐, ๐) is sometimes used as another notation for ๐ + ๐๐.
Therefore, the verification that โ2 is a vector space over โ is nearly identical to what we did in
Example 8.1 above.
We can visualize elements of โ2 as points or vectors in a plane in exactly the same way that we
visualize complex numbers as points or vectors in the complex plane.
2. โ3 = {(๐, ๐, ๐) | ๐, ๐, ๐ ∈ โ} is a vector space over โ, where we define addition and scalar
multiplication by (๐, ๐, ๐) + (๐, ๐, ๐) = (๐ + ๐, ๐ + ๐, ๐ + ๐) and ๐(๐, ๐, ๐) = (๐๐, ๐๐, ๐๐),
respectively.
We can visualize elements of โ3 as points in space in a way similar to visualizing elements of
โ2 and โ as points in a plane.
3. More generally, we can let โ๐ = {(๐1 , ๐2 , … , ๐๐ ) | ๐๐ ∈ โ for each ๐ = 1, 2, … , ๐}. Then โ๐ is
a vector space over โ, where we define addition and scalar multiplication by
(๐1 , ๐2 , … , ๐๐ ) + (๐1 , ๐2 , … , ๐๐ ) = (๐1 + ๐1 , ๐2 + ๐2 , … , ๐๐ + ๐๐ ).
๐(๐1 , ๐2 , … , ๐๐ ) = (๐๐1 , ๐๐2 , … , ๐๐๐ ).
4. More generally still, if ๐ฝ is any field (for our purposes, we can think of ๐ฝ as โ, โ, or โ), we let
๐ฝ๐ = {(๐1 , ๐2 , … , ๐๐ ) | ๐๐ ∈ ๐ฝ for each ๐ = 1, 2, … , ๐}. Then ๐ฝ๐ is a vector space over ๐ฝ, where
we define addition and scalar multiplication by
(๐1 , ๐2 , … , ๐๐ ) + (๐1 , ๐2 , … , ๐๐ ) = (๐1 + ๐1 , ๐2 + ๐2 , … , ๐๐ + ๐๐ ).
๐(๐1 , ๐2 , … , ๐๐ ) = (๐๐1 , ๐๐2 , … , ๐๐๐ ).
Notes: (1) Ordered pairs have the property that (๐, ๐) = (๐, ๐) if and only if ๐ = ๐ and ๐ = ๐. So, for
example, (1,2) ≠ (2,1). Compare this to the unordered pair (or set) {1, 2}. Recall that a set is
determined by its elements and not the order in which the elements are listed. So, {1, 2} = {2, 1}.
We will learn more about ordered pairs in Lesson 10.
96
(2) (๐1 , ๐2 , … , ๐๐ ) is called an ๐-tuple. So, โ๐ consists of all ๐-tuples of elements from โ, and more
generally, ๐ฝ๐ consists of all ๐-tuples of elements from the field ๐ฝ.
1 1 1 1 1 1 1
For example, (3, 2 − ๐, √2 + √3๐, – 3๐) ∈ โ4 and (1, 2 , 3 , 4 , 5 , 6 , 7 , 8) ∈ โ8 (and since โ8 ⊆ โ8 ⊆ โ8 ,
we can also say that this 8-tuple is in โ8 or โ8 ).
(3) Similar to what we said in Note 1, we have (๐1 , ๐2 , … , ๐๐ ) = (๐1 , ๐2 , … , ๐๐ ) if and only if ๐๐ = ๐๐ for
all ๐ = 1, 2, … , ๐. So, for example, (2, 5, √2, √2) and (2, √2, 5, √2) are distinct elements from โ4 .
(4) You will be asked to verify that ๐ฝ๐ is a vector space over the field ๐ฝ in Problem 3 below. Unless
stated otherwise, from now on we will always consider the vector space ๐ฝ๐ to be over the field ๐ฝ.
Let’s look at a few other examples of vector spaces.
Example 8.3:
๐
] | ๐, ๐, ๐, ๐ ∈ โ} be the set of all 2 × 2 matrices of real numbers. We add two
๐
๐ ๐
๐+๐ ๐+๐
๐ ๐
matrices using the rule [
]+[
]=[
], and we multiply a matrix by a real
๐ โ
๐+๐ ๐+โ
๐ ๐
๐ ๐
๐๐ ๐๐
number using the rule ๐ [
]=[
]. It is straightforward to check that ๐ is a vector
๐ ๐
๐๐ ๐๐
space over โ.
1. Let ๐ = {[
๐
๐
2. For ๐, ๐ ∈ โค+ , an ๐ × ๐ matrix over a field ๐ฝ is a rectangular array with ๐ rows and ๐ columns,
1
5
2
5 ] is a 2 × 3 matrix over โ. We
and entries in ๐ฝ. For example, the matrix ๐ด = [
–3
7
√3
will generally use a capital letter to represent a matrix, and the corresponding lowercase letter
with double subscripts to represent the entries of the matrix. We use the first subscript for the
row and the second subscript for the column. Using the matrix ๐ด above as an example, we see
that ๐21 = – 3 because the entry in row 2 and column 1 is – 3. Similarly, we have ๐11 = 5,
1
๐12 = 2, ๐13 = 5, ๐22 = √3, and ๐23 = 7.
๐ฝ
๐ฝ
Let ๐๐๐
be the set of all ๐ × ๐ matrices over the field ๐ฝ. We add two matrices ๐ด, ๐ต ∈ ๐๐๐
to
๐ฝ
๐ฝ
get ๐ด + ๐ต ∈ ๐๐๐ using the rule (๐ + ๐)๐๐ = ๐๐๐ + ๐๐๐ . We multiply a matrix ๐ด ∈ ๐๐๐ by a
scalar ๐ ∈ ๐ฝ using the rule (๐๐)๐๐ = ๐๐๐๐ .
For example, if we let ๐ด be the matrix above and ๐ต = [
7
๐ด+๐ต =[
–4
–3
0
1
]
8
and
2
–5
–1
– √3
2๐ด = [
4
5 ], then we have
1
10
4
–6
2√3
2
5 ].
14
Notice that we get the entry in the first row and first column of ๐ด + ๐ต as follows:
(๐ + ๐)11 = ๐11 + ๐11 = 5 + 2 = 7
Similarly, we get the other two entries in the first row like this:
(๐ + ๐)12 = ๐12 + ๐12 = 2 + (– 5) = – 3
(๐ + ๐)13 = ๐13 + ๐13 =
97
1 4 5
+ = =1
5 5 5
I leave it to the reader to write out the details for computing the entries in the second row of
๐ด + ๐ต.
We get the entries in the first row of 2๐ด as follows:
1 2
=
5 5
I leave it to the reader to write out the details for computing the entries in the second row of
2๐ด.
(2๐)11 = 2๐11 = 2 ⋅ 5 = 10
(2๐)12 = 2๐12 = 2 ⋅ 2 = 4
(2๐)13 = 2๐13 = 2 ⋅
With the operations of addition and scalar multiplication defined as we have above, it is not too
๐ฝ
hard to show that ๐๐๐
is a vector space over ๐ฝ.
3. Let ๐ = {๐๐ฅ 2 + ๐๐ฅ + ๐ | ๐, ๐, ๐ ∈ โ} be the set of polynomials of degree ๐ with real
coefficients. We define addition and scalar multiplication (with scalars in โ) on this set of
polynomials as follows:
(๐๐ฅ 2 + ๐๐ฅ + ๐) + (๐๐ฅ 2 + ๐๐ฅ + ๐) = (๐ + ๐)๐ฅ 2 + (๐ + ๐)๐ฅ + (๐ + ๐).
๐(๐๐ฅ 2 + ๐๐ฅ + ๐) = (๐๐)๐ฅ 2 + (๐๐)๐ฅ + (๐๐).
For example, if ๐(๐ฅ) = 2๐ฅ 2 + 3๐ฅ − 5 and ๐(๐ฅ) = – 5๐ฅ + 4, then ๐(๐ฅ), ๐(๐ฅ) ∈ ๐ and we have
๐(๐ฅ) + ๐(๐ฅ) = (2๐ฅ 2 + 3๐ฅ − 5) + (– 5๐ฅ + 4) = 2๐ฅ 2 − 2๐ฅ − 1.
3๐(๐ฅ) = 3(2๐ฅ 2 + 3๐ฅ − 5) = 6๐ฅ 2 + 9๐ฅ − 15.
It is straightforward to check that ๐ is a vector space over โ.
Subspaces
Let ๐ be a vector space over a field ๐ฝ. A subset ๐ of ๐ is called a subspace of ๐, written ๐ ≤ ๐, if it is
also a vector space with respect to the same operations of addition and scalar multiplication as they
were defined in ๐.
Notes: (1) Recall from Note 2 following Example 3.3 that a universal statement is a statement that
describes a property that is true for all elements without mentioning the existence of any new
elements. A universal statement begins with the quantifier ∀ (“For all”) and never includes the
quantifier ∃ (“There exists” or “There is”).
Properties defined by universal statements are closed downwards. This means that if a property
defined by a universal statement is true in ๐ and ๐ is a subset of ๐, then the property is true in ๐ as
well.
For example, the statement for commutativity is ∀๐ฃ, ๐ค(๐ฃ + ๐ค = ๐ค + ๐ฃ). This is read “For all ๐ฃ and ๐ค,
๐ฃ + ๐ค = ๐ค + ๐ฃ.” The quantifier ∀ is referring to whichever set we are considering. If we are thinking
about the set ๐, then we mean “For all ๐ฃ and ๐ค in ๐, ๐ฃ + ๐ค = ๐ค + ๐ฃ.” If we are thinking about the set
๐, then we mean “For all ๐ฃ and ๐ค in ๐, ๐ฃ + ๐ค = ๐ค + ๐ฃ.”
If we assume that + is commutative in ๐ and ๐ ⊆ ๐, we can easily show that + is also commutative in
๐. To see this, let ๐ฃ, ๐ค ∈ ๐. Since ๐ ⊆ ๐, we have ๐ฃ, ๐ค ∈ ๐. Since + is commutative in ๐, we have
๐ฃ + ๐ค = ๐ค + ๐ฃ. Since ๐ฃ and ๐ค were arbitrary elements in ๐, we see that + is commutative in ๐.
98
(2) Associativity, commutativity, and distributivity are all defined by universal statements, and
therefore, when checking if ๐ is a subspace of ๐, we do not need to check any of these properties—
they will always be satisfied in the subset ๐.
(3) The identity property for addition is not defined by a universal statement. It begins with the
existential quantifier ∃ “There is.” Therefore, we do need to check that the identity 0 is in a subset ๐
of ๐ when determining if ๐ is a subspace of ๐. However, once we have checked that 0 is there, we do
not need to check that it satisfies the property of being an identity. As long as 0 ∈ ๐ (the same 0 from
๐), then it will behave as an identity because the defining property of 0 contains only the quantifier ∀.
(4) The inverse property for addition will always be true in a subset ๐ of a vector space ๐ that is closed
under scalar multiplication. To see this, we use the fact that – 1๐ฃ = – ๐ฃ for all ๐ฃ in a vector space (see
Problem 4 (iv) below).
(5) Since the multiplicative identity 1 comes from the field ๐ฝ and not the vector space ๐, and we are
using the same field for the subset ๐, we do not need to check the scalar multiplication identity when
verifying that ๐ is a subspace of ๐.
(6) The main issue when checking if a subset ๐ of ๐ is a subspace of ๐ is closure. For example, we need
to make sure that whenever we add 2 vectors in ๐, we get a vector that is also in ๐. If we were to take
an arbitrary subset of ๐, then there is no reason this should happen. For example, let’s consider the
vector space โ over the field โ. Let ๐ด = {2 + ๐๐ | ๐ ∈ โ}. ๐ด is a subset of โ, but ๐ด is not a subspace of
โ. To see this, we just need a single counterexample. 2 + ๐ ∈ ๐ด, but (2 + ๐) + (2 + ๐) = 4 + 2๐ ∉ ๐ด
(because the real part is 4 and not 2).
(7) Notes 1 through 6 above tell us that to determine if a subset ๐ of a vector space ๐ is a subspace of
๐, we need only check that 0 ∈ ๐, and ๐ is closed under addition and scalar multiplication.
(8) The statements for closure, as we have written them do look a lot like universal statements. For
example, the statement for closure under addition is “For all ๐ฃ, ๐ค ∈ ๐, ๐ฃ + ๐ค ∈ ๐.” The issue here is
that the set ๐ is not allowed to be explicitly mentioned in the formula. It needs to be understood.
For example, we saw in Note 1 that the statement for commutativity can be written as
“∀๐ฃ, ๐ค(๐ฃ + ๐ค = ๐ค + ๐ฃ).” The quantifier ∀ (for all) can be applied to any set for which there is a notion
of addition defined. We also saw that if the statement is true in ๐, and ๐ is a subset of ๐, then the
statement will be true in ๐.
With the statement of closure, to eliminate the set ๐ from the formula, we would need to say
something like, “For all ๐ฅ and ๐ฆ, ๐ฅ + ๐ฆ exists.” However, there is no way to say “exists” using just logical
notation without talking about the set we wish to exist inside of.
We summarize these notes in the following theorem.
Theorem 8.1: Let ๐ be a vector space over a field ๐ฝ and let ๐ ⊆ ๐. Then ๐ ≤ ๐ if and only if (i) 0 ∈ ๐,
(ii) for all ๐ฃ, ๐ค ∈ ๐, ๐ฃ + ๐ค ∈ ๐, and (iii) for all ๐ฃ ∈ ๐ and ๐ ∈ ๐ฝ, ๐๐ฃ ∈ ๐.
Proof: Let ๐ be a vector space over a field ๐ฝ, and ๐ ⊆ ๐.
99
If ๐ is a subspace of ๐, then by definition of ๐ being a vector space, (i), (ii), and (iii) hold.
Now suppose that (i), (ii), and (iii) hold.
By (ii), + is a binary operation on ๐.
Associativity and commutativity of + are defined by universal statements, and therefore, since they
hold in ๐ and ๐ ⊆ ๐, they hold in ๐.
We are given that 0 ∈ ๐. If ๐ฃ ∈ ๐, then since ๐ ⊆ ๐, ๐ฃ ∈ ๐. Since 0 is the additive identity for ๐,
0 + ๐ฃ = ๐ฃ + 0 = ๐ฃ. Since ๐ฃ ∈ ๐ was arbitrary, the additive identity property holds in ๐.
Let ๐ฃ ∈ ๐. Since ๐ ⊆ ๐, ๐ฃ ∈ ๐. Therefore, there is – ๐ฃ ∈ ๐ such that ๐ฃ + (– ๐ฃ) = (– ๐ฃ) + ๐ฃ = 0. By (iii),
– 1๐ฃ ∈ ๐ and by Problem 4 (part (iv)), – 1๐ฃ = – ๐ฃ. Since ๐ฃ ∈ ๐ was arbitrary, the additive inverse
property holds in ๐.
So, (๐, +) is a commutative group.
By (iii), ๐ is closed under scalar multiplication.
Associativity of scalar multiplication and both types of distributivity are defined by universal
statements, and therefore, since they hold in ๐ and ๐ ⊆ ๐, they hold in ๐.
Finally, if ๐ฃ ∈ ๐, then since ๐ ⊆ ๐, ๐ฃ ∈ ๐. So, 1๐ฃ = ๐ฃ, and the scalar multiplication identity property
holds in ๐.
Therefore, ๐ ≤ ๐.
โก
Example 8.4:
1. Let ๐ = โ2 = {(๐, ๐) | ๐, ๐ ∈ โ} be the vector space over โ with the usual definitions of
addition and scalar multiplication, and let ๐ = {(๐, 0) | ๐ ∈ โ}. If (๐, 0) ∈ ๐, then ๐, 0 ∈ โ,
and so (๐, 0) ∈ ๐. Thus, ๐ ⊆ ๐. The 0 vector of ๐ is (0, 0) which is in ๐. If (๐, 0), (๐, 0) ∈ ๐ and
๐ ∈ โ, then (๐, 0) + (๐, 0) = (๐ + ๐, 0) ∈ ๐ and ๐(๐, 0) = (๐๐, 0) ∈ ๐. It follows from
Theorem 8.1 that ๐ ≤ ๐.
This subspace ๐ of โ2 looks and behaves just like โ, the set of real numbers. More specifically,
we say that ๐ is isomorphic to โ. Most mathematicians identify this subspace ๐ of โ2 with โ,
and just call it โ. See Lesson 11 for a precise definition of “isomorphic.”
In general, it is common practice for mathematicians to call various isomorphic copies of certain
structures by the same name. As a generalization of this example, if ๐ < ๐, then we can say
โ๐ ≤ โ๐ by identifying (๐1 , ๐2 , … , ๐๐ ) ∈ โ๐ with the vector (๐1 , ๐2 , … , ๐๐ , 0, 0, … ,0) ∈ โ๐
1
that has a tail end of ๐ − ๐ zeros. For example, we may say that (2, √2, 7, – 2 , 0, 0, 0) is in โ4 ,
even though it is technically in โ7 . With this type of identification, we have โ4 ≤ โ7 .
2. Let ๐ = โ3 = {(๐, ๐, ๐) | ๐, ๐, ๐ ∈ โ} be the vector space over โ with the usual definitions of
addition and scalar multiplication and let ๐ = {(๐, ๐, ๐) ∈ โ3 | ๐ = ๐ + 2๐}. Let’s check that
๐ ≤ ๐.
100
It’s clear that ๐ ⊆ ๐. Since 0 = 0 + 2 ⋅ 0, we see that the zero vector (0, 0, 0) is in ๐. Let
(๐, ๐, ๐), (๐, ๐, ๐) ∈ ๐ and ๐ ∈ โ. Then we have
(๐, ๐, ๐) + (๐, ๐, ๐) = (๐, ๐, ๐ + 2๐) + (๐, ๐, ๐ + 2๐) = (๐ + ๐, ๐ + ๐, (๐ + ๐) + 2(๐ + ๐)).
๐(๐, ๐, ๐) = ๐(๐, ๐, ๐ + 2๐) = (๐๐, ๐๐, ๐๐ + 2๐๐).
These vectors are both in ๐, and so, by Theorem 8.1, ๐ ≤ ๐.
3. Consider ๐ = โ as a vector space over โ in the usual way and let ๐ = {๐ง ∈ โ | Re ๐ง = 1}. Then
๐ ⊆ ๐, but ๐ โฐ ๐ because the zero vector is not in ๐. After all, 0 = 0 + 0๐, and so, Re 0 = 0.
4. Let ๐ = {๐๐ฅ 2 + ๐๐ฅ + ๐ | ๐, ๐, ๐ ∈ โ} be the set of polynomials of degree 2 with real coefficients
over โ, and let ๐ = {๐(๐ฅ) ∈ ๐ | ๐(5) = 0}. Let’s check that ๐ ≤ ๐ (note that if
๐(๐ฅ) = ๐๐ฅ 2 + ๐๐ฅ + ๐, then ๐(5) = 25๐ + 5๐ + ๐).
It’s clear that ๐ ⊆ ๐. The zero polynomial ๐(๐ฅ) = 0 satisfies ๐(5) = 0, and so, the zero vector
is in ๐. Let ๐(๐ฅ), ๐(๐ฅ) ∈ ๐ and ๐ ∈ โ. Then we have ๐(5) + ๐(5) = 0 + 0 = 0, so that
๐(๐ฅ) + ๐(๐ฅ) ∈ ๐, and we have ๐๐(5) = ๐ ⋅ 0 = 0, so that ๐๐(๐ฅ) ∈ ๐. By Theorem 8.1, ๐ ≤ ๐.
5. Every vector space is a subspace of itself, and the vector space consisting of just the 0 vector
from the vector space ๐ is a subspace of ๐.
In other words, for any vector space ๐, ๐ ≤ ๐ and {0} ≤ ๐.
The empty set, however, can never be a subspace of a vector space because it doesn’t contain
a zero vector.
Theorem 8.2: Let ๐ be a vector space over a field ๐ฝ and let ๐ and ๐ be subspaces of ๐. Then ๐ ∩ ๐
is a subspace of ๐.
Proof: Let ๐ be a vector space over a field ๐ฝ and let ๐ and ๐ be subspaces of ๐. Since ๐ ≤ ๐, 0 ∈ ๐.
Since ๐ ≤ ๐, 0 ∈ ๐. So, 0 ∈ ๐ ∩ ๐. Let ๐ฃ, ๐ค ∈ ๐ ∩ ๐. So, ๐ฃ, ๐ค ∈ ๐ and ๐ฃ, ๐ค ∈ ๐. Since ๐ ≤ ๐ and
๐ ≤ ๐, ๐ฃ + ๐ค ∈ ๐ and ๐ฃ + ๐ค ∈ ๐. Therefore, ๐ฃ + ๐ค ∈ ๐ ∩ ๐. Let ๐ฃ ∈ ๐ ∩ ๐ and ๐ ∈ ๐ฝ. Then
๐ฃ ∈ ๐ and ๐ฃ ∈ ๐. Since ๐ ≤ ๐ and ๐ ≤ ๐, ๐๐ฃ ∈ ๐ and ๐๐ฃ ∈ ๐. So, ๐๐ฃ ∈ ๐ ∩ ๐. By Theorem 8.1,
๐ ∩ ๐ ≤ ๐.
โก
Bases
Let ๐ be a vector space over a field ๐ฝ, let ๐ฃ, ๐ค ∈ ๐, and ๐, ๐ ∈ ๐ฝ. The expression ๐๐ฃ + ๐๐ค is called a
linear combination of the vectors ๐ฃ and ๐ค. We call the scalars ๐ and ๐ weights.
Example 8.5: Let ๐ = โ2 = {(๐, ๐) | ๐, ๐ ∈ โ} be the vector space over โ with the usual definitions
of addition and scalar multiplication. Let ๐ฃ = (1, 0), ๐ค = (0, 1), ๐ = 4, and ๐ = – 2. We have
๐๐ฃ + ๐๐ค = 4(1, 0) − 2(0, 1) = (4, 0) + (0, – 2) = (4, – 2).
It follows that the vector (4, – 2) is a linear combination of the vectors (1, 0) and (0, 1) with weights 4
and – 2, respectively.
If ๐ฃ, ๐ค ∈ ๐, where ๐ is a a vector space over a field ๐ฝ, then the set of all linear combinations of ๐ฃ and
๐ค is called the span of ๐ฃ and ๐ค. Symbolically, we have span{๐ฃ, ๐ค} = {๐๐ฃ + ๐๐ค | ๐, ๐ ∈ ๐ฝ}.
101
Example 8.6: in Example 8.5, we saw that (4, – 2) can be written as a linear combination of the vectors
(1, 0) and (0, 1). It follows that (4, – 2) ∈ span{(1, 0), (0, 1)}.
Theorem 8.3: Let ๐ = โ2 = {(๐, ๐) | ๐, ๐ ∈ โ} be the vector space over โ with the usual definitions
of addition and scalar multiplication. Then span{(1, 0), (0, 1)} = โ2 .
Proof: Let ๐ฃ ∈ span{(1, 0), (0, 1)}. Then there are weights ๐, ๐ ∈ โ with ๐ฃ = ๐(1, 0) + ๐(0, 1). So, we
have ๐ฃ = ๐(1, 0) + ๐(0, 1) = (๐, 0) + (0, ๐) = (๐, ๐). Since ๐, ๐ ∈ โ, we have ๐ฃ = (๐, ๐) ∈ โ2 . Since
๐ฃ ∈ span{(1, 0), (0, 1)} was arbitrary, span{(1, 0), (0, 1)} ⊆ โ2 .
Now, let ๐ฃ ∈ โ2 . Then there are ๐, ๐ ∈ โ with ๐ฃ = (๐, ๐) = (๐, 0) + (0, ๐) = ๐(1, 0) + ๐(0, 1). Since
we have expressed ๐ฃ as a linear combination of (1, 0) and (0, 1), we see that ๐ฃ ∈ span{(1, 0), (0, 1)}.
Since ๐ฃ ∈ โ2 was arbitrary, โ2 ⊆ span{(1, 0), (0, 1)}.
Since span{(1, 0), (0, 1)} ⊆ โ2 and โ2 ⊆ span{(1, 0), (0, 1)}, we have span{(1, 0), (0, 1)} = โ2 .
โก
If ๐ฃ, ๐ค ∈ ๐, where ๐ is a a vector space over a field ๐ฝ, then we say that ๐ฃ and ๐ค are linearly
independent if neither vector is a scalar multiple of the other one. Otherwise, we say that ๐ฃ and ๐ค are
linearly dependent.
Example 8.7:
1. The vectors (1, 0) and (0, 1) are linearly independent in โ2 because for any ๐ ∈ โ, we have
๐(1, 0) = (๐, 0) ≠ (0, 1) and ๐(0, 1) = (0, ๐) ≠ (1, 0).
2. The vectors (1, 2) and (– 3, – 6) are linearly dependent in โ2 because (– 3, – 6) = – 3(1, 2).
If ๐ฃ, ๐ค ∈ ๐, where ๐ is a vector space over a field ๐ฝ, then we say that {๐ฃ, ๐ค} is a basis of ๐ if ๐ฃ and ๐ค
are linearly independent and span{๐ฃ, ๐ค} = ๐.
Example 8.8:
1. In Example 8.7, we saw that the vectors (1, 0) and (0, 1) are linearly independent in โ2 . By
Theorem 8.3, span{(1, 0), (0, 1)} = โ2 . It follows that {(1, 0), (0, 1)} is a basis of โ2 .
2. In Example 8.7, we saw that the vectors (1, 2) and (– 3, – 6) are linearly dependent in โ2 . It
follows that {(1, 2), (– 3, – 6)} is not a basis of โ2 .
We would like to generalize the notion of linear dependence to more than two vectors. The definition
of one vector being a scalar multiple of the other isn’t quite good enough to do that. The following
theorem gives us an alternative definition of linear dependence that generalizes nicely.
Theorem 8.4: Let ๐ be a vector space over a field ๐ฝ and let ๐ฃ, ๐ค ∈ ๐. Then ๐ฃ and ๐ค are linearly
dependent if and only if there are ๐, ๐ ∈ ๐ฝ, not both 0, such that ๐๐ฃ + ๐๐ค = 0.
Proof: Let ๐ฃ, ๐ค ∈ ๐, and suppose that ๐ฃ and ๐ค are linearly dependent. Then one vector is a scalar
multiple of the other. Without loss of generality, we may assume that there is ๐ ∈ ๐ฝ with ๐ฃ = ๐๐ค. Then
we have 1๐ฃ + (– ๐)๐ค = 0. So, if we let ๐ = 1 and ๐ = – ๐, then ๐๐ฃ + ๐๐ค = 0, and ๐ = 1 ≠ 0.
102
Now suppose that there are ๐, ๐ ∈ ๐ฝ, not both 0, such that ๐๐ฃ + ๐๐ค = 0. Without loss of generality,
๐
assume that ๐ ≠ 0. Then we have ๐๐ฃ = – ๐๐ค, and so, ๐ฃ = – ๐ ๐ค. So, ๐ฃ is a scalar multiple of ๐ค.
Therefore, ๐ฃ and ๐ค are linearly dependent.
โก
Note: See the Note following Theorem 6.6 in Lesson 6 for an explanation of the expression “Without
loss of generality,” and how to properly use it in a proof.
We will now extend the notions of linear dependence and independence to more than two vectors.
Let ๐ be a vector space over a field ๐ฝ, let ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐, and ๐1 , ๐2 , … , ๐๐ ∈ ๐ฝ. The expression
๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ is called a linear combination of the vectors ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ . We call the scalars
๐1 , ๐2 , … , ๐๐ weights.
Example 8.9: Let ๐ = โ3 = {(๐, ๐, ๐) | ๐, ๐, ๐ ∈ โ} be the vector space over โ with the usual
definitions of addition and scalar multiplication. Let ๐ฃ1 = (1, 0, 0), ๐ฃ2 = (0, 1, 0), ๐ฃ3 = (0, 0, 1),
๐1 = 3, ๐2 = – 5, ๐3 = 6 We have
๐1 ๐ฃ1 + ๐2 ๐ฃ2 + ๐3 ๐ฃ3 = 3(1, 0, 0) − 5(0, 1, 0) + 6(0, 0, 1).
= (3, 0, 0) + (0, – 5, 0) + (0, 0, 6) = (3, – 5, 6).
It follows that the vector (3, – 5, 6) is a linear combination of the vectors (1, 0, 0), (0, 1, 0), and (0, 0, 1)
with weights 3, – 5, and 6, respectively.
If ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐, where ๐ is a vector space over a field ๐ฝ, then the set of all linear combinations of
๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐ is called the span of ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ . Symbolically, we have
span{๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } = {๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ | ๐1 , ๐2 , … , ๐๐ ∈ ๐ฝ}.
Example 8.10: in Example 8.9, we saw that (3, – 5, 6) can be written as a linear combination of the
vectors (1, 0, 0), (0, 1, 0), and (0, 0, 1). It follows that (3, – 5, 6) ∈ span{(1, 0, 0), (0, 1, 0), (0, 0, 1)}.
Theorem 8.5: Let ๐ = โ๐ = {(๐1 , ๐2 , … , ๐๐ ) | ๐1 , ๐2 , … , ๐๐ ∈ โ} be the vector space over โ with the
usual definitions of addition and scalar multiplication. Then
span{(1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1)} = โ๐ .
Proof: Let ๐ฃ ∈ span{(1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1)}. Then there are weights
๐1 , ๐2 , … , ๐๐ ∈ โ with ๐ฃ = ๐1 (1, 0, 0, … , 0) + ๐2 (0, 1, 0, … , 0) + โฏ + ๐๐ (0, 0, 0, … , 1). So, we have
๐ฃ = (๐1 , 0, 0, … , 0) + (0, ๐2 , 0, … , 0) + โฏ + (0, 0, 0, … , ๐๐ ) = (๐1 , ๐2 , … , ๐๐ ). Since ๐1 , ๐2 , … , ๐๐ ∈ โ,
we have ๐ฃ = (๐1 , ๐2 , … , ๐๐ ) ∈ โ๐ . Since ๐ฃ ∈ span{(1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1)} was
arbitrary, span{(1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1)} ⊆ โ๐ .
Now, let ๐ฃ ∈ โ๐ . Then there are ๐1 , ๐2 , … , ๐๐ ∈ โ with
๐ฃ = (๐1 , ๐2 , … , ๐๐ ) = (๐1 , 0, 0, … , 0) + (0, ๐2 , 0, … , 0) + โฏ + (0, 0, 0, … , ๐๐ )
= ๐1 (1, 0, 0, … , 0) + ๐2 (0, 1, 0, … , 0) + โฏ + ๐๐ (0, 0, 0, … , 1).
103
Since we have expressed ๐ฃ as a linear combination of (1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1),
we see that ๐ฃ ∈ span{(1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1)} . Since ๐ฃ ∈ โ๐ was arbitrary, we
have โ๐ ⊆ span{(1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1)}.
Therefore, span{(1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1)} = โ๐ .
โก
If ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐, where ๐ is a vector space over a field ๐ฝ, then we say that ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly
dependent if there exist weights ๐1 , ๐2 , … , ๐๐ ∈ ๐ฝ, with at least one weight nonzero, such that
๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ = 0. Otherwise, we say that ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly independent.
Notes: (1) ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly independent if whenever we write ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ = 0,
it follows that all the weights ๐1 , ๐2 , … , ๐๐ are 0.
(2) We will sometimes call the expression ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ = 0 a dependence relation. If any
of the weights ๐1 , ๐2 ,…,๐๐ are nonzero, then we say that the dependence relation is nontrivial.
Example 8.11:
1. The three vectors (1, 0, 0), (0, 1, 0), and (0, 0, 1) are linearly independent in โ3 . To see this,
note that we have
๐1 (1, 0, 0) + ๐2 (0, 1, 0) + ๐3 (0, 0, 1) = (๐1 , 0, 0) + (0, ๐2 , 0) + (0, 0, ๐3 ) = (๐1 , ๐2 , ๐3 ).
So, ๐1 (1, 0, 0) + ๐2 (0, 1, 0) + ๐3 (0, 0, 1) = (0, 0, 0) if and only if (๐1 , ๐2 , ๐3 ) = (0, 0, 0) if and
only if ๐1 = 0, ๐2 = 0, and ๐3 = 0.
2. A similar computation shows that the ๐ vectors (1, 0, 0, … ,0), (0, 1, 0, … ,0), … , (0, 0, 0, … , 1)
are linearly independent in โ๐ .
3. The vectors (1, 2, 3), (– 2, 4, 3), and (1, 10, 12) are linearly dependent in โ3 . To see this, note
that 3(1, 2, 3) + (– 2, 4, 3) = (3, 6, 9) + (– 2, 4, 3) = (1, 10, 12), and therefore,
3(1, 2, 3) + (– 2, 4, 3) − (1, 10, 12) = 0.
This gives us a nontrivial dependence relation because we have at least one nonzero weight (in
fact, all three weights are nonzero). The weights are 3, 1, and – 1.
If ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐, where ๐ is a vector space over a field ๐ฝ, then we say that {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } is a basis
of ๐ if ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly independent and span{ ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } = ๐.
Example 8.12:
1. In Example 8.11, we saw that the vectors (1, 0, 0), (0, 1, 0), and (0, 0, 1) are linearly
independent in โ3 . By Theorem 8.5, span{(1, 0, 0), (0, 1, 0), (0, 0, 1)} = โ3 . It follows that
{(1, 0, 0), (0, 1, 0), (0, 0, 1)} is a basis of โ3 .
Similarly, {(1, 0, 0, … , 0), (0, 1, 0, … , 0), … , (0, 0, 0, … , 1)} is a basis of โ๐ .
2. In Example 8.11, we saw that the vectors (1, 2, 3), (– 2, 4, 3), and (1, 10, 12) are linearly
dependent in โ3 . It follows that {(1, 2, 3), (– 2, 4, 3), (1, 10, 12)} is not a basis of โ3 .
104
Problem Set 8
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Determine if each of the following subsets of โ2 is a subspace of โ2 :
(i)
๐ด = {(๐ฅ, ๐ฆ) | ๐ฅ + ๐ฆ = 0}
(ii)
๐ต = {(๐ฅ, ๐ฆ) | ๐ฅ๐ฆ = 0}
(iii) ๐ถ = {(๐ฅ, ๐ฆ) |2๐ฅ = 3๐ฆ}
(iv) ๐ท = {(๐ฅ, ๐ฆ) | ๐ฅ ∈ โ}
2. For each of the following, determine if the given pair of vectors ๐ฃ and ๐ค are linearly independent
or linearly dependent in the given vector space ๐:
2
2
1
(i)
๐ = โ4 , ๐ฃ = (3, 2, 2, – 1), ๐ค = (– 1, – 3 , – 3 , – 3)
(ii)
๐ = โ3 , ๐ฃ = (1, √2, 1), ๐ค = (√2, 2, √2)
(iii) ๐ = โ5 , ๐ฃ = (1, ๐, 2– ๐, 0, 3๐), ๐ค = (– ๐, 1, – 1 − 2๐, 0, 3)
(iv)
(v)
โ
๐ = ๐22
,๐ฃ=
๐
[๐
2
1
๐
],
๐ค
=
[
1
3๐
2
๐
๐
] (๐ ≠ 0, ๐ ≠ ๐)
3
๐ = {๐๐ฅ 2 + ๐๐ฅ + ๐ | ๐, ๐, ๐ ∈ โ}, ๐ฃ = ๐ฅ, ๐ค = ๐ฅ 2
LEVEL 2
3. Let ๐ฝ be a field. Prove that ๐ฝ๐ is a vector space over ๐ฝ.
4. Let ๐ be a vector space over ๐ฝ. Prove each of the following:
(i)
For every ๐ฃ ∈ ๐, – (– ๐ฃ) = ๐ฃ.
(ii)
For every ๐ฃ ∈ ๐, 0๐ฃ = 0.
(iii) For every ๐ ∈ ๐ฝ, ๐ ⋅ 0 = 0.
(iv) For every ๐ฃ ∈ ๐, – 1๐ฃ = – ๐ฃ.
LEVEL 3
5. Let ๐ be a vector space over a field ๐ฝ and let ๐ฟ be a set of subspaces of ๐. Prove that โ๐ฟ is a
subspace of ๐.
6. Prove that a finite set with at least two vectors is linearly dependent if and only if one of the
vectors in the set can be written as a linear combination of the other vectors in the set.
105
LEVEL 4
7. Let ๐ and ๐ be subspaces of a vector space ๐. Determine necessary and sufficient conditions
for ๐ ∪ ๐ to be a subspace of ๐.
8. Give an example of vector spaces ๐ and ๐ with ๐ ⊆ ๐ such that ๐ is closed under scalar
multiplication, but ๐ is not a subspace of ๐.
LEVEL 5
9. Let ๐ be a set of two or more linearly dependent vectors in a vector space ๐. Prove that there is
a vector ๐ฃ in the set so that span ๐ = span ๐ โ {๐ฃ}.
10. Prove that a finite set of vectors ๐ in a vector space ๐ is a basis of ๐ if and only if every vector
in ๐ can be written uniquely as a linear combination of the vectors in ๐.
11. Let ๐ = {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } be a set of linearly independent vectors in a vector space ๐ and let
๐ = {๐ค1 , ๐ค2 , … , ๐ค๐ } be a set of vectors in ๐ such that span ๐ = ๐. Prove that ๐ ≤ ๐.
12. Let ๐ต be a basis of a vector space ๐ with ๐ vectors. Prove that any other basis of ๐ also has ๐
vectors.
106
LESSON 9 – LOGIC
LOGICAL ARGUMENTS
Statements and Substatements
In Lesson 1, we introduced propositional variables such as ๐, ๐, and ๐ to represent the building blocks
of statements (or propositions).
We now define the set of statements a bit more formally as follows:
1. We have a list of symbols ๐, ๐, ๐, … called propositional variables, each of which is a statement
(these are the atomic statements).
2. Whenever ๐ is a statement, (¬๐) is a statement.
3. Whenever ๐ and ๐ are statements, (๐ ∧ ๐), (๐ ∨ ๐), (๐ → ๐), and (๐ ↔ ๐) are statements.
Notes: (1) For easier readability, we will always drop the outermost pair of parentheses. For example,
we will write (๐ ∧ ๐) as ๐ ∧ ๐, and we will write (๐ → (๐ ∨ ๐)) as ๐ → (๐ ∨ ๐).
(2) Also, for easier readability, we will often drop the parentheses around (¬๐) to get ¬๐. For
example, we will write (๐ ∧ (¬๐)) as ๐ ∧ ¬๐. Notice that we dropped the outermost pair of
parentheses to get ๐ ∧ (¬๐), and then we dropped the parentheses around ¬๐.
(3) When we apply the negation symbol two or more times in a row, we will not drop parentheses. For
example, (¬(¬๐)) will be written as ¬(¬๐) and not as ¬¬๐.
(4) ๐ is called a substatement of (¬๐). For example, ๐ is a substatement of ¬๐ (¬๐ is the abbreviated
version of (¬๐)). Similarly, ๐ and ๐ are substatements of (๐ ∧ ๐), (๐ ∨ ๐), (๐ → ๐), and (๐ ↔ ๐).
For example, ๐ and ๐ are both substatements of ๐ ↔ ๐. Also, if ๐ is a substatement of ๐ and ๐ is a
substatement of ๐, then we will consider ๐ to be a substatement of ๐. For example, ๐ is a substatement
of ¬(¬๐) because ๐ is a substatement of ¬๐ and ¬๐ is a substatement of ¬(¬๐).
(5) Although we are abbreviating statements by eliminating parentheses, it is important to realize that
those parentheses are there. If we were to use ∧ to form a new statement from ๐ → ๐ and ๐, it would
be incorrect to write ๐ → ๐ ∧ ๐. This expression is meaningless, as we do not know whether to apply
๐ → ๐ or ๐ ∧ ๐ first. The correct expression is (๐ → ๐) ∧ ๐. This is now an acceptable abbreviation for
the statement ((๐ → ๐) ∧ ๐).
Notice that ๐, ๐, ๐, and ๐ → ๐ are all substatements of (๐ → ๐) ∧ ๐, whereas ๐ ∧ ๐ is not a
substatement of (๐ → ๐) ∧ ๐.
Example 9.1: Let ๐, ๐, and ๐ be propositional variables. Then we have the following:
1. ๐, ๐, and ๐ are statements.
2. (๐ → ๐) is a statement (by 3 above). Using Note 1, we will abbreviate this statement as ๐ → ๐.
๐ and ๐ are both substatements of ๐ → ๐.
107
Example 9.2: Let’s find the substatements of ((๐ → ๐) ∨ ¬๐) ↔ ¬(๐ ∧ ๐) .
Solution: The substatements are ๐, ๐, ๐, ¬๐, ๐ → ๐, (๐ → ๐) ∨ ¬๐, ๐ ∧ ๐, and ¬(๐ ∧ ๐).
Note: The given statement is an abbreviation for(((๐ → ๐) ∨ (¬๐)) ↔ (¬(๐ ∧ ๐))). This is much
harder to read, and shows why we like to use abbreviations.
Logical Equivalence
Let ๐ and ๐ be statements. We say that ๐ and ๐ are logically equivalent, written ๐ ≡ ๐, if every truth
assignment of the propositional variables appearing in either ๐ or ๐ (or both) leads to the same truth
value for both statements.
Example 9.3: Let ๐ be a propositional variable, let ๐ = ๐, and let ๐ = ¬(¬๐). If ๐ ≡ T, then ๐ ≡ T
and ๐ ≡ ¬(¬T) ≡ ¬F ≡ T. If ๐ ≡ F, then ๐ ≡ F and ๐ ≡ ¬(¬F) ≡ ¬T ≡ F. So, both possible truth
assignments of ๐ lead to the same truth value for ๐ and ๐. It follows that ๐ ≡ ๐ (๐ and ๐ are logically
equivalent).
Notes: (1) One way to determine if two statements ๐ and ๐ are logically equivalent is to draw the truth
table for each statement. We would generally put all the information into a single table. If the columns
corresponding to ๐ and ๐ are a perfect match, then ๐ ≡ ๐.
Here is a truth table with columns for ๐ = ๐ and ๐ = ¬(¬๐).
๐
๐
๐
¬๐
๐
๐
¬(¬๐)
๐
๐
Observe that the first column gives the truth values for ๐, the third column gives the truth values for
๐, and both these columns are identical. It follows that ๐ ≡ ๐.
(2) The logical equivalence ๐ ≡ ¬(¬๐) is called the law of double negation.
Example 9.4: Let ๐ and ๐ be propositional variables, let ๐ = ¬(๐ ∧ ๐), and let ๐ = ¬๐ ∨ ¬๐. If ๐ ≡ F
or ๐ ≡ F, then ๐ ≡ ¬F ≡ T and ๐ ≡ T (because ¬๐ ≡ T or ¬๐ ≡ T). If ๐ ≡ T and ๐ ≡ T, then
๐ ≡ ¬T ≡ F and ๐ ≡ F ∨ F ≡ F. So, all four possible truth assignments of ๐ and ๐ lead to the same
truth value for ๐ and ๐. It follows that ๐ ≡ ๐.
Notes: (1) Here is a truth table with columns for ๐ = ¬(๐ ∧ ๐) and ๐ = ¬๐ ∨ ¬๐.
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
¬๐
๐
๐
๐
๐
¬๐
๐
๐
๐
๐
๐∧๐
๐
๐
๐
๐
108
¬(๐ ∧ ๐)
๐
๐
๐
๐
¬๐ ∨ ¬๐
๐
๐
๐
๐
Observe that the sixth column gives the truth values for ๐, the seventh column gives the truth values
for ๐, and both these columns are identical. It follows that ๐ ≡ ๐.
(2) The logical equivalence ¬(๐ ∧ ๐) ≡ ¬๐ ∨ ¬๐ is one of De Morgan’s laws.
(3) There are two De Morgan’s laws. The second one is ¬(๐ ∨ ๐) ≡ ¬๐ ∧ ¬๐. I leave it to the reader
to verify this equivalence.
List 9.1: Here is a list of some useful logical equivalences. The reader should verify each of these by
drawing a truth table or by using arguments similar to those used in Examples 9.3 and 9.4 (see Problem
2 below).
1. Law of double negation: ๐ ≡ ¬(¬๐)
2. De Morgan’s laws:
¬(๐ ∧ ๐) ≡ ¬๐ ∨ ¬๐
¬(๐ ∨ ๐) ≡ ¬๐ ∧ ¬๐
3. Commutative laws:
๐∧๐ ≡๐∧๐
๐∨๐ ≡๐∨๐
4. Associative laws:
(๐ ∧ ๐) ∧ ๐ ≡ ๐ ∧ (๐ ∧ ๐)
(๐ ∨ ๐) ∨ ๐ ≡ ๐ ∨ (๐ ∨ ๐)
5. Distributive laws:
๐ ∧ (๐ ∨ ๐) ≡ (๐ ∧ ๐) ∨ (๐ ∧ ๐)
๐ ∨ (๐ ∧ ๐) ≡ (๐ ∨ ๐) ∧ (๐ ∨ ๐)
6. Identity laws:
๐∧T≡๐
๐∨T≡T
7. Negation laws:
๐ ∧ ¬๐ ≡ F
๐ ∨ ¬๐ ≡ T
8. Redundancy laws:
๐∧๐ ≡๐
๐∨๐ ≡๐
9. Absorption laws:
(๐ ∨ ๐) ∧ ๐ ≡ ๐
(๐ ∧ ๐) ∨ ๐ ≡ ๐
10. Law of the conditional:
๐ → ๐ ≡ ¬๐ ∨ ๐
๐∧F≡F
๐∨F≡๐
11. Law of the contrapositive: ๐ → ๐ ≡ ¬๐ → ¬๐
12. Law of the biconditional: ๐ ↔ ๐ ≡ (๐ → ๐) ∧ (๐ → ๐)
Notes: (1) Although this is a fairly long list of laws, a lot of it is quite intuitive. For example, in English
the word “and” is commutative. If the statement “I have a cat and I have a dog” is true, then the
statement “I have a dog and I have a cat” is also true. So, it’s easy to see that ๐ ∧ ๐ ≡ ๐ ∧ ๐ (the first
law in 3 above). As another example, the statement “I have a cat and I do not have a cat” could never
be true. So, it’s easy to see that ๐ ∧ ¬๐ ≡ F (the first law in 7 above).
(2) The law of the conditional allows us to replace the conditional statement ๐ → ๐ by the more
intuitive statement ¬๐ ∨ ๐. We can think of the conditional statement ๐ → ๐ as having the hypothesis
(or premise or assumption) ๐ and the conclusion ๐. The disjunctive form ¬๐ ∨ ๐ tells us quite explicitly
that a conditional statement is true if and only if the hypothesis ๐ is false or the conclusion ๐ is true.
(3) A statement that has truth value T for all truth assignments of the propositional variables is called
a tautology. A statement that has truth value F for all truth assignments of the propositional variables
is called a contradiction.
In laws 6 and 7 above, we can replace T by any tautology and F by any contradiction, and the law still
holds. For example, since ๐ ↔ ¬๐ is a contradiction, by the fourth identity law, ๐ ∨ (๐ ↔ ¬๐) ≡ ๐.
109
(4) It’s worth observing that if ๐ and ๐ are sentences, then ๐ ≡ ๐ if and only if ๐ ↔ ๐ is a tautology.
This follows from the fact that ๐ ↔ ๐ ≡ T if and only if ๐ and ๐ have the same truth value.
For example, (๐ → ๐) ↔ (¬๐ → ¬๐) is a tautology. This follows from the law of the contrapositive and
the remark in the last paragraph. Let’s look at the complete truth table for this example.
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
¬๐
๐
๐
๐
๐
¬๐
๐
๐
๐
๐
๐→๐
๐
๐
๐
๐
¬๐ → ¬๐
๐
๐
๐
๐
(๐ → ๐) ↔ (¬๐ → ¬๐)
๐
๐
๐
๐
Notice how the columns for (๐ → ๐) and (¬๐ → ¬๐) have the same truth values. So, it should be
obvious that the column for (๐ → ๐) ↔ (¬๐ → ¬๐) will have only T’s.
The following three additional laws of logical equivalence will be used freely (often without mention):
1. Law of transitivity of logical equivalence: Let ๐, ๐, and ๐ be statements such that ๐ ≡ ๐ and
๐ ≡ ๐. Then ๐ ≡ ๐.
2. Law of substitution of logical equivalents: Let ๐, ๐, and ๐ be statements such that ๐ ≡ ๐ and
๐ is a substatement of ๐. Let ๐ ∗ be the sentence formed by replacing ๐ by ๐ inside of ๐. Then
๐ ∗ ≡ ๐.
3. Law of substitution of sentences: Let ๐ and ๐ be statements such that ๐ ≡ ๐, let ๐ be a
propositional variable, and let ๐ be a statement. Let ๐ ∗ and ๐ ∗ be the sentences formed by
replacing every instance of ๐ with ๐ in ๐ and ๐, respectively. Then ๐ ∗ ≡ ๐ ∗ .
Example 9.5:
1. Since ๐ ≡ ¬(¬๐) (by the law of double negation), we have ๐ ∧ ๐ ≡ ๐ ∧ ¬(¬๐). Here we have
used the law of substitution of logical equivalents with ๐ = ๐, ๐ = ¬(¬๐), ๐ = ๐ ∧ ๐, and
๐ ∗ = ๐ ∧ ¬(¬๐).
2. Let’s show that the negation of the conditional statement ๐ → ๐ is logically equivalent to the
statement ๐ ∧ ¬๐.
We have ¬(๐ → ๐) ≡ ¬(¬๐ ∨ ๐) ≡ ¬(¬๐) ∧ ¬๐ ≡ ๐ ∧ ¬๐. Here we have used the law of
substitution of logical equivalents together with the law of the conditional, the second De
Morgan’s law, the law of double negation, and the law of transitivity of logical equivalence.
3. Since ๐ → ๐ ≡ ¬๐ ∨ ๐ (by the law of the conditional), (๐ ∧ ๐) → (๐ ∨ ๐) ≡ ¬(๐ ∧ ๐) ∨ (๐ ∨ ๐).
Here we have used the law of substitution of sentences twice. We replaced the propositional
variable ๐ by the statement ๐ ∧ ๐, and then we replaced the propositional variable ๐ by the
statement ๐ ∨ ๐.
Notes: (1) If you think about the equivalence ¬(๐ → ๐) ≡ ๐ ∧ ¬๐ from part 2 of Example 9.5 for a
moment, you will realize that it makes perfect sense. Again, we can think of the conditional statement
๐ → ๐ as having the hypothesis ๐ and the conclusion ๐. We know the only way to make a conditional
statement false is to make the hypothesis true and the conclusion false.
110
So, to make the negation of the conditional statement true, we would do the same thing. In other
words, the negation of the conditional is true if ๐ is true and ๐ is false, or equivalently, if ๐ ∧ ¬๐ is true.
In summary, the logical equivalence ¬(๐ → ๐) ≡ ๐ ∧ ¬๐ says that a conditional statement is false if
and only if the hypothesis is true and the conclusion is false.
(2) By the second associative law, (๐ ∨ ๐) ∨ ๐ ≡ ๐ ∨ (๐ ∨ ๐). So, we can write ๐ ∨ ๐ ∨ ๐ because
whichever way we choose to think about it (๐ ∨ ๐ first or ๐ ∨ ๐ first), we get the same truth values.
In part 3 of Example 9.5, we saw that (๐ ∧ ๐) → (๐ ∨ ๐) ≡ ¬(๐ ∧ ๐) ∨ (๐ ∨ ๐). By our remarks in the
last paragraph, we can write ¬(๐ ∧ ๐) ∨ (๐ ∨ ๐) as ¬(๐ ∧ ๐) ∨ ๐ ∨ ๐ without causing any confusion.
Example 9.6: Let’s show that the statement ๐ ∧ [(๐ ∧ ¬๐) ∨ ๐] is logically equivalent to the atomic
statement ๐.
Solution:
๐ ∧ [(๐ ∧ ¬๐) ∨ ๐] ≡ ๐ ∧ [๐ ∨ (๐ ∧ ¬๐)] ≡ ๐ ∧ [(๐ ∨ ๐) ∧ (๐ ∨ ¬๐)] ≡ ๐ ∧ [(๐ ∨ ๐) ∧ T]
≡ ๐ ∧ (๐ ∨ ๐) ≡ (๐ ∨ ๐) ∧ ๐ ≡ (๐ ∨ ๐) ∧ ๐ ≡ ๐
So, we see that ๐ ∧ [(๐ ∧ ¬๐) ∨ ๐] is logically equivalent to the atomic statement ๐.
Notes: (1) For the first equivalence, we used the second commutative law.
(2) For the second equivalence, we used the second distributive law.
(3) For the third equivalence, we used the second negation law.
(4) For the fourth equivalence, we used the first identity law.
(5) For the fifth equivalence, we used the first commutative law.
(6) For the sixth equivalence, we used the second commutative law.
(7) For the last equivalence, we used the first absorption law.
(8) We also used the law of transitivity of logical equivalence and the law of substitution of logical
equivalents several times.
Validity in Sentential Logic
A logical argument or proof consists of premises (statements that we are given) and conclusions
(statements we are not given).
One way to write an argument is to list the premises and conclusions vertically with a horizontal line
separating the premises from the conclusions. If there are two premises ๐ and ๐, and one conclusion
๐, then the argument would look like this:
๐
๐
๐
111
Example 9.7: Let’s take ๐ → ๐ and ๐ to be premises and ๐ to be a conclusion. Here is the argument.
๐→๐
๐
๐
A logical argument is valid if every truth assignment that makes all premises true also makes all the
conclusions true. A logical argument that is not valid is called invalid or a fallacy.
There are several ways to determine if a logical argument is valid. We will give three methods in the
next example.
Example 9.8: Let’s show that the logical argument given in Example 9.7 is valid. The premises are
๐ → ๐ and ๐, and the conclusion is ๐.
๐→๐
๐
๐
Solution: Let’s use a truth table to illustrate the three methods.
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐
๐→๐
๐
๐
๐
๐
(๐ → ๐) ∧ ๐
๐
๐
๐
๐
[(๐ → ๐) ∧ ๐] → ๐
๐
๐
๐
๐
There are several ways to use this truth table to see that the logical argument is valid.
Method 1: We use only the first three columns. We look at each row where both premises (columns 1
and 3) are true. Only the first row satisfies this. Since the conclusion (column 2) is also true in the first
row, the logical argument is valid. Symbolically, we write ๐ → ๐, ๐ โข ๐, and we say that {๐ → ๐, ๐}
tautologically implies ๐.
Method 2: We can take the conjunction of the premises, as we did in column 4. We look at each row
where this conjunction is true. Again, only the first row satisfies this. Since the conclusion (column 2) is
also true in the first row, the logical argument is valid. Symbolically, we write (๐ → ๐) ∧ ๐ โข ๐, and we
say that (๐ → ๐) ∧ ๐ tautologically implies ๐.
Method 3: We can use the conjunction of the premises as the hypothesis of the conditional with the
appropriate conclusion, as we did in column 5. We now check that this statement is a tautology.
Symbolically, we can write โข [(๐ → ๐) ∧ ๐] → ๐ (this can be read “[(๐ → ๐) ∧ ๐] → ๐ is a tautology”).
Notes: (1) A valid argument is called a rule of inference. The rule of inference in this example is called
modus ponens.
(2) We didn’t need to draw a whole truth table to verify that the argument presented here was valid.
For example, for Method 1, we could argue as follows: If ๐ ≡ T and ๐ → ๐ ≡ T, then we must have
๐ ≡ T because if ๐ were false, we would have ๐ → ๐ ≡ T → F ≡ F.
112
(3) ๐ and ๐ could be any statements here. For example, suppose ๐ is the statement “Pigs have wings,”
and ๐ is the statement “pigs can fly.” Then the argument looks like this:
If pigs have wings, then they can fly
Pigs have wings
Pigs can fly
This seems like a good time to point out that just because a logical argument is valid, it does not mean
that the conclusion is true. We have shown in the solution above that this argument is valid. However,
I think we can all agree that pigs cannot fly!
(4) We say that a logical argument is sound if it is valid and all the premises are true. Note 3 above
gives an example of an argument that is valid, but not sound.
Every tautology gives us at least one rule of inference.
Example 9.9: Recall the first De Morgan’s law: ¬(๐ ∧ ๐) ≡ ¬๐ ∨ ¬๐. This law gives us the following
two rules of inference.
¬(๐ ∧ ๐)
¬๐ ∨ ¬๐
¬๐ ∨ ¬๐
¬(๐ ∧ ๐)
To show that an argument is invalid, we need only produce a single truth assignment that makes all the
premises true and the conclusion (or one of the conclusions) false. Such a truth assignment is called a
counterexample.
Example 9.10: The following invalid argument is called the fallacy of the converse.
๐→๐
๐→๐
To see that this argument is invalid, we will find a counterexample. Here we can use the truth
assignment ๐ ≡ F, ๐ ≡ T. We then have that ๐ → ๐ ≡ F → T ≡ T and ๐ → ๐ ≡ T → F ≡ F.
Notes: (1) Consider the conditional statement ๐ → ๐. The statement ๐ → ๐ is called the converse of
the original conditional statement. The argument in this example shows that the converse of a
conditional statement is not logically equivalent to the original conditional statement.
(2) The statement ¬๐ → ¬๐ is called the inverse of the original conditional statement. This statement
is also not logically equivalent to the original conditional statement. The reader should write down the
fallacy of the inverse and give a counterexample to show that it is invalid (as we did above for the
converse).
(3) The statement ¬๐ → ¬๐ is called the contrapositive of the original conditional statement. By the
law of the contrapositive, this statement is logically equivalent to the original conditional statement.
The reader should write down the law of the contrapositive as a rule of inference, as was done for the
first De Morgan’s law in Example 9.9.
113
List 9.2: Here is a list of some useful rules of inference that do not come from tautologies. The reader
should verify that each of the logical arguments given here is valid (see Problem 6 below).
Modus Ponens
Modus Tollens
Disjunctive Syllogism
Hypothetical Syllogism
๐→๐
๐
๐
๐→๐
¬๐
¬๐
๐∨๐
¬๐
๐
๐→ ๐
๐→๐
๐→๐
Conjunctive
Introduction
Disjunctive
Introduction
Biconditional
Introduction
Constructive Dilemma
๐
๐
๐∧๐
๐
๐∨๐
๐→๐
๐→๐
๐↔๐
Conjunctive
Elimination
Disjunctive
Resolution
Biconditional
Elimination
๐∧๐
๐
๐∨๐
¬๐ ∨ ๐
๐∨๐
๐↔๐
๐→๐
๐→ ๐
๐→๐
๐∨๐
๐∨๐
Destructive Dilemma
๐→ ๐
๐→๐
¬๐ ∨ ¬๐
¬๐ ∨ ¬๐
A derivation is a valid logical argument such that each conclusion follows from the premises and
conclusions above it using a rule of inference.
When creating a derivation, we will label each premise and conclusion with a number and state the
rule of inference and numbers that are used to derive each conclusion.
Example 9.11: Let’s give a derivation of the following logical argument.
¬๐
¬๐ → ¬๐
¬๐ ∨ ๐
Solution:
1 ¬๐
Premise
2 ¬๐ → ¬๐
Premise
3 ¬๐
Modus ponens (2, 1)
4 ¬๐ ∨ ๐
Disjunctive introduction (3)
Notes: (1) We started by listing the premises above the line.
(2) If we let ๐ = ¬๐ and ๐ = ¬๐, then by modus ponens, we have ๐ → ๐, ๐ โข ๐. So, we can write
๐ ≡ ¬๐ as the third line of the derivation. We applied modus ponens to the sentences in lines 2 and 1
to derive ¬๐.
114
(3) If we let ๐ = ¬๐, then by disjunctive introduction, we have ๐ ∨ ๐ = ¬๐ ∨ ๐. So, we can write
¬๐ ∨ ๐ as the fourth line of the derivation. We applied disjunctive introduction to the sentence in line
3 to derive ¬๐ ∨ ๐.
Example 9.12: Let’s determine if the following logical argument is valid.
If cats hiss and purr, then dogs can talk.
Cats hiss.
Dogs cannot talk.
Therefore, cats do not purr.
Solution: Let โ represent “Cats hiss,” let ๐ represent “Cats purr,” and let ๐ก represent “Dogs can talk.”
We now give a derivation showing that the argument is valid.
1 (โ ∧ ๐) → ๐ก
Premise
2 โ
Premise
3 ¬๐ก
Premise
4 ¬(โ ∧ ๐)
Modus tollens (1, 3)
5 ¬โ ∨ ¬๐
De Morgan’s law (4)
6 ¬(¬โ)
Law of double negation (2)
7 ¬๐
Disjunctive syllogism (5, 6)
Note: The derivation in the solution above shows us that the logical argument is valid. However, notice
that the statement we derived is false. After all, cats do purr. So, although the logical argument is valid,
it is not sound (see Note 4 following Example 9.8). This means that one of the premises must be false.
Which one is it? Well cats do hiss and dogs cannot talk. So, the false statement must be “If cats hiss
and purr, then dogs can talk.” If it’s not clear to you that this statement is false, use the law of the
conditional to rewrite it as “Neither cats hiss nor purr, or dogs can talk.” Since cats do hiss and purr,
the statement “Neither cats hiss nor purr” is false. Since dogs cannot talk, the statement “Dogs can
talk” is also false. Therefore, the disjunction of those two statements is false.
115
Problem Set 9
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Let ๐ be the following statement: (๐ ∧ ¬๐) ↔ ¬[๐ ∨ (¬๐ → ๐)].
(i)
The statement ๐ is abbreviated. Write ๐ in its unabbreviated form.
(ii)
Write down all the substatements of ๐ in both abbreviated and unabbreviated form.
2. Verify all the logical equivalences given in List 9.1.
LEVEL 2
3. Let ๐, ๐, and ๐ be statements. Prove that ๐ โข ๐ and ๐ โข ๐ implies ๐ โข ๐.
4. Let ๐ and ๐ be statements. Prove that ๐ โข ๐ if and only if ๐ → ๐ is a tautology.
LEVEL 3
5. Determine if each of the following statements is a tautology, a contradiction, or neither.
(i)
๐∧๐
(ii)
๐ ∧ ¬๐
(iii) (๐ ∨ ¬๐) → (๐ ∧ ¬๐)
(iv) ¬(๐ ∨ ๐) ↔ (¬๐ ∧ ¬๐)
(v)
๐ → (¬๐ ∧ ๐)
(vi) (๐ ↔ ๐) → (๐ → ๐)
6. Verify all the rules of inference given in List 9.2.
LEVEL 4
7. Determine whether each of the following logical arguments is valid or invalid. If the argument is
valid, provide a deduction. If the argument is invalid, provide a counterexample.
I
)
๐∨๐
๐
๐
II
¬(๐ ∧ ๐)
๐
¬๐
III
116
¬๐
๐∨๐
๐ → ¬๐
¬๐
IV
๐→ ๐
๐ → ¬๐
๐→๐
8. Simplify each statement.
(i)
๐ ∨ (๐ ∧ ¬๐)
(ii)
(๐ ∧ ๐) ∨ ¬๐
(iii) ¬๐ → (¬๐ → ๐)
(iv) (๐ ∧ ¬๐) ∨ ๐
(v)
[(๐ ∧ ๐) ∨ ๐] ∧ [(๐ ∨ ๐) ∧ ๐]
LEVEL 5
9. Determine if the following logical argument is valid. If the argument is valid, provide a
deduction. If the argument is invalid, provide a counterexample.
If a piano has 88 keys, then the box is empty.
If a piano does not have 88 keys, then paintings are white.
If we are in immediate danger, then the box is not empty.
Therefore, paintings are white or we are not in immediate danger.
10. Determine if the following logical argument is valid. If the argument is valid, provide a
deduction. If the argument is invalid, provide a counterexample.
Tangs have fangs or tings have wings.
It is not the case that tangs have fangs and tings do not have wings.
It is not the case that tangs do not have fangs and tings have wings.
Therefore, tangs have fangs and either tings have wings or tangs do not have
fangs.
117
LESSON 10 – SET THEORY
RELATIONS AND FUNCTIONS
Relations
An unordered pair is a set with 2 elements. Recall, that a set doesn’t change if we write the elements
in a different order or if we write the same element multiple times. For example, {0, 1} = {1, 0} and
{0, 0} = {0}.
We now define the ordered pair (๐ฅ, ๐ฆ) in such a way that (๐ฆ, ๐ฅ) will not be the same as (๐ฅ, ๐ฆ). The
simplest way to define a set with this property is as follows:
(๐ฅ, ๐ฆ) = {{๐ฅ}, {๐ฅ, ๐ฆ}}
We now show that with this definition, the ordered pair behaves as we would expect.
Theorem 10.1: (๐ฅ, ๐ฆ) = (๐ง, ๐ค) if and only if ๐ฅ = ๐ง and ๐ฆ = ๐ค.
Part of the proof of this theorem is a little trickier than expected. Assuming that (๐ฅ, ๐ฆ) = (๐ง, ๐ค), there
are actually two cases to consider: ๐ฅ = ๐ฆ and ๐ฅ ≠ ๐ฆ. If ๐ฅ = ๐ฆ, then (๐ฅ, ๐ฆ) is a set with just one element.
Indeed, (๐ฅ, ๐ฅ) = {{๐ฅ}, {๐ฅ, ๐ฅ}} = {{๐ฅ}, {๐ฅ}} = {{๐ฅ}}. So, the only element of (๐ฅ, ๐ฅ) is {๐ฅ}. Watch carefully
how this plays out in the proof.
Proof of Theorem 10.1: First suppose that ๐ฅ = ๐ง and ๐ฆ = ๐ค. Then by direct substitution, {๐ฅ} = {๐ง} and
{๐ฅ, ๐ฆ} = {๐ง, ๐ค}. So, (๐ฅ, ๐ฆ) = {{๐ฅ}, {๐ฅ, ๐ฆ}} = {{๐ง}, {๐ง, ๐ค}} = (๐ง, ๐ค).
Conversely, suppose that (๐ฅ, ๐ฆ) = (๐ง, ๐ค). Then {{๐ฅ}, {๐ฅ, ๐ฆ}} = {{๐ง}, {๐ง, ๐ค}}. There are two cases to
consider.
Case 1: If ๐ฅ = ๐ฆ, then {{๐ฅ}, {๐ฅ, ๐ฆ}} = {{๐ฅ}}. So, {{๐ฅ}} = {{๐ง}, {๐ง, ๐ค}}. It follows that {๐ง} = {๐ฅ} and
{๐ง, ๐ค} = {๐ฅ}. Since {๐ง, ๐ค} = {๐ฅ}, we must have ๐ง = ๐ฅ and ๐ค = ๐ฅ. Therefore, ๐ฅ, ๐ฆ, ๐ง, and ๐ค are all equal.
In particular, ๐ฅ = ๐ง and ๐ฆ = ๐ค.
Case 2: If ๐ฅ ≠ ๐ฆ, then {๐ฅ, ๐ฆ} is a set with two elements. So, {๐ฅ, ๐ฆ} cannot be equal to {๐ง} (because {๐ง}
has just one element). Therefore, we must have {๐ฅ, ๐ฆ} = {๐ง, ๐ค}. It then follows that {๐ฅ} = {๐ง}. So, we
have ๐ฅ = ๐ง. Since ๐ฅ = ๐ง and {๐ฅ, ๐ฆ} = {๐ง, ๐ค}, we must have ๐ฆ = ๐ค.
โก
Note: (๐ฅ, ๐ฆ) is an abbreviation for the set {{๐ฅ}, {๐ฅ, ๐ฆ}}. In the study of Set Theory, every object can be
written as a set like this. It’s often convenient to use abbreviations, but we should always be aware
that if necessary, we can write any object in its unabbreviated form.
We can extend the idea of an ordered pair to an ordered ๐-tuple. An ordered 3-tuple (also called an
ordered triple) is defined by (๐ฅ, ๐ฆ, ๐ง) = ((๐ฅ, ๐ฆ), ๐ง), an ordered 4-tuple is (๐ฅ, ๐ฆ, ๐ง, ๐ค) = ((๐ฅ, ๐ฆ, ๐ง), ๐ค),
and so on.
118
Example 10.1: Let’s write the ordered triple (๐ฅ, ๐ฆ, ๐ง) in its unabbreviated form (take a deep breath!).
(๐ฅ, ๐ฆ, ๐ง) = ((๐ฅ, ๐ฆ), ๐ง) = {{(๐ฅ, ๐ฆ)}, {(๐ฅ, ๐ฆ), ๐ง}} = {{{{๐}, {๐, ๐}}} , {{{๐}, {๐, ๐}}, ๐}}
The Cartesian product of the sets ๐ด and ๐ต, written ๐ด × ๐ต is the set of ordered pairs (๐, ๐) with ๐ ∈ ๐ด
and ๐ ∈ ๐ต. Symbolically, we have
๐ด × ๐ต = {(๐, ๐) | ๐ ∈ ๐ด ∧ ๐ ∈ ๐ต}.
Example 10.2:
1. Let ๐ด = {0, 1, 2} and ๐ต = {๐, ๐}. Then ๐ด × ๐ต = {(0, ๐), (0, ๐), (1, ๐), (1, ๐), (2, ๐), (2, ๐)}.
2. Let ๐ถ = ∅ and ๐ท = {๐, ๐, ๐, ๐}. Then ๐ถ × ๐ท = ∅.
3. Let ๐ธ = {∅} and ๐น = {Δ, โ}. Then ๐ธ × ๐น = {(∅, Δ), (∅,โ)}.
We can extend the definition of the Cartesian product to more than two sets in the obvious way:
๐ด × ๐ต × ๐ถ = {(๐, ๐, ๐) | ๐ ∈ ๐ด ∧ ๐ ∈ ๐ต ∧ ๐ ∈ ๐ถ}
๐ด × ๐ต × ๐ถ × ๐ท = {(๐, ๐, ๐, ๐) | ๐ ∈ ๐ด ∧ ๐ ∈ ๐ต ∧ ๐ ∈ ๐ถ ∧ ๐ ∈ ๐ท}
Example 10.3:
1. {๐} × {1} × {Δ} × {∝} = {(๐, 1, Δ, ∝)}
2. {0} × {0, 1} × {1} × {0, 1} × {0} = {(0, 0, 1, 0, 0), (0, 0, 1, 1, 0), (0, 1, 1, 0, 0), (0, 1, 1, 1, 0)}
We abbreviate Cartesian products of sets with themselves using exponents.
๐ด2 = ๐ด × ๐ด
๐ด3 = ๐ด × ๐ด × ๐ด
๐ด4 = ๐ด × ๐ด × ๐ด × ๐ด
Example 10.4:
1. โ2 = โ × โ = {(๐ฅ, ๐ฆ) | ๐ฅ, ๐ฆ ∈ โ} is the set of ordered pairs of real numbers.
2. โ5 = โ × โ × โ × โ × โ = {(๐, ๐, ๐, ๐, ๐) | ๐, ๐, ๐, ๐, ๐ ∈ โ} is the set of ordered 5-tuples of
natural numbers.
3. {0, 1}2 = {0, 1} × {0, 1} = {(0, 0), (0, 1), (1, 0), (1, 1)}.
A binary relation on a set ๐ด is a subset of ๐ด2 = ๐ด × ๐ด. Symbolically, we have
๐
is a binary relation on ๐ด if and only if ๐
⊆ ๐ด × ๐ด.
We will usually abbreviate (๐, ๐) ∈ ๐
as ๐๐
๐.
Example 10.5:
1. Let ๐
= {(๐, ๐) ∈ โ × โ | ๐ < ๐}. For example, we have (0, 1) ∈ ๐
because 0 < 1. However,
(1, 1) ∉ ๐
because 1 โฎ 1. We abbreviate (0, 1) ∈ ๐
by 0๐
1.
Observe that ๐
⊆ โ × โ, and so, ๐
is a binary relation on โ.
119
We would normally use the name < for this relation ๐
. So, we have (0, 1) ∈ <, which we
abbreviate as 0 < 1, and we have (1, 1) ∉ <, which we abbreviate as 1 โฎ 1.
2. There are binary relations <, ≤, >, ≥ defined on โ, โค, โ, and โ. For example, if we consider
17
3
17
3
>⊆ โ2 , we have ( 2 , – 5) ∈ >, or equivalently, 2 > – 5.
3. Let ๐
= {((๐, ๐), (๐, ๐)) ∈ (โค × โค∗ )2 | ๐๐ = ๐๐}. Then ๐
is a binary relation on โค × โค∗ . For
example, (1, 2)๐
(2, 4) because 1 ⋅ 4 = 2 ⋅ 2. However, (1, 2)๐
(2, 5) because 1 ⋅ 5 ≠ 2 ⋅ 2.
1
2
Compare this to the rational number system where we have 2 = 4 because 1 ⋅ 4 = 2 ⋅ 2, but
1
2
≠ 5 because 1 ⋅ 5 ≠ 2 ⋅ 2.
2
We say that a binary relation ๐
on ๐ด is
•
reflexive if for all ๐ ∈ ๐ด, (๐, ๐) ∈ ๐
.
•
symmetric if for all ๐, ๐ ∈ ๐ด, (๐, ๐) ∈ ๐
implies (๐, ๐) ∈ ๐
.
•
transitive if for all ๐, ๐, ๐ ∈ ๐ด, (๐, ๐), (๐, ๐) ∈ ๐
implies (๐, ๐) ∈ ๐
.
•
antireflexive if for all ๐ ∈ ๐ด, (๐, ๐) ∉ ๐
.
•
antisymmetric if for all ๐, ๐ ∈ ๐ด, (๐, ๐) ∈ ๐
and (๐, ๐) ∈ ๐
implies ๐ = ๐.
Example 10.6:
1. Let ๐ด be any set, and let ๐
= {(๐, ๐) ∈ ๐ด2 | ๐ = ๐}. Then ๐
is reflexive (๐ = ๐), symmetric (if
๐ = ๐, then ๐ = ๐), transitive (if ๐ = ๐ and ๐ = ๐, then ๐ = ๐), and antisymmetric (trivially). If
๐ด ≠ ∅, then this relation is not antireflexive because ๐ ≠ ๐ is false for any ๐ ∈ ๐ด.
2. The binary relations ≤ and ≥ defined in the usual way on โค are transitive (if ๐ ≤ ๐ and ๐ ≤ ๐,
then ๐ ≤ ๐, and similarly for ≥), reflexive (๐ ≤ ๐ and ๐ ≥ ๐), and antisymmetric (if ๐ ≤ ๐ and
๐ ≤ ๐, then ๐ = ๐, and similarly for ≥). These relations are not symmetric. For example, 1 ≤ 2,
but 2 โฐ 1). These relations are not antireflexive. For example, 1 ≤ 1 is true.
Any relation that is transitive, reflexive, and antisymmetric is called a partial ordering.
3. The binary relations < and > defined on โค are transitive (if ๐ < ๐ and ๐ < ๐, then ๐ < ๐, and
similarly for >), antireflexive (๐ โฎ ๐ and ๐ โฏ ๐), and antisymmetric (this is vacuously true
because ๐ < ๐ and ๐ < ๐ can never occur). These relations are not symmetric (for example,
1 < 2, but 2 โฎ 1). These relations are not reflexive (for example, 1 < 1 is false).
Any relation that is transitive, antireflexive, and antisymmetric is called a strict partial ordering.
4. Let ๐
= {(0, 0), (0, 2), (2, 0), (2, 2), (2, 3), (3, 2), (3, 3)} be a relation on โ. Then it is easy to
see that ๐
is symmetric. ๐
is not reflexive because 1 ∈ โ, but (1, 1) ∉ ๐
(however, if we were
to consider ๐
as a relation on {0, 2, 3} instead of on โ, then ๐
would be reflexive). ๐
is not
transitive because we have (0, 2), (2, 3) ∈ ๐
, but (0, 3) ∉ ๐
. ๐
is not antisymmetric because
we have (2, 3), (3, 2) ∈ ๐
and 2 ≠ 3. ๐
is not antireflexive because (0, 0) ∈ ๐
.
We can extend the idea of a binary relation on a set ๐ด to an ๐-ary relation on ๐ด. For example, a 3-ary
relation (or ternary relation) on ๐ด is a subset of ๐ด3 = ๐ด × ๐ด × ๐ด. More generally, we have that ๐
is an
๐-ary relation on ๐ด if and only if ๐
⊆ ๐ด๐ . A ๐-ary relation (or unary relation) on ๐ด is just a subset of ๐ด.
120
Example 10.7: Let ๐
= {(๐ฅ, ๐ฆ, ๐ง) ∈ โค3 | ๐ฅ + ๐ฆ = ๐ง}. Then ๐
is a ternary (or 3-ary) relation on โค. We
have, for example, (1, 2, 3) ∈ ๐
(because 1 + 2 = 3) and (1, 2, 4) ∉ ๐
(because 1 + 2 ≠ 4).
Equivalence Relations and Partitions
A binary relation ๐
on a set ๐ด is an equivalence relation if ๐
is reflexive, symmetric, and transitive.
Example 10.8:
1. The most basic equivalence relation on a set ๐ด is the relation ๐
= {(๐, ๐) ∈ ๐ด2 | ๐ = ๐} (the
equality relation). We already saw in part 1 of Example 10.6 that this relation is reflexive,
symmetric and transitive.
2. Another trivial equivalence relation on a set ๐ด is the set ๐ด2 . Since every ordered pair
(๐, ๐) is in ๐ด2 , reflexivity, symmetry, and transitivity can never fail.
3. We say that integers ๐ and ๐ have the same parity if they are both even or both odd. Define ≡2
on โค by ≡2 = {(๐, ๐) ∈ โค2 | ๐ and ๐ โave the same parity}. It is easy to see that ≡2 is reflexive
(๐ ≡2 ๐ because every integer has the same parity as itself), ≡2 is symmetric (if ๐ ≡2 ๐, then ๐
has the same parity as ๐, so ๐ has the same parity as ๐, and therefore, ๐ ≡2 ๐), and ≡2 is
transitive (if ๐ ≡2 ๐ and ๐ ≡2 ๐, then ๐, ๐, and ๐ all have the same parity, and so, ๐ ≡2 ๐).
Therefore, ≡2 is an equivalence relation.
Another way to say that ๐ and ๐ have the same parity is to say that ๐ − ๐ is divisible by 2, or
equivalently, 2|๐ − ๐ (see Lesson 4). This observation allows us to generalize the notion of
having the same parity. For example, ≡3 = {(๐, ๐) ∈ โค2 | 3|๐ − ๐} is an equivalence relation,
and more generally, for each ๐ ∈ โค+ , ≡๐ = {(๐, ๐) ∈ โค2 | ๐|๐ − ๐} is an equivalence relation. I
leave the proof that ≡๐ is reflexive, symmetric, and transitive on โค as an exercise (see Problem
4 in the problem set below).
4. Consider the relation ๐
= {((๐, ๐), (๐, ๐)) ∈ (โค × โค∗ )2 | ๐๐ = ๐๐} defined in part 3 of Example
10.5. Since ๐๐ = ๐๐, we see that (๐, ๐)๐
(๐, ๐), and therefore, ๐
is reflexive. If (๐, ๐)๐
(๐, ๐),
then ๐๐ = ๐๐. Therefore, ๐๐ = ๐๐, and so, (๐, ๐)๐
(๐, ๐). Thus, ๐
is symmetric. Finally, suppose
that (๐, ๐)๐
(๐, ๐) and (๐, ๐)๐
(๐, ๐). Then ๐๐ = ๐๐ and ๐๐ = ๐๐. So, ๐๐๐๐ = ๐๐๐๐. Using the
fact that (โค, +, ⋅) is a commutative ring, we get ๐๐(๐๐ − ๐๐) = ๐๐๐๐ − ๐๐๐๐ = 0. If ๐ = 0,
then ๐๐ = 0, and so, ๐ = 0 (because ๐ ≠ 0). So, ๐๐ = 0, and therefore, ๐ = 0 (because ๐ ≠ 0).
So, ๐๐ = ๐๐ (because they’re both 0). If ๐ ≠ 0, then ๐ ≠ 0. Therefore, ๐๐ − ๐๐ = 0, and so,
๐๐ = ๐๐. Since ๐ = 0 and ๐ ≠ 0 both lead to ๐๐ = ๐๐, we have (๐, ๐)๐
(๐, ๐). So, ๐
is
transitive. Since ๐
is reflexive, symmetric, and transitive, it follows that ๐
is an equivalence
relation.
Recall: (1) If ๐ฟ is a nonempty set of sets, we say that ๐ฟ is pairwise disjoint if for all ๐ด, ๐ต ∈ ๐ฟ with
๐ด ≠ ๐ต, ๐ด and ๐ต are disjoint (๐ด ∩ ๐ต = ∅).
(2) If ๐ฟ is a nonempty set of sets, then union ๐ฟ is defined by โ๐ฟ = {๐ฆ | there is ๐ ∈ ๐ with ๐ฆ ∈ ๐}.
A partition of a set ๐ is a set of pairwise disjoint nonempty subsets of ๐ whose union is ๐. Symbolically,
๐ฟ is a partition of ๐ if and only if
∀๐ด ∈ ๐ฟ(๐ด ≠ ∅ ∧ ๐ด ⊆ ๐) ∧ ∀๐ด, ๐ต ∈ ๐ฟ(๐ด ≠ ๐ต → ๐ด ∩ ๐ต = ∅) ∧ โ๐ฟ = ๐.
121
Example 10.9:
1. Let ๐ผ = {2๐ | ๐ ∈ โค} be the set of even integers and let ๐ = {2๐ + 1 | ๐ ∈ โค} be the set of odd
integers. Then ๐ฟ = {๐ผ, ๐} is a partition of โค. We can visualize this partition as follows:
โค = {… , – 4, – 2, 0, 2, 4, … } ∪ {… , – 3, – 1, 1, 3, 5, … }
2. Let ๐ด = {3๐ | ๐ ∈ โค}, ๐ต = {3๐ + 1 | ๐ ∈ โค}, and ๐ถ = {3๐ + 2 | ๐ ∈ โค}. Then ๐ฟ = {๐ด, ๐ต, ๐ถ} is a
partition of โค. A rigorous proof of this requires results similar to those given in Example 4.7 and
the notes following (or you can wait for the Division Algorithm, which will be presented in
Lesson 12). We can visualize this partition as follows:
โค = {… , – 6, – 3, 0, 3, 6, … } ∪ {… , – 5, – 2, 1, 4, 7, … } ∪ {… , – 4, – 1, 2, 5, 8, … }
3. For each ๐ ∈ โค, let ๐ด๐ = (๐, ๐ + 1]. Then ๐ฟ = {๐ด๐ |๐ ∈ โค} is a partition of โ. We can visualize
this partition as follows:
โ = โฏ ∪ (−2, −1] ∪ (−1, 0] ∪ (0, 1] ∪ (1, 2] ∪ (2, 3] ∪ โฏ
4. For each ๐ ∈ โ, let ๐ด๐ = {๐ + ๐๐ | ๐ ∈ โ}. Then ๐ฟ = {๐ด๐ | ๐ ∈ โ} is a partition of โ.
5. The only partition of ∅ is ∅.
6. The only partition of the one element set {๐} is {{๐}}.
7. The partitions of the two element set {๐, ๐} with ๐ ≠ ๐ are {{๐}, {๐}} and {{๐, ๐}}.
We will now explore the relationship between equivalence relations and partitions. Let’s begin with an
example.
Example 10.10: Consider the equivalence relation ≡2 from part 3 of Example 10.8, defined by ๐ ≡2 ๐
if and only if ๐ and ๐ have the same parity, and the partition {๐ผ, ๐} of โค from part 1 of Example 10.9.
For this partition, we are thinking of โค as the union of the even and odd integers:
โค = {… , – 4, – 2, 0, 2, 4, … } ∪ {… , – 3, – 1, 1, 3, 5, … }
Observe that ๐ and ๐ are in the same member of the partition if and only if ๐ ≡2 ๐. For example,
– 8 ≡2 4 and – 8, 4 ∈ ๐ผ, whereas – 8 โข2 3 and – 8 ∈ ๐ผ, 3 ∈ ๐. In fact, ๐ผ = {๐ ∈ โค | ๐ ≡2 0} and
๐ = {๐ ∈ โค | ๐ ≡2 1}. We call ๐ผ the equivalence class of 0 and we call ๐ the equivalence class of 1.
Let ~ be an equivalence relation on a set ๐. If ๐ฅ ∈ ๐, the equivalence class of ๐ฅ, written [๐ฅ], is the set
[๐ฅ] = {๐ฆ ∈ ๐ | ๐ฅ ~ ๐ฆ}.
Example 10.10 continued: We have [0] = {๐ฆ ∈ โค | 0 ≡2 ๐ฆ} = ๐ผ. Observe that [2] = [0], and in fact, if
๐ is any even integer, then [๐] = [0] = ๐ผ. Similarly, if ๐ is any odd integer, then [๐] = [1] = ๐.
Example 10.11: Recall that the power set of ๐ด, written ๐ซ(๐ด), is the set consisting of all subsets of ๐ด.
๐ซ(๐ด) = {๐ | ๐ ⊆ ๐ด}
For example, if ๐ด = {๐, ๐, ๐}, then ๐ซ(๐ด) = {∅, {๐}, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}. We can define
a binary relation ~ on ๐ซ(๐ด) by ๐ ~ ๐ if and only if |๐| = |๐| (๐ and ๐ have the same number of
elements). It is easy to see that ~ is an equivalence relation on ๐ซ(๐ด). There are four equivalence
classes.
122
[∅] = {∅}
[{๐}] = {{๐}, {๐}, {๐}}
[{๐, ๐}] = {{๐, ๐}, {๐, ๐}, {๐, ๐}}
[{๐, ๐, ๐}] = {{๐, ๐, ๐}}
Notes: (1) {๐} ~ {๐} ~ {๐} because each of these sets has one element. It follows that {๐}, {๐}, and {๐}
are all in the same equivalence class. Above, we chose to use {๐} as the representative for this
equivalence class. This is an arbitrary choice. In fact, [{๐}] = [{๐}] = [{๐}].
Similarly, [{๐, ๐}] = [{๐, ๐}] = [{๐, ๐}].
(2) The empty set is the only subset of ๐ด with 0 elements. Therefore, the equivalence class of ∅ contains
only itself. Similarly, the equivalence class of ๐ด = {๐, ๐, ๐} contains only itself.
(3) Notice that the four equivalence classes are pairwise disjoint, nonempty, and their union is ๐ซ(๐ด).
In other words, the equivalence classes form a partition of ๐ซ(๐ด).
Theorem 10.2: Let ๐ท be a partition of a set ๐. Then there is an equivalence relation ~ on ๐ for which
the elements of ๐ท are the equivalence classes of ~. Conversely, if ~ is an equivalence relation on a set
๐, then the equivalence classes of ~ form a partition of ๐.
You will be asked to prove Theorem 10.2 in Problem 17 below.
Important note: We will sometimes want to define relations or operations on equivalence classes.
When we do this, we must be careful that what we are defining is well-defined. For example, consider
the equivalence relation ≡2 on โค, and let ๐ = {[0], [1]} be the set of equivalence classes.
Let’s attempt to define a relation on ๐ by [๐ฅ]๐
[๐ฆ] if and only if ๐ฅ < ๐ฆ. Is [0]๐
[1] true? It looks like it is
because 0 < 1. But this isn’t the end of the story. Since [0] = [2], if [0]๐
[1], then we must also have
[2]๐
[1] (by a direct substitution). But 2 โฎ 1! So, [2]๐
[1] is false. To summarize, [0]๐
[1] should be true
and [2]๐
[1] should be false, but [0]๐
[1] and [2]๐
[1] represent the same statement. So, ๐
is not a
well-defined relation on ๐.
As another example, let’s attempt to define an operation +: ๐ × ๐ → ๐ by [๐ฅ] + [๐ฆ] = [๐ฅ + ๐ฆ]. This is
a well-defined operation. We proved in Theorem 4.1 from Lesson 4 that the sum of two even integers
is even. Similar arguments can be used to show that the sum of two odd integers is even and that the
sum of an odd integer and an even integer is odd. These results can now be used to show that the
operation + is well-defined. For example, if [๐ฅ] = [0] and [๐ฆ] = [0], then ๐ฅ and ๐ฆ are even. By Theorem
4.1, it follows that ๐ฅ + ๐ฆ is even. So, [๐ฅ + ๐ฆ] = [0]. Since [0 + 0] = [0], [๐ฅ] + [๐ฆ] = [0] + [0]. The
reader should check the other three cases to finish verifying that + is well-defined on ๐.
This principle applies whenever there are elements in a set that can be represented in more than one
way. Let’s take the set of rational numbers as an example. Each rational number has infinitely many
1
2
3
representations. For example, 2 = 4 = 6 = โฏ and so on. When verifying that (โ, +, ⋅) is a field (see
Problems 9 and 11 from Lesson 3), were you careful to check that addition and multiplication are
well-defined on โ? If not, you may want to go back and do so now. Also take a look at Theorem 5.1.
123
Orderings
A binary relation ≤ on a set ๐ด is a partial ordering on ๐ด if ≤ is reflexive, antisymmetric, and transitive
on ๐ด. If we replace “reflexive” by “antireflexive,” then we call the relation a strict partial ordering on
๐ด (we would normally use the symbol < instead of ≤ for a strict partial ordering).
A partially ordered set (or poset) is a pair (๐ด, ≤), where ๐ด is a set and ≤ is a partial ordering on ๐ด.
Similarly, a strict poset is a pair (๐ด, <), where ๐ด is a set and < is a strict partial ordering on ๐ด.
Example 10.12:
1. The usual ordering ≤ on โค = {… , – 3, – 2, – 1, 0, 1, 2, 3, … } is a partial ordering, and the ordering
< on โค is a strict partial ordering. See Example 10.6 (parts 2 and 3).
2. If ๐ด is a set, then (๐ซ(๐ด), ⊆) is a poset. Since every set is a subset of itself, ⊆ is reflexive. If
๐, ๐ ∈ ๐ซ(๐ด) with ๐ ⊆ ๐ and ๐ ⊆ ๐, then ๐ = ๐ by the Axiom of Extensionality (see the
Technical note after Theorem 2.5 in Lesson 2). So, ⊆ is antisymmetric. By Theorem 2.3, ⊆ is
transitive. The following tree diagrams give visual representations of this poset when ๐ด = {๐, ๐}
and ๐ด = {๐, ๐, ๐}. For a detailed explanation of these diagrams, see Example 2.8 in Lesson 2.
{๐, ๐}
{๐}
{๐, ๐, ๐}
{๐}
{๐, ๐}
{๐, ๐}
{๐, ๐}
{๐}
{๐}
{๐}
∅
∅
Let (๐ด, ≤) be a poset. We say that ๐, ๐ ∈ ๐ด are comparable if ๐ ≤ ๐ or ๐ ≤ ๐. The poset satisfies the
comparability condition if every pair of elements in ๐ด are comparable. A poset that satisfies the
comparability condition is called a linearly ordered set (or totally ordered set). Similarly, a strict
linearly ordered set (๐ด, <) satisfies trichotomy: If ๐, ๐ ∈ ๐ด, then ๐ < ๐, ๐ = ๐, or ๐ < ๐.
Example 10.13:
1. (โ, ≤), (โค, ≤), (โ, ≤), and (โ, ≤) are linearly ordered sets. Problem 5 from Lesson 5 (parts (i),
(ii), and (iv)) show that (โ, ≤) and (โ, ≤) are linearly ordered.
Similarly, (โ, <), (โค, <), (โ, <), and (โ, <) are strict linearly ordered sets.
2. If ๐ด has at least two elements, then (๐ซ(๐ด), ⊆) is not linearly ordered. Indeed, if ๐, ๐ ∈ ๐ด with
๐ ≠ ๐, then {๐} โ {๐} and {๐} โ {๐}. See either of the tree diagrams above at the end of
Example 10.12.
Functions
Let ๐ด and ๐ต be sets. ๐ is a function from ๐ด to ๐ต, written ๐: ๐ด → ๐ต, if the following two conditions hold.
1. ๐ ⊆ ๐ด × ๐ต.
2. For all ๐ ∈ ๐ด, there is a unique ๐ ∈ ๐ต such that (๐, ๐) ∈ ๐.
124
If ๐: ๐ด → ๐ต, the domain of ๐, written dom ๐, is the set ๐ด, and the range of ๐, written ran ๐, is the set
{๐(๐) | ๐ ∈ ๐ด}. Observe that ran ๐ ⊆ ๐ต. The set ๐ต is sometimes called the codomain of ๐. When we
know that ๐ is a function, we will abbreviate (๐, ๐) ∈ ๐ by ๐(๐) = ๐.
Example 10.14:
1. ๐ = {(0, ๐), (1, ๐)} is a function with dom ๐ = {0, 1} and ran ๐ = {๐}. Instead of (0, ๐) ∈ ๐, we
will usually write ๐(0) = ๐. Similarly, instead of (1, ๐) ∈ ๐, we will write ๐(1) = ๐. Here is a
visual representation of this function.
๐
0
๐
1
This function ๐: {0, 1} → {๐} is called a constant function because the range of ๐ consists of a
single element.
Note also that ๐ is a binary relation on the set {0, 1, ๐}. In general, a function ๐: ๐ด → ๐ต is a
binary relation on ๐ด ∪ ๐ต.
2. If ๐ ≠ ๐, then ๐ = {(0, ๐), (0, ๐)} is not a function because it violates the second condition in
the definition of being a function. It is, however, a binary relation on {0, ๐, ๐}.
๐
0
๐
๐
3. โ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ > 0 ∧ ๐2 + ๐ 2 = 2} is a relation on โ that is not a function. (1, 1)
and (1, – 1) are both elements of โ, violating the second condition in the definition of a
function. See the figure below on the left. Notice how a vertical line hits the graph twice.
โ
๐
(1, 1)
(1, – 1)
4. ๐ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ > 0 ∧ ๐2 + ๐ 2 = 2} is a function. See the figure above on the right.
To see that the second condition in the definition of a function is satisfied, suppose that (๐, ๐)
and (๐, ๐) are both in ๐. Then ๐2 + ๐ 2 = 2, ๐2 + ๐ 2 = 2, and ๐ and ๐ are both positive. It
follows that ๐ 2 = ๐ 2, and since ๐ and ๐ are both positive, we have ๐ = ๐.
125
We have dom ๐ = (– √2, √2) and ran ๐ = (0, √2 ]. So, ๐: (– √2, √2) → (0, √2 ].
5. A function with domain โ is called an infinite sequence. For example, let ๐: โ → {0, 1} be
0 if ๐ is even.
defined by ๐(๐) = {
A nice way to visualize an infinite sequence is to list the
1 if ๐ is odd.
“outputs” of the sequence in order in parentheses. So, we may write ๐ as (0, 1, 0, 1, 0, 1, … ). In
general, if ๐ด is a nonempty set and ๐: โ → ๐ด is a sequence, then we can write ๐ as
(๐(0), ๐(1), ๐(2), … ).
Similarly, a finite sequence is a function with domain {0, 1, … , ๐ − 1} for some ๐. For example,
the sequence (0, 2, 4, 6, 8, 10) is the function โ: {0, 1, 2, 3, 4, 5} → โ defined by ๐(๐) = 2๐. If
the domain of a finite sequence is {0, 1, … , ๐ − 1}, we say that the length of the sequence is ๐.
Observe how a finite sequence with domain {0, 1, … , ๐ − 1} and range ๐ด looks just like an
๐-tuple in ๐ด๐ . In fact, it’s completely natural to identify a finite sequence of length ๐ with the
corresponding ๐-tuple. So, (0, 2, 4, 6, 8, 10) can be thought of as a 6-tuple from โ6 , or as the
function โ: {0, 1, 2, 3, 4, 5} → โ defined by ๐(๐) = 2๐.
Informally, we can think of an infinite sequence as an infinite length tuple. As one more
example, (1, 2, 4, 8, 16, 32, … ) represents the sequence ๐: โ → โ defined by ๐(๐) = 2๐ .
Note: In the study of set theory, we define the natural numbers by letting 0 = ∅, 1 = {0}, 2 = {0, 1},
3 = {0, 1, 2},… and so on. In general, the natural number ๐ is the set of all its predecessors. Specifically,
๐ = {0, 1, 2, … , ๐ − 1}. Using this notation, we can say that a finite sequence of length ๐ is a function
๐: ๐ → ๐ด for some set ๐ด. For example, the function โ above has domain 6, so that โ: 6 → โ.
A function ๐: ๐ด → ๐ต is injective (or one-to-one), written ๐: ๐ด โช ๐ต, if for all ๐, ๐ ∈ ๐ด, if ๐ ≠ ๐, then
๐(๐) ≠ ๐(๐). In this case, we call ๐ an injection.
Note: The contrapositive of the statement “If ๐ ≠ ๐, then ๐(๐) ≠ ๐(๐)” is “If ๐(๐) = ๐(๐), then
๐ = ๐.” So, we can say that a function ๐: ๐ด → ๐ต is injective if for all ๐, ๐ ∈ ๐ด, if ๐(๐) = ๐(๐), then
๐ = ๐.
A function ๐: ๐ด → ๐ต is surjective (or onto ๐ฉ), written ๐: ๐ด โฆ ๐ต, if for all ๐ ∈ ๐ต, there is an ๐ ∈ ๐ด such
that ๐(๐) = ๐. In this case, we call ๐ a surjection.
A function ๐: ๐ด → ๐ต is bijective, written ๐: ๐ด ≅ ๐ต if ๐ is both an injection and a surjection. In this case,
we call ๐ a bijection.
Example 10.15:
1. ๐ = {(0, ๐), (1, ๐)} from part 1 of Example 10.14 is not an injective function because ๐(0) = ๐,
๐(1) = ๐, and 0 ≠ 1. If we think of ๐ as ๐: {0, 1} → {๐}, then ๐ is surjective. However, if we
think of ๐ as ๐: {0, 1} → {๐, ๐}, then ๐ is not surjective. So, surjectivity depends upon the
codomain of the function.
๐
0
๐
1
126
2. ๐ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ > 0 ∧ ๐2 + ๐ 2 = 2} from part 4 of
Example 10.14 is not an injective function. For example,
(1, 1) ∈ ๐ because 12 + 12 = 1 + 1 = 2 and (– 1, 1) ∈ ๐
because (– 1)2 + 12 = 1 + 1 = 2. Notice how a horizontal
line hits the graph twice. If we think of ๐ as a function from
(– √2, √2) to โ+ , then ๐ is not surjective. For example,
2 ∉ ran ๐ because for any ๐ ∈ โ, ๐2 + 22 = ๐2 + 4 ≥ 4,
and so, ๐2 + 22 cannot be equal to 2. However, if instead we
consider ๐ as a function with codomain (0, √2 ], that is
๐: (– √2, √2) → (0, √2 ], then ๐ is surjective. Indeed, if
0 < ๐ ≤ √2, then 0 < ๐ 2 ≤ 2, and so, ๐2 = 2 − ๐ 2 ≥ 0.
Therefore, ๐ = √2 − ๐ 2 is a real number such that ๐(๐) = ๐.
(– 1, 1)
๐
(1, 1)
3. Define ๐: โ → โ by ๐(๐ฅ) = 7๐ฅ − 3. Then ๐ is injective because if
๐(๐) = ๐(๐), we then have 7๐ − 3 = 7๐ − 3. Using the fact that โ is
a field, we get 7๐ = 7๐ (by the additive inverse property), and then
๐ = ๐ (by the multiplicative inverse property). Also, ๐ is surjective
๐+3
because if ๐ ∈ โ, then 7 ∈ โ (because โ is a field) and
๐+3
๐+3
๐(
) = 7(
) − 3 = (๐ + 3) − 3 = ๐ + (3 − 3) = ๐ + 0 = ๐
7
7
Therefore, ๐ is bijective. See the image to the right for a visual
representation of โ2 and the graph of the function ๐.
Notice that any vertical line will hit the graph of ๐ exactly once because
๐ is a function with domain โ. Also, any horizontal line will hit the
graph exactly once because ๐ is bijective. Injectivity ensures that each horizontal line hits the
graph at most once and surjectivity ensures that each horizontal line hits the graph at least
once.
If ๐: ๐ด → ๐ต is bijective, we define ๐ −1 : ๐ต → ๐ด, the inverse of ๐, by ๐ −1 = {(๐, ๐) | (๐, ๐) ∈ ๐}. In other
words, for each ๐ ∈ ๐ต, ๐ −1 (๐) = “the unique ๐ ∈ ๐ด such that ๐(๐) = ๐.”
Notes: (1) Let ๐: ๐ด → ๐ต be bijective. Since ๐ is surjective, for each ๐ ∈ ๐ต, there is an ๐ ∈ ๐ด such that
๐(๐) = ๐. Since ๐ is injective, there is only one such value of ๐.
(2) The inverse of a bijective function is also bijective.
Example 10.16:
1. Define ๐: {0, 1} → {๐, ๐} by ๐ = {(0, ๐), (1, ๐)}. Then ๐ is a bijection and ๐ −1 : {๐, ๐} → {0, 1} is
defined by ๐ −1 = {(๐, 0), (๐, 1)}. Observe that ๐ −1 is also a bijection.
2. Let ๐ผ = {0, 2, 4, 6, 8, … } be the set of even natural numbers and let ๐ = {1, 3, 5, 7, 9 … } be the
set of odd natural numbers. The function ๐: ๐ผ → ๐ defined by ๐(๐) = ๐ + 1 is a bijection with
inverse ๐ −1 : ๐ → ๐ผ defined by ๐(๐) = ๐ − 1.
127
3. If ๐ and ๐ are sets, we define ๐๐ to be the set of functions from ๐ to ๐. Symbolically, we have
๐
๐ = {๐ | ๐: ๐ → ๐}
For example, if ๐ด = {๐, ๐} and ๐ต = {0, 1}, then ๐ด๐ต has 4 elements (each element is a function
from ๐ด to ๐ต). The elements are ๐1 = {(๐, 0), (๐, 0)}, ๐2 = {(๐, 0), (๐, 1)}, ๐3 = {(๐, 1), (๐, 0)},
and ๐4 = {(๐, 1), (๐, 1)}. Here is a visual representation of these four functions.
๐ด
๐1
๐ต
๐ด
0
๐
๐
๐
๐ด
๐3
๐
๐
๐ต
๐ด
0
๐
1
๐ต
0
1
๐
๐2
1
๐4
๐ต
0
๐
1
Define ๐น: ๐ด๐ต → ๐ซ(๐ด) by ๐น(๐) = {๐ฅ ∈ ๐ด | ๐(๐ฅ) = 1}.
So, ๐น(๐1 ) = ∅, ๐น(๐2 ) = {๐}, ๐น(๐3 ) = {๐}, and ๐น(๐4 ) = {๐, ๐}.
Since ๐ซ(๐ด) = {∅, {๐}, {๐}, {๐, ๐}}, we see that ๐น is a bijection from ๐ด๐ต to ๐ซ(๐ด).
0 if ๐ฅ ∉ ๐ถ.
The inverse of ๐น is the function ๐น −1 : ๐ซ(๐ด) → ๐ด๐ต defined by ๐น −1 (๐ถ)(๐ฅ) = {
1 if ๐ฅ ∈ ๐ถ.
−1 (∅)
−1 ({๐})
−1 ({๐})
−1 ({๐,
So, we see that ๐น
= ๐1 , ๐น
= ๐2 , ๐น
= ๐3 , and ๐น
๐}) = ๐4 .
4. For ๐ด ≠ ∅ and ๐ต = {0, 1}, the function ๐น: ๐ด๐ต → ๐ซ(๐ด) defined by ๐น(๐) = {๐ฅ ∈ ๐ด | ๐(๐ฅ) = 1}
is always a bijection.
To see that ๐น is injective, let ๐, ๐ ∈ ๐ด๐ต with ๐ ≠ ๐. Since ๐ and ๐ are different, there is some
๐ ∈ ๐ด such that either ๐(๐) = 0, ๐(๐) = 1 or ๐(๐) = 1, ๐(๐) = 0. Without loss of generality,
assume that ๐(๐) = 0, ๐(๐) = 1. Since ๐(๐) = 0, ๐ ∉ ๐น(๐). Since ๐(๐) = 1, ๐ ∈ ๐น(๐). So,
๐น(๐) ≠ ๐น(๐). Since ๐ ≠ ๐ implies ๐น(๐) ≠ ๐น(๐), ๐น is injective.
0 if ๐ฅ ∉ ๐ถ.
1 if ๐ฅ ∈ ๐ถ.
Then ๐ฅ ∈ ๐น(๐) if and only if ๐(๐ฅ) = 1 if and only if ๐ฅ ∈ ๐ถ. So, ๐น(๐) = ๐ถ. Since ๐ถ ∈ ๐ซ(๐ด) was
arbitrary, ๐น is surjective.
To see that ๐น is surjective, let ๐ถ ∈ ๐ซ(๐ด), so that ๐ถ ⊆ ๐ด. Define ๐ ∈ ๐ด๐ต by ๐(๐ฅ) = {
0
As in 3, the inverse of ๐น is the function ๐น −1 : ๐ซ(๐ด) → ๐ด๐ต defined by ๐น −1 (๐ถ)(๐ฅ) = {
1
if ๐ฅ ∉ ๐ถ.
if ๐ฅ ∈ ๐ถ.
Notes: (1) See the Note following Theorem 6.6 in Lesson 6 for an explanation of the expression
“Without loss of generality,” and how to properly use it in a proof.
(2) As in the note following Example 10.14, using the notation ๐ = {0, 1, 2, … , ๐ − 1}, we have just
shown that for any nonempty set ๐ด, there is a bijection ๐: ๐ด2 → ๐ซ(๐ด).
128
Given functions ๐: ๐ด → ๐ต and ๐: ๐ต → ๐ถ, the composite of ๐ and ๐, written ๐ โ ๐: ๐ด → ๐ถ, is defined by
(๐ โ ๐)(๐) = ๐(๐(๐)) for all ๐ ∈ ๐ด. Symbolically, we have
๐ โ ๐ = {(๐, ๐) ∈ ๐ด × ๐ถ | There is a ๐ ∈ ๐ต such that (๐, ๐) ∈ ๐ and (๐, ๐) ∈ ๐}.
We can visualize the composition of two functions ๐ and ๐ as follows.
๐ด
๐
๐
๐ต
๐
๐ถ
๐(๐)
๐โ๐
๐(๐(๐))
(๐ โ ๐)(๐)
In the picture above, sets ๐ด, ๐ต, and ๐ถ are drawn as different shapes simply to emphasize that they can
all be different sets. Starting with an arbitrary element ๐ ∈ ๐ด, we have an arrow showing ๐ being
mapped by ๐ to ๐(๐) ∈ ๐ต and another arrow showing ๐(๐) being mapped by ๐ to ๐(๐(๐)) ∈ ๐ถ. There
is also an arrow going directly from ๐ ∈ ๐ด to (๐ โ ๐)(๐) = ๐(๐(๐)) in ๐ถ. Note that the only way we
know how to get from ๐ to (๐ โ ๐)(๐) is to first travel from ๐ to ๐(๐), and then to travel from ๐(๐) to
๐(๐(๐)).
Example 10.17: Define ๐: โ → โ by ๐(๐ฅ) = ๐ฅ√2 and define ๐: โ → {0, 1} by ๐(๐ฅ) = {
Then ๐ โ ๐: โ → {0, 1} is defined by (๐ โ ๐)(๐ฅ) = {
0
1
0
1
if ๐ฅ ∈ โ
if ๐ฅ ∈ โ โ โ
if ๐ฅ = 0.
if ๐ฅ ∈ โ โ {0}.
To see this, observe that (๐ โ ๐)(0) = ๐(๐(0)) = ๐(0√2) = ๐(0) = 0 because 0 ∈ โ. If ๐ฅ ∈ โ โ {0},
then ๐ฅ√2 ∉ โ because if ๐ฆ = ๐ฅ√2 ∈ โ, then since โ is a field, √2 = ๐ฅ −1 ๐ฆ ∈ โ, which we know to be
false. So, (๐ โ ๐)(๐ฅ) = ๐(๐(๐ฅ)) = ๐(๐ฅ√2) = 1.
It will be important to know that when we take the composition of bijective functions, we always get a
bijective function. We will prove this in two steps. We will first show that the composition of injective
functions is injective. We will then show that the composition of surjective functions is surjective.
Theorem 10.3: If ๐: ๐ด โช ๐ต and ๐: ๐ต โช ๐ถ, then ๐ โ ๐: ๐ด โช ๐ถ.
Note: We are given that ๐ and ๐ are injections, and we want to show that ๐ โ ๐ is an injection. We can
show this directly using the definition of injectivity, or we can use the contrapositive of the definition
of injectivity. Let’s do it both ways.
Direct proof of Theorem 10.3: Suppose that ๐: ๐ด โช ๐ต and ๐: ๐ต โช ๐ถ, and let ๐ฅ, ๐ฆ ∈ ๐ด with ๐ฅ ≠ ๐ฆ. Since
๐ is injective, ๐(๐ฅ) ≠ ๐(๐ฆ). Since ๐ is injective, ๐(๐(๐ฅ)) ≠ ๐(๐(๐ฆ). So, (๐ โ ๐)(๐ฅ) ≠ (๐ โ ๐)(๐ฆ). Since
๐ฅ, ๐ฆ ∈ ๐ด were arbitrary, ๐ โ ๐: ๐ด โช ๐ถ.
โก
Contrapositive proof of Theorem 10.3: Suppose that ๐: ๐ด โช ๐ต and ๐: ๐ต โช ๐ถ, let ๐ฅ, ๐ฆ ∈ ๐ด and suppose
that (๐ โ ๐)(๐ฅ) = (๐ โ ๐)(๐ฆ). Then ๐(๐(๐ฅ)) = ๐(๐(๐ฆ). Since ๐ is injective, ๐(๐ฅ) = ๐(๐ฆ). Since ๐ is
injective, ๐ฅ = ๐ฆ. Since ๐ฅ, ๐ฆ ∈ ๐ด were arbitrary, ๐ โ ๐: ๐ด โช ๐ถ.
โก
129
Theorem 10.4: If ๐: ๐ด โฆ ๐ต and ๐: ๐ต โฆ ๐ถ, then ๐ โ ๐: ๐ด โฆ ๐ถ.
Proof: Suppose that ๐: ๐ด โฆ ๐ต and ๐: ๐ต โฆ ๐ถ, and let ๐ ∈ ๐ถ. Since ๐ surjective, there is ๐ ∈ ๐ต with
๐(๐) = ๐. Since ๐ is surjective, there is ๐ ∈ ๐ด with ๐(๐) = ๐. So, (๐ โ ๐)(๐) = ๐(๐(๐)) = ๐(๐) = ๐.
Since ๐ ∈ ๐ถ was arbitrary, ๐ โ ๐ is surjective.
โก
Corollary 10.5: If ๐: ๐ด ≅ ๐ต and ๐: ๐ต ≅ ๐ถ, then ๐ โ ๐: ๐ด ≅ ๐ถ.
Proof: Suppose that ๐: ๐ด ≅ ๐ต and ๐: ๐ต ≅ ๐ถ. Then ๐ and ๐ are injective. By Theorem 10.3, ๐ โ ๐ is
injective. Also, ๐ and ๐ are surjective. By Theorem 10.4, ๐ โ ๐ is surjective. Since ๐ โ ๐ is both injective
and surjective, ๐ โ ๐ is bijective.
โก
Note: A corollary is a theorem that follows easily from a theorem or theorems that have already been
proved.
If ๐ด is any set, then we define the identity function on ๐ด, written ๐๐ด : ๐ด → ๐ด by ๐๐ด (๐) = ๐ for all ๐ ∈ ๐ด.
Note that the identity function on ๐ด is a bijection from ๐ด to itself.
Theorem 10.6: If ๐: ๐ด ≅ ๐ต, then ๐ −1 โ ๐ = ๐๐ด and ๐ โ ๐ −1 = ๐๐ต .
Proof: Let ๐ ∈ ๐ด with ๐(๐) = ๐. Then ๐ −1 (๐) = ๐, and so, (๐ −1 โ ๐)(๐) = ๐ −1 (๐(๐)) = ๐ −1 (๐) = ๐.
Since ๐๐ด (๐) = ๐, we see that (๐ −1 โ ๐)(๐) = ๐๐ด (๐). Since ๐ ∈ ๐ด was arbitrary, ๐ −1 โ ๐ = ๐๐ด .
Now, let ๐ ∈ ๐ต. Since ๐: ๐ด ≅ ๐ต, there is a unique ๐ ∈ ๐ด with ๐(๐) = ๐. Equivalently, ๐ −1 (๐) = ๐. We
have (๐ โ ๐ −1 )(๐) = ๐(๐ −1 (๐)) = ๐(๐) = ๐ Since ๐๐ต (๐) = ๐, we see that (๐ โ ๐ −1 )(๐) = ๐๐ต (๐).
Since ๐ ∈ ๐ต was arbitrary, ๐ โ ๐ −1 = ๐๐ต .
โก
Equinumerosity
We say that two sets ๐ด and ๐ต are equinumerous, written ๐ด ~ ๐ต if there is a bijection ๐: ๐ด ≅ ๐ต.
It is easy to see that ~ is an equivalence relation. For any set ๐ด, the identity function ๐๐ด : ๐ด → ๐ด is a
bijection, showing that ~ is reflexive. For sets ๐ด and ๐ต, if ๐: ๐ด ≅ ๐ต, then ๐ −1 : ๐ต ≅ ๐ด, showing that ~ is
symmetric. For sets ๐ด, ๐ต, and ๐ถ, if ๐: ๐ด ≅ ๐ต and ๐: ๐ต ≅ ๐ถ, then ๐ โ ๐: ๐ด ≅ ๐ถ by Corollary 10.5, showing
that ~ is transitive.
Example 10.18:
1. Let ๐ด = {anteater, elephant, giraffe} and ๐ต = {apple, banana, orange}. Then ๐ด ~ ๐ต. We can
define a bijection ๐: ๐ด ≅ ๐ต by ๐(anteater) = apple, ๐(elephant) = banana, and
๐(giraffe) = orange. This is not the only bijection from ๐ด to ๐ต, but we need only find one (or
prove one exists) to show that the sets are equinumerous.
2. At this point it should be easy to see that two finite sets are equinumerous if and only if they
have the same number of elements. It should also be easy to see that a finite set can never be
equinumerous with an infinite set.
130
3. Let โ = {0, 1, 2, 3, 4 … } be the set of natural numbers and ๐ผ = {0, 2, 4, 6, 8 … } the set of even
natural numbers. Then โ ~ ๐ผ. We can actually see a bijection between these two sets just by
looking at the sets themselves.
0 1 2 3 4 5 6…
0 2 4 6 8 10 12 …
The function ๐: โ → ๐ผ defined by ๐(๐) = 2๐ is an explicit bijection. To see that ๐ maps โ into
๐ผ, just observe that if ๐ ∈ โ, then 2๐ ∈ ๐ผ by the definition of an even integer (see Lesson 4). ๐
is injective because if ๐(๐) = ๐(๐), then 2๐ = 2๐, and so, ๐ = ๐. Finally, ๐ is surjective
because if ๐ ∈ ๐ผ, then there is ๐ ∈ โ such that ๐ = 2๐. So, ๐(๐) = 2๐ = ๐.
๐
4. โ ~ โค via the bijection ๐: โ ≅ โค defined by ๐(๐) = {
2
๐+1
– 2
if ๐ is even.
if ๐ is odd.
Let’s look at this correspondence visually:
0 1 2 3
0 –1 1 –2
4 5 6…
2 –3 3 …
Many students get confused here because they are under the misconception that the integers
should be written “in order.” However, when checking to see if two sets are equinumerous, we
do not include any other structure. In other words, we are just trying to “pair up” elements—it
does not matter how we do so.
You will be asked to verify that the function ๐ defined above is a bijection in Problem 8 below.
5. For ๐ด any nonempty set, ๐ด{0, 1} ~ ๐ซ(๐ด). We showed this in part 4 of Example 10.16.
We say that a set is countable if it is equinumerous with a subset of โ. It’s easy to visualize a countable
set because a bijection from a subset of โ to a set ๐ด generates a list. For example, the set ๐ผ can be
listed as 0, 2, 4, 6, … and the set โค can be listed as 0, – 1, 1, – 2, 2, … (see Example 10.18 above).
There are two kinds of countable sets: finite sets and denumerable sets. We say that a set is
denumerable if it is countably infinite.
At this point, you may be asking yourself if all infinite sets are denumerable. If this were the case, then
we would simply have finite sets and infinite sets, and that would be the end of it. However, there are
in fact infinite sets that are not denumerable. An infinite set that is not denumerable is uncountable.
Theorem 10.7 (Cantor’s Theorem): If ๐ด is any set, then ๐ด is not equinumerous with ๐ซ(๐ด).
Analysis: How can we prove that ๐ด is not equinumerous with ๐ซ(๐ด)? Well, we need to show that there
does not exist a bijection from ๐ด to ๐ซ(๐ด). Recall that a bijection is a function which is both an injection
and a surjection. So, we will attempt to show that there do not exist any surjections from ๐ด to ๐ซ(๐ด).
To do this, we will take an arbitrary function ๐: ๐ด → ๐ซ(๐ด), and then argue that ๐ is not surjective. We
will show that ran ๐ ≠ ๐ซ(๐ด) by finding a set ๐ต ∈ ๐ซ(๐ด) โ ran ๐. In words, we will find a subset of ๐ด
that is not in the range of ๐.
131
Let’s begin by looking at โ, the set of natural numbers. Given a specific function ๐: โ → ๐ซ(โ), it’s not
too hard to come up with a set ๐ต ∈ ๐ซ(โ) โ ran ๐. Let’s choose a specific such ๐ and use this example
to try to come up with a procedure for describing the set ๐ต.
๐(0) = {๐, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, … }
๐(1) = {0, ๐, 3, 4, 5, 6, 7, 8, 9, 10, … }
๐(2) = {0, 1, 4, 5, 6, 7, 8, 9, 10, … }
๐(3) = {0, 1, 4, 6, 7, 8, 9, 10, … }
๐(4) = {0, 1, ๐, 6, 8, 9, 10, … }
…
Technical note: Recall that a prime number is a natural number with exactly two factors, 1 and itself.
The set of prime numbers looks like this: {2, 3, 5, 7, 11, 13, 17, … }. The function ๐: โ → ๐ซ(โ) that we
chose to use here is defined by ๐(๐) = {๐ ∈ โ | ๐ is not equal to one of the first ๐ prime numbers}.
Notice how ๐(0) is just the set โ of all natural numbers, ๐(1) is the set of all natural numbers except
2 (we left out the first prime), ๐(2) is the set of all natural numbers except 2 and 3 (we left out the first
two primes), and so on. Prime numbers will be covered in detail in Lesson 12.
Observe that the “inputs” of our function are natural numbers, and the “outputs” are sets of natural
numbers. So, it’s perfectly natural to ask the question “Is ๐ in ๐(๐)?”
For example, we see that 0 ∈ ๐(0), 1 ∈ ๐(1), and 4 ∈ ๐(4) (indicated in bold in the definition of the
function above). However, we also see that 2 ∉ ๐(2) and 3 ∉ ๐(3).
Let’s let ๐ต be the set of natural numbers ๐ that are not inside their images. Symbolically, we have
๐ต = {๐ ∈ โ | ๐ ∉ ๐(๐)}.
Which natural numbers are in the set ๐ต? Well, we already said that 0 ∈ ๐(0). It follows that 0 ∉ ๐ต.
Similarly, 1 ∉ ๐ต and 4 ∉ ๐ต, but 2 ∈ ๐ต and 3 ∈ ๐ต.
Why did we choose to define ๐ต this way? The reason is because we are trying to make sure that ๐ต
cannot be equal to ๐(๐) for every ๐. Since 0 ∈ ๐(0), but 0 ∉ ๐ต, it follows that ๐(0) and ๐ต are different
sets because they differ by at least one element, namely 0. Similarly, since 1 ∈ ๐(1), but 1 ∉ ๐ต, ๐ต
cannot be equal to ๐(1). What about 2? Well 2 ∉ ๐(2), but 2 ∈ ๐ต. Therefore, ๐ต ≠ ๐(2) as well… and
so on down the line. We intentionally chose to make ๐ต disagree with ๐(๐) for every natural number ๐,
ensuring that ๐ต will not be in the range of ๐.
I think we are now ready to prove the theorem.
Proof of Theorem 10.7: Let ๐: ๐ด → ๐ซ(๐ด), and let ๐ต = {๐ ∈ ๐ด | ๐ ∉ ๐(๐)}. Suppose toward
contradiction that ๐ต ∈ ran ๐. Then there is ๐ ∈ ๐ด with ๐(๐) = ๐ต. But then we have ๐ ∈ ๐ต if and only
if ๐ ∉ ๐(๐ด) if and only if ๐ ∉ ๐ต. This contradiction tells us that ๐ต ∉ ran ๐, and so, ๐ is not surjective.
Since ๐: ๐ด → ๐ซ(๐ด) was arbitrary, there does not exist a surjection from ๐ด to ๐ซ(๐ด), and therefore, there
is no bijection from ๐ด to ๐ซ(๐ด). So, ๐ด is not equinumerous with ๐ซ(๐ด).
โก
So, for example, โ is not equinumerous with ๐ซ(โ). Which of these two sets is the “bigger” one? Let’s
consider the function ๐: โ → ๐ซ(โ) defined by ๐(๐) = {๐}. This function looks like this:
132
0 1 2 3 4…
{0} {1} {2} {3} {4} …
Observe that we are matching up each natural number with a subset of natural numbers (a very simple
subset consisting of just one natural number) in a way so that different natural numbers get matched
with different subsets. In other words, we defined an injective function from โ to ๐ซ(โ). It seems like
there are lots of subsets of โ that didn’t get mapped to (for example, all infinite subsets of โ). So, it
seems that โ is a “smaller” set than ๐ซ(โ).
We use the notation ๐ด โผ ๐ต if there is an injective function from ๐ด to ๐ต.
๐ด โผ ๐ต if and only if ∃๐(๐: ๐ด โช ๐ต)
We write ๐ด โบ ๐ต if ๐ด โผ ๐ต and ๐ด โ ๐ต.
So, for example, โ โบ ๐ซ(โ).
Theorem 10.8: If ๐ด is any set, then ๐ด โบ ๐ซ(๐ด).
Proof: The function ๐: ๐ด → ๐ซ(๐ด) defined by ๐(๐) = {๐} is injective. So, ๐ด โผ ๐ซ(๐ด). By Theorem 10.7,
๐ด โ ๐ซ(๐ด). It follows that ๐ด โบ ๐ซ(๐ด).
โก
Example 10.19: If we let ๐ด = ๐ซ(โ), we can apply Theorem 10.8 to this set ๐ด to see that
๐ซ(โ) โบ ๐ซ(๐ซ(โ)). Continuing in this fashion, we get a sequence of increasingly larger sets.
โ โบ ๐ซ(โ) โบ ๐ซ(๐ซ(โ)) โบ ๐ซ (๐ซ(๐ซ(โ))) โบ โฏ
If ๐ด and ๐ต are arbitrary sets, in general it can be difficult to determine if ๐ด and ๐ต are equinumerous by
producing a bijection. Luckily, the next theorem provides an easier way.
Theorem 10.9 (The Cantor-Schroeder-Bernstein Theorem): If ๐ด and ๐ต are sets such that ๐ด โผ ๐ต and
๐ต โผ ๐ด, then ๐ด ~ ๐ต.
Note: At first glance, many students think that Theorem 10.9 is obvious and that the proof must be
trivial. This is not true. The theorem says that if there is an injective function from ๐ด to ๐ต and another
injective function from ๐ต to ๐ด, then there is a bijective function from ๐ด to ๐ต. This is a deep result, which
is far from obvious. Constructing a bijection from two arbitrary injections is not an easy thing to do. I
suggest that the reader takes a few minutes to try to do it, if for no other reason than to convince
themselves that the proof is difficult. I leave the proof itself as an optional exercise.
Example 10.20: Let’s use Theorem 10.9 to prove that the open interval of real numbers (0, 1) is
equinumerous to the closed interval of real numbers [0, 1].
Analysis: Since (0, 1) ⊆ [0, 1], there is an obvious injective function ๐: (0, 1) → [0, 1] (just send each
element to itself).
133
The harder direction is finding an injective function ๐ from [0, 1] into (0, 1).
1
3
We will do this by drawing a line segment with endpoints (0, 4) and (1, 4).
1 3
This will give us a bijection from [0, 1] to [4 , 4]. We can visualize this bijection
using the graph to the right. We will write an equation for this line in the
slope-intercept form ๐ฆ = ๐๐ฅ + ๐. Here ๐ is the slope of the line and ๐ is
1
the ๐ฆ-intercept of the line. We can use the graph to see that ๐ = 4 and
3 1
−
rise
2
1
1
1
4 4
๐ = run = 1−0
= 4 = 2. So, we define ๐: [0, 1] → (0, 1) by ๐(๐ฅ) = 2 ๐ฅ + 4.
Let’s write out the details of the proof.
Proof: Let ๐: (0, 1) → [0, 1] be defined by ๐(๐ฅ) = ๐ฅ. Clearly, ๐ is injective, so that (0, 1) โผ [0, 1].
1
1
1
1
2
4
2
2
Next, we define ๐: [0, 1] → โ by ๐(๐ฅ) = ๐ฅ + . If 0 ≤ ๐ฅ ≤ 1, then 0 ≤ ๐ฅ ≤ , and therefore,
1
1
1
3
1
3
≤ 2 ๐ฅ + 4 ≤ 4. Since 0 < 4 and 4 < 1, we have 0 < ๐(๐ฅ) < 1. Therefore, ๐: [0, 1] → (0, 1). If ๐ฅ ≠ ๐ฅ ′ ,
4
1
1
1
1
1
1
then 2 ๐ฅ ≠ 2 ๐ฅ ′ , and so, ๐(๐ฅ) = 2 ๐ฅ + 4 ≠ 2 ๐ฅ ′ + 4 = ๐(๐ฅ ′ ). This shows that ๐ is injective. It follows that
[0, 1] โผ (0, 1).
Since (0, 1) โผ [0, 1] and [0, 1] โผ (0, 1), it follows from the Cantor-Schroeder-Bernstein Theorem that
(0, 1) ~ [0, 1].
โก
Notes: (1) If ๐ด ⊆ ๐ต, then the function ๐: ๐ด → ๐ต defined by ๐(๐) = ๐ for all ๐ ∈ ๐ด is always injective. It
is called the inclusion map.
(2) It is unfortunate that the same notation is used for points and open intervals. Normally this isn’t an
issue, but in this particular example both usages of this notation appear. Take another look at the
analysis above and make sure you can see when the notation (๐, ๐) is being used for a point and when
it is being used for an open interval.
1 3
(3) We could have used any interval [๐, ๐] with 0 < ๐ < ๐ < 1 in place of [4 , 4].
134
Problem Set 10
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. For each set ๐ด below, evaluate (i) ๐ด2 ; (ii) ๐ซ(๐ด); (iii) ๐ด๐ด.
1. ๐ด = ∅
2. ๐ด = {∅}
3. ๐ด = {0, 1}
4. ๐ด = ๐ซ({∅})
2. Find all partitions of the three-element set {๐, ๐, ๐} and the four-element set {๐, ๐, ๐, ๐}.
LEVEL 2
3. For ๐, ๐ ∈ โ, we will say that ๐ divides ๐, written ๐|๐, if there is a natural number ๐ such that
๐ = ๐๐. Notice that | is a binary relation on โ. Prove that (โ, | ) is a partially ordered set, but it
is not a linearly ordered set.
4. Prove that for each ๐ ∈ โค+ , ≡๐ (see part 3 of Example 10.8) is an equivalence relation on โค.
5. Let ๐ด, ๐ต, and ๐ถ be sets. Prove the following:
(i)
If ๐ด ⊆ ๐ต, then ๐ด โผ ๐ต.
(ii)
โผ is transitive.
(iii) โบ is transitive.
(iv) If ๐ด โผ ๐ต and ๐ต โบ ๐ถ, then ๐ด โบ ๐ถ.
(v)
If ๐ด โบ ๐ต and ๐ต โผ ๐ถ, then ๐ด โบ ๐ถ.
6. Let ๐ด and ๐ต be sets such that ๐ด ⊆ ๐ต. Prove that ๐ซ(๐ด) โผ ๐ซ(๐ต).
LEVEL 3
7. For ๐, ๐ ∈ โโ, define ๐ โผ ๐ if and only if for all ๐ฅ ∈ โ, ๐(๐ฅ) ≤ ๐(๐ฅ). Is ( โโ, โผ) a poset? Is it
a linearly ordered set? What if we replace โผ by โผ∗ , where ๐ โผ∗ ๐ if and only if there is an ๐ฅ ∈ โ
such that ๐(๐ฅ) ≤ ๐(๐ฅ)?
๐
8. Prove that the function ๐: โ → โค defined by ๐(๐) = {
2
๐+1
– 2
if ๐ is even
if ๐ is odd
is a bijection.
9. Define ๐ซ๐ (โ) for each ๐ ∈ โ by ๐ซ0 (โ) = โ and ๐ซ๐+1 (โ) = ๐ซ(๐ซ๐ (โ)) for ๐ > 0. Find a set ๐ต
such that for all ๐ ∈ โ, ๐ซ๐ (โ) โบ ๐ต.
10. Prove that if ๐ด ~ ๐ต and ๐ถ ~ ๐ท, then ๐ด × ๐ถ ~ ๐ต × ๐ท.
135
LEVEL 4
11. Define a partition ๐ท of โ such that ๐ท ~ โ and for each ๐ ∈ ๐ท, ๐ ~ โ.
12. Prove that a countable union of countable sets is countable.
13. Let ๐ด and ๐ต be sets such that ๐ด ~ ๐ต. Prove that ๐ซ(๐ด) ~ ๐ซ(๐ต).
14. Prove the following:
(i)
โ × โ ~ โ.
(ii)
โ ~ โ.
(iii) Any two intervals of real numbers are equinumerous (including โ itself).
(iv)
โ
โ ~ ๐ซ(โ).
15. Prove that {๐ด ∈ ๐ซ(โ) | ๐ด is infinite} is uncountable.
16. For ๐, ๐ ∈ โโ, define ๐ <∗ ๐ if and only if there is ๐ ∈ โ such that for all ๐ > ๐, ๐(๐) < ๐(๐).
(i)
Is ( โโ, <∗ ) a strict poset?
(ii)
Is ( โโ, <∗ ) a strict linearly ordered set?
(iii) Let โฑ = {๐๐ : โ → โ | ๐ ∈ โ} be a countable set of functions. Must there be a function
๐ ∈ โโ such that for all ๐ ∈ โ, ๐๐ <∗ ๐?
17. Let ๐ท be a partition of a set ๐. Prove that there is an equivalence relation ~ on ๐ for which the
elements of ๐ท are the equivalence classes of ~. Conversely, if ~ is an equivalence relation on a
set ๐, prove that the equivalence classes of ~ form a partition of ๐.
LEVEL 5
18. Prove that if ๐ด ~ ๐ต and ๐ถ ~ ๐ท, then ๐ด๐ถ ~ ๐ต๐ท.
19. Prove that for any sets ๐ด, ๐ต, and ๐ถ, ๐ต×๐ถ๐ด ~ ๐ถ( ๐ต๐ด).
20. Prove the following:
(i)
๐ซ(โ) ~ {๐ ∈ โโ | ๐ is a bijection}.
(ii)
โ
โ โ โโ, given that โ ~ ๐ซ(โ) .
CHALLENGE PROBLEM
21. Prove the Cantor-Schroeder-Bernstein Theorem.
136
LESSON 11 – ABSTRACT ALGEBRA
STRUCTURES AND HOMOMORPHISMS
Structures and Substructures
An ๐-ary relation on a set ๐ is a subset of ๐ ๐ . We usually use the expressions unary, binary, and ternary
in place of 1-ary, 2-ary, and 3-ary. Note that a unary relation on ๐ is simply a subset of ๐. We do not
define a 0-ary relation.
Example 11.1: Let โค = {… , – 3, – 2, – 1, 0, 1, 2, 3, … } be the set of integers. The set โ = {0, 1, 2, 3, … }
of natural numbers is a unary relation on โค. In other words, โ ⊆ โค. Some examples of binary relations
on โค are the linear orderings <, ≤, >, and ≥ (see Example 10.5 (part 2)) and the equivalence relations
≡๐ = {(๐, ๐) ∈ โค2 | ๐|๐ − ๐} (see Example 10.8 (part 3)). ๐
= {(๐ฅ, ๐ฆ, ๐ง) ∈ โค3 | ๐ฅ + ๐ฆ = ๐ง} is an
example of a ternary relation on โค (see Example 10.7).
An ๐-ary operation on a set ๐ is a function from ๐ ๐ to ๐. We also define a 0-ary operation to simply be
an element of ๐. We will usually call a 0-ary operation a constant in ๐.
Example 11.2: Let โ be the set of real numbers. Negation is an example of a unary operation on โ.
This is the operation that maps each ๐ฅ ∈ โ to – ๐ฅ. Addition, subtraction, and multiplication are
examples of binary operations on โ. 0 is an example of a 0-ary operation on โ or a constant in โ.
A finitary relation is an ๐-ary relation for some ๐ ∈ โ∗ . A finitary operation is an ๐-ary operation for
some ๐ ∈ โ.
A structure is a set together with a collection of finitary operations and relations defined on the set.
The set is called the domain of the structure.
Example 11.3:
1. Semigroups, monoids, and groups are structures of the form (๐,โ), where ๐ is a set and โ is a
binary operation on ๐.
We may want to view a monoid as a structure of the form (๐,โ, ๐) and a group as a structure of
the form (๐,โ, −1 , ๐), where ๐ is a constant called the identity element of the monoid or group
and −1 is the unary inverse operator.
2. Rings and fields are structures of the form (๐, +, ⋅), where ๐ is a set, and + and ⋅ are binary
operations on ๐. Again, we may want to include additional operations (see part 4 of Example
11.5 and also part 4 of Example 11.6 below).
3. Ordered rings and fields are structures of the form (๐, +, ⋅, ≤), where ๐ is a set, + and ⋅ are
binary operations on ๐, and ≤ is a binary relation on ๐.
4. Every set without any operations and relations is a structure. For example, โ, โค, โ, โ, and โ
are structures. (Notice that we abbreviate the structure (๐) as ๐.)
5. We can view a vector space (๐, ⊕) over the field (๐น, +, ⋅) as (๐ ∪ ๐น, ๐, ๐น, ๐
⊕ , ๐
+ , ๐
⋅ , ๐
โ ),
where ๐ and ๐น are unary relations, and ๐
⊕ , ๐
+ , ๐
⋅ , ๐
โ are the following ternary relations:
137
๐
⊕ = {(๐ฅ, ๐ฆ, ๐ง) ∈ ๐ 3 | ๐ฅ ⊕ ๐ฆ = ๐ง}
๐
⋅ = {(๐ฅ, ๐ฆ, ๐ง) ∈ ๐น 3 | ๐ฅ ⋅ ๐ฆ = ๐ง}
๐
+ = {(๐ฅ, ๐ฆ, ๐ง) ∈ ๐น 3 | ๐ฅ + ๐ฆ = ๐ง}
๐
โ = {(๐ฅ, ๐ฆ, ๐ง) ∈ ๐น × ๐ × ๐ | ๐ฅ๐ฆ = ๐ง}
Notice that we had to use ternary relations instead of binary functions for the four operations
because the definition of a structure demands that functions be defined on (๐ ∪ ๐น)2 . However,
none of the functions are defined on (๐ ∪ ๐น)2, Indeed, ⊕ is defined only on ๐ 2 , + and ⋅ are
defined only on ๐น 2 , and scalar multiplication is defined on ๐น × ๐.
We will sometimes use a fraktur letter (such as ๐, ๐
, โญ) for the name of a structure if we want to be
clear that we are talking about the whole structure and not just the underlying set. For example, we
might write ๐ = (๐บ,โ) for a group ๐ with underlying set ๐บ and group operation โ.
Notes: (1) A finitary operation on a set ๐ is a function ๐: ๐ ๐ → ๐ for some ๐ ∈ โ. There are two
important facts implied by this definition:
1. The operation ๐ is defined for every ๐-tuple (๐1 , ๐2 , … , ๐๐ ) ∈ ๐ ๐ .
2. The set ๐ is closed under ๐.
(2) A finitary relation on a set ๐ is a subset ๐
of ๐ ๐ for some ๐ ∈ โ. We have more flexibility with
relations than we do with operations. For example, an (๐ + 1)-ary relation can be used to define a
partial ๐-ary function. Suppose we want a structure that consists of the set of integers โค together with
the partial function defined on only the even integers that divides each even integer by 2. We can
define a relation ๐
= {(2๐, ๐) | ๐ ∈ โค}. The structure (โค, ๐
) consists of the set of integers together
๐
with the function ๐: 2โค → โค defined by ๐(๐) = 2 (2โค is the set of even integers). Notice that we
defined a unary partial function on โค by using a binary relation.
We say that structures ๐ and ๐
have the same type if they have the same number of ๐-ary operations
for each ๐ ∈ โ, and the same number of ๐-ary relations for each ๐ ∈ โ∗ (recall that โ∗ = โ โ {0} is
the set of nonzero natural numbers).
Example 11.4:
1. (โ, ≤), (๐ซ(โ), ⊆), and for each ๐ ∈ โ∗ , (โค, ≡๐ ) all have the same type because they each have
exactly one binary relation.
2. (โค, +) and (โค, +, 0) have different types. The first structure has one binary operation and
nothing else. The second structure has a binary operation and a constant (or a 0-ary operation).
Both of these are different ways of describing the group of integers under addition. The second
way is specifically mentioning the identity element, while the first is not. Another structure (of
yet another type) that describes the same group is (โค, +, – , 0), where – is the unary additive
inverse operator.
Note: For structures with only finitely many operations and relations, the definition we gave of being
of the same type is adequate. However, for structures with infinitely many operations and/or relations,
we should be a little more careful with what we mean by “the same number.” A better definition in
this case is that for each ๐ ∈ โ, the set of ๐-ary operations in ๐ is equinumerous with the set of ๐-ary
operations in ๐
, and for each ๐ ∈ โ∗ , the set of ๐-ary relations in ๐ is equinumerous with the set of
๐-ary relations in ๐
. See Lesson 10 for more information on equinumerosity.
138
๐ is a substructure of ๐
, written ๐ ⊆ ๐
if
1. ๐ and ๐
have the same type.
2. ๐ด ⊆ ๐ต.
3. If ๐ is an ๐-ary operation, and (๐1, ๐2 , … , ๐๐ ) ∈ ๐ด๐ , then ๐๐ด (๐1 , ๐2 , … , ๐๐ ) = ๐๐ต (๐1 , ๐2 , … , ๐๐ ).
4. If ๐
is an ๐-ary relation, and (๐1 , ๐2 , … , ๐๐ ) ∈ ๐ด๐ , then ๐
๐ด (๐1 , ๐2 , … , ๐๐ ) if and only if
๐
๐ต (๐1 , ๐2 , … , ๐๐ )
Notes: (1) Part 1 of the definition says that in order for ๐ to be a substructure of ๐
, the two structures
must have the same number of ๐-ary operations and ๐-ary relations for each ๐. For example, (โ, +) is
a substructure of (โค, +), written (โ, +) ⊆ (โค, +), but (โ, +) is not a substructure of (โค, +, 0).
(2) The notation in 3 and 4 might look confusing at first. Let’s clarify with an example of each. Suppose
that ๐ is addition, so that ๐(๐1 , ๐2 ) = ๐1 + ๐2 . Then 3 says that if ๐ ⊆ ๐
and we choose ๐1 and ๐2
from ๐ด, then we get the same result whether we add ๐1 and ๐2 in ๐ด or ๐ต. We might write this as
๐1 +๐ด ๐2 = ๐1 +๐ต ๐2 . Now suppose that ๐
is <, so that ๐
(๐1 , ๐2 ) means ๐1 < ๐2 . Then 4 says that if
๐ ⊆ ๐
and we choose ๐1 and ๐2 from ๐ด, then ๐1 <๐ด ๐2 if and only if ๐1 <๐ต ๐2 .
Example 11.5:
1. Let (๐,โ) be a semigroup. A substructure (๐,โ) of (๐,โ) is called a subsemigroup. Notice that
๐ ⊆ ๐ and the operation โ must be the same for both structures. Also, โ is a binary operation
on ๐, which means that ๐ is closed under โ. Is โ associative in ๐? Recall from Note 2 following
Example 3.3 in Lesson 3 that associativity is closed downwards. In other words, since โ is
associative in ๐ and ๐ ⊆ ๐, it follows that โ is associative in ๐. We just showed that a
subsemigroup of a semigroup is itself a semigroup.
For example, let ๐ = (โ, +) and let ๐
= (๐ผ, +), where ๐ผ = {2๐ | ๐ ∈ โ} is the set of even
natural numbers. Then ๐
⊆ ๐. That is, ๐
is a subsemigroup of ๐.
On the other hand, if we let ๐ = {2๐ + 1|๐ ∈ โ}, then (๐, +) is not even a structure because
+ is not a binary operation on ๐. For example, 3,5 ∈ ๐, but 3 + 5 ∉ ๐.
2. Let (๐,โ, ๐) be a monoid, where ๐ is the identity of ๐. A substructure (๐,โ, ๐) of (๐,โ, ๐) is
called a submonoid. Notice that the operation โ and the identity ๐ must be the same for both
structures. As we saw in 1 above, ๐ is closed under โ and โ is associative in ๐. We just showed
that a submonoid of a monoid is itself a monoid.
Note that a substructure (๐,โ) of a monoid (๐,โ) is a subsemigroup of (๐,โ), but may or may
not be a submonoid of (๐,โ). For example, let ๐ถ = โ โ {0, 1} = {2, 3, 4, … } be the set of
natural numbers with 0 and 1 removed. Then (๐ถ, ⋅) is a subsemigroup of the monoid (โ, ⋅),
but (๐ถ, ⋅) is not a submonoid of (โ, ⋅) because ๐ถ is missing the multiplicative identity 1.
If (๐,โ) is a monoid with identity ๐, we can define a submonoid to be a substructure (๐,โ) of
(๐,โ) such that ๐ contains ๐. In other words, if we wish to leave the identity out of the
structure, we need to explicitly mention that the domain of the substructure contains the
identity in order to guarantee that we get a submonoid. For example, if we let ๐ = (โ, +) and
๐
= (๐ผ, +), we see that ๐
is a submonoid of ๐ because ๐ผ ⊆ โ is closed under + and 0 ∈ ๐ผ.
139
3. Let (๐บ,โ, −1 , ๐) be a group, where −1 is the unary inverse operator and ๐ is the identity of ๐บ. A
substructure (๐ป,โ, −1 , ๐) of (๐บ,โ, −1 , ๐) is called a subgroup. Notice that the operations โ and
−1
, and the identity ๐ must be the same for both structures. As we saw in 1 and 2 above, ๐ป is
closed under โ and โ is associative in ๐. By making the unary inverse operator part of the
structure, we have guaranteed that the inverse property holds for the substructure. So, a
subgroup of a group is itself a group.
Also note that if โ is commutative in ๐บ, then โ is commutative in ๐ป. Commutativity is closed
downwards for the same reason that associativity is closed downwards (once again, see Note 2
following Example 3.3 in Lesson 3).
For example, let ๐ = (โค, +, – , 0) and let ๐
= (2โค, +, – , 0), where 2โค = {2๐ | ๐ ∈ โค} is the set
of even integers. Then ๐
is a subgroup of ๐. More generally, for any positive integer ๐, we can
let ๐โค = {๐๐ | ๐ ∈ โค}. The structure (๐โค, +, – , 0) is a subgroup of the group (โค, +, – , 0).
Note that a substructure (๐ป,โ) of a group (๐บ,โ) is a subsemigroup of (๐บ,โ), but may or may not
be a subgroup of (๐บ,โ), as we saw in 2 above. Furthermore, a substructure (๐ป,โ, ๐) of a group
(๐บ,โ, ๐) is a submonoid of (๐บ,โ, ๐) but still may not be a subgroup of (๐บ,โ, ๐). For example,
(โ, +, 0) is a substructure of the group (โค, +, 0) that is not a subgroup of (โค, +, 0) (it is a
submonoid though). We need to include the unary inverse operator in the structure to
guarantee that a substructure of a subgroup will be a subgroup.
If (๐บ,โ) is a group with identity ๐, we can define a subgroup to be a substructure (๐ป,โ) of
(๐บ,โ) such that ๐ป contains ๐ and for all ๐ฅ ∈ ๐ป, ๐ฅ −1 ∈ ๐ป (in other words, we need to insist that
๐ป is closed under taking inverses). These conditions can be used in place of including symbols
for inverse and identity in the structure itself. For example, if we let ๐ = (โ∗ , ⋅) and
๐
= (โ∗ , ⋅), we see that ๐
is a subgroup of ๐ because โ∗ ⊆ โ∗ , 1 ∈ โ∗ , and โ∗ is closed
under taking multiplicative inverses.
If the operation is understood, we can simplify notation even further. We may write ๐ป ≤ ๐บ and
say that ๐ป is a subgroup of ๐บ. What we mean by this is (๐ป,โ, −1 , ๐) is a substructure of
(๐บ,โ, −1 , ๐), or equivalently, (๐ป,โ) is a substructure of (๐บ,โ) such that the identity of ๐บ is in ๐ป
and ๐ป is closed under taking inverses.
We use the same notation for other structures as well. Just be careful about one thing. When
we write ๐ด ≤ ๐ต, we don’t just mean that the structure ๐ is a substructure of the structure ๐
.
We also mean that the structure ๐ has all the properties we need for the type of structure
under discussion. For example, if we are talking about groups under addition, then we would
not write โ ≤ โค. However, if we are talking about monoids under addition, then we could write
โ ≤ โค.
4. Let (๐
, +, ⋅, – , 1) be a ring, where – is the unary additive inverse operator and 1 is the
multiplicative identity of ๐
. A substructure (๐, +, ⋅, – , 1) of (๐
, +, ⋅, – , 1) is called a subring.
Notice that the operations +, ⋅, and –, and the multiplicative identity 1 must be the same for
both structures. By the definition of a structure, ๐ is closed under +, ⋅ and –.
140
You may be wondering why we didn’t put a constant for 0 in the structure. The reason is
because we don’t need to. Since 1 ∈ ๐ and ๐ is closed under the additive inverse, we have
0 = 1 + (– 1) ∈ ๐. Associativity of addition and multiplication, commutativity of addition, and
distributivity all hold in ๐ because these operations are closed downwards (see Note 2 following
Example 3.3 in Lesson 3). It follows that a subring is itself a ring.
Alternatively, we can say that (๐, +, ⋅) is a subring of (๐
, +, ⋅) if (๐, +, ⋅) is a substructure of
(๐
, +, ⋅) such that ๐ contains 1 and for all ๐ฅ ∈ ๐, – ๐ฅ ∈ ๐ (in other words, we need to insist that
๐ is closed under taking additive inverses).
As we discussed above, we may write ๐ ≤ ๐
for ๐ is a subring of ๐
if it is clear that we are talking
about the ring structures of ๐ and ๐
.
For example, (โค, +, ⋅) is a subring of the fields (โ, +, ⋅), (โ, +, ⋅), and (โ, +, ⋅).
(โค, +, ⋅) has no subring other than itself. To see this, let ๐ด ≤ โค. First note that the multiplicative
identity 1 ∈ ๐ด. Using closure of addition and the principle of mathematical induction, we can
then show that each positive integer is in ๐ด (for example, 2 = 1 + 1). Since ๐ด is closed under
the additive inverse of โค, for each positive integer ๐, – ๐ ∈ ๐ด. It follows that ๐ด = โค. (Note that
we know that 0 ∈ ๐ด because we have already shown that 0 is in any subring of a ring.)
5. Let (๐น, +, ⋅, – , −1 , 0, 1) be a field, where – and −1 are the unary additive inverse and
multiplicative inverse operators, respectively, and 0 and 1 are the additive and multiplicative
identities of ๐
, respectively. Note that technically speaking, −1 must be expressed as the binary
relation −1 = {(๐ฅ, ๐ฆ) | ๐ฆ = ๐ฅ −1 } because −1 isn’t defined for ๐ฅ = 0. A substructure
(๐พ, +, ⋅, – , −1 , 0, 1) of (๐น, +, ⋅, – , −1 , 0, 1) is a subfield provided that the domain and range of
the multiplicative inverse relation −1 are both ๐พ ∗ . Notice that the operations +, ⋅,–, the
relation −1 , and the identities 0 and 1 must be the same for both structures. By the definition
of a structure, ๐พ is closed under +, ⋅, and –. Associativity and commutativity of addition and
multiplication, and distributivity all hold in ๐พ because these operations are closed downwards
(see Note 2 following Example 3.3 in Lesson 3). It follows that a subfield is itself a field.
Alternatively, we can say that (๐พ, +, ⋅) is a subfield of (๐น, +, ⋅) if (๐พ, +, ⋅) is a substructure of
(๐น, +, ⋅) such that ๐พ contains 0 and 1, for all ๐ฅ ∈ ๐พ, −๐ฅ ∈ ๐พ and for all nonzero ๐ฅ ∈ ๐พ,
๐ฅ −1 ∈ ๐พ (in other words, we need to insist that ๐พ is closed under taking additive inverses and
๐พ ∗ is closed under taking multiplicative inverses). We will write ๐พ ≤ ๐น when ๐พ is a subfield of
๐น and it is clear we are talking about the field structures of ๐พ and ๐น.
For example, (โ, +, ⋅) is a subfield of both (โ, +, ⋅) and (โ, +, ⋅), and (โ, +, ⋅) is a subfield
of (โ, +, ⋅).
6. If (๐, ≤) is a partially ordered set, then a substructure (๐, ≤) of (๐, ≤) will also be a partially
ordered set. This is because reflexivity, antisymmetry, and transitivity are all closed downwards.
Once again, see Note 2 following Example 3.3 in Lesson 3 for an explanation of this. Similarly,
any substructure of a linearly ordered set is linearly ordered, and similar results hold for strict
partial and linear orders.
For example, we have (โ, ≤) ⊆ (โค, ≤) ⊆ (โ, ≤ ) ⊆ (โ, ≤ ), and each of these structures are
linearly ordered sets. Similarly, we have (โ, <) ⊆ (โค, <) ⊆ (โ, < ) ⊆ (โ, < ).
141
Homomorphisms
A homomorphism is a function from one structure to another structure of the same type that preserves
all the relations and functions of the structure (see the Note after Example 11.6 for a more rigorous
definition).
Example 11.6:
1. Let (๐,โ) and (๐,โ) be semigroups. A semigroup homomorphism is a function ๐: ๐ → ๐ such
that for all ๐, ๐ ∈ ๐, ๐(๐ โ ๐) = ๐(๐) โ ๐(๐).
For example, let ๐ = (โค+ , +), ๐
= (๐ผ, ⋅), and let ๐: โค+ → ๐ผ be defined by ๐(๐) = 2๐ . For all
๐, ๐ ∈ โค+ , we have ๐(๐ + ๐) = 2๐+๐ = 2๐ ⋅ 2๐ = ๐(๐) ⋅ ๐(๐). Therefore, ๐ is a semigroup
homomorphism.
As another example, let ๐ = (โ, +), ๐
= ({T, F}, ∨), and let ๐: โ → {T, F} be defined by
๐(๐) = T. For all ๐, ๐ ∈ โ, we have ๐(๐ + ๐) = T = T ∨ T = ๐(๐) ∨ ๐(๐). Therefore, ๐ is a
semigroup homomorphism.
2. Let (๐,โ, ๐๐ ) and (๐,โ, ๐๐ ) be monoids, where ๐๐ and ๐๐ are the identities of ๐ and ๐,
respectively. A monoid homomorphism is a function ๐: ๐ → ๐ such that for all ๐, ๐ ∈ ๐,
๐(๐ โ ๐) = ๐(๐) โ ๐(๐) and ๐(๐๐ ) = ๐๐ .
Note that we need to include the identity element of a monoid as part of the structure for a
homomorphism to be a monoid homomorphism. Otherwise we get only a semigroup
homomorphism. The second example in part 1 above is a semigroup homomorphism, but not
a monoid homomorphism. Indeed, the identity of (โ, +) is 0 and the identity of ({T, F}, ∨) is
F, but ๐(0) = T ≠ F.
On the other hand, if we change the domains of the structures in the first example from part 1
above slightly, we do get a monoid homomorphism. Let ๐ = (โ, +, 0), ๐
= (โ, ⋅, 1), and let
๐: โ → โ be defined by ๐(๐) = 2๐ . For all ๐, ๐ ∈ โ, ๐(๐ + ๐) = ๐(๐) ⋅ ๐(๐), as we saw
above, and ๐(0) = 20 = 1. Therefore, ๐ is a monoid homomorphism.
3. Let (๐บ,โ) and (๐ป,โ) be groups. A group homomorphism is a function ๐: ๐บ → ๐ป such that for all
๐, ๐ ∈ ๐บ, ๐(๐ โ ๐) = ๐(๐) โ ๐(๐).
You may be asking why we are not including constant symbols for the identity like we did for
monoids. After all, we certainly want ๐ to take the identity of ๐บ to the identity of ๐ป. And you
may also be asking why we are not including a unary operator symbol for taking the inverse, as
−1
we certainly want ๐(๐−1 ) = (๐(๐)) . For structures (๐บ,โ, −1๐บ , ๐๐บ ) and (๐ป,โ, −1๐ป , ๐๐ป ), we
can define a group homomorphism to be a function ๐: ๐บ → ๐ป such that for all ๐, ๐ ∈ ๐บ,
−1
๐(๐ โ ๐) = ๐(๐) โ ๐(๐), for all ๐ ∈ ๐บ, ๐(๐−1 ) = (๐(๐)) , and ๐(๐๐บ ) = ๐๐ป . However, it turns
out that this more complicated definition is equivalent to our first simpler one. In other words,
if ๐: ๐บ → ๐ป is a group homomorphism using the simpler definition, then ๐ already maps the
identity of ๐บ to the identity of ๐ป, and ๐ already preserves inverses. We will prove these facts in
Theorems 11.1 and 11.2 below.
142
As an example, let ๐ = (โค, +), ๐
= ({1, – 1}, ⋅), and let ๐: โค → {1, – 1} be defined by
1 if ๐ is even.
๐(๐) = {
There are four cases to consider. If ๐ and ๐ are both even, then
– 1 if ๐ is odd.
๐ + ๐ is even, and so, ๐(๐ + ๐) = 1 and ๐(๐) ⋅ ๐(๐) = 1 ⋅ 1 = 1. If ๐ and ๐ are both odd,
then ๐ + ๐ is even, and so, ๐(๐ + ๐) = 1 and ๐(๐) ⋅ ๐(๐) = (– 1) ⋅ (– 1) = 1. If ๐ is even and
๐ is odd, then ๐ + ๐ is odd, and so, ๐(๐ + ๐) = – 1 and ๐(๐) ⋅ ๐(๐) = 1 ⋅ (– 1) = – 1. Finally,
if ๐ is odd and ๐ is even, then ๐ + ๐ is odd, and so, we have ๐(๐ + ๐) = – 1 and
๐(๐) ⋅ ๐(๐) = – 1 ⋅ 1 = – 1. Therefore, ๐ is a group homomorphism.
Let’s look at another example. Let ๐ = (โ, +), ๐
= (โ, +), and let ๐: โ → โ be defined by
๐(๐ฅ) = ๐ฅ 2 . Then ๐ is not a group homomorphism. To see this, we just need a single
counterexample. We have ๐(1) = 12 = 1, ๐(2) = 22 = 4, ๐(1 + 2) = ๐(3) = 32 = 9, and
๐(1) + ๐(2) = 1 + 4 = 5. Since ๐(1 + 2) ≠ ๐(1) + ๐(2), ๐ fails to be a homomorphism.
4. Let (๐
, +๐
, ⋅๐
, 1๐
) and (๐, +๐ , ⋅๐ , 1๐ ) be rings, where 1๐
and 1๐ are the multiplicative identities
of ๐
and ๐, respectively. A ring homomorphism is a function ๐: ๐
→ ๐ such that for all ๐, ๐ ∈ ๐
,
๐(๐+๐
๐) = ๐(๐)+๐ ๐(๐), ๐(๐ ⋅๐
๐) = ๐(๐) ⋅๐ ๐(๐), and ๐(1๐
) = 1๐ .
Notice that we did not include constant symbols for the additive identities of the rings and we
did not include unary operator symbols for taking the additive inverses of elements in the rings.
We will see in Theorems 11.1 and 11.2 below that with ๐ defined as above, it follows that for
all ๐ ∈ ๐
, ๐(– ๐) = – ๐(๐), and ๐(0๐
) = 0๐ .
Let’s look at an example. First note that if ๐
is a ring, then ๐
× ๐
with addition and multiplication
defined componentwise is also a ring. That is, for ๐, ๐, ๐, ๐ ∈ ๐
, we define addition and
multiplication by (๐, ๐) + (๐, ๐) = (๐ + ๐, ๐ + ๐) and (๐, ๐)(๐, ๐) = (๐๐, ๐๐). The verification
that ๐
× ๐
is a ring with these definitions is straightforward (see Problem 5 below). Let
๐ = (โค × โค, +, ⋅, (1, 1)), ๐
= (โค, +, ⋅, 1), and let ๐: โค × โค → โค be defined by ๐((๐, ๐)) = ๐.
Then for all ๐, ๐, ๐, ๐ ∈ โค, we have ๐((๐, ๐) + (๐, ๐)) = ๐((๐ + ๐, ๐ + ๐)) = ๐ + ๐ and
๐((๐, ๐)) + ๐((๐, ๐)) = ๐ + ๐. We also have ๐((๐, ๐) ⋅ (๐, ๐)) = ๐((๐๐, ๐๐)) = ๐๐ and
๐((๐, ๐)) ⋅ ๐((๐, ๐)) = ๐๐. Finally, ๐((1,1)) = 1. Therefore, ๐ is a ring homomorphism.
Let’s look at another example. Let ๐ = ๐
= (โค, +, ⋅, 1), and let ๐: โค → โค be defined by
๐(๐) = 2๐. Then ๐ is not a ring homomorphism. To see this, we just need a single
counterexample. ๐(3) = 2 ⋅ 3 = 6, ๐(5) = 2 ⋅ 5 = 10, ๐(3 ⋅ 5) = ๐(15) = 2 ⋅ 15 = 30, and
๐(3) ⋅ ๐(5) = 6 ⋅ 10 = 60. Since ๐(3 ⋅ 5) ≠ ๐(3) ⋅ ๐(5), ๐ fails to be a ring homomorphism.
Note, however, that ๐ is a group homomorphism from (โค, +) to itself. Indeed, if ๐, ๐ ∈ โค, then
๐(๐ + ๐) = 2(๐ + ๐) = 2๐ + 2๐ = ๐(๐) + ๐(๐).
5. A field homomorphism is the same as a ring homomorphism. The multiplicative inverse is
automatically preserved (see Theorem 11.2 below), and so, nothing additional needs to be
added to the definition.
6. Let (๐ด, ≤๐ด ) and (๐ต, ≤๐ต ) be partially ordered sets. An order homomorphism (also known as a
monotonic function) is a function ๐: ๐ด → ๐ต such that for all ๐ฅ, ๐ฆ ∈ ๐ด, ๐ฅ ≤๐ด ๐ฆ if and only if
๐(๐ฅ) ≤๐ต ๐(๐ฆ).
For example, let ๐ = ๐
= (โ, ≤) and let ๐: โ → โ be defined by ๐(๐) = ๐ + 3. For all
๐, ๐ ∈ โ, we have ๐ ≤ ๐ if and only if ๐ + 3 ≤ ๐ + 3 if and only if ๐(๐) ≤ ๐(๐). Therefore,
๐ is an order homomorphism.
143
As another example, let ๐ = (โค, ≥), ๐
= (๐ซ(โค), ⊆), and let ๐: โค → ๐ซ(โค) be defined by
๐(๐) = {๐ ∈ โค | ๐ ≤ ๐}. Let ๐, ๐ ∈ โค. We will show that ๐ ≥ ๐ if and only if the relationship
{๐ ∈ โค | ๐ ≤ ๐} ⊆ {๐ ∈ โค | ๐ ≤ ๐} holds. Suppose that ๐ ≥ ๐ and let ๐ ∈ {๐ ∈ โค | ๐ ≤ ๐}.
Then ๐ ≥ ๐. Since ๐ ≥ ๐, ๐ ≥ ๐, and so, ๐ ∈ {๐ ∈ โค | ๐ ≤ ๐}. Now, let
{๐ ∈ โค | ๐ ≤ ๐} ⊆ {๐ ∈ โค | ๐ ≤ ๐}. Since ๐ ≤ ๐, we have ๐ ∈ {๐ ∈ โค | ๐ ≤ ๐}. So,
๐ ∈ {๐ ∈ โค | ๐ ≤ ๐}. Thus, ๐ ≤ ๐, or equivalently, ๐ ≥ ๐. Therefore, ๐ is an order
homomorphism.
Note: Here is a more rigorous definition of a homomorphism.
If ๐ and ๐
are structures of the same type with underlying domains ๐ด and ๐ต, then a homomorphism
is a function ๐: ๐ด → ๐ต such that for each ๐ ∈ โ,
1. if ๐
is an ๐-ary relation, then ๐
๐ด (๐1 , ๐2 , … , ๐๐ ) if and only if ๐
๐ต (๐(๐1 ), ๐(๐2 ), … , ๐(๐๐ )).
2. If ๐น is an ๐-ary function, then ๐(๐น๐ด (๐1 , ๐2 , … , ๐๐ )) = ๐น๐ต (๐(๐1 ), ๐(๐2 ), … , ๐(๐๐ )).
In particular, 2 implies that if ๐ is a constant, then ๐(๐๐ด ) = ๐๐ต .
Theorem 11.1: Let (๐บ,โ) and (๐ป,โ) be groups with identities ๐๐บ and ๐๐ป , respectively, and let
๐: ๐บ → ๐ป be a group homomorphism. Then ๐(๐๐บ ) = ๐๐ป .
Proof: Since ๐๐บ = ๐๐บ โ ๐๐บ , we have ๐(๐๐บ ) = ๐(๐๐บ โ ๐๐บ ) = ๐(๐๐บ ) โ ๐(๐๐บ ). So,
−1
๐(๐๐บ ) = ๐(๐๐บ ) โ ๐๐ป = ๐(๐๐บ ) โ (๐(๐๐บ ) โ (๐(๐๐บ )) )
= (๐(๐๐บ ) โ ๐(๐๐บ )) โ (๐(๐๐บ ))
−1
= ๐(๐๐บ ) โ (๐(๐๐บ ))
−1
= ๐๐ป .
โก
Notes: (1) The computations in the proof take place in the group (๐ป,โ). In particular, ๐(๐๐บ ) ∈ ๐ป and
๐๐ป ∈ ๐ป. If the proof seems confusing because ๐(๐๐บ ) appears so often, try making the substitutions
โ = ๐(๐๐บ ) and ๐ = ๐๐ป . Notice that โ, ๐ ∈ ๐ป and by the first line of the proof, โ = โ โ โ. The rest of the
proof then looks like this:
โ = โ โ ๐ = โ โ (โ โ โ−1 ) = (โ โ โ) โ โ−1 = โ โ โ−1 = ๐.
Remember that โ = ๐(๐๐บ ) and ๐ = ๐๐ป . So, we have ๐(๐๐บ ) = ๐๐ป , as desired.
(2) โ = โ โ ๐ because ๐ is the identity for ๐ป.
(3) ๐ = โ โ โ−1 by the definition of inverse and because ๐ is the identity for ๐ป. From this equation, it
follows that โ โ ๐ = โ โ (โ โ โ−1 ).
(4) โ โ (โ โ โ−1 ) = (โ โ โ) โ โ−1 because โ is associative in ๐ป.
(5) โ โ โ = โ from the first line of the proof (this is equivalent to ๐(๐๐บ ) โ ๐(๐๐บ ) = ๐(๐๐บ )). It follows
that (โ โ โ) โ โ−1 = โ โ โ−1 .
(6) Finally, โ โ โ−1 = ๐, again by the definition of inverse and because ๐ is the identity for ๐ป.
(7) If the group operation is addition, then we usually use the symbols 0๐บ and 0๐ป for the identities.
144
Theorem 11.2: Let (๐บ,โ) and (๐ป,โ) be groups and let ๐: ๐บ → ๐ป be a group homomorphism. Then for
−1
all ๐ ∈ ๐บ, ๐(๐−1 ) = (๐(๐)) .
Proof: By Theorem 11.1, we have ๐(๐๐บ ) = ๐๐ป . So, for ๐ ∈ ๐บ, we have
๐๐ป = ๐(๐๐บ ) = ๐(๐ โ ๐−1 ) = ๐(๐) โ ๐(๐−1 ).
−1
Since ๐(๐) โ ๐(๐−1) = ๐๐ป , ๐(๐−1 ) = (๐(๐)) .
โก
Notes: (1) ๐๐บ = ๐ โ ๐−1 by the definition of inverse and because ๐๐บ is the identity for ๐บ. From this
equation, it follows that ๐(๐๐บ ) = ๐(๐ โ ๐−1 ).
(2) ๐(๐ โ ๐−1 ) = ๐(๐) โ ๐(๐−1 ) because ๐ is a homomorphism.
(3) In a group with identity ๐, if ๐ฅ๐ฆ = ๐ and ๐ฆ๐ฅ = ๐, then ๐ฆ = ๐ฅ −1. We actually need to verify only one
of the equations ๐ฅ๐ฆ = ๐ or ๐ฆ๐ฅ = ๐ to determine that ๐ฆ = ๐ฅ −1 (see Note 6 after the solution to Problem
7 in Problem Set 3 from Lesson 3). Letting ๐ฅ = ๐(๐), ๐ฆ = ๐(๐−1 ), and ๐ = ๐๐ป , we showed in the proof
−1
that ๐ฅ๐ฆ = ๐. It follows that ๐ฆ = ๐ฅ −1. That is, ๐(๐−1 ) = (๐(๐)) .
An isomorphism is a bijective homomorphism. If there is an isomorphism from a structure ๐ to a
structure ๐
, then we say that ๐ and ๐
are isomorphic, and we write ๐ ≅ ๐
. Mathematicians generally
consider isomorphic structures to be the same. Indeed, they behave identically. The only difference
between them is the “names” of the elements.
Example 11.7:
1. For ๐ ∈ โค+ , the function ๐: โค → ๐โค defined by ๐(๐) = ๐๐ is an isomorphism between the
groups (โค, +) and (๐โค, +). It’s easy to see that ๐ is injective (๐ ≠ ๐ → ๐๐ ≠ ๐๐) and surjective
(if ๐๐ ∈ ๐โค, then ๐(๐) = ๐๐). If ๐, ๐ ∈ โค, then ๐(๐ + ๐) = ๐(๐ + ๐) = ๐๐ + ๐๐ = ๐(๐) + ๐(๐).
It follows that (โค, +) ≅ (๐โค, +).
Note that this map is not a ring isomorphism for ๐ > 1. First, (๐โค, +, ⋅) is technically not even
a ring for ๐ > 1 because 1 ∉ ๐โค. But it is “almost a ring.” In fact, the multiplicative identity
property is the only property that fails. See the notes following Theorem 11.4 for more details.
Let’s show that for ๐ > 1, ๐ is not an isomorphism between the “almost rings” (โค, +, ⋅) and
(๐โค, +, ⋅). Let’s use 2, 3 ∈ โค to provide a counterexample: ๐(2 ⋅ 3) = ๐(6) = ๐ ⋅ 6 = 6๐ and
๐(2) ⋅ ๐(3) = (๐ ⋅ 2)(๐ ⋅ 3) = 6๐2 . If ๐(2 ⋅ 3) = ๐(2) ⋅ ๐(3), then 6๐ = 6๐2 , so that ๐ = ๐2 .
This equation is equivalent to ๐2 − ๐ = 0, or ๐(๐ − 1) = 0. So, ๐ = 0 or ๐ = 1.
In fact, as “almost rings,” (โค, +, ⋅) is not isomorphic to (๐โค, +, ⋅) at all for ๐ > 1. If ๐: โค → ๐โค
were an isomorphism, then ๐(1) = ๐๐ for some ๐ ∈ โค. But also, since ๐ is a homomorphism,
๐(1) = ๐(1 ⋅ 1) = ๐(1)๐(1) = (๐๐)(๐๐) = ๐2 ๐2 . So, ๐๐ = ๐2 ๐2 , and thus, ๐ = 0, ๐ = 0,
or 1 = ๐๐. If ๐ = 0, then ๐(1) = 0, and so, ๐(2) = ๐(1 + 1) = ๐(1) + ๐(1) = 0 + 0 = 0.
So, ๐ is not injective. Since ๐ > 1, ๐ ≠ 0 and 1 ≠ ๐๐.
2. Recall that if ๐ง = ๐ + ๐๐ is a complex number, then the conjugate of ๐ง is the complex number
๐ง = ๐ − ๐๐. The function ๐: โ → โ defined by ๐(๐ง) = ๐ง is an isomorphism between the field
(โ, +, ⋅) and itself. By Problem 3 (parts (iii) and (iv)) from Problem Set 7 in Lesson 7, we have
145
๐(๐ง + ๐ค) = ๐ง + ๐ค = ๐ง + ๐ค = ๐(๐ง) + ๐(๐ค)
๐(๐ง๐ค) = ๐ง๐ค = ๐ง ⋅ ๐ค = ๐(๐ง)๐(๐ค)
๐(1) = 1 = 1
Thus, ๐ is a homomorphism. Since for all ๐ง ∈ โ, ๐(๐ง) = ๐ง, ๐ is surjective. Since ๐ง ≠ ๐ค implies
that ๐ง ≠ ๐ค, ๐ is injective. Therefore, ๐ is a bijective homomorphism, and so, ๐ is an
isomorphism.
An isomorphism from a structure to itself is called an automorphism. The identity function is
always an automorphism from any structure to itself. In the previous example, we described a
nontrivial automorphism from โ to โ.
Images and Kernels
Let ๐: ๐ด → ๐ต be a homomorphism. The image of ๐ is the set ๐[๐ด] = {๐(๐ฅ) | ๐ฅ ∈ ๐ด} and the kernel of
๐ is the set ker(๐) = {๐ฅ ∈ ๐ด | ๐(๐ฅ) = ๐๐ต }. In the case where ๐ต has both an additive and multiplicative
identity, then ๐๐ต will always be the additive identity (in other words, if 0, 1 ∈ ๐ต, then the kernel of ๐ is
the set of all elements of ๐ด that map to 0).
Theorem 11.3: Let ๐: ๐
→ ๐ be a ring homomorphism. Then ๐[๐
] is a subring of ๐.
Proof: Since ๐(๐ฅ) + ๐(๐ฆ) = ๐(๐ฅ + ๐ฆ) and ๐(๐ฅ)๐(๐ฆ) = ๐(๐ฅ๐ฆ), we see that ๐[๐
] is closed under
addition and multiplication. Since 1๐ = ๐(1๐
), 1๐ ∈ ๐[๐
]. By Theorem 11.2, – ๐(๐ฅ) = ๐(– ๐ฅ) (this is
the conclusion of Theorem 11.2 when additive notation is used). So, for each element ๐(๐ฅ) ∈ ๐[๐
],
– ๐(๐ฅ) ∈ ๐[๐
]. It follows that ๐[๐
] is a subring of ๐.
โก
Note: The same result holds if we replace “ring” by semigroup, monoid, group, or field. If (๐,โ) and
(๐,โ) are semigroups, and ๐: ๐ → ๐ is a semigroup homomorphism, then ๐(๐ฅ) โ ๐(๐ฆ) = ๐(๐ฅ โ ๐ฆ)
shows that ๐[๐] is closed under โ, and therefore, ๐[๐] is a subsemigroup of ๐.
Furthermore, if (๐,โ) and (๐,โ) are monoids, and ๐: ๐ → ๐ is a monoid homomorphism, then by
definition, ๐(๐๐ ) = ๐๐ , and therefore, ๐[๐] is a submonoid of ๐.
If (๐บ,โ) and (๐ป,โ) are groups, and ๐: ๐บ → ๐ป is a group homomorphism, then ๐(๐๐บ ) = ๐๐ป by Theorem
−1
11.1, and for all ๐ ∈ ๐บ, (๐(๐)) = ๐(๐−1 ) by Theorem 11.2. Therefore, ๐[๐บ] is a subgroup of ๐ป.
If (๐น, +, ⋅) and (๐พ, +, ⋅) are fields, and ๐: ๐น → ๐พ is a field homomorphism, then for all ๐ฅ ∈ ๐น ∗ ,
−1
(๐(๐ฅ)) = ๐(๐ฅ −1 ) by Theorem 11.2 again. Therefore, ๐[๐น] is a subfield of ๐พ.
Theorem 11.4: Let ๐: ๐บ → ๐ป be a group homomorphism. Then ker(๐) is a subgroup of ๐บ.
Proof: Let ๐ฅ, ๐ฆ ∈ ker(๐). Then ๐(๐ฅ) = ๐๐ป and ๐(๐ฆ) = ๐๐ป . So ๐(๐ฅ โ ๐ฆ) = ๐(๐ฅ) โ ๐(๐ฆ) = ๐๐ป โ ๐๐ป = ๐๐ป .
Thus, ๐ฅ โ ๐ฆ ∈ ker(๐). Since ๐(๐๐บ ) = ๐๐ป (by Theorem 11.1), ๐๐บ ∈ ker(๐). Suppose ๐ฅ ∈ ker(๐). By
−1
Theorem 11.2, we have ๐(๐ฅ −1 ) = (๐(๐ฅ)) = ๐๐ป−1 = ๐๐ป . So ๐ฅ −1 ∈ ker(๐). Therefore, ker(๐) is a
subgroup of ๐บ.
โก
Notes: (1) The same result holds for semigroups and monoids. This should be clear from the proof.
146
(2) Let’s say that (๐
, +, ⋅) is almost a ring if all the ring properties hold except the existence of a
multiplicative identity. Similarly, we will say that (๐, +, ⋅) is almost a subring of the ring (๐
, +, ⋅) if all
the properties of being a subring hold except ๐ does not contain the multiplicative identity.
In this case, some authors use the word “rng.” They intentionally leave out the “i” in ring to help
remember that this structure has no multiplicative identity. In other words, 1 is missing from a rng.
(3) If ๐: ๐
→ ๐ is a ring homomorphism, then unless ๐ is the trivial ring {0}, ker(๐) is not a ring because
๐(1๐
) = 1๐ ≠ 0๐ . So, 1๐
∉ ker(๐). However, every other property holds and so ker(๐) is almost a
subring of ๐
. Indeed, if ๐ฅ, ๐ฆ ∈ ker(๐), then
๐(๐ฅ + ๐ฆ) = ๐(๐ฅ) + ๐(๐ฆ) = 0๐ + 0๐ = 0๐ and ๐(๐ฅ๐ฆ) = ๐(๐ฅ)๐(๐ฆ) = 0๐ ⋅ 0๐ = 0๐ .
Also, ๐(0๐
) = 0๐ by Theorem 11.1, and if ๐ฅ ∈ ker(๐), then ๐(– ๐ฅ) = – ๐(๐ฅ) = – 0๐ = 0๐ by Theorem
11.2 (this is the conclusion of Theorem 11.2 when additive notation is used).
(4) Some authors exclude the existence of a multiplicative identity from the definition of a ring. Note 3
gives a good justification for doing so. However, removing a property creates other complexities. So,
there is no right or wrong answer here. For us, rings will always include a multiplicative identity. If we
wish to exclude the multiplicative identity, we will call the structure “almost a ring.”
Theorem 11.5: Let ๐: ๐บ → ๐ป be a group homomorphism. Then ker(๐) = {๐๐บ } if and only ๐ is injective.
Proof: Suppose that ker(๐) = {๐๐บ }, let ๐ฅ, ๐ฆ ∈ ๐บ, and let ๐(๐ฅ) = ๐(๐ฆ). Then ๐(๐ฅ)(๐(๐ฆ))
−1
−1
= ๐๐ป . It
follows from Theorem 11.2 that ๐(๐ฅ๐ฆ −1 ) = ๐(๐ฅ)๐(๐ฆ −1 ) = ๐(๐ฅ)(๐(๐ฆ)) = ๐๐ป . So, ๐ฅ๐ฆ −1 ∈ ker(๐).
Since ker(๐) = {๐๐บ }, ๐ฅ๐ฆ −1 = ๐๐บ . Therefore, ๐ฅ = ๐ฆ. Since ๐ฅ, ๐ฆ ∈ ๐บ were arbitrary, ๐ is injective.
Conversely, suppose that ๐ is injective, and let ๐ฅ ∈ ker(๐). Then ๐(๐ฅ) = ๐๐ป . But also, by Theorem 11.1,
๐(๐๐บ ) = ๐๐ป . So, ๐(๐ฅ) = ๐(๐๐บ ). Since ๐ is injective, ๐ฅ = ๐๐บ . Since ๐ฅ ∈ ๐บ was arbitrary, ker(๐) ⊆ {๐๐บ }.
By Theorem 11.1, ๐(๐๐บ ) = ๐๐ป , so that ๐๐บ ∈ ker(๐), and therefore, {๐๐บ } ⊆ ker(๐). It follows that
ker(๐) = {๐๐บ }.
โก
Note: The theorem also holds for ring homomorphisms. Specifically, if ๐: ๐
→ ๐ is a ring
homomorphism, then ker(๐) = {0๐
} if and only if ๐ is injective. The proof is the same, except additive
notation should be used. Here is a sketch of the proof using additive notation:
If ker(๐) = {0๐
} and ๐(๐ฅ) = ๐(๐ฆ), then ๐(๐ฅ + (– ๐ฆ)) = ๐(๐ฅ) + ๐(– ๐ฆ) = ๐(๐ฅ) − ๐(๐ฆ) = 0๐ , so that
๐ฅ + (– ๐ฆ) ∈ ker(๐), and thus, ๐ฅ + (– ๐ฆ) = 0๐
, and so, ๐ฅ = ๐ฆ.
Conversely, if ๐ is injective and ๐ฅ ∈ ker(๐), then ๐(๐ฅ) = 0๐ . Since ๐(0๐
) = 0๐ and ๐ is injective, we
have ๐ฅ = 0๐
. So, ker(๐) ⊆ {0๐
}. Also, ๐(0๐
) = 0๐ . So, 0๐
∈ ker(๐), and therefore, {0}๐
⊆ ker(๐).
Normal Subgroups and Ring Ideals
Let (๐บ,โ) be a group and โ, ๐ ∈ ๐บ. We say that ๐ is a conjugate of โ if there is a ๐ ∈ ๐บ such that
๐ = ๐โ๐−1 (as usual, we abbreviate ๐ โ โ โ ๐−1 as ๐โ๐−1 ).
147
If (๐บ,โ) is a group, we say that a subgroup ๐ of ๐บ is normal, and write ๐ โฒ ๐บ, if whenever โ ∈ ๐ and
๐ ∈ ๐บ is a conjugate of โ, then ๐ ∈ ๐. (In this case, we may say that ๐ is closed under conjugation.)
Example 11.8:
1. If ๐บ is a commutative group, then every subgroup ๐ป of ๐บ is normal. Indeed, if โ ∈ ๐ป and ๐ ∈ ๐บ,
then ๐โ๐−1 = โ๐๐−1 = โ๐ = โ ∈ ๐ป.
2. If ๐: ๐บ → ๐ป is a group homomorphism, then ker(๐) is a normal subgroup of ๐บ. We already
showed in Theorem 11.4 that ker(๐) is a subgroup of ๐บ. To see that ker(๐) โฒ ๐บ, let โ ∈ ker(๐)
−1
−1
and let ๐ ∈ ๐บ. Then ๐(๐โ๐−1 ) = ๐(๐)๐(โ)๐(๐−1 ) = ๐(๐)๐(๐(๐)) = ๐(๐)(๐(๐)) = ๐.
3. Any group is a normal subgroup of itself. Indeed, if โ ∈ ๐บ and ๐ ∈ ๐บ, then clearly ๐โ๐−1 ∈ ๐บ.
4. The trivial subgroup of a group ๐บ consisting of just the identity ๐ is a normal subgroup of ๐บ.
Indeed, if โ ∈ {๐} and ๐ ∈ ๐บ, then ๐โ๐−1 = ๐๐๐−1 = ๐๐−1 = ๐ ∈ {๐}.
5. Let ๐ด be a nonempty set. A bijection from ๐ด to itself is called a permutation of ๐ด. Let ๐(๐ด) be
the set of permutations of ๐ด. Let’s check that (๐(๐ด),โ) is a group, where โ is the operation of
composition.
By Corollary 10.5 from Lesson 10, ๐(๐ด) is closed under โ.
To see that โ is associative, let ๐, ๐, โ ∈ ๐(๐ด) and let ๐ ∈ ๐ด. Then
((๐ โ ๐) โ โ)(๐) = (๐ โ ๐)(โ(๐)) = ๐ (๐(โ(๐))) = ๐((๐ โ โ)(๐)) = (๐ โ (๐ โ โ))(๐).
Since ๐ ∈ ๐ด was arbitrary, (๐ โ ๐) โ โ = ๐ โ (๐ โ โ). So, โ is associative in ๐(๐ด).
Recall that the identity permutation ๐๐ด is defined by ๐๐ด (๐) = ๐ for all ๐ ∈ ๐ด. If ๐ ∈ ๐ด, then
(๐๐ด โ ๐)(๐) = ๐๐ด (๐(๐)) = ๐(๐) = ๐(๐๐ด (๐)) = (๐ โ ๐๐ด )(๐). Since ๐ ∈ ๐ด was arbitrary, we have
๐๐ด โ ๐ = ๐ and ๐ โ ๐๐ด = ๐.
Recall that for any permutation ๐ on ๐ด, there is an inverse permutation ๐ −1 satisfying
๐ −1 โ ๐ = ๐ โ ๐ −1 = ๐๐ด for each ๐ ∈ ๐(๐ด) (by Theorem 10.6).
So, we have verified that (๐(๐ด),โ) is a group.
If ๐ด = {1,2, … , ๐}, then we define ๐๐ to be ๐(๐ด). For example, ๐3 = ๐({1, 2, 3}). We can
visualize each element of ๐3 with a cycle diagram. Here are the six elements of ๐3 visualized
this way.
(๐)
1
(๐๐)
2
3
1
(๐๐)
2
3
1
2
3
(๐๐)
(๐๐๐)
(๐๐๐)
1
1
1
2
3
2
3
2
3
The first diagram represents the identity permutation {(1, 1), (2, 2), (3, 3)}, where each
element is being mapped to itself. Technically, we should have an arrow from each point looping
back to itself. However, to avoid unnecessary clutter, we leave out arrows for elements that are
mapping to themselves. In cycle notation, we have (1)(2)(3), which we abbreviate as (1).
148
The second diagram represents the permutation {(1, 2), (2, 1), (3, 3)}, where 1 is being
mapped to 2, 2 is being mapped to 1, and 3 is being mapped to itself. Again, we leave out the
arrow from 3 to itself to avoid clutter, and we just put in the arrows from 1 to 2 and from 2 to
1. In cycle notation, we have (12)(3), which we abbreviate as (12). In this notation, (12)
represents a cycle. The cycle moves from left to right and the last element in the cycle connects
to the first. So, 1 maps to 2 and 2 maps to 1. Any element that does not appear in the cycle
notation maps to itself.
As one more example, in the cycle (123), 1 maps to 2, 2 maps to 3, and 3 maps to 1.
To compose two permutations in cycle notation, we write the one we want to apply first on the
right (just as we do in function notation). For example, let’s simplify (12)(13). Starting with 1,
we see that the rightmost cycle sends 1 to 3. The leftmost cycle sends 3 to itself, and so the
composition sends 1 to 3. Let’s do 2 next. The rightmost cycle sends 2 to itself, and then the
leftmost cycle sends 2 to 1. So, the composition sends 2 to 1. And finally, let’s look at 3. The
rightmost cycle sends 3 to 1, and then the leftmost cycle sends 1 to 2. So, the composition
sends 3 to 2. It follows that (12)(13) = (132).
Observe that the group (๐3 ,โ) is not commutative. For example, (12)(13) = (132), whereas
(13)(12) = (123).
Let’s consider the subgroups ๐ป = {(1), (123), (132)} and ๐พ = {(1), (12)}. One of these is a
normal subgroup of ๐3 and the other is not. You will be asked to verify that ๐ป and ๐พ are
subgroups of ๐3 and to determine which one is normal and which one is not in Problem 2 below.
Let (๐
, +, ⋅) be a ring and let ๐ด ⊆ ๐
. We say that ๐ด absorbs ๐
if for every ๐ ∈ ๐ด and ๐ฅ ∈ ๐
, ๐๐ฅ ∈ ๐ด
and ๐ฅ๐ ∈ ๐ด.
Note: Since in a ring, multiplication is not necessarily commutative, both conditions ๐๐ฅ ∈ ๐ด and
๐ฅ๐ ∈ ๐ด may be necessary. In a commutative ring, either condition follows from the other.
If (๐
, +, ⋅) is a ring, we say that a subset ๐ผ of ๐
is an ideal of ๐
, and write ๐ผ โฒ ๐
, if (๐ผ, +) is a subgroup
of (๐
, +) and ๐ผ absorbs ๐
.
Example 11.9:
1. Consider the ring (โค, +, ⋅). Then (2โค, +, ⋅) is an ideal of โค because (2โค, +) is a subgroup of
(โค, +) (see part 3 of Example 11.5) and when we multiply an even integer by any other integer,
we get an even integer (so, 2โค absorbs โค).
More generally, for each ๐ ∈ โค+ , (๐โค, +, ⋅) is an ideal of (โค, +, ⋅).
2. If ๐: ๐
→ ๐ is a ring homomorphism, then ker(๐) is an ideal of ๐บ. We already showed in Note 3
following Theorem 11.4 that (ker(๐) , +) is a subgroup of (๐
, +). To see that ker(๐) absorbs
๐
, let ๐ ∈ ker(๐) and let ๐ฅ ∈ ๐
. Then ๐(๐๐ฅ) = ๐(๐)๐(๐ฅ) = 0๐ ⋅ ๐(๐ฅ) = 0๐ , so that
๐๐ฅ ∈ ker(๐). Also, ๐(๐ฅ๐) = ๐(๐ฅ)๐(๐) = ๐(๐ฅ) ⋅ 0๐ = 0๐ , so that ๐ฅ๐ ∈ ker(๐).
3. Any ring is an ideal of itself. Indeed, if ๐ ∈ ๐
and ๐ฅ ∈ ๐
, then clearly ๐๐ฅ ∈ ๐
and ๐ฅ๐ ∈ ๐
.
4. {0๐
} is an ideal of ๐
because for all ๐ฅ ∈ ๐
, 0๐
⋅ ๐ฅ = 0๐
and ๐ฅ ⋅ 0๐
= 0๐
.
149
Problem Set 11
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Write the elements of ๐4 in cycle notation.
2. Draw a group multiplication table for ๐3 . Let ๐ป = {(1), (123), (132)} and ๐พ = {(1), (12)}.
Show that ๐ป and ๐พ are subgroups of ๐3 and determine which of these is a normal subgroup of
๐3 .
LEVEL 2
3. A Gaussian integer is a complex number of the form ๐ + ๐๐, where ๐, ๐ ∈ โค. Let โค[๐] be the set
of Gaussian integers. Prove that (โค[๐], +, ⋅) is a subring of (โ, +, ⋅).
4. Let (๐บ,โ) be a group with ๐ป a nonempty subset of ๐บ. Prove that (๐ป,โ) is a subgroup of (๐บ,โ) if
and only if for all ๐, โ ∈ ๐ป, ๐ โ โ−1 ∈ ๐ป.
5. Let (๐
, +, ⋅) be a ring and define addition and multiplication on ๐
× ๐
componentwise, as was
done in part 4 of Example 11.6. Prove that (๐
× ๐
, +, ⋅) is a ring and that (๐
, +, ⋅) is isomorphic
to a subring of (๐
× ๐
, +, ⋅).
LEVEL 3
6. Prove that there are exactly two ring homomorphisms from โค to itself.
7. Prove the following:
(i)
Ring isomorphism is an equivalence relation.
(ii)
If we let Aut(๐
) be the set of automorphisms of a ring ๐
, then (Aut(๐
), โ) is a group,
where โ is composition.
8. Let ๐บ be a group with ๐ป and ๐พ subgroups of ๐บ, and let ๐บ = ๐ป ∪ ๐พ. Prove that ๐ป = ๐บ or ๐พ = ๐บ.
9. Prove that a commutative ring ๐
is a field if and only if the only ideals of ๐
are {0} and ๐
.
10. Prove that if ๐ฟ is a nonempty set of normal subgroups of a group ๐บ then โ๐ฟ is a normal subgroup
of ๐บ. Similarly, prove that if ๐ฟ is a nonempty set of ideals of a ring ๐
, then โ๐ฟ is an ideal of ๐
.
Is the union of normal subgroups always a normal subgroup? Is the union of ideals always an
ideal?
150
11. Let โค๐ [๐ฅ] = {๐๐ ๐ฅ ๐ + ๐๐−1 ๐ฅ ๐−1 + โฏ + ๐1 ๐ฅ + ๐0 | ๐0 , ๐1 , … , ๐๐ ∈ โค}. In other words, โค๐ [๐ฅ]
consists of all polynomials of degree at most ๐. Prove that (โค๐ [๐ฅ], +) is a commutative group
for ๐ = 0, 1, and 2, where addition is defined in the “usual way.” Then prove that โค0 [๐ฅ] is a
subgroup of โค1 [๐ฅ] and โค1 [๐ฅ] is a subgroup of โค2 [๐ฅ]. What if we replace “all polynomials of
degree at most ๐” with “all polynomials of degree ๐?”
LEVEL 4
12. Let ๐ be a normal subgroup of a group ๐บ. For each ๐ ∈ ๐บ, let ๐๐ = {๐๐ฅ | ๐ฅ ∈ ๐}. Prove that
๐๐ = โ๐ if and only if ๐โ−1 ∈ ๐. Let ๐บ/๐ = {๐๐ | ๐ ∈ ๐บ} . Prove that (๐บ/๐, โ) is a group,
where โ is defined by ๐๐ โ โ๐ = (๐โ)๐.
13. Let ๐ผ be an ideal of a ring ๐
. For each ๐ฅ ∈ ๐
, let ๐ฅ + ๐ผ = {๐ฅ + ๐ง | ๐ง ∈ ๐ผ}. Prove that
๐ฅ + ๐ผ = ๐ฆ + ๐ผ if and only if ๐ฅ − ๐ฆ ∈ ๐ผ. Let ๐
/๐ผ = {๐ฅ + ๐ผ | ๐ฅ ∈ ๐
}. Prove that (๐
/๐ผ, +, ⋅) is a
ring, where addition and multiplication are defined by (๐ฅ + ๐ผ) + (๐ฆ + ๐ผ) = (๐ฅ + ๐ฆ) + ๐ผ and
(๐ฅ + ๐ผ)(๐ฆ + ๐ผ) = ๐ฅ๐ฆ + ๐ผ.
14. Let โค๐ = {[๐] | ๐ ∈ โค}, where [๐] is the equivalence class of ๐ under the equivalence ≡๐ . Prove
that (โค๐ , +, ⋅) is a ring, where addition and multiplication are defined by [๐ฅ] + [๐ฆ] = [๐ฅ + ๐ฆ]
and [๐ฅ๐ฆ] = [๐ฅ] ⋅ [๐ฆ]. Then prove that โค/๐โค ≅ โค๐ . Find the ideals of โค/15โค and โค15 and show
that there is a natural one-to-one correspondence between them.
LEVEL 5
15. Let โค[๐ฅ] = {๐๐ ๐ฅ ๐ + ๐๐−1 ๐ฅ ๐−1 + โฏ + ๐1 ๐ฅ + ๐0 | ๐ ∈ โ ∧ ๐0 , ๐1 , … , ๐๐ ∈ โค}. (โค[๐ฅ], +, ⋅) with
addition and multiplication defined in the “usual way” is called the polynomial ring over โค.
Prove that (โค[๐ฅ], +, ⋅) is a ring. Then prove that (โค๐ [๐ฅ], +, ⋅) is not a subring of (โค[๐ฅ], +, ⋅)
for any ๐ ∈ โ. Let ๐
[๐ฅ] = {๐๐ ๐ฅ ๐ + ๐๐−1 ๐ฅ ๐−1 + โฏ + ๐1 ๐ฅ + ๐0 | ๐ ∈ โ ∧ ๐0 , ๐1 , … , ๐๐ ∈ ๐
} for
an arbitrary ring ๐
. Is (๐
[๐ฅ], +, ⋅) necessarily a ring?
16. Let ๐ be a normal subgroup of the group ๐บ, and define ๐: ๐บ → ๐บ/๐ by ๐(๐) = ๐๐. Prove that
๐ is a surjective group homomorphism with kernel ๐. Conversely, prove that if ๐: ๐บ → ๐ป is a
group homomorphism, then ๐บ/ ker(๐) ≅ ๐[๐บ].
17. Let ๐ผ be an ideal of a ring ๐
, and define ๐: ๐
→ ๐
/๐ผ by ๐(๐ฅ) = ๐ฅ + ๐ผ. Prove that ๐ is a surjective
ring homomorphism with kernel ๐ผ. Conversely, prove that if ๐: ๐
→ ๐ is a ring homomorphism,
then ๐
/ ker(๐) ≅ ๐[๐
].
18. Prove that ( โโ, +, ⋅) is a ring, where addition and multiplication are defined pointwise. Then
prove that for each ๐ฅ ∈ โ, ๐ผ๐ฅ = {๐ ∈ โโ | ๐(๐ฅ) = 0} is an ideal of โโ and the only ideal of โโ
containing ๐ผ๐ฅ and not equal to ๐ผ๐ฅ is โโ.
151
LESSON 12 – NUMBER THEORY
PRIMES, GCD, AND LCM
Prime Numbers
Recall that an integer ๐ is divisible by an integer ๐, written ๐|๐, if there is another integer ๐ such that
๐ = ๐๐. We also say that ๐ is a factor of ๐, ๐ is a divisor of ๐, ๐ divides ๐, or ๐ is a multiple of ๐. For
example, 7|21 because 21 = 7 ⋅ 3. Also, see Examples 4.3 and 4.4 from Lesson 4.
Notes: (1) Every integer is divisible by 1. Indeed, if ๐ ∈ โค, then ๐ = 1 ⋅ ๐.
(2) Every integer is divisible by itself. Indeed, if ๐ ∈ โค, then ๐ = ๐ ⋅ 1.
(3) It follows from Notes 1 and 2 above that every integer greater than 1 has at least 2 factors.
A prime number is a natural number with exactly two positive integer factors.
Notes: (1) An equivalent definition of a prime number is the following: A prime number is an integer
greater than 1 that is divisible only by 1 and itself.
(2) An integer greater than 1 that is not prime is called composite.
Example 12.1:
1. 0 is not prime because every positive integer is a factor of 0. Indeed, if ๐ ∈ โค+ , then 0 = ๐ ⋅ 0,
so that ๐|0.
2. 1 is not prime because it has only one positive integer factor: if 1 = ๐๐ with ๐ > 0, then ๐ = 1
and ๐ = 1.
3. The first ten prime numbers are 2, 3, 5, 7, 11, 13, 17, 19, 23, and 29.
4. 4 is not prime because 4 = 2 ⋅ 2. In fact, the only even prime number is 2 because by definition,
an even integer has 2 as a factor.
5. 9 is the first odd integer greater than 1 that is not prime. Indeed, 3, 5, and 7 are prime, but 9 is
not because 9 = 3 ⋅ 3.
6. The first ten composite numbers are 4, 6, 8, 9, 10, 12, 14, 15, 16, and 18.
Two very important facts about prime numbers (that we will prove in this Lesson) are the following.
1. There are infinitely many prime numbers.
2. Every integer greater than 1 can be written uniquely as a product of prime numbers, up to the
order in which the factors are written.
The second fact is known as The Fundamental Theorem of Arithmetic. It is used often in many
branches of mathematics.
152
When we write an integer ๐ as a product of other integers, we call that product a factorization of ๐. If
all the factors in the product are prime, we call the product a prime factorization of ๐.
Example 12.2:
1. 20 = 4 ⋅ 5 is a factorization of 20. This is not a prime factorization of 20 because 4 is not prime.
20 = 2 ⋅ 10 is another factorization of 20. This example shows that factorizations in general are
not unique.
2. An example of a prime factorization of 20 is 20 = 2 ⋅ 2 ⋅ 5. We can also write this prime
factorization as 2 ⋅ 5 ⋅ 2 or 5 ⋅ 2 ⋅ 2. So, you can see that if we consider different orderings of
the factors as different factorizations, then prime factorizations are not unique. This is why we
say that prime factorizations are unique, up to the order in which the factors are written.
3. A prime number is equal to its own prime factorization. In other words, we consider a prime
number to be a product of primes with just one factor in the product. For example, the prime
factorization of 2 is 2.
Recall from Lesson 4 that the Well Ordering Principle says that every nonempty subset of natural
numbers has a least element.
We will now use the Well Ordering Principle to prove half of the Fundamental Theorem of Arithmetic.
Theorem 12.1: Every integer greater than 1 can be written as a product of prime numbers.
Note that we left out the word “uniquely” here. The uniqueness is the second half of the Fundamental
Theorem of Arithmetic, which we will prove later in this lesson.
Analysis: We will prove this theorem by contradiction using the Well Ordering Principle. The idea is
simple. If an integer ๐ greater than 1 is not prime, then it can be factored as ๐๐ with 1 < ๐ < ๐ and
1 < ๐ < ๐. If ๐ and ๐ can be written as a product of primes, then so can ๐ because ๐ is simply the
product of all the factors of ๐ and ๐. For example, 6 = 2 ⋅ 3 and 20 = 2 ⋅ 2 ⋅ 5. Therefore, we have
120 = 6 ⋅ 20 = (2 ⋅ 3) ⋅ (2 ⋅ 2 ⋅ 5). Let’s write the proof.
Proof of Theorem 12.1: Suppose toward contradiction that there exists an integer greater than 1 that
cannot be written as a product of prime numbers. By the Well Ordering Principle, there is a least such
integer, let’s call it ๐. Since ๐ cannot be written as a product of prime numbers, then in particular, ๐ is
not prime. So, we can write ๐ = ๐๐ with ๐, ๐ ∈ โ and 1 < ๐ < ๐ and 1 < ๐ < ๐. Since ๐ is the least
integer greater than 1 that cannot be written as a product of prime numbers, ๐ and ๐ can both be
written as products of prime numbers. But then ๐ = ๐๐ is also a product of prime numbers,
contradicting our choice of ๐. This contradiction shows that every integer greater than 1 can be written
as a product of prime numbers.
โก
Notes: (1) Recall that a proof by contradiction works as follows:
1. We assume the negation of what we are trying to prove.
2. We use a logically valid argument to derive a statement which is false.
3. Since the argument was logically valid, the only possible error is our original assumption.
Therefore, the negation of our original assumption must be true.
153
The negation of the statement “Every integer greater than 1 can be written as a product of prime
numbers” is “There is an integer greater than 1 that cannot be written as a product of prime numbers.”
If we let ๐ = {๐ ∈ โ | ๐ > 1 ∧ ๐ cannot be written as a product of prime numbers}, then by our
assumption, ๐ ≠ ∅. It follows from the Well Ordering Principle that ๐ has a least element, which in the
proof above, we name ๐.
The argument then proceeds to factor ๐ as ๐๐, where ๐ and ๐ are both greater than 1 and less than ๐.
We can factor ๐ this way because ๐ in not prime.
Since ๐ is the least element of ๐, it follows that ๐ and ๐ are not in ๐. Therefore, ๐ and ๐ can be written
as a product of prime numbers. But this immediately gives us a prime factorization of ๐, contradicting
our original assumption.
Since every step of our argument was logically valid, the only thing that could have been wrong was
our original assumption. So, every integer greater than 1 can be written as a product of prime numbers.
(2) In general, if ๐(๐ฅ) is a property, then the negation of ∀๐ฅ(๐(๐ฅ)) is ∃๐ฅ(¬๐(๐ฅ)). In other words,
when we pass a negation symbol through a universal quantifier, the quantifier changes to an existential
quantifier. So, ¬∀๐ฅ(๐(๐ฅ)) ≡ ∃๐ฅ(¬๐(๐ฅ)), where ≡ is pronounced “is logically equivalent to.” For
Theorem 12.1, the property ๐(๐ฅ) is ๐(๐ฅ) → ๐(๐ฅ), where ๐(๐ฅ) is “๐ฅ > 1” and ๐(๐ฅ) is “๐ฅ can be written
as a product of prime numbers.” Recall from part 2 of Example 9.5 in Lesson 9 that ¬(๐(๐ฅ) → ๐(๐ฅ)) is
logically equivalent to ๐(๐ฅ) ∧ ¬๐(๐ฅ). So ∃๐ฅ(¬๐(๐ฅ)) says, “There is an integer ๐ฅ such that ๐ฅ > 1 and ๐ฅ
cannot be written as a product of prime numbers.”
In general (although not needed here), we also have ¬∃๐ฅ(๐(๐ฅ)) ≡ ∀๐ฅ(¬๐(๐ฅ)).
Corollary 12.2: Every integer greater than 1 has a prime factor.
Proof: Let ๐ be an integer greater than 1. By Theorem 12.1, ๐ can be written as a product of prime
numbers. Let ๐ be any of the prime numbers in that product. Then ๐ is a prime factor of ๐.
โก
Theorem 12.3: There are infinitely many primes.
Analysis: Starting with a prime number ๐ > 1, we want to find a prime number greater than ๐. This
will prove that there infinitely many prime numbers, because if ๐ is a finite set of prime numbers, then
the previous statement implies that we can find a prime number greater than the biggest number in
the set ๐.
Now recall that if ๐ is a positive integer, then the number ๐! (pronounced “๐ factorial”) is defined by
๐! = 1 ⋅ 2 โฏ ๐. For example, 3! = 1 ⋅ 2 ⋅ 3 = 6 and 4! = 1 ⋅ 2 ⋅ 3 ⋅ 4 = 24.
If ๐ > 2, then ๐! is a number larger than ๐ that is divisible by every positive integer less than or equal
to ๐. For example, 3! = 6 is divisible by 1, 2, and 3, and 4! = 24 is divisible by 1, 2, 3, and 4.
Now, ๐! Is certainly not prime. In fact, it has lots of factors! For example,4! = 24 has 8 factors (what
are they?). Therefore, ๐! itself won’t work for us. So, we add 1 to this number to get the number
๐ = ๐! + 1.
154
By adding 1 to ๐! to produce ๐, we have destroyed almost all the divisibility that we had. Specifically,
๐ is not divisible by any integer ๐ with 1 < ๐ ≤ ๐. To see this, let ๐ be an integer satisfying 1 < ๐ ≤ ๐.
We know that there is an integer ๐ such that ๐! = ๐๐ (because ๐! Is divisible by ๐). If ๐ were divisible
by ๐, then there would be an integer ๐ such that ๐ = ๐๐ . But then, by subtracting ๐! from each side of
the equation ๐ = ๐! + 1, we get 1 = ๐ − ๐! = ๐๐ − ๐๐ = ๐(๐ − ๐). Since ๐ > 1 and ๐ − ๐ is an
integer, this is impossible! Therefore, ๐ is not divisible by ๐.
It would be nice if we could prove that ๐ is prime. Then ๐ would be a prime number greater than ๐,
thus completing the proof. Sometimes ๐ does turn out to be prime. For example, if ๐ = 2, then
๐ = 2! + 1 = 2 + 1 = 3, which is prime. However, it is unfortunate for us that ๐ is not always prime.
In Problem 6 below you will find values for ๐ for which ๐ is not prime.
However, even if ๐ is not prime, all is not lost. By Corollary 12.2, we know that ๐ has a prime factor,
let’s call it ๐. We also know that ๐ is not divisible by any integer ๐ with 1 < ๐ ≤ ๐. It follows that ๐ is
a prime number greater than ๐.
I think we’re ready to write out the proof.
Proof of Theorem 12.3: Let ๐ be a finite set of prime numbers with greatest member ๐ and let
๐ = ๐! + 1. By Corollary 12.2, ๐ has a prime factor ๐. So, there is an integer ๐ such that ๐ = ๐๐.
We show that ๐ > ๐.
Suppose toward contradiction that ๐ ≤ ๐. Then ๐|๐!. So, there is an integer ๐ such that ๐! = ๐๐. It
follows that 1 = ๐ − ๐! = ๐๐ − ๐๐ = ๐(๐ − ๐). So, ๐ = 1, which contradicts that ๐ is prime.
It follows that ๐ > ๐ and so, ๐ is greater than every prime number in ๐. Since ๐ was an arbitrary finite
set of prime numbers, we have shown that there are infinitely many prime numbers.
โก
The Division Algorithm
In Lesson 4 (Example 4.7 and the notes following), we showed that every integer is even or odd, and
never both. In other words, if ๐ ∈ โค, there are unique integers ๐ and ๐ such that ๐ = 2๐ + ๐, where
๐ = 0 or ๐ = 1. We sometimes say, “When ๐ is divided by 2, ๐ is the quotient and ๐ is the remainder.”
Observe that when an integer ๐ is divided by 2, the quotient can be any integer, but the remainder can
be only 0 or 1.
Example 12.3:
1. When 11 is divided by 2, the quotient is 5 and the remainder is 1. That is, 11 = 2 ⋅ 5 + 1.
2. When 20 is divided by 2, the quotient is 10 and the remainder is 0. That is, 20 = 2 ⋅ 10 + 0, or
equivalently, 20 = 2 ⋅ 10. Notice that in this case, 20 is divisible by 2.
3. When – 11 is divided by 2, the quotient is – 6 and the remainder is 1. That is – 11 = 2(– 6) + 1.
Compare this to the first example. Based on that example, most students would probably guess
that the quotient here would turn out to be – 5. But as you can see, that is not the case.
155
The Division Algorithm generalizes the notion of an integer ๐ being “even or odd” (2๐ or 2๐ + 1) to ๐
being equal to ๐๐ + ๐, where 0 ≤ ๐ < ๐.
For example, for ๐ = 3, the Division Algorithm will tell us that every integer can be written uniquely
in one of the three forms 3๐, 3๐ + 1, or 3๐ + 2. Observe that when an integer ๐ is divided by 3, the
quotient can be any integer, but the remainder can be only 0, 1, or 2.
As one more example, for ๐ = 4, the Division Algorithm will tell us that every integer can be written
uniquely in one of the four forms 4๐, 4๐ + 1, 4๐ + 2, or 4๐ + 3. Observe that when an integer ๐ is
divided by 4, the quotient can be any integer, but the remainder can be only 0, 1, 2, or 3.
Example 12.4:
1. When 14 is divided by 3, the quotient is 4 and the remainder is 2. That is, 14 = 3 ⋅ 4 + 2.
2. When 36 is divided by 4, the quotient is 9 and the remainder is 0. That is, 36 = 4 ⋅ 9 + 0, or
equivalently, 36 = 4 ⋅ 9. Notice that in this case, 36 is divisible by 4.
3. When 17 is divided by 5, the quotient is 3 and the remainder is 2. That is, 17 = 5 ⋅ 3 + 2.
4. When – 17 is divided by 5, the quotient is – 4 and the remainder is 3. That is – 17 = 5(– 4) + 3.
Theorem 12.4 (The Division Algorithm): Let ๐ and ๐ be integers with ๐ > 0. Then there are unique
integers ๐ and ๐ such that ๐ = ๐๐ + ๐ with 0 ≤ ๐ < ๐.
Many students find the standard proof of the Division Algorithm to be quite hard to follow. I know that
when I read the proof for the first time, I found it quite confusing. To better understand the argument,
let’s first run a couple of simulations using specific examples that mimic the proof.
Simulation 1: Let’s let ๐ = 7 and ๐ = 2. With these choices for ๐ and ๐, the Division Algorithm says
that there are unique integers ๐ and ๐ such that 7 = 2๐ + ๐ and 0 ≤ ๐ < 2 (in other words, ๐ = 0 or
๐ = 1).
Let’s look at the equation 7 = 2๐ + ๐ in the form 7 − 2๐ = ๐. In particular, let’s look at the possible
values of 7 − 2๐ as ๐ ranges over all possible integers. Let’s do this by matching up each integer ๐ with
the corresponding value of 7 − 2๐:
๐?
๐
โฏ
–4
–3
–2
–1
0
1
2
๐
๐ − ๐๐
โฏ
15
13
11
9
7
5
3
๐
4
–1
โฏ
โฏ
Observe that the top row is simply “listing” all the integers. The “โฏ” to the left of – 4 and to the right
of 4 are there to indicate that this list keeps going infinitely in each direction. However, I did make sure
to include the most important values in the visible part of our list.
We get each value in the bottom row by substituting the value above it for ๐ in the expression 7 − 2๐.
For example, for ๐ = – 4, we have 7 − 2๐ = 7 − 2( – 4) = 7 + 8 = 15.
156
Notice that the values in the bottom row decrease by 2 units for each 1 unit increase in ๐. This is
because ๐ = 2.
We highlighted the column where ๐ = 3 and ๐ = 7 − 2๐ = 1. This is the column where the smallest
nonnegative number appears in the bottom row. In other words, we let ๐ be the least positive value of
7 − 2๐ก, as ๐ก ranges over all the integers, and we let ๐ be the corresponding ๐ก-value.
In general, how do we know that these values exist?
Well, since ๐ ≥ 0 (๐ = 7 in this example), the expression ๐ − ๐๐ก ≥ 0 when ๐ก = 0. It follows that the
set {๐ − ๐๐ก | ๐ก ∈ โค ∧ ๐ − ๐๐ก ≥ 0} = {7 − 2๐ก | ๐ก ∈ โค ∧ 7 − 2๐ก ≥ 0} is not empty (7 − 2 ⋅ 0 = 7 is in
this set). So, we can invoke the Well Ordering Principle to get a least element ๐. In this simulation, ๐
will turn out to be 1 with a corresponding ๐-value of 3. (We will see what happens if ๐ < 0 in the next
simulation).
By taking ๐ to be the least element from a set of natural numbers, we know that ๐ will be nonnegative.
But how do we know that ๐ will be less than 2? We use the fact that the bottom row decreases by 2
units for each 1 unit increase in the top row.
Suppose we accidentally chose ๐ = 3. Then we have 7 − 2๐ = 3. If we subtract 2 from each side of
this equation, we get 7 − 2๐ − 2 = 1. Using distributivity, we have that 7 − 2๐ − 2 is equal to
7 − 2(๐ + 1). So, 7 − 2(๐ + 1) = 1. Looks like we chose the wrong value for ๐. What we just showed
is that if we increase ๐ by 1 (from 2 to 3), we decrease ๐ by 2 (from 3 to 1).
In general, if ๐ ≥ 2, then we have ๐ − 2๐ ≥ 2, so that ๐ − 2๐ − 2 ≥ 0. Thus, ๐ − 2(๐ + 1) ≥ 0. But
๐ − 2(๐ + 1) = ๐ − 2๐ − 2 < ๐ − 2๐. This contradicts that ๐ was the least possible value of ๐ − 2๐ก
with ๐ − 2๐ก ≥ 0. It follows that ๐ < 2.
Now let’s check uniqueness. So, we have 7 = 2 ⋅ 3 + 1. How do we know that there aren’t two other
numbers ๐ ′ and ๐ ′ with 0 ≤ ๐ ′ < 1 such that 7 = 2๐ ′ + ๐ ′?
Well, if there were, then we would have 2 ⋅ 3 + 1 = 2๐ ′ + ๐ ′ . Subtracting 2๐ ′ from each side of the
equation and subtracting 1 from each side of the equation gives us 2 ⋅ 3 − 2๐ ′ = ๐ ′ − 1. We now use
the distributive property on the left to get 2(3 − ๐ ′ ) = ๐ ′ − 1. This equation shows that 2 is a factor
of ๐ ′ − 1. ๐ ′ can’t be 0 because 2 is not a factor of – 1. Therefore, ๐ ′ = 1 (remember that 0 and 1 are
the only two choices for ๐ ′ ). So, 2(3 − ๐ ′ ) = 0, and therefore, 3 − ๐ ′ = 0. So, ๐ ′ = 3. Oh, look at that!
๐ ′ and ๐ ′ are the same as ๐ and ๐.
So, we just proved that there is exactly one way to write 7 in the form 2๐ + ๐ with ๐ and ๐ integers
and 0 ≤ ๐ < 2. We showed that 7 = 2 ⋅ 3 + 1 is the only way to do it.
Simulation 2: This time, let’s let ๐ = – 4 and ๐ = 3. With these choices for ๐ and ๐, the Division
Algorithm says that there are unique integers ๐ and ๐ such that – 4 = 3๐ + ๐ and 0 ≤ ๐ < 3 (in other
words, ๐ = 0, ๐ = 1, or ๐ = 2).
Let’s look at the equation – 4 = 3๐ + ๐ in the form – 4 − 3๐ = ๐, and as we did in Simulation 1, let’s
match up each integer ๐ with the corresponding value of – 4 − 3๐:
157
๐?
๐
โฏ
–4
–3
–๐
–1
0
1
– ๐ − ๐๐
โฏ
8
5
๐
–1
–4
–7
4
โฏ
– 10 – 13 – 16
โฏ
2
3
This time, since ๐ = 3, the values in the bottom row decrease by 3 units for each 1 unit increase in ๐.
We highlighted the column where ๐ = – 2 and ๐ = – 4 − 3(– 2) = – 4 + 6 = 2 because it is the column
where the smallest nonnegative number appears in the bottom row. This time 2 is the smallest possible
value of ๐, and this ๐-value corresponds to a ๐-value of – 2.
Since ๐ < 0 this time (๐ = – 4 in this example), setting ๐ก = 0 in the expression ๐ − ๐๐ก does not
produce a nonnegative value. This time, we let ๐ก = ๐ to get ๐ − ๐ ⋅ ๐ (specifically, for this simulation
we set ๐ก = – 4 to get – 4 − 3(– 4) = – 4 + 12 = 8, which is greater than 0). It follows that the set
{๐ − ๐๐ก | ๐ก ∈ โค ∧ ๐ − ๐๐ก ≥ 0} = {– 4 − 3๐ก | ๐ก ∈ โค ∧ – 4 − 3๐ก ≥ 0} is not empty. So, once again, we
can invoke the Well Ordering Principle to get a least element ๐. In this simulation, ๐ will turn out to be
2 with a corresponding ๐-value of – 2.
As in Simulation 1, it is clear that ๐ ≥ 0, and we use the fact that the bottom row decreases by 3 units
for each 1 unit increase in the top row to show that ๐ < 3.
Suppose we accidentally chose ๐ = 5. Then we have – 4 − 3๐ = 5. If we subtract 3 from each side of
this equation, we get – 4 − 3๐ − 3 = 2. But using distributivity, we have that – 4 − 3๐ − 3 is equal to
– 4 − 3(๐ + 1). So, – 4 − 3(๐ + 1) = 2. We just showed is that if we increase ๐ by 1 (from – 3 to – 2),
we decrease ๐ by 3 (from 5 to 2).
In general, if ๐ ≥ 3, then we have ๐ − 3๐ ≥ 3, so that ๐ − 3๐ − 3 ≥ 0. Thus, ๐ − 3(๐ + 1) ≥ 0. But
๐ − 3(๐ + 1) = ๐ − 3๐ − 3 < ๐ − 3๐. This contradicts that ๐ was the least possible value of ๐ − 3๐ก
with ๐ − 3๐ก ≥ 0. It follows that ๐ < 3.
I leave it as an exercise for the reader to check uniqueness for this special case.
Let’s move on to the proof of the Theorem.
Proof of Theorem 12.4: Let ๐, ๐ ∈ โค with ๐ > 0, and let ๐ = {๐ − ๐๐ก | ๐ก ∈ โค ∧ ๐ − ๐๐ก ≥ 0}. To see
that ๐ ≠ ∅,we consider two cases. If ๐ ≥ 0, then let ๐ก = 0, and we have ๐ − ๐๐ก = ๐ ∈ ๐. If ๐ < 0,
then let ๐ก = ๐, so that we have ๐ − ๐๐ก = ๐ − ๐๐ = ๐(1 − ๐). Since ๐ ≥ 1, we have 1 − ๐ ≤ 0. It
follows that ๐(1 − ๐) ≥ 0, and so, ๐ − ๐๐ก ∈ ๐. In both cases, we have shown that ๐ ≠ ∅.
Since ๐ is a nonempty subset of natural numbers, by the Well Ordering Principle, ๐ has a least element
๐ = ๐ − ๐๐, where ๐ ∈ โค. Since ๐ ⊆ โ, ๐ ≥ 0. By adding ๐๐ to each side of the equation, we have
๐ = ๐๐ + ๐.
We need to show that ๐ < ๐. Suppose toward contradiction that ๐ ≥ ๐. Substituting ๐ − ๐๐ for ๐
gives us ๐ − ๐๐ ≥ ๐. Subtracting ๐ from each side of this last equation gives (๐ − ๐๐) − ๐ ≥ 0.
Now, since ๐ > 0, ๐ > ๐ − ๐ = (๐ − ๐๐) − ๐. But (๐ − ๐๐) − ๐ = ๐ − ๐๐ − ๐ = ๐ − ๐(๐ + 1),
and so, (๐ − ๐๐) − ๐ is an element of ๐ smaller than ๐, contradicting ๐ being the least element of ๐.
This contradiction tells us that we must have ๐ < ๐.
158
We still need to prove that ๐ and ๐ are unique. Suppose that ๐ = ๐๐1 + ๐1 and ๐ = ๐๐2 + ๐2 with
both 0 ≤ ๐1 < ๐ and 0 ≤ ๐2 < ๐. Without loss of generality, we may assume that ๐2 ≥ ๐1 .
By a simple substitution, ๐๐1 + ๐1 = ๐๐2 + ๐2 . Subtracting ๐๐2 from each side of the equation and
simultaneously subtracting ๐1 from each side of the equation, we get ๐๐1 − ๐๐2 = ๐2 − ๐1. Factoring
๐ on the left gives ๐(๐1 − ๐2 ) = ๐2 − ๐1, and we see that ๐|๐2 − ๐1 .
Since ๐2 ≥ ๐1 , we have ๐2 − ๐1 ≥ 0. Since we have ๐1 ≥ 0 and ๐2 < ๐, we have ๐2 − ๐1 < ๐ − 0 = ๐.
So, ๐|๐2 − ๐1 and 0 ≤ ๐2 − ๐1 < ๐. It follows that ๐2 − ๐1 = 0. So, ๐2 = ๐1. Finally, ๐2 = ๐1 and
๐๐1 + ๐1 = ๐๐2 + ๐2 together imply that ๐๐1 = ๐๐2 , and so, ๐1 = ๐2 .
โก
GCD and LCM
Let ๐ and ๐ be two integers. An integer ๐ is a common divisor (or common factor) of ๐ and ๐ if ๐ is a
factor of both ๐ and ๐. An integer ๐ is a common multiple of ๐ and ๐ if ๐ is a multiple of both ๐ and ๐.
Example 12.5: Let ๐ = 6 and ๐ = 15. The positive divisors of ๐ are ๐, 2, ๐, and 6. The positive divisors
of ๐ are ๐, ๐, 5, and 15. Therefore, the positive common divisors of ๐ and ๐ are ๐ and ๐.
For each positive divisor there is a corresponding negative divisor. So, a complete list of the divisors of
๐ are 1, 2, 3, 6, – 1, – 2, – 3, and – 6 and a complete list of the divisors of ๐ are 1, 3, 5, 15, – 1, – 3, – 5,
and – 15. Therefore, a complete list of the common divisors of ๐ and ๐ are ๐, ๐, – ๐, and – ๐.
If both ๐ and – ๐ are in a list, we will sometimes use the notation ±๐ instead of listing ๐ and – ๐
separately. In this example, we can say that the complete list of common divisors of ๐ and ๐ is ±1, ±3.
The multiples of ๐ are ±6, ±12, ±18, ±24, ±๐๐, ±36, … and so on. The multiples of 15 are
±15, ±๐๐, ±45, ±๐๐, … and so on. Therefore, the common multiples of ๐ and ๐ are
±๐๐, ±๐๐, ±๐๐, ±๐๐๐, … and so on.
Again, let ๐ and ๐ be distinct integers. The greatest common divisor (or greatest common factor) of ๐
and ๐, written gcd(๐, ๐), is the largest common divisor of ๐ and ๐. The least common multiple of ๐
and ๐, written lcm(๐, ๐), is the smallest positive common multiple of ๐ and ๐.
Example 12.6:
1. From Example 12.5, it’s easy to see that gcd(6, 15) = 3 and lcm(6, 15) = 30.
2. gcd(2, 3) = 1 and lcm(2, 3) = 6. More generally, if ๐ and ๐ are prime numbers with ๐ ≠ ๐,
then gcd(๐, ๐) = 1 and lcm(๐, ๐) = ๐๐.
3. gcd(4, 15) = 1 and lcm(4, 15) = 60. Observe that neither 4 nor 15 is prime, and yet their gcd
is 1 and their lcm is the product of 4 and 15. This is because 4 and 15 have no common factors
except for 1 and – 1. We say that 4 and 15 are relatively prime.
Note that if ๐ and ๐ are prime numbers with ๐ ≠ ๐, then ๐ and ๐ are relatively prime.
We have the following more general result: if ๐ and ๐ are relatively prime integers, then
gcd(๐, ๐) = 1 and lcm(๐, ๐) = ๐๐ (see Theorem 12.10 below).
159
We can extend all these ideas to larger sets of numbers. Specifically, let ๐ be a finite set of integers
containing at least one nonzero integer. Then the greatest common divisor of the integers in ๐, written
gcd(๐) (or gcd(๐1 , ๐2 , … , ๐๐ ), where ๐ = {๐1 , ๐2 , … , ๐๐ }) is the largest integer that divides every
integer in the set ๐, and the least common multiple of the integers in ๐, written lcm(๐) (or
lcm(๐1 , ๐2 , … , ๐๐ )) is the smallest positive integer that each integer in the set ๐ divides.
For convenience, if ๐ contains only 0, we define gcd(๐) = 0.
Also, the integers in the set ๐ are said to be mutually relatively prime if gcd(๐) = 1. The integers in
the set ๐ are said to be pairwise relatively prime if for each pair ๐, ๐ ∈ ๐ with ๐ ≠ ๐, gcd(๐, ๐) = 1.
Example 12.7:
1. gcd(10, 15, 35) = 5 and lcm(10, 15, 35) = 210.
2. gcd(2, 3, 12) = 1 and lcm(2, 3, 12) = 12. Notice that here 2, 3, and 12 are mutually relatively
prime, but not pairwise relatively prime because for example, gcd(2, 12) = 2 ≠ 1.
3. gcd(10, 21, 143) = 1 and lcm(10, 21, 143) = 30,030. In this case, we have 10, 21, and 143
are pairwise relatively prime.
We have the following result: if ๐ = {๐1 , ๐2 , … , ๐๐ } is a set of pairwise relatively prime integers,
then gcd(๐) = 1 and lcm(๐) = ๐1 ๐2 โฏ ๐๐ . The proof of this is left as an optional exercise for
the reader. Also note that pairwise relatively prime implies mutually relatively prime.
4. For a set ๐ with just one element ๐, gcd(๐) = ๐ and lcm(๐) = ๐. In particular, gcd(0) = 0 and
lcm(0) = 0.
Let ๐, ๐ ∈ โค. A linear combination of ๐ and ๐ is an expression of the form ๐๐ + ๐๐ with ๐, ๐ ∈ โค. We
call the integers ๐ and ๐ weights.
Example 12.8:
1. Since 5 ⋅ 10 − 2 ⋅ 15 = 50 − 30 = 20, we see that 20 is a linear combination of 10 and 15.
When we write 20 as 5 ⋅ 10 − 2 ⋅ 15, the weights are 5 and – 2.
This is not the only way to write 20 as a linear combination of 10 and 15. For example, we also
have – 1 ⋅ 10 + 2 ⋅ 15 = – 10 + 30 = 20. When we write 20 as – 1 ⋅ 10 + 2 ⋅ 15, the weights
are – 1 and 2.
2. Any number that is a multiple of either 10 or 15 is a linear combination of 10 and 15 because
we can allow weights to be 0. For example, 80 is a linear combination of 10 and 15 because
80 = 8 ⋅ 10 + 0 ⋅ 15.
Also, 45 is a linear combination of 10 and 15 because 45 = 0 ⋅ 10 + 3 ⋅ 15.
3. We will see in Theorem 12.5 below that gcd(๐, ๐) can always be written as a linear combination
of ๐ and ๐. For example, gcd(10, 15) = 5, and we have 5 = – 1 ⋅ 10 + 1 ⋅ 15.
4. Using the same theorem mentioned in 3, if ๐ and ๐ are relatively prime, then 1 can be written
as a linear combination of ๐ and ๐. For example, 4 and 15 are relatively prime and we have
1 = 4 ⋅ 4 − 1 ⋅ 15.
160
Theorem 12.5: Let ๐ and ๐ be integers, at least one of which is not 0. Then gcd(๐, ๐) is the least positive
integer ๐ such that there exist ๐, ๐ ∈ โค with ๐ = ๐๐ + ๐๐.
This theorem says two things. First, it says that gcd(๐, ๐) can be written as a linear combination of ๐
and ๐. Second, it says that any positive integer smaller than gcd(๐, ๐) cannot be written as a linear
combination of ๐ and ๐.
Proof: We first prove the theorem for ๐, ๐ ∈ โค+ . So, let ๐, ๐ be positive integers and let ๐ be the set of
all positive linear combinations of ๐ and ๐ with weights in โค.
๐ = {๐๐ + ๐๐ | ๐, ๐ ∈ โค ∧ ๐๐ + ๐๐ > 0}
Notice that ๐, ๐ ∈ ๐ because ๐ = 1๐ + 0๐ and ๐ = 0๐ + 1๐. In particular, ๐ ≠ ∅. By the Well Ordering
Principle, ๐ has a least element ๐. By the definition of ๐, there exist ๐, ๐ ∈ โค with ๐ = ๐๐ + ๐๐.
By the Division Algorithm, there are ๐ , ๐ ∈ โค with ๐ = ๐๐ + ๐ and 0 ≤ ๐ < ๐.
So, ๐ = ๐ − ๐๐ = ๐ − (๐๐ + ๐๐)๐ = ๐ − ๐๐๐ − ๐๐๐ = (1 − ๐๐ )๐ − (๐๐ )๐. We see that ๐ is a linear
combination of ๐ and ๐. Since ๐ < ๐ and ๐ is a linear combination of ๐ and ๐, ๐ cannot be in ๐ (because
๐ is the least element of ๐). So, ๐ must be 0. It follows that ๐ = ๐๐ . Therefore, ๐|๐.
Replacing ๐ by ๐ in the last two paragraphs shows that ๐|๐ as well. So, ๐ is a common divisor of ๐ and
๐. Now, if ๐ is another common divisor of ๐ and ๐, then by Problem 7 from Problem Set 4 in Lesson 4,
๐ is a divisor of any linear combination of ๐ and ๐. Since ๐ is a linear combination of ๐ and ๐, ๐ is a
divisor of ๐. Since every common divisor of ๐ and ๐ is also a divisor of ๐, it follows that ๐ = gcd(๐, ๐).
Since ๐๐ = (– ๐)(– ๐) and ๐๐ = (– ๐)(– ๐), the result holds whenever ๐ and ๐ are both nonzero.
Finally, suppose ๐ = 0 or ๐ = 0. Without loss of generality, let ๐ = 0. Then ๐ ≠ 0. So, gcd(๐, ๐) = ๐
(or – ๐ if ๐ < 0). We also have for any ๐, ๐ ∈ โค, ๐๐ + ๐๐ = ๐ ⋅ 0 + ๐๐ = ๐๐. The least positive
integer of the form ๐๐ is 1 ⋅ ๐ = ๐ (or – 1 ⋅ ๐ if ๐ < 0). So, the result holds in this case as well.
โก
We’re almost ready to finish proving the Fundamental Theorem of Arithmetic. We will first prove two
preliminary results that will make the proof easier.
Theorem 12.6: Let ๐, ๐, ๐ ∈ โค+ with ๐ and ๐ relatively prime and ๐|๐๐. Then ๐|๐.
Proof: Let ๐, ๐, ๐ ∈ โค+ with ๐ and ๐ relatively prime and let ๐|๐๐. Since gcd(๐, ๐) = 1, by Theorem
12.5, there are integers ๐ and ๐ with 1 = ๐๐ + ๐๐. Since ๐|๐๐, there is an integer ๐ such that
๐๐ = ๐๐. Multiplying each side of the equation 1 = ๐๐ + ๐๐ by ๐ and using the distributive property,
๐ = ๐(๐๐ + ๐๐) = ๐๐๐ + ๐๐๐ = ๐๐๐ + ๐๐๐ = ๐๐๐ + ๐๐๐ = ๐(๐๐ + ๐๐). Since ๐, ๐, ๐, ๐ ∈ โค and
โค is closed under addition and multiplication, ๐๐ + ๐๐ ∈ โค. Therefore, ๐|๐.
โก
Theorem 12.7: Let ๐ be prime and let ๐1 , ๐2 , … , ๐๐ be positive integers such that ๐|๐1 ๐2 โฏ ๐๐ . Then
there is an integer ๐ with 1 ≤ ๐ ≤ ๐ such that ๐|๐๐ .
Proof: We will prove this theorem by induction on ๐ ≥ 1.
161
Base Case (๐ = 1): We are given that ๐ is prime, ๐1 ∈ โค+ , and ๐|๐1. Wait a sec… ๐|๐1 is the conclusion
we were looking for. So, the theorem holds for ๐ = 1.
Inductive Step: Let ๐ ∈ โ and assume that the result holds for ๐ = ๐.
Let ๐ be prime and let ๐1 , ๐2 , … ๐๐ , ๐๐+1 be positive integers such that ๐|๐1 ๐2 โฏ ๐๐ ๐๐+1 . Since ๐ is
prime, its only positive factors are 1 and ๐. Therefore, gcd(๐, ๐1 ๐2 โฏ ๐๐ ) is either 1 or ๐.
If gcd(๐, ๐1 ๐2 โฏ ๐๐ ) = 1, then by Theorem 12.6, ๐|๐๐+1 . If gcd(๐, ๐1 ๐2 โฏ ๐๐ ) = ๐, then ๐|๐1 ๐2 โฏ ๐๐ ,
and by our inductive assumption, there is an integer ๐ with 1 ≤ ๐ ≤ ๐ such that ๐|๐๐ .
Therefore, the result holds for ๐ = ๐ + 1.
By the Principle of Mathematical Induction, the result holds for all ๐ ∈ โ with ๐ ≥ 1.
โก
We are finally ready to finish the proof of the Fundamental Theorem of Arithmetic.
Theorem 12.8 (The Fundamental Theorem of Arithmetic): Every integer greater than 1 can be written
uniquely as a product of prime numbers, up to the order in which the factors are written.
Proof: By Theorem 12.1, every integer greater than 1 can be written as a product of prime numbers.
We need to show that any two such prime factorizations are equal. Assume toward contradiction that
๐ can be written in the following two different ways: ๐ = ๐1 ๐2 โฏ ๐๐ = ๐1 ๐2 โฏ ๐๐ , where
๐1, ๐2 , … , ๐๐ , ๐1 , ๐2 , … , ๐๐ are prime numbers. Without loss of generality, assume ๐1 ≤ ๐2 ≤ โฏ ≤ ๐๐
and ๐1 ≤ ๐2 ≤ โฏ ≤ ๐๐ . Also, by cancelling common primes on the left with common primes on the
right, we may assume that for all ๐ ≤ ๐ and ๐ ≤ ๐, ๐๐ ≠ ๐๐ . Suppose 1 ≤ ๐ ≤ ๐. Then ๐๐ |๐1 ๐2 โฏ ๐๐ .
Since ๐1 ๐2 โฏ ๐๐ = ๐1 ๐2 โฏ ๐๐ , we have ๐๐ |๐1 ๐2 โฏ ๐๐ . By Theorem 12.7, there is ๐ with 1 ≤ ๐ ≤ ๐ such
that ๐๐ |๐๐ . This is a contradiction. So, there cannot exist two different prime factorizations of ๐.
โก
Since prime factorizations are unique only up to the order in which the factors are written, there can
be many ways to write a prime factorization. For example, 10 can be written as 2 ⋅ 5 or 5 ⋅ 2. To make
things as simple as possible we always agree to use the canonical representation (or canonical form).
The word “canonical” is just a fancy name for “natural,” and the most natural way to write a prime
factorization is in increasing order of primes. So, the canonical representation of 10 is 2 · 5.
As another example, the canonical representation of 18 is 2 · 3 · 3. We can tidy this up a bit by rewriting
3 · 3 as 32 . So, the canonical representation of 18 is 2 · 32 .
If you are new to factoring, you may find it helpful to draw a factor tree.
For example, here is a factor tree for 18:
18
โโ
2 9
โโ
3 3
162
To draw this tree, we started by writing 18 as the product 2 · 9. We put a box around 2 because 2 is
prime and does not need to be factored any more. We then proceeded to factor 9 as 3 · 3. We put a
box around each 3 because 3 is prime. We now see that we are done, and the prime factorization can
be found by multiplying all the boxed numbers together. Remember that we will usually want the
canonical representation, and so, we write the final product in increasing order of primes.
By the Fundamental Theorem of Arithmetic above it does not matter how we factor the number—we
will always get the same canonical form. For example, here is a different factor tree for 18:
18
โโ
3 6
โโ
2 3
Now, to prove that a positive integer ๐ is composite, we simply need to produce a factor of ๐ that is
different from 1 and ๐ itself. This may sound easy, but in practice, as we look at larger and larger values
of ๐ it can become very difficult to find factors of ๐. For example, the largest prime number that we
are currently aware of (at the time I am writing this book) is 277,232,917 − 1. This is an enormous number
with 23,249,425 digits. By Theorem 12.3, we know that there are prime numbers larger than this, but
we have not yet found one.
The following theorem provides a couple of tricks to help us (or a computer) determine if a positive
integer is prime more quickly.
Theorem 12.9: If ๐ is composite, then ๐ has a prime factor ๐ ≤ √๐.
Proof: Let ๐ be composite, so that there are integers ๐, ๐ with 1 < ๐, ๐ < ๐ and ๐ = ๐๐. If both ๐ and
๐ are greater than √๐, then we would have ๐ = ๐๐ > √๐ ⋅ √๐ = ๐, a contradiction. So, either ๐ ≤ √๐
or ๐ ≤ √๐. Without loss of generality, suppose that ๐ ≤ √๐. By Corollary 12.2, ๐ has a prime factor ๐.
Since ๐ is a factor of ๐ and ๐ is a factor ๐, it follows that ๐ is a factor of ๐. Also, since ๐ is a factor of ๐
and ๐ ≤ √๐, we have ๐ ≤ √๐.
โก
Example 12.9:
1. Let’s determine if 187 is prime or composite. Since √187 < √196 = 14, by Theorem 12.9, we
need only check to see if 187 is divisible by 2, 3, 5, 7, 11, and 13. Checking each of these, we see
that 187 = 11 ⋅ 17. So, 187 is composite.
2. Let’s determine if 359 is prime or composite. Since √359 < √361 = 19, by Theorem 12.9, we
need only check to see if 359 is divisible by 2, 3, 5, 7, 11 13, and 17. A quick check shows that
359 is not divisible by any of these numbers, and so, 359 is prime.
Sometimes in a prime factorization we will want to make sure that we do not “skip” any primes, and
that each prime has a power.
163
For example, the canonical representation of 50 is 2 ⋅ 52 . Note that we “skipped over” the prime 3 and
there is no exponent written for 2. We can easily give 2 an exponent by rewriting it as 21 , and since
๐ฅ 0 = 1 for any nonzero ๐ฅ (by definition), we can write 1 = 30 . Therefore, the prime factorization of
50 can be written as 21 ⋅ 30 ⋅ 52 .
This convention can be especially useful when comparing two or more positive integers or performing
๐ ๐
๐
an operation on two or more integers. We will say that ๐0 0 ๐1 1 โฏ ๐๐ ๐ is a complete prime factorization
if ๐0 , ๐1 , … , ๐๐ are the first ๐ primes (๐0 = 2, ๐1 = 3, and so on) and ๐0 , ๐1 , … , ๐๐ ∈ โ.
Example 12.10:
1. The prime factorization of 364 in canonical form is 22 ⋅ 7 ⋅ 13. However, this is not a complete
factorization.
A complete factorization of 364 is 22 ⋅ 30 ⋅ 50 ⋅ 71 ⋅ 110 ⋅ 131 . This is not the only complete
factorization of 364. Another one is 22 ⋅ 30 ⋅ 50 ⋅ 71 ⋅ 110 ⋅ 131 ⋅ 170 .
๐
๐
๐
๐
๐
๐
0
Given a complete factorization ๐0 0 ๐1 1 โฏ ๐๐ ๐ of a positive integer, ๐0 0 ๐1 1 โฏ ๐๐ ๐ ๐๐+1
is another
๐0 ๐1
๐๐ 0
0
0
complete factorization, and in fact, for any ๐ ∈ โ, ๐0 ๐1 โฏ ๐๐ ๐๐+1 ๐๐+2 โฏ ๐๐+๐ is also a
complete factorization of that same positive integer. In words, we can include finitely many
additional prime factors at the tail end of the original factorization all with exponent 0. Just be
careful not to skip any primes!
2. 20 ⋅ 35 ⋅ 50 ⋅ 72 ⋅ 110 ⋅ 130 ⋅ 172 and 23 ⋅ 31 ⋅ 50 ⋅ 70 ⋅ 116 are complete prime factorizations. In
many cases, it is useful to rewrite the second factorization as 23 ⋅ 31 ⋅ 50 ⋅ 70 ⋅ 116 ⋅ 130 ⋅ 170 .
This is also a complete prime factorization. However, this one has all the same prime factors as
the first number given.
Complete prime factorizations give us an easy way to compute greatest common divisors and least
common multiples of positive integers.
๐
๐
๐
๐
๐
๐
Suppose that ๐ = ๐0 0 ๐1 1 โฏ ๐๐ ๐ and ๐ = ๐0 0 ๐1 1 โฏ ๐๐๐ are complete prime factorizations of ๐ and ๐.
Then we have
min{๐0 ,๐0 } min{๐1 ,๐1 }
min{๐๐ ,๐๐ }
๐1
โฏ ๐๐
gcd(๐, ๐) = ๐0
max{๐0 ,๐0 } max{๐1 ,๐1 }
max{๐๐ ,๐๐ }
๐1
โฏ ๐๐
.
lcm(๐, ๐) = ๐0
Example 12.11: Let ๐ = 2 ⋅ 52 ⋅ 7 and ๐ = 3 ⋅ 5 ⋅ 112 . We can rewrite ๐ and ๐ with the following
complete prime factorizations: ๐ = 21 ⋅ 30 ⋅ 52 ⋅ 71 ⋅ 110 and ๐ = 20 ⋅ 31 ⋅ 51 ⋅ 70 ⋅ 112 . From these
factorizations, it is easy to compute gcd(๐, ๐) and lcm(๐, ๐).
gcd(๐, ๐) = 20 ⋅ 30 ⋅ 51 ⋅ 70 ⋅ 110 = 5 and lcm(๐, ๐) = 21 ⋅ 31 ⋅ 52 ⋅ 71 ⋅ 112 = 127,050.
Observe that in this example, ๐๐ = 350 ⋅ 1815 = 635,250 = 5 ⋅ 127,050 = gcd(๐, ๐) ⋅ lcm(๐, ๐).
We will now show that the equation ๐๐ = gcd(๐, ๐) ⋅ lcm(๐, ๐) is true for all positive integers ๐ and ๐.
Before we state and prove the theorem, note that min{๐ฅ, ๐ฆ} + max{๐ฅ, ๐ฆ} = ๐ฅ + ๐ฆ (check this!).
Theorem 12.10: Let ๐, ๐ ∈ โค+ . Then gcd(๐, ๐) ⋅ lcm(๐, ๐) = ๐๐.
164
๐
๐
๐
๐
๐
๐
Proof: Let ๐ = ๐0 0 ๐1 1 โฏ ๐๐ ๐ and ๐ = ๐0 0 ๐1 1 โฏ ๐๐๐ be complete prime factorizations of ๐ and ๐. Then
gcd(๐, ๐) ⋅ lcm(๐, ๐)
min{๐0 ,๐0 } min{๐1 ,๐1 }
min{๐๐ ,๐๐ }
max{๐0 ,๐0 } max{๐1 ,๐1 }
max{๐๐ ,๐๐ }
๐1
โฏ ๐๐
⋅ ๐0
๐1
โฏ ๐๐
min{๐0 ,๐0 } max{๐0 ,๐0 } min{๐1 ,๐1 } max{๐1 ,๐1 }
min{๐๐ ,๐๐ } max{๐๐ ,๐๐ }
= ๐0
๐0
๐1
๐1
โฏ ๐๐
๐๐
min{๐0 ,๐0 }+max{๐0 ,๐0 } min{๐1 ,๐1 }+max{๐1 ,๐1 }
min{๐๐ ,๐๐ }+max{๐๐ ,๐๐ }
= ๐0
๐1
โฏ ๐๐
๐ +๐ ๐ +๐
๐ +๐
= ๐0 0 0 ๐1 1 1 โฏ ๐๐ ๐ ๐
๐ ๐ ๐ ๐
๐ ๐
= ๐0 0 ๐0 0 ๐1 1 ๐1 1 โฏ ๐๐ ๐ ๐๐๐
๐ ๐
๐
๐ ๐
๐
= ๐0 0 ๐1 1 โฏ ๐๐ ๐ ⋅ ๐0 0 ๐1 1 โฏ ๐๐๐
= ๐0
= ๐๐
โก
We will finish this lesson with the Euclidean Algorithm. This is an algorithm for computing the gcd of
two positive integers. It also provides a method for expressing the gcd as a linear combination of the
two integers.
Theorem 12.11 (The Euclidean Algorithm): Let ๐, ๐ ∈ โค+ with ๐ ≥ ๐. Let ๐0 = ๐, ๐1 = ๐. Apply the
division algorithm to ๐0 and ๐1 to find ๐1 , ๐2 ∈ โค+ such that ๐0 = ๐1 ๐1 + ๐2 , where 0 ≤ ๐2 < ๐1. If we
iterate this process to get ๐๐ = ๐๐+1 ๐๐+1 + ๐๐+2 , where 0 ≤ ๐๐+2 < ๐๐+1 for ๐ = 0, 1, … , ๐ − 1 so that
๐๐+1 = 0. Then gcd(๐, ๐) = ๐๐ .
You will be asked to prove the Euclidean Algorithm in Problem 12 below.
Example 12.12: Let’s use the Euclidean Algorithm to find gcd(305, 1040).
1040 = 305 ⋅ 3 + 125
305 = 125 ⋅ 2 + 55
125 = 55 ⋅ 2 + 15
55 = 15 ⋅ 3 + 10
15 = 10 ⋅ 1 + ๐
10 = 5 ⋅ 2 + ๐
So, gcd(305, 1040) = ๐.
Notes: (1) In this example, we have ๐ = ๐0 = 1040 and ๐ = ๐1 = 305. By the Division Algorithm we
can write 1040 = 305๐1 + ๐2, where 0 < ๐2 < 305. To find ๐1 , we are simply looking for the largest
integer ๐ such that 305๐ ≤ 1040. Well, 305 ⋅ 3 = 915 and 305 ⋅ 4 = 1220. So, 4 is too big and
therefore, we let ๐1 = 3. It follows that ๐2 = 1040 − 305 ⋅ 3 = 1040 − 915 = 125.
We now repeat the procedure using ๐1 = 305 and ๐2 = 125 to get 305 = 125 ⋅ 2 + 55. Notice that
125 ⋅ 3 = 375, which is too big because 375 > 305. This is why we let ๐2 = 2. It follows that
๐3 = 305 − 125 ⋅ 2 = 305 − 250 = 55.
165
Continuing this process, we eventually wind up with 10 = 5 ⋅ 2 + 0, so that ๐7 = 0. By Theorem 12.11,
gcd(305,1040) = ๐6 = 5.
(2) As we go through the algorithm, we get ๐0 = 1040, ๐1 = 305, ๐2 = 125, ๐3 = 55, ๐4 = 15,
๐5 = 10, ๐6 = 5, and ๐7 = 0.
We also get ๐1 = 3, ๐2 = 2, ๐3 = 2, ๐4 = 3, ๐5 = 1, and ๐6 = 2.
(3) We can now go backwards through the algorithm to express gcd(305, 1040) as a linear
combination of 305 and 1040.
We start with the second to last line (line 5): 15 = 10 ⋅ 1 + 5. We solve this equation for 5 to get
5 = 15 − 1 ⋅ 10.
Working backwards, we next look at line 4: 55 = 15 ⋅ 3 + 10. We solve this equation for 10 and then
substitute into the previous equation: 10 = 55 − 15 ⋅ 3. After substituting, we get
5 = 15 − 1 ⋅ 10 = 15 − 1(55 − 15 ⋅ 3)
We then distribute and group all the 15’s together and all the 55’s together. So, we have
5 = 15 − 1 ⋅ 10 = 15 − 1(55 − 15 ⋅ 3) = 15 − 1 ⋅ 55 + 3 ⋅ 15 = 4 ⋅ 15 − 1 ⋅ 55.
Line 3 is next: 125 = 55 ⋅ 2 + 15. We solve this equation for 15 to get 15 = 125 − 2 ⋅ 55. And once
again we now substitute into the previous equation to get
5 = 4 ⋅ 15 − 1 ⋅ 55 = 4(125 − 2 ⋅ 55) − 1 ⋅ 55 = 4 ⋅ 125 − 8 ⋅ 55 − 1 ⋅ 55 = 4 ⋅ 125 − 9 ⋅ 55.
Let’s go to line 2: 305 = 125 ⋅ 2 + 55. We solve this equation for 55 to get 55 = 305 − 2 ⋅ 125.
Substituting into the previous equation gives us
5 = 4 ⋅ 125 − 9 ⋅ 55 = 4 ⋅ 125 − 9(305 − 2 ⋅ 125)
= 4 ⋅ 125 − 9 ⋅ 305 + 18 ⋅ 125 = 22 ⋅ 125 − 9 ⋅ 305.
And finally line 1: 1040 = 305 ⋅ 3 + 125. Solving this equation for 125 gives us 125 = 1040 − 3 ⋅ 305.
Substituting into the previous equation gives
5 = 22 ⋅ 125 − 9 ⋅ 305 = 22(1040 − 3 ⋅ 305) − 9 ⋅ 305
= 22 ⋅ 1040 − 66 ⋅ 305 − 9 ⋅ 305 = 22 ⋅ 1040 − 75 ⋅ 305.
So, we see that gcd(305, 1040) = 5 = 22 ⋅ 1040 − 75 ⋅ 305 = – 75 ⋅ 305 + 22 ⋅ 1040.
(4) With a little practice, the computations done in Note 3 can be done fairly quickly. Here is what the
quicker computation might look like:
5 = 15 − 1 ⋅ 10 = 15 − 1 ⋅ (55 − 15 ⋅ 3) = 4 ⋅ 15 − 1 ⋅ 55 = 4(125 − 55 ⋅ 2) − 1 ⋅ 55 = 4 ⋅ 125 − 9 ⋅ 55
= 4 ⋅ 125 − 9(305 − 125 ⋅ 2) = 22 ⋅ 125 − 9 ⋅ 305 = 22(1040 − 305 ⋅ 3) − 9 ⋅ 305 = 22 ⋅ 1040 − 75 ⋅ 305
So, 5 = gcd(305, 1040) = – 75 ⋅ 305 + 22 ⋅ 1040.
166
Problem Set 12
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Write each of the following positive integers as a product of prime factors in canonical form:
(i)
9
(ii)
13
(iii) 21
(iv) 30
(v)
44
(vi) 693
(vii) 67,500
(viii) 384,659
(ix) 9,699,690
2. List all prime numbers less than 100.
3. Find the gcd and lcm of each of the following sets of numbers:
(i)
{4, 6}
(ii)
{12, 180}
(iii) {2, 3, 5}
(iv) {14, 21, 77}
(v)
{720, 2448, 5400}
(vi) {217 54 119 23, 25 32 74 113 13}
LEVEL 2
4. Determine if each of the following numbers is prime:
(i)
101
(ii)
399
(iii) 1829
(iv) 1933
(v)
8051
(vi) 13,873
(vii) 65,623
167
5. Use the division algorithm to find the quotient and remainder when 723 is divided by 17.
6. For ๐ ∈ โค+ , let ๐๐ = ๐! + 1. Determine if ๐๐ is prime for ๐ = 1, 2, 3, 4, 5, 6, and 7.
LEVEL 3
7. Use the Euclidean Algorithm to find gcd(825, 2205). Then express gcd(825, 2205) as a linear
combination of 825 and 2205.
8. Prove that if ๐ ∈ โค with ๐ > 1, then ๐ 3 + 1 is not prime.
9. Prove that gcd(๐, ๐) | lcm(๐, ๐).
10. Let ๐, ๐, ๐ ∈ โค. Prove that gcd(๐, ๐) = gcd(๐ + ๐๐, ๐).
11. Let ๐, ๐, ๐, ๐ ∈ โค with ๐ = ๐๐ + ๐. Prove that gcd(๐, ๐) = gcd(๐, ๐).
LEVEL 4
12. Prove the Euclidean Algorithm: Let ๐, ๐ ∈ โค+ with ๐ ≥ ๐. Let ๐0 = ๐, ๐1 = ๐. Apply the division
algorithm to ๐0 and ๐1 to find ๐1 , ๐2 ∈ โค+ such that ๐0 = ๐1 ๐1 + ๐2, where 0 ≤ ๐2 < ๐1. If we
iterate this process to get ๐๐ = ๐๐+1 ๐๐+1 + ๐๐+2 , where 0 ≤ ๐๐+2 < ๐๐+1 for ๐ = 0, 1, … , ๐ − 1 so
that ๐๐+1 = 0. Then gcd(๐, ๐) = ๐๐ .
13. Prove that if ๐|๐ and ๐|๐, then lcm(๐, ๐) | ๐.
14. Suppose that ๐, ๐ ∈ โค+ , gcd(๐, ๐) = 1, and ๐|๐๐. Prove that there are integers ๐ and ๐ such that
๐ = ๐๐, ๐|๐, and ๐|๐.
15. A prime triple is a sequence of three prime numbers of the form ๐, ๐ + 2, and ๐ + 4. For
example, 3, 5, 7 is a prime triple. Prove that there are no other prime triples.
LEVEL 5
16. If ๐, ๐ ∈ โค+ and gcd(๐, ๐) = 1, find the following:
(i)
gcd(๐, ๐ + 1)
(ii)
gcd(๐, ๐ + 2)
(iii) gcd(3๐ + 2,5๐ + 3)
(iv) gcd(๐ + ๐, ๐ − ๐)
(v)
gcd(๐ + 2๐, 2๐ + ๐)
17. Find the smallest ideal of โค containing 6 and 15. Find the smallest ideal of โค containing 2 and
3. In general, find the smallest ideal of โค containing ๐ and ๐, where ๐, ๐ ∈ โค.
18. Find all subgroups of (โค, +) and all submonoids of (โค, +).
168
LESSON 13 – REAL ANALYSIS
LIMITS AND CONTINUITY
Strips and Rectangles
A horizontal strip in โ × โ is a set of the form โ × (๐, ๐) = {(๐ฅ, ๐ฆ) | ๐ < ๐ฆ < ๐}.
Example 13.1: The horizontal strips โ × (– 2, 1) and โ × (2.25, 2.75) can be visualized in the ๐๐-plane
as follows:
โ × (– 2, 1)
โ × (2.25, 2.75)
Similarly, a vertical strip in โ × โ is a set of the form (๐, ๐) × โ = {(๐ฅ, ๐ฆ) | ๐ < ๐ฅ < ๐}.
Example 13.2: The vertical strips (– 3, 0) × โ and (0.8, 1) × โ can be visualized in the ๐๐-plane as
follows:
(– 3, 0) × โ
(0.8, 1) × โ
169
We will say that the horizontal strip โ × (๐, ๐) contains ๐ฆ if ๐ฆ ∈ โ and ๐ < ๐ฆ < ๐. Otherwise, we will
say that the horizontal strip excludes ๐ฆ.
Similarly, we will say that the vertical strip (๐, ๐) × โ contains ๐ฅ if ๐ฅ ∈ โ and ๐ < ๐ฅ < ๐. Otherwise,
we will say that the vertical strip excludes ๐ฅ.
Example 13.3: The horizontal strip โ × (2.25, 2.75) contains 2.5 and excludes 3. One way to visualize
this is to draw the horizontal lines ๐ฆ = 2.5 and ๐ฆ = 3. Below in the figure on the left, we used a solid
line for the line ๐ฆ = 2.5 because it is contained in the horizontal strip and we used a dashed line for
the line ๐ฆ = 3 because it is not contained in the horizontal strip.
Similarly, the vertical strip (0.8, 1) × โ contains 0.9 and excludes 2. Again, we can visualize this by
drawing the vertical lines ๐ฅ = 0.9 and ๐ฅ = 2. These vertical lines are shown below in the figure on the
right.
An open rectangle is a set of the form (๐, ๐) × (๐, ๐) = {(๐ฅ, ๐ฆ) | ๐ < ๐ฅ < ๐ ∧ ๐ < ๐ฆ < ๐}. Note that
the open rectangle (๐, ๐) × (๐, ๐) is the intersection of the horizontal strip โ × (๐, ๐) and the vertical
strip (๐, ๐) × โ. We will say that an open rectangle traps the point (๐ฅ, ๐ฆ) if ๐ฅ, ๐ฆ ∈ โ and (๐ฅ, ๐ฆ) is in the
open rectangle. Otherwise, we will say that (๐ฅ, ๐ฆ) escapes from the open rectangle.
Example 13.4: The open rectangle ๐
= (– 3, 0) × (– 2, 1) is the intersection of the horizontal strip
๐ป = โ × (– 2, 1) and the vertical strip ๐ = (– 3, 0) × โ. So, ๐
= ๐ป ∩ ๐. The rectangle ๐
traps (– 1, 0),
whereas (– 2, 3) escapes from ๐
. This can be seen in the figure below on the left.
The open rectangle ๐
= (0.8, 1) × (2.25, 2.75) is the intersection of the horizontal strip
๐ป = โ × (2.25, 2.75) and the vertical strip ๐ = (0.8, 1) × โ. So, ๐
= ๐ป ∩ ๐. The rectangle ๐
traps
(0.9, 2.5), whereas (0.9, 2) escapes from ๐
. This can be seen in the figure below on the right.
Observe that in this example, I chose points that escape the given rectangles in the vertical direction.
They fall outside the rectangle because they’re too high or too low. This is the only type of escape that
we will be interested in here. We do not care about points that escape to the left or right of a rectangle.
170
Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐
= (๐, ๐) × (๐, ๐) be an open rectangle. We say that ๐
traps ๐ if for
all ๐ฅ ∈ (๐, ๐), ๐
traps (๐ฅ, ๐(๐ฅ)). Otherwise we say that ๐ escapes from ๐
.
Example 13.5: Let ๐: โ → โ be defined by ๐(๐ฅ) = ๐ฅ + 1. Consider the open rectangles
๐
= (0, 2) × (1, 3) and ๐ = (0, 2) × (0, 2). Then ๐
traps ๐, as can be seen in the figure below on the
left, whereas ๐ escapes from ๐, as can be seen in the figure below on the right. I put a box around the
points of the form (๐ฅ, ๐(๐ฅ)) that escape from ๐. For example, the point (1.2, ๐(1.2)) = (1.2, 2.2)
escapes from ๐ because 0 < 1.2 < 2, but ๐(1.2) = 2.2 ≥ 2.
When we are checking the limiting behavior near a real number ๐, we don’t care if the point (๐, ๐(๐))
escapes. Therefore, before we define a limit, we need to modify our definitions of “traps” and
“escapes” slightly to account for this.
Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐
= (๐, ๐) × (๐, ๐) be an open rectangle. We say that ๐น traps ๐
around ๐ if for all ๐ฅ ∈ (๐, ๐) โ {๐}, ๐
traps (๐ฅ, ๐(๐ฅ)). Otherwise, we say ๐ escapes from ๐น around ๐.
171
Limits and Continuity
Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐, ๐ฟ ∈ โ. We say that the limit of ๐ as ๐ approaches ๐ is ๐ณ, written
lim ๐(๐ฅ) = ๐ฟ, if for every horizontal strip ๐ป that contains ๐ฟ there is a vertical strip ๐ that contains ๐
๐ฅ→๐
such that the rectangle ๐ป ∩ ๐ traps ๐ around ๐.
Technical note: According to the definition of limit just given, in order for lim ๐(๐ฅ) to exist, the set ๐ด
๐ฅ→๐
needs to contain a deleted neighborhood of ๐, say ๐๐โจ (๐) = (๐ − ๐, ๐) ∪ (๐, ๐ + ๐). As an example,
suppose that ๐ด = {0} and ๐: ๐ด → โ is defined by ๐(0) = 1. What is the value of lim ๐(๐ฅ)? Well, any
๐ฅ→0
rectangle of the form ๐ป ∩ ๐ does not trap any points of the form (๐ฅ, ๐(๐ฅ)) with ๐ฅ ≠ 0 simply because
๐(๐ฅ) is not defined when ๐ฅ ≠ 0. Therefore, given a horizontal strip ๐ป, there is no vertical strip ๐ such
that ๐ป ∩ ๐ traps ๐ around ๐, and so, lim ๐(๐ฅ) does not exist. This agrees with our intuition.
๐ฅ→๐
As a less extreme example, suppose that ๐ด = โ and ๐: โ → โ is the constant function where
๐(๐ฅ) = 1 for all ๐ฅ ∈ โ. Then for any ๐ ∈ โ, we should probably have lim ๐(๐ฅ) = 1. But if we use our
๐ฅ→๐
current definition of limit, then lim ๐(๐ฅ) does not exist. A more general definition of limit would yield
๐ฅ→๐
finite values for limits defined on certain sets (like โ) that do not contain a neighborhood of ๐.
Specifically, we really should insist only that for each ๐ ∈ โ+ , ๐ด ∩ ((๐ − ๐, ๐ + ๐) โ {๐}) ≠ ∅. The
definition of limit given above could be modified slightly to accommodate this more general situation.
For example, we could change “๐
traps ๐ around ๐” to “for all ๐ฅ ∈ ๐ด ∩ ((๐, ๐) โ {๐}), ๐
traps
(๐ฅ, ๐(๐ฅ)).” If we were to use this more general definition, it is very important that we also insist that
the set ๐ด has the property given at the beginning of this paragraph. Otherwise, we would have an issue
with the function ๐ defined at the beginning of this note. The interested reader may want to investigate
this.
In this lesson, we will avoid these more complicated domains and stick with the simpler definition of
limit. Let’s just always assume that if lim ๐(๐ฅ) exists, then ๐ is defined on some deleted neighborhood
๐ฅ→๐
of ๐.
Example 13.6: Let ๐: โ → โ be defined by ๐(๐ฅ) = ๐ฅ + 1,
let ๐ = 1.5, and let ๐ฟ = 1. Let’s show that lim ๐(๐ฅ) ≠ 1.
๐ฅ→1.5
If ๐ป = โ × (0, 2) and ๐ is any vertical strip that contains
1.5, then ๐ป ∩ ๐ does not trap ๐ around 1.5. Indeed, if
1
๐ = (๐, ๐) × โ, then if we let ๐ฅ = 2 (1.5 + ๐), we will show
1
that ๐ฅ ∈ (๐, ๐) and ๐(๐ฅ) = 2 (1.5 + ๐) + 1 > 2 (see the
figure to the right).
To see that ๐ฅ ∈ (๐, ๐), note that since ๐ > 1.5, we have
1
1
1
๐ฅ = 2 (1.5 + ๐) > 2 (1.5 + 1.5) = 2 ⋅ 3 = 1.5 > ๐, and we
1
1
1
have ๐ฅ = 2 (1.5 + ๐) < 2 (๐ + ๐) = 2 ⋅ 2๐ = ๐.
172
To see that ๐(๐ฅ) > 2, note that
1
1
1
๐(๐ฅ) = 2 (1.5 + ๐) + 1 > 2 (1.5 + 1.5) + 1 = 2 ⋅ 3 + 1 = 1.5 + 1 = 2.5 > 2.
So, what is lim ๐(๐ฅ) equal to? From the picture above, a good guess would be 2.5. To verify that this
๐ฅ→1.5
is true, let ๐ป = โ × (๐, ๐) be a horizontal strip that contains 2.5. Next, let ๐ = (๐ − 1, ๐ − 1) × โ. We
will show that ๐ป ∩ ๐ = (๐ − 1, ๐ − 1) × (๐, ๐) traps ๐ around 1.5. Let ๐ฅ ∈ (๐ − 1, ๐ − 1) โ {1.5}, so
that ๐ − 1 < ๐ฅ < ๐ − 1 and ๐ฅ ≠ 1.5. Adding 1 to each part of this sequence of inequalities gives
๐ < ๐ฅ + 1 < ๐, so that ๐ < ๐(๐ฅ) < ๐, or equivalently, ๐(๐ฅ) ∈ (๐, ๐). Since ๐ฅ ∈ (๐ − 1, ๐ − 1) โ {1.5}
and ๐(๐ฅ) ∈ (๐, ๐), it follows that (๐ฅ, ๐(๐ฅ)) ∈ (๐ − 1, ๐ − 1) × (๐, ๐) = ๐ป ∩ ๐. Therefore, ๐ป ∩ ๐ traps
๐ around 1.5.
Notes: (1) The figures above give a visual representation of the argument just presented. In the figure
on the left, we let ๐ = 2 and ๐ = 3, so that ๐ป = โ × (2, 3). Our choice of ๐ is then (1, 2) × โ, and
therefore, ๐ป ∩ ๐ = (1, 2) × (2, 3). Now, if 1 < ๐ฅ < 2, then 2 < ๐ฅ + 1 < 3. So, (๐ฅ, ๐(๐ฅ)) ∈ ๐ป ∩ ๐.
In the figure on the right, we started with a thinner horizontal strip without being specific about its
exact definition. Notice that we then need to use a thinner vertical strip to prevent ๐ from escaping. If
the vertical strip were just a little wider on the right, then some points of the form (๐ฅ, ๐(๐ฅ)) would
escape the rectangle because they would be too high. If the vertical strip were just a little wider on the
left, then some points of the form (๐ฅ, ๐(๐ฅ)) would escape the rectangle because they would be too
low.
(2) Notice that in this example, the point (1.5, ๐(1.5)) itself always stays in the rectangle. In the
argument given, we excluded this point from consideration. Even if (1.5, ๐(1.5)) were to escape the
rectangle, it would not change the result here. We would still have lim ๐(๐ฅ) = 2.5. I indicated the
๐ฅ→1.5
parts of the argument where (1.5, ๐(1.5)) was being excluded from consideration in Example 13.6
above by placing rectangles around that part of the text. If we delete all the parts of the argument
inside those rectangles, the resulting argument would still be correct. We will examine this situation
more carefully in the next example.
173
If we modify the definition of limit by getting rid of “around ๐,” insisting that ๐ ∈ ๐ด, and replacing ๐ฟ by
๐(๐), we get the definition of continuity. Specifically, we have the following definition.
Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐ ∈ ๐ด. We say that the function ๐ is continuous at ๐ if for every
horizontal strip ๐ป that contains ๐(๐) there is a vertical strip ๐ that contains ๐ such that the rectangle
๐ป ∩ ๐ traps ๐.
Example 13.7:
1. If we delete all the text that I placed in rectangles in Example 13.6 above, then the resulting
argument shows that the function ๐ defined by ๐(๐ฅ) = ๐ฅ + 1 is continuous at ๐ฅ = 1.5.
To summarize, given a horizontal strip ๐ป containing ๐(1.5) = 2.5, we found a vertical strip ๐
containing 1.5 such that ๐ป ∩ ๐ traps ๐. Notice once again that in this example we do not
exclude ๐ฅ = 1.5 from consideration, and when we mention trapping ๐, we do not say “around
1.5.” We need to trap (1.5, ๐(1.5)) = (1.5, 2.5) as well.
๐ฅ + 1 if ๐ฅ ≠ 1.5
. This function is
–2
if ๐ฅ = 1.5
nearly identical to the function ๐ we have been discussing. It differs from the previous function
only at ๐ฅ = 1.5. It should follow that lim ๐(๐ฅ) = lim ๐(๐ฅ). And, in fact it does. The same
2. Let’s consider the function ๐: โ → โ defined by ๐(๐ฅ) = {
๐ฅ→1.5
๐ฅ→1.5
exact argument that we gave in Example 13.6 shows that lim ๐(๐ฅ) = 2.5. The figures below
๐ฅ→1.5
illustrate the situation.
This time however, we cannot delete the text inside the rectangles in Example 13.6. ๐ฅ = 1.5
needs to be excluded from consideration for the argument to go through. In the leftmost figure
below, we see that if ๐ป is the horizontal strip ๐ป = โ × (2, 3), then for any vertical strip
๐ = (๐, ๐) × โ that contains 1.5, the point (1.5, – 2) will escape the rectangle ๐ป ∩ ๐. Indeed,
๐ป ∩ ๐ = (๐, ๐) × (2, 3), and (1.5, ๐(1.5)) = (1.5, – 2) ∉ (๐, ๐) × (2, 3) because – 2 < 2. This
shows that ๐ is not continuous at ๐ฅ = 1.5.
๐ฆ = { ๐ฅ + 1 if ๐ฅ ≠ 1.5
–2
if ๐ฅ = 1.5
๐ฆ = { ๐ฅ + 1 if ๐ฅ ≠ 1.5
–2
if ๐ฅ = 1.5
174
The strip game: Suppose we want to determine if lim ๐(๐ฅ) = ๐ฟ. Consider the following game between
๐ฅ→๐
two players: Player 1 “attacks” by choosing a horizontal strip ๐ป0 containing ๐ฟ. Player 2 then tries to
“defend” by choosing a vertical strip ๐0 containing ๐ such that ๐ป0 ∩ ๐0 traps ๐ around ๐. If Player 2
cannot find such a vertical strip, then Player 1 wins and lim ๐(๐ฅ) ≠ ๐ฟ. If Player 2 defends successfully,
๐ฅ→๐
then Player 1 chooses a new horizontal strip ๐ป1 containing ๐ฟ. If Player 1 is smart, then he/she will
choose a “much thinner” horizontal strip that is contained in ๐ป0 (compare the two figures above). The
thinner the strip, the harder it will be for Player 2 to defend. Player 2 once again tries to choose a
vertical strip ๐1 such that ๐ป1 ∩ ๐1 traps ๐ around ๐. This process continues indefinitely. Player 1 wins
the strip game if at some stage, Player 2 cannot defend successfully. Player 2 wins the strip game if he
or she defends successfully at every stage.
Player 1 has a winning strategy for the strip game if and only if lim ๐(๐ฅ) ≠ ๐ฟ, while Player 2 has a
๐ฅ→๐
winning strategy for the strip game if and only if lim ๐(๐ฅ) = ๐ฟ.
๐ฅ→๐
Note that if it’s possible for Player 1 to win the strip game, then Player 1 can win with a single move—
just choose the horizontal strip immediately that Player 2 cannot defend against.
For example, if ๐(๐ฅ) = ๐ฅ + 1, then lim ๐(๐ฅ) ≠ 1. Player 1
๐ฅ→1.5
can win the appropriate strip game immediately by
choosing the horizontal strip ๐ป = โ × (0, 2). Indeed, if
Player 2 chooses any vertical strip ๐ = (๐, ๐) × โ that
contains 1.5, let ๐ฅ ∈ (๐, ๐) with ๐ฅ > 1.5. Then we have
๐(๐ฅ) = ๐ฅ + 1 > 1.5 + 1 = 2.5 > 2.
So, (๐ฅ, ๐(๐ฅ)) escapes ๐ป ∩ ๐. In the figure to the right, we
see that Player 1 has chosen ๐ป = โ × (0, 2) and Player 2
chose ๐ = (๐, ๐) × โ for some ๐, ๐ ∈ โ with ๐ < 1.5 < ๐.
The part of the line inside the square is an illustration of
where ๐ escapes ๐ป ∩ ๐ between ๐ and ๐. Observe that no
matter how much thinner we try to make that vertical strip,
if it contains 1.5, then it will contain a portion of the line
that is inside the square.
Now, if it’s possible for Player 2 to win the game, then we need to describe how Player 2 defends
against an arbitrary attack from Player 1. Suppose again that ๐(๐ฅ) = ๐ฅ + 1 and we are trying to show
that lim ๐(๐ฅ) = 2.5. We have already seen how Player 2 can defend against an arbitrary attack from
๐ฅ→1.5
Player 1 in Example 13.6. If at stage ๐, Player 1 attacks with the horizontal strip ๐ป๐ = โ × (๐, ๐), then
Player 2 can successfully defend with the vertical strip ๐๐ = (๐ − 1, ๐ − 1) × โ.
Equivalent Definitions of Limits and Continuity
The definitions of limit and continuity can be written using open intervals instead of strips. Specifically,
we have the following:
175
Theorem 13.1: Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐, ๐ฟ ∈ โ. The following are equivalent:
1. lim ๐(๐ฅ) = ๐ฟ.
๐ฅ→๐
2. For every open interval (๐, ๐) with ๐ฟ ∈ (๐, ๐), there is an open interval (๐, ๐) with ๐ ∈ (๐, ๐)
such that whenever ๐ฅ ∈ (๐, ๐) and ๐ฅ ≠ ๐, ๐(๐ฅ) ∈ (๐, ๐).
3. For every positive real number ๐, there is a positive real number ๐ฟ such that whenever
๐ฅ ∈ (๐ − ๐ฟ, ๐ + ๐ฟ) and ๐ฅ ≠ ๐, ๐(๐ฅ) ∈ (๐ฟ − ๐, ๐ฟ + ๐).
This is the first Theorem where we want to prove more than two statements equivalent. We will do
this with the following chain: 1 → 2 → 3 → 1. In other words, we will assume statement 1 and use it to
prove statement 2. We will then assume statement 2 and use it to prove statement 3. Finally, we will
assume statement 3 and use it to prove statement 1.
Proof of Theorem 13.1: (1→ 2) Suppose that lim ๐(๐ฅ) = ๐ฟ and let ๐ฟ ∈ (๐, ๐). Then the horizontal strip
๐ฅ→๐
โ × (๐, ๐) contains ๐ฟ. Since lim ๐(๐ฅ) = ๐ฟ, there is a vertical strip (๐, ๐) × โ that contains ๐ such that
๐ฅ→๐
the rectangle ๐
= (๐, ๐) × (๐, ๐) traps ๐ around ๐. Since the vertical strip (๐, ๐) × โ contains ๐,
๐ ∈ (๐, ๐). Since the rectangle ๐
traps ๐ around ๐, for all ๐ฅ ∈ (๐, ๐) โ {๐}, ๐
traps (๐ฅ, ๐(๐ฅ)). In other
words, whenever ๐ฅ ∈ (๐, ๐) and ๐ฅ ≠ ๐, we have (๐ฅ, ๐(๐ฅ)) ∈ (๐, ๐) × (๐, ๐), and thus, ๐(๐ฅ) ∈ (๐, ๐).
(2→ 3) Suppose 2 holds and let ๐ be a positive real number. Then ๐ฟ − ๐ < ๐ฟ < ๐ฟ + ๐, or equivalently,
๐ฟ ∈ (๐ฟ − ๐, ๐ฟ + ๐). By 2, there is an open interval (๐, ๐) with ๐ ∈ (๐, ๐) such that whenever ๐ฅ ∈ (๐, ๐)
and ๐ฅ ≠ ๐, we have ๐(๐ฅ) ∈ (๐ฟ − ๐, ๐ฟ + ๐). Let ๐ฟ = min{๐ − ๐, ๐ − ๐}. Since ๐ฟ ≤ ๐ − ๐, we have
– ๐ฟ ≥ – (๐ − ๐) = – ๐ + ๐. Therefore, ๐ − ๐ฟ ≥ ๐ + (– ๐ + ๐) = ๐. Furthermore, since ๐ฟ ≤ ๐ − ๐, we
have ๐ + ๐ฟ ≤ ๐ + (๐ − ๐) = ๐. So, (๐ − ๐ฟ, ๐ + ๐ฟ) ⊆ (๐, ๐). If ๐ฅ ∈ (๐ − ๐ฟ, ๐ + ๐ฟ) and ๐ฅ ≠ ๐, then since
(๐ − ๐ฟ, ๐ + ๐ฟ) ⊆ (๐, ๐), ๐ฅ ∈ (๐, ๐). Therefore, ๐(๐ฅ) ∈ (๐ฟ − ๐, ๐ฟ + ๐).
(3→ 1) Suppose 3 holds and ๐ป = โ × (๐, ๐) is a horizontal strip that contains ๐ฟ. Since ๐ < ๐ฟ < ๐, we
have ๐ฟ − ๐ > 0 and ๐ − ๐ฟ > 0. Therefore, ๐ = min{๐ฟ − ๐, ๐ − ๐ฟ} > 0. So, there is ๐ฟ > 0 such that
whenever ๐ฅ ∈ (๐ − ๐ฟ, ๐ + ๐ฟ) and ๐ฅ ≠ ๐, then ๐(๐ฅ) ∈ (๐ฟ − ๐, ๐ฟ + ๐). Let ๐ = (๐ − ๐ฟ, ๐ + ๐ฟ) × โ. Then
๐ contains ๐. We now show that ๐ป ∩ ๐ = (๐ − ๐ฟ, ๐ + ๐ฟ) × (๐, ๐) traps ๐ around ๐. Let
๐ฅ ∈ (๐ − ๐ฟ, ๐ + ๐ฟ) with ๐ฅ ≠ ๐. Then ๐(๐ฅ) ∈ (๐ฟ − ๐, ๐ฟ + ๐). So, ๐(๐ฅ) > ๐ฟ − ๐ ≥ ๐ฟ − (๐ฟ − ๐) = ๐ and
๐(๐ฅ) < ๐ฟ + ๐ ≤ ๐ฟ + (๐ − ๐ฟ) = ๐. Therefore, ๐(๐ฅ) ∈ (๐, ๐), and so, ๐ป ∩ ๐ traps ๐ around ๐.
โก
Notes: (1) ๐ and ๐ฟ are Greek letters pronounced “epsilon” and “delta,” respectively. Mathematicians
tend to use these two symbols to represent arbitrarily small numbers.
(2) If ๐ ∈ โ and ๐ > 0, then the ๐-neighborhood of ๐ is the interval ๐๐ (๐) = (๐ − ๐, ๐ + ๐) and the
deleted ๐-neighborhood of ๐ is the “punctured” interval ๐๐โจ (๐) = (๐ − ๐, ๐) ∪ (๐, ๐ + ๐). We can
visualize the deleted ๐-neighborhood ๐๐โจ (๐) as follows:
๐−๐
๐
176
๐+๐
For a specific example, let’s look at ๐2โจ (1) = (1 − 2, 1) ∪ (1, 1 + 2) = (– 1, 1) ∪ (1, 3).
(3) The third part of Theorem 13.1 can be written in terms of neighborhoods as follows:
“For every positive real number ๐, there is a positive real number ๐ฟ such that whenever ๐ฅ ∈ ๐๐ฟโจ (๐),
๐(๐ฅ) ∈ ๐๐ (๐ฟ).”
(4) ๐ฅ ∈ (๐ − ๐, ๐ + ๐) is equivalent to ๐ − ๐ < ๐ฅ < ๐ + ๐. If we subtract ๐ from each part of this
inequality, we get – ๐ < ๐ฅ − ๐ < ๐. This last expression is equivalent to |๐ฅ − ๐| < ๐. So, we have the
following sequence of equivalences:
๐ฅ ∈ ๐๐ (๐) ⇔ ๐ฅ ∈ (๐ − ๐, ๐ + ๐) ⇔ ๐ − ๐ < ๐ฅ < ๐ + ๐ ⇔ |๐ฅ − ๐| < ๐.
(5) ๐ฅ ≠ ๐ is equivalent to ๐ฅ − ๐ ≠ 0. Since the absolute value of a real number can never be negative,
๐ฅ − ๐ ≠ 0 is equivalent to |๐ฅ − ๐| > 0. This can also be written 0 < |๐ฅ − ๐|. So, we have the following
sequence of equivalences:
๐ฅ ∈ ๐๐โจ (๐) ⇔ ๐ฅ ∈ (๐ − ๐, ๐) ∪ (๐, ๐ + ๐) ⇔ 0 < |๐ฅ − ๐| < ๐.
(6) The third part of Theorem 13.1 can be written using absolute values as follows:
“For every positive real number ๐, there is a positive real number ๐ฟ such that whenever
0 < |๐ฅ − ๐| < ๐ฟ, |๐(๐ฅ) − ๐ฟ| < ๐.”
(7) We can abbreviate the expression from Note 6 using quantifiers as follows:
∀๐ > 0 ∃๐ฟ > 0 (0 < |๐ฅ − ๐| < ๐ฟ → |๐(๐ฅ) − ๐ฟ| < ๐)
We will refer to this expression as the ๐ − ๐น definition of a limit.
For each equivalent formulation of a limit, we have a corresponding formulation for the definition of
continuity.
Theorem 13.2: Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐ ∈ ๐ด. The following are equivalent:
1. ๐ is continuous at ๐.
2. For every open interval (๐, ๐) with ๐(๐) ∈ (๐, ๐), there is an open interval (๐, ๐) with ๐ ∈ (๐, ๐)
such that whenever ๐ฅ ∈ (๐, ๐), ๐(๐ฅ) ∈ (๐, ๐).
3. For every positive real number ๐, there is a positive real number ๐ฟ such that whenever
๐ฅ ∈ (๐ − ๐ฟ, ๐ + ๐ฟ), ๐(๐ฅ) ∈ (๐(๐) − ๐, ๐(๐) + ๐).
4. ∀๐ > 0 ∃๐ฟ > 0 (|๐ฅ − ๐| < ๐ฟ → |๐(๐ฅ) − ๐(๐)| < ๐).
The proof of Theorem 13.2 is left to the reader. It is very similar to the proof of Theorem 13.1.
177
Basic Examples
Example 13.8: Let’s use the ๐ − ๐ฟ definition of a limit to prove that lim(2๐ฅ + 1) = 3.
๐ฅ→1
Analysis: Given ๐ > 0, we need to find ๐ฟ > 0 so that 0 < |๐ฅ − 1| < ๐ฟ implies |(2๐ฅ + 1) − 3| < ๐. First
note that |(2๐ฅ + 1) − 3| = |2๐ฅ − 2| = |2(๐ฅ − 1)| = |2||๐ฅ − 1| = 2|๐ฅ − 1|. So, |(2๐ฅ + 1) − 3| < ๐ is
๐
๐
equivalent to |๐ฅ − 1| < 2. Therefore, ๐ฟ = 2 should work.
๐
Proof: Let ๐ > 0 and let ๐ฟ = 2. Suppose that 0 < |๐ฅ − 1| < ๐ฟ. Then we have
๐
|(2๐ฅ + 1) − 3| = |2๐ฅ − 2| = |2(๐ฅ − 1)| = |2||๐ฅ − 1| = 2|๐ฅ − 1| < 2๐ฟ = 2 ⋅ = ๐.
2
Since ๐ > 0 was arbitrary, we have ∀๐ > 0 ∃๐ฟ > 0 (0 < |๐ฅ − 1| < ๐ฟ → |(2๐ฅ + 1) − 3| < ๐).
Therefore, lim(2๐ฅ + 1) = 3.
โก
๐ฅ→1
Notes: (1) Even though we’re using the “๐ − ๐ฟ definition” instead of the “strip definition,” we can still
visualize the situation in terms of the strip game. When we say “Let ๐ > 0,” we can think of this as
Player 1 “attacking” with the horizontal strip ๐ป = โ × (3 − ๐, 3 + ๐). In the proof above, Player 2 is
๐
๐
then “defending” with the vertical strip ๐ = (1 − 2 , 1 + 2) × โ. This defense is successful because
๐
๐
when 1 − 2 < ๐ฅ < 1 + 2, we have 2 − ๐ < 2๐ฅ < 2 + ๐, and so, 3 − ๐ < 2๐ฅ + 1 < 3 + ๐, or
๐
๐
equivalently, 2๐ฅ + 1 ∈ (3 − ๐, 3 + ๐). In other words, for ๐ฅ ∈ (1 − 2 , 1 + 2), ๐ป ∩ ๐ traps ๐.
(2) Instead of playing the strip game, we can play the ๐ − ๐ฟ game instead. The idea is the same. Suppose
we are trying to figure out if lim ๐(๐ฅ) = ๐ฟ. Player 1 “attacks” by choosing a positive number ๐. This is
๐ฅ→๐
equivalent to Player 1 choosing the horizontal strip ๐ป = โ × (๐ฟ − ๐, ๐ฟ + ๐). Player 2 then tries to
“defend” by finding a positive number ๐ฟ. This is equivalent to Player 2 choosing the vertical strip
๐ = (๐ − ๐ฟ, ๐ + ๐ฟ) × โ. The defense is successful if whenever ๐ฅ ∈ (๐ − ๐ฟ, ๐ + ๐ฟ), ๐ฅ ≠ ๐, we have
๐(๐ฅ) ∈ (๐ฟ − ๐, ๐ฟ + ๐). This is equivalent to ๐ป ∩ ๐ trapping ๐ around ๐.
The figure to the right shows what happens during one
round of the ๐ − ๐ฟ game corresponding to checking if
lim (2๐ฅ + 1) = 3. In the figure, Player 1 chooses ๐ = 0.5,
๐ฅ→1
so that ๐ฟ − ๐ = 3 − 0.5 = 2.5 and ๐ฟ + ๐ = 3 + 0.5 = 3.5.
Notice how we drew the corresponding horizontal strip
๐ป = โ × (2.5, 3.5). According to our proof, Player 1
๐
0.5
chooses ๐ฟ = 2 = 2 = 0.25. So ๐ − ๐ฟ = 1 − 0.25 = 0.75
and ๐ + ๐ฟ = 1 + 0.25 = 1.25. Notice how we drew the
corresponding vertical strip ๐ = (0.75, 1.25) × โ. Also
notice how the rectangle ๐ป ∩ ๐ traps ๐.
178
(3) Observe that the value for ๐ฟ that Player 2 chose here is the largest value of ๐ฟ that would result in a
successful defense. If we widen the vertical strip at all on either side, then ๐ would escape from the
resulting rectangle. However, any smaller value of ๐ฟ will still work. If we shrink the vertical strip, then
๐ is still trapped. After all, we have less that we need to trap.
(4) In the next round, Player 1 will want to choose a smaller value for ๐. If Player 1 chooses a larger
value for ๐, then the same ๐ฟ that was already played will work to defend against that larger ๐. But for
this problem, it doesn’t matter how small a value for ๐ Player 1 chooses—Player 1 simply cannot win.
๐
All Player 2 needs to do is defend with ๐ฟ = 2 (or any smaller positive number).
(5) Essentially the same argument can be used to show that the function ๐ defined by ๐(๐ฅ) = 2๐ฅ + 1
is continuous at ๐ฅ = 1. Simply replace the expression 0 < |๐ฅ − 1| < ๐ฟ by the expression |๐ฅ − 1| < ๐ฟ
everywhere it appears in the proof. The point is that ๐(1) = 2 ⋅ 1 + 1 = 3. Since this value is equal to
lim (2๐ฅ + 1), we don’t need to exclude ๐ฅ = 1 from consideration when trying to trap ๐.
๐ฅ→1
Example 13.9: Let’s use the ๐ − ๐ฟ definition of a limit to prove that lim(๐ฅ 2 − 2๐ฅ + 1) = 4.
๐ฅ→3
Analysis: This is quite a bit more difficult than Example 13.8.
Given ๐ > 0, we need to find ๐ฟ > 0 so that 0 < |๐ฅ − 3| < ๐ฟ implies |(๐ฅ 2 − 2๐ฅ + 1) − 4| < ๐. First
note that |(๐ฅ 2 − 2๐ฅ + 1) − 4| = |๐ฅ 2 − 2๐ฅ − 3| = |(๐ฅ − 3)(๐ฅ + 1)| = |๐ฅ − 3||๐ฅ + 1|. Therefore,
|(๐ฅ 2 − 2๐ฅ + 1) − 4| < ๐ is equivalent to |๐ฅ − 3||๐ฅ + 1| < ๐.
There is a small complication here. The |๐ฅ − 3| is not an issue because we’re going to be choosing ๐ฟ so
that this expression is small enough. But to make the argument work we need to make |๐ฅ + 1| small
too. Remember from Note 3 after Example 13.8 that if we find a value for ๐ฟ that works, then any smaller
positive number will work too. This allows us to start by assuming that ๐ฟ is smaller than any positive
number we choose. So, let’s just assume that ๐ฟ ≤ 1 and see what effect that has on |๐ฅ + 1|.
Well, if ๐ฟ ≤ 1 and 0 < |๐ฅ − 3| < ๐ฟ, then |๐ฅ − 3| < 1. Therefore, – 1 < ๐ฅ − 3 < 1. We now add 4 to
each part of this inequality to get 3 < ๐ฅ + 1 < 5. Since – 5 < 3, this implies that – 5 < ๐ฅ + 1 < 5,
which is equivalent to |๐ฅ + 1| < 5.
So, if we assume that ๐ฟ ≤ 1, then |(๐ฅ 2 − 2๐ฅ + 1) − 4| = |๐ฅ − 3||๐ฅ + 1| < ๐ฟ ⋅ 5 = 5๐ฟ. Therefore, if
we want to make sure that |(๐ฅ 2 − 2๐ฅ + 1) − 4| < ๐, then is suffices to choose ๐ฟ so that 5๐ฟ ≤ ๐, as
๐
long as we also have ๐ฟ ≤ 1. So, we will let ๐ฟ = min {1, 5}.
๐
Proof: Let ๐ > 0 and let ๐ฟ = min {1, 5}. Suppose that 0 < |๐ฅ − 3| < ๐ฟ. Then since ๐ฟ ≤ 1, we have
๐
|๐ฅ − 3| < 1, and so, |๐ฅ + 1| < 5 (see the algebra in the analysis above). Also, since ๐ฟ ≤ , we have
๐
๐
5
|๐ฅ − 3| < . It follows that |(๐ฅ 2 − 2๐ฅ + 1) − 4| = |๐ฅ 2 − 2๐ฅ − 3| = |๐ฅ − 3||๐ฅ + 1| < ⋅ 5 = ๐.
5
5
Since ๐ > 0 was arbitrary, we have ∀๐ > 0 ∃๐ฟ > 0 (0 < |๐ฅ − 3| < ๐ฟ → |(๐ฅ 2 − 2๐ฅ + 1) − 4| < ๐).
Therefore, lim(๐ฅ 2 − 2๐ฅ + 1) = 4.
โก
๐ฅ→1
179
Example 13.10: Let ๐, ๐ ∈ โ with ๐ ≠ 0. Let’s use the ๐ − ๐ฟ definition of continuity to prove that the
function ๐: โ → โ defined by ๐(๐ฅ) = ๐๐ฅ + ๐ is continuous everywhere.
A function of the form ๐(๐ฅ) = ๐๐ฅ + ๐, where ๐, ๐ ∈ โ and ๐ ≠ 0 is called a linear function. So, we
will now show that every linear function is continuous everywhere.
Analysis: Given ๐ ∈ โ and ๐ > 0, we will find ๐ฟ > 0 so that |๐ฅ − ๐| < ๐ฟ implies |๐(๐ฅ) − ๐(๐)| < ๐.
First note that |๐(๐ฅ) − ๐(๐)| = |(๐๐ฅ + ๐) − (๐๐ + ๐)| = |๐๐ฅ − ๐๐| = |๐||๐ฅ − ๐|. Therefore,
๐
๐
|๐(๐ฅ) − ๐(๐)| < ๐ is equivalent to |๐ฅ − ๐| < . So, ๐ฟ =
should work.
|๐|
|๐|
๐
Proof: Let ๐ ∈ โ, let ๐ > 0, and let ๐ฟ = |๐|. Suppose that |๐ฅ − ๐| < ๐ฟ. Then we have
|๐(๐ฅ) − ๐(๐)| = |(๐๐ฅ + ๐) − (๐๐ + ๐)| = |๐๐ฅ − ๐๐| = |๐||๐ฅ − ๐| < |๐|๐ฟ = |๐| ⋅
๐
= ๐.
|๐|
Since ๐ > 0 was arbitrary, we have ∀๐ > 0 ∃๐ฟ > 0 (|๐ฅ − ๐| < ๐ฟ → |๐(๐ฅ) − ๐(๐)| < ๐). Therefore, ๐
is continuous at ๐ฅ = ๐. Since ๐ ∈ โ was arbitrary, ๐ is continuous everywhere.
โก
Notes: (1) We proved ∀๐ ∈ โ ∀๐ > 0 ∃๐ฟ > 0 ∀๐ฅ ∈ โ(|๐ฅ − ๐| < ๐ฟ → |๐(๐ฅ) − ๐(๐)| < ๐). In words,
we proved that for every real number ๐, given a positive real number ๐, we can find a positive real
number ๐ฟ such that whenever the distance between ๐ฅ and ๐ is less than ๐ฟ, the distance between ๐(๐ฅ)
and ๐(๐) is less than ๐. And of course, a simpler way to say this is “for every real number ๐, ๐ is
continuous at ๐,” or ∀๐ ∈ โ (๐ is continuous at ๐).”
(2) If we move the expression ∀๐ ∈ โ next to ∀๐ฅ ∈ โ, we get a concept that is stronger than continuity.
We say that a function ๐: ๐ด → โ is uniformly continuous on ๐ด if
∀๐ > 0 ∃๐ฟ > 0 ∀๐, ๐ฅ ∈ ๐ด (|๐ฅ − ๐| < ๐ฟ → |๐(๐ฅ) − ๐(๐)| < ๐).
(3) As a quick example of uniform continuity, every linear function is uniformly continuous on โ. We
can see this by modifying the proof above just slightly:
๐
New proof: Let ๐ > 0 and let ๐ฟ = |๐|. Let ๐, ๐ฅ ∈ โ and suppose that |๐ฅ − ๐| < ๐ฟ. Then we have
|๐(๐ฅ) − ๐(๐)| = |(๐๐ฅ + ๐) − (๐๐ + ๐)| = |๐๐ฅ − ๐๐| = |๐||๐ฅ − ๐| < |๐|๐ฟ = |๐| ⋅
๐
= ๐.
|๐|
Since ๐ > 0 was arbitrary, we have ∀๐ > 0 ∃๐ฟ > 0 ∀๐, ๐ฅ ∈ โ (|๐ฅ − ๐| < ๐ฟ → |๐(๐ฅ) − ๐(๐)| < ๐).
Therefore, ๐ is uniformly continuous on โ.
(4) The difference between continuity and uniform continuity on a set ๐ด can be described as follows:
In both cases, an ๐ is given and then a ๐ฟ is chosen. For continuity, for each value of ๐ฅ, we can choose a
different ๐ฟ. For uniform continuity, once we choose a ๐ฟ for some value of ๐ฅ, we need to be able to use
the same ๐ฟ for every other value of ๐ฅ in ๐ด.
In terms of strips, once a horizontal strip is given, we need to be more careful how we choose a vertical
strip. As we check different ๐ฅ-values, we can move the vertical strip left and right. However, we are not
allowed to decrease the width of the vertical strip.
180
Try to come up with a function that is continuous on a set ๐ด, but not uniformly continuous on ๐ด. This
will be explored a little more in the problem set below.
Limit and Continuity Theorems
Theorem 13.3: Let ๐ด, ๐ต ⊆ โ, let ๐: ๐ด → โ, ๐: ๐ต → โ, let ๐ ∈ โ, and suppose that lim[๐(๐ฅ)] and
๐ฅ→๐
lim [๐(๐ฅ)] are both finite real numbers. Then lim[๐(๐ฅ) + ๐(๐ฅ)] = lim[๐(๐ฅ)] + lim[๐(๐ฅ)].
๐ฅ→๐
๐ฅ→๐
๐ฅ→๐
๐ฅ→๐
Analysis: If lim[๐(๐ฅ)] = ๐ฟ, then given ๐ > 0, there is ๐ฟ > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ implies
๐ฅ→๐
|๐(๐ฅ) − ๐ฟ| < ๐. If lim[๐(๐ฅ)] = ๐พ, then given ๐ > 0, there is ๐ฟ > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ implies
๐ฅ→๐
|๐(๐ฅ) − ๐พ| < ๐. We should acknowledge something here. If we are given a single positive real value
for ๐, there is no reason that we would necessarily choose the same ๐ฟ for both ๐ and ๐. However, using
the fact that once we find a ๐ฟ that works, any smaller ๐ฟ will also work, it is easy to see that we could
choose a single value for ๐ฟ that would work for both ๐ and ๐. This should be acknowledged in some
way in the proof. There are several ways to work this into the argument. The way we will handle this is
to use ๐ฟ1 for ๐ and ๐ฟ2 for ๐, and then let ๐ฟ be the smaller of ๐ฟ1 and ๐ฟ2 .
Next, recall from Theorem 7.3 from Lesson 7 that the Triangle Inequality says that for all ๐ฅ, ๐ฆ ∈ โ,
|๐ฅ + ๐ฆ| ≤ |๐ฅ| + |๐ฆ|. (The theorem is stated to be true for all complex numbers, but since โ ⊆ โ, it is
equally true for all real numbers.) After assuming 0 < |๐ฅ − ๐| < ๐ฟ , we will use the Triangle Inequality
to write
|๐(๐ฅ) + ๐(๐ฅ) − (๐ฟ + ๐พ)| = |(๐(๐ฅ) − ๐ฟ) + (๐(๐ฅ) − ๐พ)| ≤ |๐(๐ฅ) − ๐ฟ| + |๐(๐ฅ) − ๐พ| < ๐ + ๐ = 2๐.
It seems that we wound up with 2๐ on the right-hand side instead of ๐. Now, if ๐ is an arbitrarily small
positive real number, then so is 2๐, and vice versa. So, getting 2๐ on the right-hand side instead of ๐
really isn’t too big of a deal. However, to be rigorous, we should prove that it is okay. There are at least
two ways we can handle this. One possibility is to prove a theorem that says 2๐ works just as well as ๐.
A second possibility (and the way I usually teach it in basic analysis courses) is to edit the original ๐’s,
๐
so it all works out to ๐ in the end. The idea is simple. If ๐ is a positive real number, then so is 2. So, after
๐
we are given ๐, we can pretend that Player 1 (in the ๐ − ๐ฟ game) is “attacking” with 2 instead. Let’s see
how this all plays out in the proof.
Proof: Suppose that lim[๐(๐ฅ)] = ๐ฟ and lim[๐(๐ฅ)] = ๐พ, and let ๐ > 0. Since lim[๐(๐ฅ)] = ๐ฟ, there is
๐ฅ→๐
๐ฅ→๐
๐ฅ→๐
๐
๐ฟ1 > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ1 implies |๐(๐ฅ) − ๐ฟ| < 2. Since lim[๐(๐ฅ)] = ๐พ, there is ๐ฟ2 > 0 such
๐ฅ→๐
๐
that 0 < |๐ฅ − ๐| < ๐ฟ2 implies |๐(๐ฅ) − ๐ฟ| < . Let ๐ฟ = min{๐ฟ1 , ๐ฟ2 } and suppose that 0 < |๐ฅ − ๐| < ๐ฟ.
๐
2
๐
Then since ๐ฟ ≤ ๐ฟ1, |๐(๐ฅ) − ๐ฟ| < 2. Since ๐ฟ ≤ ๐ฟ2 , |๐(๐ฅ) − ๐พ| < 2. By the Triangle Inequality, we have
๐
๐
|๐(๐ฅ) + ๐(๐ฅ) − (๐ฟ + ๐พ)| = |(๐(๐ฅ) − ๐ฟ) + (๐(๐ฅ) − ๐พ)| ≤ |๐(๐ฅ) − ๐ฟ| + |๐(๐ฅ) − ๐พ| < + = ๐.
2
2
So, lim[๐(๐ฅ) + ๐(๐ฅ)] = ๐ฟ + ๐พ = lim[๐(๐ฅ)] + lim[๐(๐ฅ)].
๐ฅ→๐
๐ฅ→๐
โก
๐ฅ→๐
Theorem 13.4: Let ๐ด, ๐ต ⊆ โ, let ๐: ๐ด → โ, ๐: ๐ต → โ, let ๐ ∈ โ, and suppose that lim[๐(๐ฅ)] and
lim [๐(๐ฅ)] are both finite real numbers. Then lim[๐(๐ฅ)๐(๐ฅ)] = lim[๐(๐ฅ)] ⋅ lim[๐(๐ฅ)].
๐ฅ→๐
๐ฅ→๐
181
๐ฅ→๐
๐ฅ→๐
๐ฅ→๐
Analysis: As in Theorem 13.3, we let lim[๐(๐ฅ)] = ๐ฟ and lim[๐(๐ฅ)] = ๐พ. If ๐ > 0 is given, we will find
๐ฅ→๐
๐ฅ→๐
a single ๐ฟ > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ implies |๐(๐ฅ) − ๐ฟ| < ๐ and |๐(๐ฅ) − ๐พ| < ๐ (like we did for
Theorem 13.3). Now, we want to show that whenever 0 < |๐ฅ − ๐| < ๐ฟ, |๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐พ| < ๐. This is
quite a bit more challenging than anything we had to do in Theorem 13.3.
To show that |๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐พ| < ๐ we will apply the Standard Advanced Calculus Trick (SACT – see
Note 7 following Example 4.5 from Lesson 4). We would like for |๐(๐ฅ) − ๐ฟ| and |๐(๐ฅ) − ๐พ| to appear
as factors in our expression. To make this happen, we subtract ๐ฟ๐(๐ฅ) from ๐(๐ฅ)๐(๐ฅ) to get
๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐(๐ฅ) = (๐(๐ฅ) − ๐ฟ)๐(๐ฅ). To “undo the damage,” we then add back ๐ฟ๐(๐ฅ). The
application of SACT together with the Triangle Inequality looks like this:
|๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐พ| = |(๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐(๐ฅ)) + (๐ฟ๐(๐ฅ) − ๐ฟ๐พ)|
≤ |๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐(๐ฅ)| + |๐ฟ๐(๐ฅ) − ๐ฟ๐พ| = |๐(๐ฅ) − ๐ฟ||๐(๐ฅ)| + |๐ฟ||๐(๐ฅ) − ๐พ|
< ๐|๐(๐ฅ)| + |๐ฟ|๐ = ๐(|๐(๐ฅ)| + |๐ฟ|).
Uh oh! How can we possibly get rid of |๐(๐ฅ)| + |๐ฟ|? We have seen how to handle a constant multiple
of ๐ in the proof of Theorem 13.3. But this time we are multiplying ๐ by a function of ๐ฅ. We will resolve
this issue by making sure we choose ๐ฟ small enough so that ๐(๐ฅ) is sufficiently bounded.
We do this by taking a specific value for ๐, and then using the fact that lim[๐(๐ฅ)] = ๐พ to come up with
๐ฅ→๐
a ๐ฟ > 0 and a bound ๐ for ๐ on the deleted ๐ฟ-neighborhood of ๐. For simplicity, let’s choose ๐ = 1.
Then since lim[๐(๐ฅ)] = ๐พ, we can find ๐ฟ > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ implies |๐(๐ฅ) − ๐พ| < 1. Now,
๐ฅ→๐
|๐(๐ฅ) − ๐พ| < 1 ⇔ – 1 < ๐(๐ฅ) − ๐พ < 1 ⇔ ๐พ − 1 < ๐(๐ฅ) < ๐พ + 1. For example, if ๐พ = 5, we would
have 4 < ๐(๐ฅ) < 6. Since this implies – 6 < ๐(๐ฅ) < 6, or equivalently, |๐(๐ฅ)| < 6, we could choose
๐ = 6. If, on the other hand, ๐พ = – 3, we would have – 4 < ๐(๐ฅ) < – 2. Since this implies
– 4 < ๐(๐ฅ) < 4, or equivalently, |๐(๐ฅ)| < 4, we could choose ๐ = 4. In general, we will let
๐ = max{|๐พ − 1|, |๐พ + 1|}.
We will now be able to get |๐(๐ฅ)๐(๐ฅ) − (๐ฟ๐พ)| < ๐(|๐(๐ฅ)| + |๐ฟ|) < ๐(๐ + |๐ฟ|). Great! Now it looks
just like the situation we had in Theorem 13.3. The number ๐ + |๐ฟ| looks messier, but it is just a
๐
number, and so we can finish cleaning up the argument by replacing Player 1’s ๐-attacks by ๐+|๐ฟ|.
Proof: Suppose that lim[๐(๐ฅ)] = ๐ฟ and lim[๐(๐ฅ)] = ๐พ, and let ๐ > 0. Since lim[๐(๐ฅ)] = ๐พ, there is
๐ฅ→๐
๐ฅ→๐
๐ฅ→๐
๐ฟ1 > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ1 implies |๐(๐ฅ) − ๐พ| < 1. Now, |๐(๐ฅ) − ๐พ| < 1 is equivalent to
– 1 < ๐(๐ฅ) − ๐พ < 1, or by adding ๐พ, ๐พ − 1 < ๐(๐ฅ) < ๐พ + 1. Let ๐ = max{|๐พ − 1|, |๐พ + 1|}. Then,
0 < |๐ฅ − ๐| < ๐ฟ1 implies – ๐ < ๐(๐ฅ) < ๐, or equivalently, |๐(๐ฅ)| < ๐. Note also that ๐ > 0.
Therefore, ๐ + |๐ฟ| > 0.
๐
Now, since lim[๐(๐ฅ)] = ๐ฟ, there is ๐ฟ2 > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ2 implies |๐(๐ฅ) − ๐ฟ| < ๐+|๐ฟ|.
๐ฅ→๐
๐
Since lim[๐(๐ฅ)] = ๐พ, there is ๐ฟ3 > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ3 implies |๐(๐ฅ) − ๐ฟ| < ๐+|๐ฟ|. Let
๐ฅ→๐
๐ฟ = min{๐ฟ1 , ๐ฟ2 , ๐ฟ3 } and suppose that 0 < |๐ฅ − ๐| < ๐ฟ. Then since ๐ฟ ≤ ๐ฟ1 , |๐(๐ฅ)| < ๐. Since ๐ฟ ≤ ๐ฟ2 ,
|๐(๐ฅ) − ๐ฟ| <
๐
๐
. Since ๐ฟ ≤ ๐ฟ3 , |๐(๐ฅ) − ๐พ| < ๐+|๐ฟ|. By the Triangle Inequality (and SACT), we have
๐+|๐ฟ|
182
|๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐พ| = |(๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐(๐ฅ)) + (๐ฟ๐(๐ฅ) − ๐ฟ๐พ)|
≤ |๐(๐ฅ)๐(๐ฅ) − ๐ฟ๐(๐ฅ)| + |๐ฟ๐(๐ฅ) − ๐ฟ๐พ| = |๐(๐ฅ) − ๐ฟ||๐(๐ฅ)| + |๐ฟ||๐(๐ฅ) − ๐พ|
๐
๐
๐
< ๐+|๐ฟ| ⋅ ๐ + |๐ฟ| ๐+|๐ฟ| = ๐+|๐ฟ| (๐ + |๐ฟ|) = ๐.
So, lim[๐(๐ฅ)๐(๐ฅ)] = ๐ฟ๐พ = lim [๐(๐ฅ)] ⋅ lim[๐(๐ฅ)].
๐ฅ→๐
๐ฅ→๐
โก
๐ฅ→๐
Limits Involving Infinity
Recall that a horizontal strip in โ × โ is a set of the form โ × (๐, ๐) = {(๐ฅ, ๐ฆ) | ๐ < ๐ฆ < ๐} and a
vertical strip is a set of the form (๐, ๐) × โ = {(๐ฅ, ๐ฆ) | ๐ < ๐ฅ < ๐}. If we allow ๐ and/or ๐ to take on
the value – ∞ (in which case we say that the strip contains – ∞) and we allow ๐ and/or ๐ to take on
the value +∞ (in which case we say that the strip contains +∞), we can extend our definition of limit
to handle various situations involving infinity.
Example 13.11: Let’s take a look at the horizontal strip โ × (1, +∞) and the vertical strip
(– ∞, – 2) × โ in the ๐ฅ๐ฆ-plane. These can be visualized as follows:
โ × (1, +∞)
(– ∞, – 2) × โ
The horizontal strip โ × (1, +∞) contains +∞ and the vertical strip (– ∞, – 2) × โ contains – ∞.
Note: Strips that contain +∞ or – ∞ are usually called half planes. Here, we will continue to use the
expression “strip” because it allows us to handle all types of limits (finite and infinite) without having
to discuss every case individually.
By allowing strips to contain +∞ or – ∞, intersections of horizontal and vertical strips can now be
unbounded. The resulting open rectangles (๐, ๐) × (๐, ๐) can have ๐ and/or ๐ taking on the value – ∞
and ๐ and/or ๐ taking on the value +∞.
183
Example 13.12: Consider the horizontal strip ๐ป = โ × (1, +∞) and the vertical strip
๐ = (– 2, – 1) × โ. The intersection of these strips is the open rectangle ๐
= (– 2, – 1) × (1, +∞). The
rectangle ๐
traps (– 1.5, 3), whereas (– 1.5, 0) escapes from ๐
. This can be seen in the figure below on
the left. Also, consider the horizontal strip ๐ป = โ × (1, +∞) and the vertical strip ๐ = (– ∞, – 2) × โ.
The intersection of these strips is the open rectangle ๐ = (– ∞, – 2) × (1, +∞). The rectangle ๐ traps
(– 3, 2), whereas (– 3, – 1) escapes from ๐. This can be seen in the figure below on the right.
When we allow +∞ and – ∞, the definitions of “trap” and “escape” are just about the same. We just
need to make the following minor adjustment.
Small technicality: If ๐ = +∞ or ๐ = – ∞, then we define ๐น traps ๐ around ๐ to simply mean that ๐น
traps ๐. In other words, when checking a limit that is approaching +∞ or – ∞, we do not exclude any
point from consideration as we would do if ๐ were a finite real number.
1
Example 13.13: Let ๐: โ → โ be defined by ๐(๐ฅ) = ๐ฅ 2, let ๐ = 0, and let ๐ฟ = +∞. Let’s show that
lim ๐(๐ฅ) = +∞. Let ๐ป = โ × (๐, +∞) be a horizontal strip that contains +∞. Next, let
๐ฅ→0
1
1
√
1
√
1
√
√
1
1
√
√
๐ = (– ๐ , ๐) × โ. We will show that ๐ป ∩ ๐ = (– ๐ , ๐) × (๐, +∞) traps ๐ around 0. Let
๐ฅ ∈ (– ๐ , ๐
1
) โ {0}, so that –
1
1
√๐
<๐ฅ<
1
√๐
1
1
√
√๐
and ๐ฅ ≠ 0. Then – ๐ < ๐ฅ < 0 or 0 < ๐ฅ <
1
1
√
√
. In either case,
๐ฅ 2 < ๐ , and therefore, ๐ < ๐ฅ 2 = ๐(๐ฅ). Since ๐ฅ ∈ (– ๐ , ๐) โ {0} and ๐(๐ฅ) ∈ (๐, +∞), it follows that
1
1
√
√
(๐ฅ, ๐(๐ฅ)) ∈ (– ๐ , ๐
) × (๐, +∞) = ๐ป ∩ ๐. Therefore, ๐ป ∩ ๐ traps ๐ around 0. So, lim ๐(๐ฅ) = +∞.
๐ฅ→0
Example 13.14: Let ๐: โ → โ be defined by ๐(๐ฅ) = – ๐ฅ + 3, let ๐ = +∞, and let ๐ฟ = – ∞. Let’s show
that lim ๐(๐ฅ) = – ∞. Let ๐ป = โ × (– ∞, ๐) be a horizontal strip that contains – ∞. Next, let
๐ฅ→+∞
๐ = (3 − ๐, +∞) × โ. We will show that ๐ป ∩ ๐ = (3 − ๐, +∞) × (– ∞, ๐) traps ๐ around +∞ (or
more simply, that (3 − ๐, +∞) × (– ∞, ๐) traps ๐). Let ๐ฅ ∈ (3 − ๐, +∞), so that ๐ฅ > 3 − ๐. Then we
have – ๐ฅ < ๐ − 3, and so, ๐(๐ฅ) = – ๐ฅ + 3 < ๐. Since ๐ฅ ∈ (3 − ๐, +∞) and ๐(๐ฅ) ∈ (– ∞, ๐), it follows
that (๐ฅ, ๐(๐ฅ)) ∈ (3 − ๐, +∞) × (– ∞, ๐) = ๐ป ∩ ๐. Therefore, ๐ป ∩ ๐ traps ๐. So, lim ๐(๐ฅ) = – ∞.
๐ฅ→+∞
184
We can find equivalent definitions for limits involving infinity on a case-by-case basis. We will do one
example here and you will look at others in Problem 15 in the problem set below.
Theorem 13.5: lim ๐(๐ฅ) = +∞ if and only if ∀๐ > 0 ∃๐ฟ > 0 (0 < |๐ฅ − ๐| < ๐ฟ → ๐(๐ฅ) > ๐).
๐ฅ→๐
Proof: Suppose that lim ๐(๐ฅ) = +∞ and let ๐ > 0. Let ๐ป = โ × (๐, +∞). Since lim ๐(๐ฅ) = +∞,
๐ฅ→๐
๐ฅ→๐
there is a vertical strip ๐ = (๐, ๐) × โ that contains ๐ such that the rectangle (๐, ๐) × (๐, +∞) traps
๐ around ๐. Let ๐ฟ = min{๐ − ๐, ๐ − ๐}, and let 0 < |๐ฅ − ๐| < ๐ฟ. Then ๐ฅ ≠ ๐ and – ๐ฟ < ๐ฅ − ๐ < ๐ฟ. So,
๐ − ๐ฟ < ๐ฅ < ๐ + ๐ฟ. Since ๐ฟ ≤ ๐ − ๐, we have ๐ ≤ ๐ − ๐ฟ. Since ๐ฟ ≤ ๐ − ๐, we have ๐ ≥ ๐ + ๐ฟ.
Therefore, ๐ < ๐ฅ < ๐, and so, ๐ฅ ∈ (๐, ๐). Since ๐ฅ ≠ ๐, ๐ฅ ∈ (๐, ๐), and (๐, ๐) × (๐, +∞) traps ๐ around
๐, we have ๐(๐ฅ) ∈ (๐, +∞). Thus, ๐(๐ฅ) > ๐.
Conversely, suppose that ∀๐ > 0 ∃๐ฟ > 0 (0 < |๐ฅ − ๐| < ๐ฟ → ๐(๐ฅ) > ๐). Let ๐ป = โ × (๐, +∞) be a
horizontal strip containing +∞ and let ๐ = max{๐, 1}. Then there is ๐ฟ > 0 such that 0 < |๐ฅ − ๐| < ๐ฟ
implies ๐(๐ฅ) > ๐. Let ๐ = (๐ − ๐ฟ, ๐ + ๐ฟ) × โ and let ๐
= ๐ป ∩ ๐ = (๐ − ๐ฟ, ๐ + ๐ฟ) × (๐, +∞). We
show that ๐
traps ๐ around ๐. Indeed, if ๐ฅ ∈ (๐ − ๐ฟ, ๐ + ๐ฟ) and ๐ฅ ≠ ๐, then 0 < |๐ฅ − ๐| < ๐ฟ and so,
๐(๐ฅ) > ๐. So, (๐ฅ, ๐(๐ฅ)) ∈ (๐ − ๐ฟ, ๐ + ๐ฟ) × (๐, +∞) ⊆ (๐ − ๐ฟ, ๐ + ๐ฟ) × (๐, +∞) (because ๐ ≤ ๐).
So, ๐
traps ๐ around ๐.
โก
One-sided Limits
Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐ ∈ โ and ๐ฟ ∈ โ ∪ {– ∞, +∞}. We say that the limit of ๐ as ๐
approaches ๐ from the right is ๐ณ, written lim+ ๐(๐ฅ) = ๐ฟ, if for every horizontal strip ๐ป that contains ๐ฟ
๐ฅ→๐
there is a vertical strip ๐ of the form (๐, ๐) × โ such that the rectangle ๐ป ∩ ๐ traps ๐.
1
Example 13.15: Let ๐: โ โ {1} → โ be defined by ๐(๐ฅ) = ๐ฅ−1, let ๐ = 1, and let ๐ฟ = +∞. Let’s show
that lim+ ๐(๐ฅ) = +∞. Let ๐ป = โ × (๐, +∞) be a horizontal strip that contains +∞ and let
๐ฅ→1
1
1
๐ = max{1, ๐}. Let ๐ = (1, ๐ + 1) × โ. We will show that ๐ป ∩ ๐ = (1, ๐ + 1) × (๐, +∞) traps ๐.
1
1
1
1
Let ๐ฅ ∈ (1, ๐ + 1), so that 1 < ๐ฅ < ๐ + 1. Then we have 0 < ๐ฅ − 1 < ๐, and so, ๐ฅ−1 > ๐ ≥ ๐. So,
1
1
๐(๐ฅ) > ๐ . Since ๐ฅ ∈ (1, ๐ + 1) and ๐(๐ฅ) ∈ (๐, +∞), (๐ฅ, ๐(๐ฅ)) ∈ (1, ๐ + 1) × (๐, +∞) = ๐ป ∩ ๐.
Therefore, ๐ป ∩ ๐ traps ๐. So, lim+ ๐(๐ฅ) = +∞.
๐ฅ→1
Theorem 13.6: lim+ ๐(๐ฅ) = ๐ฟ (๐ฟ real) if and only if ∀๐ > 0 ∃๐ฟ > 0 (0 < ๐ฅ − ๐ < ๐ฟ → |๐(๐ฅ) − ๐ฟ| < ๐).
๐ฅ→๐
Proof: Suppose that lim+ ๐(๐ฅ) = ๐ฟ and let ๐ > 0. Let ๐ป = โ × (๐ฟ − ๐, ๐ฟ + ๐). Since lim+ ๐(๐ฅ) = ๐ฟ,
๐ฅ→๐
๐ฅ→๐
there is a vertical strip ๐ = (๐, ๐) × โ such that the rectangle ๐ป ∩ ๐ = (๐, ๐) × (๐ฟ − ๐, ๐ฟ + ๐) traps ๐.
Let ๐ฟ = ๐ − ๐, and let 0 < ๐ฅ − ๐ < ๐ฟ. Then ๐ < ๐ฅ < ๐ and so, ๐ฅ ∈ (๐, ๐). Since (๐, ๐) × (๐ฟ − ๐, ๐ฟ + ๐)
traps ๐, we have ๐(๐ฅ) ∈ (๐ฟ − ๐, ๐ฟ + ๐). Thus, ๐ฟ − ๐ < ๐(๐ฅ) < ๐ฟ + ๐, or equivalently, |๐(๐ฅ) − ๐ฟ| < ๐.
Conversely, suppose that ∀๐ > 0 ∃๐ฟ > 0 (0 < ๐ฅ − ๐ < ๐ฟ → |๐(๐ฅ) − ๐ฟ| < ๐). Let ๐ป = โ × (๐, ๐) be a
horizontal strip containing ๐ฟ and let ๐ = min{๐ฟ − ๐, ๐ − ๐ฟ}. Then there is ๐ฟ > 0 such that
0 < ๐ฅ − ๐ < ๐ฟ implies |๐(๐ฅ) − ๐ฟ| < ๐. Let ๐ = (๐, ๐ + ๐ฟ) × โ and ๐
= ๐ป ∩ ๐ = (๐, ๐ + ๐ฟ) × (๐, ๐).
We show that ๐
traps ๐. If ๐ฅ ∈ (๐, ๐ + ๐ฟ), then ๐ < ๐ฅ < ๐ + ๐ฟ, or equivalently, 0 < ๐ฅ − ๐ < ๐ฟ. So,
|๐(๐ฅ) − ๐ฟ| < ๐. Therefore, – ๐ < ๐(๐ฅ) − ๐ฟ < ๐, or equivalently, ๐ฟ − ๐ < ๐(๐ฅ) < ๐ฟ + ๐. Thus,
(๐ฅ, ๐(๐ฅ)) ∈ (๐, ๐ + ๐ฟ) × (๐ฟ − ๐, ๐ฟ + ๐) ⊆ (๐, ๐ + ๐ฟ) × (๐, ๐) (Check this!). So, ๐
traps ๐.
โก
185
Problem Set 13
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Let ๐: โ → โ be defined by ๐(๐ฅ) = 5๐ฅ − 1.
(i)
Prove that lim ๐(๐ฅ) = 14.
(ii)
Prove that ๐ is continuous on โ.
๐ฅ→3
2. Let ๐, ๐ ∈ โ and let ๐: โ → โ be defined by ๐(๐ฅ) = ๐. Prove that lim[๐(๐ฅ)] = ๐.
๐ฅ→๐
3. Let ๐ด ⊆ โ, let ๐: ๐ด → โ, let ๐, ๐ ∈ โ, and suppose that lim[๐(๐ฅ)] is a finite real number. Prove
๐ฅ→๐
that lim[๐๐(๐ฅ)] = ๐ lim [๐(๐ฅ)].
๐ฅ→๐
๐ฅ→๐
LEVEL 2
4. Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐ ∈ โ. Prove that ๐ is continuous at ๐ if and only if
lim[๐(๐ฅ)] = ๐(๐).
๐ฅ→๐
5. Prove that every polynomial function ๐: โ → โ is continuous on โ.
LEVEL 3
6. Let ๐: โ → โ be defined by ๐(๐ฅ) = 2๐ฅ 2 − 3๐ฅ + 7.
(i)
Prove that lim ๐(๐ฅ) = 6.
(ii)
Prove that ๐ is continuous on โ.
๐ฅ→1
7. Suppose that ๐, ๐: โ → โ, ๐ ∈ โ, ๐ is continuous at ๐, and ๐ is continuous at ๐(๐). Prove that
๐ โ ๐ is continuous at ๐.
LEVEL 4
๐ฅ 3 −4
4
8. Let โ: โ → โ be defined by โ(๐ฅ) = ๐ฅ 2 +1. Prove that lim โ(๐ฅ) = 5.
๐ฅ→2
9. Let ๐: (0, ∞) → โ be defined by ๐(๐ฅ) = √๐ฅ.
(i)
Prove that lim ๐(๐ฅ) = 5.
(ii)
Prove that ๐ is continuous on (0, ∞).
๐ฅ→25
(iii) Is ๐ uniformly continuous on (0, ∞)?
186
10. Let ๐: โ → โ be defined by ๐(๐ฅ) = ๐ฅ 2 . Prove that ๐ is continuous on โ, but not uniformly
continuous on โ.
11. Prove that if lim[๐(๐ฅ)] > 0, then there is a deleted neighborhood ๐ of ๐ such that ๐(๐ฅ) > 0 for
all ๐ฅ ∈ ๐.
๐ฅ→๐
12. Let ๐ด ⊆ โ, let ๐: ๐ด → โ, let ๐ ∈ โ, and suppose that lim[๐(๐ฅ)] is a finite real number. Prove
๐ฅ→๐
that there is ๐ ∈ โ and an open interval (๐, ๐) containing ๐ such that |๐(๐ฅ)| ≤ ๐ for all
๐ฅ ∈ (๐, ๐) โ {๐}.
13. Let ๐ด ⊆ โ, let ๐, ๐, โ: ๐ด → โ, let ๐ ∈ โ, let ๐(๐ฅ) ≤ ๐(๐ฅ) ≤ โ(๐ฅ) for all ๐ฅ ∈ ๐ด โ {๐}, and
suppose that lim[๐(๐ฅ)] = lim[โ(๐ฅ)] = ๐ฟ. Prove that lim[๐(๐ฅ)] = ๐ฟ.
๐ฅ→๐
๐ฅ→๐
๐ฅ→๐
LEVEL 5
14. Let ๐ด ⊆ โ, let ๐, ๐: ๐ด → โ such that ๐(๐ฅ) ≠ 0 for all ๐ฅ ∈ ๐ด, let ๐ ∈ โ, and suppose that
lim[๐(๐ฅ)] and lim[๐(๐ฅ)] are both finite real numbers such that lim[๐(๐ฅ)] ≠ 0. Prove that
๐ฅ→๐
lim [
๐(๐ฅ)
๐ฅ→๐ ๐(๐ฅ)
๐ฅ→๐
lim ๐(๐ฅ)
๐ฅ→๐
] = lim ๐(๐ฅ).
๐ฅ→๐
๐ฅ→๐
15. Give a reasonable equivalent definition for each of the following limits (like what was done in
Theorem 13.5). ๐ and ๐ฟ are finite real numbers.
(i)
(ii)
(iii)
(iv)
(v)
(vi)
(vii)
lim ๐(๐ฅ) = – ∞
๐ฅ→๐
lim ๐(๐ฅ) = ๐ฟ
๐ฅ→+∞
lim ๐(๐ฅ) = ๐ฟ
๐ฅ→ –∞
lim ๐(๐ฅ) = +∞
๐ฅ→+∞
lim ๐(๐ฅ) = – ∞
๐ฅ→+∞
lim ๐(๐ฅ) = +∞
๐ฅ→ –∞
lim ๐(๐ฅ) = – ∞
๐ฅ→ –∞
16. Let ๐(๐ฅ) = – ๐ฅ 2 + ๐ฅ + 1. Use the ๐ − ๐พ definition of an infinite limit (that you came up with
in Problem 15) to prove lim ๐(๐ฅ) = – ∞.
๐ฅ→+∞
187
17. Give a reasonable definition for each of the following limits (like what was done in Theorem
13.6). ๐ and ๐ฟ are finite real numbers.
(i)
(ii)
(iii)
(iv)
(v)
lim ๐(๐ฅ) = ๐ฟ
๐ฅ→๐ –
lim ๐(๐ฅ) = +∞
๐ฅ→๐ +
lim ๐(๐ฅ) = – ∞
๐ฅ→๐ +
lim ๐(๐ฅ) = +∞
๐ฅ→๐ –
lim ๐(๐ฅ) = – ∞
๐ฅ→๐ –
18. Use the ๐ − ๐ฟ definition of a one-sided limit (that you came up with in Problem 17) to prove
1
that lim–
= – ∞.
๐ฅ→3 ๐ฅ−3
๐ฅ+1
19. Let ๐(๐ฅ) = (๐ฅ−1)2. Prove that
(i)
(ii)
lim ๐(๐ฅ) = 0.
๐ฅ→+∞
lim ๐(๐ฅ) = +∞.
๐ฅ→1+
20. Let ๐: โ → โ be defined by ๐(๐ฅ) = {
does not exist.
0
1
if ๐ฅ is rational.
Prove that for all ๐ ∈ โ, lim[๐(๐ฅ)]
๐ฅ→๐
if ๐ฅ is irrational.
188
LESSON 14 – TOPOLOGY
SPACES AND HOMEOMORPHISMS
Topological Spaces
A topological space consists of a set ๐ together with a collection of “open” subsets of ๐. Before we give
the formal definition of “open,” let’s quickly review a standard example that most of us are somewhat
familiar with.
Consider the set โ of real numbers and call a subset ๐ of โ open if for every real number ๐ฅ ∈ ๐, there
is an open interval (๐, ๐) with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐. We were first introduced to this definition of
an open set in Lesson 6. In that same lesson, we showed that ∅ and โ are both open sets (Theorem
6.4), we proved that an arbitrary union of open sets is open (Theorem 6.7), and we proved that a finite
intersection of open sets is open (Theorem 6.9 and part (iii) of Problem 6 from that lesson). As it turns
out, with this definition of open, every open set can be expressed as a union of open intervals (Theorem
6.8).
In this lesson we will move to a more general setting and explore arbitrary sets together with various
collections of “open” subsets of these sets. Let’s begin by giving the formal definition of a topological
space.
Let ๐ be a set and let ๐ฏ be a collection of subsets of ๐. ๐ฏ is said to be a topology on ๐ if the following
three properties are satisfied:
1. ∅ ∈ ๐ฏ and ๐ ∈ ๐ฏ.
2. If ๐ฟ ⊆ ๐ฏ, then โ๐ฟ ∈ ๐ฏ (๐ฏ is closed under taking arbitrary unions).
3. If ๐ ⊆ ๐ฏ and ๐ is finite, then โ๐ ∈ ๐ฏ (๐ฏ is closed under taking finite intersections).
A topological space is a pair (๐, ๐ฏ), where ๐ is a set and ๐ฏ is a topology on ๐. We will call the elements
of ๐ฏ open sets. Complements of elements of ๐ฏ will be called closed sets (๐ด is closed if and only if ๐ โ ๐ด
is open).
We may sometimes refer to the topological space ๐. When we do, there is a topology ๐ฏ on ๐ that we
are simply not mentioning explicitly.
Example 14.1:
1. Let ๐ = {๐} be a set consisting of just the one element ๐. There is just one topology on ๐. It is
the topology ๐ฏ = {∅, {๐}}.
Note that the power set of ๐ is ๐ซ(๐) = {∅, {๐}} and ๐ซ(๐ซ(๐)) = {∅, {∅}, {{๐}}, {∅, {๐}}}. Notice
that the topology ๐ฏ = {∅, {๐}} is an element of ๐ซ(๐ซ(๐)). However, the other three elements
of ๐ซ(๐ซ(๐)) are not topologies on ๐ = {๐}.
In general, for any set ๐, a topology on ๐ is a subset of ๐ซ(๐), or equivalently, an element of
๐ซ(๐ซ(๐)). If ๐ ≠ ∅, then not every element of ๐ซ(๐ซ(๐)) will be a topology on ๐.
189
For example, if ๐ = {๐}, Then ∅, {∅}, and {{๐}} are all elements of ๐ซ(๐ซ(๐)) that are not
topologies on ๐. ∅ and {∅} fail to be topologies on {๐} because they do not contain {๐}, while
{{๐}} fails to be a topology on {๐} because it does not contain ∅.
2. Let ๐ = {๐, ๐} be a set consisting of the two distinct elements ๐ and ๐. There are four topologies
on ๐: ๐ฏ1 = {∅, {๐, ๐}}, ๐ฏ2 = {∅, {๐}, {๐, ๐}}, ๐ฏ3 = {∅, {๐}, {๐, ๐}}, and ๐ฏ4 = {∅, {๐}, {๐}, {๐, ๐}}.
We can visualize these topologies as follows.
๐ฏ4
๐ฏ2
๐ฏ3
๐ฏ1
Notice that all four topologies in the figure have the elements ๐ and ๐ inside a large circle
because ๐ = {๐, ๐} is in all four topologies. Also, it is understood that ∅ is in all the topologies.
๐ฏ1 is called the trivial topology (or indiscrete topology) on ๐ because it contains only ∅ and ๐.
๐ฏ4 is called the discrete topology on ๐, as it contains every subset of ๐. The discrete topology is
just ๐ซ(๐) (the power set of ๐).
The topologies ๐ฏ2 , ๐ฏ3 , and ๐ฏ4 are finer than the topology ๐ฏ1 because ๐ฏ1 ⊆ ๐ฏ2 , ๐ฏ1 ⊆ ๐ฏ3 and
๐ฏ1 ⊆ ๐ฏ4 . We can also say that ๐ฏ1 is coarser than ๐ฏ2 , ๐ฏ3 , and ๐ฏ4 . Similarly, ๐ฏ4 is finer than ๐ฏ2 and
๐ฏ3 , or equivalently, ๐ฏ2 and ๐ฏ3 are coarser than ๐ฏ4 . The topologies ๐ฏ2 and ๐ฏ3 are incomparable.
Neither one is finer than the other. To help understand the terminology “finer” and “coarser,”
we can picture the open sets as a pile of rocks. If we were to smash that pile of rocks (the open
sets) with a hammer, the rocks will break into smaller pieces (creating more open sets), and the
pile of rocks (the topology) will have been made “finer.”
Note that for any set ๐, the discrete topology is always the finest topology and the trivial
topology is always the coarsest.
3. Let ๐ = {๐, ๐, ๐} be a set consisting of the three distinct elements ๐, ๐, and ๐. There are 29
topologies on ๐. Let’s look at a few of them.
We have the trivial topology ๐ฏ1 = {∅, {๐, ๐, ๐}}.
If we throw in just a singleton set (a set consisting of just one element), we get the three
topologies ๐ฏ2 = {∅, {๐}, {๐, ๐, ๐}}, ๐ฏ3 = {∅, {๐}, {๐, ๐, ๐}}, ๐ฏ4 = {∅, {๐}, {๐, ๐, ๐}}.
190
Note that we can’t throw in just two singleton sets. For example, {∅, {๐}, {๐}, {๐, ๐, ๐}} is not a
topology on ๐. Do you see the problem? It’s not closed under taking unions: {๐} and {๐} are
there, but {๐, ๐} = {๐} ∪ {๐} is not! However, ๐ฏ5 = {∅, {๐}, {๐}, {๐, ๐}, {๐, ๐, ๐}} is a topology on
๐.
Here are a few chains of topologies on ๐ written in order from the coarsest to the finest
topology (chains are linearly ordered subsets of {๐ฏ | ๐ฏ is a topology on ๐}).
{∅, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐, ๐}, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐}, {๐, ๐}, {๐, ๐, ๐}}
⊆ {∅, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}
{∅, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐, ๐}, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}
⊆ {∅, {๐}, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}
{∅, {๐, ๐, ๐}} ⊆ {∅, {๐, ๐}, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐, ๐}, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐}, {๐, ๐}, {๐, ๐, ๐}}
⊆ {∅, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}} ⊆ {∅, {๐}, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}
Below is a picture of all 29 topologies on {๐, ๐, ๐} (to avoid clutter we left out the names of the
elements). Again, a large circle surrounds ๐, ๐, and ๐ in all cases because ๐ = {๐, ๐, ๐} is in all 29
topologies. Also, it is understood that the empty set is in all these topologies.
I organized these topologies by the number of sets in each topology. The lowest row consists of
just the trivial topology. The next row up consists of the topologies with just one additional set
(three sets in total because ∅ and ๐ are in every topology), and so on.
Below we see a visual representation of the three chains described above. As each path moves
from the bottom to the top of the picture, we move from coarser to finer topologies.
191
4. Let ๐ = โ and let ๐ฏ = {๐ ⊆ โ | ∀๐ฅ ∈ ๐ ∃๐, ๐ ∈ โ(๐ฅ ∈ (๐, ๐) ∧ (๐, ๐) ⊆ ๐}. In other words,
we are defining a subset of โ to be open as we did in Lesson 6. That is, a subset ๐ of โ is open
if for every real number ๐ฅ ∈ ๐, there is an open interval (๐, ๐) with ๐ฅ ∈ (๐, ๐) and (๐, ๐) ⊆ ๐.
By Theorem 6.4, ∅, โ ∈ ๐ฏ. By Theorem 6.7, ๐ฏ is closed under taking arbitrary unions. By
Problem 6 from Problem Set 6 (part (iii)), ๐ฏ is closed under taking finite intersections. It follows
that ๐ฏ is a topology on โ. This topology is called the standard topology on โ.
5. Let ๐ = โ and let ๐ฏ = {๐ ⊆ โ | ∀๐ง ∈ ๐ ∃๐ ∈ โ ∃๐ ∈ โ+ (๐ง ∈ ๐๐ (๐) ∧ ๐๐ (๐) ⊆ ๐}. In other
words, we are defining a subset of โ to be open as we did in Lesson 7. That is, a subset ๐ of โ
is open if for every complex number ๐ง ∈ ๐, there is an open disk (or neighborhood) ๐๐ (๐) with
๐ง ∈ ๐๐ (๐) and ๐๐ (๐) ⊆ ๐. By Example 7.8 (part 4), ∅, โ ∈ ๐ฏ. By Problem 8 in Problem Set 7
(parts (i) and (ii)), ๐ฏ is closed under taking arbitrary unions and finite intersections. It follows
that ๐ฏ is a topology on โ. This topology is called the standard topology on โ.
Note: Recall that for ๐ ∈ โ and ๐ ∈ โ+ the ๐-neighborhood of ๐, written ๐๐ (๐) is the open disk with
center ๐ and radius ๐. That is, ๐๐ (๐) = {๐ง ∈ โ | |๐ง − ๐| < ๐}. See Lesson 7 for details.
Bases
If (๐, ๐ฏ) is a topological space, then a basis for the topology ๐ฏ is a subset โฌ ⊆ ๐ฏ such that every
element of ๐ฏ can be written as a union of elements from โฌ. We say that ๐ฏ is generated by โฌ or โฌ
generates ๐ฏ.
Notes: (1) Given a topological space ๐ฏ, it can be cumbersome to describe all the open sets in ๐ฏ.
However, it is usually not too difficult to describe a topology in terms of its basis elements.
(2) If (๐, ๐ฏ) is a topological space and โฌ is a basis for ๐ฏ, then ๐ฏ = {โ๐ฟ | ๐ฟ ⊆ โฌ}.
192
(3) More generally, if ๐ณ is any collection of subsets of ๐, then we can say that ๐ณ generates
{โ๐ฟ | ๐ฟ ⊆ ๐ณ}. However, this set will not always be a topology on ๐.
Example 14.2:
1. Let ๐ = {๐, ๐, ๐} and ๐ฏ = {∅, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}. The set โฌ = {{๐}, {๐}, {๐, ๐}} is a
basis for ๐ฏ. Indeed, we have {๐} = โ{{๐}}, {๐} = โ{{๐}}, {๐, ๐} = โ{{๐}, {๐}} = {๐} ∪ {๐},
{๐, ๐} = โ{{๐, ๐}}, {๐, ๐, ๐} = โ{{๐}, {๐, ๐}} = {๐} ∪ {๐, ๐}, and ∅ = โ∅.
(Note that โ∅ = {๐ฆ | there is ๐ ∈ ∅ with ๐ฆ ∈ ๐} = ∅. It follows that ∅ does not need to be
included in a basis.)
We can visualize the basis โฌ and the topology ๐ฏ that is generated by โฌ as follows.
โฌ
๐ฏ
We know ∅ ∈ ๐ฏ (even though ∅ is not indicated in the picture of ๐ฏ) because ๐ฏ is a topology.
On the other hand, it is unclear from the picture of โฌ whether ∅ ∈ โฌ. However, it doesn’t really
matter. Since ∅ is equal to an empty union, ∅ will always be generated from โฌ anyway.
There can be more than one basis for the same topology. Here are a few more bases for the
topology ๐ฏ just discussed (are there any others?):
โฌ1 = {{๐}, {๐}, {๐, ๐}, {๐, ๐}}
โฌ2 = {{๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}
โฌ3 = {∅, {๐}, {๐}, {๐, ๐}}
โฌ4 = ๐ฏ = {∅, {๐}, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}
2. Let ๐ = {๐, ๐, ๐} and let ๐ณ = {{๐}, {๐}}. In this case, ๐ณ generates {∅, {๐}, {๐}, {๐, ๐}}. This set is
not a topology on ๐ because ๐ = {๐, ๐, ๐} is not in the set. The reason that ๐ณ failed to generate
a topology on ๐ is that it didn’t completely “cover” ๐. Specifically, ๐ is not in any set in ๐ณ.
In general, if an element ๐ฅ from a set ๐ does not appear in any of the sets in a set ๐ณ, then no
matter how large a union we take from ๐ณ, we will never be able to generate a set from ๐ณ with
๐ฅ in it, and therefore, ๐ณ will not generate a topology on ๐ (although it might generate a
topology on a subset of ๐).
3. Let ๐ = {๐, ๐, ๐} and ๐ณ = {{๐, ๐}, {๐, ๐}}. In this case, ๐ณ generates {∅, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}.
This set is also not a topology because {๐, ๐} ∩ {๐, ๐} = {๐} is not in the set. In other words, the
set is not closed under finite intersections.
In general, if there are two sets ๐ด and ๐ต in ๐ณ with nonempty intersection such that the
intersection ๐ด ∩ ๐ต does not include some nonempty set in ๐ณ, then the set generated by ๐ณ will
not be closed under finite intersections, and therefore, ๐ณ will not generate a topology on ๐.
Note that ๐ด ∩ ๐ต itself does not necessarily need to be in ๐ณ. However, there does need to be a
set ๐ถ with ๐ถ ⊆ ๐ด ∩ ๐ต and ๐ถ ∈ ๐ณ.
193
Parts 2 and 3 from Example 14.2 show us that not every collection ๐ณ of subsets of a set ๐ is the basis
for a topology on ๐. Let’s see if we can find conditions on a collection ๐ณ of subsets of ๐ that will
guarantee that ๐ณ is a basis for a topology on ๐.
We say that ๐ณ covers ๐ if every element of ๐ belongs to at least one member of ๐ณ . Symbolically, we
have
∀๐ฅ ∈ ๐ ∃๐ด ∈ ๐ณ(๐ฅ ∈ ๐ด).
We say that ๐ณ has the intersection containment property on ๐ if every element of ๐ that is in the
intersection of two sets in ๐ณ is also in some set in ๐ณ that is contained in that intersection.
∀๐ฅ ∈ ๐ ∀๐ด, ๐ต ∈ ๐ณ(๐ฅ ∈ ๐ด ∩ ๐ต → ∃๐ถ ∈ ๐ณ (๐ฅ ∈ ๐ถ ∧ ๐ถ ⊆ ๐ด ∩ ๐ต)).
Example 14.3:
1. Once again, let ๐ = {๐, ๐, ๐}. ๐ณ1 = {{๐}, {๐, ๐}} does not cover ๐ because ๐ ∈ ๐ does not belong
to any member of ๐ณ1 . ๐ณ1 does have the intersection containment property—the only element
of {๐} ∩ {๐, ๐} = {๐} is ๐, and ๐ ∈ {๐} ∈ ๐ณ1 and {๐} is contained in {๐} ∩ {๐, ๐}. Notice that the
set that ๐ณ1 generates is {∅, {๐}, {๐, ๐}}. This set is not a topology on ๐ because ๐ = {๐, ๐, ๐} is
not in this set. However, it is a topology on {๐, ๐}.
๐ณ2 = {{๐, ๐}, {๐, ๐}} covers ๐, but does not have the intersection containment property.
Indeed, {๐, ๐} ∩ {๐, ๐} = {๐} and ๐ ∈ {๐}, but {๐} ∉ ๐ณ2 . Notice that the set that ๐ณ2 generates
is {∅, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}. This set is not a topology on ๐ because {๐, ๐} ∩ {๐, ๐} = {๐} is not
in the set.
โฌ = {{๐}, {๐, ๐}, {๐, ๐}} covers ๐ and has the intersection containment property. The set that โฌ
generates is the topology ๐ฏ = {∅, {๐}, {๐, ๐}, {๐, ๐}, {๐, ๐, ๐}}.
We can visualize the sets ๐ณ1 , ๐ณ2 , โฌ, and ๐ฏ as follows.
๐ณ1
๐ณ2
โฌ
๐ฏ
2. Let ๐ = โ and let โฌ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ < ๐} be the set of open intervals with endpoints
in โ. โฌ covers โ because if ๐ฅ ∈ โ, then ๐ฅ ∈ (๐ฅ − 1, ๐ฅ + 1) ∈ โฌ. โฌ also has the intersection
containment property. Indeed, if ๐ฅ ∈ โ and (๐, ๐), (๐, ๐) ∈ โฌ with ๐ฅ ∈ (๐, ๐) ∩ (๐, ๐), then
๐ฅ ∈ (๐, ๐) ∩ (๐, ๐) = (๐, ๐), where ๐ = max{๐, ๐} and ๐ = min{๐, ๐} (see part (ii) of Problem
6 from Problem Set 6) and (๐, ๐) ∈ โฌ. In fact, โฌ is a basis on โ that generates the standard
topology of โ.
194
To see that โฌ generates the standard topology on โ, let ๐ฏ be the standard topology on โ and
let ๐ฏ′ be the topology generated by โฌ. First, let ๐ ∈ ๐ฏ. By Theorem 6.8, ๐ can be expressed as
a union of bounded open intervals. So, ๐ ∈ ๐ฏ′. Since ๐ ∈ ๐ฏ was arbitrary, ๐ฏ ⊆ ๐ฏ′. Now, let
๐ ∈ ๐ฏ′. Then ๐ is a union of bounded open intervals, say ๐ = โ๐. Let ๐ฅ ∈ ๐. Since ๐ = โ๐,
๐ฅ ∈ โ๐. So, ๐ฅ ∈ (๐, ๐) for some (๐, ๐) ∈ ๐. Since (๐, ๐) ∈ ๐, (๐, ๐) ⊆ โ๐ = ๐. Therefore,
๐ ∈ ๐ฏ. Since ๐ ∈ ๐ฏ′ was arbitrary, ๐ฏ ′ ⊆ ๐ฏ. Since ๐ฏ ⊆ ๐ฏ′ and ๐ฏ ′ ⊆ ๐ฏ, ๐ฏ ′ = ๐ฏ.
3. Let ๐ = โ and let ๐ณ = {(– ∞, ๐) | ๐ ∈ โ} ∪ {(๐, ∞) | ๐ ∈ โ}. ๐ณ covers โ because if ๐ฅ ∈ โ,
then ๐ฅ ∈ (๐ฅ − 1, ∞) ∈ ๐ณ. However, ๐ณ does not have the intersection containment property.
For example, 0 ∈ (– ∞, 1) ∩ (– 1, ∞) = (– 1, 1), but there is no set in ๐ณ contained in (– 1, 1).
The set generated by ๐ณ is ๐ณ ∪ {(– ∞, ๐) ∪ (๐, ∞) | ๐, ๐ ∈ โ ∧ ๐ < ๐} ∪ {∅, โ}. This set is not
closed under finite intersections, and therefore, it is not a topology on โ.
Based on the previous examples, the next theorem should come as no surprise.
Theorem 14.1: Let ๐ be a nonempty set and let โฌ be a collection of subsets of ๐. โฌ is a basis for a
topology on ๐ if and only if โฌ covers ๐ and โฌ has the intersection containment property on ๐.
Note: The set generated by โฌ is {โ๐ฟ | ๐ฟ ⊆ โฌ}. This set can also be written in the alternative form
{๐ด ⊆ ๐ | ∀๐ฅ ∈ ๐ด ∃๐ต ∈ โฌ(๐ฅ ∈ ๐ต ∧ ๐ต ⊆ ๐ด)}. You will be asked to verify that these two sets are equal in
Problem 6 below. We will use this alternative form of the set generated by โฌ in the proof of Theorem
14.1.
Proof of Theorem 14.1: Suppose that โฌ covers ๐ and โฌ has the intersection containment property on
๐. The set generated by โฌ is ๐ฏ = {๐ด ⊆ ๐ | ∀๐ฅ ∈ ๐ด ∃๐ต ∈ โฌ(๐ฅ ∈ ๐ต ∧ ๐ต ⊆ ๐ด)}. Let’s check that ๐ฏ is a
topology on ๐.
Since ๐ด = ∅ vacuously satisfies the condition ∀๐ฅ ∈ ๐ด ∃๐ต ∈ โฌ(๐ฅ ∈ ๐ต ∧ ๐ต ⊆ ๐ด), we have ∅ ∈ ๐ฏ.
To see that ๐ ∈ ๐ฏ, let ๐ฅ ∈ ๐. Since โฌ covers ๐, there is ๐ต ∈ โฌ such that ๐ฅ ∈ ๐ต and ๐ต ⊆ ๐. So, ๐ ∈ ๐ฏ.
Let ๐ฟ ⊆ ๐ฏ and let ๐ฅ ∈ โ๐ฟ. Then there is ๐ด ∈ ๐ฟ with ๐ฅ ∈ ๐ด. Since ๐ฟ ⊆ ๐ฏ, ๐ด ∈ ๐ฏ. So, there is ๐ต ∈ โฌ
such that ๐ฅ ∈ ๐ต and ๐ต ⊆ ๐ด. Since ๐ต ⊆ ๐ด and ๐ด ⊆ โ๐ฟ, ๐ต ⊆ โ๐ฟ. It follows that the condition
∀๐ฅ ∈ โ๐ฟ ∃๐ต ∈ โฌ(๐ฅ ∈ ๐ต ∧ ๐ต ⊆ โ๐ฟ) is satisfied. So, โ๐ฟ ∈ ๐ฏ.
We now prove by induction on ๐ ∈ โ that for ๐ ≥ 2, the intersection of ๐ sets in ๐ฏ is also in ๐ฏ.
Base Case (๐ = 2): Let ๐ด1 , ๐ด2 ∈ ๐ฏ and let ๐ฅ ∈ ๐ด1 ∩ ๐ด2 . Then there are ๐ต1, ๐ต2 ∈ โฌ with ๐ฅ ∈ ๐ต1, ๐ฅ ∈ ๐ต2,
๐ต1 ⊆ ๐ด1 and ๐ต2 ⊆ ๐ด2 . Since ๐ฅ ∈ ๐ต1 and ๐ฅ ∈ ๐ต2, ๐ฅ ∈ ๐ต1 ∩ ๐ต2. Since โฌ has the intersection containment
property, there is ๐ถ ∈ โฌ such that ๐ฅ ∈ ๐ถ and ๐ถ ⊆ ๐ต1 ∩ ๐ต2. Since ๐ต1 ⊆ ๐ด1 and ๐ต2 ⊆ ๐ด2 , ๐ถ ⊆ ๐ด1 ∩ ๐ด2 .
Therefore, ๐ด1 ∩ ๐ด2 ∈ ๐ฏ.
Inductive Step: Suppose that the intersection of ๐ sets in ๐ฏ is always in ๐ฏ. Let ๐ด1 , ๐ด2 , … , ๐ด๐ , ๐ด๐+1 ∈ ๐ฏ.
By the inductive hypothesis, ๐ด1 ∩ ๐ด2 ∩ โฏ ∩ ๐ด๐ ∈ ๐ฏ. If we let ๐ถ = ๐ด1 ∩ ๐ด2 ∩ โฏ ∩ ๐ด๐ and ๐ท = ๐ด๐+1,
then we have ๐ถ, ๐ท ∈ ๐ฏ. By the base case, ๐ถ ∩ ๐ท ∈ ๐ฏ. It follows that
๐ด1 ∩ ๐ด2 ∩ โฏ ∩ ๐ด๐ ∩ ๐ด๐+1 = (๐ด1 ∩ ๐ด2 ∩ โฏ ∩ ๐ด๐ ) ∩ ๐ด๐+1 = ๐ถ ∩ ๐ท ∈ ๐ฏ.
195
Since ∅, ๐ ∈ ๐ฏ, ๐ฏ is closed under arbitrary unions, and ๐ฏ is closed under finite intersections, it follows
that ๐ฏ is a topology. By the note following the statement of Theorem 14.1, โฌ generates ๐ฏ.
Conversely, suppose that โฌ is a basis for a topology ๐ฏ on ๐. Since ๐ฏ is a topology on ๐, ๐ ∈ ๐ฏ. Since โฌ
is a basis for ๐ฏ, ๐ = โ๐ณ for some ๐ณ ⊆ โฌ. Let ๐ฅ ∈ ๐. Then ๐ฅ ∈ โ๐ณ. So, there is ๐ด ∈ ๐ณ with ๐ฅ ∈ ๐ด.
Since ๐ณ ⊆ โฌ, ๐ด ∈ โฌ. Since ๐ฅ ∈ ๐ was arbitrary, โฌ covers ๐.
Let ๐ฅ ∈ ๐ด1 ∩ ๐ด2 , where ๐ด1 , ๐ด2 ∈ โฌ. Then ๐ด1 , ๐ด2 ∈ ๐ฏ, and since ๐ฏ is a topology on ๐, ๐ด1 ∩ ๐ด2 ∈ ๐ฏ.
Since โฌ is a basis for ๐ฏ, ๐ด1 ∩ ๐ด2 = โ๐ณ for some ๐ณ ⊆ โฌ. It follows that ๐ฅ ∈ โ๐ณ. So, there is ๐ถ ∈ ๐ณ
with ๐ฅ ∈ ๐ถ. Since ๐ณ ⊆ โฌ, ๐ถ ∈ โฌ. Also, ๐ถ ⊆ โ๐ณ = ๐ด1 ∩ ๐ด2 . Since ๐ด1 , ๐ด2 ∈ โฌ and ๐ฅ ∈ ๐ were arbitrary,
โฌ has the intersection containment property.
โก
Example 14.4:
1. If ๐ is any set, then โฌ = {๐} is a basis for the trivial topology on ๐. Note that {๐} covers ๐ and
{๐} has the intersection containment property on ๐ (there is just one instance to check:
๐ ∩ ๐ = ๐ and ๐ ∈ {๐}).
2. If ๐ is any set, then โฌ = {{๐ฅ} | ๐ฅ ∈ ๐} is a basis for the discrete topology on ๐. โฌ covers ๐
because if ๐ฅ ∈ ๐, then {๐ฅ} ∈ โฌ and ๐ฅ ∈ {๐ฅ}. โฌ vacuously has the intersection containment
property because โฌ is pairwise disjoint.
3. Let ๐ = โ and let โฌ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ < ๐}. We saw in Example 14.3 (part 2) that โฌ
covers โ and that โฌ has the intersection containment property on โ. It follows that โฌ is a basis
for a topology on โ. In fact, we already saw in the same Example that โฌ generates the standard
topology on โ.
The basis โฌ just described is uncountable because โ is uncountable and the function ๐: โ → โฌ
defined by ๐(๐) = (๐, ๐ + 1) is injective. Does โ with the standard topology have a countable
basis? In fact, it does! Let โฌ ′ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ < ๐}. In Problem 9 below you will be asked
to show that โฌ′ is countable and that โฌ′ is a basis for โ with the standard topology.
4. We saw in part 3 of Example 14.3 that ๐ณ = {(– ∞, ๐) | ๐ ∈ โ} ∪ {(๐, ∞) | ๐ ∈ โ} does not have
the intersection containment property. It follows from Theorem 14.1 that ๐ณ is not a basis for a
topology on โ.
However, โฌ ∗ = {(๐, ∞) | ๐ ∈ โ} covers โ and has the intersection containment property.
Therefore, โฌ ∗ is a basis for a topology ๐ฏ ∗ on โ. Since every set of the form (๐, ∞) is open in the
standard topology, ๐ฏ ∗ is coarser than the standard topology on โ. Since no bounded open
interval is in ๐ฏ ∗ , we see that ๐ฏ ∗ is strictly coarser than the standard topology on โ.
Note: Although the set ๐ณ = {(– ∞, ๐) | ๐ ∈ โ} ∪ {(๐, ∞) | ๐ ∈ โ} is not a basis for a topology on โ, if
we let โฌ be the collection of all finite intersections of sets in ๐ณ, then โฌ does form a basis for โ (because
๐ณ covers โ). In this case, we call ๐ณ a subbasis for the topology generated by โฌ. Since every bounded
open interval is in โฌ, it is not hard to see that โฌ generates the standard topology on โ. We can also
say that the standard topology on โ is generated by the subbasis ๐ณ.
196
Types of Topological Spaces
A topological space (๐, ๐ฏ) is a ๐ป๐ -space (or Kolmogorov space) if for all ๐ฅ, ๐ฆ ∈ ๐ with ๐ฅ ≠ ๐ฆ, there is
๐ ∈ ๐ฏ such that either ๐ฅ ∈ ๐ and ๐ฆ ∉ ๐ or ๐ฅ ∉ ๐ and ๐ฆ ∈ ๐.
In other words, in a ๐0 -space, given any two elements, there is an open set that
contains one of the elements and excludes the other. In the picture on the right we
see two typical elements ๐ and ๐ in a ๐0 -space. We have drawn an open set
containing ๐ and excluding ๐. There does not need to be an open set containing ๐
and excluding ๐ (although there can be).
Example 14.5:
1. Let ๐ = {๐, ๐} where ๐ ≠ ๐. ๐ together with the trivial topology {∅, {๐, ๐}} is not a ๐0 -space. In
fact, the trivial topology on any set with more than one element is not a ๐0 -space.
{๐, ๐} together with the discrete topology {∅, {๐}, {๐}, {๐, ๐}} is a ๐0 -space because the open
set {๐} satisfies ๐ ∈ {๐} and ๐ ∉ {๐}. In fact, the discrete topology on any set is a ๐0 -space.
The other two topologies on {๐, ๐} are also ๐0 -spaces. For example, {∅, {๐}, {๐, ๐}} is a ๐0 -space
because {๐} is open, ๐ ∈ {๐} and ๐ ∉ {๐}.
2. Let ๐ = โ and let ๐ฏ be the topology generated by the basis {(๐, ∞) | ๐ ∈ โ}. Then (๐, ๐ฏ) is a
๐0 -space. If ๐, ๐ ∈ โ with ๐ < ๐, then ๐ = (๐, ∞) is an open set with ๐ ∈ ๐ and ๐ ∉ ๐.
3. If (๐, ๐ฏ) is a ๐0 -space and ๐ฏ′ is finer than ๐ฏ, then (๐, ๐ฏ ′ ) is also a ๐0 -space. Indeed, if ๐ ∈ ๐ฏ
with ๐ฅ ∈ ๐ and ๐ฆ ∉ ๐, then since ๐ฏ′ is finer than ๐ฏ, we have ๐ ∈ ๐ฏ′.
For example, since the standard topology on โ is finer than the topology generated by
{(๐, ∞) | ๐ ∈ โ}, โ together with the standard topology on โ is a ๐0 -space.
A topological space (๐, ๐ฏ) is a ๐ป๐ -space (or Fréchet space or Tikhonov space) if for all ๐ฅ, ๐ฆ ∈ ๐ with
๐ฅ ≠ ๐ฆ, there are ๐, ๐ ∈ ๐ฏ such that ๐ฅ ∈ ๐ and ๐ฆ ∉ ๐ and ๐ฅ ∉ ๐ and ๐ฆ ∈ ๐.
In the picture on the right we see two typical elements ๐ and ๐ in a ๐1 -space. We
have drawn an open set containing ๐ and excluding ๐ and an open set containing ๐
and excluding ๐. These two open sets do not need to be disjoint. The smaller dots in
the picture are representing some elements of the space other than ๐ and ๐.
Example 14.6:
1. ๐ = {๐, ๐} together with the discrete topology {∅, {๐}, {๐}, {๐, ๐}} is a ๐1 -space because the
open sets {๐} and {๐} satisfy ๐ ∈ {๐}, ๐ ∉ {๐}, ๐ ∈ {๐}, and ๐ ∉ {๐}. In fact, the discrete
topology on any set is a ๐1 -space.
It should be clear from the definitions that every ๐1 -space is a ๐0 -space. It follows that the trivial
topology on any set with more than one element is not a ๐1 -space.
197
The other two topologies on {๐, ๐} are not ๐1 -spaces. For example,
{∅, {๐}, {๐, ๐}} is not a ๐1 -space because the only open set containing ๐ also
contains ๐.
In fact, the only topology on any finite set that is ๐1 is the discrete topology.
To see this, let ๐ฏ be a topology on a finite set ๐ that is ๐1 and let ๐ฅ ∈ ๐. For
each ๐ฆ ∈ ๐ with ๐ฆ ≠ ๐ฅ, there is an open set ๐๐ฆ such that ๐ฅ ∈ ๐๐ฆ and ๐ฆ ∉ ๐๐ฆ . It follows that
๐ = โ{๐๐ฆ | ๐ฆ ∈ ๐ ∧ ๐ฆ ≠ ๐ฅ} is open and it is easy to see that ๐ = {๐ฅ}. So, ๐ฏ is generated by the
one point sets, and therefore, ๐ฏ is the discrete topology on ๐.
2. Let ๐ = โ and let ๐ฏ be the topology generated by the basis {(๐, ∞) | ๐ ∈ โ}. Then (๐, ๐ฏ) is not
a ๐1 -space. To see this, let ๐ฅ, ๐ฆ ∈ โ with ๐ฅ < ๐ฆ. Let ๐ be an open set containing ๐ฅ, say
๐ = (๐, ∞). Since ๐ฅ < ๐ฆ and ๐ < ๐ฅ, we have ๐ < ๐ฆ, and so, ๐ฆ ∈ ๐. Therefore, there is no open
set ๐ with ๐ฅ ∈ ๐ and ๐ฆ ∉ ๐.
It’s worth noting that the topology generated by {(๐, ∞) | ๐ ∈ โ} is {(๐, ∞) | ๐ ∈ โ} ∪ {∅, โ}.
3. Let ๐ = โ and let ๐ฏ be the topology generated by the basis โฌ = {๐ ⊆ โ | โ โ ๐ is finite}. ๐ฏ is
called the cofinite topology on โ. I leave it to the reader to verify that โฌ generates a topology
on โ that is strictly coarser than the standard topology (Problem 3 below). It’s easy to see that
(๐, ๐ฏ) is a ๐1 -space. Indeed, if ๐, ๐ ∈ โ with ๐ ≠ ๐, then let ๐ = โ โ {๐} and ๐ = โ โ {๐}.
4. If (๐, ๐ฏ) is a ๐1 -space and ๐ฏ′ is finer than ๐ฏ, then (๐, ๐ฏ ′ ) is also a ๐1 -space. Indeed, if ๐, ๐ ∈ ๐ฏ
with ๐ฅ ∈ ๐ and ๐ฆ ∉ ๐ and ๐ฅ ∉ ๐ and ๐ฆ ∈ ๐, then since ๐ฏ′ is finer than ๐ฏ, we have ๐, ๐ ∈ ๐ฏ′.
For example, since the standard topology on โ is finer than the cofinite topology on โ, it follows
that โ together with the standard topology on โ is a ๐1 -space.
Theorem 14.2: A topological space (๐, ๐ฏ) is a ๐1 -space if and only if for all ๐ฅ ∈ ๐, {๐ฅ} is a closed set.
Proof: Let (๐, ๐ฏ) be a topological space. First, assume that (๐, ๐ฏ) is a ๐1 -space and let ๐ฅ ∈ ๐. For each
๐ฆ ∈ ๐ with ๐ฆ ≠ ๐ฅ, there is an open set ๐๐ฆ with ๐ฆ ∈ ๐๐ฆ and ๐ฅ ∉ ๐๐ฆ . Then ๐ = โ{๐๐ฆ | ๐ฆ ∈ ๐ ∧ ๐ฆ ≠ ๐ฅ}
is open (because ๐ is a union of open sets). Let’s check that {๐ฅ} = ๐ โ ๐. Since ๐ฅ ∉ ๐๐ฆ for all ๐ฆ ≠ ๐ฅ,
๐ฅ ∉ ๐. So, ๐ฅ ∈ ๐ โ ๐. It follows that {๐ฅ} ⊆ ๐ โ ๐. If ๐ง ∈ ๐ โ ๐, then ๐ง ∉ ๐. So, for all ๐ฆ ≠ ๐ฅ, ๐ง ∉ ๐๐ฆ .
Thus, for all ๐ฆ ≠ ๐ฅ, ๐ง ≠ ๐ฆ. Therefore, ๐ง = ๐ฅ, and so, ๐ง ∈ {๐ฅ}. So, ๐ โ ๐ ⊆ {๐ฅ}. Since {๐ฅ} ⊆ ๐ โ ๐ and
๐ โ ๐ ⊆ {๐ฅ}, we have {๐ฅ} = ๐ โ ๐. Since ๐ is open, {๐ฅ} = ๐ โ ๐ is closed.
Conversely, suppose that for all ๐ฅ ∈ ๐, {๐ฅ} is a closed set. Let ๐ฅ, ๐ฆ ∈ ๐ with ๐ฅ ≠ ๐ฆ, let ๐ = ๐ โ {๐ฆ}, and
let ๐ = ๐ โ {๐ฅ}. Then ๐ and ๐ are open sets such that ๐ฅ ∈ ๐ and ๐ฆ ∉ ๐ and ๐ฅ ∉ ๐ and ๐ฆ ∈ ๐. So, (๐, ๐ฏ)
is a ๐1 -space.
โก
A topological space (๐, ๐ฏ) is a ๐ป๐ -space (or Hausdorff space) if for all ๐ฅ, ๐ฆ ∈ ๐ with ๐ฅ ≠ ๐ฆ, there are
๐, ๐ ∈ ๐ฏ with ๐ฅ ∈ ๐, ๐ฆ ∈ ๐, and ๐ ∩ ๐ = ∅.
In the picture on the right we see two typical elements ๐ and ๐ in a ๐2 -space. We
have drawn disjoint open sets, one including ๐ and the other including ๐. The smaller
dots in the picture represent some elements of the space other than ๐ and ๐.
198
Example 14.7:
1. The discrete topology on any set is a ๐2 -space. Indeed, if ๐ and ๐ are distinct points from a
๐2 -space, then {๐} and {๐} are disjoint open sets.
It should be clear from the definitions that every ๐2 -space is a ๐1 -space. It follows that except
for the discrete topology, every other topology on a finite set is not a ๐2 -space.
2. The topological space (โ, ๐ฏ), where ๐ฏ is the cofinite topology on โ (see part 3 of Example 14.6)
is not a ๐2 -space. Indeed, if ๐ and ๐ are open sets containing ๐, ๐ ∈ โ, respectively, then
โ โ (๐ ∩ ๐) = (โ โ ๐) ∪ (โ โ ๐) (this is De Morgan’s law), which is finite. So, ๐ ∩ ๐ is
infinite, and therefore, nonempty.
3. The standard topologies on โ and โ are both ๐2 . The same argument can be used for both
(although the geometry looks very different).
1
1
Let ๐ = โ or โ and let ๐ฅ, ๐ฆ ∈ ๐. Let ๐ = 2 ๐(๐ฅ, ๐ฆ) = 2 |๐ฅ − ๐ฆ|. Then ๐ = ๐๐ (๐ฅ) and ๐ = ๐๐ (๐ฆ)
are disjoint open sets with ๐ฅ ∈ ๐ and ๐ฆ ∈ ๐.
In the picture below, we have drawn two typical real numbers ๐ฅ and ๐ฆ on the real line and then
separated them with the disjoint neighborhoods ๐ = ๐๐ (๐ฅ) and ๐ = ๐๐ (๐ฆ).
๐ = ๐๐ (๐ฅ)
๐ = ๐๐ (๐ฆ)
In the picture to the right, we have drawn two
typical complex numbers ๐ฅ and ๐ฆ in the complex
plane and then separated them with the disjoint
neighborhoods ๐ = ๐๐ (๐ฅ) and ๐ = ๐๐ (๐ฆ).
๐ = ๐๐ (๐ฆ)
Note once again that neighborhoods on the real
line are open intervals, whereas neighborhoods in
the complex plane are open disks.
4. If (๐, ๐ฏ) is a ๐2 -space and ๐ฏ′ is finer than ๐ฏ, then
(๐, ๐ฏ ′ ) is also a ๐2 -space. Indeed, if ๐, ๐ ∈ ๐ฏ with
๐ฅ ∈ ๐, ๐ฆ ∈ ๐, and ๐ ∩ ๐ = ∅, then since ๐ฏ′ is
finer than ๐ฏ, we have ๐, ๐ ∈ ๐ฏ′. Let’s look at an
example of this.
1
๐ = ๐๐ (๐ฅ)
Let ๐พ = {๐ | ๐ ∈ โค+ }, โฌ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ < ๐} ∪ {(๐, ๐) โ ๐พ | ๐, ๐ ∈ โ ∧ ๐ < ๐}. In
Problem 4 below, the reader will be asked to verify that โฌ is a basis for a topology ๐ฏ๐พ on โ.
Since ๐ฏ๐พ contains every basis element of the standard topology on โ, we see that ๐ฏ๐พ is finer
than the standard topology. It follows that (โ, ๐ฏ๐พ ) is a ๐2 -space.
199
A topological space (๐, ๐ฏ) is a ๐ป๐ -space (or Regular space) if (๐, ๐ฏ) is a ๐1 -space and for every ๐ฅ ∈ ๐
and closed set ๐ with ๐ ⊆ ๐ โ {๐ฅ}, there are ๐, ๐ ∈ ๐ฏ with ๐ฅ ∈ ๐, ๐ ⊆ ๐, and ๐ ∩ ๐ = ∅.
In the picture on the right we see a typical element ๐ and a closed set ๐พ in a
๐3 -space. We have drawn disjoint open sets, one including ๐ and the other
containing ๐พ. The smaller dots in the picture represent some elements of the
space other than ๐ that are not included in ๐พ. (Note that we replaced an arbitrary
closed set ๐ with the specific closed set ๐พ, and similarly, we replaced ๐ฅ with ๐.)
Example 14.8:
1. The discrete topology on any set ๐ is a ๐3 -space. Indeed, if ๐ฅ ∈ ๐ and ๐ด is any subset of ๐ โ {๐ฅ}
(all subsets of ๐ are closed), simply let ๐ = {๐ฅ} and ๐ = ๐ด (all subsets of ๐ are also open).
Some authors call a set clopen if it is both open and closed. If ๐ is given the discrete topology,
then all subsets of ๐ are clopen.
2. Every ๐3 -space is a ๐2 -space. This follows easily from the fact that a ๐3 -space is a ๐1 -space and
Theorem 14.2. It follows that except for the discrete topology, every other topology on a finite
set is not a ๐3 -space.
3. The standard topologies on โ and โ are both ๐3 . This follows from Problem 14 below.
1
4. Consider the ๐2 -space (โ, ๐ฏ๐พ ) from part 4 of Example 14.7. Recall that ๐พ = {๐ | ๐ ∈ โค+ } and
(โ, ๐ฏ๐พ ) has basis โฌ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ < ๐} ∪ {(๐, ๐) โ ๐พ | ๐, ๐ ∈ โ ∧ ๐ < ๐}. Let ๐ฅ = 0
and ๐ด = ๐พ. โ โ ๐พ = (– ∞, 0) ∪ [(– 1,1) โ ๐พ] ∪ (1, ∞), which is a union of three open sets, thus
open. Therefore, ๐พ is a closed set in this topology. Let ๐ be an open set containing 0 and let ๐
be an open set containing ๐พ. For some ๐ > 0, (0, ๐) โ ๐พ ⊆ ๐. By the Archimedean Property of
1
1
1
โ, there is ๐ ∈ โ with ๐ > ๐ , or equivalently, ๐ < ๐. There is 0 < ๐ฟ ≤ ๐ − ๐ such that
1
1
1 1
(๐ − ๐ฟ, ๐ + ๐ฟ) ⊆ ๐. Let ๐ be an irrational number in (๐ , ๐ + ๐ฟ). ๐ ∈ ๐ ∩ ๐ and therefore,
๐ ∩ ๐ ≠ ∅. Since we cannot separate 0 and ๐พ with open sets, (โ, ๐ฏ๐พ ) is not a ๐3 -space.
Unlike ๐0 , ๐1 , and ๐2 -spaces, ๐3 -spaces are not closed under upward refinement. In other
words, if (๐, ๐ฏ) is a ๐3 -space and ๐ฏ′ is finer than ๐ฏ, then (๐, ๐ฏ ′ ) is not necessarily a ๐3 -space.
The topological space (โ, ๐ฏ๐พ ) proves this.
Also, since (โ, ๐ฏ) is ๐3 , where ๐ฏ is the standard topology on โ, but (โ, ๐ฏ๐พ ) is not, the two
topological spaces cannot be the same. It follows that ๐ฏ๐พ is strictly finer than the standard
topology on โ.
A topological space (๐, ๐ฏ) is a ๐ป๐ -space (or Normal space) if (๐, ๐ฏ) is a ๐1 -space
and for every pair ๐, ๐ of disjoint closed subsets of ๐, there are ๐, ๐ ∈ ๐ฏ with
๐ ⊆ ๐, ๐ ⊆ ๐, and ๐ ∩ ๐ = ∅.
In the picture on the right we see two closed sets ๐พ and ๐ฟ in a ๐4 -space. We have
drawn disjoint open sets, one containing ๐พ and the other containing ๐ฟ. The
smaller dots in the picture represent some elements of the space not included in
๐พ or ๐ฟ. (Note that we replaced the arbitrary closed sets ๐ and ๐ with specific closed sets ๐พ and ๐ฟ.)
200
Example 14.9:
1. The discrete topology on any set ๐ is a ๐4 -space. Indeed, if ๐ด and ๐ต are disjoint closed subsets
of ๐, then ๐ด and ๐ต are also disjoint open subsets of ๐ (because all subsets of ๐ are both open
and closed).
Every ๐4 -space is a ๐3 -space. This follows easily from the fact that a ๐4 -space is a ๐1 -space and
Theorem 14.2. It follows that except for the discrete topology, every other topology on a finite
set is not a ๐4 -space.
2. The standard topologies on โ and โ are both ๐4 . This follows immediately from Problem 14
below.
3. In Problem 15 below, you will see a ๐3 -space that is not a ๐4 -space.
The definitions of ๐0 , ๐1 , ๐2 , ๐3 , and ๐4 are called separation axioms because they all involve
“separating” points and/or closed sets from each other by open sets.
We will now look at two more types of topological spaces that appear frequently in mathematics.
A metric space is a pair (๐, ๐), where ๐ is a set and ๐ is a function ๐: ๐ × ๐ → โ with the following
properties:
1. For all ๐ฅ, ๐ฆ ∈ ๐, ๐(๐ฅ, ๐ฆ) = 0 if and only if ๐ฅ = ๐ฆ.
2. For all ๐ฅ, ๐ฆ ∈ ๐, ๐(๐ฅ, ๐ฆ) = ๐(๐ฆ, ๐ฅ).
3. For all ๐ฅ, ๐ฆ, ๐ง ∈ ๐, ๐(๐ฅ, ๐ง) ≤ ๐(๐ฅ, ๐ฆ) + ๐(๐ฆ, ๐ง).
The function ๐ is called a metric or distance function. It is a consequence of the definition that for all
๐ฅ ∈ ๐, ๐(๐ฅ, ๐ฅ) ≥ 0. You will be asked to prove this in Problem 2 below.
If (๐, ๐) is a metric space, ๐ ∈ ๐, and ๐ ∈ โ+ , then the open ball centered at ๐ with radius ๐, written
๐ต๐ (๐) (or ๐ต๐ (๐; ๐) if we need to distinguish this metric from other metrics), is the set of all elements
of ๐ whose distance to ๐ is less than ๐. That is,
๐ต๐ (๐) = {๐ฅ ∈ ๐ | ๐(๐, ๐ฅ) < ๐}.
The collection โฌ = {๐ต๐ (๐) | ๐ ∈ ๐ ∧ ๐ ∈ โ+ } covers ๐. Indeed, if ๐ ∈ ๐, then ๐(๐, ๐) = 0 < 1, and so,
๐ ∈ ๐ต1 (๐).
๐ = ๐ − ๐(๐, ๐ฅ)
๐ต๐ (๐)
+
Also, the collection โฌ = {๐ต๐ (๐) | ๐ ∈ ๐ ∧ ๐ ∈ โ } has the
๐ − ๐(๐, ๐ฅ)
intersection containment property. To see this, let
๐
๐ฅ ∈ ๐ต๐ (๐) ∩ ๐ต๐ (๐) and ๐ = min{๐ − ๐(๐, ๐ฅ), ๐ − ๐(๐, ๐ฅ)}.
๐ฅ
We have ๐ฅ ∈ ๐ต๐ (๐ฅ) because ๐(๐ฅ, ๐ฅ) = 0 < ๐. Now, let
๐
๐ฆ ∈ ๐ต๐ (๐ฅ). Then ๐(๐ฅ, ๐ฆ) < ๐. So, we have
๐
๐(๐, ๐ฆ) ≤ ๐(๐, ๐ฅ) + ๐(๐ฅ, ๐ฆ) < ๐(๐, ๐ฅ) + ๐
≤ ๐(๐, ๐ฅ) + ๐ − ๐(๐, ๐ฅ) = ๐.
๐ต๐ (๐)
๐
So, ๐ฆ ∈ ๐ต๐ (๐). A similar argument shows that ๐ฆ ∈ ๐ต๐ (๐). So,
๐ฆ ∈ ๐ต๐ (๐) ∩ ๐ต๐ (๐). It follows that ๐ต๐ (๐ฅ) ⊆ ๐ต๐ (๐) ∩ ๐ต๐ (๐).
201
This verifies that โฌ has the intersection containment property.
Since the collection of open balls covers ๐ and has the intersection containment property, it follows
that this collection is a basis for a topology on ๐.
Note: Open balls can be visualized as open intervals on the real line โ, open disks in the Complex Plane
โ (or โ2 ), or open balls in three-dimensional space โ3 .
When proving theorems about metric spaces, it’s usually most useful to visualize open balls as open
disks in โ. This does not mean that all metric spaces look like โ. The visualization should be used as
evidence that a theorem might be true. Of course, a detailed proof still needs to be written.
This is exactly what we did when we drew the picture above. That picture represents the open balls
๐ต๐ (๐) and ๐ต๐ (๐) as intersecting open disks. Inside this intersection, we can see the open ball ๐ต๐ (๐ฅ).
The reader may also want to draw another picture to help visualize the triangle inequality. A picture
similar to this is drawn to the right of Note 1 following the proof of Theorem 7.4 in Lesson 7.
A topological space (๐, ๐ฏ) is metrizable if there is a metric ๐: ๐ × ๐ → โ such that ๐ฏ is generated from
the open balls in (๐, ๐). We also say that the metric ๐ induces the topology ๐ฏ.
Example 14.10:
1. (โ, ๐) is a metric space, where ๐: โ × โ → โ is defined by ๐(๐ง, ๐ค) = |๐ง − ๐ค|. Let’s check that
the 3 properties of a metric space are satisfied. Property 3 is the Triangle Inequality (Theorem
7.3 and Problem 4 in Problem Set 7). Let’s verify the other two properties. Let ๐ง = ๐ + ๐๐ and
๐ค = ๐ + ๐๐. Then ๐(๐ง, ๐ค) = |๐ง − ๐ค| = √(๐ − ๐)2 + (๐ − ๐)2 . So, ๐(๐ง, ๐ค) = 0 if and only if
√(๐ − ๐)2 + (๐ − ๐)2 = 0 if and only if (๐ − ๐)2 + (๐ − ๐)2 = 0 if and only if ๐ − ๐ = 0 and
๐ − ๐ = 0 if and only if ๐ = ๐ and ๐ = ๐ if and only if ๐ง = ๐ค. So, property 1 holds. We have
๐(๐ง, ๐ค) = |๐ง − ๐ค| = |– (๐ค − ๐ง)| = |– 1(๐ค − ๐ง)| = |– 1||๐ค − ๐ง| = 1|๐ค − ๐ง| = ๐(๐ค, ๐ง).
Therefore, property 2 holds.
If ๐ง ∈ โ and ๐ ∈ โ+ , then the open ball ๐ต๐ (๐ง) is the set ๐ต๐ (๐ง) = {๐ค ∈ โ | |๐ง − ๐ค| < ๐}. This is
just an open disk in the complex plane, as we defined in Lesson 7.
Since the collection of open disks in the complex plane generates the standard topology on โ,
we see that โ with the standard topology is a metrizable space.
2. Similarly, (โ, ๐) is a metric space, where ๐: โ × โ → โ is defined by ๐(๐ฅ, ๐ฆ) = |๐ฅ − ๐ฆ|. The
proof is similar to the proof above for (โ, ๐).
In this case, the open ball ๐ต๐ (๐) is the open interval (๐ − ๐, ๐ + ๐). To see this, observe that we
have
๐ต๐ (๐) = {๐ฅ ∈ โ | |๐ฅ − ๐| < ๐} = {๐ฅ ∈ โ | – ๐ < ๐ฅ − ๐ < ๐}
= {๐ฅ ∈ โ | ๐ − ๐ < ๐ฅ < ๐ + ๐} = (๐ − ๐, ๐ + ๐).
Since the collection of bounded open intervals of real numbers generates the standard topology
on โ, we see that โ with the standard topology is a metrizable space.
202
3. Define the functions ๐1 and ๐2 from โ × โ to โ by ๐1 (๐ง, ๐ค) = |Re ๐ง − Re ๐ค| + |Im ๐ง − Im ๐ค|
and ๐2 (๐ง, ๐ค) = max{|Re ๐ง − Re ๐ค|, |Im ๐ง − Im ๐ค|}. In Problem 7 below, you will be asked to
verify that (โ, ๐1 ) and (โ, ๐2 ) are metric spaces that induce the standard topology on โ.
So, we see that a metrizable space can be induced by many different metrics.
The open balls ๐ต๐ (๐; ๐1 ) and ๐ต๐ (๐; ๐2 ) are both interiors of squares. For example, the unit open
ball in the metric ๐1 is ๐ต1 (0; ๐1 ) = {๐ค ∈ โ | ๐1 (0, ๐ค) < 1} = {๐ค ∈ โ | |Re ๐ค| + |Im ๐ค| < 1},
which is the interior of a square with vertices 1, ๐, – 1, and – ๐. Similarly, the unit open ball in the
metric ๐2 is ๐ต1 (0; ๐2 ) = {๐ค ∈ โ | ๐2 (0, ๐ค) < 1} = {๐ค ∈ โ | max{|Re ๐ค|, |Im ๐ค|} < 1}, which
is the interior of a square with vertices 1 + ๐, – 1 + ๐, – 1 − ๐, and 1 − ๐.
7
+๐
๐
8
1
๐ต1 (0; ๐1 )
2
1
+ 2๐
๐ต1 (0; ๐2 )
7
max {8 , 1} = 1
0
0
3
1 + 4๐
3
max {1, 4 } = 1
4. We can turn any nonempty set ๐ into a metric space by defining ๐: ๐ × ๐ → โ by
0 if ๐ฅ = ๐ฆ
๐(๐ฅ, ๐ฆ) = {
1 if ๐ฅ ≠ ๐ฆ
Properties 1 and 2 are obvious. For Property 3, let ๐ฅ, ๐ฆ, ๐ง ∈ ๐. If ๐ฅ = ๐ง, then ๐(๐ฅ, ๐ง) = 0, and
so, ๐(๐ฅ, ๐ง) = 0 ≤ ๐(๐ฅ, ๐ฆ) + ๐(๐ฆ, ๐ง). If ๐ฅ ≠ ๐ง, then ๐(๐ฅ, ๐ง) = 1. Also, ๐ฆ cannot be equal to both
๐ฅ and ๐ง (otherwise ๐ฆ = ๐ฅ ∧ ๐ฆ = ๐ง → ๐ฅ = ๐ง). So, ๐(๐ฅ, ๐ฆ) = 1 or ๐(๐ฆ, ๐ง) = 1 (or both).
Therefore, ๐(๐ฅ, ๐ฆ) + ๐(๐ฆ, ๐ง) ≥ 1 = ๐(๐ฅ, ๐ง).
If ๐ > 1, then ๐ต๐ (๐ฅ) = ๐ and if 0 < ๐ ≤ 1, then ๐ต๐ (๐ฅ) = {๐ฅ}. It follows that every singleton set
{๐ฅ} is open and therefore, (๐, ๐) induces the discrete topology on ๐.
Let (๐, ๐ฏ) be a topological space. A collection ๐ of subsets of ๐ is a covering of ๐ (or we can say that ๐
covers ๐) if โ๐ = ๐. If ๐ consists of only open sets, then we will say that ๐ is an open covering of ๐.
A topological space (๐, ๐ฏ) is compact if every open covering of ๐ contains a finite subcollection that
covers ๐.
Example 14.11:
1. If ๐ is a finite set, then for any topology ๐ฏ on ๐, (๐, ๐ฏ) is compact. After all, any open covering
of ๐ is already finite.
2. If ๐ is an infinite set and ๐ฏ is the discrete topology on ๐, then (๐, ๐ฏ) is not compact. Indeed,
{{๐ฅ} | ๐ฅ ∈ ๐} is an open covering of ๐ with no finite subcollection covering ๐.
3. (โ, ๐ฏ), where ๐ฏ is the standard topology on โ, is not compact. Indeed, {(๐, ๐ + 2) | ๐ ∈ โค} is
an open covering of โ with no finite subcollection covering โ.
203
4. The topological space (โ, ๐ฏ), where ๐ฏ is the cofinite topology on โ (see part 3 of Example 14.6)
is compact. To see this, let ๐ be an open covering of โ, and let ๐ด0 be any set in ๐. Then โ โ ๐ด0
is finite, say โ โ ๐ด0 = {๐1 , ๐2 , … , ๐๐ }. For each ๐ = 1, 2, … , ๐, let ๐ด๐ ∈ ๐ with ๐๐ ∈ ๐ด๐ . Then the
collection {๐ด0 , ๐ด1 , ๐ด2 , … , ๐ด๐ } is a finite subcollection from ๐ that covers โ.
There is actually nothing special about โ in this example. If ๐ is any set, we can define the
cofinite topology on ๐ to be the topology ๐ฏ generated from the basis {๐ ⊆ ๐ | ๐ โ ๐ is finite}.
If we replace โ by ๐ in the argument above, we see that the topological space (๐, ๐ฏ) is compact.
Continuous Functions and Homeomorphisms
If ๐: ๐ → ๐ and ๐ด ⊆ ๐, then the image of ๐จ under ๐ is the set ๐[๐ด] = {๐(๐ฅ) | ๐ฅ ∈ ๐ด}. Similarly, if
๐ต ⊆ ๐, then the inverse image of ๐ฉ under ๐ is the set ๐ −1 [๐ต] = {๐ฅ ∈ ๐ | ๐(๐ฅ) ∈ ๐ต}.
Let (๐, ๐ฏ ) and (๐, ๐ฐ) be topological spaces. A function ๐: ๐ → ๐ is continuous if for each ๐ ∈ ๐ฐ, we
have ๐ −1 [๐] ∈ ๐ฏ.
Notes: (1) In words, a function from one topological space to another is continuous if the inverse image
of each open set is open.
(2) Continuity of a function may depend just as much on the two given topologies as it does on the
function ๐.
(3) As an example of Note 2, if ๐ is given the discrete topology, then any function ๐: ๐ → ๐ is
continuous. After all, every subset of ๐ is open in ๐, and therefore every subset of ๐ of the form
๐ −1 [๐], where ๐ is an open set in ๐, is open in ๐.
(4) As another example, if ๐ = {๐, ๐} is given the trivial topology, and ๐ = ๐ = {๐, ๐} is given the
discrete topology, then the identity function ๐๐ : ๐ → ๐ is not continuous. To see this, just note that {๐}
is open in ๐ (because every subset of ๐ is open), but ๐๐−1 ({๐}) = {๐} is not open in ๐ (because {๐} ≠ ∅
and {๐} ≠ ๐).
(5) Constant functions are always continuous. Indeed, let ๐ ∈ ๐ and suppose that ๐: ๐ → ๐ is defined
by ๐(๐ฅ) = ๐ for all ๐ฅ ∈ ๐. Let ๐ต ⊆ ๐. If ๐ ∈ ๐ต, then ๐ −1 [๐ต] = ๐ and if ๐ ∉ ๐ต, then ๐ −1 [๐ต] = ∅. Since
๐ and ∅ are open in any topology on ๐, ๐ is continuous.
(6) If โฌ is a basis for ๐ฐ, then to determine if ๐ is continuous, we need only check that for each ๐ ∈ โฌ,
we have ๐ −1 [๐] ∈ ๐ฏ. To see this, assume that for each ๐ ∈ โฌ, we have ๐ −1 [๐] ∈ ๐ฏ, and let ๐ ∈ ๐ฐ.
Since โฌ is a basis for ๐ฐ, ๐ = โ๐ฟ, for some subset ๐ฟ of โฌ. So, ๐ −1 [๐] = ๐ −1 [โ๐ฟ] = โ{๐ −1 [๐] | ๐ ∈ ๐}
(by part (ii) of Problem 1 below). Since ๐ฏ is a topology, it is closed under taking arbitrary unions, and
therefore, โ{๐ −1 [๐] | ๐ ∈ ๐} ∈ ๐ฏ.
Similarly, if ๐ฎ is a subbasis for ๐ฐ, then to determine if ๐ is continuous, we need only check that for each
๐ ∈ ๐ฎ, we have ๐ −1 [๐] ∈ ๐ฏ. To see this, let’s assume that for each ๐ ∈ ๐ฎ, we have ๐ −1 [๐] ∈ ๐ฏ and let
โฌ be the collection of all finite intersections of sets in ๐ฎ. Then โฌ is a basis for ๐ฐ. Let ๐ด ∈ โฌ. Then
๐ด = โ๐ฟ for some finite subset ๐ฟ of ๐ฎ. So, ๐ −1 [๐ด] = ๐ −1 [โ๐ฟ] = โ{๐ −1 [๐] | ๐ ∈ ๐} (Check this!).
Since ๐ฏ is a topology, it is closed under taking finite intersections, and so, โ{๐ −1 [๐] | ๐ ∈ ๐} ∈ ๐ฏ.
204
Example 14.12:
1. Let (๐ด, ๐ฏ) and (๐ต, ๐ฐ) be the topological spaces with sets ๐ด = {๐, ๐} and ๐ต = {1, 2, 3} and
topologies ๐ฏ = {∅, {๐}, {๐, ๐}} and ๐ฐ = {∅, {1, 2}, {1, 2, 3}}. The function ๐: ๐ด → ๐ต defined by
๐(๐) = 1 and ๐(๐) = 3 is continuous because ๐ −1 [{1, 2}] = {๐}, which is open in (๐ด, ๐ฏ). On
the other hand, the function ๐: ๐ด → ๐ต defined by ๐(๐) = 3 and ๐(๐) = 1 is not continuous
because ๐−1 [{1, 2}] = {๐}, which is not open in (๐ด, ๐ฏ). We can visualize these two functions as
follows:
2. Consider (โ, ๐ฏ) and (โ, ๐ฐ), where ๐ฏ is the standard topology on โ and ๐ฐ is the topology
generated by the basis {(๐, ∞) | ๐ ∈ โ}. To avoid confusion, let’s use the notation โ๐ฏ and โ๐ฐ
to indicate that we are considering โ with the topologies ๐ฏ and ๐ฐ, respectively. The identity
function ๐1 : โ๐ฏ → โ๐ฐ is continuous because ๐1−1 [(๐, ∞)] = (๐, ∞) is open in (โ, ๐ฏ) for every
๐ ∈ โ. However, the identity function ๐2 : โ๐ฐ → โ๐ฏ is not continuous because (0, 1) is open in
(โ, ๐ฏ), but ๐2−1 [(0, 1)] = (0, 1) is not open in (โ, ๐ฐ).
3. Consider (โ, ๐ฏ) and (๐, ๐ฐ), where ๐ฏ is the standard topology on โ, ๐ = {๐, ๐, ๐}, and ๐ฐ is the
๐ if ๐ฅ < 0
topology {∅, {๐}, {๐, ๐}, {๐, ๐, ๐}}. The function ๐: โ → ๐ defined by ๐(๐ฅ) = {
is
๐ if ๐ฅ ≥ 0
continuous because ๐ −1 [{๐}] = ∅ and ๐ −1 [{๐, ๐}] = (– ∞, 0) are both open in (โ, ๐ฏ).
If we replace the topology ๐ฐ by the topology ๐ฑ = {∅, {๐}, {๐, ๐, ๐}}, then the same function ๐ is
not continuous because ๐ −1 [{๐}] = [0, ∞), which is not open in (โ, ๐ฏ).
Let (๐, ๐ฏ) and (๐, ๐ฐ) be topological spaces. A function ๐: ๐ → ๐ is continuous at ๐ฅ ∈ ๐ if for each
๐ ∈ ๐ฐ with ๐(๐ฅ) ∈ ๐, there is ๐ ∈ ๐ฏ with ๐ฅ ∈ ๐ such that ๐[๐] ⊆ ๐.
Example 14.13:
1. Consider the functions ๐ and ๐ from part 1 of Example 14.12. They are pictured below.
Let’s check that ๐ is continuous at ๐. There are two open sets containing ๐(๐) = 1. The first
one is {1, 2}. The set {๐} is open and ๐[{๐}] = {1} ⊆ {1, 2}. The second open set containing 1
is {1, 2, 3}. We can use the open set {๐} again because ๐[{๐}] = {1} ⊆ {1,2,3}. Alternatively,
we can use the open set {๐, ๐} because ๐[{๐, ๐}] = {1, 3} ⊆ {1, 2, 3}.
205
Let’s also check that ๐ is continuous at ๐. The only open set containing ๐(๐) = 3 is {1, 2, 3}. We
have ๐ ∈ {๐, ๐} and ๐[{๐, ๐}] = {1, 3} ⊆ {1, 2, 3}.
The function ๐ is continuous at ๐ because the only open set containing ๐(๐) = 3 is {1, 2, 3} and
we have ๐ ∈ {๐} and ๐[{๐}] = {3} ⊆ {1, 2, 3}.
The function ๐ is not continuous at ๐. The open set {1, 2} contains ๐(๐) = 1. However, the only
open set containing ๐ is {๐, ๐} and ๐[{๐, ๐}] = {1, 3} โ {1, 2}.
๐ฅ
if ๐ฅ < 0
. Then ๐ is not continuous at 0. To see this, note
๐ฅ + 1 if ๐ฅ ≥ 0
that ๐(0) = 1 ∈ (0, 2) and if 0 ∈ (๐, ๐), then ๐[(๐, ๐)] = (๐, 0) ∪ [1, ๐ + 1) โ (0, 2) because
๐
๐
๐
∈ (๐, 0), so that 2 < 0, and therefore, 2 ∉ (0, 2).
2
2. Define ๐: โ → โ by ๐(๐ฅ) = {
If ๐ > 0, then ๐ is continuous at ๐. To see this, let (๐, ๐) be an open interval containing
๐ (๐) = ๐ + 1. Then ๐ < ๐ + 1 < ๐, and so, ๐ − 1 < ๐ < ๐ − 1. Let ๐ = max{0, ๐ − 1}. Then
we have ๐ < ๐ < ๐ − 1. So, ๐ ∈ (๐, ๐ − 1). Since ๐ > 0, ๐[(๐, ๐ − 1)] = (๐ + 1, ๐). We now
show that (๐ + 1, ๐) ⊆ (๐, ๐). Let ๐ฆ ∈ (๐ + 1, ๐). Then ๐ + 1 < ๐ฆ < ๐. Since ๐ ≥ ๐ − 1,
๐ + 1 ≥ ๐. Thus, ๐ < ๐ฆ < ๐, and therefore, ๐ฆ ∈ (๐, ๐). It follows that ๐[(๐, ๐ − 1)] ⊆ (๐, ๐).
Also, if ๐ < 0, then ๐ is continuous at ๐. To see this, let (๐, ๐) be an open interval containing
๐(๐) = ๐. Then ๐ < ๐ < ๐. Let ๐ = min{0, ๐}. Then we have ๐ < ๐ < ๐. So, ๐ ∈ (๐, ๐). Finally,
note that ๐[(๐, ๐)] = (๐, ๐) ⊆ (๐, ๐).
We will see in Theorem 14.4 below that if ๐: โ → โ, where โ is given the standard topology,
then the topological definition of continuity here agrees with all the equivalent definitions of
continuity from Lesson 13.
Theorem 14.3: Let (๐, ๐ฏ) and (๐, ๐ฐ) be topological spaces and let ๐: ๐ → ๐. Then ๐ is continuous if
and only if ๐ is continuous at each ๐ฅ ∈ ๐.
Proof: Let (๐, ๐ฏ) and (๐, ๐ฐ) be topological spaces and let ๐: ๐ → ๐. First, suppose that ๐ is continuous.
Let ๐ฅ ∈ ๐ and let ๐ ∈ ๐ฐ with ๐(๐ฅ) ∈ ๐. Since ๐ is continuous, ๐ −1 [๐] ∈ ๐ฏ. If we let ๐ = ๐ −1 [๐], then
by part (i) of Problem 1 below, we have ๐[๐] = ๐[๐ −1 [๐]] ⊆ ๐.
Conversely, suppose that ๐ is continuous at each ๐ฅ ∈ ๐. Let ๐ ∈ ๐ฐ. If ๐ −1 [๐] = ∅, then ๐ −1 [๐] ∈ ๐ฏ
because every topology contains the empty set. If ๐ −1 [๐] ≠ ∅, let ๐ฅ ∈ ๐ −1 [๐]. Then ๐(๐ฅ) ∈ ๐. So,
there is ๐๐ฅ ∈ ๐ฏ with ๐ฅ ∈ ๐๐ฅ such that ๐[๐๐ฅ ] ⊆ ๐. Let ๐ = โ{๐๐ฅ | ๐ฅ ∈ ๐ −1 [๐]}. Since ๐ is a union of
open sets, ๐ ∈ ๐ฏ. We will show that ๐ = ๐ −1 [๐]. Let ๐ง ∈ ๐. Then there is ๐ฅ ∈ ๐ with ๐ง ∈ ๐๐ฅ . So, we
have ๐(๐ง) ∈ ๐[๐๐ฅ ]. Since ๐[๐๐ฅ ] ⊆ ๐, ๐(๐ง) ∈ ๐. Thus, ๐ง ∈ ๐ −1 [๐]. Since ๐ง ∈ ๐ was arbitrary, we have
shown that ๐ ⊆ ๐ −1 [๐]. Now, let ๐ง ∈ ๐ −1 [๐]. Then ๐(๐ง) ∈ ๐. So, ๐ง ∈ ๐๐ง . Since ๐๐ง ⊆ ๐, we have
๐ง ∈ ๐. Since ๐ง ∈ ๐ −1 [๐] was arbitrary, we have shown that ๐ −1 [๐] ⊆ ๐. Since ๐ ⊆ ๐ −1 [๐] and
๐ −1 [๐] ⊆ ๐, we have ๐ = ๐ −1 [๐].
โก
We now give an ๐ − ๐ฟ definition of continuity for metrizable topological spaces.
Theorem 14.4: Let (๐, ๐ฏ) and (๐, ๐ฐ) be metrizable topological spaces where ๐ฏ and ๐ฐ are induced by
the metrics ๐ and ๐, respectively. ๐: ๐ → ๐ is continuous at ๐ฅ ∈ ๐ if and only if for all ๐ > 0 there is
๐ฟ > 0 such that ๐(๐ฅ, ๐ฆ) < ๐ฟ implies ๐(๐(๐ฅ), ๐(๐ฆ)) < ๐.
206
Proof: Let (๐, ๐ฏ) and (๐, ๐ฐ) be topological spaces with corresponding metrics ๐ and ๐ and let ๐ฅ ∈ ๐.
First, suppose that ๐: ๐ → ๐ is continuous at ๐ฅ ∈ ๐ and let ๐ > 0. ๐(๐ฅ) ∈ ๐ต๐ (๐(๐ฅ)) and ๐ต๐ (๐(๐ฅ)) is
open in ๐ฐ. Since ๐ is continuous at ๐ฅ, there is ๐ ∈ ๐ฏ with ๐ฅ ∈ ๐ such that ๐[๐] ⊆ ๐ต๐ (๐(๐ฅ)). Since the
open balls form a basis for ๐ฐ, we can find ๐ฟ > 0 such that ๐ต๐ฟ (๐ฅ) ⊆ ๐ (Why?). It follows that
๐[๐ต๐ฟ (๐ฅ)] ⊆ ๐[๐] and so, ๐[๐ต๐ฟ (๐ฅ)] ⊆ ๐ต๐ (๐(๐ฅ)). Now, if ๐(๐ฅ, ๐ฆ) < ๐ฟ, then ๐ฆ ∈ ๐ต๐ฟ (๐ฅ). So,
๐(๐ฆ) ∈ ๐[๐ต๐ฟ (๐ฅ)]. Since ๐[๐ต๐ฟ (๐ฅ)] ⊆ ๐ต๐ (๐(๐ฅ)), we have ๐(๐ฆ) ∈ ๐ต๐ (๐(๐ฅ)). So, ๐(๐(๐ฅ), ๐(๐ฆ)) < ๐.
Conversely, suppose that for all ๐ > 0 there is ๐ฟ > 0 such that ๐(๐ฅ, ๐ฆ) < ๐ฟ implies ๐(๐(๐ฅ), ๐(๐ฆ)) < ๐.
Let ๐ ∈ ๐ฐ with ๐(๐ฅ) ∈ ๐. Since the open balls form a basis for ๐, there is ๐ > 0 such that
๐(๐ฅ) ∈ ๐ต๐ (๐(๐ฅ)) and ๐ต๐ (๐(๐ฅ)) ⊆ ๐ (Why?). Choose ๐ฟ > 0 such that ๐(๐ฅ, ๐ฆ) < ๐ฟ implies
๐(๐(๐ฅ), ๐(๐ฆ)) < ๐. Let ๐ = ๐ต๐ฟ (๐ฅ). Then ๐ ∈ ๐ฏ and ๐ฅ ∈ ๐. We show that ๐[๐] ⊆ ๐. Let ๐ฆ ∈ ๐[๐].
Then there is ๐ง ∈ ๐ with ๐ฆ = ๐(๐ง). Since ๐ง ∈ ๐ = ๐ต๐ฟ (๐ฅ), ๐(๐ฅ, ๐ง) < ๐ฟ. Therefore, ๐(๐(๐ฅ), ๐(๐ง)) < ๐.
So, ๐(๐ง) ∈ ๐ต๐ (๐(๐ฅ)). Since ๐ต๐ (๐(๐ฅ)) ⊆ ๐, ๐(๐ง) ∈ ๐. Since ๐ฆ = ๐(๐ง), we have ๐ฆ ∈ ๐, as desired. โก
Note: If we consider a function ๐: โ → โ with the metric ๐(๐ฅ, ๐ฆ) = |๐ฅ − ๐ฆ|, Theorem 14.4 shows that
all our definitions of continuity given in Lesson 13 are equivalent to the topological definitions given
here.
Let (๐, ๐ฏ) and (๐, ๐ฐ) be topological spaces. A function ๐: ๐ → ๐ is a homeomorphism if ๐ is a bijection
such that ๐ ∈ ๐ฏ if and only if ๐[๐] ∈ ๐ฐ.
Notes: (1) If ๐: ๐ → ๐ is a bijection, then every subset ๐ ⊆ ๐ can be written as ๐[๐] for exactly one
subset ๐ ⊆ ๐. If ๐ is also continuous, then given ๐ ⊆ ๐ with ๐[๐] ∈ ๐ฐ, we have ๐ = ๐ −1 [๐[๐]] ∈ ๐ฏ.
Conversely, suppose that ๐ is a bijection such that for every subset ๐ of ๐, ๐[๐] ∈ ๐ฐ implies ๐ ∈ ๐ฏ.
Then, given ๐ ∈ ๐ฐ, since there is ๐ ⊆ ๐ with ๐ = ๐[๐], by our assumption, we have
๐ −1 [๐] = ๐ −1 [๐[๐]] = ๐ ∈ ๐ฏ, showing that ๐ is continuous. It follows that ๐ is a continuous bijection
if and only if ๐ is a bijection such that ∀๐ ⊆ ๐(๐[๐] ∈ ๐ฐ → ๐ ∈ ๐ฏ).
(2) Similarly, ๐: ๐ → ๐ is a bijective function with continuous inverse ๐ −1 : ๐ → ๐ if and only if ๐ is a
bijection such that ∀๐ ⊆ ๐(๐ ∈ ๐ฏ → ๐[๐] ∈ ๐ฐ).
(3) Notes 1 and 2 tell us that ๐: ๐ → ๐ is a homeomorphism if and only if ๐ is a continuous bijective
function with a continuous inverse.
(4) Since a homeomorphism is bijective, it provides a one to one correspondence between the elements
of ๐ and the elements of ๐. However, a homeomorphism does much more than this. It also provides a
one to one correspondence between the sets in ๐ฏ and the sets in ๐ฐ.
(5) A homeomorphism between two topological spaces is analogous to an isomorphism between two
algebraic structures (see Lesson 11). From the topologists point of view, if there is a homeomorphism
from one space to another, the two topological spaces are indistinguishable.
We say that two topological spaces (๐, ๐ฏ) and (๐, ๐ฐ) are homeomorphic or topologically equivalent
if there is a homeomorphism ๐: ๐ → ๐.
207
Example 14.14:
1. Let ๐ = {๐, ๐}, ๐ฏ = {∅, {๐}, {๐, ๐}}, and ๐ฐ = {∅, {๐}, {๐, ๐}}. The map ๐: ๐ → ๐ defined by
๐(๐) = ๐ and ๐(๐) = ๐ is a homeomorphism from (๐, ๐ฏ) to (๐, ๐ฐ). Notice that the inverse
image of the open set {๐} ∈ ๐ฐ is the open set {๐} ∈ ๐ฏ. This shows that ๐ is continuous.
Conversely, the image of the open set {๐} ∈ ๐ฏ is the open set {๐} ∈ ๐ฐ. This shows that ๐ −1 is
continuous. Since ๐ is also a bijection, we have shown that ๐ is a homeomorphism. On the other
hand, the identity function ๐: ๐ → ๐ defined by ๐(๐) = ๐ and ๐(๐) = ๐ is not a
homeomorphism because it is not continuous. For example, the inverse image of the open set
{๐} ∈ ๐ฐ is the set {๐} which is not in the topology ๐ฏ. We can visualize these two functions as
follows:
Notice that ๐ and ๐ are both bijections from ๐ to ๐, but only the function ๐ also gives a one to
one correspondence between the open sets of the topology (๐, ๐ฏ) and the open sets of the
topology (๐, ๐ฐ).
The homeomorphism ๐ shows that (๐, ๐ฏ) and (๐, ๐ฐ) are topologically equivalent. So, up to
topological equivalence, there are only three topologies on a set with two elements: the trivial
topology, the discrete topology, and the topology with exactly three open sets.
2. Let ๐ = {๐, ๐, ๐}, ๐ฏ = {∅, {๐}, {๐, ๐}, {๐, ๐, ๐}}, and ๐ฐ = {∅, {๐, ๐}, {๐, ๐, ๐}}. Then the identity
function ๐: ๐ → ๐ is a continuous bijection from (๐, ๐ฏ) to (๐, ๐ฐ). Indeed, the inverse image of
the open set {๐, ๐} ∈ ๐ฐ is the open set {๐, ๐} ∈ ๐ฏ. However, ๐ is not a homeomorphism
because ๐ −1 is not continuous. The set {๐} is open in ๐ฏ, but its image ๐[{๐}] = {๐} is not open
in ๐ฐ.
3. We saw in part 3 of Example 14.1 that there are 29 topologies on a set with three elements.
However, up to topological equivalence, there are only 9. Below is a visual representation of
the 9 distinct topologies on the set ๐ = {๐, ๐, ๐}, up to topological equivalence.
The dedicated reader should verify that each of the other 20 topologies are topologically
equivalent to one of these and that no two topologies displayed here are topologically
equivalent.
208
4. Consider โ together with the standard topology. Define ๐: โ → โ by ๐(๐ฅ) = 2๐ฅ + 3. Let’s
check that ๐ is a homeomorphism. If ๐ฅ ≠ ๐ฆ, then 2๐ฅ ≠ 2๐ฆ, and so, 2๐ฅ + 3 ≠ 2๐ฆ + 3. Therefore,
๐ฆ−3
∀๐ฅ, ๐ฆ ∈ โ(๐ฅ ≠ ๐ฆ → ๐(๐ฅ) ≠ ๐(๐ฆ)). That is, ๐ is injective. Next, if ๐ฆ ∈ โ, let ๐ฅ = 2 . Then
๐ฆ−3
๐ฆ−3
๐(๐ฅ) = ๐ ( 2 ) = 2 ( 2 ) + 3 = (๐ฆ − 3) + 3 = ๐ฆ. So, ∀๐ฆ ∈ โ ∃๐ฅ ∈ โ(๐(๐ฅ) = ๐ฆ). That is, ๐ is
๐−3 ๐−3
surjective. Now, let (๐, ๐) be a bounded open interval. ๐ −1 [(๐, ๐)] = ( 2 , 2 ), which is open.
So, ๐ is continuous. Also, ๐[(๐, ๐)] = (2๐ + 3, 2๐ + 3), which is open. So, ๐ −1 is continuous.
Since ๐ is a continuous bijection with a continuous inverse, ๐ is a homeomorphism.
5. Consider (โ, ๐ฏ) and (โ, ๐ฐ), where ๐ฏ is the standard topology on โ and ๐ฐ is the topology
generated by the basis {(๐, ∞) | ๐ ∈ โ}. We saw in part 2 of Example 14.12 that the identity
function ๐: โ๐ฏ → โ๐ฐ is continuous because ๐ −1 [(๐, ∞)] = (๐, ∞) is open in (โ, ๐ฏ) for every
๐ ∈ โ. However, this function is not a homeomorphism because ๐ −1 is not continuous. For
example, (0, 1) is open in (โ, ๐ฏ), but ๐[(0, 1)] = (0, 1) is not open in (โ, ๐ฐ).
A topological property or topological invariant is a property that is preserved under homeomorphisms.
More specifically, we say that property ๐ is a topological property if whenever the topological space
(๐, ๐ฏ) has property ๐ and (๐, ๐ฐ) is topologically equivalent to (๐, ๐ฏ), then (๐, ๐ฐ) also has property ๐.
In Problem 5 below, you will be asked to show that compactness is a topological property. As another
example, let’s show that the property of being a ๐2 -space is a topological property.
Theorem 14.5: Let (๐, ๐ฏ) be a ๐2 -space and let (๐, ๐ฐ) be topologically equivalent to (๐, ๐ฏ). Then (๐, ๐ฐ)
is a ๐2 -space.
Proof: Let (๐, ๐ฏ) be a ๐2 -space and let ๐: ๐ → ๐ be a homeomorphism. Let ๐ฅ, ๐ฆ ∈ ๐ with ๐ฅ ≠ ๐ฆ. Since
๐ is bijective, there are ๐ง, ๐ค ∈ ๐ with ๐ง ≠ ๐ค such that ๐(๐ง) = ๐ฅ and ๐(๐ค) = ๐ฆ. Since (๐, ๐ฏ) is a
๐2 -space, there are open sets ๐, ๐ ∈ ๐ฏ with ๐ง ∈ ๐, ๐ค ∈ ๐, and ๐ ∩ ๐ = ∅. Since ๐ is a
homeomorphism, ๐[๐], ๐[๐] ∈ ๐ฐ. We also have ๐ฅ = ๐(๐ง) ∈ ๐[๐] and ๐ฆ = ๐(๐ค) ∈ ๐[๐]. We show
that ๐[๐] ∩ ๐[๐] = ∅. If not, there is ๐ ∈ ๐[๐] ∩ ๐[๐]. So, there are ๐ ∈ ๐ and ๐ ∈ ๐ with ๐(๐) = ๐
and ๐(๐) = ๐. So, ๐(๐) = ๐(๐). Since ๐ is injective, ๐ = ๐. But then ๐ ∈ ๐ ∩ ๐, contradicting that
๐ ∩ ๐ = ∅. It follows that ๐[๐] ∩ ๐[๐] = ∅. Therefore, (๐, ๐ฐ) is a ๐2 -space.
โก
The dedicated reader might want to show that each of the other separation axioms (๐0 through ๐4 ) are
topological properties and that metrizability is a topological property.
209
Problem Set 14
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Let ๐: ๐ด → ๐ต and let ๐ฟ be a nonempty collection of subsets of ๐ต. Prove the following:
(i)
For any ๐ ∈ ๐ฟ, ๐[๐ −1 [๐]] ⊆ ๐.
(ii)
๐ −1 [โ๐ฟ] = โ{๐ −1 [๐] | ๐ ∈ ๐}.
2. Let (๐, ๐) be a metric space. Prove that for all ๐ฅ ∈ ๐, ๐(๐ฅ, ๐ฅ) ≥ 0.
LEVEL 2
3. Prove that โฌ = {๐ ⊆ โ | โ โ ๐ is finite} generates a topology ๐ฏ on โ that is strictly coarser than
the standard topology. ๐ฏ is called the cofinite topology on โ.
1
4. Let ๐พ = {๐ | ๐ ∈ โค+ }, โฌ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ < ๐} ∪ {(๐, ๐) โ ๐พ | ๐, ๐ ∈ โ ∧ ๐ < ๐}. Prove
that โฌ is a basis for a topology ๐ฏ๐พ on โ that is strictly finer than the standard topology on โ.
LEVEL 3
5. Let (๐พ, ๐ฏ) and (๐ฟ, ๐ฐ) be topological spaces with (๐พ, ๐ฏ) compact and let ๐: ๐พ → ๐ฟ be a
homeomorphism. Prove that (๐ฟ, ๐ฐ) is compact.
6. Let ๐ be a nonempty set and let โฌ be a collection of subsets of ๐. Prove that the set generated by
โฌ, {โ๐ฟ | ๐ฟ ⊆ โฌ}, is equal to {๐ด ⊆ ๐ | ∀๐ฅ ∈ ๐ด ∃๐ต ∈ โฌ(๐ฅ ∈ ๐ต ∧ ๐ต ⊆ ๐ด)}.
7. Define the functions ๐1 and ๐2 from โ × โ to โ by ๐1 (๐ง, ๐ค) = |Re ๐ง − Re ๐ค| + |Im ๐ง − Im ๐ค|
and ๐2 (๐ง, ๐ค) = max{|Re ๐ง − Re ๐ค|, |Im ๐ง − Im ๐ค|}. Prove that (โ, ๐1 ) and (โ, ๐2 ) are metric
spaces such that ๐1 and ๐2 induce the standard topology on โ.
8. Let (๐, ๐ฏ) be a topological space and let ๐ด ⊆ ๐. Prove that ๐ฏ๐ด = {๐ด ∩ ๐ | ๐ ∈ ๐ฏ} is a topology
on ๐ด. Then prove that if โฌ is a basis for ๐ฏ, then โฌ๐ด = {๐ด ∩ ๐ต| ๐ต ∈ โฌ} is a basis for ๐ฏ๐ด . ๐ฏ๐ด is
called the subspace topology on ๐ด.
LEVEL 4
9. Let โฌ ′ = {(๐, ๐) | ๐, ๐ ∈ โ ∧ ๐ < ๐}. Prove that โฌ′ is countable and that โฌ′ is a basis for a
topology on โ. Then show that the topology generated by โฌ′ is the standard topology on โ.
10. Let (๐, ๐ฏ) be a ๐2 -space and ๐ด ⊆ ๐. Prove that (๐ด, ๐ฏ๐ด ) is a ๐ฏ2 -space (see Problem 8 for the
definition of ๐ฏ๐ด ). Determine if the analogous statement is true for ๐3 -spaces.
210
11. Let (๐1 , ๐ฏ1 ) and (๐2 , ๐ฏ2 ) be topological spaces. Let โฌ = {๐ × ๐ | ๐ ∈ ๐ฏ1 ∧ ๐ ∈ ๐ฏ2 }. Prove that โฌ
is a basis for a topology ๐ฏ on ๐1 × ๐2 , but in general, โฌ itself is not a topology on ๐1 × ๐2 . Then
prove that if โฌ1 is a basis for ๐ฏ1 and โฌ2 is a basis for ๐ฏ2 , then ๐ = {๐ × ๐ | ๐ ∈ โฌ1 ∧ ๐ ∈ โฌ2 } is
a basis for ๐ฏ. The topology ๐ฏ is called the product topology on ๐1 × ๐2.
LEVEL 5
12. Let (๐1 , ๐ฏ1 ) and (๐2 , ๐ฏ2 ) be ๐2 -spaces. Prove that ๐1 × ๐2 with the product topology (as defined
in Problem 11) is also a ๐2 -space. Determine if the analogous statement is true for ๐3 -spaces.
13. Let ๐๐ฟ be the set generated by the half open intervals of the form [๐, ๐) with ๐, ๐ ∈ โ. Show that
๐๐ฟ is a topology on โ that is strictly finer than the standard topology on โ and incomparable with
the topology ๐ฏ๐พ .
14. Prove that every metrizable space is ๐4 .
15. Consider the topological space (โ, ๐ฏ๐ฟ ). Prove that โ2 with the corresponding product topology
(as defined in Problem 11) is a ๐3 -space, but not a ๐4 -space.
16. Let (๐1 , ๐ฏ1 ) and (๐2 , ๐ฏ2 ) be metrizable spaces. Prove that ๐1 × ๐2 with the product topology is
metrizable. Use this to show that (โ, ๐ฏ๐ฟ ) is not metrizable.
211
LESSON 15 – COMPLEX ANALYSIS
COMPLEX VALUED FUNCTIONS
The Unit Circle
Recall from Lesson 7 that a circle in the Complex Plane is the set of all points that are at a fixed distance
(called the radius of the circle) from a fixed point (called the center of the circle).
The circumference of a circle is the distance around the circle.
If ๐ถ and ๐ถ′ are the circumferences of two circles with radii ๐ and ๐′, respectively, then it turns out that
๐ถ
๐ถ′
= 2๐ ′ . In other words, the value of the ratio
2๐
Circumference
2(radius)
is independent of the circle that we use to
form this ratio. We leave the proof of this fact for the interested reader to investigate themselves. We
๐ถ
call the common value of this ratio ๐ (pronounced “pi”). So, we have 2๐ = ๐, or equivalently,
๐ช = ๐๐
๐.
Example 15.1: The unit circle is the circle with radius 1 and center
(0, 0). The equation of this circle is |๐ง| = 1. If we write ๐ง in the
standard form ๐ง = ๐ฅ + ๐ฆ๐, we see that |๐ง| = √๐ฅ 2 + ๐ฆ 2 , and so,
the equation of the unit circle can also be written ๐ฅ 2 + ๐ฆ 2 = 1.
To the right is a picture of the unit circle in the Complex Plane.
|๐ง| = 1
The circumference of the unit circle is 2๐ ⋅ 1 = ๐๐
.
An angle in standard position consists of two rays, both of which
have their initial point at the origin, and one of which is the
positive ๐ฅ-axis. We call the positive ๐ฅ-axis the initial ray and we
call the second ray the terminal ray. The radian measure of the
angle is the part of the circumference of the unit circle beginning
at the point (1, 0) on the positive ๐ฅ-axis and eventually ending at the point on the unit circle intercepted
by the second ray. If the motion is in the counterclockwise direction, the radian measure is positive and
if the motion is in the clockwise direction, the radian measure is negative.
Example 15.2: Let’s draw a few angles where the terminal ray lies along the line ๐ฆ = ๐ฅ.
212
Observe that in the leftmost picture, the arc intercepted by the angle has a length that is one-eighth of
the circumference of the circle. Since the circumference of the unit circle is 2๐ and the motion is in the
2๐
๐
counterclockwise direction, the angle has a radian measure of 8 = 4 .
Similarly, in the center picture, the arc intercepted by the angle has a length that is seven-eighths of
the circumference of the circle. This time the motion is in the clockwise direction, and so, the radian
7
7๐
measure of the angle is – 8 ⋅ 2๐ = – 4 .
In the rightmost picture, the angle consists of a complete rotation, tracing out the entire circumference
of the circle, followed by tracing out an additional length that is one-eighth the circumference of the
circle. Since the motion is in the counterclockwise direction, the radian measure of the angle is
2๐
8๐
๐
9๐
2๐ + 8 = 4 + 4 = 4 .
๐
Let’s find the point of intersection of the unit circle with the terminal ray of the angle 4 that lies along
the line with equation ๐ฆ = ๐ฅ (as shown in the leftmost figure from Example 15.2 above). If we call this
point (๐, ๐), then we have ๐ = ๐ (because (๐, ๐) is on the line ๐ฆ = ๐ฅ) and ๐2 + ๐ 2 = 1 (because (๐, ๐)
is on the unit circle). Replacing ๐ by ๐ in the second equation gives us ๐2 + ๐2 = 1, or equivalently,
1
1
2๐2 = 1. So, ๐2 = 2. The two solutions to this equation are ๐ = ±√2 = ±
√1
=±
√2
it should be clear that we are looking for the positive solution, so that ๐ =
have ๐ =
1
1
. Therefore, the point of intersection is (
√2
Notes: (1) The number
,
1
1
1
. From the picture,
√2
. Since ๐ = ๐, we also
√2
).
√2 √2
1
√2
can also be written in the form
√2
. To see that these two numbers are equal,
2
observe that we have
1
√2
=
1
√2
⋅1=
1
⋅
√2
√2 √2
=
1 ⋅ √2
√2 ⋅ √2
=
√2
.
2
(2) In the figure below on the left, we see a visual representation of the circle, the given angle, and the
desired point of intersection.
1 1
( , )
√2 √2
(–
(–
213
1
,
1
√2 √2
1
√2
,–
)
1
√2
)
1 1
( , )
√2 √2
1
1
( ,– )
√2 √2
(3) In the figure above on the right, we have divided the Complex Plane into eight regions using the
lines with equations ๐ฆ = ๐ฅ and ๐ฆ = – ๐ฅ (together with the ๐ฅ- and ๐ฆ-axes). We then used the symmetry
of the circle to label the four points of intersection of the unit circle with each of these two lines.
If ๐ (pronounced “theta”) is the radian measure of an angle in standard position such that the terminal
ray intersects the unit circle at the point (๐ฅ, ๐ฆ), then we will say that ๐(๐) = (๐ฅ, ๐ฆ). This expression
defines a function ๐: โ → โ × โ called the wrapping function. Observe that the inputs of the
wrapping function are real numbers, which we think of as the radian measure of angles in standard
position. The outputs of the wrapping function are pairs of real numbers, which we think of as points
in the Complex Plane. Also, observe that the range of the wrapping function is the unit circle.
We now define the cosine and sine of the angle ๐ by ๐๐จ๐ฌ ๐ฝ = ๐ and ๐ฌ๐ข๐ง ๐ฝ = ๐, where ๐(๐) = (๐ฅ, ๐ฆ).
๐ฌ๐ข๐ง ๐ฝ
๐
For convenience, we also define the tangent of the angle by ๐ญ๐๐ง ๐ฝ = ๐๐จ๐ฌ ๐ฝ = ๐.
๐
5๐
Notes: (1) The wrapping function is not one to one. For example, ๐ ( 2 ) = (0, 1) and ๐ ( 2 ) = (0, 1).
๐
5๐
However, 2 ≠ 2 . There are actually infinitely many real numbers that map to (0, 1) under the
๐
wrapping function. Specifically, ๐ ( 2 + 2๐๐) = (0, 1) for every ๐ ∈ โค.
In general, each point on the unit circle is the image of infinitely many real numbers. Indeed, if
๐(๐) = (๐, ๐), then ๐(๐ + 2๐๐) = (๐, ๐) for all ๐ ∈ โค.
(2) The wrapping function gives us a convenient way to associate an angle ๐ in standard position with
the corresponding point (๐ฅ, ๐ฆ) on the unit circle. It is mostly used only as a notational convenience. We
will usually be more interested in the expressions cos ๐ = ๐ฅ and sin ๐ = ๐ฆ.
Example 15.3: Using the rightmost figure above, we can make the following computations:
๐
1 1
๐( ) =( , )
4
√2 √2
3๐
1 1
๐ ( ) = (–
, )
4
√2 √2
๐
1
=
4 √2
5๐
1
cos
=–
4
√2
๐(
๐
1
=
4 √2
5๐
1
sin
=–
4
√2
cos
5๐
1
1
) = (–
,– )
4
√2 √2
3๐
1
=–
4
√2
7๐
1
cos
=
4
√2
sin
cos
๐(
7๐
1
1
) = ( ,– )
4
√2 √2
3๐
1
=
4
√2
7๐
1
sin
=–
4
√2
sin
๐
3๐
It’s also easy to compute the cosine and sine of the four quadrantal angles 0, 2 , ๐, and 2 . Here we use
the fact that the points (1, 0), (0, 1), (– 1, 0), and (0, – 1) lie on the unit circle.
๐(0) = (1, 0)
๐
๐ ( ) = (0, 1)
2
๐(๐) = (– 1, 0)
cos 0 = 1
sin 0 = 0
cos ๐ = – 1
sin ๐ = 0
๐
=0
2
3๐
cos
= 0
2
cos
214
๐(
3๐
) = (0, – 1)
2
๐
=1
2
3๐
sin
= –1
2
sin
Also, if we add any integer multiple of 2๐ to an angle, the cosine and sine of the new angle have the
9๐
๐
8๐
๐
๐
same values as the old angle. For example, cos 4 = cos ( 4 + 4 ) = cos ( 4 + 2๐) = cos 4 =
1
. This is
√2
a direct consequence of the fact that ๐(๐ + 2๐๐) = ๐(๐) for all ๐ ∈ โค.
We can also compute the tangent of each angle by dividing the sine of the angle by the cosine of the
angle. For example, we have
1
๐
sin
๐
4 = √2 = 1.
tan =
1
4 cos ๐
4
√2
Similarly, we have
tan
๐
3๐
= –1
4
tan
5๐
=1
4
tan
7๐
= –1
4
tan 0 = 0
tan ๐ = 0
3๐
When ๐ = 2 or 2 , tan ๐ is undefined.
Notes: (1) If ๐ง = ๐ฅ + ๐ฆ๐ is any complex number, then the point (๐ฅ, ๐ฆ) lies on a circle of radius ๐ centered
at the origin, where ๐ = |๐ง| = √๐ฅ 2 + ๐ฆ 2 . If ๐ is the radian measure of an angle in standard position
such that the terminal ray intersects this circle at the point (๐ฅ, ๐ฆ), then it can be proved that the cosine
๐
๐
and sine of the angle are equal to ๐๐จ๐ฌ ๐ฝ = ๐ and ๐ฌ๐ข๐ง ๐ฝ = ๐.
(2) It is standard to use the abbreviations cos 2 ๐ and sin2 ๐ for(cos ๐)2 and(sin ๐)2 , respectively.
From the definition of cosine and sine, we have the following formula called the Pythagorean Identity:
cos 2 ๐ + sin2 ๐ = 1
(3) Also, from the definition of cosine and sine, we have the following two formulas called the Negative
Identities:
๐๐จ๐ฌ(– ๐ฝ) = ๐๐จ๐ฌ ๐ฝ
๐ฌ๐ข๐ง(– ๐ฝ) = – ๐ฌ๐ข๐ง ๐ฝ.
Theorem 15.1: Let ๐ and ๐ be the radian measures of angles ๐ด and ๐ต, respectively. Then we have
cos(๐ + ๐) = cos ๐ cos ๐ − sin ๐ sin ๐
sin(๐ + ๐) = sin ๐ cos ๐ + cos ๐ sin ๐.
Notes: (1) The two formulas appearing in Theorem 15.1 are called the Sum Identities. You will be asked
to prove Theorem 15.1 in Problem 14 below (parts (i) and (v)).
(2) Theorem 15.1 will be used to prove De Moivre’s Theorem (Theorem 15.2) below. De Moivre’s
Theorem provides a fast method for performing exponentiation of complex numbers.
(3) ๐ and ๐ are Greek letters pronounced “theta” and “phi,” respectively. These letters are often used
to represent angle measures. We may sometimes also use the capital versions of these letters, Θ and
Φ, especially when insisting that the radian measures of the given angles are between – ๐ and ๐.
215
Exponential Form of a Complex Number
The standard form (or rectangular form) of a complex number ๐ง is
๐ง = ๐ฅ + ๐ฆ๐, where ๐ฅ and ๐ฆ are real numbers. Recall from Lesson 7 that
we can visualize the complex number ๐ง = ๐ฅ + ๐ฆ๐ as the point (๐ฅ, ๐ฆ) in
the Complex Plane.
If for ๐ง ≠ 0, we let ๐ = |๐ง| = |๐ฅ + ๐ฆ๐| = √๐ฅ 2 + ๐ฆ 2 and we let ๐ be the
radian measure of an angle in standard position such that the terminal
ray passes through the point (๐ฅ, ๐ฆ), then we see that ๐ and ๐ determine
this point. So, we can also write this point as (๐, ๐).
๐ฅ
(๐ฅ, ๐ฆ) or (๐, ๐)
๐
๐ฆ
๐
๐ฅ
๐ฆ
In Note 1 following Example 15.3, we saw that cos ๐ = ๐ and sin ๐ = ๐ . By multiplying each side of the
last two equations by ๐, we get ๐ฅ = ๐ cos ๐ and ๐ฆ = ๐ sin ๐. These equations allow us to rewrite the
complex number ๐ง = ๐ฅ + ๐ฆ๐ in the polar form ๐ง = ๐ cos ๐ + ๐๐ sin ๐ = ๐(cos ๐ + ๐ sin ๐).
If we also make the definition ๐ ๐๐ = cos ๐ + ๐ sin ๐, we can write the complex number ๐ง = ๐ฅ + ๐ฆ๐ in
the exponential form ๐ง = ๐๐ ๐๐ .
Recall from Lesson 7 that ๐ = |๐ง| is called the absolute value or modulus of the complex number. We
will call the angle ๐ an argument of the complex number and we may sometimes write ๐ = arg ๐ง.
Note that although ๐ = |๐ง| and ๐ = arg ๐ง uniquely determine a point (๐, ๐), there are infinitely many
other values for arg ๐ง that represent the same point. Indeed, (๐, ๐ + 2๐๐) represents the same point
for each ๐ ∈ โค. However, there is a unique such value Θ for arg ๐ง such that – ๐ < Θ ≤ ๐. We call this
value Θ the principal argument of ๐ง, and we write Θ = Arg ๐ง.
Notes: (1) The definition ๐ ๐๐ = cos ๐ + ๐ sin ๐ is known as Euler’s formula.
(2) When written in exponential form, two complex numbers ๐ง = ๐๐ ๐๐ and ๐ค = ๐ ๐ ๐๐ are equal if and
only if ๐ = ๐ and ๐ = ๐ + 2๐๐ for some ๐ ∈ โค.
Example 15.4: Let’s convert the complex number ๐ง = 1 + ๐ to exponential form. To do this, we need
1
to find ๐ and ๐. We have ๐ = |๐ง| = √12 + 12 = √1 + 1 = √2. Next, we have tan ๐ = 1 = 1. It follows
๐
๐
that ๐ = 4 . So, in exponential form, we have ๐ = √๐๐๐ ๐ .
๐
π
Note: 4 is the principal argument of ๐ง = 1 + ๐ because – ๐ < 4 ≤ ๐. When we write a complex number
in exponential form, we will usually use the principle argument.
If ๐ง ∈ โ, we define ๐ง 2 to be the complex number ๐ง ⋅ ๐ง. Similarly, ๐ง 3 = ๐ง ⋅ ๐ง ⋅ ๐ง = ๐ง 2 ⋅ ๐ง. More generally,
for ๐ง ∈ โ and ๐ ∈ โค we define ๐ง ๐ as follows:
•
For ๐ = 0, ๐ง ๐ = ๐ง 0 = 1.
•
For ๐ ∈ โค+ , ๐ง ๐+1 = ๐ง ๐ ⋅ ๐ง.
•
For ๐ ∈ โค– , ๐ง ๐ = (๐ง –๐ )–1 = ๐ง –๐.
1
216
Due to the following theorem, it’s often easier to compute ๐ง ๐ when ๐ง is written in exponential form.
๐
Theorem 15.2 (De Moivre’s Theorem): For all ๐ ∈ โค, (๐ ๐๐ ) = ๐ ๐(๐๐) .
0
Proof: For ๐ = 0, we have (๐ ๐๐ ) = (cos ๐ + ๐ sin ๐)0 = 1 = ๐ 0 = ๐ ๐(0๐) .
We prove De Moivre’s Theorem for ๐ ∈ โค+ by induction on ๐.
1
Base Case (๐ = 1): (๐ ๐๐ ) = ๐ ๐๐ = ๐ ๐(1๐) .
๐
Inductive Step: Assume that ๐ ≥ 1 and (๐ ๐๐ ) = ๐ ๐(๐๐) . We then have
(๐ ๐๐ )
๐+1
๐
= (cos ๐ + ๐ sin ๐)๐+1 = (cos ๐ + ๐ sin ๐)๐ (cos ๐ + ๐ sin ๐) = (๐ ๐๐ ) (cos ๐ + ๐ sin ๐)
= ๐ ๐(๐๐) (cos ๐ + ๐ sin ๐) = (cos ๐๐ + ๐ sin ๐๐)(cos ๐ + ๐ sin ๐)
= [(cos ๐๐)(cos ๐) − (sin ๐๐)(sin ๐)] + [(sin ๐๐)(cos ๐) + (cos ๐๐)(sin ๐)]๐.
= cos((๐ + 1)๐) + sin((๐ + 1)๐) ๐ (by Theorem 15.1) = ๐ ๐((๐+1)๐) .
๐
By the Principle of Mathematical Induction, (๐ ๐๐ ) = ๐ ๐(๐๐) for all ๐ ∈ โค+ .
If ๐ < 0, then
1
๐
(๐ ๐๐ ) =
=
=
(๐ ๐๐ )–๐
=
1
๐ ๐(–๐๐)
=
1
cos(– ๐๐) + ๐ sin(– ๐๐)
1
(by the Negative Identities)
cos(๐๐) − ๐ sin(๐๐)
1
cos(๐๐) + ๐ sin(๐๐)
cos(๐๐) + ๐ sin(๐๐)
⋅
=
cos(๐๐) − ๐ sin(๐๐) cos(๐๐) + ๐ sin(๐๐) cos2 (๐๐) + sin2 (๐๐)
= cos(๐๐) + ๐ sin(๐๐) (by the Pythagorean Identity) = ๐ ๐(๐๐) .
โก
Note: De Moivre’s Theorem generalizes to all ๐ ∈ โ with a small “twist.” In general, the expression
๐
(๐ ๐๐ ) may have multiple values, whereas ๐ ๐(๐๐) takes on just one value. However, for all ๐ ∈ โ,
๐
๐
(๐ ๐๐ ) = ๐ ๐(๐๐) in the sense that ๐ ๐(๐๐) is equal to one of the possible values of (๐ ๐๐ ) .
๐
1
1
As a very simple example, let ๐ = 0 and ๐ = 2. Then ๐ ๐(๐๐) = ๐ 0 = 1 and (๐ ๐๐ ) = 12 , which has two
values: 1 and – 1 (because 12 = 1 and (– 1)2 = 1). Observe that ๐ ๐(๐๐) is equal to one of the two
๐
possible values of (๐ ๐๐ ) .
We will not prove this more general result here.
–2
7๐
Example 15.5: Let’s compute (2 − 2๐)6 . If we let ๐ง = 2 − 2๐, we have tan ๐ = 2 = – 1, so that ๐ = 4
(Why?). Also, ๐ = |๐ง| = √22 + (– 2)2 = √22 (1 + 1) = √22 ⋅ 2 = √22 ⋅ √2 = 2√2. So, in exponential
7๐
form, ๐ง = 2√2๐ 4 ๐ , and therefore,
217
7๐
6
7๐
6
6
7๐
๐
21๐
๐ง 6 = (2√2๐ 4 ๐ ) = 26 √2 (๐ 4 ๐ ) = 64 ⋅ 8๐ 6( 4 )๐ = 512๐ 2 ๐ = 512๐ ( 2 +10๐)๐
๐
๐
+ ๐ sin ) = 512(0 + ๐ ⋅ 1) = ๐๐๐๐.
2
2
Recall that a square root of a complex number ๐ง is a complex number ๐ค such that ๐ง = ๐ค 2 (see Lesson
7). More generally, if ๐ง ∈ โ and ๐ ∈ โค+ , we say that ๐ค ∈ โ is an ๐th root of ๐ง if ๐ง = ๐ค ๐ .
๐
= 512๐ 2 ๐ = 512 (cos
Suppose that ๐ง = ๐๐ ๐๐ and ๐ค = ๐ ๐ ๐๐ are exponential forms of ๐ง, ๐ค ∈ โ and that ๐ค is an ๐th root of ๐ง.
Let’s derive a formula for ๐ค in terms of ๐ and ๐.
๐
We have ๐ค ๐ = ๐ ๐ (๐ ๐๐ ) = ๐ ๐ ๐ ๐(๐๐) . Since ๐ง = ๐ค ๐ , ๐๐ ๐๐ = ๐ ๐ ๐ ๐(๐๐) . So, ๐ ๐ = ๐ and ๐๐ = ๐ + 2๐๐,
๐
where ๐ ∈ โค. Therefore, ๐ = √๐ and ๐ =
๐
2๐๐
๐
If ๐ ≥ ๐, then ๐ + ๐ = ๐ +
๐ 2๐๐
2(๐+๐−๐)๐
๐
๐ 2(๐−๐)๐
)
๐
and therefore, ๐ ๐(๐+ ๐ ) = ๐ ๐(๐+
๐+2๐๐
๐
2๐๐
๐ 2๐๐
๐
= ๐ + ๐ for ๐ ∈ โค. Thus, ๐ค = √๐๐ ๐(๐+ ๐ ) , ๐ ∈ โค.
๐
2๐๐+2(๐−๐)๐
=๐+
๐
๐
๐
2๐๐
=๐+ ๐ +
2(๐−๐)๐
๐
๐
=๐+
2(๐−๐)๐
๐
+ 2๐,
.
๐ 2๐๐
It follows that there are exactly ๐ distinct ๐th roots of ๐ง given by ๐ค = √๐๐ ๐(๐+ ๐ ) , ๐ = 0, 1, … , ๐ − 1.
๐
๐
The principal ๐๐ญ๐ก root of ๐ง, written √๐ง, is √๐๐ ๐๐ , where – ๐ < θ ≤ ๐.
๐
๐
Example 15.6: Let’s compute all the eighth roots of 1 (also called the ๐th roots of unity). If 1 = ๐ค ๐ ,
0 2๐๐
๐๐
then ๐ค = √1๐ ๐(8+ 8 ) = ๐ 4 ๐ for ๐ = 0, 1, 2, 3, 4, 5, 6, 7. Substituting each of these values for ๐ into
8
๐๐
the expression ๐ 4 ๐ gives us the following 8 eighth roots of unity.
๐,
๐
+
๐
๐, ๐, –
๐
+
๐
๐, – ๐, –
๐
−
๐
๐, – ๐,
๐
−
๐
๐
√๐ √๐
√๐ √๐
√๐ √๐
√๐ √๐
Note: Notice how the eight 8th roots of unity are uniformly distributed on the unit circle.
Functions of a Complex Variable
We will be considering functions ๐: ๐ด → โ, where ๐ด ⊆ โ. If ๐ง ∈ ๐ด, then ๐(๐ง) = ๐ค for some ๐ค ∈ โ.
If we write both ๐ง and ๐ค in standard form, then we have ๐ง = ๐ฅ + ๐ฆ๐ and ๐ค = ๐ข + ๐ฃ๐ for some real
numbers ๐ฅ, ๐ฆ, ๐ข, and ๐ฃ. Note that the values of ๐ข and ๐ฃ depend upon the values of ๐ฅ and ๐ฆ. It follows
that the complex function ๐ is equivalent to a pair of real functions ๐ข, ๐ฃ: โ2 → โ. That is, we have
๐(๐ง) = ๐(๐ฅ + ๐ฆ๐) = ๐ข(๐ฅ, ๐ฆ) + ๐๐ฃ(๐ฅ, ๐ฆ).
218
If we write ๐ง in the exponential form ๐ง = ๐๐ ๐๐ , we have ๐(๐ง) = ๐(๐๐ ๐๐ ) = ๐ข(๐, ๐) + ๐๐ฃ(๐, ๐).
Notes: (1) If ๐: ๐ด → โ, ๐ง = ๐ฅ + ๐ฆ๐ and ๐(๐ง) = ๐ข + ๐ฃ๐, then the function ๐ takes the point (๐ฅ, ๐ฆ) in the
Complex Plane to the point (๐ข, ๐ฃ) in the Complex Plane.
Compare this to a real-valued function, where a point ๐ฅ on the real line is taken to a point ๐ฆ on the real
line. The usual treatment here is to draw two real lines perpendicular to each other, label one of them
the ๐ฅ-axis and the other the ๐ฆ-axis. This forms a plane and we can plot points (๐ฅ, ๐(๐ฅ)) in the usual
way.
With complex-valued functions, we cannot visualize the situation in an analogous manner. The
problem is that a visualization using this method would require us to plot points of the form (๐ฅ, ๐ฆ, ๐ข, ๐ฃ).
So, we would need a four-dimensional version of the two-dimensional plane, but humans are capable
of perceiving only three dimensions. Therefore, we will need to come up with other methods for
visualizing complex-valued functions.
(2) One way to visualize a complex-valued function is to simply stay in
the same plane and to analyze how a typical point moves or how a
certain set is transformed. For example, let ๐: โ → โ be defined by
๐(๐ง) = ๐ง − 1. Then the function ๐ takes the point (๐ฅ, ๐ฆ) to the point
(๐ฅ − 1, ๐ฆ). That is, each point is shifted one unit to the left. Similarly, if
๐ ⊆ โ, then each point of the set ๐ is shifted one unit to the left by the
function ๐. Both these situations are demonstrated in the figure to the
right.
This method may work well for very simple functions, but for more complicated functions, the method
in Note 3 below will usually be preferable.
(3) A second way to visualize a complex-valued function is to draw two separate planes: an ๐ฅ๐ฆ-plane
and a ๐ข๐ฃ-plane. We can then draw a point or a set in the ๐ฅ๐ฆ-plane and its image under ๐ in the
๐ข๐ฃ-plane. Let’s see how this works for the function ๐ defined by ๐(๐ง) = ๐ง − 1 (the same function we
used in Note 2).
219
Example 15.7:
1. Let ๐(๐ง) = ๐ง + ๐.
If we write ๐ง = ๐ฅ + ๐ฆ๐, then we have ๐(๐ฅ + ๐ฆ๐) = ๐ฅ + ๐ฆ๐ + ๐ = ๐ฅ + (๐ฆ + 1)๐.
So, ๐ข(๐ฅ, ๐ฆ) = ๐ฅ and ๐ฃ(๐ฅ, ๐ฆ) = ๐ฆ + 1.
Geometrically, ๐ is a translation. It takes any point (๐ฅ, ๐ฆ) in the
Complex Plane and translates it up one unit. For example, the
point (1, 2) is translated to (1, 3) under the function ๐ because
๐(1 + 2๐) = (1 + 2๐) + ๐ = 1 + 3๐. We can see this in the
figure to the right.
Observe that any vertical line is mapped to itself under the
function ๐. We can see this geometrically because given a
vertical line in the Complex Plane, each point is just moved up
one unit along that same vertical line. The vertical line in the
figure on the right has equation ๐ฅ = 1. If we let ๐ฟ be the set of
points on the line ๐ฅ = 1, then we see that ๐[๐ฟ] = ๐ฟ. In fact, the function ๐ maps ๐ฟ bijectively
onto ๐ฟ. It might be more precise to say that ๐ maps the vertical line ๐ฅ = 1 in the ๐ฅ๐ฆ-plane to
the vertical line ๐ข = 1 in the ๐ข๐ฃ-plane.
If a subset ๐ of โ satisfies ๐[๐] ⊆ ๐, we will say that ๐ is invariant under the function ๐. If
๐[๐] = ๐, then we will say that ๐ is surjectively invariant under ๐. So, in this example, we see
that any vertical line ๐ฟ is surjectively invariant under ๐.
A horizontal line, however, is not invariant under the function ๐. For example, the horizontal
line ๐ฆ = 1 in the ๐ฅ๐ฆ-plane is mapped bijectively to the horizontal line ๐ฃ = 2 in the ๐ข๐ฃ-plane.
We can visualize this mapping as follows:
๐ฃ=2
๐ฆ=1
In fact, for any “shape” in the ๐ฅ๐ฆ-plane, after applying the function ๐, we wind up with the same
shape shifted up 1 unit in the ๐ข๐ฃ-plane. We can even think of this function as shifting the whole
plane up 1 unit. More specifically, the image of the ๐ฅ๐ฆ-plane under ๐ is the entire ๐ข๐ฃ-plane,
where each point in the ๐ฅ๐ฆ-plane is mapped to the point in the ๐ข๐ฃ-plane that is shifted up 1
unit from the original point. So, โ is surjectively invariant under ๐.
2. Let ๐(๐ง) = ๐งฬ
.
If we write ๐ง = ๐ฅ + ๐ฆ๐, then we have ๐(๐ฅ + ๐ฆ๐) = ๐ฅ − ๐ฆ๐.
220
So, ๐ข(๐ฅ, ๐ฆ) = ๐ฅ and ๐ฃ(๐ฅ, ๐ฆ) = – ๐ฆ.
Geometrically, ๐ is a reflection in the ๐ฅ-axis (or real axis). It
takes any point (๐ฅ, ๐ฆ) in the Complex Plane and reflects it
through the ๐ฅ-axis to the point (๐ฅ, – ๐ฆ). For example, the point
(1, 2) is reflected through the ๐ฅ-axis to the point (1, – 2) under
the function ๐ because ๐(1 + 2๐) = 1 − 2๐. We can see this in
the figure to the right.
Observe that the ๐ฅ-axis is invariant under ๐. To see this, note
that any point on the ๐ฅ-axis has the form (๐, 0) for some ๐ ∈ โ
and ๐(๐ + 0๐) = ๐ − 0๐ = ๐ = ๐ + 0๐. Notice that ๐ actually
maps each point on the ๐ฅ-axis to itself. Therefore, we call each
point on the ๐ฅ-axis a fixed point of ๐.
It’s not hard to see that the subsets of โ that are invariant under
๐ are precisely the subsets that are symmetric with respect to
the ๐ฅ-axis. However, points above and below the ๐ฅ-axis are not
fixed points of ๐, as they are reflected across the ๐ฅ-axis. The
figure below should help to visualize this. Note that in this
example, invariant is equivalent to surjectively invariant.
In the figure, the rectangle displayed is invariant under ๐. The fixed points of ๐ in the rectangle
are the points on the ๐ฅ-axis. We see that points below the ๐ฅ-axis in the ๐ฅ๐ฆ-plane are mapped
to points above the ๐ข-axis in the ๐ข๐ฃ-plane. A typical point below the ๐ฅ-axis and its image under
๐ above the ๐ข-axis are shown. Similarly, points above the ๐ฅ-axis in the ๐ฅ๐ฆ-plane are mapped to
points below the ๐ข-axis in the ๐ข๐ฃ-plane.
3. Let โ(๐ง) = ๐๐ง.
If we write ๐ง = ๐ฅ + ๐ฆ๐, then we have โ(๐ฅ + ๐ฆ๐) = ๐(๐ฅ + ๐ฆ๐) = ๐ฅ๐ + ๐ฆ๐ 2 = ๐ฅ๐ − ๐ฆ = – ๐ฆ + ๐ฅ๐.
So, the function โ takes any point (๐ฅ, ๐ฆ) to the point ( – ๐ฆ, ๐ฅ). To understand what this means
geometrically, it is useful to analyze what the image looks like in exponential form.
๐
๐
๐
If we write ๐ง = ๐๐ ๐๐ , then we have โ(๐๐ ๐๐ ) = ๐(๐๐ ๐๐ ) = ๐ ๐⋅2 (๐๐ ๐๐ ) = ๐๐ ๐⋅2 ๐ ๐๐ = ๐๐ ๐(๐+ 2 ) .
221
Notice that ๐ remains unchanged under this
transformation. So, โ(๐ง) is the same distance
from the origin as ๐ง. However, the angle
๐
changes from ๐ to ๐ + 2 . Geometrically, ๐ is a
๐
rotation about the origin by 2 radians, or
equivalently, 90°. As an example, the point
(1, 1) is rotated 90° about the origin to the
point (– 1, 1) (see the figure to the right). We
can see this in one of two ways. If we use the
standard form of 1 + ๐, then we have
โ(1 + ๐) = – 1 + ๐. If we use exponential form,
๐
then by Example 15.4, 1 + ๐ = √2๐ 4 ๐ . So,
๐ ๐
๐
3๐
โ (√2๐ 4 ๐ ) = √2๐ ( 4 + 2 )๐ = √2๐ 4 ๐ . Therefore,
3๐
we have ๐ข = √2 cos 4 = √2 (–
3๐
1
1
) = – 1 and
√2
๐ฃ = √2 sin 4 = √2 ( ) = 1. So, once again, ๐(1 + ๐) = – 1 + 1๐ = – 1 + ๐.
√2
Observe that any circle centered at the origin is surjectively invariant under โ and the only fixed
point of โ is the origin.
4. Let ๐(๐ง) = ๐ง 2 .
2
2
If we write ๐ง = ๐๐ ๐๐ , then we have ๐(๐๐ ๐๐ ) = (๐๐ ๐๐ ) = ๐ 2 (๐ ๐๐ ) = ๐ 2 ๐ ๐(2๐) by De Moivre’s
Theorem.
Under this function, the modulus of the complex number ๐ง is squared and the argument is
doubled. As an example, let’s see what happens to the point (1, 1) under this function.
๐
๐
Changing to exponential form, by Example 15.4, we have 1 + ๐ = √2๐ 4 ๐ . So, ๐(1 + ๐) = 2๐ 2 ๐ .
๐
We see that the modulus of ๐(1 + ๐) is 2 and the argument of ๐(1 + ๐) is 2 . So, in the Complex
Plane, this is the point that is 2 units from the origin on the positive ๐ฆ-axis (because
๐
๐ ( ) = (0, 1) and (0, 1) lies on the positive ๐ฆ-axis). In standard form, we have ๐(1 + ๐) = 2๐.
2
The only fixed points of ๐ are ๐ง = 0 and ๐ง = 1. To see this, note that if ๐ 2 ๐ ๐(2๐) = ๐๐ ๐๐ , then
๐ 2 = ๐ and 2๐ = ๐ + 2๐๐ for some ๐ ∈ โค. The equation ๐ 2 = ๐ is equivalent to ๐ 2 − ๐ = 0 or
๐(๐ − 1) = 0. So, ๐ = 0 or ๐ = 1. If ๐ = 0, then ๐ง = 0. So, assume ๐ = 1. We see that
2๐ = ๐ + 2๐๐ is equivalent to ๐ = 2๐๐. So, ๐ง = 1 ⋅ ๐ ๐(2๐๐) = ๐ 0 = 1.
Observe that the unit circle is surjectively invariant under ๐. To see this, first note that if
๐ง = ๐๐ ๐๐ lies on the unit circle, then ๐ = 1 and ๐(๐ ๐๐ ) = ๐ ๐(2๐) , which also has modulus 1.
๐
๐
2
Furthermore, every point ๐ง on the unit circle has the form ๐ง = ๐ ๐๐ and ๐ (๐ ๐ 2 ) = (๐ ๐ 2 ) = ๐ ๐๐
by De Moivre’s Theorem.
What other subsets of โ are surjectively invariant under ๐? Here are a few:
•
The positive real axis: {๐ง ∈ โ | Re ๐ง > 0 ∧ Im ๐ง = 0}
222
•
The open unit disk: {๐ง ∈ โ | |๐ง| < 1}
•
The complement of the open unit disk: {๐ง ∈ โ | |๐ง| ≥ 1}
The dedicated reader should prove that these sets are surjectively invariant under ๐. Are there
any other sets that are surjectively invariant under ๐? What about sets that are invariant, but
not surjectively invariant?
Limits and Continuity
Let ๐ด ⊆ โ, let ๐: ๐ด → โ, let ๐ฟ ∈ โ, and let ๐ ∈ โ be a point such that ๐ด contains some deleted
neighborhood of ๐. We say that the limit of ๐ as ๐ approaches ๐ is ๐ณ, written lim ๐(๐ง) = ๐ฟ, if for every
๐ง→๐
positive number ๐, there is a positive number ๐ฟ such that 0 < |๐ง − ๐| < ๐ฟ → |๐(๐ง) − ๐ฟ| < ๐.
Notes: (1) The statement of this definition of limit is essentially the same as the statement of the ๐ − ๐ฟ
definition of a limit of a real-valued function (see Lesson 13). However, the geometry looks very
different.
For a real-valued function, a deleted neighborhood of ๐ has the form ๐๐โจ (๐) = (๐ − ๐, ๐) ∪ (๐, ๐ + ๐)
and we can visualize this neighborhood as follows:
๐−๐
๐
For a complex-valued function, a deleted neighborhood of ๐, say
๐๐โจ (๐) = {๐ง ∈ โ | 0 < |๐ง − ๐| < ๐}, is a punctured disk with center ๐.
We can see a visualization of such a neighborhood to the right.
๐+๐
๐๐โจ (๐)
(2) In โ, there is a simple one to one correspondence between
neighborhoods (open intervals) and (vertical or horizontal) strips.
In โ there is no such correspondence. Therefore, for complex-valued
functions, we start right away with the ๐ − ๐ฟ definition.
(3) Recall that in โ, the expression |๐ฅ − ๐| < ๐ฟ is equivalent to
๐ − ๐ฟ < ๐ฅ < ๐ + ๐ฟ, or ๐ฅ ∈ (๐ − ๐ฟ, ๐ + ๐ฟ).
Also, the expression 0 < |๐ฅ − ๐| is equivalent to ๐ฅ − ๐ ≠ 0, or ๐ฅ ≠ ๐.
Therefore, 0 < |๐ฅ − ๐| < ๐ฟ is equivalent to ๐ฅ ∈ (๐ − ๐ฟ, ๐) ∪ (๐, ๐ + ๐ฟ).
In โ, if we let ๐ง = ๐ฅ + ๐ฆ๐ and ๐ = ๐ + ๐๐, then
|๐ง − ๐| = |(๐ฅ + ๐ฆ๐) − (๐ + ๐๐)| = |(๐ฅ − ๐) + (๐ฆ − ๐)๐| = √(๐ฅ − ๐)2 + (๐ฆ − ๐)2 .
So, |๐ง − ๐| < ๐ฟ is equivalent to (๐ฅ − ๐)2 + (๐ฆ − ๐)2 < ๐ฟ 2. In other words, (๐ฅ, ๐ฆ) is inside the disk with
center (๐, ๐) and radius ๐ฟ.
223
Also, we have
0 < |๐ง − ๐| ⇔ (๐ฅ − ๐)2 + (๐ฆ − ๐)2 ≠ 0 ⇔ ๐ฅ − ๐ ≠ 0 or ๐ฆ − ๐ ≠ 0 ⇔ ๐ฅ ≠ ๐ or ๐ฅ ≠ ๐ ⇔ ๐ง ≠ ๐.
Therefore, 0 < |๐ง − ๐| < ๐ฟ is equivalent to “๐ง is in the punctured disk with center ๐ and radius ๐ฟ.”
(4) Similarly, in โ, we have that |๐(๐ฅ) − ๐ฟ| < ๐ is equivalent to ๐(๐ฅ) ∈ (๐ฟ − ๐, ๐ฟ + ๐), while in โ, we
have |๐(๐ง) − ๐ฟ| < ๐ is equivalent to “๐(๐ง) is in the disk with center ๐ฟ and radius ๐.”
(5) Just like for real-valued functions, we can think of determining if lim ๐(๐ง) = ๐ฟ as the result of an
๐ง→๐
๐ − ๐ฟ game. Player 1 “attacks” by choosing a positive number ๐. This is equivalent to Player 1 choosing
the disk ๐๐ (๐ฟ) = {๐ค ∈ โ | |๐ค − ๐ฟ| < ๐}.
๐๐ (๐ฟ)
๐
Player 2 then tries to “defend” by finding a positive number ๐ฟ. This is equivalent to Player 2 choosing
the punctured disk ๐๐ฟโจ (๐) = {๐ง ∈ โ | 0 < |๐ง − ๐| < ๐ฟ}.
๐๐ (๐ฟ)
๐๐ฟโจ (๐)
๐
๐ฟ
The defense is successful if ๐ง ∈ ๐๐ฟโจ (๐) implies ๐(๐ง) ∈ ๐๐ (๐ฟ), or equivalently, ๐[๐๐ฟโจ (๐)] ⊆ ๐๐ (๐ฟ).
๐๐ (๐ฟ)
๐๐ฟโจ (๐)
๐
๐ฟ
224
If Player 2 defends successfully, then Player 1 chooses a new positive number ๐′, or equivalently, a new
neighborhood ๐๐′ (๐ฟ) = {๐ค ∈ โ | |๐ค − ๐ฟ| < ๐′}. If Player 1 is smart, then he/she will choose ๐′ to be
less than ๐ (otherwise, Player 2 can use the same ๐ฟ). The smaller the value of ๐′, the smaller the
neighborhood ๐๐′ (๐ฟ), and the harder it will be for Player 2 to defend. Player 2 once again tries to
choose a positive number ๐ฟ′ so that ๐[๐๐ฟโจ′ (๐)] ⊆ ๐๐′ (๐ฟ). This process continues indefinitely. Player 1
wins the ๐ − ๐ฟ game if at some stage, Player 2 cannot defend successfully. Player 2 wins the ๐ − ๐ฟ
game if he or she defends successfully at every stage.
(6) If for a given ๐ > 0, we have found a ๐ฟ > 0 such that ๐[๐๐ฟโจ (๐)] ⊆ ๐๐ (๐ฟ), then any positive number
smaller than ๐ฟ works as well. Indeed, if 0 < ๐ฟ ′ < ๐ฟ, then ๐๐ฟโจ′ (๐) ⊆ ๐๐ฟโจ (๐). It then follows that
๐[๐๐ฟโจ′ (๐)] ⊆ ๐[๐๐ฟโจ (๐)] ⊆ ๐๐ (๐ฟ).
๐๐ง
Example 15.8: Let’s use the ๐ − ๐ฟ definition of limit to prove that lim ( + 2) = ๐.
๐ง→3+6๐
3
๐๐ง
Analysis: Given ๐ > 0, we will find ๐ฟ > 0 so that 0 < |๐ง − (3 + 6๐)| < ๐ฟ implies |( 3 + 2) − ๐| < ๐.
First note that
๐๐ง
1
1
1
1
1
|( 3 + 2) − ๐| = |3 (๐๐ง + 6) − 3 (3๐)| = |3 ๐(๐ง − 6๐ − 3)| = |3 ๐| |๐ง − 3 − 6๐| = 3 |๐ง − (3 + 6๐)|.
๐๐ง
So, |( 3 + 2) − ๐| < ๐ is equivalent to |๐ง − (3 + 6๐)| < 3๐. Therefore, ๐ฟ = 3๐ should work.
Proof: Let ๐ > 0 and let ๐ฟ = 3๐. Suppose that 0 < |๐ง − (3 + 6๐)| < ๐ฟ. Then we have
๐๐ง
1
1
1
|( 3 + 2) − ๐| = 3 |๐ง − (3 + 6๐)| < 3 ๐ฟ = 3 (3๐) = ๐.
๐๐ง
Since ๐ > 0 was arbitrary, we have ∀๐ > 0 ∃๐ฟ > 0 (0 < |๐ง − (3 + 6๐)| < ๐ฟ → |( 3 + 2) − ๐| < ๐).
๐๐ง
Therefore, lim ( 3 + 2) = ๐.
โก
๐ง→3+6๐
Example 15.9: Let’s use the ๐ − ๐ฟ definition of limit to prove that lim ๐ง 2 = – 1.
๐ง→๐
Analysis: Given ๐ > 0, we need to find ๐ฟ > 0 so that 0 < |๐ง − ๐| < ๐ฟ implies |๐ง 2 − (– 1)| < ๐. First
note that |๐ง 2 − (– 1)| = |๐ง 2 + 1| = |(๐ง − ๐)(๐ง + ๐)| = |๐ง − ๐||๐ง + ๐|. Therefore, |๐ง 2 − (– 1)| < ๐ is
equivalent to |๐ง − ๐||๐ง + ๐| < ๐.
As in Example 13.9 from Lesson 13, |๐ง − ๐| is not an issue because we’re going to be choosing ๐ฟ so that
this expression is small enough. But to make the argument work we need to make |๐ง + ๐| small too.
Remember from Note 6 above that if we find a value for ๐ฟ that works, then any smaller positive number
will work too. This allows us to start by assuming that ๐ฟ is smaller than any positive number we choose.
So, let’s just assume that ๐ฟ ≤ 1 and see what effect that has on |๐ง + ๐|.
Well, if ๐ฟ ≤ 1 and 0 < |๐ง − ๐| < ๐ฟ, then |๐ง + ๐| = |(๐ง − ๐) + 2๐| ≤ |๐ง − ๐| + |2๐| < 1 + 2 = 3. Here
we used the Standard Advanced Calculus Trick (SACT) from Note 7 after Example 4.5 in Lesson 4,
followed by the Triangle Inequality (Theorem 7.3), and then the computation |2๐| = |2||๐| = 2 ⋅ 1 = 2.
225
So, if we assume that ๐ฟ ≤ 1, then |๐ง 2 − (– 1)| = |๐ง − ๐||๐ง + ๐| < ๐ฟ ⋅ 3 = 3๐ฟ. Therefore, if we want to
make sure that |๐ง 2 − (– 1)| < ๐, then is suffices to choose ๐ฟ so that 3๐ฟ ≤ ๐, as long as we also have
๐
๐ฟ ≤ 1. So, we will let ๐ฟ = min {1, 3}.
๐
Proof: Let ๐ > 0 and let ๐ฟ = min {1, 3}. Suppose that 0 < |๐ง − ๐| < ๐ฟ. Then since ๐ฟ ≤ 1, we have
|๐ง + ๐| = |(๐ง − ๐) + 2๐| ≤ |๐ง − ๐| + |2๐| = |๐ง − ๐| + |2||๐| = |๐ง − ๐| + 2 < 1 + 2 = 3, and therefore,
๐
|๐ง 2 − (– 1)| = |๐ง 2 + 1| = |(๐ง − ๐)(๐ง + ๐)| = |๐ง − ๐||๐ง + ๐| < ๐ฟ ⋅ 3 ≤ ⋅ 3 = ๐.
3
Since ๐ > 0 was arbitrary, we have ∀๐ > 0 ∃๐ฟ > 0 (0 < |๐ง − ๐| < ๐ฟ → |๐ง 2 − (– 1)| < ๐). Therefore,
lim ๐ง 2 = – 1.
โก
๐ง→๐
Theorem 15.3: If lim ๐(๐ง) exists, then it is unique.
๐ง→๐
Proof: Suppose that lim ๐(๐ง) = ๐ฟ and lim ๐(๐ง) = ๐พ. Let ๐ > 0. Since lim ๐(๐ง) = ๐ฟ, we can find
๐ง→๐
๐ง→๐
๐ง→๐
๐
๐ฟ1 > 0 such that 0 < |๐ง − ๐| < ๐ฟ1 → |๐(๐ง) − ๐ฟ| < 2. Since lim ๐(๐ง) = ๐พ, we can find ๐ฟ2 > 0 such that
๐ง→๐
๐
0 < |๐ง − ๐| < ๐ฟ2 → |๐(๐ง) − ๐พ| < 2. Let ๐ฟ = min{๐ฟ1 , ๐ฟ2 }. Suppose that 0 < |๐ง − ๐| < ๐ฟ. Then
๐
๐
|๐ฟ − ๐พ| = |(๐(๐ง) − ๐พ) − (๐(๐ง) − ๐ฟ)| (๐๐๐๐) ≤ |๐(๐ง) − ๐พ| + |๐(๐ง) − ๐ฟ| (๐๐) < + = ๐. Since ๐
2
2
was an arbitrary positive real number, by Problem 8 from Lesson 5, we have |๐ฟ − ๐พ| = 0. So,
๐ฟ − ๐พ = 0, and therefore, ๐ฟ = ๐พ.
โก
Note: SACT stands for the Standard Advanced Calculus Trick and TI stands for the Triangle Inequality.
๐ง 2
Example 15.10: Let’s show that lim ( ) does not exist.
๐ง→0 ๐ง
๐ง 2
๐ฅ+0๐ 2
๐ฅ 2
Proof: If we consider complex numbers of the form ๐ฅ + 0๐, (๐ง) = (๐ฅ−0๐) = (๐ฅ) = 12 = 1. Since
๐ง 2
every deleted neighborhood of 0 contains points of the form ๐ฅ + 0๐, we see that if lim (๐ง) exists, it
๐ง→0
must be equal to 1.
๐ง 2
๐ฅ+๐ฅ๐ 2
2๐ฅ 2 ๐
Next, let’s consider complex numbers of the form ๐ฅ + ๐ฅ๐. In this case, (๐ง) = (๐ฅ−๐ฅ๐) = –2๐ฅ 2 ๐ = – 1.
๐ง 2
Since every deleted neighborhood of 0 contains points of the form ๐ฅ + ๐ฅ๐, we see that if lim (๐ง) exists,
๐ง→0
it must be equal to – 1.
โก
By Theorem 15.3, the limit does not exist.
Define ๐: โ × โ → โ by ๐(๐ง, ๐ค) = |๐ง − ๐ค|. By Example 14.10 (part 1), (โ, ๐) is a metric space. So, by
Theorem 14.4, we have the following definition of continuity for complex-valued functions:
Let ๐ด ⊆ โ, let ๐: ๐ด → โ, and let ๐ ∈ ๐ด be a point such that ๐ด contains some neighborhood of ๐. ๐ is
continuous at ๐ if and only if for every positive number ๐, there is a positive number ๐ฟ such that
|๐ง − ๐| < ๐ฟ → |๐(๐ง) − ๐(๐)| < ๐.
226
๐๐ง
Example 15.11: Let ๐: โ → โ be defined by ๐(๐ง) = 3 + 2. In Example 15.8, we showed that
lim ๐(๐ง) = ๐. Since ๐(3 + 6๐) =
๐ง→3+6๐
๐(3+6๐)
3
+2=
3๐−6
3
+2=
3(๐−2)
3
+ 2 = ๐ − 2 + 2 = ๐, we see from
๐๐ง
the proof in Example 15.8 that if |๐ง − (3 + 6๐)| < ๐ฟ, then |๐(๐ง) − ๐(3 + 6๐)| = |( 3 + 2) − ๐| < ๐. It
follows that ๐ is continuous at 3 + 6๐.
More generally, let’s show that for all ๐ ∈ โ, ๐ is continuous at ๐.
Proof: Let ๐ ∈ โ, let ๐ > 0 and let ๐ฟ = 3๐. Suppose that |๐ง − ๐| < ๐ฟ. Then we have
๐๐ง
๐๐
๐
๐
1
1
|๐(๐ง) − ๐(๐)| = |( + 2) − ( + 2)| = | (๐ง − ๐)| = | | |๐ง − ๐| < ๐ฟ = (3๐) = ๐.
3
3
3
3
3
3
Since ๐ > 0 was arbitrary, we have ∀๐ > 0 ∃๐ฟ > 0 (|๐ง − ๐| < ๐ฟ → |๐(๐ง) − ๐(๐)| < ๐).
Therefore, ๐ is continuous at ๐.
โก
Notes: (1) We proved ∀๐ ∈ โ ∀๐ > 0 ∃๐ฟ > 0 ∀๐ง ∈ โ(|๐ง − ๐| < ๐ฟ → |๐(๐ง) − ๐(๐)| < ๐). In words, we
proved that for every complex number ๐, given a positive real number ๐, we can find a positive real
number ๐ฟ such that whenever the distance between ๐ง and ๐ is less than ๐ฟ, the distance between ๐(๐ง)
and ๐(๐) is less than ๐. And of course, a simpler way to say this is “for every complex number ๐, ๐ is
continuous at ๐,” or ∀๐ ∈ โ (๐ is continuous at ๐).”
(2) If we move the expression ∀๐ ∈ โ next to ∀๐ง ∈ โ, we get a concept that is stronger than continuity.
We say that a function ๐: ๐ด → โ is uniformly continuous on ๐ด if
∀๐ > 0 ∃๐ฟ > 0 ∀๐, ๐ง ∈ ๐ด (|๐ง − ๐| < ๐ฟ → |๐(๐ง) − ๐(๐)| < ๐).
(3) As a quick example of uniform continuity, let’s prove that the function ๐: โ → โ defined by
๐๐ง
๐(๐ง) = 3 + 2 is uniformly continuous on โ.
New proof: Let ๐ > 0 and let ๐ฟ = 3๐. Let ๐, ๐ง ∈ โ and suppose that |๐ง − ๐| < ๐ฟ. Then we have
๐๐ง
๐๐
๐
๐
1
1
|๐(๐ง) − ๐(๐)| = |( + 2) − ( + 2)| = | (๐ง − ๐)| = | | |๐ง − ๐| < ๐ฟ = ⋅ 3๐ = ๐.
3
3
3
3
3
3
Since ๐ > 0 was arbitrary, we have ∀๐ > 0 ∃๐ฟ > 0 ∀๐, ๐ง ∈ โ (|๐ง − ๐| < ๐ฟ → |๐(๐ง) − ๐(๐)| < ๐).
Therefore, ๐ is uniformly continuous.
(4) The difference between continuity and uniform continuity on a set ๐ด can be described as follows:
In both cases, an ๐ is given and then a ๐ฟ is chosen. For continuity, for each value of ๐, we can choose a
different ๐ฟ. For uniform continuity, once we choose a ๐ฟ for some value of ๐, we need to be able to use
the same ๐ฟ for every other value of ๐ in ๐ด.
In terms of disks, once a disk of radius ๐ is given, we need to be more careful how we choose our disk
of radius ๐ฟ. As we check different ๐ง-values, we can translate our chosen disk as much as we like around
the ๐ฅ๐ฆ-plane. However, we are not allowed to decrease the radius of the disk.
227
The Riemann Sphere
We have used the symbols – ∞ and ∞ (or +∞) to describe unbounded intervals of real numbers, as
well as certain limits of real-valued functions. These symbols are used to express a notion of “infinity.”
If we pretend for a moment that we are standing on the real line at 0, and we begin walking to the
right, continuing indefinitely, then we might say we are walking toward ∞. Similarly, if we begin walking
to the left instead, continuing indefinitely, then we might say we are walking toward – ∞.
–∞
∞
We would like to come up with a coherent notion of infinity with respect to the Complex Plane. There
is certainly more than one way to do this. A method that is most analogous to the picture described
above would be to define a set of infinities {∞๐ |0 ≤ ๐ < 2๐}, the idea being that for each angle ๐ in
standard position, we have an infinity, ∞๐ , describing where we would be headed if we were to start
at the origin and then begin walking along the terminal ray of ๐, continuing indefinitely.
The method in the previous paragraph, although acceptable, has the disadvantage of having to deal
with uncountably many “infinities.” Instead, we will explore a different notion that involves just a single
point at infinity. The idea is relatively simple. Pretend you have a large sheet of paper balancing on the
palm of your hand. The sheet of paper represents the Complex plane with the origin right at the center
of your palm. The palm of your hand itself represents the unit circle together with its interior.
Now, imagine using the pointer finger on your other hand to press down on the origin of that sheet of
paper (the Complex Plane), forcing your hand to form a unit sphere (reshaping the Complex Plane into
a unit sphere as well). Notice that the origin becomes the “south pole” of the sphere, while all the
“infinities” described in the last paragraph are forced together at the “north pole” of the sphere. Also,
notice that the unit circle stays fixed, the points interior to the unit circle form the lower half of the
sphere, and the points exterior to the unit circle form the upper half of the sphere with the exception
of the “north pole.”
When we visualize the unit sphere in this way, we refer to it as the Reimann Sphere.
Let’s let ๐2 be the Reimann Sphere and let’s officially define the north pole and south pole of ๐2 to be
the points ๐ = (0, 0, 1) and ๐ = (0, 0, – 1), respectively.
๐ = (0, 0, 1)
2
2
๐
Also, since ๐ is a subset of three-dimensional space (formally
known as โ3 ), while โ is only two dimensional, let’s identify โ
โ
with โ × {0} so that we write points in the Complex Plane as
๐
(๐, ๐, 0) instead of (๐, ๐). We can then visualize the Complex
Plane as intersecting the Reimann sphere in the unit circle. To
the right we have a picture of the Reimann Sphere together with
๐ = (0, 0, – 1)
the Complex Plane.
228
For each point ๐ง in the Complex Plane, consider the line passing through the points ๐ and ๐ง. This line
intersects ๐2 in exactly one point ๐๐ง . This observation allows us to define a bijection ๐: โ → ๐2 โ ๐
defined by ๐(๐ง) = ๐๐ง . An explicit definition of ๐ can be given by
|๐ง|2 − 1
๐ง+๐ง
๐ง−๐ง
๐(๐ง) = (
,
,
)
1 + |๐ง|2 ๐(1 + |๐ง|2 ) |๐ง|2 + 1
Below is a picture of a point ๐ง in the Complex Plane and its image ๐(๐ง) = ๐๐ง on the Riemann Sphere.
๐
๐(๐ง) = ๐๐ง
๐
๐ง
In Challenge Problem 21 below, you will be asked to verify that ๐ is a homeomorphism. If we let
โ = โ ∪ {∞}, then we can extend ๐ to a function ๐: โ → ๐2 by defining ๐(∞) = ๐. โ is called the
Extended Complex Plane. If we let ๐ฏ consist of all sets ๐ ⊆ โ that are either open in โ or have the
form ๐ = ๐ ∪ {∞}, where ๐ is the complement of a closed and bounded set in โ, then ๐ฏ defines a
topology on โ, and ๐ is a homeomorphism from (โ, ๐ฏ) to (๐2 , ๐ฐ๐2 ), where ๐ฐ is the product topology
on โ3 with respect to the standard topology on โ.
Note: Subspace and product topologies were defined in Problems 8 and 11 in Lesson 14.
1
1
If ๐ is a small positive number, then ๐ is a large positive number. We see that the set ๐1 = {๐ง | |๐ง| > ๐ }
๐
is a neighborhood of ∞ in the following sense. Notice that ๐1 consists of all points outside of the circle
๐
1
of radius ๐ centered at the origin. The image of this set under ๐ is a deleted neighborhood of ๐.
We can now extend our definition of limit to include various infinite cases. We will do one example
here and you will look at others in Problem 18 below.
1
lim ๐(๐ง) = ∞ if and only if ∀๐ > 0 ∃๐ฟ > 0 (0 < |๐ง − ๐| < ๐ฟ → |๐(๐ง)| > ๐ ).
๐ง→๐
1
Theorem 15.4: lim ๐(๐ง) = ∞ if and only lim ๐(๐ง) = 0.
๐ง→๐
๐ง→๐
1
Proof: Suppose lim ๐(๐ง) = ∞ and let ๐ > 0. There is ๐ฟ > 0 so that 0 < |๐ง − ๐| < ๐ฟ → |๐(๐ง)| > ๐ . But,
1
๐ง→๐
1
1
|๐(๐ง)| > is equivalent to |
− 0| < ๐. So, lim ๐(๐ง) = 0.
๐
๐(๐ง)
๐ง→๐
1
1
Now, suppose lim ๐(๐ง) = 0 and let ๐ > 0. There is ๐ฟ > 0 so that 0 < |๐ง − ๐| < ๐ฟ → |๐(๐ง) − 0| < ๐. But,
1
๐ง→๐
1
|๐(๐ง) − 0| < ๐ is equivalent to|๐(๐ง)| > ๐ . So, lim ๐(๐ง) = ∞.
๐ง→๐
229
โก
Problem Set 15
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
๐
1 √3
1. In Problems 11 and 12 below, you will be asked to show that ๐ ( 3 ) = (2 , 2 ) and
๐
√3 1
๐ (6 ) = ( 2 , 2). Use this information to compute the sine, cosine, and tangent of each of the
following angles:
(i)
(ii)
๐
6
๐
3
(iii)
2๐
(iv)
5๐
(v)
7๐
(vi)
4๐
(vii)
5๐
(viii)
11๐
3
6
6
3
3
6
2. Use the sum identities (Theorem 15.1) to compute the cosine, sine, and tangent of each of the
following angles:
(i)
(ii)
5๐
12
๐
12
(iii)
11๐
(iv)
19๐
12
12
230
LEVEL 2
3. Each of the following complex numbers is written in exponential form. Rewrite each complex
number in standard form:
(i)
๐ ๐๐
(ii)
๐– 2 ๐
5๐
๐
(iii) 3๐ 4 ๐
๐
(iv) 2๐ 3 ๐
(v)
7๐
√2๐ 6
๐
5๐
(vi) ๐๐ – 4 ๐
19๐
(vii) ๐ 12
4. Each of the following complex numbers is written in standard form. Rewrite each complex
number in exponential form:
(i)
–1− ๐
(ii)
√3 + ๐
(iii) 1 − √3๐
√6 + √2
√6 − √2
) + ( 4 )๐
4
(iv) (
5. Write the following complex numbers in standard form:
√2
√2
4
(i)
( 2 + 2 ๐)
(ii)
(1 + √3๐)
5
LEVEL 3
6. Use De Moivre’s Theorem to prove the following identities:
(i)
cos 2๐ = cos 2 ๐ − sin2 ๐
(ii)
sin 2๐ = 2 sin ๐ cos ๐
(iii) cos 3๐ = cos 3 ๐ − 3 cos ๐ sin2 ๐
7. Suppose that ๐ง = ๐๐ ๐๐ and ๐ค = ๐ ๐ ๐๐ are complex numbers written in exponential form. Express
each of the following in exponential form. Provide a proof in each case:
(i)
๐ง๐ค
(ii)
๐ง
๐ค
231
8. Write each function in the form ๐(๐ง) = ๐ข(๐ฅ, ๐ฆ) + ๐๐ฃ(๐ฅ, ๐ฆ) and ๐(๐ง) = ๐ข(๐, ๐) + ๐๐ฃ(๐, ๐):
(i)
๐(๐ง) = 2๐ง 2 − 5
(ii)
๐(๐ง) = ๐ง
1
(iii) ๐(๐ง) = ๐ง 3 + ๐ง 2 + ๐ง + 1
9. Let ๐(๐ง) = ๐ฅ 2 − ๐ฆ 2 − 2๐ฅ + 2๐ฆ(๐ฅ + 1)๐. Rewrite ๐(๐ง) in terms of ๐ง.
10. Find all complex numbers that satisfy the given equation:
(i)
๐ง6 − 1 = 0
(ii)
๐ง4 + 4 = 0
LEVEL 4
11. Consider triangle ๐ด๐๐, where ๐ = (0, 0), ๐ด = (1, 0), and ๐ is the point on the unit circle so that
๐
angle ๐๐๐ด has radian measure 3 . Prove that triangle ๐ด๐๐ is equilateral, and then use this to prove
๐
1 √3
that ๐ ( 3 ) = (2 , 2 ). You may use the following facts about triangles: (i) The interior angle
measures of a triangle sum to ๐ radians; (ii) Two sides of a triangle have the same length if and
only if the interior angles of the triangle opposite these sides have the same measure; (iii) If two
sides of a triangle have the same length, then the line segment beginning at the point of
intersection of those two sides and terminating on the opposite base midway between the
endpoints of that base is perpendicular to that base.
๐
√3 1
12. Prove that ๐ (6 ) = ( 2 , 2). You can use facts (i), (ii), and (iii) described in Problem 11.
13. Let ๐ and ๐ be the radian measure of angles ๐ด and ๐ต, respectively. Prove the following identity:
cos(๐ − ๐) = cos ๐ cos ๐ + sin ๐ sin ๐
14. Let ๐ and ๐ be the radian measure of angles ๐ด and ๐ต, respectively. Prove the following identities:
(i)
cos(๐ + ๐) = cos ๐ cos ๐ − sin ๐ sin ๐
(ii)
cos(๐ − ๐) = – cos ๐
๐
(iii) cos ( 2 − ๐) = sin ๐
๐
(iv) sin ( − ๐) = cos ๐
2
(v)
sin(๐ + ๐) = sin ๐ cos ๐ + cos ๐ sin ๐
(vi) sin(๐ − ๐) = – sin ๐
15. Let ๐ง, ๐ค ∈ โ. Prove that arg ๐ง๐ค = arg ๐ง + arg ๐ค in the sense that if two of the three terms in the
equation are specified, then there is a value for the third term so that the equation holds. Similarly,
๐ง
prove that arg ๐ค = arg ๐ง − arg ๐ค. Finally, provide examples to show that the corresponding
equations are false if we replace “arg” by “Arg.”
232
LEVEL 5
16. Define the function ๐: โ → โ by ๐(๐ง) = ๐ง 2 . Determine the images under ๐ of each of the
following sets:
(i)
๐ด = {๐ฅ + ๐ฆ๐ | ๐ฅ 2 − ๐ฆ 2 = 1}
(ii)
๐ต = {๐ฅ + ๐ฆ๐ |๐ฅ > 0 ∧ ๐ฆ > 0 ∧ ๐ฅ๐ฆ < 1}
(iii) ๐ถ = {๐ฅ + ๐ฆ๐ | ๐ฅ ≥ 0 ∧ ๐ฆ ≥ 0}
(iv) ๐ท = {๐ฅ + ๐ฆ๐ | ๐ฆ ≥ 0}
17. Let ๐ด ⊆ โ, let ๐: ๐ด → โ, let ๐ฟ = ๐ + ๐๐ ∈ โ, and let ๐ = ๐ + ๐๐ ∈ โ be a point such that ๐ด
contains some deleted neighborhood of ๐. Suppose that ๐(๐ฅ + ๐ฆ๐) = ๐ข(๐ฅ, ๐ฆ) + ๐๐ฃ(๐ฅ, ๐ฆ). Prove
that lim ๐(๐ง) = ๐ฟ if and only if lim ๐ข(๐ฅ, ๐ฆ) = ๐ and lim ๐ฃ(๐ฅ, ๐ฆ) = ๐.
(๐ฅ,๐ฆ)→(๐,๐)
๐ง→๐
(๐ฅ,๐ฆ)→(๐,๐)
18. Give a reasonable definition for each of the following limits (like what was done right before
Theorem 15.4). ๐ฟ is a finite real number.
(i)
(ii)
lim ๐(๐ง) = ๐ฟ
๐ง→∞
lim ๐(๐ง) = ∞
๐ง→∞
19. Prove each of the following:
(i)
(ii)
1
lim ๐(๐ง) = ๐ฟ if and only lim ๐ (๐ง) = ๐ฟ
๐ง→∞
๐ง→0
lim ๐(๐ง) = ∞ if and only lim
1
1
๐ง→0 ๐(๐ง)
๐ง→∞
= 0.
20. Let ๐, ๐: โ → โ be defined by ๐(๐ฅ) = cos ๐ฅ and ๐(๐ฅ) = sin ๐ฅ. Prove that ๐ and ๐ are uniformly
continuous on โ. Hint: Use the fact that the least distance between two points is a straight line.
CHALLENGE PROBLEM
21. Consider โ with the standard topology and ๐2 with its subspace topology, where ๐2 is being
considered as a subspace of โ3 . Let ๐: โ → ๐2 โ ๐ be defined as follows:
|๐ง|2 − 1
๐ง+๐ง
๐ง−๐ง
๐(๐ง) = (
,
,
)
1 + |๐ง|2 ๐(1 + |๐ง|2 ) |๐ง|2 + 1
Prove that ๐ is a homeomorphism.
233
LESSON 16 – LINEAR ALGEBRA
LINEAR TRANSFORMATIONS
Linear Transformations
Recall from Lesson 8 that a vector space over a field ๐ฝ is a set ๐ together with a binary operation + on
๐ (called addition) and an operation called scalar multiplication satisfying the following properties:
(1) (Closure under addition) For all ๐ฃ. ๐ค ∈ ๐, ๐ฃ + ๐ค ∈ ๐.
(2) (Associativity of addition) For all ๐ฃ, ๐ค, ๐ข ∈ ๐, (๐ฃ + ๐ค) + ๐ข = ๐ฃ + (๐ค + ๐ข).
(3) (Commutativity of addition) For all ๐ฃ, ๐ค ∈ ๐, ๐ฃ + ๐ค = ๐ค + ๐ฃ.
(4) (Additive identity) There exists an element 0 ∈ ๐ such that for all ๐ฃ ∈ ๐, 0 + ๐ฃ = ๐ฃ + 0 = ๐ฃ.
(5) (Additive inverse) For each ๐ฃ ∈ ๐, there is – ๐ฃ ∈ ๐ such that ๐ฃ + (– ๐ฃ) = (– ๐ฃ) + ๐ฃ = 0.
(6) (Closure under scalar multiplication) For all ๐ ∈ ๐ฝ and ๐ฃ ∈ ๐, ๐๐ฃ ∈ ๐.
(7) (Scalar multiplication identity) If 1 is the multiplicative identity of ๐ฝ and ๐ฃ ∈ ๐, then 1๐ฃ = ๐ฃ.
(8) (Associativity of scalar multiplication) For all ๐, ๐ ∈ ๐ฝ and ๐ฃ ∈ ๐, (๐๐)๐ฃ = ๐(๐๐ฃ).
(9) (Distributivity of 1 scalar over 2 vectors) For all ๐ ∈ ๐ฝ and ๐ฃ, ๐ค ∈ ๐, ๐(๐ฃ + ๐ค) = ๐๐ฃ + ๐๐ค.
(10) (Distributivity of 2 scalars over 1 vector) For all ๐, ๐ ∈ ๐ฝ and ๐ฃ ∈ ๐, (๐ + ๐)๐ฃ = ๐๐ฃ + ๐๐ฃ.
The simplest examples of vector spaces are โ๐ , โ๐ , and โ๐ , the vector spaces consisting of ๐-tuples of
rational numbers, real numbers, and complex numbers, respectively. As a specific example, we have
โ3 = {(๐ฅ, ๐ฆ, ๐ง) | ๐ฅ, ๐ฆ, ๐ง ∈ โ} with addition defined by (๐ฅ, ๐ฆ, ๐ง) + (๐ , ๐ก, ๐ข) = (๐ฅ + ๐ , ๐ฆ + ๐ก, ๐ง + ๐ข) and
scalar multiplication defined by ๐(๐ฅ, ๐ฆ, ๐ง) = (๐๐ฅ, ๐๐ฆ, ๐๐ง). Note that unless specified otherwise, we
would usually consider โ3 as a vector space over โ, so that the scalars ๐ are all real numbers.
Let ๐ and ๐ be vector spaces over a field ๐ฝ, and let ๐: ๐ → ๐ be a function from ๐ to ๐.
We say that ๐ is additive if for all ๐ข, ๐ฃ ∈ ๐, ๐(๐ข + ๐ฃ) = ๐(๐ข) + ๐(๐ฃ).
We say that ๐ is homogenous if for all ๐ ∈ ๐ฝ and all ๐ฃ ∈ ๐, ๐(๐๐ฃ) = ๐๐(๐ฃ).
๐ is a linear transformation if it is additive and homogeneous.
Example 16.1:
1. Let ๐ = ๐ = โ be vector spaces over โ and define ๐: โ → โ by ๐(๐ง) = 5๐ง. We see that
๐(๐ง + ๐ค) = 5(๐ง + ๐ค) = 5๐ง + 5๐ค = ๐(๐ง) + ๐(๐ค). So, ๐ is additive. Furthermore, we have
๐(๐๐ง) = 5(๐๐ง) = ๐(5๐ง) = ๐๐(๐ง). So, ๐ is homogenous. Therefore, ๐ is a linear
transformation.
More generally, for any vector space ๐ over โ and any ๐ ∈ โ, the function ๐: ๐ → ๐ defined
by ๐(๐ฃ) = ๐๐ฃ is a linear transformation. The verification is nearly identical to what we did in
the last paragraph. This type of linear transformation is called a dilation.
234
Note that if ๐, ๐ ∈ โ with ๐ ≠ 0, then the function ๐
: ๐ → ๐ defined by ๐
(๐ฃ) = ๐๐ฃ + ๐ is not
a linear transformation. To see this, observe that ๐
(2๐ฃ) = ๐(2๐ฃ) + ๐ = 2๐๐ฃ + ๐ and
2๐
(๐ฃ) = 2(๐๐ฃ + ๐) = 2๐๐ฃ + 2๐. If ๐
(2๐ฃ) = 2๐
(๐ฃ), then 2๐๐ฃ + ๐ = 2๐๐ฃ + 2๐, or
equivalently, ๐ = 2๐. Subtracting ๐ from each side of this equation yields ๐ = 0, contrary to
our assumption that ๐ ≠ 0. So, the linear functions that we learned about in high school are
usually not linear transformations. The only linear functions that are linear transformations are
the ones that pass through the origin (in other words, ๐ must be 0).
2. Let ๐ = โ4 and ๐ = โ3 be vector spaces over โ and define ๐: โ4 → โ3 by
๐((๐ฅ, ๐ฆ, ๐ง, ๐ค)) = (๐ฅ + ๐ง, 2๐ฅ − 3๐ฆ, 5๐ฆ − 2๐ค).
We have
๐((๐ฅ, ๐ฆ, ๐ง, ๐ค) + (๐ , ๐ก, ๐ข, ๐ฃ)) = ๐((๐ฅ + ๐ , ๐ฆ + ๐ก, ๐ง + ๐ข, ๐ค + ๐ฃ))
= ((๐ฅ + ๐ ) + (๐ง + ๐ข), 2(๐ฅ + ๐ ) − 3(๐ฆ + ๐ก), 5(๐ฆ + ๐ก) − 2(๐ค + ๐ฃ))
= ((๐ฅ + ๐ง) + (๐ + ๐ข), (2๐ฅ − 3๐ฆ) + (2๐ − 3๐ก), (5๐ฆ − 2๐ค) + (5๐ก − 2๐ฃ))
= (๐ฅ + ๐ง, 2๐ฅ − 3๐ฆ, 5๐ฆ − 2๐ค) + (๐ + ๐ข, 2๐ − 3๐ก, 5๐ก − 2๐ฃ)
= ๐((๐ฅ, ๐ฆ, ๐ง, ๐ค)) + ๐((๐ , ๐ก, ๐ข, ๐ฃ)).
So, ๐ is additive. Also, we have
๐(๐(๐ฅ, ๐ฆ, ๐ง, ๐ค)) = ๐((๐๐ฅ, ๐๐ฆ, ๐๐ง, ๐๐ค))
= (๐๐ฅ + ๐๐ง, 2(๐๐ฅ) − 3(๐๐ฆ), 5(๐๐ฆ) − 2(๐๐ค))
= (๐(๐ฅ + ๐ง), ๐(2๐ฅ − 3๐ฆ), ๐(5๐ฆ − 2๐ค))
= ๐(๐ฅ + ๐ง, 2๐ฅ − 3๐ฆ, 5๐ฆ − 2๐ค) = ๐๐((๐ฅ, ๐ฆ, ๐ง, ๐ค)).
So, ๐ is homogenous. Therefore, ๐ is a linear transformation.
3. Let ๐ = โ2 and ๐ = โ be vector spaces over โ and define ๐: โ2 → โ by ๐((๐ฅ, ๐ฆ)) = ๐ฅ๐ฆ. Then
๐ is not a linear transformation. Indeed, consider (1, 0), (0, 1) ∈ โ2 . We have
๐((1,0) + (0, 1)) = ๐((1, 1)) = 1 ⋅ 1 = 1.
๐((1, 0)) + ๐((0, 1)) = 1 ⋅ 0 + 0 ⋅ 1 = 0 + 0 = 0.
So, ๐((1,0) + (0, 1)) ≠ ๐((1, 0)) + ๐((0, 1)). This shows that ๐ is not additive, and therefore,
๐ is not a linear transformation.
Observe that ๐ is also not homogeneous. To see this, consider (1, 1) ∈ โ2 and 2 ∈ โ. We have
๐(2(1, 1)) = ๐((2, 2)) = 2 ⋅ 2 = 4, but 2๐(1, 1) = 2(1 ⋅ 1) = 2 ⋅ 1 = 2.
In Problem 3 below, you will be asked to show that neither additivity nor homogeneity alone is enough
to guarantee that a function is a linear transformation.
Recall from Lesson 8 that if ๐ฃ, ๐ค ∈ ๐ and ๐, ๐ ∈ ๐ฝ, then ๐๐ฃ + ๐๐ค is called a linear combination of the
vectors ๐ฃ and ๐ค with weights ๐ and ๐. The next theorem says that a function is a linear transformation
if and only if it “behaves well” with respect to linear combinations.
235
Theorem 16.1: Let ๐ and ๐ be vector spaces over a field ๐ฝ. A function ๐: ๐ → ๐ is a linear
transformation if and only if for all ๐ฃ, ๐ค ∈ ๐ and all ๐, ๐ ∈ ๐ฝ, ๐(๐๐ฃ + ๐๐ค) = ๐๐(๐ฃ) + ๐๐(๐ค).
Proof: Suppose that ๐: ๐ → ๐ is a linear transformation, let ๐ฃ, ๐ค ∈ ๐, and let ๐, ๐ ∈ ๐ฝ. Since ๐ is
additive, ๐(๐๐ฃ + ๐๐ค) = ๐(๐๐ฃ) + ๐(๐๐ค). Since ๐ is homogenous, ๐(๐๐ฃ) = ๐๐(๐ฃ) and
๐(๐๐ค) = ๐๐(๐ค). Therefore, ๐(๐๐ฃ + ๐๐ค) = ๐(๐๐ฃ) + ๐(๐๐ค) = ๐๐(๐ฃ) + ๐๐(๐ค), as desired.
Conversely, suppose that for all ๐, ๐ ∈ ๐ฝ, ๐(๐๐ฃ + ๐๐ค) = ๐๐(๐ฃ) + ๐๐(๐ค). Let ๐ฃ, ๐ค ∈ ๐ and let
๐ = ๐ = 1. Then ๐(๐ฃ + ๐ค) = ๐(1๐ฃ + 1๐ค) = 1๐(๐ฃ) + 1๐(๐ค) = ๐(๐ฃ) + ๐(๐ค). Therefore, ๐ is
additive. Now, let ๐ฃ ∈ ๐ and ๐ ∈ ๐ฝ. Then ๐(๐๐ฃ) = ๐(๐๐ฃ + 0๐ฃ) = ๐๐(๐ฃ) + 0๐(๐ฃ) = ๐๐(๐ฃ).
Therefore, ๐ is homogenous. It follows that ๐ is a linear transformation.
โก
We can use induction to extend Theorem 16.1 to arbitrary linear combinations. If ๐ฃ ∈ ๐ can be written
as a linear combination of vectors ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐, then ๐(๐ฃ) is determined by ๐(๐ฃ1 ), ๐(๐ฃ2 ),…,๐(๐ฃ๐ ).
Specifically, if ๐ฃ = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ , then we have
๐(๐ฃ) = ๐(๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ ) = ๐1 ๐(๐ฃ1 ) + ๐2 ๐(๐ฃ2 ) + โฏ + ๐๐ ๐(๐ฃ๐ ).
In particular, if ๐ต = {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } is a basis of ๐, then ๐ is completely determined by the values of
๐(๐ฃ1 ), ๐(๐ฃ2 ),…,๐(๐ฃ๐ ).
Notes: (1) Recall from Lesson 8 that the vectors ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐ are linearly independent if whenever
๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ = 0, it follows that all the weights ๐1 , ๐2 , … , ๐๐ are 0.
(2) Also, recall that the set of all linear combinations of ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐ is called the span of
๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ , written span{ ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ }.
(3) The set of vectors {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } is a basis of ๐ if ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly independent and
span{ ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } = ๐.
In particular, if {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } is a basis of ๐, then every vector in ๐ can be written as a linear
combination of ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ .
So, if we know the values of ๐(๐ฃ1 ), ๐(๐ฃ2 ), … , ๐(๐ฃ๐ ), then we know the value of ๐(๐ฃ) for any ๐ฃ ∈ ๐, as
shown above.
In other words, given a basis ๐ต of ๐, any function ๐: ๐ต → ๐ extends uniquely to a linear transformation
๐: ๐ → ๐.
Let ๐ and ๐ be vector spaces over a field ๐ฝ. We define โ(๐, ๐) to be the set of all linear
transformations from ๐ to ๐. Symbolically, โ(๐, ๐) = {๐: ๐ → ๐ | ๐ is a linear transformation}.
Theorem 16.2: Let ๐ and ๐ be vector spaces over a field ๐ฝ. Then โ(๐, ๐) is a vector space over ๐ฝ,
where addition and scalar multiplication are defined as follows:
๐ + ๐ ∈ โ(๐, ๐) is defined by (๐ + ๐)(๐ฃ) = ๐(๐ฃ) + ๐(๐ฃ) for ๐, ๐ ∈ โ(๐, ๐).
๐๐ ∈ โ(๐, ๐) is defined by (๐๐)(๐ฃ) = ๐๐(๐ฃ) for ๐ ∈ โ(๐, ๐) and ๐ ∈ ๐ฝ.
236
The reader will be asked to prove Theorem 16.2 in Problem 8 below.
If ๐, ๐, and ๐ are vector spaces over ๐ฝ, and ๐: ๐ → ๐, ๐: ๐ → ๐ are linear transformations, then the
composition ๐ โ ๐: ๐ → ๐ is a linear transformation, where ๐ โ ๐ is defined by (๐ โ ๐)(๐ฃ) = ๐(๐(๐ฃ))
for all ๐ฃ ∈ ๐. To see this, let ๐ฃ, ๐ค ∈ ๐ and ๐, ๐ ∈ ๐ฝ. Then we have
(๐ โ ๐)(๐๐ฃ + ๐๐ค) = ๐(๐(๐๐ฃ + ๐๐ค)) = ๐(๐๐(๐ฃ) + ๐๐(๐ค))
= ๐ (๐(๐(๐ฃ))) + ๐ (๐(๐(๐ค))) = ๐(๐ โ ๐)(๐ฃ) + ๐(๐ โ ๐)(๐ค).
Example 16.2: Let ๐: โ2 → โ3 be the linear transformation defined by ๐((๐ฅ, ๐ฆ)) = (๐ฅ, ๐ฅ + ๐ฆ, ๐ฆ) and
let ๐: โ3 → โ2 be the linear transformation defined by ๐((๐ฅ, ๐ฆ, ๐ง)) = (๐ง − ๐ฆ, ๐ฅ − ๐ง). Then
๐ โ ๐: โ2 → โ2 is a linear transformation and we have
(๐ โ ๐)((๐ฅ, ๐ฆ)) = ๐ (๐((๐ฅ, ๐ฆ))) = ๐((๐ฅ, ๐ฅ + ๐ฆ, ๐ฆ)) = (– ๐ฅ, ๐ฅ − ๐ฆ).
Notes: (1) In Example 16.2, the composition ๐ โ ๐: โ3 → โ3 is also a linear transformation and we have
(๐ โ ๐)((๐ฅ, ๐ฆ, ๐ง)) = ๐ (๐((๐ฅ, ๐ฆ, ๐ง))) = ๐((๐ง − ๐ฆ, ๐ฅ − ๐ง)) = (๐ง − ๐ฆ, ๐ฅ − ๐ฆ, ๐ฅ − ๐ง).
(2) In general, if ๐: ๐ → ๐, ๐: ๐ → ๐ are linear transformations, then ๐ โ ๐ is defined if and only if
๐ = ๐. So, just because ๐ โ ๐ is defined, it does not mean that ๐ โ ๐ is also defined. For example, if
๐: โ → โ2 and ๐: โ2 → โ3 , then ๐ โ ๐ is defined and ๐ โ ๐: โ → โ3 . However, ๐ โ ๐ is not defined.
The “outputs” of the linear transformation ๐ are ordered triples of real numbers, while the “inputs” of
the linear transformation ๐ are real numbers. They just don’t “match up."
(3) If ๐ and ๐ are both linear transformations from a vector space ๐ to itself (that is ๐, ๐: ๐ → ๐), then
the compositions ๐ โ ๐ and ๐ โ ๐ will both also be linear transformations from ๐ to itself.
By Note 3 above, in the vector space โ(๐, ๐), we can define a multiplication by ๐๐ = ๐ โ ๐. This
definition of multiplication gives โ(๐, ๐) a ring structure. In fact, with addition, scalar multiplication,
and composition as previously defined, โ(๐, ๐) is a structure called a linear algebra.
A linear algebra over a field ๐ฝ is a triple (๐ด, +, ⋅), where (๐ด, +) is a vector space over ๐ฝ, (๐ด, +, ⋅) is a
ring, and for all ๐ข, ๐ฃ ∈ ๐ด and ๐ ∈ ๐ฝ, ๐(๐ข๐ฃ) = (๐๐ข)๐ฃ = ๐ข(๐๐ฃ).
We will call the last property “compatibility of scalar and vector multiplication.”
Notes: (1) There are two multiplications defined in a linear algebra. As for a vector space, we have
scalar multiplication. We will refer to the ring multiplication as vector multiplication.
(2) Recall from Lesson 4 that a ring (๐ด, +, ⋅) satisfies the first 5 properties of a vector space listed above
(with ๐ด in place of ๐) together with the following three additional properties of vector multiplication:
•
(Closure) For all ๐ข, ๐ฃ ∈ ๐ด, ๐ข ⋅ ๐ฃ ∈ ๐ด.
•
(Associativity) For all ๐ข, ๐ฃ, ๐ค ∈ ๐ด, (๐ข ⋅ ๐ฃ) ⋅ ๐ค = ๐ข ⋅ (๐ฃ ⋅ ๐ค).
•
(Identity) There exists an element 1 ∈ ๐ด such that for all ๐ฃ ∈ ๐ด, 1 ⋅ ๐ฃ = ๐ฃ ⋅ 1 = ๐ฃ.
237
Example 16.3:
1. (โ, +, ⋅) is a linear algebra over โ, where addition and multiplication are defined in the usual
way. In this example, scalar and vector multiplication are the same.
2. Similarly, (โ, +, ⋅) is a linear algebra over โ, where addition and multiplication are defined in
the usual way (see Lesson 7). Again, in this example, scalar and vector multiplication are the
same.
3. If ๐ is a vector space over a field ๐ฝ, then โ(๐, ๐) is a linear algebra over ๐ฝ, where addition and
scalar multiplication are defined as in Theorem 16.2, and vector multiplication is given by
composition of linear transformations. You will be asked to verify this in Problem 9 below.
Recall from Lesson 10 that a function ๐: ๐ด → ๐ต is injective if ๐, ๐ ∈ ๐ด and ๐ ≠ ๐ implies ๐(๐) ≠ ๐(๐).
Also, ๐ is surjective if for all ๐ ∈ ๐ต, there is ๐ ∈ ๐ด with ๐(๐) = ๐. A bijective function is one that is both
injective and surjective.
Also recall that a bijective function ๐ is invertible. The inverse of ๐ is then the function ๐ −1 : ๐ต → ๐ด
defined by ๐ −1 (๐) = “the unique ๐ ∈ ๐ด such that ๐(๐) = ๐.”
By Theorem 10.6 from Lesson 10, ๐ −1 โ ๐ = ๐๐ด and ๐ โ ๐ −1 = ๐๐ต , where ๐๐ด and ๐๐ต are the identity
functions on ๐ด and ๐ต, respectively. Furthermore, ๐ −1 is the only function that satisfies these two
equations. Indeed, if โ: ๐ต → ๐ด also satisfies โ โ ๐ = ๐๐ด and ๐ โ โ = ๐๐ต , then
โ = โ โ ๐๐ต = โฬ โ (๐ โ ๐ −1 ) = (โ โ ๐) โ ๐ −1 = ๐๐ด โ ๐ −1 = ๐ −1 .
A bijection ๐: ๐ → ๐ that is also a linear transformation is called an isomorphism. If an isomorphism
๐: ๐ → ๐ exists, we say that ๐ and ๐ are isomorphic. As is always the case with algebraic structures,
isomorphic vector spaces are essentially identical. The only difference between them are the “names”
of the elements. Isomorphisms were covered in more generality in Lesson 11.
If a bijective function happens to be a linear transformation between two vector spaces, it’s nice to
know that the inverse function is also a linear transformation. We prove this now.
Theorem 16.3: Let ๐: ๐ → ๐ be an invertible linear transformation. Then ๐ −1 : ๐ → ๐ is also a linear
transformation.
Proof: Let ๐: ๐ → ๐ be an invertible linear transformation, let ๐ข, ๐ฃ ∈ ๐, and let ๐, ๐ ∈ ๐ฝ. Then by the
linearity of ๐, we have
๐(๐๐ −1 (๐ข) + ๐๐ −1 (๐ฃ)) = ๐๐(๐ −1 (๐ข)) + ๐๐(๐ −1 (๐ฃ)) = ๐๐ข + ๐๐ฃ.
Since ๐ is injective, ๐๐ −1 (๐ข) + ๐๐ −1 (๐ฃ) is the unique element of ๐ whose image under ๐ is ๐๐ข + ๐๐ฃ.
By the definition of ๐ −1 , ๐ −1 (๐๐ข + ๐๐ฃ) = ๐๐ −1 (๐ข) + ๐๐ −1 (๐ฃ).
โก
238
Example 16.4:
1. Let ๐ = ๐ = โ be vector spaces over โ and define ๐: โ → โ by ๐(๐ง) = 5๐ง, as we did in part 1
of Example 16.1. if ๐ง ≠ ๐ค, then 5๐ง ≠ 5๐ค, and so ๐ is injective. Also, if ๐ค ∈ โ, then we have
1
1
๐ (5 ๐ค) = 5 (5 ๐ค) = ๐ค. So, ๐ is surjective. It follows that ๐ is invertible and that the inverse of
1
๐ is defined by ๐ −1 (๐ง) = 5 ๐ง. By Theorem 16.3, ๐ −1 : โ → โ is also a linear transformation. In
the terminology of Lesson 11, ๐ is an automorphism. In other words, ๐ is an isomorphism from
โ to itself.
2. Let ๐ be a vector space over a field ๐ฝ with basis {๐ฃ1 , ๐ฃ2 , ๐ฃ3 }. Then let ๐: ๐ → ๐ฝ3 be the unique
linear transformation such that ๐(๐ฃ1 ) = (1, 0, 0), ๐(๐ฃ2 ) = (0, 1, 0), and ๐(๐ฃ3 ) = (0, 0, 1). In
other words, if ๐ฃ ∈ ๐, since {๐ฃ1 , ๐ฃ2 , ๐ฃ3 } is a basis of ๐, we can write ๐ฃ = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + ๐3 ๐ฃ3 ,
and ๐ is defined by ๐(๐ฃ) = ๐1 ๐(๐ฃ1 ) + ๐2 ๐(๐ฃ2 ) + ๐3 ๐(๐ฃ3 ) = (๐1 , ๐2 , ๐3 ).
To see that ๐ is injective, suppose that ๐(๐1 ๐ฃ1 + ๐2 ๐ฃ2 + ๐3 ๐ฃ3 ) = ๐(๐1 ๐ฃ1 + ๐2 ๐ฃ2 + ๐3 ๐ฃ3 ).
Then (๐1 , ๐2 , ๐3 ) = (๐1 , ๐2 , ๐3 ). It follows that ๐1 = ๐1 , ๐2 = ๐2 , and ๐3 = ๐3 . Therefore,
๐1 ๐ฃ1 + ๐2 ๐ฃ2 + ๐3 ๐ฃ3 = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + ๐3 ๐ฃ3 and so, ๐ is injective.
Now, if (๐, ๐, ๐) ∈ ๐ฝ3 , then ๐(๐๐ฃ1 + ๐๐ฃ2 + ๐๐ฃ3 ) = (๐, ๐, ๐) and so, ๐ is surjective. From this
computation, we also see that ๐ −1 : ๐ฝ3 → ๐ is defined by ๐ −1 ((๐, ๐, ๐)) = ๐๐ฃ1 + ๐๐ฃ2 + ๐๐ฃ3 .
It follows that ๐: ๐ → ๐ฝ3 is an isomorphism, so that ๐ is isomorphic to ๐ฝ3 .
Essentially the same argument as above can be used to show that if ๐ is a vector space over a
field ๐ฝ with a basis consisting of ๐ vectors, then ๐ is isomorphic to ๐ฝ๐ .
Matrices
Recall from Lesson 8 that for ๐, ๐ ∈ โค+ , an ๐ × ๐ matrix over a field ๐ฝ is a rectangular array with ๐
1
๐
2 − 5๐
5 ] is a
rows and ๐ columns, and entries in ๐ฝ. For example, the matrix ๐ป = [
–1
7+๐
√3
2 × 3 matrix over โ. We will generally use a capital letter to represent a matrix, and the corresponding
lowercase letter with double subscripts to represent the entries of the matrix. We use the first subscript
for the row and the second subscript for the column. Using the matrix ๐ป above as an example, we see
1
that โ11 = ๐, โ12 = 2 − 5๐, โ13 = 5, โ21 = – 1, โ22 = √3, and โ23 = 7 + ๐.
If ๐ด is an ๐ × ๐ matrix, then we can visualize ๐ด as follows:
๐11 โฏ ๐1๐
โฎ ]
๐ด=[ โฎ
๐๐1 โฏ ๐๐๐
๐ฝ
We let ๐๐๐
be the set of all ๐ × ๐ matrices over the field ๐ฝ. Recall that we add two matrices
๐ฝ
๐ฝ
๐ฝ
๐ด, ๐ต ∈ ๐๐๐ to get ๐ด + ๐ต ∈ ๐๐๐
using the rule (๐ + ๐)๐๐ = ๐๐๐ + ๐๐๐ . We multiply a matrix ๐ด ∈ ๐๐๐
by a scalar ๐ ∈ ๐ฝ using the rule (๐๐)๐๐ = ๐๐๐๐ . We can visualize these computations as follows:
๐11
[ โฎ
๐๐1
โฏ
โฏ
๐1๐
๐11
โฎ ]+[ โฎ
๐๐๐
๐๐1
โฏ
โฏ
๐1๐
๐11 + ๐11
โฎ ]=[
โฎ
๐๐๐
๐๐1 + ๐๐1
239
โฏ
โฏ
๐1๐ + ๐1๐
โฎ
]
๐๐๐ + ๐๐๐
๐11
๐[ โฎ
๐๐1
โฏ
๐1๐
๐๐11
โฎ ]=[ โฎ
โฏ ๐๐๐
๐๐๐1
โฏ
โฏ
๐๐1๐
โฎ ]
๐๐๐๐
๐ฝ
With these operations of addition and scalar multiplication, ๐๐๐
is a vector space over ๐ฝ.
๐ฝ
๐ฝ
We would now like to turn ๐๐๐
into a linear algebra over ๐ฝ by defining a vector multiplication in ๐๐๐
.
๐ฝ
Notice that we will not be turning all vector spaces ๐๐๐ into linear algebras. We will be able to do this
only when ๐ = ๐. That is, the linear algebra will consist only of square matrices of a specific size.
We first define the product of an ๐ × ๐ matrix with an ๐ × ๐ matrix, where ๐, ๐, ๐ are positive
integers. Notice that to take the product ๐ด๐ต we first insist that the number of columns of ๐ด be equal
to the number of rows of ๐ต (these are the “inner” two numbers in the expressions “๐ × ๐” and
“๐ × ๐”).
So, how do we actually multiply two matrices? This is a bit complicated and requires just a little practice.
Let’s begin by walking through an example while informally describing the procedure, so that we can
get a feel for how matrix multiplication works before getting caught up in the “messy looking”
definition.
0 1
1 2 0
] and ๐ต = [
]. Notice that ๐ด is a 2 × 2 matrix and ๐ต is a 2 × 3 matrix. Since ๐ด
3 2
0 3 6
has 2 columns and ๐ต has 2 rows, we will be able to multiply the two matrices.
Let ๐ด = [
For each row of the first matrix and each column of the second matrix, we add up the products entry
by entry. Let’s compute the product ๐ด๐ต as an example.
๐ฅ ๐ฆ ๐ง
0 1 1 2 0
๐ด๐ต = [
]⋅[
]=[
]
๐ข ๐ฃ ๐ค
3 2
0 3 6
Since ๐ฅ is in the first row and first column, we use the first row of ๐ด and the first column of ๐ต to get
1
๐ฅ = [0 1] [ ] = 0 ⋅ 1 + 1 ⋅ 0 = 0 + 0 = 0.
0
Since ๐ข is in the second row and first column, we use the second row of ๐ด and the first column of ๐ต to
1
get ๐ข = [3 2] [ ] = 3 ⋅ 1 + 2 ⋅ 0 = 3.
0
The reader should attempt to follow this procedure to compute the values of the remaining entries.
The final product is
๐ด๐ต = [
0 3
3 12
6
]
12
Notes: (1) The product of a ๐ × 2 matrix and a 2 × ๐ matrix is a 2 × 3 matrix.
(2) More generally, the product of an ๐ × ๐ matrix and an ๐ × ๐ matrix is an ๐ × ๐ matrix. Observe
that the inner most numbers (both ๐) must agree, and the resulting product has dimensions given by
the outermost numbers (๐ and ๐).
240
๐11 โฏ ๐1๐
โฎ ]
We formally define matrix multiplication as follows. Let ๐ด be the ๐ × ๐ matrix ๐ด = [ โฎ
๐๐1 โฏ ๐๐๐
๐11 โฏ ๐1๐
โฎ ]. We define the product ๐ด๐ต to be the ๐ × ๐ matrix
and let ๐ต be the ๐ × ๐ matrix ๐ต = [ โฎ
๐๐1 โฏ ๐๐๐
๐11 โฏ ๐1๐
โฎ ] such that
๐ถ=[ โฎ
๐๐1 โฏ ๐๐๐
๐
๐๐๐ = ๐๐1 ๐1๐ + ๐๐2 ๐2๐ + โฏ + ๐๐๐ ๐๐๐ = ∑ ๐๐๐ ๐๐๐ .
๐=1
Notes: (1) The symbol Σ is the Greek letter Sigma. In mathematics, this symbol is often used to denote
a sum. Σ is generally used to abbreviate a very large sum or a sum of unknown length by specifying
what a typical term of the sum looks like. Let’s look at a simpler example first before we analyze the
more complicated one above:
5
∑ ๐ 2 = 12 + 22 + 32 + 42 + 52 = 1 + 4 + 9 + 16 + 25 = 55.
๐=1
The expression “๐ = 1” written underneath the symbol indicates that we get the first term of the sum
by replacing ๐ by 1 in the given expression. When we replace ๐ by 1 in the expression ๐ 2 , we get 12 .
For the second term, we simply increase ๐ by 1 to get ๐ = 2. So, we replace ๐ by 2 to get ๐ 2 = 22 .
We continue in this fashion, increasing ๐ by 1 each time until we reach the number written above the
symbol. In this case, that is ๐ = 5.
(2) Let’s now get back to the expression that we’re interested in.
๐
๐๐๐ = ∑ ๐๐๐ ๐๐๐ = ๐๐1 ๐1๐ + ๐๐2 ๐2๐ + โฏ + ๐๐๐ ๐๐๐
๐=1
Once again, the expression “๐ = 1” written underneath the symbol indicates that we get the first term
of the sum by replacing ๐ by 1 in the given expression. When we replace ๐ by 1 in the expression
๐๐๐ ๐๐๐ , we get ๐๐1 ๐1๐ . Notice that this is the first term of ๐๐๐ .
For the second term, we simply increase ๐ by 1 to get ๐ = 2. So, we replace ๐ by 2 to get ๐๐2 ๐2๐ .
We continue in this fashion, increasing ๐ by 1 each time until we reach the number written above the
symbol. In this case, that is ๐ = ๐. So, the last term is ๐๐๐ ๐๐๐ .
(3) In general, we get the entry ๐๐๐ in the ๐th row and ๐th column of ๐ถ = ๐ด๐ต by “multiplying” the ๐th
row of ๐ด with the ๐th column of ๐ต. We can think of the computation like this:
241
[๐๐1 ๐๐2 โฏ ๐๐๐ ]
๐1๐
๐2๐
โฎ
= ๐๐1 ๐1๐ + ๐๐2 ๐2๐ + โฏ + ๐๐๐ ๐๐๐
[๐๐๐ ]
Notice how we multiply the leftmost entry ๐๐1 by the topmost entry ๐1๐ . Then we move one step to
the right to ๐๐2 and one step down to ๐2๐ to form the next product, … and so on.
It is fairly straightforward to verify that with our definitions of addition, scalar multiplication, and matrix
๐ฝ
multiplication, for each ๐ ∈ โค+ , ๐๐๐
is a linear algebra over ๐ฝ. I leave this as an exercise for the reader.
Note that it is important that the number of rows and columns of our matrices are the same. Otherwise,
the matrix products will not be defined.
Example 16.5:
5
1. [1 2 3 4] ⋅ [ 1] = [1 ⋅ 5 + 2 ⋅ 1 + 3(– 2) + 4 ⋅ 3] = [5 + 2 − 6 + 12] = [13].
–2
3
5
We generally identify a 1 × 1 matrix with its only entry. So, [1 2 3 4] ⋅ [ 1] = ๐๐.
–2
3
5
๐ ๐๐ ๐๐ ๐๐
๐
๐ ].
๐
2. [ 1] ⋅ [1 2 3 4] = [ ๐
–2
–๐ –๐ –๐ –๐
๐
3
๐
๐ ๐๐
5
5
1
Notice that [1 2 3 4] ⋅ [ ] ≠ [ 1] ⋅ [1 2 3 4], and in fact, the two products do not even
–2
–2
3
3
have the same size. This shows that if ๐ด๐ต and ๐ต๐ด are both defined, then they do not need to
be equal.
0+6 2+4
2
6 6
]=[
]=[
].
0
+
3
0
+
2
2
3 2
0+0 0+2
2
0 2
]=[
]=[
].
3+0 6+2
1
3 8
1 2 0 2
0 2 1 2
Notice that [
]⋅[
]≠[
]⋅[
].
0 1 3 2
3 2 0 1
This shows that even if ๐ด and ๐ต are square matrices of the same size, in general ๐ด๐ต ≠ ๐ต๐ด. So,
๐ฝ
matrix multiplication is not commutative. ๐๐๐
is a noncommutative linear algebra.
1
0
0
[
3
3. [
2 0
]⋅[
1 3
2 1
]⋅[
2 0
The Matrix of a Linear Transformation
Let ๐ ∈ โ(๐, ๐) and let ๐ต = {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } and ๐ถ = {๐ค1 , ๐ค2 , … , ๐ค๐ } be bases of ๐ and ๐,
respectively. Recall that ๐ is completely determined by the values of ๐(๐ฃ1 ), ๐(๐ฃ2 ), … , ๐(๐ฃ๐ ).
Furthermore, since ๐(๐ฃ1 ), ๐(๐ฃ2 ),…,๐(๐ฃ๐ ) ∈ ๐ and ๐ถ is a basis for ๐, each of ๐(๐ฃ1 ), ๐(๐ฃ2 ), … , ๐(๐ฃ๐ )
can be written as a linear combination of the vectors in ๐ถ. So, we have
242
๐(๐ฃ1 ) = ๐11 ๐ค1 + ๐21 ๐ค2 + โฏ + ๐๐1 ๐ค๐
๐(๐ฃ2 ) = ๐12 ๐ค1 + ๐22 ๐ค2 + โฏ + ๐๐2 ๐ค๐
โฎ
๐(๐ฃ๐ ) = ๐1๐ ๐ค1 + ๐2๐ ๐ค2 + โฏ + ๐๐๐ ๐ค๐
โฎ
๐(๐ฃ๐ ) = ๐1๐ ๐ค1 + ๐2๐ ๐ค2 + โฏ + ๐๐๐ ๐ค๐
Here, we have ๐๐๐ ∈ ๐ฝ for each ๐ = 1, 2, … , ๐ and ๐ = 1, 2, … , ๐. We form the following matrix:
๐11
โณ๐ (๐ต, ๐ถ) = [ โฎ
๐๐1
โฏ
โฏ
๐1๐
โฎ ]
๐๐๐
โณ๐ (๐ต, ๐ถ ) is called the matrix of the linear transformation ๐ป with respect to the bases ๐ฉ and ๐ช.
Note: The coefficients in the expression ๐(๐ฃ๐ ) = ๐1๐ ๐ค1 + ๐2๐ ๐ค2 + โฏ + ๐๐๐ ๐ค๐ become the ๐th
column of โณ๐ (๐ต, ๐ถ). Your first instinct might be to form the row [๐1๐ ๐2๐ โฏ ๐๐๐ ], but this is incorrect.
Pay careful attention to how we form โณ๐ (๐ต, ๐ถ) in part 2 of Example 16.6 below to make sure that you
avoid this error.
Example 16.6:
1. Consider the linear transformation ๐: โ → โ from part 1 of Example 16.1. We are considering
โ as a vector space over โ and ๐ is defined by ๐(๐ง) = 5๐ง. Let’s use the standard basis for โ, so
that ๐ต = ๐ถ = {1 + 0๐, 0 + 1๐} = {1, ๐}. We have
๐(1) = 5 = 5 ⋅ 1 + 0 ⋅ ๐
๐(๐) = 5๐ = 0 ⋅ 1 + 5 ⋅ ๐
5 0
The matrix of ๐ with respect to the standard basis is โณ๐ ({1, ๐}, {1, ๐}) = [
].
0 5
In this case, since ๐ is being mapped from a vector space to itself and we are using the same
basis for both “copies” of โ, we can abbreviate โณ๐ ({1, ๐}, {1, ๐}) as โณ๐ ({1, ๐}). Furthermore,
since we are using the standard basis, we can abbreviate โณ๐ ({1, ๐}, {1, ๐}) even further as โณ๐ .
5 0
So, we can simply write โณ๐ = [
].
0 5
๐
Now, let ๐ง = ๐ + ๐๐ ∈ โ and write ๐ง as the column vector ๐ง = [ ]. We have
๐
๐
๐
5 0
5๐
โณ๐ ⋅ ๐ง = [
] [ ] = [ ] = 5 [ ] = 5๐ง = ๐(๐ง).
๐
0 5 ๐
5๐
So, multiplication on the left by โณ๐ gives the same result as applying the transformation ๐.
2. Consider the linear transformation ๐: โ4 → โ3 from part 2 of Problem 16.1. We are considering
โ4 and โ3 as vector spaces over โ and ๐ is defined by
๐((๐ฅ, ๐ฆ, ๐ง, ๐ค)) = (๐ฅ + ๐ง, 2๐ฅ − 3๐ฆ, 5๐ฆ − 2๐ค).
Let’s use the standard bases for โ4 and โ3 , so that
๐ต = {(1, 0, 0, 0), (0, 1, 0, 0), (0, 0, 1, 0), (0, 0, 0, 1)} and ๐ถ = {(1, 0, 0), (0, 1, 0), (0, 0, 1)}.
243
We have
๐((1, 0, 0, 0)) = (1, 2, 0)
๐((0, 1, 0, 0)) = (0, – 3, 5)
๐((0, 0, 1, 0)) = (1, 0, 0)
๐((0, 0, 0, 1)) = (0, 0, – 2)
1
0 1 0
The matrix of ๐ with respect to the standard bases is โณ๐ = [2 – 3 0 0]
0
5 0 –2
Once again, we abbreviate โณ๐ (๐ต, ๐ถ) as โณ๐ because we are using the standard bases.
๐ฅ
๐ฆ
Now, let ๐ฃ = (๐ฅ, ๐ฆ, ๐ง, ๐ค) ∈ โ4 and write ๐ฃ as the column vector ๐ฃ = [ ๐ง ]. We have
๐ค
1
โณ๐ ⋅ ๐ฃ = [2
0
0
–3
5
๐ฅ
๐ฅ+๐ง
1 0
๐ฆ
2๐ฅ
− 3๐ฆ ] = ๐(๐ฃ).
]
[
]
=
[
0 0 ๐ง
5๐ฆ − 2๐ค
0 –2 ๐ค
So, once again, multiplication on the left by โณ๐ gives the same result as applying the
transformation ๐.
Let ๐ be a vector space over ๐ฝ with a finite basis. Then we say that ๐ is finite-dimensional. If
๐ต = {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ }, then by Problem 12 from Lesson 8, all bases of ๐ have ๐ elements. In this case, we
say that ๐ is ๐-dimensional, and we write dim ๐ = ๐.
Theorem 16.4: Let ๐ be an ๐-dimensional vector space over a field ๐ฝ. Then there is a linear algebra
๐ฝ
isomorphism ๐น: โ(๐, ๐) → ๐๐๐
You will be asked to prove Theorem 16.4 in Problem 15 below.
Images and Kernels
Let ๐: ๐ → ๐ be a linear transformation. The image (or range) of ๐ is the set ๐[๐] = {๐(๐ฃ) | ๐ฃ ∈ ๐}
and the kernel (or null space) of ๐ is the set ker(๐) = {๐ฃ ∈ ๐ | ๐(๐ฃ) = 0}.
Example 16.7: Let ๐: โ4 → โ3 be defined by ๐((๐ฅ, ๐ฆ, ๐ง, ๐ค)) = (๐ฅ + ๐ฆ, ๐ฅ − ๐ง, ๐ฅ + 2๐ค). Let’s compute
๐[โ4 ] and ker(๐). First, ๐[โ4 ] consists of all vectors of the form
(๐ฅ + ๐ฆ, ๐ฅ − ๐ง, ๐ฅ + 2๐ค) = (๐ฅ + ๐ฆ)(1, 0, 0) + (๐ฅ − ๐ง)(0, 1, 0) + (๐ฅ + 2๐ค)(0, 0, 1)
1
So, if (๐ฃ1 , ๐ฃ2 , ๐ฃ3 ) ∈ โ3 , let ๐ฅ = 0, ๐ฆ = ๐ฃ1 , ๐ง = – ๐ฃ2 , and ๐ค = 2 ๐ฃ3 . Then we see that
(๐ฅ + ๐ฆ)(1, 0, 0) + (๐ฅ − ๐ง)(0, 1, 0) + (๐ฅ + 2๐ค)(0, 0, 1)
= ๐ฃ1 (1, 0, 0) + ๐ฃ2 (0, 1, 0) + ๐ฃ3 (0, 0, 1) = (๐ฃ1 , ๐ฃ2 , ๐ฃ3 )
244
Therefore, โ3 ⊆ ๐[โ4 ]. Since it is clear that ๐[โ4 ] ⊆ โ3 , we have ๐[โ4 ] = โ3 .
Now, (๐ฅ, ๐ฆ, ๐ง, ๐ค) ∈ ker(๐) if and only if (๐ฅ + ๐ฆ, ๐ฅ − ๐ง, ๐ฅ + 2๐ค) = (0, 0, 0) if and only if ๐ฅ + ๐ฆ = 0,
๐ฅ
๐ฅ − ๐ง = 0, and ๐ฅ + 2๐ค = 0 if and only if ๐ฆ = – ๐ฅ, ๐ง = ๐ฅ, and ๐ค = – 2 if and only if
๐ฅ
1
2
2
(๐ฅ, ๐ฆ, ๐ง, ๐ค) = (๐ฅ, – ๐ฅ, ๐ฅ, – ) = ๐ฅ (1, – 1, 1, – ).
1
1
So, every element of ker(๐) is a scalar multiple of (1, – 1, 1, – 2). Thus, ker(๐) ⊆ span {(1, – 1, 1, – 2)}.
1
1
Conversely, an element of span {(1, – 1, 1, – 2)} has the form (๐ฃ, – ๐ฃ, ๐ฃ, – 2 ๐ฃ), and we have
1
1
1
๐ ((๐ฃ, – ๐ฃ, ๐ฃ, – 2 ๐ฃ)) = (๐ฃ − ๐ฃ, ๐ฃ − ๐ฃ, ๐ฃ + 2 (– 2 ๐ฃ)) = (0, 0, 0). So, span {(1, – 1, 1, – 2)} ⊆ ker(๐)
1
Therefore, ker(๐) = span {(1, – 1, 1, – 2)}.
Notice that ๐[โ4 ] is a subspace of โ3 (in fact, ๐[โ4 ] = โ3 ) and ker(๐) is a subspace of โ4 . Also, the
sum of the dimensions of ๐[โ4 ] and ker(๐) is 3 + 1 = 4, which is the dimension of โ4 . None of this is
a coincidence, as we will see in the next few theorems.
Theorem 16.5: Let ๐ and ๐ be vector spaces over a field ๐ฝ and let ๐: ๐ → ๐ be a linear transformation.
Then ๐[๐] ≤ ๐.
Proof: We have ๐(0) = ๐(0 + 0) = ๐(0) + ๐(0). Therefore, ๐(0) = 0. It follows that 0 ∈ ๐[๐].
Let ๐ค, ๐ก ∈ ๐[๐]. Then there are ๐ข, ๐ฃ ∈ ๐ with ๐(๐ข) = ๐ค and ๐(๐ฃ) = ๐ก. It then follows that
๐(๐ข + ๐ฃ) = ๐(๐ข) + ๐(๐ฃ) = ๐ค + ๐ก. So, ๐ค + ๐ก ∈ ๐[๐].
Let ๐ค ∈ ๐[๐] and ๐ ∈ ๐ฝ. Then there is ๐ข ∈ ๐ with ๐(๐ข) = ๐ค. We have ๐(๐๐ข) = ๐๐(๐ข) = ๐๐ค.
Therefore, ๐๐ค ∈ ๐[๐].
By Theorem 8.1 from Lesson 8, ๐[๐] ≤ ๐.
โก
Theorem 16.6: Let ๐ and ๐ be vector spaces over a field ๐ฝ and let ๐: ๐ → ๐ be a linear transformation.
Then ker(๐) ≤ ๐.
Proof: As in the proof of Theorem 16.5, we have ๐(0) = 0. So, 0 ∈ ker(๐).
Let ๐ข, ๐ฃ ∈ ker(๐). Then ๐(๐ข + ๐ฃ) = ๐(๐ข) + ๐(๐ฃ) = 0 + 0 = 0. So, ๐ข + ๐ฃ ∈ ker(๐).
Let ๐ข ∈ ker(๐) and ๐ ∈ ๐ฝ. Then ๐(๐๐ข) = ๐๐(๐ข) = ๐ ⋅ 0 = 0. Therefore, ๐๐ข ∈ ker(๐).
By Theorem 8.1 from Lesson 8, ker(๐) ≤ ๐.
โก
Theorem 16.7: Let ๐ and ๐ be vector spaces over a field ๐ฝ and let ๐: ๐ → ๐ be a linear transformation.
Then ker(๐) = {0} if and only if ๐ is injective.
245
Proof: Suppose that ker(๐) = {0}, let ๐ข, ๐ฃ ∈ ๐, and let ๐(๐ข) = ๐(๐ฃ). Then ๐(๐ข) − ๐(๐ฃ) = 0. It follows
that ๐(๐ข − ๐ฃ) = ๐(๐ข) − ๐(๐ฃ) = 0. So, ๐ข − ๐ฃ ∈ ker(๐). Since ker(๐) = {0}, ๐ข − ๐ฃ = 0. Therefore,
๐ข = ๐ฃ. Since ๐ข, ๐ฃ ∈ ๐ were arbitrary, ๐ is injective.
Conversely, suppose that ๐ is injective, and let ๐ข ∈ ker(๐). Then ๐(๐ข) = 0. But also, by the proof of
Theorem 16.5, ๐(0) = 0. So, ๐(๐ข) = ๐(0). Since ๐ is injective, ๐ข = 0. Since ๐ข ∈ ๐ was arbitrary,
ker(๐) ⊆ {0}. By the proof of Theorem 16.5, ๐(0) = 0, so that 0 ∈ ker(๐), and so, {0} ⊆ ker(๐). It
follows that ker(๐) = {0}.
โก
If ๐ and ๐ are vector spaces over a field ๐ฝ, and ๐: ๐ → ๐ is a linear transformation, then the rank of
๐ is the dimension of ๐[๐] and the nullity of ๐ is the dimension of ker(๐).
Theorem 16.8: Let ๐ and ๐ be vector spaces over a field ๐ฝ with dim ๐ = ๐ and let ๐: ๐ → ๐ be a
linear transformation. Then rank ๐ + nullity ๐ = ๐.
Note: Before proving the theorem, let’s observe that in a finite-dimensional vector space ๐, any vectors
that are linearly independent can be extended to a basis of ๐.
To see this, let ๐ฃ1 , ๐ฃ2 , … ๐ฃ๐ be linearly independent and let ๐ข1 , ๐ข2 , … , ๐ข๐ be any vectors such that
span{๐ข1 , ๐ข2 , … , ๐ข๐ } = ๐. We will decide one by one if we should throw in or exclude each ๐ข๐ .
๐ต0
if ๐ข1 ∈ span ๐ต0 .
Specifically, we start by first letting ๐ต0 = {๐ฃ1 , ๐ฃ2 , … ๐ฃ๐ } and then ๐ต1 = {
๐ต0 ∪ {๐ข1 } if ๐ข1 ∉ span ๐ต0 .
๐ต๐−1
if ๐ข๐ ∈ span ๐ต๐−1 .
In general, for each ๐ = 1, 2, … ๐, we let ๐ต๐ = {
By Problem 6 from
๐ต๐−1 ∪ {๐ข๐ } if ๐ข๐ ∉ span ๐ต๐−1 .
Lesson 8, for each ๐, ๐ต๐ is linearly independent. Since for each ๐, ๐ข๐ ∈ span ๐ต๐ and ๐ต๐ ⊆ ๐ต๐ ,
๐ = span{๐ข1 , ๐ข2 , … , ๐ข๐ } = span ๐ต๐ . Therefore, ๐ต๐ is a basis of ๐.
Proof of Theorem 16.8: Suppose nullity ๐ = ๐, where 0 ≤ ๐ ≤ ๐. Then there is a basis {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ }
of ker(๐) (note that if ๐ = 0, this basis is the empty set). In particular, the vectors ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are
linearly independent. By the note above, we can extend these vectors to a basis ๐ต of ๐, let’s say
๐ต = {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ , ๐ข1 , ๐ข2 , … , ๐ข๐ }. So, we have ๐ = ๐ + ๐. Let’s show that {๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ )}
is a basis of ๐[๐].
For linear independence of ๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ ), note that since ๐ is a linear transformation,
๐1 ๐(๐ข1 ) + ๐2 ๐(๐ข2 ) + โฏ + ๐๐ ๐(๐ข๐ ) = 0 is equivalent to ๐(๐1 ๐ข1 + ๐2 ๐ข2 + โฏ + ๐๐ ๐ข๐ ) = 0, which is
equivalent to ๐1 ๐ข1 + ๐2 ๐ข2 + โฏ + ๐๐ ๐ข๐ ∈ ker(๐). Since {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } is a basis of ker(๐), we can
find weights ๐1 , ๐2 , … , ๐๐ such that ๐1 ๐ข1 + ๐2 ๐ข2 + โฏ + ๐๐ ๐ข๐ = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ . Since ๐ต is
a basis of ๐, all weights (the ๐๐ ’s and ๐๐ ’s) are 0. So, ๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ ) are linearly independent.
To see that ๐[๐] = span{๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ )}, let ๐ฃ ∈ ๐. Since ๐ต is a basis of ๐, we can write ๐ฃ as
a linear combination ๐ฃ = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ ๐๐ ๐ฃ๐ + ๐1 ๐ข1 + ๐2 ๐ข2 + โฏ + ๐๐ ๐ข๐ . Applying the linear
transformation ๐ gives us
๐(๐ฃ) = ๐(๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ + ๐1 ๐ข1 + ๐2 ๐ข2 + โฏ + ๐๐ ๐ข๐ )
= ๐1 ๐(๐ฃ1 ) + ๐2 ๐(๐ฃ2 ) + โฏ + ๐๐ ๐(๐ฃ๐ ) + ๐1 ๐(๐ข1 ) + ๐2 ๐(๐ข2 ) + โฏ + ๐๐ ๐(๐ข๐ )
= ๐1 ๐(๐ข1 ) + ๐2 ๐(๐ข2 ) + โฏ + ๐๐ ๐(๐ข๐ ).
246
Note that ๐(๐ฃ1 ), ๐(๐ฃ2 ), … , ๐(๐ฃ๐ ) are all 0 because ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ker(๐).
Since each vector of the form ๐(๐ฃ) can be written as a linear combination of ๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ ),
we have shown that ๐[๐] = span{๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ )}.
Since ๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ ) are linearly independent and ๐[๐] = span{๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ )}, it
follows that {๐(๐ข1 ), ๐(๐ข2 ), … , ๐(๐ข๐ )} is a basis of ๐[๐]. Therefore, rank ๐ = ๐.
โก
Eigenvalues and Eigenvectors
We now restrict our attention to linear transformations from a vector space to itself. For a vector space
๐, we will abbreviate the linear algebra โ(๐, ๐) by โ(๐).
If ๐ ≤ ๐, we say that ๐ is invariant under ๐ ∈ โ(๐) if ๐[๐] ⊆ ๐.
Example 16.8: Let ๐ be a vector space and let ๐ ∈ โ(๐).
1. {0} is invariant under ๐. Indeed, ๐(0) = 0 by the proof of Theorem 16.5.
2. ๐ is invariant under ๐. Indeed, if ๐ฃ ∈ ๐, then ๐(๐ฃ) ∈ ๐.
3. ker(๐) is invariant under ๐. To see this, let ๐ฃ ∈ ker(๐). Then ๐(๐ฃ) = 0 ∈ ker(๐).
4. ๐[๐] is invariant under ๐. To see this, let ๐ค ∈ ๐[๐]. Then ๐(๐ค) is clearly also in ๐[๐].
Let ๐ be a vector space over a field ๐ฝ. We call a subspace ๐ ≤ ๐ a simple subspace if it consists of all
scalar multiples of a single vector. In other words, ๐ is simple if there is a ๐ข ∈ ๐ such that
๐ = {๐๐ข | ๐ ∈ ๐ฝ}.
Theorem 16.9: Let ๐ be a vector space over a field ๐ฝ, let ๐ = {๐๐ข | ๐ ∈ ๐ฝ} be a simple subspace of ๐,
and let ๐ ∈ โ(๐). Then ๐ is invariant under ๐ if and only if there is ๐ ∈ ๐ฝ such that ๐(๐ข) = ๐๐ข.
Proof: Suppose that ๐ = {๐๐ข | ๐ ∈ ๐ฝ} is invariant under ๐. Then ๐(๐ข) ∈ ๐. It follows that ๐(๐ข) = ๐๐ข
for some ๐ ∈ ๐ฝ.
Conversely, suppose there is ๐ ∈ ๐ฝ such that ๐(๐ข) = ๐๐ข. Let ๐ฃ ∈ ๐. Then there is ๐ ∈ ๐ฝ such that
๐ฃ = ๐๐ข. Then ๐(๐ฃ) = ๐(๐๐ข) = ๐๐(๐ข) = ๐(๐๐ข) = (๐๐)๐ข ∈ ๐. Since ๐ฃ ∈ ๐ was arbitrary, ๐[๐] ⊆ ๐.
Therefore, ๐ is invariant under ๐.
โก
Let ๐ be a vector space over a field ๐ฝ and let ๐ ∈ โ(๐). A scalar ๐ ∈ ๐ฝ is called an eigenvalue of ๐ if
there is a nonzero vector ๐ฃ ∈ ๐ such that ๐(๐ฃ) = ๐๐ฃ. The vector ๐ฃ is called an eigenvector of ๐.
Notes: (1) If ๐ฃ is the zero vector, Then ๐(๐ฃ) = ๐(0) = 0 = ๐ ⋅ 0 for every scalar ๐. This is why we
exclude the zero vector from being an eigenvector. An eigenvector must be nonzero.
(2) If we let ๐ผ: ๐ → ๐ be the identity linear transformation defined by ๐ผ(๐ฃ) = ๐ฃ for all ๐ฃ ∈ ๐, then we
can write ๐๐ฃ as ๐๐ผ(๐ฃ). So, the equation ๐(๐ฃ) = ๐๐ฃ is equivalent to the equation (๐ − ๐๐ผ)(๐ฃ) = 0.
(3) It follows from Note 2 that ๐ is an eigenvalue of ๐ if and only if ker(๐ − ๐๐ผ) ≠ {0}. By Theorem
16.7, ๐ is an eigenvalue of ๐ if and only if ๐ − ๐๐ผ is not injective.
247
(4) By Note 2, ๐ฃ is an eigenvector of ๐ corresponding to eigenvalue ๐ if and only if ๐ฃ is a nonzero vector
such that (๐ − ๐๐ผ)(๐ฃ) = 0. So, the set of eigenvectors of ๐ corresponding to ๐ is ker(๐ − ๐๐ผ). By
Theorem 16.6, ker(๐ − ๐๐ผ) is a subspace of ๐. We call this subspace the eigenspace of ๐ corresponding
to the eigenvalue ๐.
Example 16.9:
1. Let ๐ be any vector space over a field ๐ฝ and let ๐ผ: ๐ → ๐ be the identity linear transformation.
Then for any ๐ฃ ∈ ๐, ๐ผ(๐ฃ) = ๐ฃ = 1๐ฃ. So, we see that 1 is the only eigenvalue of ๐ผ and every
nonzero vector ๐ฃ ∈ ๐ is an eigenvector of ๐ผ for the eigenvalue 1.
2. More generally, if ๐ ∈ ๐ฝ, then the linear transformation ๐๐ผ satisfies (๐๐ผ)(๐ฃ) = ๐๐ผ(๐ฃ) = ๐๐ฃ for
all ๐ฃ ∈ ๐. So, we see that ๐ is the only eigenvalue of ๐๐ผ and every nonzero vector ๐ฃ ∈ ๐ is an
eigenvector of ๐๐ผ for the eigenvalue ๐.
3. Consider โ2 as a vector space over โ and define ๐: โ2 → โ2 by ๐((๐ง, ๐ค)) = (– ๐ค, ๐ง). Observe
that ๐ = ๐ is an eigenvalue of ๐ with corresponding eigenvector (1, – ๐). Indeed, we have
๐((1, – ๐)) = (๐, 1) and ๐(1, – ๐) = (๐, – ๐ 2 ) = (๐, 1). So, ๐((1, – ๐)) = ๐(1, – ๐).
Let’s find all the eigenvalues of this linear transformation. We need to solve the equation
๐((๐ง, ๐ค)) = ๐(๐ง, ๐ค), or equivalently, (– ๐ค, ๐ง) = (๐๐ง, ๐๐ค). Equating the first components and
second components gives us the two equations – ๐ค = ๐๐ง and ๐ง = ๐๐ค. Solving the first equation
for ๐ค yields ๐ค = – ๐๐ง. Substituting into the second equation gives us ๐ง = ๐(– ๐๐ง) = – ๐2 ๐ง. So,
๐ง + ๐2 ๐ง = 0. Using distributivity on the left-hand side of this equation gives ๐ง(1 + ๐2 ) = 0. So,
๐ง = 0 or 1 + ๐2 = 0. If ๐ง = 0, then ๐ค = – ๐ ⋅ 0 = 0. So, (๐ง, ๐ค) = (0, 0). Since an eigenvector
must be nonzero, we reject ๐ง = 0. The equation 1 + ๐2 = 0 has the two solutions ๐ = ๐ and
๐ = – ๐. These are the two eigenvalues of ๐.
Next, let’s find the eigenvectors corresponding to the eigenvalue ๐ = ๐. In this case, we have
๐((๐ง, ๐ค)) = ๐(๐ง, ๐ค), or equivalently, (– ๐ค, ๐ง) = (๐๐ง, ๐๐ค). So, – ๐ค = ๐๐ง and ๐ง = ๐๐ค. These two
equations are actually equivalent. Indeed, if we multiply each side of the second equation by ๐,
we get ๐๐ง = ๐ 2 ๐ค, or equivalently, ๐๐ง = – ๐ค or – ๐ค = ๐๐ง.
So, we use only one of the equations, say – ๐ค = ๐๐ง, or equivalently, ๐ค = – ๐๐ง. So, the
eigenvectors of ๐ corresponding to the eigenvalue ๐ = ๐ are all nonzero vectors of the form
(๐ง, – ๐ง๐). For example, letting ๐ง = 1, we see that (1, – ๐) is an eigenvector corresponding to the
eigenvalue ๐ = ๐.
Let’s also find the eigenvectors corresponding to the eigenvalue ๐ = – ๐. In this case, we have
๐((๐ง, ๐ค)) = – ๐(๐ง, ๐ค), or equivalently, (– ๐ค, ๐ง) = (– ๐๐ง, – ๐๐ค). So, – ๐ค = – ๐๐ง and ๐ง = – ๐๐ค. Once
again, these two equations are equivalent. Indeed, if we multiply each side of the second
equation by – ๐, we get – ๐๐ง = ๐ 2 ๐ค, or equivalently, – ๐๐ง = – ๐ค or – ๐ค = – ๐๐ง.
So, we use only one of the equations, say – ๐ค = – ๐๐ง, or equivalently, ๐ค = ๐๐ง. So, the
eigenvectors of ๐ corresponding to the eigenvalue ๐ = – ๐ are all nonzero vectors of the form
(๐ง, ๐ง๐). For example, letting ๐ง = 1, we see that (1, ๐) is an eigenvector corresponding to the
eigenvalue ๐ = – ๐.
248
Note that if we consider the vector space โ2
over the field โ instead of โ2 over โ, then the
linear transformation ๐: โ2 → โ2 defined by
๐((๐ง, ๐ค)) = (– ๐ค, ๐ง) has no eigenvalues (and
therefore, no eigenvectors). Algebraically, this
follows from the fact that 1 + ๐2 = 0 has no
real solutions.
It is also easy to see geometrically that this
transformation has no eigenvalues. The given
transformation rotates any nonzero point
(๐ง, ๐ค) ∈ โ2 counterclockwise by 90°. Since no
multiple of (๐ง, ๐ค) results in such a rotation, we
see that there is no eigenvalue. The figure to
the right shows how ๐ rotates the point (1, 1)
counterclockwise 90° to the point (– 1, 1).
Let ๐ be a vector space over a field ๐ฝ, let ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐, and ๐1 , ๐2 , … , ๐๐ ∈ ๐ฝ. Recall from Lesson 8
that the expression ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ is called a linear combination of the vectors ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐
with weights ๐1 , ๐2 , … , ๐๐ .
Also recall once more that ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly dependent if there exist weights ๐1 , ๐2 , … , ๐๐ ∈ ๐ฝ,
with at least one weight nonzero, such that ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ = 0. Otherwise, we say that
๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly independent.
In Problem 6 from Lesson 8, you were asked to prove that if a finite set of at least two vectors is linearly
dependent, then one of the vectors in the set can be written as a linear combination of the other
vectors in the set. To prove the next theorem (Theorem 16.11), we will need the following slightly
stronger result.
Lemma 16.10: Let ๐ be a vector space over a field ๐ฝ and let ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐ be linearly dependent
with ๐ ≥ 2. Also assume that ๐ฃ1 ≠ 0. Then there is ๐ก ≤ ๐ such that ๐ฃ๐ก can be written as a linear
combination of ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ก−1.
Proof: Suppose that ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly dependent and ๐ฃ1 ≠ 0. Let ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ +๐๐ ๐ฃ๐ = 0
be a nontrivial dependence relation (in other words, not all the ๐๐ are 0). Since ๐ฃ1 ≠ 0, we must have
๐๐ ≠ 0 for some ๐ ≠ 1 (otherwise ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ ๐ฃ๐ = 0 implies ๐1 ๐ฃ1 = 0, which implies that
๐1 = 0, contradicting that the dependence relation is nontrivial). Let ๐ก be the largest value such that
๐๐ก ≠ 0. Then we have ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ +๐๐ ๐ฃ๐ = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ก ๐ฃ๐ก + 0๐ฃ๐ก+1 โฏ + 0๐ฃ๐ , and so,
๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ก ๐ฃ๐ก = 0. Since ๐๐ก ≠ 0, we can solve for ๐ฃ๐ก to get
๐1
๐๐ก−1
๐ฃ๐ก = – ๐ฃ1 − โฏ −
๐ฃ .
๐๐ก
๐๐ก ๐ก−1
So, ๐ฃ๐ก can be written as a linear combination of ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ก−1 .
249
โก
Note: A lemma is a theorem whose primary purpose it to prove a more important theorem. Although
Lemma 16.10 is an important result in Linear Algebra, the main reason we are mentioning it now is to
help us prove the next theorem (Theorem 16.11).
Theorem 16.11: Let ๐ be a vector space over a field ๐ฝ, let ๐ ∈ โ(๐), and let ๐1 , ๐2 , … , ๐๐ be distinct
eigenvalues of ๐ with corresponding eigenvectors ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ . Then ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly
independent.
Proof: Suppose toward contradiction that ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly dependent. Let ๐ก be the least integer
such that ๐ฃ๐ก can be written as a linear combination of ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ก−1 (we can find such a ๐ก by Lemma
16.10). Then there are weights ๐1 , ๐2 , … , ๐๐ก−1 such that ๐ฃ๐ก = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ก−1 ๐ฃ๐ก−1 . Apply the
linear transformation ๐ to each side of this last equation to get the equation
๐(๐ฃ๐ก ) = ๐(๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ก−1 ๐ฃ๐ก−1 ) = ๐1 ๐(๐ฃ1 ) + ๐2 ๐(๐ฃ2 ) + โฏ + ๐๐ก−1 ๐(๐ฃ๐ก−1 ). Since each ๐ฃ๐ is
an eigenvector corresponding to eigenvalue ๐๐ , we have ๐๐ก ๐ฃ๐ก = ๐1 ๐1 ๐ฃ1 + ๐2 ๐2 ๐ฃ2 + โฏ + ๐๐ก−1 ๐๐ก−1 ๐ฃ๐ก−1 .
We can also multiply each side of the equation ๐ฃ๐ก = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ก−1 ๐ฃ๐ก−1 by ๐๐ก to get the
equation ๐๐ก ๐ฃ๐ก = ๐1 ๐๐ก ๐ฃ1 + ๐2 ๐๐ก ๐ฃ2 + โฏ + ๐๐ก−1 ๐๐ก ๐ฃ๐ก−1 . We now subtract:
๐๐ก ๐ฃ๐ก = ๐1 ๐๐ก ๐ฃ1 + ๐2 ๐๐ก ๐ฃ2 + โฏ + ๐๐ก−1 ๐๐ก ๐ฃ๐ก−1
๐๐ก ๐ฃ๐ก = ๐1 ๐1 ๐ฃ1 + ๐2 ๐2 ๐ฃ2 + โฏ + ๐๐ก−1 ๐๐ก−1 ๐ฃ๐ก−1
0 = ๐1 (๐๐ก − ๐1 )๐ฃ1 + ๐2 (๐๐ก − ๐2 )๐ฃ2 + โฏ + ๐๐ก−1 (๐๐ก − ๐๐ก−1 )๐ฃ๐ก−1
Since we chose ๐ก to be the least integer such that ๐ฃ๐ก can be written as a linear combination of
๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ก−1, it follows that ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ก−1 are linearly independent. Therefore, the constants
๐1 (๐๐ก − ๐1 ), ๐2 (๐๐ก − ๐2 ),…, ๐๐ก−1 (๐๐ก − ๐๐ก−1 ) are all 0. Since the eigenvalues are all distinct, we must
have ๐1 = ๐2 = โฏ = ๐๐ก−1 = 0. Then ๐ฃ๐ก = ๐1 ๐ฃ1 + ๐2 ๐ฃ2 + โฏ + ๐๐ก−1 ๐ฃ๐ก−1 = 0, contradicting our
assumption that ๐ฃ๐ก is an eigenvector. Therefore, ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ cannot be linearly dependent. So,
๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly independent.
โก
๐11 โฏ ๐1๐
โฎ ]. The diagonal entries of ๐ด are the entries
Let ๐ด be a square matrix, say ๐ด = [ โฎ
๐๐1 โฏ ๐๐๐
๐11 , ๐22 , … , ๐๐๐ . All other entries of ๐ด are nondiagonal entries.
1 5 2
Example 16.10: The diagonal entries of the matrix ๐ต = [3 6 0] are ๐11 = 1, ๐22 = 6, and ๐33 = 8.
2 9 8
The nondiagonal entries of ๐ต are ๐12 = 5, ๐13 = 2, ๐21 = 3, ๐23 = 0, ๐31 = 2, and ๐32 = 9.
A diagonal matrix is a square matrix that has every nondiagonal entry equal to 0.
Example 16.11: The matrix ๐ต from Example 16.10 is not a diagonal matrix, while the matrices
1 0 0
5
0 0
๐ถ = [0 6 0] and ๐ท = [0 – 2 0] are diagonal matrices.
0 0 8
0
0 0
Let ๐ be a vector space. A linear transformation ๐ ∈ โ(๐) is said to be diagonalizable if there is a basis
โฌ of ๐ for which โณ๐ (๐ต) is a diagonal matrix.
250
Example 16.12:
1. Consider โ as a vector space over โ and define ๐: โ → โ by ๐(๐ง) = 5๐ง, as we did in part 1 of
Example 16.1. The equation ๐(๐ง) = 5๐ง tells us that every nonzero vector ๐ง is an eigenvector
corresponding to the eigenvalue ๐ = 5. ๐(๐ง) = ๐๐ง is equivalent to 5๐ง = ๐๐ง or (๐ − 5)๐ง = 0.
So, ๐ = 5 is the only eigenvalue. In particular, the standard basis vectors 1 and ๐ are
eigenvectors corresponding to the eigenvalue ๐ = 5. We have
๐(1) = 5 = 5 ⋅ 1 + 0 ⋅ ๐.
๐(๐) = 5๐ = 0 ⋅ 1 + 5 ⋅ ๐.
So, as we saw in part 1 of Example 16.6, the matrix of ๐ with respect to the standard basis is
5 0
โณ๐ = [
], a diagonal matrix. Therefore, ๐ is diagonalizable.
0 5
2. Consider โ3 as a vector space over โ and define ๐: โ3 → โ3 by
๐((๐ฅ, ๐ฆ, ๐ง)) = (3๐ฅ + ๐ฆ, ๐ฆ − 2๐ง, 7๐ง).
Let’s find the eigenvalues and eigenvectors of ๐.
We start by solving the equation ๐((๐ฅ, ๐ฆ, ๐ง)) = ๐(๐ฅ, ๐ฆ, ๐ง). This equation is equivalent to the
three equations 3๐ฅ + ๐ฆ = ๐๐ฅ, ๐ฆ − 2๐ง = ๐๐ฆ, and 7๐ง = ๐๐ง. We work backwards. If ๐ง ≠ 0, we get
๐ = 7. If ๐ง = 0 and ๐ฆ ≠ 0, we get ๐ = 1. Finally, if ๐ง = 0, ๐ฆ = 0, and ๐ฅ ≠ 0, we get ๐ = 3.
So, the eigenvalues of ๐ are 7, 1, and 3.
If we let ๐ = 7, we get 3๐ฅ + ๐ฆ = 7๐ฅ, ๐ฆ − 2๐ง = 7๐ฆ, and 7๐ง = 7๐ง. The equation ๐ฆ − 2๐ง = 7๐ฆ is
1
equivalent to the equation 6๐ฆ = – 2๐ง, or ๐ฆ = – 3 ๐ง. The equation 3๐ฅ + ๐ฆ = 7๐ฅ is equivalent to
1
1
3
12
4๐ฅ = ๐ฆ = – ๐ง, or ๐ฅ = –
๐ง. So, if we let ๐ง = – 12, we get the eigenvector ๐ฃ1 = (1, 4, – 12).
If we let ๐ = 1, we get 3๐ฅ + ๐ฆ = ๐ฅ, ๐ฆ − 2๐ง = ๐ฆ, and 7๐ง = ๐ง. The equation 7๐ง = ๐ง is equivalent
to the equation ๐ง = 0. The equation ๐ฆ − 2๐ง = ๐ฆ is then equivalent to ๐ฆ = ๐ฆ. The equation
1
3๐ฅ + ๐ฆ = ๐ฅ is equivalent to 2๐ฅ = – ๐ฆ or ๐ฅ = – 2 ๐ฆ. So, if we let ๐ฆ = – 2, we get the eigenvector
๐ฃ2 = (1, – 2, 0).
If we let ๐ = 3, we get 3๐ฅ + ๐ฆ = 3๐ฅ, ๐ฆ − 2๐ง = 3๐ฆ, and 7๐ง = 3๐ง. The equation 7๐ง = 3๐ง is
equivalent to the equation ๐ง = 0. The equation ๐ฆ − 2๐ง = 3๐ฆ is then equivalent to ๐ฆ = 0. The
equation 3๐ฅ + ๐ฆ = 3๐ฅ is then equivalent to ๐ฅ = ๐ฅ. So, if we let ๐ฅ = 1, we get the eigenvector
๐ฃ3 = (1, 0, 0).
It follows that ๐ต = {(1, 4, – 12), (1, – 2, 0), (1, 0, 0)} is a basis of eigenvectors of ๐ and we have
๐((1, 4, – 12)) = 7(1, 4, – 12)
๐((1, – 2, 0)) = 1(1, – 2, 0)
๐((1, 0, 0)) = 3(1, 0, 0)
7 0
Therefore, the matrix of ๐ with respect to ๐ต is โณ๐ (๐ต) = [0 1
0 0
Since โณ๐ (๐ต) is a diagonal matrix, ๐ is diagonalizable.
251
0
0 ].
3
3. Recall from part 3 of Example 16.9, the linear transformation ๐: โ2 → โ2 defined by
๐((๐ฅ, ๐ฆ)) = (– ๐ฆ, ๐ฅ) (where โ2 is being viewed as a vector space over the field โ). We saw in
that example that this linear transformation has no eigenvalues. It follows that there is no basis
for โ2 such that the matrix of ๐ with respect to that basis is a diagonal matrix. In other words,
๐ is not diagonalizable.
However, in the same example, we saw that the linear transformation ๐: โ2 → โ2 defined by
๐: โ2 → โ2 by ๐((๐ง, ๐ค)) = (– ๐ค, ๐ง) (where โ2 is being viewed as a vector space over the field
โ) has eigenvalues ๐ and – ๐ with eigenvectors corresponding to these eigenvalues of (1, – ๐)
and (1, ๐), respectively. So, we have
๐((1, – ๐)) = ๐(1, – ๐)
๐((1, ๐)) = – ๐(1, ๐)
So, the matrix of ๐ with respect to the basis ๐ต = {(1, – ๐), (1, ๐)} is โณ๐ (๐ต) = [
diagonal matrix. Therefore, ๐ is diagonalizable.
๐
0
0
], a
–๐
We finish with a Theorem that gives a sufficient condition for a linear transformation to be
diagonalizable.
Theorem 16.12: Let ๐ be an ๐-dimensional vector space and let ๐ ∈ โ(๐) have ๐ distinct eigenvalues.
Then ๐ is diagonalizable.
Proof: Suppose that dim ๐ = ๐ and ๐ ∈ โ(๐) has the ๐ distinct eigenvalues ๐1 , ๐2 , … , ๐๐ , with
corresponding eigenvectors ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ . By Theorem 16.11, ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly independent. By
the note following Theorem 16.8, ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ can be extended to a basis of ๐. However, a basis of ๐
has ๐ elements and therefore, {๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } is already a basis of ๐. Since ๐(๐ฃ1 ) = ๐1 ๐ฃ1 ,
๐1 0 … 0
0 ๐2 … 0
๐(๐ฃ2 ) = ๐2 ๐ฃ2 ,…,๐(๐ฃ๐ ) = ๐๐ ๐ฃ๐ , it follows that โณ๐ (๐ต) = [
]. Since โณ๐ (๐ต) is a diagonal
โฎ โฎ โฑ
โฎ
0 0 … ๐๐
matrix, ๐ is diagonalizable.
โก
252
Problem Set 16
Full solutions to these problems are available for free download here:
www.SATPrepGet800.com/PMFBXSG
LEVEL 1
1. Let ๐ and ๐ be vector spaces over โ. Determine if each of the following functions is a linear
transformation:
(i)
๐: โ → โ defined by ๐(๐ฅ) = 2๐ฅ + 1
(ii)
๐: โ → โ2 defined by ๐(๐ฅ) = (2๐ฅ, 3๐ฅ)
(iii) โ: โ3 → โ3 defined by โ((๐ฅ, ๐ฆ, ๐ง)) = (๐ฅ + ๐ฆ, ๐ฅ + ๐ง, ๐ง − ๐ฆ)
2. Compute each of the following:
(i)
1
2 0 –3
[
] ⋅ [1
0 1
4
2
(ii)
–4
[3 – 1 5] ⋅ [– 7]
2
1
–4
0
3 0
2 0]
1 –4
–4
(iii) [– 7] ⋅ [3 – 1 5]
2
๐
๐
(iv) [
๐
๐
๐
โ
๐
1
๐ ] ⋅ [0
๐
3
0 1
2 0].
1 4
LEVEL 2
3. Consider โ as a vector space over itself. Give an example of a function ๐: โ → โ such that ๐ is
additive, but not a linear transformation. Then give an example of vector spaces ๐ and ๐ and a
homogenous function ๐: ๐ → ๐ that is not a linear transformation.
LEVEL 3
4. Let ๐ = {๐๐ฅ 2 + ๐๐ฅ + ๐ | ๐, ๐, ๐ ∈ โ} be the vector space of polynomials of degree 2 with real
coefficients (see part 3 of Example 8.3 from Lesson 8). Define the linear transformation
๐ท: ๐ → ๐ by ๐ท(๐๐ฅ 2 + ๐๐ฅ + ๐) = 2๐๐ฅ + ๐. Find the matrix of ๐ with respect to each of the
following bases:
(i)
The standard basis ๐ต = {1, ๐ฅ, ๐ฅ 2 }
(ii)
๐ถ = {๐ฅ + 1, ๐ฅ 2 + 1, ๐ฅ 2 + ๐ฅ}
253
5. Let ๐ and ๐ be vector spaces with ๐ finite-dimensional, let ๐ ≤ ๐, and let ๐ ∈ โ(๐, ๐). Prove
that there is an ๐ ∈ โ(๐, ๐) such that ๐(๐ฃ) = ๐(๐ฃ) for all ๐ฃ ∈ ๐.
LEVEL 4
6. Let ๐: ๐ → ๐ be a linear transformation and let ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ ∈ ๐. Prove the following:
(i) If ๐ is injective and ๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ are linearly independent in ๐, then
๐(๐ฃ1 ), ๐(๐ฃ2 ), … , ๐(๐ฃ๐ ) are linearly independent in ๐.
(ii) If ๐ is surjective and span{๐ฃ1 , ๐ฃ2 , … , ๐ฃ๐ } = ๐, then span{๐(๐ฃ1 ), ๐(๐ฃ2 ), … , ๐(๐ฃ๐ )} = ๐.
7. Determine if each linear transformation is diagonalizable:
(i)
๐: โ2 → โ2 defined by ๐((๐ฅ, ๐ฆ)) = (๐ฆ, 2๐ฅ)
(ii)
๐: โ2 → โ2 defined by ๐((๐ง, ๐ค)) = (๐ง + ๐๐ค, ๐๐ง − ๐ค).
8. Let ๐ and ๐ be vector spaces over a field ๐ฝ. Prove that โ(๐, ๐) is a vector space over ๐ฝ, where
addition and scalar multiplication are defined as in Theorem 16.2.
9. Let ๐ be a vector space over a field ๐ฝ. Prove that โ(๐) is a linear algebra over ๐ฝ, where addition
and scalar multiplication are defined as in Theorem 16.2 and vector multiplication is given by
composition of linear transformations.
10. Let ๐: ๐ → ๐ and ๐: ๐ → ๐ be linear transformations such that ๐๐ = ๐๐ and ๐๐ = ๐๐ . Prove
that ๐ and ๐ are bijections and that ๐ = ๐ −1 .
11. Let ๐ and ๐ be finite-dimensional vector spaces and let ๐ ∈ โ(๐, ๐). Prove the following:
(i)
If dim ๐ < dim ๐, then ๐ is not surjective
(ii)
If dim ๐ > dim ๐, then ๐ is not injective.
12. Prove that two finite-dimensional vector spaces over a field ๐ฝ are isomorphic if and only if they
have the same dimension.
13. Let ๐ ∈ โ(๐) be invertible and let ๐ ∈ ๐ฝ โ {0}. Prove that ๐ is an eigenvalue of ๐ if and only if
1
is an eigenvalue of ๐ −1 .
๐
LEVEL 5
14. Let ๐ be a vector space with dim ๐ > 1. Show that {๐ ∈ โ(๐) | ๐ is not invertible} โฐ โ(๐).
15. Let ๐ be an ๐-dimensional vector space over a field ๐ฝ. Prove that there is a linear algebra
๐ฝ
isomorphism ๐น: โ(๐) → ๐๐๐
.
254
INDEX
Abelian, 33
Abelian group, 35
Absolute value, 82
Absorption law, 109
Accessible space, 197
Accumulation point, 91
Additive function, 234
Algebraically closed field, 78
Almost ring, 146
Angle, 212
Angle in standard position, 212
Antireflexive, 120
Antisymmetric, 120
Archimedean Property, 60
Argument of a complex
number, 216
Associative, 32
Associative law, 109
Assumption, 109
Atomic statement, 9, 107
Automorphism, 146
Axiom of Extensionality, 26
Ball, 201
Base case, 44
Basis, 102, 104
Basis for a topology, 192
Biconditional, 11
Biconditional elimination, 114
Biconditional introduction, 114
Biconditional law, 109
Bijection, 126
Bijective function, 126
Binary connective, 10
Binary operation, 30
Binary relation, 119, 137
Boundary point, 92
Bounded, 58
Bounded above, 58
Bounded below, 58
Bounded interval, 64
Canonical form, 162
Canonical representation, 162
Cantor-Schroeder-Bernstein
Theorem, 133
Cantor’s Theorem, 131
Cardinality, 20
Cartesian product, 30, 119
Chains of topologies, 191
Circle, 85, 212
Circumference, 212
Clopen, 200
Closed disk, 86
Closed downwards, 33, 98
Closed interval, 64
Closed set, 74, 89, 189
Closing statement, 21
Closure, 31, 35
Coarser topology, 190
Codomain, 125
Cofinite topology, 198
Common divisor, 159
Common factor, 159
Common multiple, 159
Commutative, 33
Commutative group, 35
Commutative law, 109
Compact space, 203
Comparability condition, 124
Complement, 74, 89
Complete prime factorization,
164
Completeness, 58
Completeness Property, 60
Complex number, 78
Composite number, 152
Composite function, 129
Compound statement, 9
Conclusion, 109, 111
Conditional, 11
Conditional law, 109
Conjugate, 81, 147
Conjunction, 11
255
Conjunctive elimination, 114
Conjunctive introduction, 114
Connective, 9
Constant, 137
Constant function, 125
Constructive dilemma, 114
Continuity, 174, 177, 204, 226
Continuous at a point, 174,
177, 205, 226
Continuous function, 174, 204
Contradiction, 109
Contrapositive, 113
Contrapositive law, 109
Converse, 113
Corollary, 130
Cosine, 214
Countable, 131
Counterexample, 31
Cover of a topology, 194
Covering, 203
Cycle diagram, 148
Cycle notation, 148
De Moivre’s Theorem, 217
De Morgan’s laws, 11, 77, 109
Deleted neighborhood, 87
Dense, 61
Density Theorem, 61
Denumerable, 131
Dependence relation, 104
Derivation, 114
Destructive dilemma, 114
Diagonal entry, 250
Diagonal matrix, 250
Diagonalizable, 250
Difference identity, 232
Dilation, 234
Dimension, 244
Discrete topology, 190
Disjoint, 25, 70
Disjunction, 11
Disjunctive introduction, 114
Disjunctive resolution, 114
Disjunctive syllogism, 114
Disk, 86
Distance, 83
Distance function, 201
Distributive, 39, 77
Distributive law, 107
Distributivity, 40
Divides, 42, 152
Divisible, 42, 152
Divisibility, 41
Division Algorithm, 155, 156
Divisor, 42, 152
Domain, 125, 137
Double negation, 108
Eigenvalue, 247
Eigenvector, 247
Element, 19
Empty set, 20
Equinumerosity, 130
Equivalence class, 122
Equivalence relation, 121
Euclidean Algorithm, 165
Euler’s formula, 216
Even, 41, 47
Exclusive or, 17
Exponential form of a complex
number, 216
Extended Complex Plane, 229
Factor, 42, 152
Factor tree, 162
Factorial, 154
Factorization, 153
Fallacy, 112
Fallacy of the converse, 113
Fallacy of the inverse, 113
Fence-post formula, 20
Field, 41, 50
Field axioms, 50, 51
Field homomorphism, 143
Finer topology, 190
Finitary operation, 137
Finitary relation, 137
Finite-dimensional vector
space, 244
Finite sequence, 126
Fixed point, 221
Function, 124, 218
Fundamental Theorem of
Arithmetic, 152
Gaussian integer, 150
GCD, 159
Greatest common divisor, 159
Greatest common factor, 159
Greatest lower bound, 58
Group, 34
Group homomorphism, 142
Half-open interval, 64
Hausdorff space, 198
Homeomorphic spaces, 207
Homeomorphism, 207
Homogenous function, 234
Homomorphism, 142
Horizontal strip, 169
Hypothesis, 109
Hypothetical syllogism, 114
Ideal, 149
Identity, 34
Identity function, 130
Identity law, 109
Image, 146, 204, 244
Imaginary part, 79
Implication, 11
Inclusion map, 134
Incomparable topologies, 190
Indiscrete topology, 190
Induced topology, 202
Induction, 43
Inductive hypothesis, 44
Inductive step, 44
Infimum, 58, 59
Infinite closed interval, 64
Infinite interval, 64
Infinite limit, 183
Infinite open interval, 64
Infinite sequence, 126
256
Infinite set, 19
Initial ray, 212
Injection, 126
Injective function, 126
Integer, 19
Interior point, 92
Intersection, 24, 66, 69
Intersection containment
property, 194
Interval, 64
Invalid, 112
Invariant, 220
Invariant subspace, 247
Inverse, 34, 113
Inverse function, 127
Invertible, 34
Isomorphism, 55, 145
Kernel, 146, 244
Kolmogorov space, 197
LCM, 159
Least common multiple, 159
Least upper bound, 58
Left distributivity, 39
Lemma, 250
Limit, 172, 176, 177, 223
Limits involving infinity, 183
Linear algebra, 237
Linear combination, 101, 103,
160
Linear dependence, 102, 104
Linear equation, 78
Linear function, 179
Linear independence, 102, 104
Linear transformation, 234
Linearly ordered set, 124
Logical argument, 111
Logical connective, 9
Logical equivalence, 108
Lower bound, 58
Matrix, 97, 239
Matrix addition, 97
Matrix of a linear
transformation, 242
Matrix multiplication, 240, 241
Matrix scalar multiplication, 97
Metric, 201
Metric space, 201
Metrizable space, 202
Modulus, 82
Modus ponens, 112, 114
Modus tollens, 114
Monoid, 34
Monoid homomorphism, 142
Monotonic function, 143
Multiple, 42, 152
Mutually exclusive, 25
Mutually relatively prime, 160
Natural number, 19
Negation, 11
Negation law, 109
Negative identities, 215
Neighborhood, 86
Nondiagonal entry, 250
Normal, 147
Normal space, 200
Normal subgroup, 147
North pole, 228
Null space, 244
Nullity, 246
Odd, 47
One-sided limit, 185
One-to-one function, 126
Onto, 126
Open ball, 201
Open covering, 203
Open disk, 86
Open interval, 64
Open rectangle, 170
Open set, 71, 87, 189
Opening statement, 21
Order homomorphism, 143
Ordered field, 52
Ordered pair, 118
Ordered ring, 52
Ordered tuple, 118
Ordering, 124
Pairwise disjoint, 70, 121
Pairwise relatively prime, 160
Parity, 121
Partial binary operation, 31
Partial ordering, 124
Partially ordered set, 124
Partition, 121
Permutation, 148
Point at infinity, 228
Polar form of a complex
number, 216
Polynomial equation, 78
Polynomial ring, 151
Poset, 124
Positive square root, 82
Power set, 23
Premise, 109, 111
Prime factorization, 153
Prime number, 152
Principle of Mathematical
Induction, 43
Principle root, 218
Product, 41
Product topology, 210
Proof, 111
Proof by contradiction, 44
Proof by contrapositive, 129
Proposition, 9, 107
Propositional variable, 10, 107
Punctured disk, 87
Pure imaginary number, 79
Pythagorean identity, 215
Pythagorean Theorem, 56
Quadrantal angles, 214
Quadratic equation, 78
Quotient, 35, 81, 155
Radian measure, 212
Range, 125, 244
Rank, 246
Rational number, 35
Ray, 212
Real number, 60, 79
Real part, 79
257
Redundancy law, 109
Reflection, 221
Reflexive, 29, 120
Regular space, 200
Relation, 119, 120, 137
Relatively prime, 159
Representative of equivalence
class, 123
Riemann sphere, 228
Right distributivity, 39
Ring, 39
Ring axioms, 40
Ring homomorphism, 143
Ring ideal, 149
Ring with identity, 40
Rng, 147
Root of a complex number, 218
Roots of unity, 218
Rotation, 222
Rule of inference, 112
SACT, 45
Scalar multiplication, 93, 95
Semigroup, 32
Semigroup homomorphism,
142
Semiring, 41
Separation axioms, 197
Sequence, 126
Set, 19
Set-builder notation, 20
Set complement, 74
Set difference, 66
Sigma notation, 241
Simple subspace, 247
Sine, 214
Soundness, 113
South pole, 228
Span, 101, 103
Square matrix, 240
Square root, 82, 218
Standard Advanced Calculus
Trick, 45
Standard form of a complex
number, 78, 216
Standard topology, 192
Statement, 9, 107
Strict linearly ordered set, 124
Strict partial ordering, 124
Strict partially ordered set, 124
Strict poset, 124
Strip, 169
Strong Induction, 49
Subbasis for a topology, 196
Subfield, 80, 141
Subgroup, 140
Submonoid, 139
Subring, 140
Subsemigroup, 139
Subset, 20
Subspace, 98
Subspace topology, 210
Substatement, 107
Substitution of logical
equivalents, 109
Substitution of sentences, 109
Substructure, 139
Sum, 41
Sum identities, 215
Summation, 241
Supremum, 59
Surjection, 126
Surjective function, 126
Surjectively invariant, 220
Symmetric, 29, 120
Symmetric difference, 66
Tangent, 214
Tautologically implies, 112
Tautology, 22, 109
Terminal ray, 212
Ternary relation, 120, 137
Theorem, 21
Tichonov space, 197
Topological equivalence, 207
Topological invariant, 209
Topological property, 209
Topological space, 189
Topology, 85, 189
Totally ordered set, 124
Transitive, 24, 120
Transitivity of logical
equivalence, 109
Translation, 220
Tree diagram, 23
Triangle Inequality, 84
Trichotomy, 124
Trigonometric functions, 214
258
Trivial topology, 190
Truth table, 12
Type, 138
Unary connective, 10
Unary relation, 120, 137
Uncountable, 131
Uniform continuity, 180, 227
Uniformly continuous, 180, 227
Union, 24, 66, 69
Unit circle, 212
Unital ring, 40
Universal Quantifier, 21
Universal set, 21
Universal statement, 33, 98
Unordered pair, 118
Upper bound, 58
Valid, 112
Vector, 79, 95
Vector multiplication, 237
Vector space, 93
Venn diagram, 21
Vertical strip, 169
Weight, 101, 103, 160
Well-defined, 123
Well Ordering Principle, 43
Without loss of generality, 73
Wrapping function, 214
About the Author
Dr. Steve Warner, a New York native, earned his Ph.D. at Rutgers University in Pure Mathematics in
May 2001. While a graduate student, Dr. Warner won the TA Teaching
Excellence Award.
After Rutgers, Dr. Warner joined the Penn State Mathematics
Department as an Assistant Professor and in September 2002, he
returned to New York to accept an Assistant Professor position at Hofstra
University. By September 2007, Dr. Warner had received tenure and was
promoted to Associate Professor. He has taught undergraduate and
graduate courses in Precalculus, Calculus, Linear Algebra, Differential
Equations, Mathematical Logic, Set Theory, and Abstract Algebra.
From 2003 – 2008, Dr. Warner participated in a five-year NSF grant, “The
MSTP Project,” to study and improve mathematics and science curriculum in poorly performing junior
high schools. He also published several articles in scholarly journals, specifically on Mathematical Logic.
Dr. Warner has nearly two decades of experience in general math tutoring and tutoring for
standardized tests such as the SAT, ACT, GRE, GMAT, and AP Calculus exams. He has tutored students
both individually and in group settings.
In February 2010 Dr. Warner released his first SAT prep book “The 32 Most Effective SAT Math
Strategies,” and in 2012 founded Get 800 Test Prep. Since then Dr. Warner has written books for the
SAT, ACT, SAT Math Subject Tests, AP Calculus exams, and GRE.
Dr. Steve Warner can be reached at
steve@SATPrepGet800.com
259
BOOKS BY DR. STEVE WARNER
260
CONNECT WITH DR. STEVE WARNER
261
0
You can add this document to your study collection(s)
Sign in Available only to authorized usersYou can add this document to your saved list
Sign in Available only to authorized users(For complaints, use another form )