Transition to Calculus Lecture Notes for Department of Mathematics

advertisement
Lecture Notes
for
Transition to Calculus
Department of Mathematics
University of North Dakota
Grand Forks, ND 58202
Fall 2003
Preface
This collection of lecture notes is designed for use in Arts and Sciences 250 - Transition to
Calculus, at the University of North Dakota. This course is not a comprehensive preparation
for calculus. It is designed for either the student who needs a few gaps filled in order to be
ready for calculus, or for the student who is in calculus, but needs to refresh their precalculus
skills.
c
°2003
University of North Dakota Mathematics Department
Permission is granted to copy, distribute and/or modify this document under the terms of
the GNU Free Documentation License, Version 1.2 or any later version published by the Free
Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover
Texts. A copy of the license is included at http://www.gnu.org/copyleft in the section
entitled ”GNU Free Documentation License.”
i
TABLE OF CONTENTS
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1. Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2. Polynomial Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3. Rational and Algebraic Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4. Partial Fraction Expansion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
5. Exponential and Logarithmic Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
6. Exponential Growth and Decay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
7. Conic Sections. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .23
8. Basic Trigonometric Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
9. Special Values of Trigonometric Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
10. General Sinusoids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
11. Trigonometric Identities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
12. Applications of Trigonometry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
13. Polar Coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
14. Parametric Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
Appendices
A. The Real Number Line and the Cartesian Plane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
B. Lines. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66
C. Factoring Polynomials and Complex Arithmetic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
D. Solving Systems of Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . on line
E. Rotation of Axes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
ii
Introduction
We denote the set of real numbers by IR, and the Cartesian Plane by IR2 = IR × IR. For
more information on these sets see appendix A. Information on lines is contained in appendix
B. These appendices should be read carefully before proceeding.
A relation of real numbers is a subset of IR × IR. Des Cartes’ invention allows us to
represent any relation of real numbers graphically. That is, the graph of a relation, R, consists
of all ordered pairs (a, b) ∈ IR2 , where (a, b) ∈ R. Given a relation, R, its inverse relation,
denoted by R−1 is the relation R−1 = {(b, a)|(a, b) ∈ R}. The graph of R−1 will therefore be
the graph of R reflected across the line y = x.
Commonly a relation will be describable by a system of equations and inequalities. Thus
we may think of a relation as the set of pairs (a, b) for which the equations and inequalities
E1 (a, b), E2 (a, b), . . . , En (a, b) are all true. So for example E1 (x, y) may be x2 = y 2 , and
E2 (x, y) might be |x| ≤ 1. The relation defined by these is S = {(a, b)|a = ±b and − 1 ≤ a ≤
1}. If we put R1 = {(a, a)|a ∈ IR} and R2 = {(a, −a)|a ∈ IR}, then R = R1 ∪ R2 is the relation
defined by E1 . Moreover if we set T = {(x, y)| − 1 ≤ x ≤ 1}, then T is the relation defined by
E2 , and S = R ∩ T . Notice that S = S −1 .
In an attempt to deal with the general case we see that we must at least be able to
determine relations described by a single equation, rule, or formula. Moreover it will be
advantageous if one variable in the equation is completely determined by the other, since in
this case every single input will yield exactly one ordered pair as output. This explains why
we will spend most of our time considering functions.
Section 1: Functions
Given two sets of real numbers A and B, a function, f , is a relation of real numbers with
the property that for every a ∈ A, there is exactly one b ∈ B, so that (a, b) is in the relation.
Notationally we write f : A −→ B. Also when (a, b) is in the relation we write b = f (a). As
alluded to above, we will usually be able to describe f by an equation in x and y. But since
y is determined, given x, we can rewrite the equation to be y − f (x) = 0, where f (x) is some
formula, or rule dependent on x. Equivalently we write y = f (x).
Given a function y = f (x), where f : A −→ B, we call A the domain of f and B the
codomain. We write A = dom f . Graphically a ∈ dom f if and only if the vertical line x = a
1
Section 1: Functions
intersects the graph of f . In fact this vertical line test allows us to separate those relations
which are functions, from those which are not. A relation f is a function only if every vertical
line x = r intersects the graph of f at most once. The domain of a function can be equated
with those values of r for which the intersection is non-empty.
We will suppose that B = IR. However we are interested in the subset C of B consisting
of all b ∈ B so that b = f (a) for at least one a ∈ A. This subset is the range of f . We write
C = ran f . A number s ∈ IR is in the range of f only if the horizontal line y = s intersects
the graph of f at least once.
The end behavior of the functions describes how the output values tend as the inputs
become large in absolute value. We write x −→ ∞ to mean x grows without bound positively,
and x −→ −∞ to mean that x grows without bound negatively. In general the notation
E −→ F means that the expression E is becoming arbitrarily close to the expression F . A
line ax + by + c = 0 (not both a = 0 and b = 0) is an asymptote for a function if the graph of
the function becomes arbitrarily close to the graph of the line.
If a ∈ IR and f (a) ≤ f (b) for all b ∈ dom f , (a, f (a)) is a global minimum. If a ∈ IR
and f (a) ≤ f (b) for all b in an interval I containing a, then (a, f (a)) is a relative minimum.
Global maximum and relative maximum are defined similarly.
A function for which f (−x) = f (x) will have a graph which is symmetric about the y-axis.
Such a function is called an even function. A function for which f (−x) = −f (x) will have a
graph which is symmetric about the origin. Such a function is called odd.
An intersection of the graph of a function with a coordinate axis is called an intercept. A
function may have many x-intercepts, but has at most one y-intercept.
Given a function as a rule or formula we can use dom f , ran f , its end behavior, any
asymptotes it has, any symmetry it has, and its x-intercepts and y-intercept to help sketch
the graph of f . Conversely, given the graph of f we can describe each of the characteristics
above.
As the example in the introduction points out, we also need to be able to determine
how functions interact. To begin with, given real functions f : A −→ IR and g : C −→ IR
their sum is f + g : A ∩ C −→ IR defined by (f + g)(x) = f (x) + g(x). The difference, and
g
g
product are defined similarly. The ratio of g by f is denoted by . The domain of consists
f
f
2
Section 1: Functions
1
.
f
Finally, the composition of g by f is denoted by (f ◦ g)(x) = f (g(x)). The domain of f ◦ g is
of all x ∈ A ∩ C so that f (x) 6= 0. A special type of ratio is the reciprocal, denoted
{x ∈ dom g|g(x) ∈ dom f }.
An important case of the above operations occurs when y = g(x) = c, and c is a positive
real number. In this case dom g = IR and if f : A −→ IR, then A∩ dom g = A = dom f . Now
the function f1 (x) = (f +g)(x) has the same domain as f and its graph is {(x, f (x)+c)|x ∈ A}.
Which is to say that we may graph f1 by translating every point of the graph of f up c units
vertically. Similarly the graph of f2 = f − g can be made by translating every point of the
graph of f down c units. The graph of f3 = (f g) can be formed by scaling every second
coordinate of a point on the graph of f by a factor of c. Similarly the ratio of f by g can
be graphed by scaling all second coordinates of points on the graph of f by a factor of 1/c.
When g(x) = −1, the product f g may be graphed as the reflection of the graph of f across the
x-axis. More generally if g(x) = d 6= 0 we can graph the function f g by reflecting the graph
of f across the x-axis if necessary, and then scaling second coordinates by a factor of |d|.
Another special case occurs when g(x) = dx, d 6= 0, or g(x) = x − c, where c > 0. In the
second case when we compose g by f we have
dom (f ◦ g) = {x ∈ dom g|g(x) ∈ dom f } = {x|x − c ∈ dom f } = {x + c|x ∈ dom f }.
So the graph of this composition consists of {(x + c, f (x))|x ∈ dom f }. Which means that
to graph the composition, we simply translate the graph of f to the right c units. Similarly if
g(x) = x + c, where c > 0, then the graph of f ◦ g would be the graph of f shifted left c units.
In the first case when d > 0 the graph of f ◦ g is {( xd , f (x))|x ∈ A}. So this composition can
be graphed by compressing the scale of the x-axis by a factor of d. Notice that if 0 < d < 1,
this compression turns out to be expansion by a factor of 1/d. When g(x) = −1, the graph of
f ◦ g is the reflection of the graph of f across the y-axis. In general we reflect across the y-axis
if d < 0 and compress the scale of the x-axis by a factor of |d|.
Of very great interest is the possibility that g = f −1 . To begin with we must have
that f −1 is a function, and not simply a relation. This means that the graph of f −1 must
pass the vertical line test. Since the graph of f −1 is the graph of f reflected across the
line x = y we have that the graph of f must satisfy the horizontal line test. That is, every
horizontal line y = b intersects the graph of f at most once. In this case we say that f is a
3
Section 1: Functions
one-to-one function. Now for every point (a, b) on the graph of f , so that b = f (a), we have
a = f −1 (b). Consequently (f −1 ◦ f )(a) = f −1 (f (a)) = f −1 (b) = a. Also for b ∈ dom f −1 ,
(f ◦ f −1 )(b) = f (f −1 (b)) = f (a) = b.
One last characteristic of a function we will investigate is its difference quotient, which
f (x + h) − f (x)
is found by forming the expression
, where h is an infinitessimally small, but
h
nonzero, real number. The denominator is actually (x + h) − x, so the difference quotient is
the slope of the line through the points (x, f (x)), and (x + h, f (x + h)). This slope describes
how y = f (x) changes when x changes by h units.
To illustrate the ideas we have developed let us consider a function of the form y = f (x) =
xn , where n is a non-negative whole number. We call such a function a natural power function.
Henceforth we will dispense with the case n = 0, since we get f (x) = 1. So in the sequel unless
otherwise specified, natural power function will denote a power function f (x) = xn with n > 0.
Notice that xn = x · x · . . . · x, with n factors makes sense for any x ∈ IR. We can therefore
take dom f to be any subset of IR. Most often we will take dom f = IR.
Next, if n = 2m, where m is a positive whole number, then f (x) = xn = x2m = (x2 )m ≥ 0,
since x2 ≥ 0. Thus a function of this form has ran f ⊂ [0, ∞). Indeed graphing a few of these
leads us to conclude that ran f = [0, ∞). On the other hand when n = 2k + 1, where k is
a non-negative whole number, this is no longer true. In fact as x −→ ∞, xn −→ ∞ whether
n is even or odd. But as x −→ −∞, x2m −→ ∞ while x2k+1 −→ −∞. So when n is odd,
ran f = IR. Moreover we see that the end behavior of a natural power function is determined
by whether n is odd or even. You might also have guessed that f (x) = x2m is even while
f (x) = x2k+1 is odd.
Every natural power function passes through (0, 0). In fact if xn = 0 we deduce that
x = 0. So each natural power function f (x) = xn has exactly one x-intercept which coincides
with its y-intercept.
By definition, the reciprocal, g(x), of a natural power function f (x) = xn has domain
1
1
= n , which we also write as x−n . When n is even
IR − {0}. A formula for g is g(x) =
f (x)
x
ran g = (0, ∞). When n is odd, ran g = IR − {0}. As x −→ ∞, xn −→ ∞, when n > 0. The
reciprocals of arbitrarily large positive numbers are infinitessimally small positive numbers.
So as x −→ ∞, g(x) −→ 0. Likewise, the reciprocals of arbitrarily large negative numbers
are infinitessimally small negative numbers. Thus as x −→ −∞, g(x) −→ 0. So g(x) has the
4
Section 1: Functions
x-axis y = 0 as a horizontal asymptote. When n is even, g(x) gets arbitrarily close to 0 but
is positive so we write g(x) −→ 0+ . When n is odd, the values of g(x) are negative when x is
so as x −→ −∞, g(x) −→ 0− . By reversing our reasoning we realize that x = 0 is a vertical
asymptote of g(x). We now write g(x) −→ ∞, as x −→ 0+ . When n is even we get g(x) −→ ∞
as x −→ 0− . When n is odd we have g(x) −→ −∞ as x −→ 0− .
The functions of the form f (x) = xl where l is a whole number are the integral power
functions. From the preceding paragraphs we see that the integral power functions consist of
the natural power functions and their reciprocals.
Next, when n is odd, any integral power function passes both the vertical and horizontal
√
line tests. Therefore f −1 is a function. We use the formula f −1 (x) = n x. Alternatively we
1
have f −1 (x) = x n , from the multiplicative law of exponents. However when n 6= 0 is even,
f (x) fails the horizontal line test. In this case f −1 is a relation but not a function. In order
to partially correct this problem we restrict the domain of f to a set A for which the graph of
f will pass the horizontal line test. By convention for n even we take A = [0, ∞), for n > 0
and A = (0, ∞), for n < 0. For this restricted function, f −1 is now a function and we use the
same notation as the odd case.
When we repeatedly combine natural power functions (including f (x) = 1) using sums,
multiples, and differences we get the class of polynomial functions. When we compose integral
power functions and inverses of integral power functions we get the rational power functions.
√
For example if we compose f (x) = x3 by g(x) = x1/2 we get h(x) = (x3 )1/2 = x3/2 = ( x)3 .
Lastly, when f is a natural power function the difference quotient for f takes the form
(x + h)n − xn
xn + nxn−1 h + h2 k(x, h) − xn
f (x + h) − f (x)
=
=
= nxn−1 + h(k(x, h))
h
h
h
as long as h 6= 0, where h2 k(x, h) is the rest of the expansion of (x + h)n in terms of x and
h. So as h −→ 0, the difference quotient of f approaches nxn−1 . We can also algebraically
simplify difference quotient formulas for more general power functions.
Exercises:
1. Find a simplified formula for the difference quotient of
1
a) f (x) =
x
√
b) f (x) = x
c) f (x) = x2 − 4x + 6
5
Section 1: Functions
2. Read Section 2 and appendix C.
3. Let f (x) = ax2 + bx + c, with a 6= 0 and a, b, c ∈ IR.
a) Complete the square on x.
b) Use part a) to show that f is a shift and/or scale of y = g(x) = x2 .
c) Use part a) to show that f has either one double zero, two real zeroes, or no real zeroes.
6
Section 2: Polynomial Functions
To help you grasp the importance of this little section we encourage you to borrow a
non-reform calculus text from another student, or the local library. There should be a chapter
titled “Sequences and Series”. In this chapter will be a section titled “Taylor Polynomials”.
This section is not normally covered in calculus until the second semester, so you’ll have to
wait quite a while to appreciate the contents. The gist of the section is that any function can
be approximated relatively well using polynomials. This is enormously important, since polynomials require no division, and completely accurate division is not easy to do on a computer.
Also since polynomials are nothing more than linear combinations of natural power functions
they are quite simple to utilize.
Starting with a polynomial p(x) = an xn + an−1 xn−1 + . . . + a1 x + a0 , where an 6= 0 and
a0 , a1 , . . . , an ∈ IR are constants, it is fairly clear that dom p is any subset of IR. We usually
choose dom p = IR.
Computing the range of an arbitrary polynomial can be more of a problem. Sometimes
we can do it by inspection. The general computation often requires calculus.
Since 1 ≤ x implies 0 < x we can successively multiply both sides of 1 ≤ x by x without
changing the direction of the inequality. We arrive at the string of inequalities 1 ≤ x ≤ x2 ≤
x3 ≤ . . . ≤ xn . The point is that as x gets large, the leading term, an xn of a polynomial will
dominate all other terms. So the end behavior of a polynomial is the same as the end behavior
of its leading term. Thus if n is odd, the range of a polynomial will be IR. If n is even we may
need calculus to specify the range, but at least we know a polynomial’s end behavior. Since
n is an important parameter of a polynomial we give it a name, to wit the degree. Strictly
speaking the degree of the polynomial p(x) = 0 is not defined. Some texts will set the degree
of the zero to be −1, others will define its degree to be negative infinity.
The graph of a polynomial of degree n will “waggle” no more than n − 1 times. Peaks of
waggles are relative (sometimes global) maximums. Valleys of waggles are relative (sometimes
global) minimums. This behavior is another feature of polynomials describable after just a
modicum of calculus. So is the fact that polynomials have no asymptotes.
A polynomial is even if and only if its only nonzero coefficients have even subscripts. A
polynomial is odd if and only if its only nonzero coefficients have odd subscripts.
In between the ends, the behavior of a polynomial is mostly dictated by the fact that its
7
Section 2: Polynomial Functions
graph has no breaks, and that it can be factored into the form (see appendix C)
p(x) = an (x−r1 )e1 (x−r2 )e2 (. . .)(x−rk )ek (x2 +b1 x+c1 )f1 (x2 +b2 x+c2 )f2 . . . (x2 +bj x+cj )fj ,
where n = e1 + e2 + . . . + ek + 2(f1 + f2 + . . . + fj ), none of the quadratic factors crosses
the x-axis (so b2i − 4ci < 0), and the values r1 < r2 < . . . < rk indicate the x-coordinates of
p(x)’s x-intercepts. The values r1 , r2 , . . . , rk are the roots of the polynomial. The y-intercept
is (0, p(0)) = (0, a0 ).
When we let qi (x) = (x − ri )ei , for i = 1, 2, . . . , k, and pi (x) = p(x)/qi (x), we see that
“near x = ri ” p(x) will behave like a shift ri units right (left if ri is negative) of the function
axei , where a = pi (ri ). This phenomenon is called local behavior.
A fairly decent sketch of the graph of a polynomial can be constructed by piecing together
the local behavior (including the end-behavior) so that the end result is a smooth curve with
no breaks (this is using what is known as the Intermediate Value Theorem from calculus).
The reciprocal of a polynomial is a special kind of rational function. Rational functions
are the topic of the next section.
A polynomial will be invertible as a function only if n is odd and the graph has no
waggles. So a polynomial often has only a local inverse. In fact it can be somewhat difficult to
algebraically find a formula for the local inverse. Try for example q(x) = x4 − 4x2 + 2. Some
√
examples are easier, such as p(x) = x5 − 2. Here if y = x5 − 2, y + 2 = x5 . So x = 5 y + 2.
√
Therefore p−1 (x) = 5 x + 2.
The difference quotient of a polynomial is not too hard to compute given that the difference
quotient of a sum is the sum of the difference quotients, and that the difference quotient of
a constant times f is the constant times the difference quotient of f . So for example if
p(x) = 3x2 − 4x + 5 its difference quotient is
[3(x + h)2 − 4(x + h) + 5] − [3x2 − 4x + 5]
p(x + h) − p(x)
=
h
h
2
2
3(x + 2xh + h ) − 4x − 4h + 5 − 3x2 + 4x − 5
=
h
3x2 + 6xh + 3h2 − 4x − 4h + 5 − 3x2 + 4x − 5
=
h
2
6xh + 3h − 4h
= 6x + 3h − 4, as long as h 6= 0
=
h
8
Section 2: Polynomial Functions
Exercises:
1. For each polynomial function sketch a graph of the function by transforming the graph of
an integral power function.
a) f (x) = x3 − 2
b) f (x) = −3x4 + 6
c) f (x) = 2(x − 3)5 − 1
d) f (x) = −(2x − 4)2
2. Use the zeroes and end behavior of each polynomial function to sketch an approximation
of its graph.
a) f (x) = (x + 1)2 (x − 2)2
b) f (x) = x(x + 2)2
c) f (x) = −x(x − 1)(x + 2)
d) f (x) = (x + 1)(x − 2)2
3. A square box without a lid is to be constructed by cutting out four squares of side x from
the corners of a 30in. by 30 in. piece of cardboard. Determine the surface area of the box as
a function S of the variable x. Compute the surface area when x = 4. What is dom S?
9
Section 3: Rational and Algebraic Functions
A rational function is one which is defined as the ratio of polynomials. So if r(x) is a
p(x)
, where p(x) = pn xn + pn−1 xn−1 + . . . + p1 x + p0 and q(x) =
rational function, r(x) =
q(x)
qm xm + qm−1 xm−1 + . . . + q1 x + q0 6= 0 are polynomials of degree n and m respectively.
Write p(x) = pn (x − r1 )e1 (x − r2 )e2 . . . (x − rk )ek (x2 + a1 x + b1 )f1 . . . (x2 + aj x + bj )fj and
q(x) = qm (x − s1 )g1 (x − s2 )g2 . . . (x − sl )gl (x2 + c1 x + d1 )h1 . . . (x2 + ci x + di )hi as per the real
factors theorem. If p and q have a common irreducible quadratic factor, we may cancel it since
it is never zero for real inputs. If p and q have a common zero, a, then r(a) is not defined and
otherwise for x 6= a, r(x) is the same as the ratio of p(x)/(x − a) by q(x)/(x − a), continuing to
cancel linear factors of the form (x − a) until at most one of the numerator and denominator
has x − a as a factor. In the sequel we can therefore focus our attention on rational functions
where the numerator and denominator have no common factor (so r(x) is in lowest terms so
to speak).
If r(x) is a rational function in lowest terms, then r(x) is defined, unless q(x) = 0. So
the domain of r(x) is all real numbers which are not roots of q(x). The range of a rational
function cannot in general be computed without the aid of graphing or calculus.
Supposing that we’ve already canceled common factors and p(x) and q(x) are as in the
pn n−m
x
as |x| −→ ∞. So if n < m r(x) −→ 0 as
first paragraph, r(x) will behave like
qm
x −→ ±∞. Thus the graph has y = 0 as a horizontal asymptote in this case. If n = m
pn
pn
as |x| −→ ∞. So now the graph has the line y =
as horizontal asymptote.
r(x) −→
qm
qm
If n > m, by the division algorithm we can write p(x) = d(x)q(x) + s(x), where the degree
of s(x) is strictly less than the degree of q(x). (s(x) is not zero here since we have already
s(x)
s(x)
canceled common factors.) Now we write r(x) = d(x) +
. Since
−→ 0 as |x| −→ ∞,
q(x)
q(x)
r(x) has the same end behavior as d(x). The graphs of r(x) and d(x) will become arbitrarily
close as |x| −→ ∞. If n = m+1, then d(x) takes the form d1 x+d0 i.e. d(x) is a linear function.
The line y = d1 x + d0 will be an asymptote for the graph of r(x). We call such an asymptote
a slant asymptote to distinguish it from the vertical and horizontal asymptotes. In general a
rational function r will have a vertical asymptote whenever c is a root of its denominator. To
help draw the graph of r(x) we determine the behavior of r(x) as x −→ c+ and as x −→ c− .
Only in special cases can we determine maxima and minima for a rational function.
A rational function is even iff both its numerator and denominator are even, or both its
10
Section 3: Rational and Algebraic Functions
numerator and denominator are odd. A rational function is odd iff the numerator is odd and
the denominator is even or vice versa. A rational function in reduced form has x-intercepts
wherever its numerator is zero. The y-intercept, if it exists, is r(0). If we have compiled the
previous information for a rational function, we can usually present a good sketch of its graph.
x2 − 2x − 3
, we have p(x) = x2 − 2x − 3 = (x − 3)(x + 1) and q(x) =
x3 − 4x
x3 − 4x = x(x − 2)(x + 2). So dom r = (−∞, −2) ∪ (−2, 0) ∪ (0, 2) ∪ (2, ∞). Since the degree
Example 1 If r(x) =
of p is strictly less than the degree of q, r(x) has y = 0 as a horizontal asymptote. The lines
x = −2, x = 0 and x = 2 are vertical asymptotes for the graph of r(x).
As x −→ −2− , p(x) −→ 5 and q(x) −→ 0− so r(x) −→ −∞. As x −→ −2+ , p(x) −→ 5
and q(x) −→ 0+ so r(x) −→ ∞. As x −→ 0− , p(x) −→ −3 and q(x) −→ 0− so r(x) −→ −∞.
Similarly as x −→ 0+ , r(x) −→ ∞. Finally as x −→ 2− , r(x) −→ ∞ and as x −→ 2+ , r(x) −→
∞.
The function r(x) is neither even nor odd. It has x-intecepts when x = 3 and x = −1.
The graph does not intersect the y-axis since r(0) is not defined. To facilitate sketching a
graph of r(x) we also compute values for r(x) near the interesting values of x on the graph.
x
−3
r(x) −4/5
−3/2
6/7
−1/2
−14/15
1
4/3
5/2
−14/45
7/2
6/77
Not only do these function values help us decide how to scale our y-axis, they tell us that there
is basically no hope of showing these points exactly. So the scale on our y-axis needs to be
fine enough to approximate −14/45 as about −1/3, but no finer.
4
2
-4
-2
2
-2
-4
11
4
Section 3: Rational and Algebraic Functions
Rational functions are interesting in that the sum, difference, product, quotient and composition of rational functions are again rational functions. In fact rational functions are linear
combinations of horizontal shifts of integral power functions. The topic of the next section is
the decomposition of a given rational function into a sum of simpler rational functions.
In general the inverse (or partial inverse) of a rational function, is a rational function.
2x + 1
2y + 1
. Then to solve for f −1 algebraically we set x = f (y) =
.
x−1
y−1
This means x(y − 1) = 2y + 1. So xy − x = 2y + 1. If we collect all terms with y’s in them on
Example 2: Let f (x) =
the right hand side and move all other terms to the left hand side we arrive at xy − 2y = x + 1.
x+1
.
So y(x − 2) = x + 1. Therefore y = f −1 (x) =
x−2
Difference quotients of rational functions tend to get pretty messy, which motivates generating more powerful methods in calculus.
Linear combinations of rational power functions form the set of algebraic functions. After
some algebra an algebraic function can be place in the form f (x) = g(x)/h(x). As with rational
functions we will emphasize the analysis of algebraic functions where g and h have no common
factors. The domain of an algebraic function is then {x ∈ dom g ∩ dom h|h(x) 6= 0}. As
before we usually need more power to compute the range of a general algebraic function. End
behavior of an algebraic function is determined by the dominate terms from its numerator
and denominator. An algebraic function f intersects the x-axis only when g(x) is zero, and
x ∈ dom h. Also f will have a vertical asymptote when h(x) = 0, and x ∈ dom g.
√
(x − 1)
Example 3 Let f (x) = √
. So g(x) = x − 1 with dom g = IR, and h(x) = x2 − 4x,
x2 − 4x
√
with dom h = {x ∈ dom p(x)|p(x) ∈ dom s(x)} where p(x) = x2 − 4x and s(x) = x and
dom s = [0, ∞). So x ∈ dom h iff x2 − 4x ≥ 0. Which by means of a sign graph we find to
be equivalent to x ∈ (−∞, 0] ∪ [4, ∞). So dom g ∩ dom h = dom h, and
dom f = {x ∈ dom h|h(x) 6= 0} = (−∞, 0) ∪ (4, ∞).
x
x
As x −→ ±∞, f (x) behaves like √ =
. So f (x) −→ 1 as x −→ ∞, and f (x) −→ −1
2
|x|
x
as x −→ −∞. As x −→ 0− , g(x) −→ −1 and h(x) −→ 0+ so f (x) −→ −∞. As x −→
4+ , g(x) −→ 3 and h(x) −→ 0+ , so f (x) −→ ∞.
It is again true that we cannot determine extreme values without graphing/calculus.
Our function f is neither even, nor odd.
The graph of f neither intersects the x-axis, nor intersects the y-axis. After computing a
12
Section 3: Rational and Algebraic Functions
few function values we sketch the following graph.
6
4
2
-2
2
4
6
-2
-4
You might suppose that as the complexity of a function increases (polynomial to rational
function to algebraic function) so does the computation for finding f −1 algebraically, if it’s
even possible.
As with rational functions, algebraic functions have difference quotients that in general
are pretty nasty. Some are not too bad.
√
√
Example 4: Let f (x) = x + 3. Then f (x + h) = x + h + 3, and f (x + h) − f (x) =
√
√
√
x + h + 3 − x + 3. The trick to proceeding is to multiply by 1 in the form ( x + h + 3 +
√
√
√
x + 3)/( x + h + 3 + x + 3), since the resulting numerator is the difference of squares,
1
√
namely (x + h + 3) − (x + 3) = h. So the difference quotient reduces to √
.
x+h+3+ x+3
Exercises:
x2 − 1
. Find the domain of f . Then determine its end behavior,
x3 + 3x2 − 4x − 12
and all asymptotes to its graph. Next find f ’s intercepts. Use all of this information to sketch
1. Let f (x) =
and label a graph of f .
2. Determine a rational function which has vertical asymptotes at x = −1 and x = 2, a
horizontal asymptote at y = 2, and x-intercepts at 0 and 4.
x2 − 3x
3. Repeat exercise 1 for f (x) =
.
x+4
x−1
, or show it can’t be done.
4. Algebraically compute the inverse function for f (x) = √
x+1
1
.
5. Find a simplified form for the difference quotient of f (x) = 2
x −4
13
Section 3: Rational and Algebraic Functions
6. A rectangular region of pasture land is to be fenced using 2, 400 linear feet of fence. Say
that the length of the region is l and the width of the region is w. Express the relation between
l, w and 2, 400. Then express the area, A, of the region as
a) a function of l only.
b) a function of w only.
14
Section 4: Partial Fraction Expansion
As alluded to in the previous section, every rational function is writable as the sum of
p(x)
simpler rational functions. Given a rational function, r(x) =
, where p(x) and q(x) 6= 0
q(x)
are polynomials of degree n and m respectively, we first reduce to the case that the numerator
and denominator have no common factor (so r(x) is in lowest terms).
If n ≥ m, by the division algorithm we can write p(x) = d(x)q(x) + s(x), where the degree
of s(x) is strictly less than the degree of q(x). (s(x) is not zero here since we have already
s(x)
. So every rational function is
canceled common factors.) Now we write r(x) = d(x) +
q(x)
the sum of a polynomial and a proper rational function which is one where the degree of the
numerator is strictly less than the degree of the denominator.
The partial fraction expansion of a proper rational function with non-zero, non-constant
denominator q(x) = (x − s1 )g1 (x − s2 )g2 . . . (x − sl )gl (x2 + c1 x + d1 )h1 . . . (x2 + ci x + di )hi
factored as per the real factors theorem, is the expression of r as a sum of proper rational
functions whose denominators are powers of irreducible factors of q(x).
A theorem, which requires linear algebra to prove, is that there are k + 2l real constants
which make this expression unique, where k is the total number of real zeroes of q(x) and l is
the total number of irreducible quadratic factors of q(x), both numbers reflecting multiplicities.
If (x − s)e is one of the factors of q(x), then the partial fraction expansion of r will have e
Ai
, as i = 1, 2, . . . , e. For each factor (x2 + bx + c)f with b2 − 4c < 0
terms of the form
(x − s)i
Bj x + Cj
the partial fraction expansion of r will have f terms 2
as j = 1, 2, . . . , f .
(x + bx + c)j
x2 − 2x − 3
, we have p(x) = x2 − 2x − 3 and q(x) = x3 − 4x = x(x −
x3 − 4x
2)(x + 2). So r is a proper rational function whose denominator has three real zeroes all with
Example 1 If r(x) =
multiplicity one. So there are three real constants, call them A, B and C so that
r(x) =
A
B
C
+
+
x
x−2 x+2
To find these constants we clear denominators on both sides of the equation by multiplying
through by r(x)’s denominator q(x). So
hA
B
C i
x2 − 2x − 3 =
x(x − 2)(x + 2)
+
+
x
x−2 x+2
Ax(x − 2)(x + 2) Bx(x − 2)(x + 2) Cx(x − 2)(x + 2)
+
+
=
x
x−2
x+2
=A(x − 2)(x + 2) + Bx(x + 2) + Cx(x − 2)
15
Section 4: Partial Fraction Expansion
We then can equate coefficients to determine A, B and C, or we can substitute in three distinct
values for x to get a system of three linear equations in the three unknowns A, B and C. For
example if we set x = 0 in x2 − 2x − 3 = A(x − 2)(x + 2) + Bx(x + 2) + Cx(x − 2), then on
the left hand side we get −3 and on the right hand side we get A(−2)(2) = −4A. We deduce
that A = 3/4. If we next set x = 2 (is there a pattern here?) then on the left hand side we
get 4 − 4 − 3 = −3 and on the right hand side we get B · 2(4). So B = −3/8. Finally we set
x to be the root of x + 2, which is to say x = −2 and get 4 + 4 − 3 = 5 on the left hand side
and C(−2)(−4) = 8C on the right. So C = 5/8. We can check that when we start with
3/4
3/8
5/8
−
+
x
x−2 x+2
and put everything back over the common denominator x(x − 2)(x + 2), we get r(x).
If instead we equate coefficients we first expand the left hand side to
A(x2 − 4) + B(x2 + 2x) + C(x2 − 2x) = (A + B + C)x2 + (2B − 2C)x − 4A. So we arrive at
the three linear equations A + B + C = 1, 2B − 2C = −2, and −3 = −4A. Which is simply an
equivalent system of equations.
x2
, then there are four real numbers A, B, C and D so
(x − 2)2 (x2 + 3)
A
B
Cx + D
that r(x) =
+
+ 2
. Clearing denominators yields x2 = A(x − 2)(x2 +
2
x − 2 (x − 2)
x +3
3) + B(x2 + 3) + (Cx + D)(x − 2)2 Equating coefficients yields the system of equations
Example 2: Let r(x) =
0 =A + C
1 = − 2A + B − 4C + D
0 =3A + 4C − 4D
0 = − 6A + 3B + 4D
which can be solved by substitution, or elimination.
Equivalently we can set x = 2 to obtain the equation 4 = 7B. which means B = 4/7.
√
√
√
√
When x = 3i, we get (C 3i + D)( 3i − −2)2 = . . . = (D + 12C) + i 3(C − 4D) = −3.
So C − 4D = 0 and D + 12C = −3. Which by substitution gives 49D = −3. So D =
−3/49 and C = 4D = −12/49. Finally setting x = 1 gives 1 = −4A + 4B + (C + D).
Substituting B = 4/7, C = −12/49 and D = −3/49 gives 1 = −4A + 16/7 − 15/49. So
A = (1 − 16/7 + 15/49)/ − 4 = 12/49.
16
Section 4: Partial Fraction Expansion
Partial fraction expansions are useful as a tactical device in integrating rational functions
in the second semester of calculus, they are used in circuit analysis when network outputs are
expressed as products of transfer functions and inputs under the Laplace transform. They
are also used in combinatorics/discrete mathematics/computer science to solve recurrence
relations by the method of generating functions. (In fact the last application is simply the
discrete version of the Laplace transform procedure.)
Exercises: Find the partial fraction expansion of each rational function.
1. f (x) =
2. f (x) =
3. f (x) =
4. f (x) =
5. f (x) =
x2 − 1
x3 + 3x2 − 4x − 12
x+4
x2 − 3x
x−1
(x + 1)2
1
2
x −4
3x4 − 2x2 + 4
(x2 + 1)2 (x − 2)
17
Section 5: Exponential and Logarithmic Functions
If a is a positive real number we can use the laws of exponents, and the rules for rational
p
power functions to compute ap/q , where
∈ Q. It can be proven that for any irrational
q
number s there is a sequence r1 , r2 , . . . , rn , . . . of rational numbers with the property that as
m approaches ∞, rm approaches s. We then define the quantity as as the value which arm
approaches as m approaches ∞. The result is the base a exponential function y = f (x) = ax .
Using a sequence of rational values to compute the value of as , where s is irrational is
tantamount to plotting rational points on the graph and filling in the remainder of the graph
by connecting the dots with a smooth curve. This is actually how we plot these curves and
their shifts/scales.
For an exponential function the value a = 1 is uninteresting. Moreover the function
1
1
y = ax , where 0 < a < 1 takes the form y = ( )x = x , where b > 1 and a = 1/b. So any such
b
b
function can be analyzed as the reciprocal of an exponential function whose base is greater
than one.
Of all the exponential functions with base greater than one, the most special is the
natural exponential function y = ex , where e is an irrational number approximately equal
to 2.718281828. The actual value of e is the value (1 + h)1/h approaches as h approaches ∞.
This particular value is very special because the slope of a line tangent to the curve y = ex
at the point (r, er ) is simply the function value er . In fact this is the only base for which the
base a exponential function has this property.
Every exponential function y = ax , where a > 0, and a 6= 1 is one-to-one with domain
IR and range (0, ∞). If a > 1 as x −→ −∞, ax −→ 0+ , while as x −→ ∞, ax −→ ∞. So
every exponential function with base a > 1 has the y-axis as horizontal asymptote and no
vertical asymptote. The reciprocal of such an exponential function has bx −→ 0 as x −→ ∞,
and bx −→ ∞ as x −→ −∞. So it has opposite end behavior in one sense, and similar end
behavior in another.
Since the exponential functions with non-trivial base are all one-to-one, they have no
relative extrema. In fact their function values either always increase as x increases, or always
decrease as x increases. Therefore none of the basic exponential functions are even or odd.
The basic exponential functions are never zero, so the graphs never intersect the x-axis.
They all have y-intercept (0, 1). They are only distinguished from each other by their slopes.
18
Section 5: Exponential and Logarithmic Functions
The difference quotient of y = f (x) = ax reduces as follows
f (x + h) − f (x)
ax+h − ax
ax ah − ax
ah − 1
=
=
= ax
h
h
h
h
Set Ca to be the value so that (ah − 1)/h −→ Ca as h −→ 0. It is shown in calculus that
Ce = 1, moreover e is the only value for which this is true.
The inverse function of ax is the base a log function, denoted loga x, except when a = e
in which case we denote it ln x and call it the natural log function.
As inverses each pair of functions y = ax and y = loga x satisfy x = aloga x = loga ax .
Especially for each positive number a we have a = eln a . Thus by the multiplicative law of
exponents ax = ex ln a . So the base a exponential function is a horizontal scaling of ex by a
factor of 1/ ln a. So if we understand the natural exponential function, we should be able to
understand any other exponential function.
ln x
. So every logarithm function is a
ln a
vertical scaling of the natural log function. Thus it will suffice to understand the natural log
In a similar fashion we can show that loga x =
function. The natural log function has domain (0, ∞) and range IR. The end behavior and
asymptotes of ln x are found using the fact that its graph is the reflection of the graph of ex
across y = x. So as x −→ 0+ , ln x −→ −∞, and as x −→ ∞, ln x −→ ∞.
The natural log function has no relative extrema nor any symmetry for essentially the same
reason the function ex doesn’t. The natural log function has no y-intercept and x-intercept at
(1, 0). We have already discussed the inverse of the natural log. We choose not to discuss its
reciprocal or difference quotient.
However there is another reason the natural log function is so “natural.” For t > 1, ln t
can be defined as the area bounded by the curve y = 1/x, the x-axis, and the two vertical lines
x = 1 and x = t. This is sometimes shown in the second semester of calculus.
Finally we remark that the laws of exponents translate into the arithmetic properties of
the log functions. For example
Fact: For positive numbers a and b, ln(ab) = ln a + ln b.
Proof: We write ab = eln(ab) on the one hand. We write ab = eln a eln b = eln a+ln b by the
additive law of exponents on the other. Since ab = ab and ex is one-to-one, we are done.
The other arithmetic properties of ln x are proven similarly. They include ln ab = b ln a,
and ln ab = ln a − ln b.
19
Section 5: Exponential and Logarithmic Functions
Exercises:
1. Sketch and label a graph of each function
a) y = f (x) = 4x−2 + 1
1
b) y = f (x) = ( )x − 2
3
c) y = f (x) = 3e−2x
e) y = f (x) = ln(x + 1) − 2
d) y = f (x) = ex − e−x
2. Determine the value of a CD in the amount of $3000 that matures in ten years and pays
6% per year
a) twice a month, and b) once every 2 months.
3. Give the exact value of a) log27 3, and b) log3 27.
4. Use the properties of logarithms to write the expression ln
√
3x2
so that the result does
(x − 2)7
not contain logarithms of products, quotients, or powers.
1
5. Rewrite the expression ln(2x) − 2 ln(x + 1) + ln(x − 2) as a single logarithm.
3
6. Solve the equation ln(x − 2) + ln(2x − 3) = ln 3 for x.
20
Section 6: Exponential Growth and Decay
This section concerns the behavior of functions of the form y = ekx , where k ∈ IR − {0}.
If k < 0, then because ln x has range IR, we know that k = ln b, for some real number b.
From the graph of y = ln x we see that 0 < b < 1. So y = ekx = bx , where 0 < b < 1. Such
an exponential function is the reciprocal of an exponential function of the form y = ax , where
a = 1/b > 1. In a twist of fate if k > 0, then k = ln a for some a since ln x has range IR, and
from the graph we have a > 1.
This yields another explanation of why y = ex is the natural exponential function to work
with: The horizontal scale y = ekx of the natural exponential function y = ex decreases if
k < 0 and increases if k > 0. This feature allows us to easily model certain behaviors. The
two cases we model correspond to k > 0 which we call exponential growth, and k < 0 which
we call exponential decay.
For exponential growth we start with a quantity Q0 of some material (Q0 > 0). If the
amount of material grows exponentially as a function of time we get Q(t) = Q0 ekt , where the
value of k tells us how fast Q is growing. The function Q(t) is completely determined by Q0 and
k. In fact k can be determined by any two functional values Q(c) and Q(d), where c < d. Since
Q0 ekd
Q(d)
=
= ekd−kc . Applying ln to both
Q(c) = Q0 ekc and Q(d) = Q0 ekd we get that
Q(c)
Q0 ekc
Q(d)
ln Q(d) − ln Q(c)
sides gives ln
= ln Q(d) − ln Q(c) = kd − kc = k(d − c). So k =
. We
Q(c)
d−c
Q(c)
can also find Q0 since Q0 = kc . These simple functions can be used to model populations,
e
and principal in monetary accounts drawing interest compounded continuously.
Exponential decay works similarly except we model quantities which are decreasing. Common examples include modeling the quantity of radioactive substances, and Newton’s Law of
Cooling.
In the case of modeling the quantity of some radioactive substance, the constant k is
related to the half-life of the substance. The half-life, h, of a radioactive substance is the
1
amount of time it takes for a quantity measuring Q0 units to decay to Q0 units. This means
2
1
1
kh
kh
that Q0 = Q(h) = Q0 e . Which implies that = e . Applying ln to both sides gives
2
2
1
ln = ln 1 − ln 2 = − ln 2 = kh. So given one of h or k, we can compute the other.
2
In the case of exponential growth the corresponding idea to half-life, is called the doubling
time. In this case if d is the doubling time we get dk = ln 2.
21
Section 6: Exponential Growth and Decay
Exercises:
1. A biologist counts 6, 000 bacteria in a culture at 8:00 am. Eight hours later they count
18, 000 bacteria in the culture. Find an expression for the number of bacteria (in thousands)
as a function of time t (in hours), where t = 0 corresponds to 8:00 am. How long, in hours,
will it take for the population to reach 35, 000?
2. A 5kg mass of pretendium decays to 2.2 kg in 3 years. What is the half-life of pretendium
in years?
3. A bottle of water at 68◦ F is placed in a freezer whose temperature is 26◦ F. The temperature
of the water is measured as 54◦ F an hour later. From the time the bottle was initially placed
in the freezer, how long, in hours, will it take for the water to be frozen?
22
Section 7: Conic Sections
The general quadratic equation in x and y has the form Ax2 +Bxy+Cy 2 +Dx+Ey+F = 0,
where A, B, C, D, E and F are real numbers. One goal of this section is to determine what the
graph of such an equation looks like. We easily see that if all three of A, B and C are zero,
then we really have a linear equation as in appendix B. So we begin by supposing that not all
three of A, B and C are zero.
Next we realize that if there is no cross term (i.e. B=0), we can complete the squares on
x and y to simplify our task. If there is a cross term (B 6= 0), we can change our coordinate
system to one where there is no cross term. This involves rotating axes, a trick that the ancient
Greeks discovered. The details of this trick require some trigonometry. This is contained in
appendix E.
As soon as we know that it suffices to consider quadratic equations with no cross terms
we can begin looking at specific cases.
Subsection 1: Parabolas The first case is an equation of the form Ax2 +Cy 2 +Dx+Ey +F = 0,
where exactly one of A and C is zero.
Up to symmetry of argument we suppose that A 6= 0. Thus we have Ax2 +Dx+Ey+F = 0.
D
We next collect x-terms on one side of the equation to get Ey + F = −A(x2 + x). We
A
complete the square on x and simplify to
D
D2
D2
x+
−
)
A
4A2
4A2
D 2 D2
) +
= −A(x +
2A
4A
Ey + F = −A(x2 +
D2
D
and rewrite Ey + F −
= −A(x − h)2 .
2A
4A
If E = 0 we divide through by −A to get
We set h = −
D2
F
−
2
4A
A
2
D − 4AF
=
4A2
(x − h)2 =
√
D2 − 4AF
D2 − 4AF
Thus x − h = ±
=
±
. If D2 − 4AF = 0 we get the “double” line
4A2
2A
(x−h)2 = 0, which is x = h twice. If D2 −4AF < 0 we get no graph. Otherwise D2 −4AF > 0
r
and we have
√
√
D2 − 4AF
D2 − 4AF
and x = h −
x=h+
2A
2A
23
Section 7: Conic Sections
The graph of which is two distinct vertical lines. These three situations are what we call
degenerate cases.
In the nondegenerate case (E 6= 0) we move on by dividing through by E to write
y+
F
D2
−A
−
=
(x − h)2
E
4AE
E
−A
D2 − 4AF
and write y − k =
(x − h)2 . Finally we set c to be the value for
We set k =
4AE
E
1
1
−A
which
=
and get the standard equation of the parabola y − k = (x − h)2 . We can
4c
E
4c
graph this equation by shifting an appropriate scale of y = x2 . The point (h, k) is called the
vertex. The line x = h is called the axis, short for axis of symmetry. Due to an application in
optics the point (h, k + c) is called the focus. The line y = k − c is called the directrix.
A nondegenerate parabola, P may be defined as the set of points equidistant from the
focus and directrix. After translation we may suppose that the focus has coordinates (0, c)
and the directrix, l, has equation y = −c. So if P = {(x, y)|d((x, y), (0, c)) = d((x, y), l)}, we
p
p
have x2 + (y − c)2 = |y − (−c)| = (y + c)2 . So x2 + y 2 − 2cy + c2 = y 2 + 2cy + c2 . Which
becomes x2 = 4cy.
A parabola is also a conic section, the result of intersecting a plane with a double-napped
cone. Here we suppose that the plane intersects only one of the two nappes. Moreover the
plane is parallel to a generator of the cone.
The parabolas we derived above are functions. The graphs open up and down. If we had
worked on the case C 6= 0 instead we would have arrived eventually at the degenerate cases of
a double horizontal line, no graph, and two horizontal lines. We would have then arrived at the
1
the standard equation x − p = (y − q)2 . This is a relation whose graph is a nondegenerate
4c
parabola which opens left or right. Such a parabola has vertex at (p, q), focus at (p + c, q),
axis with equation y = q and x = p − c as directrix.
Subsection 2: Ellipses The second special case of Ax2 + Cy 2 + Dx + Ey + F = 0 to consider
is that neither of A and C is zero and that they are both positive or both negative. If both
are negative we have
−Ax2 − Cy 2 − Dx − Ey − F = 0,
with both −A and −C positive so let us suppose that A, C > 0.
24
Section 7: Conic Sections
We first complete the squares on x and y to write
A(x2 +
D
D2
E
E2
D2
E2
2
x+
)
+
C(y
+
y
+
)
+
(F
−
−
)=0
A
4A2
C
4C 2
4A 4C
D
E
D2
E2
,k=−
, and R = −F +
+
. Thus A(x − h)2 + C(y − k)2 = R.
2A
2C
4A 4C
Since A and C are strictly positive and the quantities (x−h)2 and (y −k)2 are nonnegative
We set h = −
the left hand side of the previous equation is nonnegative. So if R is negative we have a relation
with an empty graph. Also if R = 0 we must have that both (x − h)2 = 0 and (y − k)2 = 0.
This means that the graph of the relation consists of the single point (h, k). If R > 0 and
R
A = C then our equation becomes (x − h)2 + (y − k)2 = . This is the equation of a circle,
A
R
centered at (h, k) with radius squared . These are the degenerate cases.
A
In the nondegenerate case we have R > 0 and A 6= C. Up to symmetry of argument we
can suppose that A < C. Now we let a and b be the real positive numbers with
1
1
A
C
= , and 2 =
2
a
R
b
R
Our choice of A < C forces a > b, which is an important convention. Our equation now reads
(x − h)2
(y − k)2
+
=1
a2
b2
This is the standard form of the equation. Its graph is called an ellipse.
Similar to a parabola an ellipse can be defined as the set of points with sum of distances to
two foci (= focal points) a fixed constant. To verify this, again suppose that (h, k) = (0, 0) and
that the foci are at (±c, 0). Then if E = {(x, y) ∈ IR2 |d((x, y), (c, 0))+d((x, y), (−c, 0)) = k} we
p
p
have (x − c)2 + y 2 + (x + c)2 + y 2 = k. Notice that k > 2c which is the distance between
the foci. Also the points on the x-axis with coordinates (±a, 0) are on E iff a−c+a+c = 2a = k.
p
p
So now we can write (x − c)2 + y 2 = 2a − (x + c)2 + y 2 . Which by squaring both sides
p
gives (x − c)2 + y 2 = 4a2 − 4a (x + c)2 + y 2 + (x + c)2 + y 2 . We cancel the y 2 terms and
p
expand the terms (x ± c)2 to get x2 − 2xc + c2 = 4a2 − 4a (x + c)2 + y 2 + x2 + 2xc + c2 . Which
p
p
reduces to 0 = 4a2 − 4a (x + c)2 + y 2 + 4xc. We rewrite this as a (x + c)2 + y 2 = a2 + xc.
Squaring both sides gives a2 (x + c)2 + a2 y 2 = a4 + 2a2 xc + x2 c2 . Expanding the term (x + c)2
gives a2 x2 + 2a2 xc + a2 c2 + a2 y 2 = a4 + 2a2 xc + x2 c2 . Collecting x terms and cancelling the
2a2 xc terms leaves (a2 − c2 )x2 + a2 y 2 = a4 − a2 c2 = a2 (a2 − c2 ). Here we use the fact that
a > c > 0 and write b2 = a2 − c2 . Now b2 x2 + a2 y 2 = a2 b2 , which reduces to x2 /a2 + y 2 /b2 = 1.
25
Section 7: Conic Sections
In general the graph of an ellipse is an elongated circle. The graph has two axes of
symmetry = lines about which the graph is symmetric. These have equations x = h and y = k
respectively. The line y = k joins the two points (h − a, k) and (h + a, k) on the graph. These
are at distance 2a from each other. The other axis joins the two points (h, k − b) and (h, k + b)
which are at distance 2b from each other. So in the nondegenerate case we call y = k the
major axis, and x = h the minor axis.
If C < A our convention puts our standard equation in the form
(x − h)2
(y − k)2
+
=1
b2
a2
1
C
1
A
=
and 2 = . Now x = h is the major axis and y = k is the minor axis. The
2
a
R
b
R
foci are at (h, k ± c), where again a2 = b2 + c2 .
where
In either case the constant c is less than a. The ratio of c to a is a measure of how
noncircular the ellipse is. This measure is the eccentricity of the ellipse.
Finally we can realize an ellipse as a conic section where the plane intersects one nappe
and is not parallel to a generator of the cone.
Subsection 3: Hyperbolas The last case to consider is Ax2 + Cy 2 + Dx + Ey + F = 0 where one
of A and C is positive and the other is negative. We will start by supposing that we have an
equation of the form Ax2 −Cy 2 +Dx−Ey +F = 0, where A, C > 0. As in the previous section
we complete squares on x and y and simplify our equation by renaming certain quantities. We
arrive at A(x − h)2 − C(y − k)2 = T .
If T = 0 we get A(x − h)2 − C(y − k)2 = 0. We solve this for y in terms of x to get
C
C
y = k + (x − h) or y = k − (x − h). The graph is the union of the two intersecting lines.
A
A
This is the degenerate case.
1
A
1
C
If T > 0 we set a and b to be the positive constants so that 2 =
and 2 =
and get
a
T
b
T
the standard equation
(y − k)2
(x − h)2
−
=1
a2
b2
The line y = k serves as an axis of symmetry. The lines y = k ± a/b(x − h) serve as slant
asymptotes. The foci have coordinates (h ± c, k), where c2 = a2 + b2 .
1
C
1
A
If T < 0 we set a and b to be the positive constants so that 2 =
and 2 =
. The
a
−T
b
−T
standard equation now reads
(x − h)2
(y − k)2
−
=1
a2
b2
26
Section 7: Conic Sections
The graph of the equation is symmetric about x = h and asymptotically approaches the lines
y = k ± b/a(x − h). The foci have coordinates (h, k ± c), where c2 = a2 + b2 .
These are both examples of a hyperbola. A hyperbola is also defined as the set of points so
that the magnitude of the difference of distances to two foci is a fixed constant. For hyperbolas
the ratio of c to a is strictly larger than 1. This ratio is the eccentricity of the hyperbola. The
derivation of the equation of the hyperbola is left to the interested reader.
A hyperbola is also a conic section formed as the intersection of a double napped cone
with a plane which cuts both nappes.
Note on Conic Sections: The intent of this note is to describe the nomenclature used in the
section. To make a right circular cone we use two coplanar lines. We call one, a, the axis
and the other, l, the generator. We do not consider the case that a and l are perdendicular
nor the case that a = l. A right circular cone, C,is generated when l is revolved about a in
a circular fashion. If a and l are parallel, the result is degenerate. We call C a right circular
cylinder. Otherwise a and l intersect in a unique point f called the fulcrum. The result is a
double-napped cone. A conic section is the intersection of a plane P and a right circular cone
C.
If C is degenerate, one possibility is that the intersection of P and C is empty. (This is
either the empty parabola or the empty ellipse.) Another possibility is that P is tangent to
C, in which case the result is the “double” line parabola. When P is not tangent to C but
contains two generators for C we get the degenerate parabola consisting of two parallel lines.
If P is perdendicular to a the result is a circle. Otherwise P ∩ C is a nondegenerate ellipse.
If C is nondegenerate we also get a degenerate section if P contains the fulcrum f . If such
a plane misses both nappes we get a “point” ellipse. When P contains exactly one generator
we get the “double” line parabola. Finally, if P contains two generators (and a), we get the
degenerate hyperbola consisting of two intersecting lines.
If C is nondegenerate and P does not contain f , the only degenerate section occurs when
P is perpendicular to a. The result is a circle.
The last three cases occur when P is not perpendicular to a, does not contain f and C is
nondegenerate. These are the special conic sections which tend to have applications involving
their reflective properties.
We now see that every quadratic equation in x and y must represent a conic section and
27
Section 7: Conic Sections
vice-versa. Hence y = 1/x when written xy − 1 = 0 is recognized as a hyperbola with major
axis the line y = x. A rotation of axes by 45◦ will clear the cross term. So in retrospect we’ve
been discussing conic sections since day one.
Exercises:
1. Find the standard form of the equation for the parabola x2 + 4x − 4y + 8 = 0.
2. Sketch and label a graph of each parabola clearly indicating the vertex, focus, directrix and
axis.
1
(x + 1)2
2
1
b) y − 3 = (x + 1)2
4
1
c) y − 3 = (x + 1)2
8
3. Find the standard equation of a parabola with focus (−1, 2) and directrix x = 1.
a) y − 3 =
4. Sketch and label the graph of the parabola with equation 4x = (y − 1)2 clearly indicating
the vertex, focus, directrix and axis.
5. Sketch and label a graph of each conic section clearly indicating axes of symmetry, foci,
vertices and asymptotes if any.
a) 9x2 + 25y 2 + 36x − 50y − 164 = 0
b) 9x2 − 16y 2 + 36x + 32y − 124 = 0
c) −9x2 + 16y 2 − 36x − 32y − 164 = 0
6. Find the standard equation of an ellipse with foci at (−1, 14) and (−1, −10) which also has
one vertex at (−1, 15).
28
Section 8: Basic Trigonometric Functions
Our goal in this section is to define and develop the six basic trigonometric functions. We
start with two of the more important transcendental functions, namely the sine function and
the cosine function. These are universally denoted by y = sin x and y = cos x respectively.
In order to define sin x and cos x, we first need to agree on how we are going to measure
angles. An angle, θ, consists of two rays emanating from a common point called the vertex.
One ray is called the initial side, the other is the terminal side. Because we can choose where
to put the origin in the Cartesian plane and we can decide where the positive x-axis goes, we
will assume that the vertex of our angle is at the origin and that the initial side is along the
positive x-axis. This is called putting the angle in standard position. Finally, since only the
direction of the rays matters, we can chop off both rays at length one. Our angle theta is now
determined by the two points at the end of these line segments of length 1, both of which lie
on the unit circle whose equation is x2 + y 2 = 1. The point on the initial side is simply (1, 0).
Here we will denote the point on the terminal side by P (θ).
All we need to determine how to measure θ is to know how we generated the angle. That
is we need to know how we go from (1, 0) to P (θ). If we generate θ by moving from (1, 0)
counterclockwise (CCW), we say θ has positive measure. If we generate θ by moving from
(1, 0) clockwise (CW), we say that θ has negative measure. The radian measure of θ is the
length of the arc on the unit circle we travel from (1, 0) to P (θ) when generating θ. We use a
minus sign in front when θ is generated CW.
Since the unit circle has circumference 2π · 1 = 2π, once around the circle CCW is 2π
radians. Halfway around the circle CCW is therefore π radians. One fourth the way around
π
2π
= radians. So we see that P (θ) = P (ϕ) any time that θ and ϕ have
the circle CCW is
4
2
measures which differ by a multiple of 2π radians. So we write P (θ) = P (θ + 2kπ) for all
k ∈ ZZ. Two such angles are called coterminal.
We call the x-coordinate of P (θ) the cosine of θ and denote it by cos θ. The y-coordinate
of P (θ) is the sine of θ and is denoted by sin θ. It is fairly clear that these are functions of
θ where θ, as a length, can be any real number. Therefore both the sine function and cosine
function have domain IR, unless restricted for some reason.
Since all the points on the unit circle have coordinates bounded between 1 and −1, we
have that −1 ≤ sin θ ≤ 1 and −1 ≤ cos θ ≤ 1. By the horizontal line test every one of these
29
Section 8: Basic Trigonometric Functions
values is taken on, so the sine function and the cosine function both have range [−1, 1]. These
serve as our first examples of functions with bounded range. Meaning there is a number M
with f (x) ≤ M , for all x ∈ dom f .
Since P (θ) = P (θ + 2kπ) for all k ∈ ZZ, we have sin(θ) = sin(θ + 2kπ), and cos(θ) =
cos(θ + 2kπ) for all k ∈ ZZ. A function which repeats its values regularly is called periodic.
The period of a periodic function is the length of the smallest continuous input interval between
repetitions. Here we see that sin x and cos x have period 2π. Thus we would not be able to
describe the behavior of sin x as x −→ ∞ except to say that the values oscillate.
Neither the sine function nor the cosine function have any asymptote.
The cosine has maximum value 1 and attains this value at θ = 0 + 2kπ, where k ∈ ZZ. The
cosine has minimum value −1 and attains this at θ = π + 2lπ = (2l + 1)π, where l ∈ ZZ. The
sine function attains maximum value of 1 when θ = π/2 + 2kπ = (4k + 1)π/2, where k ∈ ZZ.
The minimum value of the sine function is −1 and occurs when θ = 3π/2 + 2lπ = (4l + 3)π/2,
where l ∈ ZZ.
For a given point P (θ) on the unit circle there are several other points which are natural
to consider. The first is the antipodal point on the opposite side of the unit circle. This
point has coordinates (− cos θ, − sin θ). It is also P (θ + π). We thus derive the relationship
π
sin(θ + π) = − sin θ. Another point to consider is radians CW from P (θ). It has coordinates
2
(sin θ, − cos θ) and is P (θ − π/2). Thus cos(θ − π/2) = sin θ, which means that the graph of
the sine function is the graph of the cosine function shifted right π/2 units. Equivalently the
graph of the cosine function is the graph of the sine function shifted left π/2 units and we have
that cos θ = sin(θ + π/2).
In short to determine the graphs of these functions, it suffices to graph the sine function
for 0 ≤ θ ≤ π and apply all the relations we found above. This is made even easier when we
use the fact that the unit circle is symmetric about the y-axis, giving us sin θ = sin(π − θ). So
we only need to determine values for 0 ≤ θ ≤ π/2.
Before we do so we remark that the x-intercepts of the sine function have x-coordinates
nπ, where n ∈ ZZ. The x-coordinates of the intercepts of the cosine function are (2m + 1)π/2,
where m ∈ ZZ.
Now sin 0 = 0, and sin π/2 = 1. Also as θ increases from 0 to π/2, the value of the sine
√
function will increase. In the next section it is derived that sin π/6 = 1/2, sin π/4 = 2/2,
30
Section 8: Basic Trigonometric Functions
and sin π/3 =
√
3/2. With just these five values (which you should memorize), we can sketch
a remarkably good graph of z = f (θ) = sin θ. We can also therefore graph its shift left by π/2
units, z = cos θ.
The reciprocal of the sine function is the cosecant function, and is denoted csc x. It has a
vertical asymptote when sin x = 0. It has a relative minimum when sin x = 1, and a relative
maximum when sin x = −1. It is also periodic with period 2π.
The reciprocal of the cosine function is the secant function, and is denoted sec x. It has
period 2π, and is a shift left π/2 units of the cosecant function.
The ratio of the sine function to the cosine function is the tangent function, denoted
tan x = sin x/ cos x. This function is zero when sin x = 0, i.e. x = nπ, where n ∈ ZZ. It has a
vertical asymptote whenever cos x = 0, which is when x = (2m + 1)π/2, where m ∈ ZZ. Since
−/− is +, the tangent function has period π. As the ratio of an odd function and an even
function the tangent function is odd. Finally as x −→ (π/2)− , sin x −→ 1, and cos x −→ 0+ ,
so tan x −→ ∞. On the other hand as x −→ (π/2)+ , tan x −→ −∞. The reciprocal of the
tangent function is the cotangent function, denoted cot x. Collectively these make up the six
basic trigonometric functions. We normally concentrate on sin x, cos x, tan x, and sec x.
We will discuss inverse trigonometric functions, and the difference quotients of trigonometric functions in later sections.
Several last points need to be made here. First of all, for any θ ∈ IR, there is a value r ∈ IR,
called the reference angle, so that 0 ≤ r ≤ π/2, with | sin(θ)| = sin r and cos r = | cos(θ)|. For
r 6= 0, π/2, there is therefore a unique right triangle of hypotenuse 1 which has base cos r and
height sin r. This is the reference triangle for θ. We investigate this connection with right
triangles further in a later section. Second, since the point P (θ) lies on the unit circle we have
that (sin θ)2 + (cos θ)2 = 1, for all θ ∈ IR. Since this follows from the Pythagorean Theorem,
this is called the Pythagorean Identity. This is one of the more important of hundreds of
trigonometric identities.
Exercise: Read sections 9 and 10.
31
Section 9: Special Values of Trigonometric Functions
A crucial theorem in trigonometry is Euclid’s Theorem on similar triangles. The triangle
with vertices at A, B and C is denoted by 4ABC. Two triangles 4ABC and 4DEF are
similar provided that their interior angles have the same measure. So the angle with vertex
at A has the same measure as the angle with vertex at D, etc. We state the theorem without
proof.
Theorem: (Euclid) 4ABC with side lengths a = |AB|, b = |BC|, and c = |AC| is similar to
b
c
a
4DEF with side lengths d = |DE|, e = |EF | and f = |DF | if and only if = = .
d
e
f
This theorem allows us to define the trigonometric functions using P (θ). First we draw
a line which touches the unit circle only at P (θ). Such a line is said to be tangent to the
curve at the point P (θ). (You might be asking how this can be done.) Except when the line
is horizontal or vertical (i.e. θ is a multiple of π/2) it has both an x-axis intercept D, and a
y-intercept E. Call the origin O, and put B = (cos θ, 0). Then 4OP B is similar to 4ODP .
|OP |
|OD|
1
|OD|
=
. That is,
=
. So since sec θ and cos θ have the same sign
Therefore
|OP |
|OB|
1
| cos θ|
the coordinates of D are (sec θ, 0). Similarly the coordinates of E are (0, csc θ).
A vertical line is tangent to the circle only when θ = kπ. For these values of θ the sine
function is zero. Hence the cosecant is undefined. Notice though that the coordinates of D
are still (sec θ, 0). The anagolous result holds true for horizontal tangents to the unit circle.
|P D|
|P B|
|P D|
| sin θ|
To define the tangent we note that
=
. Which is to say
=
. So
|OP |
|OB|
1
| cos θ|
the length from P to D measures the absolute value of the tangent of θ. In fact we see that
the tangent line to the curve at P (θ) is perpendicular to the line, l, through O and P , so the
slope of the tangent line is the negative reciprocal of the slope of l. But the slope of l is simply
sin θ
(rise over run)
. So the value of tan θ tells us how to draw the tangent line to the unit
cos θ
circle at P (θ)! The point is that the definitions we made for trigonometric functions in section
8 did not fall out of a hat, they have geometric significance.
Another application of Euclid’s Theorem is that given any right triangle with an included
angle θ, where 0 < θ < π/2, we can define the six trigonometric functions as ratios of the
lengths of the sides of the triangle. To do this draw the given triangle with the right angle in
the lower right hand corner and θ in the lower left corner. Let the length of the base be a,
the height be b, and the length of the hypotenuse be c. The other triangle is drawn with θ in
32
Section 9: Special Values of Trigonometric Functions
standard position in IR2 with vertices (0, 0), (cos θ, 0), and (cos θ, sin θ). This triangle has base
of length cos θ, height sin θ, and hypotenuse of length 1. By Euclid’s Theorem the ratios of
corresponding side lengths are equal so we have
1
sin θ
cos θ
=
=
c
b
a
So given two sides, for example a and c, we have an equation relating the two sides which can
rewritten as an equation describing their ratio. In this case 1/c = cos θ/a implies a/c = cos θ.
Equivalently c/a = 1/ cos θ = sec θ. The other formulas are b/c = sin θ, which is equivalent to
c/b = csc θ, and b/a = tan θ, which is equivalent to a/b = cot θ.
The famous mnemonic here is “soh-cah-toa” which is an acronym for “sine is opposite
over hypotenuse–cosine is adjacent over hypotenuse– tangent is opposite over adjacent.”
The sum of the measures of the angles of a triangle is 180◦ , which is π radians. So for a
right triangle with included angle θ the other nonright angle ϕ must have measure (π/2) − θ
radians. We call the two angles θ and ϕ complementary. Notice that the value of f (ϕ) = cof (θ)
for any of the six trigonometric functions f , when defined as ratios of lengths of sides. Since
the complement of the complement of θ is simply θ we get the mnemonic device “co-co=no-co”.
The fact that a triangle has three sides explains why there are three pairs of complementary
trigonometric functions.
Finally, to demonstrate the utility of these concepts we derive some special values of the
basic trigonometric functions.
√
First take an isosceles right triangle with side s. The hypotenuse has length s2 + s2 =
√
√
2s2 = s 2. Also the included acute angles must be equal and have measure π/4. Therefore
√
√
√
√
cos π/4 = s/s 2 = 1/ 2 = 2/2 = sin π/4. Also tan π/4 = 1 = cot π/4, and sec π/4 = 2 =
csc π/4. Moreover any value of θ with reference angle π/4 has |f (θ)| = f (π/4), where f is any
one of the six trigonometric functions. So to determine the actual value of f (θ) we simply need
to know whether its positive or negative, which can be done since we know what quadrant the
point P (θ) lies in, so we know the sign of its coordinates sin θ and cos θ. This is sufficient since
all of the other basic trigonometric functions are defined in terms of sine and cosine.
Second take an equilateral triangle of side s, drawn with horizontal base. The interior
angles are all π/3. If we drop a perpendicular from the top vertex to the base, we split the
triangle into two right triangles. The one on the left has base s/2 and hypotenuse s. By the
33
Section 9: Special Values of Trigonometric Functions
√
Pythagorean theorem the height is s 3/2. The complementary angle has measure π/6, so we
√
have cos π/6 = sin π/3 = 1/2, sin π/6 = cos π/3 = 3/2. The values of the other four basic
trigonometric functions are determined as before.
Identical to the first case, we can now find exact values of trigonometric functions for any
angle which has reference angle π/6, or reference angle π/3.
Example 1: Let’s find the exact value of some trigonometric functions at 4π/3. The reference
angle is π/3 and P (4π/3) is in the third quadrant where both x and y coordinates are negative.
√
Therefore cos 4π/3 = − cos π/3 = −1/2, and sin 4π/3 = − sin π/3 = − 3/2. Then for example
√
tan 4π/3 = 3, since the ratio of two negative numbers is positive.
Example 2: If we have a right triangle with base of length 12 and height 5, the Pythagorean
theorem gives 13 as the length of the hypotenuse. In this case if θ denotes the angle adjacent to
the base and hypotenuse, sin θ = 5/13, cos θ = 12/13, tan θ = 5/12, cot θ = 12/5, sec θ = 13/12
and csc θ = 13/5. Moreover we know how to find lots points on the unit circle which have a
similar reference triangle.
Example 3: If we have an isosceles right triangle of side 4, then if h denotes the length of the
√
√
hypotenuse we have 4/h = sin(π/4) = 2/2. Here we write 8 = h 2 by cross multiplying and
√
√
can solve for h = 8/ 2 = 4 2.
It seems reasonable that we should be able to perform exercise 3 for any angle, θ, where
0 < θ < π/2. And of course it is possible, if we have inverse trigonometric functions. As with
even power functions, due to their periodic nature trigonometric functions are not one-to-one.
Therefore if we wish to invert them we must restrict their domain to a subset where they are
one-to-one. In order to complete exercises like example 3 we want this restricted domain to
include (0, π/2) and to be of most value it would be as large as possible.
For y = sin x we use the convention that its domain is [−π/2, π/2] when discussing its
inverse function denoted arcsin x. Formally the function is defined by the equations y = arcsin x
iff x = sin y. Which means that we can draw a reference triangle with enclosed angle |y|,
√
opposite side of length |x| and hypotenuse of length 1. The remaining side has length 1 − x2
by the Pythagorean theorem. This construct allows us to compute tan arcsin x = tan y =
√
x/ 1 − x2 .
When we start instead with y = cos x we use the convention that its domain is [0, π] and
34
Section 9: Special Values of Trigonometric Functions
denote its inverse by arccos x.
The conventional domain for y = tan x as an invertible function is (−π/2, π/2). Now we
have y = arctan x iff x = tan y. So we can draw a reference triangle with enclosed angle |y|,
opposite side of length |x| and adjacent side of length 1. The hypotenuse therefore has length
√
√
x2 + 1, and we can write for example cos arctan x = cos y = 1/ x2 + 1. Notice that here the
cosine function always has positive value which agrees with our convention about the domain
of a composition of functions.
One last picture to consider is the one flowing from considering sec x as an invertible
function with inverse denoted arcsec x. Now we suppose that the domain of the secant function
is limited to [0, π/2) ∪ (π/2, π]. By definition y = arcsec x iff x = sec y. So we can draw a
right triangle with included angle |y|, hypotenuse of length x, adjacent side of length 1, and
√
√
opposite side of length x2 − 1. Now for example csc arcsec x = csc y = x/ x2 − 1.
The restricted domain for cot x as an invertible function is (0, π). For y = csc x to be
considered invertible our convention is to limit its domain to [−π/2, 0) ∪ (0, π/2].
Exercises:
1. Find the exact value of each of the following:
µ ¶
³π´
2π
a) sin
b) tan
3
3
µ
¶
µ ¶
−π
7π
d) csc
e) cos
6
6
µ ¶
3π
g) sin (π)
h) cot
4
j) cos (3π)
k) tan (π)
2. Find the exact value of each of the following:
a) tan arccos (−3/5)
b) sin arccot (1/2)
µ
¶
−5π
c) cos
4
µ
¶
−π
f) sin
3
µ ¶
3π
i) sec
2
³π ´
l) csc
6
c) sec arccsc (2/3)
3. Sketch the picture described in the second paragraph of this section. Label the points O,
P , B, D, and E.
4. Sketch and label a triangle in which:
a) x = sin y
b) x = tan y
35
Section 10: General Sinusoids
A general sinusoid is a function of the form y = k + A sin(B(x − c)) or y = k + A cos(B(x − c)),
where k, A, B and c are real numbers. Since the sine function and cosine function are horizontal
shifts of each other we will only consider the form y = k + A sin(B(x − c)). Such a function is a
shift right by c units (left if c < 0), of a compression horizontally by a factor of 1/B (expanded
if B < 1), of a vertical scale by A units, of a shift up by k units (down if k < 0) of the basic
function y = sin x. For such a function |A| is the amplitude. About the central line y = k
the sine wave will fluctuate up |A| units and down |A| units. All function values are bounded
below by k − |A| and above by k + |A|. The bounds are obtained when sin(B(x − c)) = ±1,
which means that B(x − c) = (2n + 1)π/2, where n is an integer. This is equivalent to
x = c + [(2n + 1)π/2B]. The general sinusoid is periodic with period 2π/B.
π
Example 1: Let’s sketch a graph of y = f (x) = 2 − 3 sin(3x − ). First we put the function
2
π
in standard form y = f (x) = 2 − 3 sin(3(x − )). Then it’s a matter of determining where
6
f takes on maximal and minimal values, since half-way between it will take value 2. The
extreme values happen when 3(x −
π
6)
= (2n + 1)π/2. So x −
π
6
= (2n + 1)π/6. Thus
6x − π = (2n + 1)π = 2nπ + π. So 6x = (2n + 2)π, or x = (n + 1)π/3. We see that f (π/3) = −1
and f (2π/3) = 5. So f (π/2) = 2, and the rest is determined by symmetry. Of course we need
to mark our x-axis off in units of π/6 to be able to clearly indicate where f takes on its extreme
values and its central values. The y-axis needs to be marked so that all values between −1
and 5 can be seen. Elongating the x-axis a little gives a sketch which is smoother – more like
the basic sine wave. We can also shorten the y-scale somewhat to acheive the same effect.
6
4
2
−π
− 2π
3
− π3
π
3
-2
36
2π
3
π
Section 10: General Sinusoids
It is also useful to be able to combine two (or more) general sinusoids arithmetically.
Given two sinusoids y = k1 + A1 sin(B1 (x − c1 )) and y = k2 + A2 sin(B2 (x − c2 )), their sum
is is shift up k1 + k2 units of y = A1 sin(B1 (x − c1 )) + A2 sin(B2 (x − c2 )). This function is
bounded since −(|A1 | + |A2 |) ≤ y ≤ |A1 | + |A2 |. The bounds may or may not be met. To
graph this function we find scales for the x-axis and y-axis that allow both simpler functions
to be graphed simultaneously. Then we add ordinates. Ordinate is a fancy name for the
y-coordinate.
Example 2: Sketch a graph of y = 3 sin x + sin(3x). First of all the outputs are bounded
between −4 and 4. Second the function y = 3 sin x has period 2π, and is simply a vertical
scale of the regular sine function. Third the function sin 3x has period 2π/3. So it is a
horizontal compression of the sine wave by a factor of 3. To show 2 periods of each component
function simultaneously we need an interval of x-values of length 4π. We might as well take
x ∈ [−2π, 2π] since the overall function is odd. We do need to mark off multiples of π/6 on the
x-axis in order to see all zeroes, and extreme values of sin 3x. We can also make two copies of
our axes, one for scratch work, the other for the final product (neatness counts).
4
2
π
−π
−2π
2π
-2
-4
Exercises:
1. For each of the following general sinusoidal functions identify the amplitude A, the vertical
shift k, the period p, and the phase shift c. Then sketch and label a graph of the function
on the given interval. In each instance choose scales for your axes which allow all important
details to be clearly shown. At least two periods of the function should be shown.
a) y = 2 − 3 cos(2πx +
π
)
2
37
Section 10: General Sinusoids
b) y = 1 + 2 sin(
x π
− )
2
2
c) y = −3 + sin(3x)
2. Sketch and label a graph of each of the functions below on the specified interval. In each
instance choose scales for your axes which allow all important details to be clearly shown. At
least two periods of the resulting function should be graphed.
a) y = 3 sin πx + 2 sin(2πx)
b) y = 2 cos(2x) + cos(4x)
38
Section 11: Trigonometric Identities
We’ve already seen many basic identities which trigonometric functions satisfy. For example if f is a basic trigonometric function, and g is its co-function, then f (x) = g(π/2−x), as long
as both sides of the equation make sense. These are called the co-function identities. Another
example are what are called the reciprocal identities, such as csc x = 1/ sin x. A third example is the quotient identity, tan x = sin x/ cos x. Finally, consider the Pythagorean identity
sin2 x + cos2 x = 1.
For each of these identities there is usually an equivalent way of stating the identity. For
example we may rewrite the Pythagorean identity as sin2 x = 1−cos2 x, or as cos2 x = 1−sin2 x.
We can even divide the equation through by cos2 x to obtain tan2 x+1 = sec2 x, after appealing
to algebra, the quotient identity, and a reciprocal identity. This latest version of the identity is
not true for all real x, but is true as long as cos2 x 6= 0, which means x 6= (2n+1)π/2. One game
we might play is to continue generating equivalent identities from old using other identities.
The result is a very impressive list of identities, too many for most people to memorize. So
instead of recommending that people memorize every form of an identity, we recommend that
they remember one form of the identity and develop the ability to re-derive the rest. This is
very much like the difference between giving someone a fish, and teaching them how to fish.
To build this skill a common practice is to use an exercise such as the following.
Example 1: Verify that 1 + cot2 x = csc2 x is an identity.
Solution :
cos2 x
sin2 x
sin2 x + cos2 x
=
sin2 x
1
=
sin2 x
= csc2 x
1 + cot2 x = 1 +
(quotient identity)
(algebra)
(Pythagorean identity)
(reciprocal identity)
Notice in our solution we did not start by affirming the conclusion. That is we did not
suppose that the identity was true. We simply started with the left hand side and continued
to apply identities until we arrived at the right hand side. Not only is this process logically
proper, it is clearly reversible. So when we verify an identity we will use this process. Next
39
Section 11: Trigonometric Identities
realize that every step was justified as either algebra or as the application of an identity. While
this is not mandatory it is suggested. Third, observe that we did not show every single step of
algebra. The real challenge is for each individual to learn to take as many steps as they need
to verify the identity. Some people are more adroit than others combining algebraic steps. So
two valid solutions might have a different number of delineated algebraic steps.
Now to demonstrate the power of this process let’s derive one identity and develop a
variety of other identities from it.
Example 2: cos(θ − φ) = cos θ cos φ + sin θ sin φ, for θ, φ ∈ IR.
Proof: Let θ, φ ∈ IR. On the unit circle the square of the distance from P (θ) to P (φ) is the
same as the square of the distance from P (θ − φ) to P (0). Therefore by the distance formula
we have
(cos θ − cos φ)2 + (sin θ − sin φ)2 = (cos(θ − φ) − 1)2 + (sin(θ − φ) − 0)2
After expansion the left hand side is cos2 θ − 2 cos θ cos φ + cos2 φ + sin2 θ − 2 sin θ sin φ + sin2 φ.
By applying the Pythagorean identity twice we get 2 − 2 cos θ cos φ − 2 sin θ sin φ. Meanwhile
the right hand side becomes cos2 (θ −φ)−2 cos(θ −φ)+1+sin2 (θ −φ). Which after applying the
Pythagorean identity once becomes 2 − 2 cos(θ − φ). Cancelling the 20 s and dividing through
by −2 yields the identity. This is called the difference identity for cosine.
From the difference identity for cosine we can derive the sum identity for cosine,
Theorem: cos(θ + φ) = cos θ cos φ − sin θ sin φ.
Proof: Here we simply write cos(θ + φ) = cos(θ − (−φ))
= cos θ cos(−φ) + sin θ sin(−φ)
= cos θ cos φ − sin θ sin φ
since cos(−φ) = cos φ and sin(−φ) = − sin φ.
Similarly we can prove the sum identity for sine.
Theorem: sin(θ + φ) = sin θ cos φ + sin φ cos θ
40
Section 11: Trigonometric Identities
Proof: We use sin((π/2) − x) = cos x, sin(−z) = − sin z and cos(y − (π/2)) = sin y.
π
sin(θ + φ) = cos((θ + φ) − )
2
π
= cos(θ + (φ − ))
2
π
π
= cos θ cos(φ − ) − sin θ sin(φ − )
2
2
π
= cos θ sin φ + sin θ sin( − φ)
2
= cos θ sin φ + sin θ cos φ
The rest is a matter of rearranging terms.
The difference identity for sine is sin(θ − φ) = sin θ cos φ − sin φ cos θ. We leave the verification of this identity as an exercise.
As interesting corollaries we get the double angle formulas which are
2 tan x
. For example
cos 2x = cos2 x − sin2 x, sin 2x = 2 sin x cos x, and tan 2x =
1 − tan2 x
Theorem: cos 2x = cos2 x − sin2 x
Proof: cos(2x) = cos(x + x) = cos x cos x − sin x sin x = cos2 x − sin2 x.
Continuing, we can produce cos 2x = cos2 x − sin2 x = cos2 x + sin2 x − sin2 x − sin2 x =
1 − 2 sin2 x. This is an equivalent version of the double angle formula for cosine. One can also
show that cos 2x = 2 cos2 x − 1.
Each of the last two versions of the double angle formula for cosine can be rewritten. For
1 − cos 2x
. Which means that the function sin2 x can be analyzed via the
example sin2 x =
2
cosine function using shifting and scaling. Or if we wish we can derive one of the half-angle
formulas.
r
1 − cos x
x
Theorem: sin = ±
2
2
Proof: Set y = x/2, so 2y = x. Then sin2 (x/2) = sin2 y = (1 − cos 2y)/2 = (1 − cos x)/2. The
result follows by taking square roots of both sides, realizing that the answer may be positive
or negative.
As a nice application of these formulas we have the following examples.
Example 3: Find the exact value of sin(5π/12).
41
Section 11: Trigonometric Identities
8π 3π
2π π
5π
= sin(
−
) = sin(
− )
12
12
12
3
4
2π
π
π
2π
= sin
cos − sin cos
3
4
4
3
√ √
√
3 2
21
=
−
2 2
2 2
√
√
6− 2
=
4
7π
Example 4: Find the exact value of cos
.
8
Solution: sin
Solution: Since 7π/8 is in the second
s quadrant
s
√
p
√
1 + cos 7π
1 + 22
2+ 2
7π
4
cos
=−
=−
=−
8
2
2
2
Example 5: If we know that cos θ = 4/5 and 3π/2 ≤ θ ≤ 2π then cos 2θ = 2 cos2 θ − 1 =
2(16/25) − 1 = 7/25. While because θ is in quadrant four the Pythagorean identity implies
sin θ = −3/5. So sin 2θ = 2 sin θ cos θ = −24/25. Therefore tan 2θ = −24/7, and we can also
compute exact values for csc 2θ and cot 2θ.
The previous example can be inverted by giving exact value of cos x, for example, and
quadrantal information about x, to enable using the half-angle formulas to compute exact
values of any of the trigonometric functions at
x
2.
Identities are also important since they may be used to simplify a problem. For example
a3
we put u = a tan θ. We find that a2 + u2 =
if we want to analyze the formula z = 2
(a + u2 )3/2
a2 + a2 tan2 θ = a2 (1 + tan2 θ) = a2 sec2 θ. So (a2 + u2 )3/2 = (a2 sec2 θ)3/2 = a3 sec3 θ,
and z = cos3 θ. That is, trigonometric identities and composition of functions may allow us
to change venue to a friendlier setting. To be sure no one has all trigonometric identities
memorized, and many of them are not initially obvious. However we can for instance go one
step further and write z = (cos 3θ + 3 cos θ)/4 which is a restatement of one of the homework
exercises. This version of z as a function of θ can be visualized using addition of ordinates.
Yet another application of trigonometric identities is in simplifying difference quotients of
sin h
−→ 1,
trigonometric functions. For example by graphing we can observe that as h −→ 0,
h
cos h − 1
while
−→ 0. We then have
h
sin(x + h) − sin x
sin x cos h + sin h cos x − sin x
=
h
h
³ cos h − 1 ´
³ sin h ´
= sin x
+ cos x
h
h
42
Section 11: Trigonometric Identities
So as h −→ 0, the difference quotient of the sine function approaches cos x.
As a final demonstration of the power of these identities you may now read appendix E
on rotation of axes to clear cross terms in general quadratic equations in two variables.
Exercises:
1. Verify the following trigonometric identities.
a) sin(θ − φ) = sin θ cos φ − sin φ cos θ
tan θ + tan φ
b) tan(θ + φ) =
1 − tan θ tan φ
c) sin 2x = 2 sin x cos x
2 tan x
d) tan 2x =
1 − tan2 x
1 + cos 2x
e) cos2 x =
2
3
f) cos 3x = 4 cos x − 3 cos x
r
x
1 + cos x
g) cos = ±
2
2
1 − cos x
x
h) tan =
2
sin x
1
i) sin α cos β = (sin(α − β) + sin(α + β))
2
1
j) cos α cos β = (cos(α − β) + cos(α + β))
2
1
k) sin α sin β = (cos(α − β) − cos(α + β))
2
2. Use trigonometric identities to find the exact value of each of the following.
π
12
7π
b) sec
12
5π
c) csc
8
a) cot
−4
and that θ is in quadrant III. Find the exact values of the
5
other five trigonometric functions at θ.
3
4. Given that θ is in quadrant I and that cos 2θ = − , find the exact values of the six
5
trigonometric functions at θ.
3. Using the fact that sin θ =
5. Read appendix E.
43
Section 12: Applications of Trigonometry
Whenever a section in a math book has applications in the title it means “word exercises.”
So now you know what you will mostly face in this section. The reason that many people have
difficulty with word problems is that their solutions tend to be more involved than the “plug
and chug” exercises otherwise found in text books. In fact the next to last part of a word
problem usually is completing a particular type of “plug and chug” exercise. The difficulty is
arriving at that stage. Here follows a short description of what is supposed to happen.
Step 1: Read the problem carefully, several times. Make sure you understand what is being
asked.
Step 2: Draw a picture or diagram.
Step 3: Label all known quantities including units, such as ft., lbs. etc. Usually word problems
do not contain superfluous information. Thus any information given is vital.
Step 4: Label all unknown quantities with distinct variables. In contrast to step 3, some
unknown quantities may be irrelevant. With practice should come some facility in deciding
which unknown quantities you can omit. Any unknown quantity which you are being asked
to find must be labeled.
Step 5: This is the hard part. Write down as many equations/relations/identities as you can
which relate the known and unknown quantities. Some of these will be irrelevant. As before,
with practice will come some ability to determine which equations are to be used. The hint is
that the equations usually have something to do with the title of the section.
Step 6: After eliminating as many variables as possible, hopefully you have reduced the word
problem to a “plug and chug” exercise.
Step 7: When you have solved the appropriate exercise, you should carefully and neatly
re-write your solution eliminating all irrelevant unknown quantities,labels,equations,relations
and identities. The finished product should look like the examples, possibly minus some of the
prose.
Before we get to the fun we need to develop a few more tools.
First consider a triangle drawn with a horizontal base and one vertex labelled B above it.
Label the left end point of the base A, the right end point C. Label the angle with vertex A,
α, the angle with vertex B, β, and the angle with vertex C, γ. Finally label the length of the
44
Section 12: Applications of Trigonometry
side opposite A, a, the length of the side opposite B, b, and the length of the third side c.
Drop a perpendicular from B to the line the base lies in. Call the height of the perpendicular h, and the intersection of the perpendicular with the base P . If the perpendicular coincides
with the side CB we have a right triangle and c2 = a2 + b2 . Otherwise the perpendicular splits
the triangle into two right triangles AP B and BP C.
The length from A to P is |c cos α|. Therefore the length from P to C is b − c cos α. Also
we have c2 = h2 + c2 cos2 α and a2 = h2 + (b − c cos α)2 . From the first equation we have
h2 = c2 − c2 cos2 α so by substitution
a2 = h2 + (b − c cos α)2
= c2 − c2 cos2 α + b2 − 2bc cos α + c2 cos2 α
= c2 + b2 − 2bc cos α
This is the law of cosines. Since the parts of the triangle can be rearranged we also have
b2 = a2 + c2 − 2ac cos β, and c2 = a2 + b2 − 2ab cos γ.
Another tool arises from this scenario since in the triangles above we have h = a sin γ =
sin γ
sin β
sin α
=
. In fact whatever this common ratio is it must equal
.
c sin α. Therefore
a
c
b
This is the law of sines.
Though this law is simpler, and therefore more easy to use, it is more dangerous to use
too. If we are only given a, c and α, then if a < c sin α, there is no triangle with these
properties. If a = c sin α , then we get a unique right triangle where γ is the right angle. When
a ≥ c(≥ c sin α) we again get only one triangle with these properties. For this triangle γ must
be acute. However when c sin α < a < c, there are two angles γ1 and γ2 , where γ1 + γ2 = π
and γ1 is obtuse while γ2 is acute. So when applying the law of sines we must be careful to
consider whether our answer makes sense physically.
Now for the fun. A fairly representative exercise whose solution uses trigonometry is the
following.
Example 1: An executive from a window washing company is putting in a bid for the weekly
washing of the windows of a skyscraper. To set his price he needs to know approximately how
tall the building is. His ex-wife works for the city planning office, so he doesn’t dare call them
to ask for the information. Instead he moves away from the building 101 feet and measures
the angle of elevation from that point to the roof of the building as 73◦ = 73π/180 radians.
45
Section 12: Applications of Trigonometry
Thus the height of the building h satisfies h/101 = tan(73◦ ). So h is about 101 tan(73◦ ) ≈ 330
feet.
Example 2: Fred and George are playing wallyball. George wants to pass the ball to Fred but
a very tall opposing player is in the way. Fred is standing 4 feet away from a wall which George
√
is standing 3 feet from. Fred and George are standing 145 feet apart and George decides
that he wants to make a chest-high carom-pass off the wall to Fred. How far along the wall
between them should George pass the ball.
Solution: Let the point where George is standing be labelled A, and the point where Fred is
standing B. Drop perpendiculars from A and B to the wall and label these points C and D
respectively. Let P be the point on the wall which George should hit. Denote the length from
P to C as x, so the length from P to D is 12 − x. Let θ be the angle with vertex P , initial side
along the wall, and terminal side P A. Basic physics tells us that θ should also be the angle with
x
12 − x
vertex P , one side along the wall, and the other side P B. Therefore = cot θ =
. Thus
3
4
1
4x = 3(12 − x) = 36 − 3x, which means x = 36/7 = 5 feet. By the way x/3 = 12/7 = cot θ,
7
so θ = arccot (12/7) ≈ 30.2◦ .
Example 3: A car leaves a little town on a road traveling due South at 75 mph. Twenty minutes
later a truck heads out of town traveling due East at 55 mph. How far apart in miles are the
vehicles forty-five minutes after the truck departed?
Solution: Let t = 0 correspond to when the truck departed and let us measure t in hours. Let
a(t) denote the distance in miles of the car to the town as a function of t, and b(t) denote the dis75
+ 75t = 25 + 75t
tance of the truck in miles to the town as a function of time. Then a(t) =
3
and b(t) = 55t. By the Pythagorean theorem the distance between them in miles as a function
of time t in hours is
d(t) =
So d(3/4) = 5
miles.
p
p
p
(25 + 75t)2 + (55t)2 = 625 + 3750t + 8650t2 = 5 25 + 150t + 346t2
p
√
√
25 + 150(3/4) + 346(9/16) = (5/4) 400 + 1800 + 3114 = (5/4) 5314 ≈ 91.1
Example 4: A passenger jet leaves an airport at 12:00 pm traveling due west at 430 mph. A
private jet leaves the same airport at 12:30 pm traveling 65◦ N of West at 552 mph. How far
apart in miles are the planes at 2:00 pm?
46
Section 12: Applications of Trigonometry
Solution: Let t denote time in hours and let t = 0 correspond to 12:30 pm. Let a(t) denote
the distance of the passenger plane to the airport in miles. Let b(t) denote the distance of
the private jet to the airport in miles. So similar to the previous example a(t) = 215 + 430t
and b(t) = 552t. By the law of cosines the distance squared between them at time t is
d2 (t) = (215 + 430t)2 + (552t)2 − 2(215 + 430t)(552t) cos(65◦ ). So when t = (3/2), the distance
squared is d2 (3/2) = (4 · 215)2 + (828)2 − 2(860)(828) cos(65◦ ). Therefore
p
d(3/2) = (4 · 215)2 + (828)2 − 2(860)(828) cos(65◦ ) ≈ 907 miles.
Example 5: A drug smuggler is traveling due North at 35 knots in a small boat. A coast guard
cutter is 125 nautical miles away on a heading 22◦ East of North from the smuggler. What
heading should the cutter take to intercept the smuggler, supposing that the cutter will run
at its top speed of 40 knots? How long will it take?
Solution: We could use the law of cosines here, but the law of sines is easier. Let A denote the
position of the smuggler, P denote the position of the cutter and I denote the point of intercept.
Let θ be the angle with vertex P and sides AP and IP . Let t denote time to intercept in hours.
sin θ
sin(22◦ )
=
So sin θ = (35/40) sin(22◦ ) = (7/8) sin(22◦ ). Which
Then by the law of sines
40t
35t
means that θ = arcsin((7/8) sin(22◦ )) ≈ 19◦ . So the cutter should head roughly 49◦ South of
sin(139◦ )
sin(22◦ )
West to intercept the smuggler. The time of intercept can be found using
=
125
35t
125 sin(22◦ )
≈ 2.04 hours.
So t =
35 sin(139◦ )
Example 6: A pilot is trying to get to an airport 362 miles away which lies on a major highway
that runs East and West. The control tower tells them to proceed on a heading 35◦ South
of West. The pilot knows the compass in the plane is not reliable so they make a guess at
the correct direction and proceed at 156 mph. Two hours later the pilot reaches the major
highway. How far from the airport is she?
Solution: Label the airport point A, the starting point of the plane P and the point where
the airplane reaches the highway C. Then the length of AP is 362 miles, and the length
of CP is 2 · 156 = 312 miles. The angle with vertex A and sides AP and AC has measure
35◦ . Since 362 sin(35◦ ) < 312 < 362 this is the ambiguous case of the law of sines. Still
if we let γ denote the angle with vertex C and sides CA and CP , the law of sines implies
sin γ = (362/312) sin(35◦ ) So γ = arcsin((181/156) sin(35◦ )) ≈ 41.7◦ or 138.3◦ . Unless the
pilot has a completely lousy sense of direction, γ is the larger angle. Therefore the distance
47
Section 12: Applications of Trigonometry
sin(6.7◦ )
sin(35◦ )
sin(6.7◦ )
=
. Which means d = 312
≈ 63.4 miles. If
d
312
sin(35◦ )
we choose the other value of γ we find d ≈ 529.4 miles!
d from A to P satisfies
As an example of a possible non-word exercise we have
Example 7: Show that for each A, B ∈ IR there is C, δ ∈ IR so that A sin x + B cos x =
C sin(x + δ).
Solution: Let C satisfy C 2 = A2 + B 2 , and δ = arctan(B/A). Draw a right reference triangle
with sides of length |A|, |B| and |C| and included angle |δ|. Then A = C cos δ and B = C sin δ
so that A sin x + B cos x = C cos δ sin x + C sin δ cos x = C sin(x + δ).
Exercises: Give exact answers if possible.
1. The Butler Office Building (BOB) is 847 feet high. An apartment building stands across
Main Avenue from BOB. Someone standing on the edge of the roof of the apartment building
closest to BOB measures the angle of elevation to the top of BOB as 45◦ . They measure the
angle of depression from their location to the foot of BOB as 60◦ . How far apart are the two
buildings, and how tall is the apartment building?
2. An Army Reserve Squad is dropped off at point B and ordered to march on a heading of
42◦ North of East through a dense forest to a bivouac site at point A, 23 miles away on a
North-South road. During the drop all their compasses are damaged. They take a guess at
their proper direction and march off. Marching at 6 miles per hour they arrive at the road in
1
4 hours. How far are they from the bivouac site? How far off was their estimated heading?
2
3. Two jets depart LAX simultaneously at midnight. The first flies on a heading 32◦ North of
East at 350 mph. The second flies on a heading 13◦ South of East at 467 mph. How far apart
are the planes at 2:20 am?
4. Show that for each A, B ∈ IR there is C, δ ∈ IR so that A sin x + B cos x = C cos(x − δ)
5. John has a tree in his back yard that he wants removed. The tree removal company charges
$25 a foot of height to remove it. John wants to know if he can afford it so he needs to know
the height of the tree. He stands 21 feet away from the tree on level ground and measures the
angle of elevation from himself to the top of the tree as 75◦ . About how much will it cost John
to have the tree removed?
48
Section 13: Polar Coordinates
A polar coordinate system is one in which there is a unique point in the plane called
the pole and a ray through the pole called the polar axis. A point, P , in the plane can be
coordinatized by its distance |r| to the pole and the measure of the angle θ from the polar axis
to the ray through the pole and P . Normally we take the pole to lie at (0, 0) in the Cartesian
coordinate system, and take the polar axis to be the positive x-axis. It is therefore common to
measure θ in radians. Of course if θ and φ are coterminal angles, then the polar coordinates
(r, θ) and (r, φ) describe the same point. So polar coordinates are not unique! In fact if we
take the convention that when r is negative we move in the antipodal direction of θ, then the
polar coordinates (r, θ) and (−r, θ ± π) represent the same point. To make our life easier we
normally restrict the values of θ to lie in the range (−π, π].
With these facts in mind we seek a way to convert polar coordinates and Cartesian (or rectangular) coordinates. The point P (θ) lies on the line through the origin and (r, θ). Therefore
by Euclid’s Theorem on similar triangles the Cartesian coordinates of the point are x = r cos θ,
and y = r sin θ. Very simple!
The conversion from rectangular coordinates to polar coordinates is not so smooth. When
x = 0, the polar coordinates of the point are (y, π/2). When y = 0, the polar coordinates
of the point are (x, 0). Otherwise x 6= 0 and the point with Cartesian coordinates (x, y) is a
vertex of the nondegenerate right triangle whose other two vertices are (0, 0) and (x, 0). By
p
the Pythagorean theorem the distance from (x, y) to the origin is r = x2 + y 2 . Let θ denote
the angle with vertex at the origin, initial side the positive x-axis and terminal side the line
segment between the origin and (x, y). If α is the reference angle for this value of θ, then
we know that tan α = |y|/|x| and in fact tan θ = y/x. So if −π/2 < θ < π/2, we can take
θ = arctan(y/x). In general as long as θ is coterminal with an angle in the domain of the
arctangent function, we have no problem. When θ is not coterminal with such and angle,
θ + (2n + 1)π is for some integer n. We can therefore describe the point with rectangular
coordinates (x, y), by the polar coordinates (−r, θ + (2n + 1)π).
A major application of polar coordinates is that the description of certain graphs in polar
coordinates lets us analyze them as functions, rather than relations. For example, x2 + y 2 = a2
has graph a circle with radius a centered at the origin. In polar coordinates x2 + y 2 = r2
(squaring the top equation on the right) so we can describe this circle as r = a, a very simple
49
Section 13: Polar Coordinates
formula in polar coordinates.
By convention we do try to write polar equations in the form r = f (θ). So r is the
dependent variable and θ is independent. In order to graph polar equations we use T-tables.
π π π π
The values we normally plug-in for θ are multiples of 0, , , , etc. That is values of θ
6 4 3 2
for which we can compute exact values for r. Standardly we keep the θ values in increasing
order and after connecting our dots we use arrows to display the behavior of the curve as θ
increases. This is called orienting the graph. This is a crucial part of the graph. If a > 0,
the polar equations r = a and r = −a both represent the circle centered at the origin with
radius a. The curve r = a is oriented positively, meaning that the motion along the curve as θ
increases is counter-clockwise. The curve r = −a is oriented negatively, since the motion along
the curve as θ increases is clockwise. These are therefore different curves in polar coordinates,
though they transform to the same curve in rectangular coordinates.
There is therefore some inherent danger when we attempt to convert equations in one
coordinate system to the other. When we convert from polar to rectangular we lose orientation.
When we convert from rectangular to polar we must often choose an orientation. The loose
convention we use is to opt for the positively oriented polar curve.
Example 1: Transform the rectangular equation y = 2x to a polar equation.
Solution: By substitution we have r sin θ = 2r cos θ. So if r 6= 0, sin θ = 2 cos θ. Equivalently
tan θ = 2, or θ = arctan 2. This works when r = 0 too, since we are at (0, θ) which is (0, 0)
in rectangular coordinates. This kind of polar curve is not orientable, since θ is fixed. This is
the polar equivalent of the vertical line in rectangular coordinates which has no slope.
Example 2: Transform the polar equation r = 2 sin θ to a rectangular equation.
Solution: Supposing that r 6= 0, we can multiply both sides by r to obtain r2 = 2r sin θ. By
substitution we have x2 + y 2 = 2y. By completing the square on y we get x2 + (y − 1)2 = 1.
Thus this polar equation describes a circle of radius 1 tangent to the pole, oriented positively.
Example 3: Transform the polar equation r = sin 2θ to a rectangular equation.
Solution: Again supposing that r 6= 0 we first write r3 = r2 sin 2θ = r2 (2 sin θ cos θ). Which
we rewrite as r3 = 2(r sin θ)(r cos θ). So by substitution (x2 + y 2 )3/2 = 2xy. Therefore
(x2 + y 2 )3 = 4x2 y 2 . But unless you know about the classification of general expressions of
total degree 6 in x and y, this equation will not help you sketch a graph of the curve. It does
50
Section 13: Polar Coordinates
enable us to realize that the un-oriented curve is symmetric with respect to the y-axis, the
x-axis, the line y = x, and with respect to the origin, since substituting in ±x for x and/or
±y for y leaves the equation unchanged. So if we figure out what the graph looks like in the
first quadrant, we can apply symmetry to determine the rest.
The values of θ we want to use to build a T-table are those for which we can compute
exact values of sin 2θ. So for example 2θ = π/6, 2θ = π/4,, 2θ = π/3, etc. So we build the
T-table
θ 0
r(θ) 0
π/12 √
π/8
1/2
2/2
π/6
√
3/2
π/4
1
Thus the graph of this polar equation is
θ=
π
3
θ=
π
6
1
We don’t always need to transform a polar equation to rectangular form to investigate
symmetry. If replacing θ with −θ leaves a polar equation unchanged, then its oriented graph
will have polar axis (x-axis) symmetry. If replacing θ by π − θ leaves a polar equation unchanged, then the oriented graph will have symmetry with respect to the line θ = π/2 (the
y-axis). Finally if replacing r by −r leaves a polar equation unchanged, then its oriented graph
will have polar (original) symmetry.
Example 3 indicates that these tests for polar symmetry are not as easy to implement
as tests for rectangular symmetry. So there can be value to transforming polar equations to
rectangular ones, even if the rectangular equations cannot be easily graphed.
But the real problem persists, what does the graph of a polar equation look like? Aside
from symmetry many polar equations have graphs which are periodic. This is because the
most common functions to plug θ into are trigonometric. In such cases we are interested to
51
Section 13: Polar Coordinates
know the range of θ values we must run through before the graph begins to repeat itself. In
the example above, whereas sin 2θ has period π viewing it as a rectangular equation relating
r and θ, the polar point (sin 2θ, θ) is antipodal to the polar point (sin(2(θ + π)), θ + π), since
sin 2(θ + π) = sin(2θ + 2π) = sin 2θ. So in this case the period of the polar curve turns out to
be 2π.
Example 4: Sketch a graph of the polar equation r = sin 3θ.
Solution: One can check that this graph will have y-axis symmetry. To determine further
symmetries we plot r versus θ rectangularly.
1
−π
− 2π
3
− π3
π
3
2π
3
π
-1
From this we observe that the polar graph will be symmetric about θ = π/6, about θ = π/2
and about θ = 5π/6. Also we note that the period of the curve will be π since sin(3(θ + π)) =
sin(3θ+3π) = sin 3θ cos 3π+sin 3π cos 3θ = − sin 3θ and therefore (sin 3θ, θ) and (− sin 3θ, θ+π)
are the same point. Moreover the part of the polar curve about π/6 in shape will be the same
as the part about π/2 with negative radii, and will be identical to that about 5π/6, back to
positive radii.
52
Section 13: Polar Coordinates
θ=
π
3
θ=
π
6
1
A general analysis leads one to recognize that a curve with polar equation r = a cos 2kθ,
where k is a whole number, will have a graph which is a rose with 4k petals of length a, period
2π, and x-axis symmetry. The graph of r = a sin 2kθ, is virtually identical except that it has
y-axis symmetry. In contrast the graph of the curve with polar equation r = a cos(2l + 1)θ,
where l is a whole number, will be a rose with 2l + 1 petals of length a, each traversed twice
as θ runs from 0 to 2π. This graph therefore has period π and also has x-axis symmetry. So
rose curves might be analogous to quadratic equations in two variables, in that their behavior
is completely describable by the form of the equation after it is put in standard form.
Another class of polar curves with this property are the limaçons. A limaçon is a polar
curve with equation of the form r = a + b cos θ, or r = a + b sin θ, where a and b are constants.
The second class behave like the first except that they have y-axis symmetry instead of x-axis
symmetry.
For the first class there are three distinct possibilities: 1) a < b, 2) a = b, and 3) a > b.
In each case the curve has period 2π. In the first case r = 0 twice in each period, since r = 0
implies that cos θ = −a/b < −1. Which means θ = arccos(−a/b), and 2π − arccos(−a/b). Also
the sign of r changes at θ and again at 2π − θ. Consequently the graph has an inner loop. We
distinguish this case by saying that the curve is a limaçon with an inner loop. In the second
case r = 0 only once per period, when θ is an odd multiple of π. The result is what we called a
cardioid. In the third case r is never zero and the graph looks rather like an egg, or a blunted
ellipse.
53
Section 13: Polar Coordinates
a<b
a=b
a>b
Another application of polar coordinates is that they allow us to easily define and describe
nondegenerate conic sections. This is a two-step process.
First we define a conic as the set of points in the plane so that the distance from any
point to a fixed point F , called the focus, is e times the distance from the point to a fixed line
l, called the directrix. When e = 1 we have by definition a parabola. It turns out that e is
exactly the eccentricity of the conic as discussed in section 7.
We can then derive that if F is the pole, any conic has equation of the form
r=
ed
,
1 + e cos θ
r=
ed
,
1 − e cos θ
r=
ed
,
1 + e sin θ
or r =
ed
1 − e sin θ
where d is the distance from F to l.
Exercises:
1. Plot the point with polar coordinates (5, 5π/6). b) Convert the polar coordinates to
rectangular coordinates. c) Give two other pairs of polar coordinates which represent the
same point, one pair with r > 0, the other with r < 0.
2. Convert the rectangular equation to a polar equation
a) y = x
b) x2 + y 2 = 4x
c) y = x2
3. Sketch and label a graph of each polar equation clearly indicating orientation.
a) r = 3 cos 3θ
b) r = −3 cos 3θ
c) r = 1 − 2 sin θ
d) r = −1 − 2 sin θ
e) r = 4 sin θ
4. Find a polar equation for the ellipse with focus at the pole, directrix with rectangular
1
equation x = 2 and eccentricity e = .
2
54
Section 14: Parametric Equations
The last topic for us to treat is parametric equations, so called because both rectangular
coordinates of a point on a curve are described by another quantity called the parameter. We
normally take our third variable to be t and think of t as denoting time. This is the situation
in many physical applications. A parametric curve is the set of all ordered pairs (x, y) where
x = g(t) and y = h(t), where g(t) and h(t) describe x and y respectively in terms of t.
An easy way to build a set of parametric equations is to start with y = f (x) and let
x = g(t) = t be the parameter so that y = f (t) = h(t). Any function can be parametrized in
this manner. There is a change in the graph of the curve in this case. As with polar curves,
every parametric curve is oriented. We place arrow marks on the curve to denote the motion
along the curve as the parameter t increases. Similarly any relation of the form x = k(y) can
be parametrized by y.
Every polar curve of the form r = f (θ) gives rise to a parametric curve parametrized
by θ. The parametric equations are x = r cos θ = f (θ) cos θ = g(θ), and y = r sin θ =
f (θ) sin θ = h(θ). This includes r = a, a circle with radius |a| centered at (0, 0). The standard
parametric equations for this curve are x = a cos θ and y = a sin θ. This works because
x2 + y 2 = a2 cos2 θ + a2 sin2 θ = a2 .
This construction generalizes to x = h + a cos t, y = k + b sin t which are parametric
equations for the ellipse (x − h)2 /a2 + (y − k)2 /b2 = 1. The sign of a and b don’t have to be
positive so there are really four possible sets of parametric equations of this form for this ellipse
x = h ± a cos t, and y = k ± b sin t. Since the crucial fact here is that sin2 t + cos2 t = 1, we can
also parametrize this ellipse via x = h ± a sin t, y = k ± b cos t. These eight sets of parametric
equations parametrize the same curve. They are distinct as is seen by considering which point
corresponds to (x(0), y(0)) and whether the curve is oriented positively or negatively.
If we tweak the Pythagorean identity to be in the form 1 = sec2 t − tan2 t, we can
parametrize the hyperbola (x − h)2 /a2 − (y − k)2 /b2 = 1 via x = h ± a sec t, and y =
k ± b tan t. In this case x = h ± a tan t and y = k ± b sec t parametrize the conjugate hy-
perbola (y − k)2 /b2 − (x − h)2 /a2 = 1.
et − e−t
et + e−t
e2t + 2 + e−2t
Define sinh t =
and cosh t =
, then cosh2 t =
, and
2
2
4
e2t − 2 + e−2t
sinh2 t =
. So cosh2 t − sinh2 t = 1. We can therefore also use these functions,
4
which are the hyperbolic sine and cosine functions, to parametrize a hyperbola.
55
Section 14: Parametric Equations
Since every parabola is either a function of the form y = k + (1/4c)(x − h)2 , or a relation
of the form x = h + (1/4c)(y − k)2 every parabola is parametrizable.
To summarize, every function is parametrizable, nearly every polar curve is parametrizable, and every conic section is parametrizable.
As we see with ellipses and hyperbolas, there are often many ways to parametrize a curve.
In fact if we have a set of parametric equations for a curve x = g(t) and y = h(t), then we can
reparametrize part or all of the curve by s where t = f (s), and f is a function of s. The new
parametric equations are x = g(t) = g(f (s)), and y = h(t) = h(f (s)).
Example 1: Let x = t and y = x2 = t2 be a set of parametric equations describing the parabola
y = x2 . If we set t = sin s, then x = t = sin s and y = t2 = sin2 s. Which means that the points
on the curve x = sin s, y = sin2 s are on the curve y = x2 . However not every point of y = x2
is on this parametric curve since −1 ≤ x = sin s ≤ 1, and 0 ≤ sin2 s = y ≤ 1. This example is
the first of a kind where we have the motion of the curve oscillating. If we want more or less
of the curve we can use x = c sin u, and y = c2 sin u. This parametric curve describes the path
of a particle that oscillates between (−c, c2 ) and (c, c2 ) along the path y = x2 .
12
10
8
6
4
2
-4
-2
2
4
x = 3 sin t, y = 9 sin2 t
Example 2: Let C denote the part of the ellipse x2 /4 + y 2 /9 = 1 in the first quadrant. We can
find sets of parametric equations for C in various ways.
First we can solve for y in terms of x, since this part of the ellipse satisfies the horizontal
line test. We get y 2 /9 = 1 − (x2 /4) = (4 − x2 )/4, so y 2 = (9/4)(4 − x2 ). Thus we can set x = t
√
and y = (3/2) 4 − t2 , where t ∈ [0, 2].
56
Section 14: Parametric Equations
We can reverse orientation by setting x = t = 2 − s. Now 4 − t2 = 4 − (2 − s)2 =
√
4 − (4 − 4s + s2 ) = 4s − s2 , so y = (3/2) 4s − s2 , and s ∈ [0, 2].
√
We can increase speed as it were by setting x = t = 2s, and y = 3 1 − s2 , where s ∈ [0, 1].
√
We can decrease speed by setting x = t = (1/2)s and y = (3/4) 16 − s2 , where s ∈ [0, 4].
p
√
We can call x = t = 2 sin θ, then y = (3/2) 4 − 4 sin2 θ = (3/2) 4 cos2 θ = 3 cos θ. Now
we take θ ∈ [0, π/2].
Example 3: Let C be the line segment between (1, −2) and (3, 6). The line through these points
has rectangular equation y = 4x − 6. We can therefore parametrize C by x = t, y = 4t − 6,
where t ∈ [1, 3].
If we set t = s + 1, then x = s + 1, and y = 4(s + 1) − 6 = 4s − 2, where s ∈ [0, 2].
If we then set s = 2u we get x = 2u + 1, and y = 8u − 2, where u ∈ [0, 1].
If we set u = 1 − v we reverse the orientation and have x = 3 − 2v, and y = 6 − 8v, where
v ∈ [0, 1].
If we have an object for which we know rectangular or polar equations, we can therefore
parametrize the curve fairly easily. One challenge may be to find the best parameterization
with respect to a given set of criteria. It might be instead that we want a parameterization
with a specific orientation, and with a specified domain for the parameter. The tricks above
work as long as we have at least one parameterization.
A problem that arises is that we won’t always be given the rectangular or polar equations
which a curve, or part of a curve satisfy. Instead we’ll begin with just a set of parametric
equations. In order to plot the graph of a set of parametric equations we can use a three
row/column table. The first row is labeled t and contains inputs. The second row consists of
the outputs x = g(t). The third column consists of the outputs y = h(t). The ordered pairs
(x, y) from the second and third columns belong on the graph. When we connect these dots
it is again important to orient the graph to indicate the behavior as t increases. The number
of values in the table varies from person to person. One should be careful to have a sufficient
number of values. Also the t-values should be carefully chosen so that, if possible, exact values
can be calculated for both x and y.
Even when one is careful mistakes can easily be made. Eliminating the parameter to find
an equation relating x and y is therefore a useful skill. We can then use what we know about
this relation to ensure we’ve got a proper graph.
57
Section 14: Parametric Equations
Example 4: Given the parametric equations x = tan θ + sec θ, and y = tan θ − sec θ, where
−π/2 < θ < π/2 let us sketch a graph of the curve. The first thing we might do is to compute
a three-row T-table where the input values of θ are ±π/6, ±π/4, ±π/3, and 0.
θ
−π/3
√
x(θ) 2 − √3
y(θ) −2 − 3
√−π/4
2−1
√
− 2−1
−π/6
√
3/3
√
− 3
0
1
−1
π/6
√
√3
− 3/3
√π/4
2+1
√
− 2+1
π/3√
2 + √3
−2 + 3
Plotting these points by hand and trying to connect the dots is dangerous. Much safer is to
realize that xy = tan2 θ − sec2 θ = −1 by a variation of the Pythagorean identity. So the points
of the parametric curve lie on the hyperbola xy = −1.
5
-5
5
-5
We may also need to remember that the parametric equations do not need to define the
whole rectangular curve.
√
Example 5: Given x = t and y = 1 − t, describe the curve parametrized.
Solution: To begin with the domain for the parameter is at most [0, ∞). Secondly we can solve
for t as a function of x to get t = x2 . So this set of parametric equations describes the right
half of the downward opening parabola y = 1 − x2 . The orientation has (0, 1) = (x(0), y(0))
and as t increases the point moves toward (∞, −∞).
Exercises:
1. Let x(t) = 1 + 4 sin t and y = −2 + 3 cos t. a) Find a rectangular equation for this relation.
b) Sketch and label a graph of the curve clearly indicating orientation.
2. Let x(t) = t + 2 and y(t) = 3t − 5. a) Find a rectangular equation for this relation. b)
Sketch and label a graph of the curve clearly indicating orientation.
58
Section 14: Parametric Equations
3. Find a set of parametric equations for the line through P = (1, −1) and Q = (−2, 4) so
that P = (x(0), y(0)) and Q = (x(1), y(1)).
4. Find a set of parametric equations for the line through P = (2, 1) and Q = (−2, 2) so that
Q = (x(1), y(1)) and P = (x(−1), y(−1)).
5. Find a set of parametric equations for the ellipse
(x(0), y(0)) and (0, −4) = (x(π/2), y(π/2)).
59
x2
(y + 1)2
+
= 1 so that (4, −1) =
16
9
Appendix A: The Real Line and the Cartesian Plane
The main purpose of this appendix is to present many properties of the real numbers you
hopefully already know. A second goal is to introduce you to specific notation. A third is to
give you a sense of just how highly developed the real number system is. Of course the same
is true of the Cartesian Plane.
Subsection 1: The Real Line
A natural number is a non-negative whole number. We use the set of natural numbers, IN,
for counting. It is useful to know that the sum or product of two natural numbers is again a
natural number. We say that IN is closed with respect to addition and multiplication. However
IN is not closed with respect to subtraction, since for example 2 − 4 = −2 and −2 is not a
non-negative whole number.
An integer is a whole number. The set of integers, ZZ, contains the set of natural numbers.
It is also closed with respect to addition and multiplication, and has the added advantage of
being closed with respect to subtraction. It is not closed with respect to division, since for
example 12 ÷ 5 = 2 52 is not a whole number.
The set of rational numbers, Q (q for quotient), consists of all ratios of integers with
non-zero denominators. Q is closed with respect to addition, multiplication, and subtraction.
Also as long as we don’t divide by zero, the ratio of two rational numbers is again a rational
number. In fact Q is the smallest set containing IN where we can perform all four basic binary
operations, barring only dividing by zero. End of story? NO.
The ancient Greeks knew that certain necessary numbers were not rational.
√
Fact: 2 is not rational.
√
√
p
Proof: Suppose 2 ∈ Q. Then 2 = , where p, q ∈ ZZ, and q 6= 0. We may also suppose
q
p
that p and q have no common prime divisor. We say that is in lowest terms. By squaring
q
p2
we obtain 2 = 2 . Clearing the denominator gives 2q 2 = p2 . From which we deduce that p2 is
q
even (it is divisible by 2). If p were odd, then p2 would also be odd. We therefore conclude that
p is even. Thus p = 2k for some whole number k. But this implies 2q 2 = p2 = (2k)2 = 4k 2 .
Which reduces to q 2 = 2k 2 . As above we deduce that q is even. But this is a contradiction
since now 2 is a common prime divisor of p and q. Since we arrived at a contradiction our
√
original hypothesis must be false. Therefore 2 6∈ Q.
60
Appendix A: The Real Line and the Cartesian Plane
To form the real numbers, IR, it is customary to consider a continuous horizontal line of
infinite length. One point on the line is designated as 0. Another point to its right is designated
as 1. The distance from 1 to 0 forms a unit by which all other distances are measured. For
any point x on the line exactly one of the following statements is true: 1) x is to the left of
0 (x is negative), 2) x is to the right of 0 (x is positive), or 3) x = 0. This property is called
the trichotomy law. We denote the distance from x to 0 by |x|. When x is to the left of
zero we have x = −|x|. Otherwise x = |x|. This basically defines the absolute value function
√
f (x) = |x| = x2 .
This model also gives rise to the notion of order on the line. We write a < b, for real
numbers a and b, when a − b = −|a − b|. Also a ≤ b is equivalent to b − a = |b − a|. The
trichotomy law generalizes in that if a and b are real then exactly one of a < b, b < a and a = b
is true. In fact one can prove that the order relations have the following properties.
Theorem: (Inequality Properties) For real numbers a, b and c
1) if a < b, then a + c < b + c
2) if a < b and 0 < c, then ac < bc
3) if a < b and c < 0, then bc < ac
To begin to understand the importance of this construction we first state and prove the
Pythagorean Theorem. The reader is strongly encouraged to draw the necessary figures to
illustrate the proof.
Theorem: For a right triangle with hypotenuse of length c and sides of lengths a and b, we
have a2 + b2 = c2 .
Proof: Let A = (a + b)2 be the area of a square of side a + b. Let B denote the area of a right
triangle with dimensions as per the theorem statement.
1) Draw a square S with side a + b and two sides horizontal. Let C1 denote the point on the
left hand side which is b units down from the top left corner. Let C2 be the point on the
bottom side which is b units to the right of the lower left corner. Let C3 be the point on the
right hand side which is b units up from the lower right hand corner. Let C4 denote the point
on the top side which is b units to the left of the upper right hand corner. Connect C1 and C2
by a line segment. Repeat by connecting C2 to C3 , C3 to C4 and C4 to C1 . S now contains
four non-overlapping right triangles whose sides measure a, b and c as above. The remaining
61
Appendix A: The Real Line and the Cartesian Plane
area inside S is the area of a square of side c. Therefore A = 4B + c2 .
2) Draw another square S 0 of side a + b and two sides horizontal. Let D1 = C1 and D2 = C2 .
Let D3 be the point on the right side of S 0 which lies b units down from the top right hand
corner. Let D4 be the point on the upper side of S 0 which lies b units to the right of the
upper left hand corner. With line segments connect D1 to D2 , D1 to D3 , D4 to D2 , and D4
to D3 . Now S 0 contains four non-overlapping right triangles with sides of lengths a, b and c.
The remaining area inside S 0 is the area of two squares, one of side a, and the other of side b.
Therefore A = 4B + a2 + b2 .
Since S and S 0 have the same area we deduce a2 + b2 = c2 .
Because
√
2 is the length of the hypotenuse of an isosceles right triangle with sides of unit
length, it is a real number. Real numbers which are not rational are called irrational. The set
√ √
of irrational numbers is not closed with respect to multiplication since for example 2 2 = 2.
Consequently we have no standard symbol for this set.
The absolute value function also plays a role in the definition of distance on the real
line. The distance from a to b, denoted d(a, b) equals |a − b|. Naturally d(a, b) = d(b, a) and
d(a, b) ≥ 0 with equality if and only if a = b. Seemingly innocuous is the statement that the
midpoint of the line segment joining a and b is (a + b)/2. This fact is used repeatedly in section
7.
Finally, the absolute value function satisfies the following properties.
Theorem: (Absolute Value Properties) For real numbers a, b and c
1) |ab| = |a||b|
¯ a ¯ |a|
¯ ¯
2) If b 6= 0, ¯ ¯ =
b
|b|
3) |c| < a if and only if −a < c < a
4) |c| ≤ a if and only if −a ≤ c ≤ a
3) |c| > a ≥ 0 if and only if c < −a or c > a
3) |c| ≥ a ≥ 0 if and only if c ≤ −a or c ≥ a
In turn the order relations are used to define intervals. For real numbers a and b, [a, b]
denotes all real numbers x with a ≤ x ≤ b. Similarly [a, b) = {x ∈ IR|a ≤ x < b}, (a, b] =
{x ∈ IR|a < x ≤ b}, and (a, b) = {x ∈ IR|a < x < b}. We use the infinity symbol, ∞ when
we have an interval of the real line which extends indefinitely in the positive direction. So
62
Appendix A: The Real Line and the Cartesian Plane
(a, ∞) = {x ∈ IR|a < x}. Also (−∞, b] = {x ∈ IR|x ≤ b}. It is to be understood that ∞ is a
symbol, not a real number.
Intervals are special subsets of the real line. Whenever we have sets A and B we denote
their union by A ∪ B. The union of two sets is the set whose elements are in at least one of
the two original sets. Also the intersection of two sets A ∩ B is the set whose elements are in
both A and in B. We can build more general subsets of IR using unions and intersections of
intervals.
The last computational tool we introduce in this subsection is the sign graph. The premise
behind this is contained in part 3 of the properties of inequalities. To paraphrase “minus times
minus is plus”, “minus times plus is minus”, and “plus times plus is plus”. So for example we
can determine all x ∈ IR so that x3 − 4x ≥ 0. To proceed we factor x3 − 4x = x(x2 − 4) =
x(x + 2)(x − 2). For each linear factor the trichotomy law allows us to determine intervals
where the factor is positive and negative. We combine these subsolutions according to the
rules above.
x
–
–
–
–
–
–
–
–
0
+ + + + + + + +
x−2
–
–
–
–
–
–
–
–
–
–
x+2
–
–
–
–
–
0
+
+
+ + + + + + + + +
x3 − 4x
–
|
–
–
|
–
0
|
+
+
|
+
0
|
-4
-3
-2
-1
–
0
–
–
|
1
–
–
0
0
|
2
+ + + +
+ + + +
|
|
3
4
So x3 − 4x ≥ 0 when x ∈ [−2, 0] ∪ [2, ∞).
Subsection 2: The Cartesian Plane
One of the greatest mathematical ideas of all time is due to René DesCartes. Given
an infinite flat plane choose a point O. Draw two perpendicular lines concurrent at O, one
horizontal and the other vertical. Call the horizontal line the x-axis and the vertical line the
y-axis. On the x-axis pick a point P to the right of O and let the distance from P to O
serve as 1 unit of length in the horizontal direction. Pick another point Q above O on the
y-axis and use the distance from O to Q as a unit of vertical distance. Any point in the
plane can then be assigned a unique ordered pair of Cartesian coordinates. To label a point
63
Appendix A: The Real Line and the Cartesian Plane
(a, b) means that it lies |a| units from O horizontally (to the left if a = −|a|, and to the
right if a = |a|.) and |b| units from O vertically (down if b = −|b|, and up otherwise). The
points with (a, b) = (|a|, |b|) lie in the first quadrant. Those with (a, b) = (−|a|, |b|) are in the
second quadrant. The third quadrant consists of those points with (a, b) = (−|a|, −|b|). The
points with (a, b) = (|a|, −|b|) make up the fourth quadrant. In essence each coordinate axis
is a copy of the real line. We denote the Cartesian plane by IR2 = IR × IR.
Starting with (x1 , y1 ) and (x2 , y2 ) in the Cartesian plane form a third point (x2 , y1 ). These
points form the vertices of a (possibly degenerate) right triangle. Measured against the x-axis
p
we see that the horizontal side of the triangle has length |x2 − x1 | = (x2 − x1 )2 . The vertical
p
side has length |y2 − y1 | = (y2 − y1 )2 . By the Pythagorean Theorem the distance between
p
the two points is d((x1 , y1 ), (x2 , y2 )) = (x2 − x1 )2 + (y2 − y1 )2 .
A circle is defined to be the set of points all of which have a prescribed distance from a
given point called the center. The equation for a circle follows from the distance formula. If
the center is the point (h, k) and the distance from any point on the circle to the center is |r|,
then the circle consists of all points (x, y) so that (x − h)2 + (y − k)2 = r2 .
In practice the equation of a circle rarely comes in this standard form. The standard
form can be recaptured by completing squares. The basic formula to remember is (z + a2 )2 =
z 2 + az + ( a2 )2 . Given an equation x2 + ax + y 2 + by = c, we strategically add zero two times.
The first time we add zero in the form ( a2 )2 − ( a2 )2 . The second time we add zero in the
form ( 2b )2 − ( 2b )2 . The result via substitution transforms the equation x2 + ax + y 2 + by = c
first to x2 + ax + ( a2 )2 − ( a2 )2 + y 2 + by = c. Which after adding ( a2 )2 to both sides becomes
(x+( a2 ))2 +y 2 +by = c+( a2 )2 . At this point we have completed the square in x. After completing
the square in y and simplifying we arrive at (x + ( a2 ))2 + (y + ( 2b ))2 = c + ( a2 )2 + ( 2b )2 .
We close this appendix with another seemingly innocent remark that the midpoint of the
line segment joining the points with coordinates (x1 , y1 ) and (x2 , y2 ) has coordinates
((x1 + x2 )/2, (y1 + y2 )/2).
Exercises:
1. Use interval notation to describe the values of x which satisfy x2 − 3x − 4 > 0.
2. Solve the inequality
1
2
≤
. Write your answer using interval notation.
x+2
x−3
3. a) Find the distance between the points (2, 1) and (−1, −3).
64
Appendix A: The Real Line and the Cartesian Plane
b) Find the midpoint of the line segment joining (2, 1) and (−1, −3).
4. a) Complete the squares to find the center and radius of the circle with equation x2 − 2x +
y 2 + 4y = 11.
b) Sketch a graph of the circle in part a). Choose equal scales for both axes.
c) Repeat part b) but choose the y-scale to be twice as long as the x-scale.
5. Find an equation of the circle with center (−3, 4) which is tangent to the y-axis.
65
Appendix B: Lines
A linear equation in x and y is of the form Ax + By = C, where we take A, B, C ∈ IR.
If A = B = 0, the equation is either inconsistent (like 2 = 0) and has empty graph, or is the
trivial equation 0 = 0 whose graph is all of IR2 . So let us suppose that at least one of A or B
is non-zero.
Thus if B = 0, we must have A 6= 0 and our equation is equivalent to x = C/A. The
graph of this contains all points (C/A, y) as y ∈ IR. So the graph is a vertical line.
Otherwise B 6= 0 and we have the equivalent equation y = (−A/B)x + (C/B). In which
case we set m = −A/B and b = (C/B). When x = 0 we see that y = b is the y-intercept
of the graph. Too if (x1 , y1 ) is on the graph we have y1 = mx1 + b. Therefore for w ∈ IR,
y1 + mw = mx1 + mw + b = m(x1 + w) + b which is to say that every point of the form
(x1 + w, y1 + mw) as w ∈ IR is on the graph. In fact if (x2 , y2 ) is a point on the graph other
than (x1 , y1 ), then simultaneously we have y1 = mx1 + b and y2 = mx2 + b. Subtracting the
first equation from the second gives y2 − y1 = mx2 − mx1 = m(x2 − x1 ). In this case when we
set w = x2 − x1 so that x2 = x1 + w, we have y2 − y1 = mw which simplifies to y2 = y1 + mw.
So the graph of the equation consists exactly of all points of the form (x1 + w, y1 + mw) as
w ∈ IR. We say that the equation is determined by one point and the value of m which we call
the slope. Moreover from y2 − y1 = m(x2 − x1 ) with (x1 , y1 ) 6= (x2 , y2 ) and B 6= 0 we have
that x2 6= x1 wherefore m = (y2 − y1 )/(x2 − x1 ). So the graph is also determined by any two
distinct points. Given two distinct points on the graph their midpoint is
(
x2 − x1
y 2 − y1
x2 − x1
x2 − x1
x1 + x2 y1 + y2
,
) = (x1 +
, y1 +
) = ((x1 +
, y1 + m
)
2
2
2
2
2
2
And since the shortest distance between points in IR2 is a straight line, the graph must contain
the line through the two points. Of course every point on the graph is also on the line as
argued above.
So every line in IR2 has an equation of the form Ax + By = C, where not both A and B
are zero. Every non-vertical line in IR2 also has an equation of the form y = mx + b. This is
the slope-intercept form. Equivalently we can describe the line as all points (x, y) satisfying
an equation of the form (y − y1 ) = m(x − x1 ). This is called the point-slope form.
We use the convention that a vertical line has infinite slope. A horizontal line has slope
0. This allows us to define parallel lines as those whose graphs have the same slope when
66
Appendix B: Lines
sketched on the same coordinate axes.
Two lines are perpendicular if they meet at right angles. Since the lines are not parallel,
they intersect at a point (x0 , y0 ). If one line is horizontal the other is vertical and vice versa.
When neither is horizontal, then both lines have non-zero slope. Suppose that the first line has
slope m and the second has slope n. For w = 1 we have that the first line contains the point
(x1 , y1 ) = (x0 + 1, y0 + m), while the second line contains the point (x2 , y2 ) = (x0 + 1, y0 + n).
The three points (x0 , y0 ), (x1 , y1 ), and (x2 , y2 ) form the vertices of a right triangle. Therefore
by the Pythagorean Theorem we have that
[(x1 − x0 )2 + (y1 − y0 )2 ] + [(x2 − x0 )2 + (y2 − y0 )2 ] = [(x2 − x1 )2 + (y2 − y1 )2 ]
But x1 − x0 = 1 = x2 − x0 , y1 − y0 = m, y2 − y0 = n, x2 − x1 = 0, and y2 − y1 = n − m. So
after substitution we have 12 + m2 + 12 + n2 = (n − m)2 = n2 − 2mn + m2 . Which implies
that mn = −1. So a line is perpendicular to a line with non-zero slope if and only if its slope
is the negative reciprocal of the first line’s slope.
Exercises:
1. Find the equation in slope-intercept form of the line
a) through (1, 1) and (2, 3)
b) through (0, −2) and (−3, −6)
c) through (1, 2) with slope 5
d) through (−1, 2) parallel to y = 3x − 4
e) through (0, 1) perpendicular to y = 3x + 4
2. Convert every equation from exercise 1 to point-slope form.
3. Graph the lines whose equations are given. Scales for your axes should be clearly labelled
and chosen so that all interesting features of the graph can be clearly seen.
a) 4x + 3y = 12
b) y = −2x + 4
c) x = 3
d) (y − 3) = 2(x − 1)
e) 4x − 3y = 12
67
Appendix C: Factoring Polynomials and Complex Arithmetic
This appendix deals with finding factors of polynomials. The first major theorem is the
division algorithm for polynomials which mirrors the division algorithm for integers.
Theorem: (The Division Algorithm) If a(x) and b(x) are polynomials with real coefficients,
then there exist unique polynomials q(x) and r(x) with real coefficients, so that
a(x) = b(x)q(x) + r(x) and either r = 0 or the degree of r is strictly less than the degree of b.
The polynomial q(x) is called the quotient and the polynomial r(x) is the remainder.
When b(x) is a non-zero polynomial and a(x) = b(x)q(x) (with remainder zero) we say that
b(x) divides a(x). We call b(x) a factor of a(x).
Theorem: (The Factor Theorem) A polynomial p(x) = an xn + an−1 xn−1 + . . . + a1 x + a0 ,
where a0 , a1 , . . . , an ∈ IR and an 6= 0 has p(c) = 0 if and only if x − c divides p(x).
Proof: By the division algorithm p(x) = (x − c)q(x) + r(x), where r(x) = 0 or its degree is
strictly less that the degree of (x − c), which is 1. So r(x) is zero, or a non-zero constant.
Now if we set x = c, we get p(c) = (c − c)q(c) + r(c) = r(c). So r is the constant p(c).
Thus r = 0 if and only if p(c) = 0 if and only if x − c divides p(x).
Of course we are interested in such linear factors of p(x), since c is a root of p(x). We
therefore would like to how many possible values of c there are. For this we need to know
Theorem: (The Fundamental Theorem of Algebra) A polynomial p(x) = an xn + an−1 xn−1 +
. . . + a1 x + a0 , where a0 , a1 , . . . , an ∈ IR and an 6= 0 can be written uniquely (up to reordering
terms) as p(x) = an (x − α1 )(x − α2 ) . . . (x − αn ), where the roots α1 , α2 , . . . , αn are complex
numbers.
So a polynomial of degree n has n complex roots counting multiplicity. To access the
theorem we therefore need some knowledge of the complex numbers. The complex numbers
are denoted by C and consist of all objects of the form z = a+bi, where i2 = −1. The real part
of z is a and we denote it by Re(z). The imaginary part of z is b and we denote it by Im(z).
So every complex number corresponds to an ordered pair (a, b). We can therefore think of
the complex numbers forming a plane where the x-axis is the real axis, and the y-axis is the
imaginary axis.
68
Appendix C: Factoring Polynomials and Complex Arithmetic
Complex numbers are operated on by addition, subtraction, or multiplication as if they
are linear polynomials. The reduction i2 = −1 is used, if necessary, after the operation. So
for example (2 + 3i) + (1 − 4i) = 3 − i, (−2 + i) − (3 − 2i) = 1 + 3i, and (2 + 3i)(−1 − i) =
−2 + (−2 + (−3))i − 3i2 = −2 − 5i − 3(−1) = 1 − 5i. Dividing a complex number by a nonzero
complex number can also be done. To do this we first introduce the conjugate of z = a + ib,
which is the complex number a − ib, and is denoted by z. Next we observe that zz reduces to
a2 + b2 , which we call the norm of z, and denote |z|. It is critical that |z| = 0 if and only if
z = 0. This follows since |z| is the square of the distance of the point z to the origin in the
complex plane. Also z = z if and only if z = Re(z). At any rate, if z 6= 0, we have 1/z = z/|z|.
So to divide by z, we multiply by its conjugate and divide by its norm (which is a real number).
Complex conjugation has several other important properties. If z = a + bi and w = c + di,
then z + w = (a + c) + (b + d)i = (a + c) − (b + d)i = (a − bi) + (c − di) = z + w. The conjugate
of a sum is the sum of the conjugates. Similarly one can show that the conjugate of a product
is the product of the conjugates. This applies to sums and products with more than two terms.
Especially we have that the conjugate of a power of z is the power of the conjugate, z n = (z)n .
Finally z + z = 2a ∈ IR for any z ∈ C.
Consequently we arrive at the following fact.
Theorem: If p(x) = an xn + an−1 xn−1 + . . . + a1 x + a0 is a degree n polynomial with
a0 , a1 , . . . , an ∈ IR and z ∈ C is a root of p(x), then so is z.
Proof: We have
0 = 0 = p(z) = an z n + an−1 z n−1 + . . . + a1 z + a0
= an z n + an−1 z n−1 + . . . + a1 z + a0
= an z n + an−1 z n−1 + . . . + a1 z 1 + a0
= an (z)n + an−1 (z)n−1 + . . . + a1 (z)1 + a0
= an (z)n + an−1 (z)n−1 + . . . + a1 (z)1 + a0 , since a0 , . . . , an ∈ IR
= p(z)
Thus if z ∈ C−IR is a root of a polynomial p(x) with real coefficients, then both (x−z) and
(x−z) are factors of p(x). Therefore p(x) has as factor the quadratic polynomial (x−z)(x−z) =
x2 − (z + z)x + zz = x2 − 2ax + |z|. This quadratic has real coefficients and no real roots. This
fact allows us to move from the Fundamental Theorem of Algebra to the theorem we need.
69
Appendix C: Factoring Polynomials and Complex Arithmetic
Theorem: (Real Factors Theorem) If p(x) = pn xn + pn−1 xn−1 + . . . + p1 x + p0 is a degree n polynomial with p0 , p1 , . . . , pn ∈ IR, then there exist k real numbers r1 , r2 , . . . , rk , not
necessarily distinct, and m quadratic polynomials with no real roots x2 + al x + bl , al , bl ∈
IR, a2l − 4bl < 0, l = 1, . . . , m, not necessarily distinct, so that n = k + 2m and p(x) =
pn (x − r1 )(x − r2 ) . . . (x − rk )(x2 + a1 x + b1 )(x2 + a2 x + b2 ) . . . (x2 + am x + bm ).
Proof: Since IR ⊂ C, by the Fundamental Theorem of Algebra p(x) has n complex roots if we
count multiplicities. For each non-real root z ∈ C − IR, its conjugate is also a root. Thus we
can combine these two linear factors to form a quadratic factor of the form described in the
theorem. After we have done this for all properly complex roots, the remaining roots must be
real. This forces n = k + 2m as in the theorem.
After collecting multiple occurences of linear and quadratic factors together we have
p(x) = pn (x−r1 )e1 (x−r2 )e2 (. . .)(x−rk )ek (x2 +a1 x+b1 )f1 (x2 +a2 x+b2 )f2 . . . (x2 +aj x+bj )fj ,
where n = e1 + e2 + . . . + ek + 2(f1 + f2 + . . . + fj ), and a2l − 4bl < 0, for l = 1, . . . .j. In this
form the linear and irreducible quadratic factors are distinct.
A proof of the fundamental theorem of algebra is beyond the scope of this course. In
fact most undergraduate math majors never see a proof. However it lies behind a common
application in calculus.
Fact: A polynomial p(x) with real coefficients and odd degree must have at least one real root.
Proof: If the degree of p(x) is n = 2r + 1 for some integer r, then 2r + 1 = k + 2m, where k is
the number of real roots p(x) has. Thus k = 2(r − m) + 1 is odd. In particular it cannot be
zero.
So the graph of any odd degree polynomial crosses the x-axis an odd number of times.
Also the graph of a polynomial with even degree intersects the x-axis an even number of times.
Unfortunately the theorem does not help us find these.
As an aid for picking out likely rational roots we have the rational root test.
Theorem: (Rational Root Test) If the rational number p/q in lowest terms is a root of the
polynomial p(x) = an xn + . . . + a1 x + a0 with real coefficients, and an 6= 0, then p divides a0
and q|an .
70
Appendix C: Factoring Polynomials and Complex Arithmetic
For fun, you might try to prove this theorem. This test alone does not necessarily solve
our problem. For example if p(x) = x2 − 30 the rational root test says that the possible
rational roots of p(x) are ±1, ±2, ±3, ±5, ±6, ±10, ±15, ±30. However we can easily factor
p(x) and see that neither of its roots is rational. So testing p(r) for each of the above values of
r would prove fruitless, except to provide points on the graph of the function. Similarly if we
attempted to synthetically divide p(x) by x − r for the values of r we would be largely wasting
our time. So the rational root test should be employed very carefully, hopefully utilizing
additional theorems.
One additional theorem is DesCartes’ Rule of Signs. If we take the form of p(x) as given
by the real factors theorem and expand it then we see that the coefficients of p(x) change
sign for each time p has a positive real root, plus possibly two more times for each irreducible
quadratic factor. That is, the number of positive real roots of p either equals the number of
sign changes in coefficients of p or less this by an even number. Also the number of negative
real roots of p either equals the number of sign changes in p(−x), or is less than this number
by an even number. This rule can therefore help us be a bit more selective as to which and
how many values from the rational root test we check.
Exercises:
1. Find the quotient q(x) and the remainder r(x) when a(x) = 3x3 − x2 + 4x + 2 is divided by
b(x) = 2x2 + x − 1.
2. Given that p(x) = x6 − 8x4 + 6x3 + 7x2 − 6x has x − 2 and x − 1 as factors, factor p(x)
completely as per the real factor theorem.
3. List all possible rational roots of p(x) = 2x3 − 5x2 + 7x − 21.
4. Find all rational zeroes of the polynomial p(x) = x4 − 3x3 − 2x2 + 5x + 3. Then determine
any irrational zeroes and factor p(x) completely.
5. Find a third degree polynomial with zeroes at x = 2, x = 0 and x = −4 whose x2 term has
coefficient 7.
6. If z = 1 − 5i and w = −2 + 2i, write z 3 − 2zw + w2 in the form a + bi.
7. Given that x = 2 + 3i is a solution of x4 − 4x3 + 14x2 − 4x + 13 = 0, find all solutions.
8. Find the polynomial with real coefficients and leading coefficient 1 which has roots at −1, 2
and (1 + i) and degree 4.
71
Appendix E: Rotation of Axes
Given a non-degenerate quadratic of the form Ax2 + Bxy + Cy 2 + Dx + Ey + F = 0 we
can rotate axes to get a new coordinate system, where the corresponding quadratic equation
in these variables has no cross term.
Essentially we want to hold the origin fixed and tilt our heads counter-clockwise at
π
an angle θ where 0 < θ < . The Greeks determined that the appropriate angle to use is
2
A−C
A−C
1
, which means θ satisfies cot 2θ =
.
θ = arccot
2
B
B
To see that this works, with our heads tilted at such an angle we draw in new axes, call
them the z and w-axes. So θ is the angle between the positive x-axis and the positive z-axis and
also the angle bewteen the positive y-axis and the postive w-axis. Given a point P = (x, y) let
p
r = x2 + y 2 denote its distance to the origin. Let α denote the angle between the positive
z-axis and the line, l, joining the origin and P . Note that θ + α is the angle between the
positive x-axis and l. From right triangle trigonometry we know that x = r cos(θ + α) and
y = r sin(θ + α). With respect to our second set of axes P has coordinates z = r cos α and
w = r sin α.
w
y
....
... ...
.......
........
...
...
.
...
...
...
...
...
...
...
.
.
...
...
...
.........
...
...
.........
...
...
...... .
.
.
...
.
......
.
.
.
.
.
.
.
.
...
.
.......
...
...
.......
...
...
...........
...
...
...... ...........
.
.
.
.
...
.
.
.
...
..
...
.......
...
...
..........
......
...
...
.. ........ ........
...
...
... .............
.
.
... .. .......
.
.
...
... .. ......
..
.............
........
.
..
......
(x, y)
z
α
θ
x
We use the sum formulae for sine and cosine to write
x = r cos(θ + α) = r(cos θ cos α − sin θ sin α) = z cos θ − w sin θ
y = r sin(θ + α) = r(sin θ cos α + cos θ sin α) = w cos θ + z sin θ
72
Appendix E: Rotation of Axes
We can temporarily let c = cos θ and s = sin θ, so x = zc − ws and y = wc + zs. Substitution
into our general equation yields
0 = Ax2 + Bxy + Cy 2 + Dx + Ey + F
= A(zc − ws)2 + B(zc − ws)(wc + zs) + C(wc + zs)2 + D(zc − ws) + E(wc + zs) + F
which is a general quadratic equation in z and w of the form Gz 2 + Hzw + Iw2 + Jz + Kw + F .
By expanding the expression and collecting coefficients we find that
G = Ac2 + Bsc + Cs2
I = As2 − Bsc + Cc2
J = Dc + Es
K = −Ds + Ec
H = −2(A − C)cs + B(c2 − s2 )
But since θ satisfies cot 2θ =
(A − C)
we have by the double angle identities
B
cot 2θ =
c2 − s2
(A − C)
cos 2θ
=
=
sin 2θ
2cs
B
Cross-multiplying gives us 2(A − C)cs = B(c2 − s2 ). Thus H = 0 as desired.
Exercises:
1. Find the angle θ by which rotation of axes will clear the cross term from the equation
x2 − 2xy + y 2 + 6 = 0.
2. Find the angle θ by which rotation of axes will clear the cross term from the equation
√
x2 + 3xy + 2y 2 + 14 = 0.
73
Download