Lecture Notes for Transition to Calculus Department of Mathematics University of North Dakota Grand Forks, ND 58202 Fall 2003 Preface This collection of lecture notes is designed for use in Arts and Sciences 250 - Transition to Calculus, at the University of North Dakota. This course is not a comprehensive preparation for calculus. It is designed for either the student who needs a few gaps filled in order to be ready for calculus, or for the student who is in calculus, but needs to refresh their precalculus skills. c 2003 University of North Dakota Mathematics Department Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included at http://www.gnu.org/copyleft in the section entitled ”GNU Free Documentation License.” i TABLE OF CONTENTS Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1. Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 2. Polynomial Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 3. Rational and Algebraic Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 4. Partial Fraction Expansion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 5. Exponential and Logarithmic Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 6. Exponential Growth and Decay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 7. Conic Sections. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .23 8. Basic Trigonometric Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 9. Special Values of Trigonometric Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 10. General Sinusoids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36 11. Trigonometric Identities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 12. Applications of Trigonometry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 13. Polar Coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 14. Parametric Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 Appendices A. The Real Number Line and the Cartesian Plane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 B. Lines. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66 C. Factoring Polynomials and Complex Arithmetic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 D. Solving Systems of Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . on line E. Rotation of Axes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74 ii Introduction We denote the set of real numbers by IR, and the Cartesian Plane by IR2 = IR × IR. For more information on these sets see appendix A. Information on lines is contained in appendix B. These appendices should be read carefully before proceeding. A relation of real numbers is a subset of IR × IR. Des Cartes’ invention allows us to represent any relation of real numbers graphically. That is, the graph of a relation, R, consists of all ordered pairs (a, b) ∈ IR2 , where (a, b) ∈ R. Given a relation, R, its inverse relation, denoted by R−1 is the relation R−1 = {(b, a)|(a, b) ∈ R}. The graph of R−1 will therefore be the graph of R reflected across the line y = x. Commonly a relation will be describable by a system of equations and inequalities. Thus we may think of a relation as the set of pairs (a, b) for which the equations and inequalities E1 (a, b), E2 (a, b), . . . , En (a, b) are all true. So for example E1 (x, y) may be x2 = y 2 , and E2 (x, y) might be |x| ≤ 1. The relation defined by these is S = {(a, b)|a = ±b and − 1 ≤ a ≤ 1}. If we put R1 = {(a, a)|a ∈ IR} and R2 = {(a, −a)|a ∈ IR}, then R = R1 ∪ R2 is the relation defined by E1 . Moreover if we set T = {(x, y)| − 1 ≤ x ≤ 1}, then T is the relation defined by E2 , and S = R ∩ T . Notice that S = S −1 . In an attempt to deal with the general case we see that we must at least be able to determine relations described by a single equation, rule, or formula. Moreover it will be advantageous if one variable in the equation is completely determined by the other, since in this case every single input will yield exactly one ordered pair as output. This explains why we will spend most of our time considering functions. Section 1: Functions Given two sets of real numbers A and B, a function, f , is a relation of real numbers with the property that for every a ∈ A, there is exactly one b ∈ B, so that (a, b) is in the relation. Notationally we write f : A −→ B. Also when (a, b) is in the relation we write b = f (a). As alluded to above, we will usually be able to describe f by an equation in x and y. But since y is determined, given x, we can rewrite the equation to be y − f (x) = 0, where f (x) is some formula, or rule dependent on x. Equivalently we write y = f (x). Given a function y = f (x), where f : A −→ B, we call A the domain of f and B the codomain. We write A = dom f . Graphically a ∈ dom f if and only if the vertical line x = a 1 Section 1: Functions intersects the graph of f . In fact this vertical line test allows us to separate those relations which are functions, from those which are not. A relation f is a function only if every vertical line x = r intersects the graph of f at most once. The domain of a function can be equated with those values of r for which the intersection is non-empty. We will suppose that B = IR. However we are interested in the subset C of B consisting of all b ∈ B so that b = f (a) for at least one a ∈ A. This subset is the range of f . We write C = ran f . A number s ∈ IR is in the range of f only if the horizontal line y = s intersects the graph of f at least once. The end behavior of the functions describes how the output values tend as the inputs become large in absolute value. We write x −→ ∞ to mean x grows without bound positively, and x −→ −∞ to mean that x grows without bound negatively. In general the notation E −→ F means that the expression E is becoming arbitrarily close to the expression F . A line ax + by + c = 0 (not both a = 0 and b = 0) is an asymptote for a function if the graph of the function becomes arbitrarily close to the graph of the line. If a ∈ IR and f (a) ≤ f (b) for all b ∈ dom f , (a, f (a)) is a global minimum. If a ∈ IR and f (a) ≤ f (b) for all b in an interval I containing a, then (a, f (a)) is a relative minimum. Global maximum and relative maximum are defined similarly. A function for which f (−x) = f (x) will have a graph which is symmetric about the y-axis. Such a function is called an even function. A function for which f (−x) = −f (x) will have a graph which is symmetric about the origin. Such a function is called odd. An intersection of the graph of a function with a coordinate axis is called an intercept. A function may have many x-intercepts, but has at most one y-intercept. Given a function as a rule or formula we can use dom f , ran f , its end behavior, any asymptotes it has, any symmetry it has, and its x-intercepts and y-intercept to help sketch the graph of f . Conversely, given the graph of f we can describe each of the characteristics above. As the example in the introduction points out, we also need to be able to determine how functions interact. To begin with, given real functions f : A −→ IR and g : C −→ IR their sum is f + g : A ∩ C −→ IR defined by (f + g)(x) = f (x) + g(x). The difference, and g g product are defined similarly. The ratio of g by f is denoted by . The domain of consists f f 2 Section 1: Functions 1 . f Finally, the composition of g by f is denoted by (f ◦ g)(x) = f (g(x)). The domain of f ◦ g is of all x ∈ A ∩ C so that f (x) 6= 0. A special type of ratio is the reciprocal, denoted {x ∈ dom g|g(x) ∈ dom f }. An important case of the above operations occurs when y = g(x) = c, and c is a positive real number. In this case dom g = IR and if f : A −→ IR, then A∩ dom g = A = dom f . Now the function f1 (x) = (f +g)(x) has the same domain as f and its graph is {(x, f (x)+c)|x ∈ A}. Which is to say that we may graph f1 by translating every point of the graph of f up c units vertically. Similarly the graph of f2 = f − g can be made by translating every point of the graph of f down c units. The graph of f3 = (f g) can be formed by scaling every second coordinate of a point on the graph of f by a factor of c. Similarly the ratio of f by g can be graphed by scaling all second coordinates of points on the graph of f by a factor of 1/c. When g(x) = −1, the product f g may be graphed as the reflection of the graph of f across the x-axis. More generally if g(x) = d 6= 0 we can graph the function f g by reflecting the graph of f across the x-axis if necessary, and then scaling second coordinates by a factor of |d|. Another special case occurs when g(x) = dx, d 6= 0, or g(x) = x − c, where c > 0. In the second case when we compose g by f we have dom (f ◦ g) = {x ∈ dom g|g(x) ∈ dom f } = {x|x − c ∈ dom f } = {x + c|x ∈ dom f }. So the graph of this composition consists of {(x + c, f (x))|x ∈ dom f }. Which means that to graph the composition, we simply translate the graph of f to the right c units. Similarly if g(x) = x + c, where c > 0, then the graph of f ◦ g would be the graph of f shifted left c units. In the first case when d > 0 the graph of f ◦ g is {( xd , f (x))|x ∈ A}. So this composition can be graphed by compressing the scale of the x-axis by a factor of d. Notice that if 0 < d < 1, this compression turns out to be expansion by a factor of 1/d. When g(x) = −1, the graph of f ◦ g is the reflection of the graph of f across the y-axis. In general we reflect across the y-axis if d < 0 and compress the scale of the x-axis by a factor of |d|. Of very great interest is the possibility that g = f −1 . To begin with we must have that f −1 is a function, and not simply a relation. This means that the graph of f −1 must pass the vertical line test. Since the graph of f −1 is the graph of f reflected across the line x = y we have that the graph of f must satisfy the horizontal line test. That is, every horizontal line y = b intersects the graph of f at most once. In this case we say that f is a 3 Section 1: Functions one-to-one function. Now for every point (a, b) on the graph of f , so that b = f (a), we have a = f −1 (b). Consequently (f −1 ◦ f )(a) = f −1 (f (a)) = f −1 (b) = a. Also for b ∈ dom f −1 , (f ◦ f −1 )(b) = f (f −1 (b)) = f (a) = b. One last characteristic of a function we will investigate is its difference quotient, which f (x + h) − f (x) is found by forming the expression , where h is an infinitessimally small, but h nonzero, real number. The denominator is actually (x + h) − x, so the difference quotient is the slope of the line through the points (x, f (x)), and (x + h, f (x + h)). This slope describes how y = f (x) changes when x changes by h units. To illustrate the ideas we have developed let us consider a function of the form y = f (x) = xn , where n is a non-negative whole number. We call such a function a natural power function. Henceforth we will dispense with the case n = 0, since we get f (x) = 1. So in the sequel unless otherwise specified, natural power function will denote a power function f (x) = xn with n > 0. Notice that xn = x · x · . . . · x, with n factors makes sense for any x ∈ IR. We can therefore take dom f to be any subset of IR. Most often we will take dom f = IR. Next, if n = 2m, where m is a positive whole number, then f (x) = xn = x2m = (x2 )m ≥ 0, since x2 ≥ 0. Thus a function of this form has ran f ⊂ [0, ∞). Indeed graphing a few of these leads us to conclude that ran f = [0, ∞). On the other hand when n = 2k + 1, where k is a non-negative whole number, this is no longer true. In fact as x −→ ∞, xn −→ ∞ whether n is even or odd. But as x −→ −∞, x2m −→ ∞ while x2k+1 −→ −∞. So when n is odd, ran f = IR. Moreover we see that the end behavior of a natural power function is determined by whether n is odd or even. You might also have guessed that f (x) = x2m is even while f (x) = x2k+1 is odd. Every natural power function passes through (0, 0). In fact if xn = 0 we deduce that x = 0. So each natural power function f (x) = xn has exactly one x-intercept which coincides with its y-intercept. By definition, the reciprocal, g(x), of a natural power function f (x) = xn has domain 1 1 IR − {0}. A formula for g is g(x) = = n , which we also write as x−n . When n is even f (x) x ran g = (0, ∞). When n is odd, ran g = IR − {0}. As x −→ ∞, xn −→ ∞, when n > 0. The reciprocals of arbitrarily large positive numbers are infinitessimally small positive numbers. So as x −→ ∞, g(x) −→ 0. Likewise, the reciprocals of arbitrarily large negative numbers are infinitessimally small negative numbers. Thus as x −→ −∞, g(x) −→ 0. So g(x) has the 4 Section 1: Functions x-axis y = 0 as a horizontal asymptote. When n is even, g(x) gets arbitrarily close to 0 but is positive so we write g(x) −→ 0+ . When n is odd, the values of g(x) are negative when x is so as x −→ −∞, g(x) −→ 0− . By reversing our reasoning we realize that x = 0 is a vertical asymptote of g(x). We now write g(x) −→ ∞, as x −→ 0+ . When n is even we get g(x) −→ ∞ as x −→ 0− . When n is odd we have g(x) −→ −∞ as x −→ 0− . The functions of the form f (x) = xl where l is a whole number are the integral power functions. From the preceding paragraphs we see that the integral power functions consist of the natural power functions and their reciprocals. Next, when n is odd, any integral power function passes both the vertical and horizontal √ line tests. Therefore f −1 is a function. We use the formula f −1 (x) = n x. Alternatively we 1 have f −1 (x) = x n , from the multiplicative law of exponents. However when n 6= 0 is even, f (x) fails the horizontal line test. In this case f −1 is a relation but not a function. In order to partially correct this problem we restrict the domain of f to a set A for which the graph of f will pass the horizontal line test. By convention for n even we take A = [0, ∞), for n > 0 and A = (0, ∞), for n < 0. For this restricted function, f −1 is now a function and we use the same notation as the odd case. When we repeatedly combine natural power functions (including f (x) = 1) using sums, multiples, and differences we get the class of polynomial functions. When we compose integral power functions and inverses of integral power functions we get the rational power functions. √ For example if we compose f (x) = x3 by g(x) = x1/2 we get h(x) = (x3 )1/2 = x3/2 = ( x)3 . Lastly, when f is a natural power function the difference quotient for f takes the form f (x + h) − f (x) (x + h)n − xn xn + nxn−1 h + h2 k(x, h) − xn = = = nxn−1 + h(k(x, h)) h h h as long as h 6= 0, where h2 k(x, h) is the rest of the expansion of (x + h)n in terms of x and h. So as h −→ 0, the difference quotient of f approaches nxn−1 . We can also algebraically simplify difference quotient formulas for more general power functions. Exercises: 1. Find a simplified formula for the difference quotient of 1 a) f (x) = x √ b) f (x) = x c) f (x) = x2 − 4x + 6 5 Section 1: Functions 2. Read Section 2 and appendix C. 3. Let f (x) = ax2 + bx + c, with a 6= 0 and a, b, c ∈ IR. a) Complete the square on x. b) Use part a) to show that f is a shift and/or scale of y = g(x) = x2 . c) Use part a) to show that f has either one double zero, two real zeroes, or no real zeroes. 6 Section 2: Polynomial Functions To help you grasp the importance of this little section we encourage you to borrow a non-reform calculus text from another student, or the local library. There should be a chapter titled “Sequences and Series”. In this chapter will be a section titled “Taylor Polynomials”. This section is not normally covered in calculus until the second semester, so you’ll have to wait quite a while to appreciate the contents. The gist of the section is that any function can be approximated relatively well using polynomials. This is enormously important, since polynomials require no division, and completely accurate division is not easy to do on a computer. Also since polynomials are nothing more than linear combinations of natural power functions they are quite simple to utilize. Starting with a polynomial p(x) = an xn + an−1 xn−1 + . . . + a1 x + a0 , where an 6= 0 and a0 , a1 , . . . , an ∈ IR are constants, it is fairly clear that dom p is any subset of IR. We usually choose dom p = IR. Computing the range of an arbitrary polynomial can be more of a problem. Sometimes we can do it by inspection. The general computation often requires calculus. Since 1 ≤ x implies 0 < x we can successively multiply both sides of 1 ≤ x by x without changing the direction of the inequality. We arrive at the string of inequalities 1 ≤ x ≤ x2 ≤ x3 ≤ . . . ≤ xn . The point is that as x gets large, the leading term, an xn of a polynomial will dominate all other terms. So the end behavior of a polynomial is the same as the end behavior of its leading term. Thus if n is odd, the range of a polynomial will be IR. If n is even we may need calculus to specify the range, but at least we know a polynomial’s end behavior. Since n is an important parameter of a polynomial we give it a name, to wit the degree. Strictly speaking the degree of the polynomial p(x) = 0 is not defined. Some texts will set the degree of the zero to be −1, others will define its degree to be negative infinity. The graph of a polynomial of degree n will “waggle” no more than n − 1 times. Peaks of waggles are relative (sometimes global) maximums. Valleys of waggles are relative (sometimes global) minimums. This behavior is another feature of polynomials describable after just a modicum of calculus. So is the fact that polynomials have no asymptotes. A polynomial is even if and only if its only nonzero coefficients have even subscripts. A polynomial is odd if and only if its only nonzero coefficients have odd subscripts. In between the ends, the behavior of a polynomial is mostly dictated by the fact that its 7 Section 2: Polynomial Functions graph has no breaks, and that it can be factored into the form (see appendix C) p(x) = an (x−r1 )e1 (x−r2 )e2 (. . .)(x−rk )ek (x2 +b1 x+c1 )f1 (x2 +b2 x+c2 )f2 . . . (x2 +bj x+cj )fj , where n = e1 + e2 + . . . + ek + 2(f1 + f2 + . . . + fj ), none of the quadratic factors crosses the x-axis (so b2i − 4ci < 0), and the values r1 < r2 < . . . < rk indicate the x-coordinates of p(x)’s x-intercepts. The values r1 , r2 , . . . , rk are the roots of the polynomial. The y-intercept is (0, p(0)) = (0, a0 ). When we let qi (x) = (x − ri )ei , for i = 1, 2, . . . , k, and pi (x) = p(x)/qi (x), we see that “near x = ri ” p(x) will behave like a shift ri units right (left if ri is negative) of the function axei , where a = pi (ri ). This phenomenon is called local behavior. A fairly decent sketch of the graph of a polynomial can be constructed by piecing together the local behavior (including the end-behavior) so that the end result is a smooth curve with no breaks (this is using what is known as the Intermediate Value Theorem from calculus). The reciprocal of a polynomial is a special kind of rational function. Rational functions are the topic of the next section. A polynomial will be invertible as a function only if n is odd and the graph has no waggles. So a polynomial often has only a local inverse. In fact it can be somewhat difficult to algebraically find a formula for the local inverse. Try for example q(x) = x4 − 4x2 + 2. Some √ examples are easier, such as p(x) = x5 − 2. Here if y = x5 − 2, y + 2 = x5 . So x = 5 y + 2. √ Therefore p−1 (x) = 5 x + 2. The difference quotient of a polynomial is not too hard to compute given that the difference quotient of a sum is the sum of the difference quotients, and that the difference quotient of a constant times f is the constant times the difference quotient of f . So for example if p(x) = 3x2 − 4x + 5 its difference quotient is p(x + h) − p(x) [3(x + h)2 − 4(x + h) + 5] − [3x2 − 4x + 5] = h h 2 2 3(x + 2xh + h ) − 4x − 4h + 5 − 3x2 + 4x − 5 = h 3x2 + 6xh + 3h2 − 4x − 4h + 5 − 3x2 + 4x − 5 = h 2 6xh + 3h − 4h = = 6x + 3h − 4, as long as h 6= 0 h 8 Section 2: Polynomial Functions Exercises: 1. For each polynomial function sketch a graph of the function by transforming the graph of an integral power function. a) f (x) = x3 − 2 b) f (x) = −3x4 + 6 c) f (x) = 2(x − 3)5 − 1 d) f (x) = −(2x − 4)2 2. Use the zeroes and end behavior of each polynomial function to sketch an approximation of its graph. a) f (x) = (x + 1)2 (x − 2)2 b) f (x) = x(x + 2)2 c) f (x) = −x(x − 1)(x + 2) d) f (x) = (x + 1)(x − 2)2 3. A square box without a lid is to be constructed by cutting out four squares of side x from the corners of a 30in. by 30 in. piece of cardboard. Determine the surface area of the box as a function S of the variable x. Compute the surface area when x = 4. What is dom S? 9 Section 3: Rational and Algebraic Functions A rational function is one which is defined as the ratio of polynomials. So if r(x) is a p(x) rational function, r(x) = , where p(x) = pn xn + pn−1 xn−1 + . . . + p1 x + p0 and q(x) = q(x) qm xm + qm−1 xm−1 + . . . + q1 x + q0 6= 0 are polynomials of degree n and m respectively. Write p(x) = pn (x − r1 )e1 (x − r2 )e2 . . . (x − rk )ek (x2 + a1 x + b1 )f1 . . . (x2 + aj x + bj )fj and q(x) = qm (x − s1 )g1 (x − s2 )g2 . . . (x − sl )gl (x2 + c1 x + d1 )h1 . . . (x2 + ci x + di )hi as per the real factors theorem. If p and q have a common irreducible quadratic factor, we may cancel it since it is never zero for real inputs. If p and q have a common zero, a, then r(a) is not defined and otherwise for x 6= a, r(x) is the same as the ratio of p(x)/(x − a) by q(x)/(x − a), continuing to cancel linear factors of the form (x − a) until at most one of the numerator and denominator has x − a as a factor. In the sequel we can therefore focus our attention on rational functions where the numerator and denominator have no common factor (so r(x) is in lowest terms so to speak). If r(x) is a rational function in lowest terms, then r(x) is defined, unless q(x) = 0. So the domain of r(x) is all real numbers which are not roots of q(x). The range of a rational function cannot in general be computed without the aid of graphing or calculus. Supposing that we’ve already canceled common factors and p(x) and q(x) are as in the pn n−m x as |x| −→ ∞. So if n < m r(x) −→ 0 as first paragraph, r(x) will behave like qm x −→ ±∞. Thus the graph has y = 0 as a horizontal asymptote in this case. If n = m pn pn r(x) −→ as |x| −→ ∞. So now the graph has the line y = as horizontal asymptote. qm qm If n > m, by the division algorithm we can write p(x) = d(x)q(x) + s(x), where the degree of s(x) is strictly less than the degree of q(x). (s(x) is not zero here since we have already s(x) s(x) canceled common factors.) Now we write r(x) = d(x) + . Since −→ 0 as |x| −→ ∞, q(x) q(x) r(x) has the same end behavior as d(x). The graphs of r(x) and d(x) will become arbitrarily close as |x| −→ ∞. If n = m+1, then d(x) takes the form d1 x+d0 i.e. d(x) is a linear function. The line y = d1 x + d0 will be an asymptote for the graph of r(x). We call such an asymptote a slant asymptote to distinguish it from the vertical and horizontal asymptotes. In general a rational function r will have a vertical asymptote whenever c is a root of its denominator. To help draw the graph of r(x) we determine the behavior of r(x) as x −→ c+ and as x −→ c− . Only in special cases can we determine maxima and minima for a rational function. A rational function is even iff both its numerator and denominator are even, or both its 10 Section 3: Rational and Algebraic Functions numerator and denominator are odd. A rational function is odd iff the numerator is odd and the denominator is even or vice versa. A rational function in reduced form has x-intercepts wherever its numerator is zero. The y-intercept, if it exists, is r(0). If we have compiled the previous information for a rational function, we can usually present a good sketch of its graph. x2 − 2x − 3 , we have p(x) = x2 − 2x − 3 = (x − 3)(x + 1) and q(x) = x3 − 4x x3 − 4x = x(x − 2)(x + 2). So dom r = (−∞, −2) ∪ (−2, 0) ∪ (0, 2) ∪ (2, ∞). Since the degree Example 1 If r(x) = of p is strictly less than the degree of q, r(x) has y = 0 as a horizontal asymptote. The lines x = −2, x = 0 and x = 2 are vertical asymptotes for the graph of r(x). As x −→ −2− , p(x) −→ 5 and q(x) −→ 0− so r(x) −→ −∞. As x −→ −2+ , p(x) −→ 5 and q(x) −→ 0+ so r(x) −→ ∞. As x −→ 0− , p(x) −→ −3 and q(x) −→ 0− so r(x) −→ −∞. Similarly as x −→ 0+ , r(x) −→ ∞. Finally as x −→ 2− , r(x) −→ ∞ and as x −→ 2+ , r(x) −→ ∞. The function r(x) is neither even nor odd. It has x-intecepts when x = 3 and x = −1. The graph does not intersect the y-axis since r(0) is not defined. To facilitate sketching a graph of r(x) we also compute values for r(x) near the interesting values of x on the graph. x −3 r(x) −4/5 −3/2 6/7 −1/2 −14/15 1 4/3 5/2 −14/45 7/2 6/77 Not only do these function values help us decide how to scale our y-axis, they tell us that there is basically no hope of showing these points exactly. So the scale on our y-axis needs to be fine enough to approximate −14/45 as about −1/3, but no finer. 4 2 -4 -2 2 -2 -4 11 4 Section 3: Rational and Algebraic Functions Rational functions are interesting in that the sum, difference, product, quotient and composition of rational functions are again rational functions. In fact rational functions are linear combinations of horizontal shifts of integral power functions. The topic of the next section is the decomposition of a given rational function into a sum of simpler rational functions. In general the inverse (or partial inverse) of a rational function, is a rational function. 2x + 1 2y + 1 . Then to solve for f −1 algebraically we set x = f (y) = . x−1 y−1 This means x(y − 1) = 2y + 1. So xy − x = 2y + 1. If we collect all terms with y’s in them on Example 2: Let f (x) = the right hand side and move all other terms to the left hand side we arrive at xy − 2y = x + 1. x+1 So y(x − 2) = x + 1. Therefore y = f −1 (x) = . x−2 Difference quotients of rational functions tend to get pretty messy, which motivates generating more powerful methods in calculus. Linear combinations of rational power functions form the set of algebraic functions. After some algebra an algebraic function can be place in the form f (x) = g(x)/h(x). As with rational functions we will emphasize the analysis of algebraic functions where g and h have no common factors. The domain of an algebraic function is then {x ∈ dom g ∩ dom h|h(x) 6= 0}. As before we usually need more power to compute the range of a general algebraic function. End behavior of an algebraic function is determined by the dominate terms from its numerator and denominator. An algebraic function f intersects the x-axis only when g(x) is zero, and x ∈ dom h. Also f will have a vertical asymptote when h(x) = 0, and x ∈ dom g. √ (x − 1) Example 3 Let f (x) = √ . So g(x) = x − 1 with dom g = IR, and h(x) = x2 − 4x, x2 − 4x √ with dom h = {x ∈ dom p(x)|p(x) ∈ dom s(x)} where p(x) = x2 − 4x and s(x) = x and dom s = [0, ∞). So x ∈ dom h iff x2 − 4x ≥ 0. Which by means of a sign graph we find to be equivalent to x ∈ (−∞, 0] ∪ [4, ∞). So dom g ∩ dom h = dom h, and dom f = {x ∈ dom h|h(x) 6= 0} = (−∞, 0) ∪ (4, ∞). x x As x −→ ±∞, f (x) behaves like √ = . So f (x) −→ 1 as x −→ ∞, and f (x) −→ −1 |x| x2 as x −→ −∞. As x −→ 0− , g(x) −→ −1 and h(x) −→ 0+ so f (x) −→ −∞. As x −→ 4+ , g(x) −→ 3 and h(x) −→ 0+ , so f (x) −→ ∞. It is again true that we cannot determine extreme values without graphing/calculus. Our function f is neither even, nor odd. The graph of f neither intersects the x-axis, nor intersects the y-axis. After computing a 12 Section 3: Rational and Algebraic Functions few function values we sketch the following graph. 6 4 2 -2 2 4 6 -2 -4 You might suppose that as the complexity of a function increases (polynomial to rational function to algebraic function) so does the computation for finding f −1 algebraically, if it’s even possible. As with rational functions, algebraic functions have difference quotients that in general are pretty nasty. Some are not too bad. √ √ Example 4: Let f (x) = x + 3. Then f (x + h) = x + h + 3, and f (x + h) − f (x) = √ √ √ x + h + 3 − x + 3. The trick to proceeding is to multiply by 1 in the form ( x + h + 3 + √ √ √ x + 3)/( x + h + 3 + x + 3), since the resulting numerator is the difference of squares, 1 √ namely (x + h + 3) − (x + 3) = h. So the difference quotient reduces to √ . x+h+3+ x+3 Exercises: x2 − 1 . Find the domain of f . Then determine its end behavior, x3 + 3x2 − 4x − 12 and all asymptotes to its graph. Next find f ’s intercepts. Use all of this information to sketch 1. Let f (x) = and label a graph of f . 2. Determine a rational function which has vertical asymptotes at x = −1 and x = 2, a horizontal asymptote at y = 2, and x-intercepts at 0 and 4. x2 − 3x 3. Repeat exercise 1 for f (x) = . x+4 x−1 4. Algebraically compute the inverse function for f (x) = √ , or show it can’t be done. x+1 1 5. Find a simplified form for the difference quotient of f (x) = 2 . x −4 13 Section 3: Rational and Algebraic Functions 6. A rectangular region of pasture land is to be fenced using 2, 400 linear feet of fence. Say that the length of the region is l and the width of the region is w. Express the relation between l, w and 2, 400. Then express the area, A, of the region as a) a function of l only. b) a function of w only. 14 Section 4: Partial Fraction Expansion As alluded to in the previous section, every rational function is writable as the sum of p(x) simpler rational functions. Given a rational function, r(x) = , where p(x) and q(x) 6= 0 q(x) are polynomials of degree n and m respectively, we first reduce to the case that the numerator and denominator have no common factor (so r(x) is in lowest terms). If n ≥ m, by the division algorithm we can write p(x) = d(x)q(x) + s(x), where the degree of s(x) is strictly less than the degree of q(x). (s(x) is not zero here since we have already s(x) canceled common factors.) Now we write r(x) = d(x) + . So every rational function is q(x) the sum of a polynomial and a proper rational function which is one where the degree of the numerator is strictly less than the degree of the denominator. The partial fraction expansion of a proper rational function with non-zero, non-constant denominator q(x) = (x − s1 )g1 (x − s2 )g2 . . . (x − sl )gl (x2 + c1 x + d1 )h1 . . . (x2 + ci x + di )hi factored as per the real factors theorem, is the expression of r as a sum of proper rational functions whose denominators are powers of irreducible factors of q(x). A theorem, which requires linear algebra to prove, is that there are k + 2l real constants which make this expression unique, where k is the total number of real zeroes of q(x) and l is the total number of irreducible quadratic factors of q(x), both numbers reflecting multiplicities. If (x − s)e is one of the factors of q(x), then the partial fraction expansion of r will have e Ai terms of the form , as i = 1, 2, . . . , e. For each factor (x2 + bx + c)f with b2 − 4c < 0 (x − s)i Bj x + Cj as j = 1, 2, . . . , f . the partial fraction expansion of r will have f terms 2 (x + bx + c)j x2 − 2x − 3 , we have p(x) = x2 − 2x − 3 and q(x) = x3 − 4x = x(x − x3 − 4x 2)(x + 2). So r is a proper rational function whose denominator has three real zeroes all with Example 1 If r(x) = multiplicity one. So there are three real constants, call them A, B and C so that r(x) = A B C + + x x−2 x+2 To find these constants we clear denominators on both sides of the equation by multiplying through by r(x)’s denominator q(x). So hA B C i x2 − 2x − 3 = x(x − 2)(x + 2) + + x x−2 x+2 Ax(x − 2)(x + 2) Bx(x − 2)(x + 2) Cx(x − 2)(x + 2) = + + x x−2 x+2 =A(x − 2)(x + 2) + Bx(x + 2) + Cx(x − 2) 15 Section 4: Partial Fraction Expansion We then can equate coefficients to determine A, B and C, or we can substitute in three distinct values for x to get a system of three linear equations in the three unknowns A, B and C. For example if we set x = 0 in x2 − 2x − 3 = A(x − 2)(x + 2) + Bx(x + 2) + Cx(x − 2), then on the left hand side we get −3 and on the right hand side we get A(−2)(2) = −4A. We deduce that A = 3/4. If we next set x = 2 (is there a pattern here?) then on the left hand side we get 4 − 4 − 3 = −3 and on the right hand side we get B · 2(4). So B = −3/8. Finally we set x to be the root of x + 2, which is to say x = −2 and get 4 + 4 − 3 = 5 on the left hand side and C(−2)(−4) = 8C on the right. So C = 5/8. We can check that when we start with 3/4 3/8 5/8 − + x x−2 x+2 and put everything back over the common denominator x(x − 2)(x + 2), we get r(x). If instead we equate coefficients we first expand the left hand side to A(x2 − 4) + B(x2 + 2x) + C(x2 − 2x) = (A + B + C)x2 + (2B − 2C)x − 4A. So we arrive at the three linear equations A + B + C = 1, 2B − 2C = −2, and −3 = −4A. Which is simply an equivalent system of equations. x2 , then there are four real numbers A, B, C and D so (x − 2)2 (x2 + 3) A B Cx + D that r(x) = + + 2 . Clearing denominators yields x2 = A(x − 2)(x2 + 2 x − 2 (x − 2) x +3 3) + B(x2 + 3) + (Cx + D)(x − 2)2 Equating coefficients yields the system of equations Example 2: Let r(x) = 0 =A + C 1 = − 2A + B − 4C + D 0 =3A + 4C − 4D 0 = − 6A + 3B + 4D which can be solved by substitution, or elimination. Equivalently we can set x = 2 to obtain the equation 4 = 7B. which means B = 4/7. √ √ √ √ When x = 3i, we get (C 3i + D)( 3i − −2)2 = . . . = (D + 12C) + i 3(C − 4D) = −3. So C − 4D = 0 and D + 12C = −3. Which by substitution gives 49D = −3. So D = −3/49 and C = 4D = −12/49. Finally setting x = 1 gives 1 = −4A + 4B + (C + D). Substituting B = 4/7, C = −12/49 and D = −3/49 gives 1 = −4A + 16/7 − 15/49. So A = (1 − 16/7 + 15/49)/ − 4 = 12/49. 16 Section 4: Partial Fraction Expansion Partial fraction expansions are useful as a tactical device in integrating rational functions in the second semester of calculus, they are used in circuit analysis when network outputs are expressed as products of transfer functions and inputs under the Laplace transform. They are also used in combinatorics/discrete mathematics/computer science to solve recurrence relations by the method of generating functions. (In fact the last application is simply the discrete version of the Laplace transform procedure.) Exercises: Find the partial fraction expansion of each rational function. 1. f (x) = 2. f (x) = 3. f (x) = 4. f (x) = 5. f (x) = x2 − 1 x3 + 3x2 − 4x − 12 x+4 x2 − 3x x−1 (x + 1)2 1 2 x −4 3x4 − 2x2 + 4 (x2 + 1)2 (x − 2) 17 Section 5: Exponential and Logarithmic Functions If a is a positive real number we can use the laws of exponents, and the rules for rational p power functions to compute ap/q , where ∈ Q. It can be proven that for any irrational q number s there is a sequence r1 , r2 , . . . , rn , . . . of rational numbers with the property that as m approaches ∞, rm approaches s. We then define the quantity as as the value which arm approaches as m approaches ∞. The result is the base a exponential function y = f (x) = ax . Using a sequence of rational values to compute the value of as , where s is irrational is tantamount to plotting rational points on the graph and filling in the remainder of the graph by connecting the dots with a smooth curve. This is actually how we plot these curves and their shifts/scales. For an exponential function the value a = 1 is uninteresting. Moreover the function 1 1 y = ax , where 0 < a < 1 takes the form y = ( )x = x , where b > 1 and a = 1/b. So any such b b function can be analyzed as the reciprocal of an exponential function whose base is greater than one. Of all the exponential functions with base greater than one, the most special is the natural exponential function y = ex , where e is an irrational number approximately equal to 2.718281828. The actual value of e is the value (1 + h)1/h approaches as h approaches ∞. This particular value is very special because the slope of a line tangent to the curve y = ex at the point (r, er ) is simply the function value er . In fact this is the only base for which the base a exponential function has this property. Every exponential function y = ax , where a > 0, and a 6= 1 is one-to-one with domain IR and range (0, ∞). If a > 1 as x −→ −∞, ax −→ 0+ , while as x −→ ∞, ax −→ ∞. So every exponential function with base a > 1 has the y-axis as horizontal asymptote and no vertical asymptote. The reciprocal of such an exponential function has bx −→ 0 as x −→ ∞, and bx −→ ∞ as x −→ −∞. So it has opposite end behavior in one sense, and similar end behavior in another. Since the exponential functions with non-trivial base are all one-to-one, they have no relative extrema. In fact their function values either always increase as x increases, or always decrease as x increases. Therefore none of the basic exponential functions are even or odd. The basic exponential functions are never zero, so the graphs never intersect the x-axis. They all have y-intercept (0, 1). They are only distinguished from each other by their slopes. 18 Section 5: Exponential and Logarithmic Functions The difference quotient of y = f (x) = ax reduces as follows f (x + h) − f (x) ax+h − ax ax ah − ax ah − 1 = = = ax h h h h Set Ca to be the value so that (ah − 1)/h −→ Ca as h −→ 0. It is shown in calculus that Ce = 1, moreover e is the only value for which this is true. The inverse function of ax is the base a log function, denoted loga x, except when a = e in which case we denote it ln x and call it the natural log function. As inverses each pair of functions y = ax and y = loga x satisfy x = aloga x = loga ax . Especially for each positive number a we have a = eln a . Thus by the multiplicative law of exponents ax = ex ln a . So the base a exponential function is a horizontal scaling of ex by a factor of 1/ ln a. So if we understand the natural exponential function, we should be able to understand any other exponential function. ln x . So every logarithm function is a ln a vertical scaling of the natural log function. Thus it will suffice to understand the natural log In a similar fashion we can show that loga x = function. The natural log function has domain (0, ∞) and range IR. The end behavior and asymptotes of ln x are found using the fact that its graph is the reflection of the graph of ex across y = x. So as x −→ 0+ , ln x −→ −∞, and as x −→ ∞, ln x −→ ∞. The natural log function has no relative extrema nor any symmetry for essentially the same reason the function ex doesn’t. The natural log function has no y-intercept and x-intercept at (1, 0). We have already discussed the inverse of the natural log. We choose not to discuss its reciprocal or difference quotient. However there is another reason the natural log function is so “natural.” For t > 1, ln t can be defined as the area bounded by the curve y = 1/x, the x-axis, and the two vertical lines x = 1 and x = t. This is sometimes shown in the second semester of calculus. Finally we remark that the laws of exponents translate into the arithmetic properties of the log functions. For example Fact: For positive numbers a and b, ln(ab) = ln a + ln b. Proof: We write ab = eln(ab) on the one hand. We write ab = eln a eln b = eln a+ln b by the additive law of exponents on the other. Since ab = ab and ex is one-to-one, we are done. The other arithmetic properties of ln x are proven similarly. They include ln ab = b ln a, and ln ab = ln a − ln b. 19 Section 5: Exponential and Logarithmic Functions Exercises: 1. Sketch and label a graph of each function a) y = f (x) = 4x−2 + 1 1 b) y = f (x) = ( )x − 2 3 c) y = f (x) = 3e−2x e) y = f (x) = ln(x + 1) − 2 d) y = f (x) = ex − e−x 2. Determine the value of a CD in the amount of $3000 that matures in ten years and pays 6% per year a) twice a month, and b) once every 2 months. 3. Give the exact value of a) log27 3, and b) log3 27. 4. Use the properties of logarithms to write the expression ln √ 3x2 so that the result does (x − 2)7 not contain logarithms of products, quotients, or powers. 1 5. Rewrite the expression ln(2x) − 2 ln(x + 1) + ln(x − 2) as a single logarithm. 3 6. Solve the equation ln(x − 2) + ln(2x − 3) = ln 3 for x. 20 Section 6: Exponential Growth and Decay This section concerns the behavior of functions of the form y = ekx , where k ∈ IR − {0}. If k < 0, then because ln x has range IR, we know that k = ln b, for some real number b. From the graph of y = ln x we see that 0 < b < 1. So y = ekx = bx , where 0 < b < 1. Such an exponential function is the reciprocal of an exponential function of the form y = ax , where a = 1/b > 1. In a twist of fate if k > 0, then k = ln a for some a since ln x has range IR, and from the graph we have a > 1. This yields another explanation of why y = ex is the natural exponential function to work with: The horizontal scale y = ekx of the natural exponential function y = ex decreases if k < 0 and increases if k > 0. This feature allows us to easily model certain behaviors. The two cases we model correspond to k > 0 which we call exponential growth, and k < 0 which we call exponential decay. For exponential growth we start with a quantity Q0 of some material (Q0 > 0). If the amount of material grows exponentially as a function of time we get Q(t) = Q0 ekt , where the value of k tells us how fast Q is growing. The function Q(t) is completely determined by Q0 and k. In fact k can be determined by any two functional values Q(c) and Q(d), where c < d. Since Q(d) Q0 ekd Q(c) = Q0 ekc and Q(d) = Q0 ekd we get that = = ekd−kc . Applying ln to both Q(c) Q0 ekc Q(d) ln Q(d) − ln Q(c) sides gives ln = ln Q(d) − ln Q(c) = kd − kc = k(d − c). So k = . We Q(c) d−c Q(c) can also find Q0 since Q0 = kc . These simple functions can be used to model populations, e and principal in monetary accounts drawing interest compounded continuously. Exponential decay works similarly except we model quantities which are decreasing. Common examples include modeling the quantity of radioactive substances, and Newton’s Law of Cooling. In the case of modeling the quantity of some radioactive substance, the constant k is related to the half-life of the substance. The half-life, h, of a radioactive substance is the 1 amount of time it takes for a quantity measuring Q0 units to decay to Q0 units. This means 2 1 1 kh kh that Q0 = Q(h) = Q0 e . Which implies that = e . Applying ln to both sides gives 2 2 1 ln = ln 1 − ln 2 = − ln 2 = kh. So given one of h or k, we can compute the other. 2 In the case of exponential growth the corresponding idea to half-life, is called the doubling time. In this case if d is the doubling time we get dk = ln 2. 21 Section 6: Exponential Growth and Decay Exercises: 1. A biologist counts 6, 000 bacteria in a culture at 8:00 am. Eight hours later they count 18, 000 bacteria in the culture. Find an expression for the number of bacteria (in thousands) as a function of time t (in hours), where t = 0 corresponds to 8:00 am. How long, in hours, will it take for the population to reach 35, 000? 2. A 5kg mass of pretendium decays to 2.2 kg in 3 years. What is the half-life of pretendium in years? 3. A bottle of water at 68◦ F is placed in a freezer whose temperature is 26◦ F. The temperature of the water is measured as 54◦ F an hour later. From the time the bottle was initially placed in the freezer, how long, in hours, will it take for the water to be frozen? 22 Section 7: Conic Sections The general quadratic equation in x and y has the form Ax2 +Bxy+Cy 2 +Dx+Ey+F = 0, where A, B, C, D, E and F are real numbers. One goal of this section is to determine what the graph of such an equation looks like. We easily see that if all three of A, B and C are zero, then we really have a linear equation as in appendix B. So we begin by supposing that not all three of A, B and C are zero. Next we realize that if there is no cross term (i.e. B=0), we can complete the squares on x and y to simplify our task. If there is a cross term (B 6= 0), we can change our coordinate system to one where there is no cross term. This involves rotating axes, a trick that the ancient Greeks discovered. The details of this trick require some trigonometry. This is contained in appendix E. As soon as we know that it suffices to consider quadratic equations with no cross terms we can begin looking at specific cases. Subsection 1: Parabolas The first case is an equation of the form Ax2 +Cy 2 +Dx+Ey +F = 0, where exactly one of A and C is zero. Up to symmetry of argument we suppose that A 6= 0. Thus we have Ax2 +Dx+Ey+F = 0. D We next collect x-terms on one side of the equation to get Ey + F = −A(x2 + x). We A complete the square on x and simplify to D D2 D2 x+ − ) A 4A2 4A2 D 2 D2 = −A(x + ) + 2A 4A Ey + F = −A(x2 + D D2 and rewrite Ey + F − = −A(x − h)2 . 2A 4A If E = 0 we divide through by −A to get We set h = − D2 F − 2 4A A 2 D − 4AF = 4A2 (x − h)2 = √ D 2 − 4AF D 2 − 4AF Thus x − h = ± = ± . If D 2 − 4AF = 0 we get the “double” line 4A2 2A (x−h)2 = 0, which is x = h twice. If D 2 −4AF < 0 we get no graph. Otherwise D 2 −4AF > 0 r and we have x=h+ √ D 2 − 4AF and x = h − 2A 23 √ D 2 − 4AF 2A Section 7: Conic Sections The graph of which is two distinct vertical lines. These three situations are what we call degenerate cases. In the nondegenerate case (E 6= 0) we move on by dividing through by E to write y+ D2 −A F − = (x − h)2 E 4AE E −A D 2 − 4AF We set k = and write y − k = (x − h)2 . Finally we set c to be the value for 4AE E 1 −A 1 which = and get the standard equation of the parabola y − k = (x − h)2 . We can 4c E 4c graph this equation by shifting an appropriate scale of y = x2 . The point (h, k) is called the vertex. The line x = h is called the axis, short for axis of symmetry. Due to an application in optics the point (h, k + c) is called the focus. The line y = k − c is called the directrix. A nondegenerate parabola, P may be defined as the set of points equidistant from the focus and directrix. After translation we may suppose that the focus has coordinates (0, c) and the directrix, l, has equation y = −c. So if P = {(x, y)|d((x, y), (0, c)) = d((x, y), l)}, we p p have x2 + (y − c)2 = |y − (−c)| = (y + c)2 . So x2 + y 2 − 2cy + c2 = y 2 + 2cy + c2 . Which becomes x2 = 4cy. A parabola is also a conic section, the result of intersecting a plane with a double-napped cone. Here we suppose that the plane intersects only one of the two nappes. Moreover the plane is parallel to a generator of the cone. The parabolas we derived above are functions. The graphs open up and down. If we had worked on the case C 6= 0 instead we would have arrived eventually at the degenerate cases of a double horizontal line, no graph, and two horizontal lines. We would have then arrived at the 1 the standard equation x − p = (y − q)2 . This is a relation whose graph is a nondegenerate 4c parabola which opens left or right. Such a parabola has vertex at (p, q), focus at (p + c, q), axis with equation y = q and x = p − c as directrix. Subsection 2: Ellipses The second special case of Ax2 + Cy 2 + Dx + Ey + F = 0 to consider is that neither of A and C is zero and that they are both positive or both negative. If both are negative we have −Ax2 − Cy 2 − Dx − Ey − F = 0, with both −A and −C positive so let us suppose that A, C > 0. 24 Section 7: Conic Sections We first complete the squares on x and y to write A(x2 + D2 E2 E2 D E D2 2 x+ ) + C(y + y + ) + (F − − )=0 A 4A2 C 4C 2 4A 4C D E D2 E2 ,k=− , and R = −F + + . Thus A(x − h)2 + C(y − k)2 = R. 2A 2C 4A 4C Since A and C are strictly positive and the quantities (x−h)2 and (y −k)2 are nonnegative We set h = − the left hand side of the previous equation is nonnegative. So if R is negative we have a relation with an empty graph. Also if R = 0 we must have that both (x − h)2 = 0 and (y − k)2 = 0. This means that the graph of the relation consists of the single point (h, k). If R > 0 and R A = C then our equation becomes (x − h)2 + (y − k)2 = . This is the equation of a circle, A R centered at (h, k) with radius squared . These are the degenerate cases. A In the nondegenerate case we have R > 0 and A 6= C. Up to symmetry of argument we can suppose that A < C. Now we let a and b be the real positive numbers with 1 A 1 C = , and 2 = 2 a R b R Our choice of A < C forces a > b, which is an important convention. Our equation now reads (x − h)2 (y − k)2 + =1 a2 b2 This is the standard form of the equation. Its graph is called an ellipse. Similar to a parabola an ellipse can be defined as the set of points with sum of distances to two foci (= focal points) a fixed constant. To verify this, again suppose that (h, k) = (0, 0) and that the foci are at (±c, 0). Then if E = {(x, y) ∈ IR2 |d((x, y), (c, 0))+d((x, y), (−c, 0)) = k} we p p have (x − c)2 + y 2 + (x + c)2 + y 2 = k. Notice that k > 2c which is the distance between the foci. Also the points on the x-axis with coordinates (±a, 0) are on E iff a−c+a+c = 2a = k. p p So now we can write (x − c)2 + y 2 = 2a − (x + c)2 + y 2 . Which by squaring both sides p gives (x − c)2 + y 2 = 4a2 − 4a (x + c)2 + y 2 + (x + c)2 + y 2 . We cancel the y 2 terms and p expand the terms (x ± c)2 to get x2 − 2xc + c2 = 4a2 − 4a (x + c)2 + y 2 + x2 + 2xc + c2 . Which p p reduces to 0 = 4a2 − 4a (x + c)2 + y 2 + 4xc. We rewrite this as a (x + c)2 + y 2 = a2 + xc. Squaring both sides gives a2 (x + c)2 + a2 y 2 = a4 + 2a2 xc + x2 c2 . Expanding the term (x + c)2 gives a2 x2 + 2a2 xc + a2 c2 + a2 y 2 = a4 + 2a2 xc + x2 c2 . Collecting x terms and cancelling the 2a2 xc terms leaves (a2 − c2 )x2 + a2 y 2 = a4 − a2 c2 = a2 (a2 − c2 ). Here we use the fact that a > c > 0 and write b2 = a2 − c2 . Now b2 x2 + a2 y 2 = a2 b2 , which reduces to x2 /a2 + y 2 /b2 = 1. 25 Section 7: Conic Sections In general the graph of an ellipse is an elongated circle. The graph has two axes of symmetry = lines about which the graph is symmetric. These have equations x = h and y = k respectively. The line y = k joins the two points (h − a, k) and (h + a, k) on the graph. These are at distance 2a from each other. The other axis joins the two points (h, k − b) and (h, k + b) which are at distance 2b from each other. So in the nondegenerate case we call y = k the major axis, and x = h the minor axis. If C < A our convention puts our standard equation in the form (y − k)2 (x − h)2 + =1 b2 a2 1 C 1 A = and 2 = . Now x = h is the major axis and y = k is the minor axis. The 2 a R b R foci are at (h, k ± c), where again a2 = b2 + c2 . where In either case the constant c is less than a. The ratio of c to a is a measure of how noncircular the ellipse is. This measure is the eccentricity of the ellipse. Finally we can realize an ellipse as a conic section where the plane intersects one nappe and is not parallel to a generator of the cone. Subsection 3: Hyperbolas The last case to consider is Ax2 + Cy 2 + Dx + Ey + F = 0 where one of A and C is positive and the other is negative. We will start by supposing that we have an equation of the form Ax2 −Cy 2 +Dx−Ey +F = 0, where A, C > 0. As in the previous section we complete squares on x and y and simplify our equation by renaming certain quantities. We arrive at A(x − h)2 − C(y − k)2 = T . If T = 0 we get A(x − h)2 − C(y − k)2 = 0. We solve this for y in terms of x to get C C y = k + (x − h) or y = k − (x − h). The graph is the union of the two intersecting lines. A A This is the degenerate case. 1 A 1 C and 2 = and get If T > 0 we set a and b to be the positive constants so that 2 = a T b T the standard equation (x − h)2 (y − k)2 − =1 a2 b2 The line y = k serves as an axis of symmetry. The lines y = k ± a/b(x − h) serve as slant asymptotes. The foci have coordinates (h ± c, k), where c2 = a2 + b2 . 1 C 1 A If T < 0 we set a and b to be the positive constants so that 2 = and 2 = . The a −T b −T standard equation now reads (y − k)2 (x − h)2 − =1 a2 b2 26 Section 7: Conic Sections The graph of the equation is symmetric about x = h and asymptotically approaches the lines y = k ± b/a(x − h). The foci have coordinates (h, k ± c), where c2 = a2 + b2 . These are both examples of a hyperbola. A hyperbola is also defined as the set of points so that the magnitude of the difference of distances to two foci is a fixed constant. For hyperbolas the ratio of c to a is strictly larger than 1. This ratio is the eccentricity of the hyperbola. The derivation of the equation of the hyperbola is left to the interested reader. A hyperbola is also a conic section formed as the intersection of a double napped cone with a plane which cuts both nappes. Note on Conic Sections: The intent of this note is to describe the nomenclature used in the section. To make a right circular cone we use two coplanar lines. We call one, a, the axis and the other, l, the generator. We do not consider the case that a and l are perdendicular nor the case that a = l. A right circular cone, C,is generated when l is revolved about a in a circular fashion. If a and l are parallel, the result is degenerate. We call C a right circular cylinder. Otherwise a and l intersect in a unique point f called the fulcrum. The result is a double-napped cone. A conic section is the intersection of a plane P and a right circular cone C. If C is degenerate, one possibility is that the intersection of P and C is empty. (This is either the empty parabola or the empty ellipse.) Another possibility is that P is tangent to C, in which case the result is the “double” line parabola. When P is not tangent to C but contains two generators for C we get the degenerate parabola consisting of two parallel lines. If P is perdendicular to a the result is a circle. Otherwise P ∩ C is a nondegenerate ellipse. If C is nondegenerate we also get a degenerate section if P contains the fulcrum f . If such a plane misses both nappes we get a “point” ellipse. When P contains exactly one generator we get the “double” line parabola. Finally, if P contains two generators (and a), we get the degenerate hyperbola consisting of two intersecting lines. If C is nondegenerate and P does not contain f , the only degenerate section occurs when P is perpendicular to a. The result is a circle. The last three cases occur when P is not perpendicular to a, does not contain f and C is nondegenerate. These are the special conic sections which tend to have applications involving their reflective properties. We now see that every quadratic equation in x and y must represent a conic section and 27 Section 7: Conic Sections vice-versa. Hence y = 1/x when written xy − 1 = 0 is recognized as a hyperbola with major axis the line y = x. A rotation of axes by 45◦ will clear the cross term. So in retrospect we’ve been discussing conic sections since day one. Exercises: 1. Find the standard form of the equation for the parabola x2 + 4x − 4y + 8 = 0. 2. Sketch and label a graph of each parabola clearly indicating the vertex, focus, directrix and axis. 1 (x + 1)2 2 1 b) y − 3 = (x + 1)2 4 1 c) y − 3 = (x + 1)2 8 3. Find the standard equation of a parabola with focus (−1, 2) and directrix x = 1. a) y − 3 = 4. Sketch and label the graph of the parabola with equation 4x = (y − 1)2 clearly indicating the vertex, focus, directrix and axis. 5. Sketch and label a graph of each conic section clearly indicating axes of symmetry, foci, vertices and asymptotes if any. a) 9x2 + 25y 2 + 36x − 50y − 164 = 0 b) 9x2 − 16y 2 + 36x + 32y − 124 = 0 c) −9x2 + 16y 2 − 36x − 32y − 164 = 0 6. Find the standard equation of an ellipse with foci at (−1, 14) and (−1, −10) which also has one vertex at (−1, 15). 28