Multivariable Calculus MA22S1 Dr Stephen Britton September 2013 Introduction The aim of this course to introduce you to the basics of multivariable calculus. The early part is dedicated to discussing curves and conic sections in two dimensions, and then moves to three dimensions, covering lines, planes and quadric surfaces. After this, we will move on to discussing vector-valued functions and the curves they represent. You will learn how to use differentiation to find tangent lines, normal vectors, slopes and more. The final part of the course deals with multiple integrals, specifically double and triple integrals to find areas and volumes. These notes are based on the textbook Calculus, Late Transcendentals, 10th edition, by Howard Anton, Irl Bivens and Stephen Davis, and published by Wiley. 1 1 Curves and Conic Sections in Two Dimensions To begin with in this course, we will only consider two directions. Until section 3, we can assume that everything is described in the xy-plane. 1.1 Parametric Curves If we consider a generic curve C in the xy-plane, is there a way to parameterise the motion using a single variable? We define the parametric equations of the motion of a particle as x = f (t) , y = g(t) , (1) where t is the parameter which describes where we are on the curve. The curve itself is called the graph of the parametric equations, or alternatively, the trajectory of the particle. An example is given in Figure 1. Generally y 3 2 Hx,yL 1 -2 2 -1 -2 4 6 8 10 x C -3 Figure 1: Trajectory C. speaking, t represents time in this course, but it could be any parameter that makes sense in a given problem. Often, t will be restricted to an interval [a, b], but if no interval is given, assume that the range is (−∞, ∞). This will be written x = f (t) , y = g(t) , (a ≤ t ≤ b) . (2) Problems will often ask you to sketch the graph of parametric equations. To do this, eliminate t. The resulting equation involving x and y should be something you can recognize. 2 Example: Sketch the graph of the equations x = 2 cos t , (0 ≤ t ≤ 2π) . y = 2 sin t , Solution: The easiest way to understand what this looks like is to notice that we can eliminate t by the doing the following x2 + y 2 = (2 cos t)2 + (2 sin t)2 = 4(cos2 t + sin2 t) = 4 . (3) This is clearly the equation of a circle of radius 2. To see what portion of the circle is graphed, lets look at a few values: t t=0: π t= : 2 t=π: 3π t= : 2 t = 2π : (x, y) (2, 0) (0, 2) (−2, 0) (0, −2) (2, 0) In other words, we have an entire circle, as is sketched in Figure 2. Example: Sketch the graph of the equations x = t − 1, y = 2t + 3 , (0 ≤ t ≤ 2π) . Solution: To eliminate t, solve both equations for t, i.e. t = x + 1, 1 t = (y − 3) . 2 We then equate these and simplify to get one equation. 1 x + 1 = (y − 3) 2 ⇒2x + 2 = y − 3 ⇒y = 2x + 5 . 3 2 Hx,yL 1 y t -2 -1 x 1 2 -1 -2 Figure 2: Circle of radius 2. y 15 10 5 -4 -2 2 4 x Figure 3: Plot of y = 2x + 5. We now can easily see that this is a line with slope 2 which intersects the y-axis at y = 5. It is plotted in Figure 3. Next, we define the orientation of a graph to be the direction of increasing parameter. In the example of the line above, the orientation is up the slope as that is the direction of increasing t. A curve with orientation is called a parametric curve. For the example of the circle, we had an anti-clockwise orientation. If we replaced t with −t, then x(−t) = 2 cos(−t) = 2 cos t, y(−t) = 2 sin(−t) = −2 sin t, which has clockwise orientation. 4 1.1.1 Expressing equations parametrically We can always describe the equation y = f (x) parametrically. The simplest way to do this is to take x = t and then y = f (t). Example: y = x2 + 1. Solution: x = t ⇒ y = t2 + 1. Of course, for something like a circle you can of course do the same. However, there are often better parameterisations to chose. In the case of a circle of radius a, the parameterisation x = a cos t, y = a sin t is often convenient. 1.1.2 Tangent lines for parametric curves Recall that the tangent line to a curve y = f (x) at a point P is the line that passes through P that has slope m = dfdt(x) . For a parametric curve, we can find the slope using the parameter t via the chain rule: dy dy/dt = . dx dx/dt (4) There are three special types of tangent lines: • If dy/dt = 0 and dx/dt 6= 0, the slope is 0 and we have a horizontal tangent line. • If dx/dt = 0 and dy/dt 6= 0, the slope is infinite and we have a vertical tangent line. • If dx/dt = 0 and dy/dt = 0, the slope is indeterminate and we have a singular point. These require special treatment. Let’s look at an example. Example: Find the tangent line of x = t, y = −t2 for t = 2. Solution: Of course, we can see that this is equivalent to y = −x2 , but let’s do this using the new method. First, the point we are interested in is P0 = (2, −4). We know that a line has an equation y = y0 + m(x − x0 ), and we can find the slope of the tangent lines since dy dy/dt −2t = = = −2t . dx dx/dt 1 5 (5) Therefore, at P0 m|P0 = −2(2) = −4 . (6) The equation of the tangent line is then y = −4 − 4(x − 2) = 4 − 4x . (7) This is sketched in Figure 4. y -4 -2 2 4 x H2,-4L -5 -10 -15 Figure 4: Tangent line to the function y = −x2 . 1.1.3 Arc Length of a Parametric Curve For a parametric curve x = x(t) , y = y(t) , (a ≤ t ≤ b) , (8) the arc length is given by Z b L= a s dx dt 2 + dy dt 2 dt . Example: Find the circumference of the circle x = a cos t , y = a sin t , (0 ≤ t ≤ 2π) . 6 (9) Solution: We find Z 2π p Z 2 2 L= (−a sin t) + (a cos t) dt = 0 2π at dt = 2πa . 0 If the curve doubles over, this will not work: in other words no point on the curve should be graphed by two values of t. In our example of the circle, if we integrated from 0 to 3π we would find 3πa, which is obviously not the circumference, because the values between 2π and 3π repeat those between 0 and π. 1.2 Polar Coordinates Up until now we have been dealing with the coordinates x and y, which are known as Cartesian coordinates or rectangular coordinates. We can equally describe a plane using any pair of coordinates so that each position on the plane is uniquely described by the pair. In particular a useful set of coordinates are polar coordinates. 1.2.1 Defining Polar Coordinates To define polar coordinates, we begin by fixing a point O, called the origin or pole. We then define a half-line (or ray) which begins at 0 and continues to infinity in a given direction, called the polar axis. This is shown in PHr,ΘL r Θ OriginHOL Polar axis Figure 5: Polar coordinates Figure 5. The distance from the origin to the point P is called the radial coordinate, r, and the angle that the line |OP | makes with the polar axis is called the angular coordinate, θ, more commonly known as the polar 7 angle. It is important to notice that the polar angle returns to its original position when the angle is 2π. Therefore, any angle greater than or equal to 2π is equivalent to an angle in the range (0 ≤ θ ≤ 2π). In fact, more generally, the angles θ − 2π n, θ and θ + 2π n are equivalent if n is an integer. 1.2.2 Relationship to Cartesian Coordinates It is quite easy to make a link between polar coordinates and Cartesian coordinates. The trick is to make the polar axis coincide with the x-axis. Then, we can see from Figure 6 that the following relationship holds y PHr,ΘL y=r sin Θ r Θ O x=r cos Θ x Figure 6: Polar coordinates in terms of Cartesian coordinates x = r cos θ , y = r sin θ , (10) y . x (11) which can also be written in the form r 2 = x2 + y 2 , tan θ = √ Example: Change the Cartesian coordinates (4, 4 3) into polar coordinates. Solution: √ r2 = 42 + (4 3)2 = 64 ⇒ r = 8 , 8 and √ 4 3 √ π tan θ = = 3⇒θ= . 4 3 The polar coordinates are therefore (r, θ) = (8, π/3). Example: Change the polar coordinates (3, 3π/4) into Cartesian coordinates. Solution: x = 3 cos 3π 3 = −√ , 4 2 and y = 3 sin 3 3π =√ , 4 2 √ √ The Cartesian coordinates are therefore (x, y) = (−3/ 2, 3/ 2). If you are asked to graph an equation in polar coordinates, the easiest thing 1.0 Π2 0.5 Π4 4 Π -1.0 -0.5 0.5 0 3 1.0 3 2 -0.5 1 2 1 3Π2 -1 -1.0 r=0 1 2 3 4 -1 Figure 7: Unit circle and the half-line θ = π4 . to do is to simply plot some points and see what happens. Later on, you may begin to recognise some of the graphs and be able to plot them from memory. For now, let’s look at some simple examples. If we fix r, say to r = 1, we get a circle, see Figure 7. Note also that once we hit θ = 2π, the graph repeats, and so we do not need to continue. Also in the same figure we see the plot of θ = π/4. We can either imagine that we can take negative values of r, 9 or as we would normally expect we could impose r ≥ 0 and plot a half-line, as is done here. Finally, let’s look at an example where both coordinates vary, say r = sin θ. Then we get the plot in Figure 8, where we see that the plot repeats values after θ = π, because the Cartesian coordinates are x = sin t cos θ, y = sin θ sin θ = sin2 θ ≥ 0. Π2 1.0 3Π4 -1.0 Π4 0.5 -0.5 0.5 Θ=0 or Θ=Π 1.0 Figure 8: Plot of r = sin θ. 1.2.3 Symmetry Tests Symmetry tests can be performed in polar coordinates by appropriate choice of shifts. • A curve in polar coordinates is symmetric about the x-axis if replacing θ with −θ leaves the equations unchanged. • A curve in polar coordinates is symmetric about the y-axis if replacing θ with π − θ leaves the equations unchanged. • A curve in polar coordinates is symmetric about the origin if replacing r with −r or equivalently θ with θ + π leaves the equations unchanged. Let’s look at an example. Example: Test the symmetry properties of r = sin θ. Solution: If we replace θ with −θ, we get sin(−θ) = − sin θ, so it is not symmetric about the x-axis. If we replace θ with π − θ, we get sin(π − θ) = sin θ, so it is symmetric about the y-axis. Finally, if we replace θ with θ + π we 10 1.0 1.0 Hr, ΘL Hr, Π-ΘL Hr, ΘL 0.5 -2 0.5 -1 1 -2 2 -1 1 -0.5 2 -0.5 Hr, -ΘL -1.0 -1.0 1.0 Hr, ΘL 0.5 -2 -1 Hr, Θ+ΠL or H-r, ΘL 1 2 -0.5 -1.0 Figure 9: Symmetry test get sin(θ + π) = − sin θ, or if we replace r with −r we get the same, so it is not symmetric about the origin. 1.2.4 Families of curves Circles: The following describe circles of radius a, r = a, r = 2a cos θ , r = 2a sin θ . (12) The first of these is a circle centered on the origin, the second centred on (a, 0), and the last centred on (0, a). Figure 10 shows these. Rose curves: The following describe rose curves for a > 0 and n integer, r = a sin nθ , r = a cos nθ . (13) If n is odd, the pattern is drawn when the values (0 ≤ θ < π) are taken, and for n even, the pattern is drawn when the values (0 ≤ θ < 2π) are taken. See what the look like: Section 10.2 of the text book has images. 11 1.5 1.5 1.0 1.0 0.5 0.5 0.5 -1.5 -1.0 -0.5 0.5 1.0 1.0 1.5 2.0 2.5 1.5 -0.5 -0.5 -1.0 -1.0 r=2a Cos Θ r=a -1.5 -1.5 2.5 2.0 1.5 1.0 0.5 -1.5 -1.0 -0.5 r=2a Sin Θ 0.5 1.0 1.5 -0.5 Figure 10: Families of circles Limaçons: The following describe limaçons for a > 0 and b > 0, r = a ± b sin nθ , r = a ± b cos nθ . (14) If a = b, these becomes cardioids. Again, see textbook for images. Spirals: Although there are many types of spiral, the only one we are interested in is the “Archimedean spiral”, which is the one you most likely imagine when someone says the word spiral. It has the equation √ r = a θ. (15) 12 1.2.5 Tangent Lines in Polar Coordinates If we take a general curve of the form r = f (θ) and write it in terms of its x and y coordinates, we would have x = f (θ) cos θ , y = f (θ) sin θ . (16) This is now a parametric curve in parameter θ, and so we can find the slope of the tangent line provided f (θ) is differentiable. We have dx dr = −f (θ) sin θ + f 0 (θ) cos θ = −r sin θ + cos θ , dθ dθ dr dy = f (θ) cos θ + f 0 (θ) sin θ = r cos θ + sin θ , dθ dθ (17) and the slope is dy = dx dy dθ dx dθ = dr r cos θ + dθ sin θ . dr −r sin θ + dθ cos θ (18) Example: Find the slope of the tangent line to the circle r = 2 cos θ at θ = π/3. Solution: We get dr dθ = −2 sin θ, and so 2 cos2 θ − 2 sin2 θ 2 cos 2θ dy = = = − cot 2θ . dx −2 sin θ cos θ − 2 sin θ cos θ −2 sin 2θ Therefore, the slope at θ = π/3 is m = − cot 1.2.6 2π 1 =√ . 3 3 Tangent Lines to Polar Curves at the Origin At the origin, we have r = 0 with angle θ0 , so assuming that origin, the equation becomes dr 0 + dθ sin θ0 dy = = tan θ0 . dr dx 0 + dθ cos θ0 dr dθ 6= 0 at the (19) The line with this slope is the line θ = θ0 , and so this line is tangent to the curve at the origin. To recall what this line looks like, return to section 1.2.2. 13 Example: Find the tangent lines at the origin of the four-petal rose r = sin 2θ . Solution: We must find all values of θ less than 2π for which r = 0. This gives us θ0 = 0, θ0 = π/2, θ0 = π, θ0 = 3π/2. These are the tangent lines and are shown in Figure 11. Note that the lines θ0 = 0 and θ0 = π coincide, as do θ0 = π/2 and θ0 = 3π/2. 1.0 Π 3Π Θ= , 2 2 0.5 Θ=0, Π -1.0 -0.5 0.5 -0.5 -1.0 Figure 11: r = sin 2θ 14 1.0 1.2.7 Arc Length in Polar Coordinates To find the arc length in terms of polar coordinates, we recall formula (9) and substitute the results of (17), giving 2 2 dx dy + dθ dθ 2 2 dr dr cos θ + r cos θ + sin θ = −r sin θ + dθ dθ dr (20) = r2 (sin2 θ + cos2 θ) + 2r cos θ sin θ dθ 2 dr dr − 2r cos θ sin θ + (sin2 θ + cos2 θ) dθ dθ 2 dr 2 , =r + dθ which results in Z β s r2 L= + α dr dθ 2 dθ (21) Let’s look at an example. Example: Find the arc length of the logarithmic spiral r = aebθ between θ = 0 and θ = π. Solution: Z πp (aebθ )2 + (abebθ )2 dθ L= Z0 π √ π a √ a√ = a 1 + b2 ebθ dθ = 1 + b2 ebθ 0 = 1 + b2 (ebπ − 1) . b b 0 1.2.8 (22) Area in Polar Coordinates In order to calculate area in polar coordinates, we need to find a way to parameterise the area. The first thing to remember is that any polar curve can be written in the form r = f (θ), and therefore we can parameterise in terms of θ. Then, if we choose two rays θ = α and θ = β such that the angles satisfy α < β ≤ α + 2π (in other words, the angles are different and are less than 2π apart, so that the area is only covered once), provided f (θ) 15 is continuous in this region, R, we can find the area. This concept is shown in Figure 12. We can obtain the area under the curve r = f (θ) by diving 6 5 r=fHΘL 4 3 R 2 Θ=Α Θ=Β -4 1 -2 0 2 4 Figure 12: Area in polar coordinates this region into wedges using a set of rays, θ = θ1 , θ = θ2 , · · · θ = θn−1 , which divide the region into small areas A1 , A2 , · · · An with angles ∆θ1 , ∆θ2 , · · · ∆θn . This is shown in Figure 13. The total area is then the sum of these small 6 5 r=fHΘL 4 A2 3 DΘ2 2 A1 DΘ1 An Θ=Β -4 -2 DΘn Θ=Α 1 0 2 4 Figure 13: Area in polar coordinates divided into wedges areas, A = A1 + A2 + · · · + An = n X Ak . (23) k=1 Each of these is the area of a sector, which has area given by 21 r2 θ. Using r = f (θ), and θ = θk for any ray in the wedge of area Ak , we arrive at the 16 discrete version of the area formula A= n X Ak ∼ k=1 n X 1 k=1 2 [f (θk )]2 ∆θk . (24) We can take the limit where these angles ∆θk vanish to give an integral form A = lim ∆k →0 n X 1 k=1 2 2 Z β [f (θk )] ∆θk = α 1 [f (θ)]2 dθ . 2 (25) Thus, the area of a region R under a continuous curve r = f (θ) and between θ = α and θ = β is Z β Z β 1 1 2 2 A= [f (θ)] dθ = r dθ . (26) α 2 α 2 The most difficult parts of this is generally finding the limits of integration. Sometimes you will be told explicitly. Important note: Always draw the region to help you identify the limits. Let’s look at a simple example. Example: Find the area of the circle r = 2a sin θ. Solution: This looks like the third types of circle Figure 10. Since we want the area of the whole circle, the limits are 0 ≤ θ ≤ π since the entire circle is above the x-axis. Note that for a circle centered on the origin r = a, the limits would be 0 ≤ θ ≤ 2π. The area is Z π Z π 1 2 2 (2a sin θ) dθ = 2a sin2 θ dθ A= 0 0 2 Z π 1 1 π 2 2 = 2a (1 − cos 2θ) dθ = 2a θ 0 − 0 = πa2 . 2 0 2 We can often make use of the symmetry of the region to simplify a problem. For example, the upper and lower semi-circles of a full circle are mirror images and obviously have the same area. Hence Z 2π Z π 1 2 1 2 a dθ = 2 a = πa2 , (27) 2 2 0 0 17 is clearly true. Let’s look at a more complicated example to help understand this. Example: Find the area of the cardioid r = 4 + 4 cos θ outside the circle r = 6. Solution: The region is the coloured region in Figure 14. The rays θ = θ1 5 -5 Θ=Θ2 5 -5 Θ=Θ1 Figure 14: Area of the cardiod r = 4 + 4 cos θ outside the circle r = 6 and θ = θ2 represent the limits of integration. We can find these limits by finding the intersections of the circle with the cardiod, 4 + 4 cos θ = 6 ⇒ cos θ = 1 π ⇒θ=± . 2 3 We can now see that there are two equally valid approaches: integrate from − π3 to π3 , or use symmetry to take twice the area from 0 to π3 . Obviously, we must subtract the area of the circle in this region from the area of the cardiod in this region. Let’s look at both methods. 18 Integration over whole region: Z π/3 Z π/3 1 2 1 2 (4 + 4 cos θ) dθ − (6) dθ A= −π/3 2 −π/3 2 Z π/3 1 = (16 + 32 cos θ + 16 cos2 θ − 36) dθ 2 −π/3 Z π/3 (16 cos θ + 8 cos2 θ − 10) dθ = −π/3 Z π/3 (16 cos θ + 4(1 + cos 2θ) − 10) dθ = −π/3 π/3 = (16 sin θ + 4(θ + 2 sin 2θ) − 10θ)−π/3 √ ! √ ! √ √ π π 1 3 π 3 π 1 3 3 = 16 +4 + − 10 +4 − − + 10 − − 16 2 3 2 2 3 2 3 2 2 3 √ = 18 3 − 4π . Notice that the first and second terms in the penultimate line are in fact the same. This is due to the symmetry about the x-axis. Using symmetry: Z π/3 A=2 (16 cos θ + 8 cos2 θ − 10) dθ 0 √ ! √ √ 3 π 1 3 π = 2 16 +4 + − 10 = 18 3 − 4π . 2 3 2 2 3 The procedure for equating the equations of the curves can fail to find some intersections. This is why it is always important to draw diagrams to ensure that you know how many intersection points to expect. 1.3 Conic Sections A conic section (or conic) is any curve that can be obtained as the set of points of intersection between a plane and a double-napped circular cone. These are the ellipse, parabola and hyperbola (note that a circle is a special case of an ellipse). The degenerate conic sections arise when the plane 19 passes through the vertex of the cone, and are the point, line and intersecting lines. The non-degenerate conic sections are shown in Figures 15 and the degenerate conic sections are shown in Figure 16. -1 -2 2 1 0 1 0 -1 -2 2 2 2 1 1 0 0 -1 -1 -2 -2 -2 -2 -1 -1 0 0 1 1 2 -2 0 -1 1 2 2 2 2 1 1 0 0 -1 2 -1 1 -2 0 -2 -1 -2 0 -1 -2 1 -1 2 Figure 15: Conic sections 20 0 1 -2 2 -2 0 -1 2 1 -2 2 2 1 1 0 0 -1 -1 -2 0 -1 2 1 -2 -2 -2 -1 -1 0 0 1 1 2 2 2 1 0 -1 2 -2 -2 1 -1 0 0 -1 1 2 -2 Figure 16: Degenerate conic sections 21 1.3.1 Types of Conic Section A parabola is the set of points in the plane equidistant from a fixed line called a directrix and a fixed point called the focus. A parabola is symmetric about the axis perpendicular to the directrix which runs through the focus, called the axis of symmetry. The parabola intersects this axis at its vertex. See Figure 17. 6 5 Axis 4 Focus3 2 Directrix 1 -2 -1 1 2 Figure 17: A parabola An ellipse is the set of all points in the plane that such that the sum of the distance from a point on the ellipse to two fixed points called foci is a constant. This constant must be greater than the distance between the foci. The midpoint of the line line joining the foci is called the centre. Note that a circle is a special case of an ellipse when the distance between the foci is 0, i.e. the foci coincide with the centre. The line segment across the ellipse through the foci is called the major axis, and the line segment across the ellipse perpendicular to this through the centre is called the minor axis. Finally, the endpoints of the major axis are called vertices. See Figure 18. -2 1.0 1.0 0.5 0.5 -1 Focus 1 Centre -0.5 -2 2 -1 Focus 1 -0.5 -1.0 -1.0 Figure 18: An ellipse 22 Major axis Minor axis 2 A hyperbola is the set of all points in the plane that such that the difference of the distance from a point on the hyperbola to two fixed points called foci is a positive constant. This constant must be less than the distance between the foci. This difference must be positive, and as a result there are two branches which curve around the different foci. The midpoint of the line segment between the foci is called the centre, and the line through the foci is called the focal axis. The line perpendicular to the focal axis and through the centre is called the conjugate axis. The intersections of the hyperbola with the focal axis are called vertices. A hyperbola also has a pair of associated lines called asymptotes, which are line such that as the hyperbola move towards ±∞ away from the centre, the distance between the asymptotes and the hyperbola approaches zero. See Figure 19. 4 4 2 2 -5 -5 5 5 -2 -2 -4 -4 Figure 19: A hyperbola. The points marked are the centre (middle) and foci (left and right). The horizontal red line is the focal axis and the vertical red line is the conjugate axis. 1.3.2 Equations of parabolas We denote the distance between the vertex and focus of a parameter as p. Then, the distance from the vertex to the directrix is also p. It is sometimes convenient to denote the focus as F (x, y). The standard positions of a parabola are those when the vertex coincides with the origin. Figure 20 sketches the standard parabolas. You should therefore recognise any equation of the form y 2 = ±4px , or x2 = ±4py , (28) as the standard equations of a parabola. To understand these equations, we return to the definition of a parabola. The distance from the directrix to 23 y y 2 2 1 1 FHp,0L 0 0.5 1.0 1.5 2.0 FH-p,0L x -2.0 -1.5 -1.0 -1 -1 2 2 y =-4px y =4px -2 -2 y y 4 -2 -1 1 3 -1 2 -2 1 -2 x -0.5 FH0,pL -1 x FH0,-pL -3 1 -1 2 2 x -4 x2 =-4py 2 x =4py -5 Figure 20: Standard equations of a parabola a point must be the same as the distance to the focus. Look at Figure 21. The distance to the directrix to Q is always simply y + p. The distance from p the focus, F , to Q is (y − p)2 + x2 . Therefore (y − p)2 + x2 = (y + p)2 ⇒y 2 − 2py + p2 + x2 = y 2 + 2py + p2 ⇒x2 = 4py , (29) as you would expect from the diagram. To sketch a parabola, determine which of the four types in equation (28) it is, and then use Figure 20 to determine which way it curves. Determine p from the equation and plot a couple of convenient points to ensure you are close. Useful points are the intersections of the parabola with a line parallel to the directrix running through the focus, see Figure 22. Please note that this is basically the same method given in the textbook, but I think the explanation is less confusing. Also, notice that we are always assuming that the vertex of the parabola is at the origin. If this wasn’t the case, we would have to find it. Example: Find the focus of the parabola y 2 = −16x and sketch the 24 y 4 3 Q 2 FH0,pL 1 -2 -1 1 -1 2 x y=-p -2 Figure 21: Equations a parabola y 4 3 2 H-2p,pL -2 H0,pL H2p,pL 1 -1 1 -1 2 x y=-p -2 Figure 22: Sketching a parabola parabola, focus and directrix. Solution: We see that we have a parabola of type y 2 = −4px and so p = 4. Since we are looking at the second type in Figure 20, the focus is at (−4, 0), and the parabola should be to the left of the y-axis. It is shown in Figure 23. More generally, we can have a parabola which has its vertex at any point. They have the following equations: Vertex (h, k), parallel to x-axis (y − k)2 = 4p(x − h) , opens to the right. (y − k)2 = −4p(x − h) , opens to the left. 25 (30) y 15 H-4,8L 10 5 H-4,0L -8 -6 -4 x=4 -2 2 4 x -5 H-4,-8L -10 -15 Figure 23: Sketching the parabola y 2 = −16x Vertex (h, k), parallel to y-axis (x − h)2 = 4p(y − k) , opens above. (x − h)2 = −4p(y − k) , opens below. (31) You should be able to recognise these equations as parabolas and reduce appropriate quadratic equations to these forms. It is often necessary to complete the squares to find the correct form of the equation. Example: Determine if y 2 − 6y − 2x + 1 = 0 represents a parabola and if so find p and the vertex position. Solution: We complete the square to get y 2 − 6y − 2x + 1 = 0 ⇒(y − 3)2 − 9 − 2x + 1 = 0 ⇒(y − 3)2 − 2x − 8 = 0 ⇒(y − 3)2 = 2(x + 4) . Hence, p = 1.3.3 1 2 and the vertex is at (−4, 3). Equations of ellipses We denote the length of the major axis of an ellipse by 2a and the length of the minor axis by 2b, with the distance between the foci denoted by 2c. In this context, it is sometimes convenient to denote the foci as F1 (x1 , y1 ) and F2 (x2 , y2 ). a is referred to as the semi-major axis and b as the semi-minor 26 axis. This is just terminology: obviously a number is not the same as a line! These numbers uniquely define the ellipse. Their ratio give the shape, and the magnitude gives the size of the ellipse. This is shown in the first diagram in Figure 24. If we consider the red triangles in the second diagram, we see 2 c 2 c Q 1 1 b -3 -2 -1 1 2 b -3 3 b -1 a -2 -1 P 1 c c 2 3 -1 a -2 -2 Figure 24: The semi-major axes and semi-minor axes for ellipses √ that the distance from the two foci to Q is simply 2 b2 + c2 . Since the sum of the distances from the foci to any point is the same, this should be the same as the sum of the distances from the foci to P , which is (a + c) + (a − c) = 2a. We therefore find the relations √ √ (32) a = b2 + c2 , or c = a2 − b2 . As for a parabola, the easiest types of ellipse to imagine are those centred on the origin. These are the standard positions of the ellipse, which give rise to the standard equations. These standard ellipses are shown in Figure 25. You should recognise any equation of the forms x2 y 2 + 2 = 1, a2 b or x2 y 2 + 2 = 1, b2 a (33) as a standard ellipse. To see where these come from, consider Figure 26. The 27 y 3 a 2 F1 H0,cL 1 b y 2 1 -2 b -1 1 2 x b -1 a F1 H-c,0L -3 -2 F2 H0,-cL F2 Hc,0L a -1 1 2 3 x -2 a -1 b -2 x2 y2 + =1 a2 b2 -3 x2 y2 + =1 b2 a2 Figure 25: Standard ellipses sum of the distances from the foci to Q should be 2a. We therefore have p p (x + c)2 + y 2 + (x − c)2 + y 2 = 2a p p ⇒ (x + c)2 + y 2 = 2a − (x − c)2 + y 2 p ⇒(x + c)2 + y 2 = (x − c)2 + y 2 − 4a (x − c)2 + y 2 + 4a2 (squaring) p ⇒4cx = −4a (x − c)2 + y 2 + 4a2 cx p = (x − c)2 + y 2 ⇒a − a cx 2 ⇒ a− = (x − c)2 + y 2 (squaring) (34) a 2 c ⇒a2 − 2cx + 2 x2 = x2 − 2cx + c2 + y 2 a 1 ⇒a2 − c2 = 2 x2 a2 − c2 + y 2 a x2 y2 ⇒1 = 2 + 2 a a − c2 x2 y 2 ⇒1 = 2 + 2 , a b 28 1.0 Q 0.5 -2 -1 1 H-c,0L 2 Hc,0L -0.5 -1.0 Figure 26: Equations of ellipses due to the relations (32). This is clearly the correct equation for the ellipse in Figure 26. To sketch an ellipse, you will want to determine whether the major axis is the x or y axis, i.e. which type of parabola in Figure 25 we are dealing with. Determine a and b, and plot the points where the ellipse intersects the x- and y-axes. More generally, we want to consider ellipses with centres that are not the origin. In this case, the equations look like (assuming b < a) Ellipse with centre (h, k), and major axis parallel to x-axis (35) (x − h)2 (y − k)2 + = 1 , a2 b2 Ellipse with centre (h, k), and major axis parallel to y-axis (x − h)2 (y − k)2 + = 1, b2 a2 (36) and to sketch the ellipse we follow more or less the same procedure with the new centre and using the points where the ellipse meets its major and minor axes for reference. Example: Determine if 9x2 + 4y 2 − 18x + 24y + 9 = 0 represents an ellipse. Sketch the resulting curve. 29 Solution: Completing the squares, we get 9x2 + 4y 2 − 18x + 24y + 9 = 0 ⇒(3x − 3)2 − 9 + (2y + 6)2 − 36 + 9 = 0 ⇒9(x − 1)2 + 4(y + 3)2 = 36 (x − 1)2 (y + 3)2 ⇒ + = 1, 4 9 (37) which is an ellipse centred on (1, −3), √ with the√major axis parallel to the y-axis, and a = 3, b = 2. Also, c = 32 − 22 = 5. It is sketched in Figure 27. y -1 1 -1 2 3 x F1 -2 -3 Centre -4 -5 F2 -6 Figure 27: The ellipse 30 (x−1)2 + 4 + (y+3)2 9 =1 1.3.4 Equations of hyperbolas When defining hyperbolas, we denote the distance between the vertices by 2a, the distance between the foci by 2c, and in analogy to the ellipse, we define a constant b via the formulae √ √ (38) b = c2 − a2 , or c = a2 + b2 . These quantities are drawn in Figure 28. We see that b is defined in such a 4 c 2 4 c -5 b -5 5 a -2 a 2 a V1 c V2 5 -2 -4 -4 Figure 28: Defining a hyperbola way that the asymptotes intersect the circle with diameter 2c. We sometimes find it convenient to denote the foci as F1 (x1 , y1 ) and F2 (x2 , y2 ), and the vertices as V1 (x1 , y1 ) and V2 (x2 , y2 ). If we look at one of these vertices, say V2 , we see that the difference of the distances from V2 to the foci is (c − a) + 2a − (c − a) = 2a . (39) Since the difference of the distances from the foci to every point is the same, it is always 2a. If the centre of the hyperbola is at the origin and the foci are on either the x- or y-axis, the hyperbola is in one of the standard positions, with the resulting standard equations. The parabola are shown in Figure 29 and the equations that you should recognise as standard hyperbola at the origin are x2 y 2 − 2 = 1, a2 b b with asymptotes y = ± x , a 31 y 2 x2 or − 2 = 1, a2 b a or y = ± x . b (40) y b y=- x a b y= x a y 5 F2 H0,cL b F1 H-c,0L -5 a y= x b 5 -a a F2 Hc,0L 5 a y=- x b -b -5 x -b a b 5 x -a -5 x2 y2 - =1 a2 b2 -5 F1 H0,-cL y2 x2 - =1 a2 b2 Figure 29: Standard hyperbolas The asymptotes are given the limit x, y → ∞ of y2 x2 = − 1, b2 a2 or y2 x2 = 1 + , a2 b2 (41) or y2 a2 = , x2 b2 (42) which gives the ratios y2 b2 = , x2 a2 or a b y = ± x , or y = ± x . (43) a b To draw a hyperbola from a standard equation, locate the foci and thus the focal axis. Next, locate the vertices and the asymptotes. If necessary, plot some extra points and then draw the hyperbola passing through the vertices, any extra points and approaching the asymptotes. More generally, we want to consider hyperbolas with centres that are not the origin. In this case, the equations look like Hyperbola with centre (h, k), and focal axis parallel to x-axis (44) (x − h)2 (y − k)2 − = 1 , a2 b2 32 Hyperbola with centre (h, k), and focal axis parallel to y-axis (45) (y − k)2 (x − h)2 − = 1 , a2 b2 Please note that when we write the hyperbolas this way, the asymptotes for the first case are the lines with slope ± ab passing through (h, k) and for the second first case the asymptotes are the lines with slope ± ab passing through (h, k). Example: Determine if x2 − 4y 2 + 2x + 8y − 7 = 0 represents a hyperbola, and if show, sketch the resulting curve. Solution: We have x2 − 4y 2 + 2x + 8y − 7 = 0 ⇒(x + 1)2 − 1 − 4(y − 1)2 + 4 − 7 = 0 ⇒(x + 1)2 − 4(y − 1)2 = 4 (x + 1)2 (y − 1)2 − = 1. ⇒ 4 1 This is clearly a hyperbola centred on (−1, 1). Its focal axis is parallel to the x-axis. The vertices are length a way from the centre, and since a = 2 by comparing with equation (44), the √ vertices are at (−3, 1) and (1, 1). To find the foci, recall that they are c = a2 + b2 away from √ √ the centre. Here √ 2 2 c = 2 + 1 = 5, and so the foci are at (−1 ± 5, 1). Finally, the asymptotes can be found by finding the lines that have slope ± 12 that pass through (−1, 1). Using 3 2 F 1 H-1- H-1,1L 5 ,1L F 2 H-1+ 5 ,1L 1 -6 -4 -2 2 4 -1 Figure 30: The hyperbola (x+1)2 4 y = mx + b , 33 − (y−1)2 1 =1 we find that for m = asymptotes are 1 , 2 b = 3 2 and for m = − 12 , b = 1 . 2 Therefore the x 3 x 1 + , y=− + . 2 2 2 2 This parabola is shown in Figure 30. y= 1.3.5 Reflection properties for conic sections Reflection property of parabolas: Consider a tangent line at a point P on the parabola. The angles between the tangent line and the line through P parallel to the focal axis and between the tangent line and the line connecting P to the focus are equal. Reflection property of ellipses: Consider a tangent line at a point P on the ellipse. The angles between the tangent line and the lines joining P to the foci are equal. Reflection property of hyperbolas: Consider a tangent line at a point P on the hyperabola. The angles between the tangent line and the lines joining P to the foci are equal. These reflection properties are drawn in Figure 31. 1.3.6 Applications of conic sections Some applications of conic sections include: Parabola: Parabolic reflector - used in headlights on cars, etc. to project a focussed beam of light. Ellipse: Planetary orbits - Kepler’s problem. More on this later. Hyperbola: Scattering of atomic particles - Rutherford’s experiment. 1.4 1.4.1 Rotated conic sections Quadratic equations All conic sections are special cases of the general quadratic equation Ax2 + Bxy + Cy 2 + Dx + Ey + F = 0 . (46) If A, B, C 6= 0 it is quadratic in x and y, and if they vanish, it reduces to a linear equation in x and y. Obviously B = 0 gives rise to the conic sections 34 y 4 1.0 3 Α P Α 0.5 Α P -2 2 Α 1 F -2 -1 1 2 -1 1 F1 x -0.5 2 F2 -1 -1.0 -2 P 1.5 1.0 F1 -4 0.5 -2 Α Α 2 F2 4 -0.5 -1.0 -1.5 Figure 31: Reflection properties we have already met when appropriate constants are chosen. However, if B 6= 0, then we have a cross-product term, which is new. The effect of this term is to rotate the axes of the conic section so that they no longer align with the x and y axes. To understand why this is true, consider the ellipse described by the foci F1 (1, 2), F2 (−1, −2) and a = 3, which is drawn in Figure 32. Based on the information we have, the point in this figure should satisfy p p (x − 1)2 + (y − 2)2 + (x + 1)2 + (y + 2)2 = 2a = 6 , (47) which we square to give p p 2x2 + 2 (x − 1)2 + (y − 2)2 (x + 1)2 + (y + 2)2 + 2y 2 + 10 = 36 . (48) If we now write p 2 p 2 (x − 1)2 + (y − 2)2 (x + 1)2 + (y + 2)2 = (26 − 2y 2 − 2x2 )2 , (49) when we simply we get 8x2 − 4xy + 5y 2 = 36 , 35 (50) y PHx,yL 2 H1,2L 1 -3 -2 -1 1 2 3 x -1 H-1,-2L -2 Figure 32: Rotated ellipse which has the xy term we expected. A useful tool to help identify conic sections is the discriminant, δ = B 2 − 4AC . δ < 0 represents an ellipse, with A = C and B = 0 being the special case of a circle. δ = 0 represents a parabola. Finally, δ > 0 represents a hyperbola. 1.5 Rotation of the x and y axes How do we relate these rotated conic sections to those we are familiar with? To do this, we must understand how to perform a rotation of the plane. If we begin with a coordinate system in polar coordinates x = r cos(θ + α) , y = r sin(θ + α) , (51) for θ constant such that the polar axis at θ + α = 0 coincides with the x-axis, we can easily imagine that we could equally construct a coordinate system which has it’s polar axis at α = 0, which would be x0 = r cos α , y 0 = r sin α . 36 (52) y 1.2 y' 1.0 P 0.8 0.6 r x' 0.4 0.2 0.5 1.0 x -0.2 Figure 33: Rotated coordinate axes These two systems are shown in Figure 33. We can use trigonometric identities to rewrite the unprimed coordinates as x = r cos θ cos α − r sin θ sin α , y = r sin θ cos α + r cos θ sin α , (53) which means that we can relate the coordinate systems by x = x0 cos θ − y 0 sin θ , y = x0 sin θ + y 0 cos θ . (54) These are known as the rotation equations. We can also invert these to get x0 = x cos θ + y sin θ , y 0 = −x sin θ + y cos θ . (55) We can eliminate a cross-product term by choosing a convenient coordinate system x0 y 0 . The trick is to find the correct angle to rotate by. Theorem: For the equation Ax2 + Bxy + Cy 2 + Dx + Ey + F = 0 , 37 (56) such that B 6= 0, the x0 y 0 -coordinate system obtained by rotating the xy-axes through an angle θ satisfying cot 2θ = A−C , B (57) then the equation in x0 y 0 -coordinates becomes A0 x2 + C 0 y 2 + D0 x + E 0 y + F 0 = 0 . (58) Proof: If we substitute equation (55) into equation (56), we get A(x0 cos θ − y 0 sin θ)2 + B(x0 cos θ − y 0 sin θ)(x0 sin θ + y 0 cos θ) + C(x0 sin θ + y 0 cos θ)2 + D(x0 cos θ − y 0 sin θ) + E(x0 sin θ + y 0 cos θ) + F = 0 , (59) which means A0 B0 C0 D0 E0 F0 = A cos2 θ + B cos θ sin θ + C sin2 θ = B(cos2 θ − sin2 θ) + 2(C − A) sin θ cos θ = A sin2 θ − B sin θ cos θ + C cos2 θ = D cos θ + E sin θ = −D sin θ + E cos θ =F. (60) A−C cos 2θ = cot 2θ = , sin 2θ B (61) B 0 = 2B cos 2θ + 2(C − A) sin θ cos θ = 0 (62) If we have then as required. Example: Rotate to the coordinates that coincide with the axes of the conic section x2 + 4xy − 2y 2 − 6 = 0. What type of conic section is it? Solution: We need to rotate by θ, where 1 − (−2) 3 = 4 4 ⇒ θ = 0.463648 rad = 26.5651o ⇒ sin θ = 0.447214 , cos θ = 0.894427 . cot 2θ = 38 Using these, we can substitute into equation (60) to get A0 = 2 , B0 = 0 , C 0 = −3 , F 0 = −6 , where of course, D0 = E 0 = 0 since there are no linear terms initially. The equation in the new variables is therefore 2x02 − 3y 02 = 6 x02 y 02 ⇒ − = 1, 3 2 which is a hyperbola. 1.6 Conic Sections in Polar Coordinates To find how to represent conic sections in polar coordinates, first we must state a theorem. Focus-Directrix Property of Conics: Suppose that a point P moves in the plane, with the motion determined by a fixed point (the focus) and a fixed line (the directrix) such that the focus does not lie on the directrix. Then, if the point moves in such a way that its distance to the focus divided by its distance to the directrix is a constant e, called the eccentricity, then the curve the point traces is a conic section. The eccentricity determines the conic section as follows: (a) a parabola if e = 1 , (b) an ellipse if 0 < e < 1 , (c) a hyperbola if e > 1 . (63) In the case of a parabola, this is fairly easy to see. For a parabola y = 4px curving to the right, we take the usual focus and directrix x = −p. However, for an ellipse or a hyperbola we must define the directrix. We will take the directrix to be the line x = a2 /c if we consider the standard shapes in Figure 34. Below we will call the point P , the focus F and the directrix D. In this notation, we require PF = e. (64) DP 39 y y 2 D FHx,yL D 0.5 -1 1 2 FHc,0L 3 x -2 -1 1 FHc,0L -0.5 -1 e=1 x=-p PHx,yL 1.0 1 2 x -1.0 0<e<1 -2 x=a2 c -1.5 y PHx,yL 1.5 D 1.0 FHc,0L 0.5 -4 -2 2 4 x -0.5 -1.0 e>1 x=a2 c -1.5 Figure 34: Conic sections defined and eccentricities For a parabola, we already know that the distance P F to the focus is equal to the distance to the directrix P D, so e = 1. For the ellipse, Recalling the fifth line of equation 34, we have 2 p c a (x − c)2 + y 2 = −x , (65) a c where the left-hand side is P F , and the bracketed term on the right is P D. Therefore, for an ellipse, c e= , (66) a which given 0 < c < a gives 0 < e < 1, unless a = 0 or c = 0. What would the conic section be in these situations? Finally, for the hyperbola, we know that the distance to the further focus minus the distance to the nearer focus is 2a. Therefore, in analogy with equation (34), we get p p (x + c)2 + y 2 − (x − c)2 + y 2 = 2a , (67) so performing the same procedures, we arrive at p c a2 2 2 (x − c) + y = x− , a c 40 (68) where the left-hand side is again P F and the right-hand side is P D since the directrix is the opposite side to the point compared with the case of the ellipse. Therefore, we also have e= c , a (69) for a hyperbola. Also note that for an ellipse, eccentricity can be regarded as a measure of the flatness of the ellipse: as e approaches 0, the ellipse becomes circular; as e approaches 1 the ellipse flattens out. Earth’s orbit has e = 0.017. See Figure 35 for plot of ellipses of different eccentricites. 1.0 0.5 -1.0 F -0.5 0.5 1.0 1.5 2.0 e=0 e=0.2 e=0.4 e=0.6 e=0.8 -0.5 -1.0 Figure 35: Ellipses with same focus and same semi-major axis, but different eccentricities 1.6.1 Polar equations of conics In order to derive the polar equations of conics, it is convenient to chose the pole to coincide with the focus, and the polar axis to be either parallel or perpendicular to the directrix. In addition, there are two choices for the direction the polar axis points. As a result there are four possibilities that one might consider. We will choose the directrix to be perpendicular to the polar axis, which will point towards the directrix. Figure 36 shows the construction. The distance between the pole and directrix is d, see Figure 36. d is constant is set by our choice of pole. We know that P F = eP D . 41 (70) PHx,yL D r Θ F r cos Θ Pole d Figure 36: Polar coordinates defined for conics For the situation in the diagram, we have P F = r and P D = d − r cos θ, and therefore ed . (71) r= 1 + e cos θ This can be used to represent a parabola, ellipse or hyperbola. In summary, for the different cases, we have (a) Directrix right of pole ed r= 1 + e cos θ (b) Directrix left of pole ed r= 1 − e cos θ (72) (c) Directrix above pole ed r= 1 + e sin θ (d) Directrix below pole ed r= 1 − e sin θ These are sketched in Figure 37. To sketch conic sections from their polar coordinates, we need to identify the relevant parameters. For a parabola, d tell us the distance from the directrix to the focus, and so we can immediately sketch it. Example: Sketch r = 2 . 1−cos θ Solution: This looks like (b) of (72) with d = 2 and e = 1. 42 HaL HbL D D F F HcL HdL D F F D Figure 37: Choices of polar coordinates for conics For an ellipse, if we consider the first diagram in Figure 38, we see that r0 = a − c , and so 1 a = (r1 + r0 ) , 2 r1 = a + c , (73) 1 c = (r1 − r0 ) . 2 (74) We also find r0 r1 = a2 − c2 = b2 ⇒ b = √ r0 r1 . (75) For an hyperbola, if we consider the second diagram in Figure 38, we see that r0 = c − a , r1 = c + a , (76) and so 1 c = (r1 + r0 ) . 2 1 a = (r1 − r0 ) , 2 We also find r0 r1 = c2 − a2 = b2 ⇒ b = Finally let’s do a quick example. Example: Sketch r = 2 . 1+2 sin θ 43 √ r0 r1 . (77) (78) 5 a b 2 r1 r0 c -5 5 1 a b -3 -2 -1 1 c 2 3 -5 -1 a a r1 -2 r0 Figure 38: Ellipses and hyperbolas in polar coordinates Solution: This looks like (c) of (72). We have d = 1 and e = 2. This is a hyperbola with directrix length 1 above the pole. Because e > 1, the denominator causes complicated behavior. At θ = 7π/6, sin θ = −1/2, causing the denominator to be 0, and a switch from r being positive to r being negative. At θ = 3π/2, sin θ = −1 and is the turning point of the upper branch of the curve. At θ = 11π/6, r returns to being positive. In 3 11Π6 5 y=2 y=43 y=23 2 3Π2 1 Π2 2 3 4 5 1 6 -5 -3 Π6 -2 -1 1 2 3 -1 Figure 39: Rough sketch of a hyperbola in polar coordinates short, we get r0 by setting θ = π/2 and r1 from θ = 3π as these represent 2 the vertices of the hyperbola, i.e. the turning points of the branches. Note that the upper branch appears for r < 0 because in the plot of the function 44 it is below the axis. The function and the hyperbola are drawn in Figure 39. Then, 2 2 2 2 r0 = = , r1 = = 2. (79) = 1 + 2 sin 3π/2 | − 1| 1 + 2 sin π/2 3 Therefore 2 1 a = (r1 − r0 ) = , 2 3 1.6.2 √ √ 2 3 b = r0 r1 = , 3 1 4 c = (r1 + r0 ) = . 2 3 (80) Planetary motion While you might see more of this in a mechanics course, we will here state Kepler’s Laws of Planetary Motion: 1. Law of Orbits: The motion of each planet traces an ellipse with the Sun at one of the foci. 2. Law of Areas: The line joining the Sun to the centre of the planet sketches out equal areas in equal times. 3. Law of Periods: The square of the orbital period of a planet is proportional to the cube of the semi-major axis of its orbit. Kepler’s second law is shown graphically in Figure 1.6.2. These laws are valid 1.0 0.5 -2 -1 Earth 1 -0.5 2 Sun -1.0 Figure 40: The Law of Areas. The two red areas represent two areas traced out during equal time periods. Of course, this is not to scale. for any celestial body under the influence of a single gravitational force. As 45 a result, these are only ever approximations as all the planets also exert gravitational forces on each other, but it is a very good approximation. The closest point between the bodies during the orbit is called the perigee (i.e. r0 ), and the farthest point is called the apogee (i.e. r1 ), which are also known as the perihelion and aphelion when the Sun is orbited, from the Greek word for the Sun, Helios. Kepler’s third law may be denoted 3 T = a2 , (81) provided a is measured in astronomical units (1AU=1.5 × 108 km), and T is 1.0 Apogee -2 D 0.5 Perigee -1 1 2 -0.5 -1.0 -1.5 ae a ae Figure 41: The length parameters for an orbit measured in Earth years so that T = 1 is the period of the Earth’s orbit and a = 1 is the distance to the sun. Normally, orbits are specified by eccentricity and the semi-major axis, so using c = ea and d= a a(1 − e2 ) a − c = − ea = , e e e (82) we get (a) r = (1 − e2 ) 1 + e cos θ (b) r = (1 − e2 ) 1 − e cos θ (83) 2 (c) r = (1 − e ) 1 + e sin θ 2 (d) r = (1 − e ) , 1 − e sin θ and r0 = a(1 − e) , r1 = a(1 + e) . (84) The various length parameters are shown in Figure 1.6.2. If there is time, we will consider Kepler’s laws in greater detail later. 46