Introducing Functions of Several Variables into Lower Division Mathematics Sheldon P. Gordon The MAA’s committee on Curriculum Renewal Across the First Two Years (CRAFTY) has been conducting a project to determine the current mathematical needs of the partner disciplines. The first round of this Curriculum Foundations project [1] involved having leading educators from 17 math intensive disciplines participate in workshops to discuss and formulate recommendations to the mathematics community on what their students need to bring out of lower division mathematics classes. A second round of workshops involving additional disciplines was held recently and their reports will appear shortly [2]. Overall, there was an amazing degree of unanimity in the recommendations across virtually all disciplines, including the need 1. to emphasis conceptual understanding over rote manipulation, 2. to emphasize realistic problem solving via mathematical modeling, 3. for the use of data, particularly statistics, which is the most important mathematical area for virtually every discipline, 4. for the routine use of technology. Many of the disciplines also recommended an emphasis on functions of several variables. To a mathematician, that conjures up a request to do far more with partial derivatives, gradients, and multiple integrals. It also likely makes us wonder what those people really have in mind – how can we possibly include these Calculus III topics in lower division courses? It’s just not possible or reasonable! However, when one actually talks to people in disciplines such as physics, economics, chemistry, and so forth, a very different picture emerges. They intend essentially three basic themes that are quite appropriate for students at the college algebra level and above that have never been part of our traditional curricula. In fact, these are the same three multiple representation themes – symbolic, numerical, and graphical – that that have become the mainstays for looking at functions in all courses from college algebra up through calculus. In this article, we will discuss how these themes also apply to functions of two variables and provide a variety of illustrative examples to show precisely the kinds of mathematics that is being requested by the partner disciplines. Numerical Representations of Functions of Two Variables In most fields of endeavor, one routinely encounters information presented via tables and often those tables are sets of values that depend on two distinct quantities – that is, are functions of two independent variables. For instance, the wind-chill factor is an adjustment made to temperature readings T that takes into account the effects of the wind speed W and so provides a way of measuring how cold it feels. Thus, C is a function of both temperature T and wind speed W. Students in courses in many different fields are expected to read and interpret such tables and perhaps even create them. This involves a considerable amount of mathematical understanding and reasoning. We start with a simpler, though more artificial, example than the wind-chill factor. Consider the following table, where the independent values are x and y. If you carefully examine the values for the dependent variable z in the table, you will notice that, along every row, each successive value of z is 5 more than the previous value. That is, for each value of y, the row is a linear function of x and the slope of that line is precisely 5. We denote this slope in the x direction by mx. In the same way, along every column, each successive value is 30 more than the previous value. That is, for each value of x, every column is a linear function of y and the slope of that line is precisely 30/5 = 6. We write this slope in the y direction as my. As a result, z is a linear function of the two variables x and y, and the resulting equation is z = 5x + 6y – 11. The constant, -11, is the value of z that corresponds to x = y = 0 and is the vertical intercept. x y 0 5 10 15 20 0 -11 19 49 79 109 1 -8 22 52 82 112 2 -5 25 55 85 115 3 -2 28 58 88 118 4 1 31 61 91 121 5 4 34 64 94 124 6 7 37 67 97 127 The parallels (puns fully intended) with standard ideas about linear functions are very clear. This is not just a matter of adding a new dimension to a college algebra or precalculus class, but also of providing an opportunity to reinforce some of the fundamental concepts about linearity while introducing students to notions that are important in other subjects. In practice, very few functions that arise via tables are linear. The following table shows some of the values for the wind-chill factor C, with the temperature T in degrees Fahrenheit and the wind speed W in miles per hour. (We could write C = f(T, W), but at the level of these courses, this might not be the most effective notation for the students.) Thus, if T = 20 and W = 25 mph, the associated wind-chill factor is C = -15. That is, the combination of a temperature of 20F with a wind blowing at 25 mph makes it feel 15F. In comparison, the wind-chill factor associated with T = 25 and W = 20 is C = -3. However, if you examine the rows and columns, it is clear that this function is non-linear in both directions. Wind speed W Temperature T 5 35 33 30 27 25 21 20 16 15 12 10 7 5 0 0 -5 10 22 16 10 3 -3 -9 -15 -22 15 16 9 2 -5 -11 -18 -25 -31 20 12 4 -3 -10 -17 -24 -31 -39 25 8 1 -7 -15 -22 -29 -36 -44 Because this function isn’t linear, trying to predict the wind-chill factor when values for T and W are not listed can be challenging for many students. While none of the other disciplines mentioned interpolation, many did talk about the need for students to know how to work with proportions and there is much discussion in mathematics education circles about the importance of students developing the ability to do proportional reasoning. Thus, introducing some problems of this form may be very valuable. In particular, suppose that one wants to estimate the wind-chill factor corresponding to T = 18 and W = 23 mph. We use the wind-chill factor value of -17 corresponding to T = 15 and W = 20 mph. In the direction of increasing T (right to left in the table), the slope of the function is approximately 7/5 = 1.4; that is, the wind-chill factor increases by approximately 1.4 for every 1 increase in temperature at that point. Therefore, corresponding to an increase of 3, the change in the wind-chill factor will be roughly 3 × 1.4 = 4.2, assuming the wind speed is constant. Similarly, in the W direction, the slope when T = 15 and W = 20 mph is roughly -5/5 = -1; that is, the windchill factor decreases by approximately 1 for every 1 mph increase in wind speed at that point. Therefore, corresponding to an increase of 3 mph in the wind speed, the change in the wind-chill factor will be roughly 3 × -1 = -3, assuming the temperature is constant. Consequently, the total change in the wind-chill factor is approximately 4.2 - 1 = 3.2, and so we estimate that when T = 18 and W = 23 mph, C = -17 + 3.2 = 13.8. Most of the fundamental concepts of functions of a single variable, y = f(x), extend in a very natural way to functions of two (or more) variables. For instance, the formal definition of a function of two independent variables, z = f(x, y) says that for every possible pair of values for x and y, there corresponds precisely one value of the dependent variable z. Think about this in terms of the wind-chill factor. If two different values for C corresponded to a value for T and a value for W, then the table would not satisfy the definition of a function. Furthermore, the notions of domain and range similarly extend to functions of two (or more) variables. The domain consists of all possible pairs of values (x, y) for the independent variables x and y, and the range consists of all the corresponding values of the dependent variable z that arise. For instance, again think about the wind-chill factor function. What are the possible values for the independent variables? The table shows values of the temperature extending from 0 to 35 and wind speeds from 5 to 25 mph. If we accept these as the only possible values for T and W, then we have the domain of this function. The table shows that the corresponding values for the wind-chill factor C extend from -44 to 33 and this would be the range. Presumably, we could probably extend the values for T somewhat, say from a minimum of -30 to a maximum of 45 (wind-chill likely wouldn’t make sense at any warmer temperatures). Wind speeds could be extended to a minimum of 0 mph (we certainly can’t have a negative wind-speed) and might be considered as high as 50 mph. If that is the case, the domain would be T between -30 and 45 and W between 0 and 50 mph. The range would then consist of all the associated values for the wind-chill factor, but we cannot specify those additional values from the table. Of course, one could ask the students to use the web to search out the values of C for the extended range based on T between -30 and 45 and W between 0 and 50 mph. We can also extend some of the fundamental notions about the behavior of functions of a single variable to talk about the behavior of a function of several variables. With a function of one variable, it is simple to decide whether the function is increasing or decreasing –simply look at the associated table (or graph) and decide whether the values of the function get larger or smaller as the independent variable increases. But, with a function of two variables, it is not that simple because there are a number of different possibilities. A function of two variables could increase as both independent variables increase. The function could decrease as both variables increase. However, the function could increase as one variable increases, but decrease as the other variable increases. For instance, consider the table for the wind-chill factor. As the temperature rises and the wind speed remains the same, so that we look horizontally from right to left along each line of the table, we see that the values for C increase. Thus, C is an increasing function of T. Alternatively, as the wind speed rises and the temperature remains the same, we look vertically down each column and see that C gets smaller or eventually more and more negative. Thus, C is a decreasing function of W. Moreover, since C does not change in a linear manner in either direction, issues of concavity come into play. When the wind-chill factor increases, does it increase at an increasing or decreasing rate? When it decreases, does it decrease at an increasing or decreasing rate? Alternatively, one could raise the questions in a somewhat more dynamic form as: Does the wind-chill factor increase faster at low temperatures or at high temperatures? Does the wind-chill factor decrease faster at low wind speeds or at high wind speeds? Questions such as these make for good classroom discussions that engage students or for homework problems that require students to think carefully about the mathematical issues. Graphical Representations of Functions of Two Variables When most mathematicians hear the words “graph of a function of two variables”, what invariably comes to mind is a surface plot of the function. In almost all other disciplines, however, the standard graphical depiction is a contour diagram or contour map. Figure 1 shows the contour diagram for the heights (in feet) of the terrain on a hypothetical island. Each contour, or level curve, connects points that are all at the same height. The shore around the island is at sea level, so it is represented as the closed contour with an associated height of 0 feet. The other contours shown are all multiples of 500 feet. For instance, the 1000 contour represents all heights on the island that are 1000 feet about sea level. As such, these contour curves will be a series of closed loops. Moreover, when you look closely at the contour map in Figure 1, you will notice three distinct groups of closed loops, with each successive contour being smaller in size and larger in value. Each of these nested loops represents a mountain on the island. The peak toward the left is slightly more than 3000 feet high, the one to the right is somewhat more than 2000 feet high, and the one in the upper center is more than 3500 feet high. Clearly, the level curves represent a sample of heights, since it is not possible to give the height of every point. However, one of the “rules” for creating a contour plot is that there must be a uniform interval between contours values; this is known as the contour interval. In Figure 1, the contour interval is 500 feet. The minimum height displayed is 0 (sea level) and the maximum height shown is 3500 feet. The absence of a contour can also provide useful information. In Figure 1, the highest contour is 3500 feet. If there were any points on this island where the height reached 4000 feet, then we would see another contour corresponding to that height. Therefore we conclude that the maximum height must be more than 3500 and less than 4000 feet. Figure 2 shows the contour diagram (also known as a bathymetric map in oceanography) for the depths of the ocean bottom around the western end of Santa Catalina Island off the coast of southern California. The depths are measured in fathoms, which are equal to 6 feet. Think about the 50 fathom contour that represents points whose depth is 50 fathoms below sea level. If you were to extend this contour as far as possible, you should expect that it continues all the way around the island until it comes back to complete the loop, unless it completes a full loop around some local undersea depression or around some local bulge of the sea bottom or if there is some sharp discontinuity under the water, like a sudden chasm or a vertical wall. We can extract considerably more information from a contour diagram than just the height (or depth or size of a quantity) at the points along the different contours. The relative distances between adjacent contours also tell us a lot about the behavior of the quantity being investigated. In particular, regions where the successive contours are far apart indicate that the function is relatively constant there, so that any changes are slow and therefore the slope is quite small. On the other hand, regions where successive contours are very close together, as at the lower left of Figure 2, indicate that the function changes very rapidly, so that the slope is very large. Given the scale for a contour diagram, students can be asked to apply some simple ideas on the slope of a line to estimate the slope of the bottom of the ocean or of a terrain or the rate at which some other quantity is changing. All that is required is to select points on two different contours, so that one knows the change in “height”, measure the distance between the two points, and use the usual formula for the slope to estimate its value. As such, this is another nice way to reinforce the early ideas on linear functions. (Incidentally, from a more sophisticated point of view, one is actually estimating the value of the directional derivative at the initial point.) There are other, more exploratory, activities that one can do with contour diagrams. For instance, it is known that sediment from Catalina Island has created an underwater trench, called the Catalina Canyon, along which silt flows to the sea floor. Students, either individually or in small groups, can be asked to locate the head of this “canyon” and to draw a curve tracing the path of the silt flowing down the trench. Or, they can be asked to identify areas that might be conducive to having a beach (shallow waters that don’t drop too suddenly) or for snorkeling (waters where there may be very sharp drops). Algebraic Representations of Functions of Two Variables When faculty from other disciplines talk about students needing to know about functions of several variables, they often mean that students need to be able to work with certain basic formulas that arise in their discipline that represents important relationships between three (or perhaps more) quantities. For instance, in chemistry, one of the most important principles is the Ideal Gas Law, PV = nRT, that relates the pressure P of a gas, its volume V, and the temperature T. In the formula, n and R are two constants. Chemists love to assign pages of problems in which two of the three variables are given and students have to solve for the remaining variable. Similarly, one of the most important principles in economics is the Cobb-Douglas model that relates production P (the monetary value of all goods produced) to the labor input L and the capital input K via the formula P = AL K1-. Economists then expect their students to be able to solve for one of the three variables given values of the other two. In biology, a famous model relates the surface area S of a human being to a person’s height H and weight W via the formula S = 15.64W 0.425H 0.725, where W is measured in pounds and H in inches. The surface area is in square inches. Typical questions students are expected to answer are things like: Estimate the surface area of a person who is 70 inches tall and who weighs 180 pounds or Estimate the weight of a person whose surface area is 3500 square inches and who is 65 inches tall. These questions may not seem like very deep mathematics to us, but they are seemingly challenging to students in those other courses, possibly because the variables are not x’s and y’s. However, students in those other fields are also expected to interpret and use contour plots for functions such as these. While we addressed the major issues involved in interpreting contour maps earlier, we did not indicate how one actually constructs them based on a formula. This can actually be a lovely classroom activity at the college algebra or precalculus level that, once more, provides the opportunity to remind students about the properties, both behavioral and algebraic, of various important families of functions. Suppose that the contour diagram for the human surface area function S= 15.64W 0.425H 0.725 is to be plotted with H horizontally and W vertically. a. First, one must determine the shape of the contours. Typically, the level curves associated with functions such as these might be increasing and concave up, increasing and concave down, decreasing and concave up, or decreasing and concave down. Students can be asked to decide which of the four possibilities is correct. As a hint to get them started: Pick a value for S, say 3400 square inches, substitute this into the formula, and solve algebraically for W as a function of H. Then ask them to use their knowledge of the behavior of power functions to answer the question. b. Second, do the successive contours get larger or smaller in a certain direction? That is, as you move upward and toward the right, say, do the successive contours correspond to larger or smaller values for S? A hint here is to suggest that students use the contour corresponding to S = 3400 square inches from part (a) as a baseline and decide whether the “next” contour in that direction is S = 3300 or S = 3500. Alternatively, one can argue in a more dynamic, though less precise, way. What should happen to the surface area if a person’s height and weight both increase? While not mathematically conclusive, a good model should reflect the underlying process and students should learn to develop their mathematical intuition to understand and predict what will likely happen with a model. c. Third, what is the spacing between the successive contours – do they get closer together or further apart? Students can use the two contours from parts (a) and (b) along with a third contour with a comparable change in S, say either S = 3200 or S =3600, to decide whether the distances between the successive contours get smaller or larger using a graphing calculator or a spreadsheet. A little effort to generate an appropriate window in which all three curves can be seen clearly may be required and sometimes it can be rather hard to decide whether the spacing is increasing or decreasing. This is the case here, as seen in Figure 3, though the spacing appears to be increasing, albeit very slowly, as S increases from 3200 to 3600. Figure 3 Contour Plots of Linear Functions What does the contour plot associated with a linear function, z = ax + by + c looks like? There are many ways to proceed. One is to use some algebra. Let’s start with a simple application – the number of calories C in a peanut butter and jelly sandwich. Each gram of peanut butter, P, has 6 calories and each gram of jelly J has 2.5 calories. So, with white bread, the total number of calories is C = 150 + 6P + 2.5J. Students can then be asked to consider the contours corresponding to C = 100, 150, 200, …, say, to produce a series of normal equations for a set of lines and observe that they are all parallel (the same slope) and appear to be equally spaced, as seen in Figure 4. Figure 4 Alternatively, from a graphical perspective, each of the level curves is clearly a line. The slope in the P- (horizontal) direction is 6 and that in the J (vertical) direction is 2.5. Suppose you pick any point Q on some particular level curve. If you move one unit to the right, the value of C increases by 6 calories and if you also move one unit vertically upward, the value of C increases by 2.5 calories. See Figure 5. So, for every possible increase of 1 gram of peanut butter (move one unit over) and 1 gram of jelly (move one unit up), the effective change is to move the point and the vertical and horizontal lines in the figure as a group along the lower line, and the function increases by a total of 8.5 calories. The result is a new line that is parallel to the original line, but with the constant term 8.5 calories larger. That is, the contours consist of a collection of parallel and uniformly spaced lines. A third approach might be to work with a considerably simper equation, say z = 2x + 3y + 1. Students can be asked to create a table such as the first one in this article, (although a slightly larger table might actually be more effective), plot all the points on a sheet of graph paper along with the associated values of the function z, and then connect the points corresponding to the same values of z. (Hopefully, if the table is extended far enough, most of the values for z will occur at least three times to give a much more dramatic impact.) The results will again be a series of parallel, equally spaced lines. Multivariate Linear Regression There is one other aspect of functions of several variables that came up in many of the Curriculum Foundations reports, particularly those from the social sciences. Many of them mentioned the importance of multivariate linear regression. Just as linear regression involves finding the line that comes closest to all the points in a set of (x, y) data in the least squares sense, one can also find the plane that comes closest to all the points in a set of (x, y, z) data in the least squares sense; if there are more than two independent variables, then it is the hyperplane that comes closest to all the data points. While not a routine built into graphing calculators, this can be done quite easily using Excel. To perform multivariate regression, Excel requires that the dependent variable be denoted by Y and its values listed in a single column or a single row. The various independent variables are denoted by X1, X2, … and their values are also listed in separate columns or rows. That is fine is the data values are given in such a format. However, suppose we want to find the linear function that is the best fit to our data on wind-chill factors. The above table has 40 entries in a 5 × 8 array, but the format is inappropriate for performing the regression computations. Instead, it is necessary to re-write the values in a 40 × 3 array, if we choose to work with columns, in which all the wind-chill factor values are the first column (Y), all the temperature readings are in the second column (X1) and all the wind speed values are in the third column (X2). When multivariate linear regression is performed, the equation of the resulting plane is Y=1.315X1 -1.55X2 - 4.817, which is equivalent to WC=1.315T -1.55W - 4.817. Notice that the slope of this plane in the increasing T direction is 1.315, so that the windchill factor, on averages, increases by about 1.3 for each 1 increase in temperature. Similarly, the slope of the plane in the W direction is -1.55, so that the wind-chill factor, on average, decreases by 1.55 for each additional 1 mph that the wind is blowing. Another, more mathematical, application of these ideas that can be introduced into courses at the college algebra or precalculus level has to do with the way that polynomial regression may be performed on a calculator or in Excel. Suppose we have a set of (x, y) data and wish to fit a cubic polynomial, say, to the values. That is, we wish to find the function c0 + c1 x + c2 x2 + c3 x3 that is the best fit to the data. Instead of thinking of this as a polynomial, the usual computational approach is to think of it as a linear function of the three variables x, x2, and x3 and to perform multivariate linear regression on the associated values. That is, there is one column for the given y-values, another for the given x-values, a third for the squares of each of the x-values, and a fourth for the cubes of the x-values. The resulting linear function is the desired cubic polynomial. Conclusions The overwhelming majority of the students who take our introductory courses do so to fulfill requirements from the partner disciplines. As such, it is in all of our interests to offer courses that meet the needs of both the students and our colleagues. This is particularly helpful if we can introduce such topics in ways that allow us to revisit fundamental mathematical topics to reinforce them in our students’ minds in the process of extending them in new directions. References 1. Ganter, Susan and William Barker, Eds., The Curriculum Foundations Project: Voices of the Partner Disciplines, MAA Reports, Mathematical Association of America, 2004. 2. Ganter, Susan L. and W.E. Haver (Eds.), Partner Discipline Recommendations for Introductory College Mathematics and the Implications for College Algebra, MAA Reports, Mathematical Association of America, 2011 (to appear).