Comparing the Discrete and Continuous Logistic Models Sheldon P. Gordon Department of Mathematics Farmingdale State College Farmingdale, NY 11735 gordonsp@farmingdale.edu Mailing Address: 61 Cedar Road, E. Northport, NY 11731 Abstract The solutions of the discrete logistic growth model based on a difference equation and the continuous logistic growth model based on a differential equation are compared and contrasted. The investigation is conducted using a dynamic interactive spreadsheet. Keywords logistic growth, discrete logistic model, continuous logistic model, comparison of solutions One of the more “fun” topics that have entered the mathematics curriculum in the last few decades is the logistic growth model. Originally developed by the Belgian biologist Verhulst to model the growth of a population subject to an inhibiting effect as the population size increases toward a maximum sustainable level, the model has since been applied to a host of other applications in many different fields. For instance, it is used to model the spread of technological innovations and the sales of most products. There are actually two different logistic models that are in wide use. The original model is a continuous one based on the differential equation P’ = aP – bP2 and a subsequent discrete model based on the difference equation ∆Pn = aPn – bPn2. (1) In both models, a represents the initial growth rate of the population P or other phenomenon and b represents the inhibiting coefficient that eventually comes into play to slow the increase in the population growth when the population size becomes large relative to the maximum sustainable level (or product saturation or whatever else corresponds to an eventual horizontal asymptote as time passes). (2) In both models, the solution has the same logistic Figure 1: Graph of a Logistic shape shown in Figure 1. Curve (3) In both models, the limiting value L turns out to be equal to a/b. The horizontal asymptote corresponds to there being no further change in the population, so either P’ = 0 or ∆P = 0 and therefore P = L = a/b. (4) In both models, the solution achieves its inflection point at a height of ½a/b, or midway to the limiting value. The inflection point represents the point where the function is growing most rapidly, so that either P’ or ∆P must achieve its maximum and both are quadratic functions of P with negative leading coefficients, so the maximum occurs midway from P = 0 to P = L. The solution of the continuous model can be found by integrating the differential equation, using either partial fractions (perhaps the only reason for including partial fractions in calculus) or by the simple substitution P = 1/x (which totally avoids the need to do partial fractions). The solution is P(t ) P0 L P0 ( L P0 )e at , where P0 is the initial population and L = a/b is the limit to growth. There is no known closed form solution to the discrete logistic model; however, the defining difference equation can be rewritten as ∆Pn = Pn+1 - Pn = aPn – bPn2, so that Pn+1 = (1 + a)Pn – bPn2, and all subsequent terms after P0 can be readily calculated. That brings us to the subject of this article: Just how do the two solutions compare to one another? Or perhaps more importantly: How do the two solutions differ from one another? To begin, let’s define what we mean by compare, since one solution is continuous while the other is discrete. We will consider the values of the continuous solution only for the points t = 0, 1, 2, …, so that we can have a direct comparison between the values predicted by the two models. Moreover, consider the differential equation for the continuous model. We can approximate the derivative P’ using a Newton difference-quotient, so that P ( t h ) P (t ) h P’(t) = aP(t) – bP(t)2 . If we take the step size h = 1, we get the approximation P(t + 1) – P(t) aP(t) – bP(t)2 and this is effectively equivalent to the difference equation model. However, a step-size of h = 1 is rather large, which suggests that there should be major differences in the values predicted by the two models. To answer the two questions posed above, the author has developed a Dynamic Interactive Graphical (DIGMath) spreadsheet in Excel investigate precisely how the two solutions compare to 300 P one another. It is available for downloading from the 250 author’s website [1], along with a large variety of other 200 DIGMath modules for College Algebra/ Precalculus, Calculus, and Statistics. This spreadsheet allows the user 150 to change both of the parameter values a and b using 100 sliders, and also to change the initial population value P0, 50 with a slider. As a result, one has the ability to see the t, n 0 resulting changes in the two populations on a continuous 0 20 40 60 80 100 basis for ease of comparison. For instance, in Figure 2, Figure 2: Comparing the continuous (solid) and the discrete (dotted) logistic curves we show the results of this spreadsheet based on a = 0.50, b = 0.0018, and P0 = 19. With these parameter values, the maximum sustainable population is L = a/b = 284, which is represented in the figure by the upper horizontal line; the respective inflection points occur at a height of ½L = 142 and this is represented in the figure by the lower horizontal line. Clearly, the two curves are very similar, but one can see that there are some significant differences between them in the portion where both are increasing rapidly. On the other hand, if we change the parameters to a = 0.30, b = 0.0017, and P0 = 100, we get the solution curves shown in Figure 3, and the two curves are virtually indistinguishable from one another, even though the window used is somewhat smaller that that in Figure 2 to make the difference more apparent.. 200 P To get a better feel for how the two solutions 180 compare, the spreadsheet also produces a second graph, 160 140 one showing the difference in the two solutions as a 120 function of time. The result corresponding to the 100 parameters used for Figure 2 are shown in Figure 4 and 80 those corresponding to Figure 3 are shown in Figure 5. 6040 One of the striking features of the graph in Figure 4 is the 20 t, n 0 presence of a pair of spikes, or surges, in the values of the 0 20 40 60 80 100 difference function, but only a single spike in Figure 5. (We note that it is also possible to get situations where the differences are essentially identically zero with no Figure 3: Continuous (solid) vs. discrete (dotted) logistic curves for other indication of such a spike.) parameter values Difference of Populations vs. Time Difference of Populations vs. Time 30 0 0 20 40 60 80 100 25 20 -1 15 10 -2 5 0 0 20 40 60 80 100 -5 -3 Figure 4: Graph of the difference between the continuous and discrete logistic curves. Figure 5: Graph of the difference between a different set of continuous and discrete logistic curves To understand what is happening here, think about the situation. Both solutions start with the same initial population value P0, so we can think of them as being pinned to the same starting point and so the initial difference must be zero. Moreover, both tend to the same horizontal asymptote and so, eventually, the difference between the two solutions must become virtually zero. Furthermore, as mentioned previously, the point of inflection for both solutions corresponds to the level where each function is increasing most rapidly. Therefore, it makes sense that the greatest discrepancy between the two functions would occur at about the time that each is passing its point of inflection. Think of the curve that reaches this level first (to the left) as the “leading” function and the other as the “trailing” function. If you compare the graphs in Figures 2 and 4 and in Figures 3 and 5, it is evident that the principal spike in each occurs at the same time as the respective pair of functions pass the horizontal line representing the height of the inflection point. In fact, the greater the horizontal separation between the leading and trailing functions at this level, the more time that the leading function has to grow at its greatest rates, and so the taller the spike is likely to be. As for the second, minor spike in Figure 4, we speculate that this may also be related to the horizontal separation between the two curves at the inflection point level. Again, the larger the separation, the further ahead the leading function gets, and so the closer to the horizontal asymptote it gets before the trailing function reaches its maximum rate of growth. However, that means that the leading function would then be growing at a relatively much slower rate and so the trailing function will be growing considerably faster. In turn, it is reasonable to expect that the trailing function, which has a lot of ground to make up, will be growing so rapidly that it sometimes overshoots the leading function before it subsequently slows relatively quickly as it too begins to approach the horizontal asymptote. This would account for the second spike. Incidentally, if the two functions have minimal separation when they cross the inflection point level, it is reasonable to expect that there will not be a significant spike. In conclusion, we note that, based on the above two examples and many others that the author has examined, the two solutions are surprisingly close to each other. Consequently, it does not seem to make a significant difference which model we opt to work with, and so the choice may be based either on whether it is more convenient or desirable to have a closed form expression in terms of t or a recursion relation in terms of n from which all subsequent terms in the solution sequence can be calculated easily. The former approach can certainly be desirable, if not necessary, in performing various algebraic analyses, as in [2] where the author uses the closed form expression for a logistic function to calculate the doubling time for such a function as a function of time t. On the other hand, most practical applications are likely better served by the recursive approach. Finally, the author is reminded of Richard Hamming’s famous quote: “The purpose of computing is insight, not numbers.” Certainly, the use of Dynamic Interactive Graphical spreadsheets such as the one described here supports that perspective solidly! We encourage interested readers to experiment with these ideas personally and with their students to gain those deeper insights personally. Acknowledgement The work described in this article was supported by the Division of Undergraduate Education of the National Science Foundation under grant DUE-0442160. However, the views expressed are those of the authors and do not necessarily reflect those of the Foundation. References 1. The DIGMath Excel spreadsheet Comparing Discrete and Continuous Logistic Models can be found at and downloaded from farmingdale.edu/~gordonsp. 2. Gordon, Sheldon P., Doubling Times for Non-Exponential Families of Functions, (submitted). Biographical Sketch Dr. Sheldon Gordon is SUNY Distinguished Teaching Professor of Mathematics at Farmingdale State College. He is a member of a number of national committees involved in undergraduate mathematics education. He is a co-author of Functioning in the Real World and Contemporary Statistics: A Computer Approach and co-editor of the MAA volumes, Statistics for the Twenty First Century and A Fresh Start for Collegiate Mathematics: Rethinking the Courses Below Calculus. He is also a co-author of the texts developed under the Calculus Consortium based at Harvard.