1 Taking Place Value Seriously: Arithmetic, Estimation and Algebra

advertisement
Taking Place Value Seriously: Arithmetic, Estimation and Algebra
Roger Howe, Yale University
0. Introduction and Summary
Arithmetic, first of whole numbers, then of decimal and common fractions, and later of rational
expressions and functions, is a central theme in school mathematics. This essay attempts to
point out ways to make the study of arithmetic more unified and more conceptual through
systematic emphasis of place value structure in the decimal number system. The essay is
divided into six sections beyond this introduction. A supplemental set of exercises is planned
for a revised version.
Section one reviews the basic principles of decimal (base ten place value) notation. It points out
the sophisticated structure that underlies the amazing efficiency of decimal notation, and it
introduces the terminology of digit, denomination, and place value number, for the basic
constituents out of which base ten numbers are formed. Standard decimal notation is an extremely
compact way of expressing any whole number as a sum of place value numbers. To enable
discussion of the sizes of base ten numbers, the idea of order of magnitude of place value
numbers, and of general numbers, is introduced.
Section two is in four parts, which discuss how decimal notation permits efficient algorithms for
the four basic operations, of addition and subtraction, multiplication and division, for whole
numbers. The main theme is that the form of decimal numbers, as sums of the very special
numbers called place value numbers, together with the Rules of Arithmetic1, determines the main
outlines of the procedures for all the operations.
The discussion of division plays a pivotal role in the essay. There are two main conceptual
interpretations of the base ten expansion of a whole number, and both are tied up with division.
One interpretation of division involves successive division-with-remainder by 10. The other
interpretation involves approximation by place value numbers. This approximation interpretation
provides the basis for the long division algorithm. Long division is a capstone arithmetic skill: it
requires fluency with multiplication, subtraction, and estimation.
Section three discusses ordering, estimation and approximation of numbers. For comparing
numbers, a crucial idea is that of relative place value. The corresponding idea for approximation is
relative error. Both relative and absolute error can be controlled in terms of decimal expansions.
A key concept is significant digit. It is pointed out that relative accuracy of approximation
improves rapidly with the number of significant digits. It is usually unreasonable to expect to
know a “real-life" number (meaning the result of a measurement) to more than three or four
significant digits, and often one must settle for, and can live with, much less. Failure to appreciate
the limits of accuracy seems to be one of the most pervasive forms of innumeracy: it affects many
people who are for the most part quite comfortable with numbers. Scientific notation, which
focuses attention on the size of numbers and the accuracy to which they are known, is also
discussed.
1
By the “Rules of Arithmetic,” we mean what mathematicians refer to as the Field Axioms, and what are often mentioned in
the mathematics education literature as “number properties,” or just “properties.” (They are, in fact, not properties of numbers
but properties of the operations.) They are nine in number: four (Commutative, Associative, Identity and Inverse Rules) for
addition, four parallel ones for multiplication, and the Distributive Rule to connect addition and multiplication. Other rules
such as “Invert and Multiply,” or the formula for adding fractions, or the rules of signs for dealing with negative numbers, can
be deduced from these nine basic rules.
1
The fourth section discusses how place value notation can be extended from whole numbers to
decimal fractions – that is, fractions whose denominators are powers of 10. The procedures for the
arithmetic operations also extend seamlessly. This is a practical reflection of the fact that the Law
of Exponents can be extended to hold for all integers, not just whole numbers.
The ease of computation with decimals together with the efficient approximation properties
discussed in sections 2D and 3, make decimal fractions an efficient tool for practical computation,
especially since numbers coming from measurements are known only approximately. The key
features of place value notation have led to its very heavy exploitation in machine computation.
The fifth section combines the ideas from sections 2 and 3 in a short discussion of accuracy and
estimation in arithmetic computation. Focusing on decimal components permits a highly effective
treatment of this issue.
The last section explores the connections of decimal notation with algebra. It is suggested that
decimal numbers can be profitably thought of as “polynomials in 10," and the parallels between
decimal computation and computation with polynomials are illustrated. Moreover, the structural
understanding that would come with a study of arithmetic based on decimal components should
promote comfort with algebraic manipulation. Further parallels between decimal arithmetic and
algebra are explored in the accompanying problems.
The reader may wonder, how might the ideas in this essay be implemented? What would a
curriculum that emphasized place value look like? This essay will not have much to say on this
point, because there is not much experience, at least in the US, with such a curriculum. However,
between drafts of the essay, the author was looking at the US edition of the Singapore elementary
textbooks. They pay close attention to many of the points discussed here. Their treatment of the
Rules of Arithmetic is much less explicit than the present discussion, but the basic principles of
place value, and their implications for arithmetic, are developed at considerable length. Thus, the
Singapore curriculum can be said to be aligned with the ideas in this essay.
This essay addresses an issue which the author hopes will be of interest to a broad spectrum of
mathematics educators (a term we use inclusively, to comprise mathematics teachers of all levels,
as well as educators with professional interest in mathematics). It probably will meet with the
same problem which bedevils every teacher of mathematics – variations in sophistication and
background among its intended audience. Substantial effort has been made to make the contents
accessible to a fairly wide audience, but what one reader may find obvious, another may find
obscure, and vice versa. It is hoped that many readers can appreciate the broad message, that place
value can serve as an organizing and unifying principle across a surprising span of the elementary
(and beyond!) mathematics curriculum; and that individual readers will be patient with particular
parts which may seem either over- or under-elaborated, keeping in mind that other readers may
have the opposite view.
Acknowledgements: The author is grateful for comments on earlier drafts of this essay from the
MSSG committee members, from Johnny Lott and his associates in NCTM and ASSM, especially
Gail Englert, Bonny Hagelberger, and Mari Muri, and from Scott Baldridge, Thomas Roby, and
Kristin Ulmann. Thanks to Mel Delvecchio and W. Barker for vital production help.
2
1. The Decimal System: Place Value Numbers, Expanded Form and Order of Magnitude.
Our common decimal system is a highly sophisticated method for writing whole numbers
efficiently. It uses only 10 symbols (the digits: 0,1,2,3,4,5,6,7,8,9), arranged in carefully structured
groups, to express any whole number. Further, it does so with impressive economy. To express the
total human population of the world would require only a ten-digit number, needing just a few
seconds to write.
The efficiency of the decimal system is possible because of its systematic use of mathematical
structure – essentially, it enlists algebra in support of counting. Making this structure explicit may
help to make mathematics instruction more effective, by increasing conceptual understanding and
computational flexibility. It should promote numeracy by making students more sensitive to order
of magnitude, and to the estimation capabilities of place value notation. It should also bring out
the parallels between arithmetic and algebra, thus making arithmetic a preparation for algebra,
rather than the impediment that it now sometimes is [KSF, Ch.8].
We will call a whole number expressed in standard place value form a base 10 number. The first
thing to understand about decimal notation is that the base ten expression of a number implicitly
breaks the number up into a sum of numbers of a very special type. Thus
7,452 = 7,000 + 400 + 50 + 2
This is usually called expanded form. It is mentioned in many state standards documents. This
essay explores what would be entailed in making it central to arithmetic instruction.
We will call the summands in the expanded form of a number the place value components of the
number. Each place value component is a digit (possibly 0) times a power of 10:
7,000 = 7 × 1000 = 7 × (10 × 10 × 10) = 7 × 103
400 = 4 × 100 = 4 × (10 × 10)
50 = 5 × 10
= 5 × 101
2= 2×1
= 2 × 100
= 4 × 102
We will refer to any number of this form, regardless of whether it is being thought of as a place
value component of some particular number, as a place value number2. Also, we will call the
powers of 10 denominations. (In this, we follow [BP].) Thus, a place value number is a digit times
a denomination.
We will call the exponent involved in a denomination the order of magnitude of the denomination.
The order of magnitude also may be described as the number of zero digits used to write the
denomination, or one less than the total number of digits used. By the order of magnitude of a
(non-zero) place value number, we mean the order of magnitude of its denomination. Finally, the
order of magnitude of a base ten number is defined to be the order of magnitude of its largest
2
This term is taken from [Ock]. Remarkably, there does not appear to be a standard term in the literature for these numbers.
The author’s first ideas for a name were decimal monomial (rejected as too clinical) and very round number. This latter term
is still the author’s favorite, but it produced strong negative reactions among some mathematicians. However, it might be the
best term for potential classroom use.
3
(non-zero) place value component. Thus, 7,452 has order of magnitude 3, the same as its largest
decimal component, 7000; and the 400, the 50 and the 2 have orders of magnitude 2, 1 and 0,
respectively.
Note that the smallest number of a given order of magnitude is the denomination of that
magnitude, and the largest is one less than the next larger denomination. For example, the
numbers with order of magnitude 3 start at 1000, and continue consecutively until 9999, with
10,000 being the smallest number with order of magnitude 4. Note that numbers are sorted by
order of magnitude: any number of a given order of magnitude is larger than every number of a
smaller magnitude, and is smaller than every number of a larger magnitude. Thus 1000 > 999 and
9999< 10,000.
For this essay, “order of magnitude" replaces the usual naming of places. We want to replace the
standard terminology involving specific place names with this order-of-magnitude terminology in
order to be able to better discuss the relationship between different orders of magnitude. Often it
will be convenient to abbreviate “order of magnitude" to just “magnitude.”
A key point about the expanded form of a number is that no two of the place value components
have the same order of magnitude. This condition in fact characterizes the expanded forms: a sum
of place value numbers is the expanded form of a base ten number exactly when it involves at
most one summand of any given order of magnitude. The non-zero digits of the decimal
expression of the number are then just the digits coming from the place value number summands,
each digit in its corresponding place. Besides these, for orders of magnitude smaller than the
magnitude of the number, if the place value number of that magnitude is “not there,” i.e., is zero,
then one puts a zero in the corresponding place of the number, to indicate the absence of any
multiple of that power of 10, so that the orders of magnitude corresponding to the non-zero digits
can be read correctly. This is the principle of place value. Thus
82 = 80 + 2
802 = 800 + 2
80,020 = 80,000 + 20
= 8 × 10 + 2 × 1
= 8 × 100 + 0 × 10 + 2 × 1
= 8 × 10,000 + 0 × 1000 + 0 × 100 + 2 × 10 + 0 × 1
There is nothing to prevent writing zeroes in places to the left of the largest non-zero digit. Doing
so does not change the value of the number. Thus
0082 = + 0 × 1000 + 0 × 100 + 8 × 10 + 2 × 1 = 8 × 10 + 2 × 1
On the other hand, there is usually no reason to do so, and it takes extra work. Accordingly, it is
very rarely done.
Although the above describes the base ten expansion of a whole number, the mathematician would
like something more: an interpretation of the base ten expansion, in terms of the structure of
arithmetic. There are (at least) two such interpretations, quite different from each other. The more
obvious one is in terms of division by 10; the other appeals to ideas of approximation. It is quite
appropriate that there should be two such distinct interpretations, reflecting two distinct but
important aspects of base ten notation. It is because both interpretations involve considerations
that are important in long division, that we do not attempt to develop them here, but instead
discuss them in section 2D on long division.
4
2. Decimal Components and the Algorithms of Arithmetic.
The fact that base ten numbers are sums has a pervasive influence on the methods for computing
with them. In fact, the procedures for carrying out the four arithmetic operations with base ten
numbers are largely determined by the fact that base ten numbers are sums of their place value
components, together with the Rules of Arithmetic (see footnote, page 1). The special form of
place value numbers has the consequence that adding or multiplying two of them substantially
reduces to the basic “addition facts" or “multiplication facts" for adding or multiplying one-digit
numbers. This is the reason for the key role those facts play in doing decimal arithmetic.
2A. Addition
The basic strategy for adding two base ten numbers is to break each number (summand) into its
place value components, and, for each order of magnitude, to add the components of that
magnitude. These sums are then recombined into a decimal number. The details of this last step
give rise to the trickier parts of the addition algorithm.
Basic Rules for Addition
One can justify this process of decomposing, adding components, and then recombining, by means
of the two most basic rules of addition: the Commutative Rule and the Associative Rule. We recall
them.
The Commutative Rule: The value of a sum does not depend on the order of the summands: for
any two whole numbers, a and b
a+b =b+a
The Associative Rule: When adding three numbers, the value of the sum does not depend on the
way the numbers are combined into pairwise sums: more precisely, for any three whole numbers
a, b, and c,
(a + b) + c = a + (b + c)
These rules are easy to justify in terms of common models of addition. If we model addition by,
say, concatenation of lengths (sticking bars end to end), the Commutative Rule can be
demonstrated by combining bars of lengths a and b into a bar of length a + b , and then rotating
the bar around its midpoint to reverse the order of the summands. Associativity is even simpler:
one just takes a bar composed of subbars of lengths a, b, and c, and observes that it can be thought
of as being made of subbars of length a + b and c, or, equally easily, of subbars of lengths a and b
+ c.
Figure 1: The Commutative Rule for Addition
a
b
a
b
a+b=b+a
5
a
Figure 2: The Associative Rule for Addition
b+c
a
b
a+b
(a + b) + c = a + (b + c)
c
c
Although the Commutative and Associative Rules are basic principles of arithmetic, one rarely
uses them in isolation. Most practical manipulations call for more or less complicated
combinations of both the Commutative and Associative Rules. Therefore, it is advisable to discuss
explicitly the logical consequence of these rules, which is that, when a list of numbers is to be
added, it does not matter what sequence of pairwise sums we do, or the order of the summands in
any of these intermediate sums: the final result will always be the same. We call this the Any
Which Way Rule, a name which emphasizes the freedom we have in performing additions. (In
[BP], it is called the Any Order Property.) Although a formal demonstration of the Any Which
Way Rule is probably not appropriate when multidigit addition is first discussed, nevertheless an
explicit discussion of the rule, with examples, particularly examples showing how it can simplify
calculations and aid mental math, does seem feasible and advisable.
In any case, the Any Which Way Rule justifies the strategy of adding two base ten numbers by
breaking each summand into its place value components, adding pairs of components of the same
order of magnitude, then adding all the results. The next question is, what does this do for us? That
is, why is it an effective method?
Sum of Two Place Value Numbers
The first part of the answer to this question comes from looking at the sum of two place value
numbers of the same order of magnitude. These addition problems simply amount to applying the
“basic addition facts" (i.e., how to add two single digit numbers), plus keeping track of the order
of magnitude. Thus, for example,
7,000 + 2,000 = 7 × 1,000 + 2 × 1,000 = (7 + 2) × 1,000 = 9 × 1,000 = 9,000
It is always the same: to add two place value numbers of the same order of magnitude, simply add
their digits and then append the order of magnitude number of zeros on the right. The formal
justification for this uses the Distributive Rule, which we will discuss later on. It can also be
thought of as simply keeping track of units: in the sum above, we are adding 7 thousands to 2
thousands, and the result is (7 + 2) thousands, or 9 thousands.
Adding Base Ten Numbers Component-wise
This observation is sufficient by itself for adding many pairs of numbers. For example:
7,452 + 1,326
= (7,000 + 1,000) + (400 + 300) + (50 + 20) + (2 + 6)
by definition of base 10 notation and the Any Which Way Rule
= 8,000 + 700 + 70 + 8
by the procedure for adding numbers of the same magnitude
= 8,778
by definition of base 10 notation
6
However, addition will be this simple only when, for every order of magnitude, the two place
value components of that magnitude have digits summing to less than 10. When this happens, each
sum of components will again be a place value number, of the same magnitude, and these may be
put directly back together to form the sum of the original numbers, as in the example above.
The Need to Regroup
Whenever a digit sum is more than 10, an extra complication arises. For example,
70,000 + 60,000 = 130,000 = 100,000 + 30,000
Again, this is a repetition on a larger scale of a basic addition fact, which is why those facts are so
important. The new problem presented by this sum is that it has two place value components, one
of the same magnitude as the place value numbers being added, and the other with magnitude one
larger.
In adding two numbers, if the sum of the two place value components of a given magnitude has a
component of larger order of magnitude, this must be added to the components of that magnitude.
This is “regrouping" or “renaming" or “carrying.” An example of addition with regrouping is
7,453 + 1,729
= (7,000 + 1,000) + (400 + 700) + (50 + 20) + (3 + 9)
by definition of base 10 notation and the Any Which Way Rule
= 8,000 + 1,100 + 70 + 12
by the procedure for adding numbers of the same magnitude
= 8,000 + (1,000 + 100) + 70 + (10 + 2)
by writing 1,100 and 12 in expanded form
= (8,000 + 1,000) + 100 + (70 + 10) + 2
by the Any Which Way rule
= 9,000 + 100 + 80 + 2
by the procedure for adding numbers of the same magnitude
= 9,182
by definition of base 10 notation
Since (when adding two numbers) the sum of components of a given magnitude contributes
directly at most to this magnitude and to the next larger one, an efficient way of finding any sum
would be to start with order of magnitude zero (the ones place), find the sum of those components,
combine the 10 (if it occurs) with the magnitude one components, find the sum of those
components, and so forth, progressing systematically to larger orders of magnitude, one at a time.
This is exactly what one does in the standard addition algorithm, although it may not be expressed
in these terms, and the relevance of the expanded form may be suppressed in a strictly procedural
approach. For example, students may not appreciate that the effect of putting one number under
the other with the ones digits aligned has the effect of putting the digits of same-magnitude
components under each other, so that adding the columns exactly accomplishes combining the
components of the same order of magnitude. If the role of magnitude is not mentioned, students
may feel that the procedure for adding decimal fractions (i.e., aligning the decimal points) is
different from the procedure for whole numbers (i.e., right justifying). If the process is understood
in terms of combining components of equal magnitude, they will see that addition of whole
numbers and of decimal fractions are exactly the same. (See section 4 for more discussion of
decimal fractions.) We note that the above procedure may also be adapted readily to addition of
many numbers (“adding the columns”).
7
Thinking explicitly about the component-by-component aspect of decimal addition can give
greater flexibility in doing addition. For example, in adding two-digit numbers mentally, it is
usually easier to think of adding the 10s and adding the 1s separately (which is exactly using
expanded form in this situation). Further, one usually adds the 10s first, since this gives the main
part of the sum, and then adds the sum of the 1s to that. If there is a carry from the 1s, it just means
increasing the result from the 10s addition by one more 10, which is easy to do when there are
only two decimal places to worry about.
We will see later on (section 5) that these same considerations lead to quick estimation procedures
for sums, easy enough to be carried out mentally in many cases.
Summary of Addition
In summary: if addition is thought of in terms of expanded form, then the basic strategy in doing
addition of two decimal numbers is this:
i a) For each order of magnitude, take the place value component of that order of magnitude
from each summand, and add them. The sum is the sum of the digits of the two components, times
their common denomination.
ii a) In cases when the digit sums are all less than 10, the component sums will be the place
value components of the sum to be found.
ii b) If, however, a digit sum is 10 or more, the 10 times the relevant denomination produces the
denomination of the next larger magnitude, which must be added to the components of that
magnitude.
These ideas need not be introduced in a single large lump. They can be taught as the decimal
system is now, in a gradual way, starting with two-digit numbers. This would simply involve
recalling that a two digit number means so many 10s and so many 1s, e.g., 47 is four 10s and
seven 1s. (Probably explicit attention to this would be helpful to many students in any case, since
the irregularities of English two-digit number words somewhat impede children to think in term of
place value [KSF, p.167 ].) Then the strategy of “adding the 10s and adding the 1s" should appear
more or less natural, perhaps after discussion. First problems with no carries could be done to
establish the basic principle, then there should be consideration of what to do when one or the
other of the digit sums is greater than 10. This approach should extend easily to numbers with
more digits. This approach is followed in the Singapore textbooks [Sing 1B].
In fact, the basic grouping principle of base ten notation is not something that all students take to
readily. Even before general two-digit addition is considered, it can be valuable to emphasize the
grouping principle involved in base ten notation by using the “make a ten" idea [KSF, p.189] in
learning the addition facts with sums above ten. As explained in [Ma], Chinese teachers think of
the addition facts as not simply as a list to be memorized, but as a place to begin teaching the
structure of the decimal system, and they emphasize the process of forming ten ones into a 10
when a sum of digits is greater than 10, and of decomposing a 10 back into ones in the
corresponding subtraction problems. The combination of this approach to the addition facts with
more discussion of number names that better reflect base ten structure may be a way to get
students off to a better start in mathematics.
2B. Subtraction
Thinking in terms of place value components can also help with subtraction.
Connecting Subtraction with Addition
8
A key principle in dealing with subtraction is to link it with addition at every opportunity, until
students think of it as the undoing of addition. This is important conceptually, to prepare for the
submersion of subtraction into addition (i.e., as addition of the additive inverse) when negative
numbers are introduced. However, it also has practical value for making sense of subtraction
procedures, by linking them to the corresponding addition procedures. This linkage also brings
more insight into addition.
Fact families should not be thought of as applying only to addition of one-digit numbers. Rather,
every subtraction problem c – b = a should be linked to the corresponding addition problem a + b
= c (and the family should be understood also to include c – a = b and b + a = c).
In fact, the details of the computations involved in the two problems c – b = a and a + b = c
correspond closely. The subtraction c – b = a will involve regrouping (borrowing) exactly when
the addition a + b = c involves regrouping (carrying). Indeed, the same orders of magnitude will
be affected in both computations.
For example, in the problems
970,957
+ 50,829
1,021,786
1,021,786
– 50,829
970,957
we see that the addition involves carrying from the 1s to the 10s, and from the 100s to the 1000s,
and that a carry from the 10,000s to the 100,000s causes a further carry to the 1,000,000s.
Correspondingly, in the subtraction, there is a borrowing from the 10s to the 1s, another borrowing
from the 1,000s to the 100s, and a “borrowing across zero,” from the 1,000,000s to the 100,000s
and 10,000s.
Borrowing Across a Zero and Rollover
This example includes one of the most troublesome aspects of subtraction, namely “borrowing
across a zero.” Study of this phenomenon in the context of the parallel addition problem reveals an
interesting feature of addition that may go unnoticed in a purely mechanical approach.
In adding two base ten numbers, when the sum of the digits of a given order of magnitude add to
9, that place would not produce a carry. However, if the next smaller place does produce a carry,
then the 1 from that, added to the 9 already there, will produce a carry. (Also, the digit left in that
place will be zero.) If the digit sum for the next larger order of magnitude is also 9, then the carry
caused by the carry from the lower place will in turn cause a carry at the next larger place. This
could continue for several places, producing a sum in which there are one or more zeroes in a row.
We call this phenomenon rollover. Its extreme form occurs in adding 1 to a number like 99 or 999
or 9999, with all digits equal to 9. Then the result, of course, is the next larger denomination. This
is what happened on car odometers when they were mechanical, and one could see the change of
the 1s place producing a change in the 10s place, which produced a change in the 100s place, and
so forth, until many or even all the digits on the odometer had rolled over.
Borrowing past a zero is the parallel for subtraction to rollover in addition: when an addition a + b
= c involves rollover, the corresponding subtraction c – b = a will require borrowing past a zero,
and vice versa. If rollover is explicitly studied as an interesting and exceptional situation in doing
addition, and if subtraction is consistently connected to addition, then borrowing past a zero may
seem less mysterious to students. Here is an example of rollover in an addition, and the
corresponding borrowing across zeroes in the associated subtraction:
9
35, 375
+24, 648
60, 023
60,023
–24,648
35,375
We see that in the addition, the carry from 10s to 100s in the addition causes two places to roll
over: it causes the 100s to carry to the 1000s, and that makes the 1000s carry to the 10,000s. In the
subtraction, in order to do the subtraction at the 10s place, one must borrow from the 10,000s
place, across both the 1000s and 100s.
Methods of Compensation
The discussion above focuses on the standard algorithm for subtraction. However, thinking in
terms of decimal components also suggests alternative procedures to the standard one for
subtraction. As is well-known, [CRL] and [KSF], there are multiple interpretations for subtraction;
take away, difference, comparison and missing addend. Comparison involves a process of
matching elements in two sets, followed by counting of the unmatched elements of the larger set
after all elements of the smaller set have been used up. Thinking in this way, one can see that, if
the same amount is subtracted from two numbers, the difference of the results is the same as the
difference of the original numbers. In algebraic terms, this is expressed by the identity
a – b = (a – c) – (b – c)
This formula may be illustrated as follows:
Figure 3: Compensation Principle for Subtraction
a
c
c
a-c
a+b
c
b-c
b
a-b
a – b = (a – c) – (b – c)
This principle may be applied in various ways to simplify subtraction problems. Methods using
this principle are sometimes called methods of compensation [ BP ].
Making Change Method
First, let us consider the extreme case of borrowing across a zero, namely, the problem of
subtracting a number from a decimal denomination. We call this a making change problem, since
it occurs when a customer hands a large denomination bill to a cashier, and the cashier must
subtract the amount of the customer’s purchase from the bill to give correct change. For example,
consider
10,000
- 964
9036
10
One way of dealing with this kind of computation is to “count up,” successively producing
numbers divisible by higher and higher powers of 10. Starting from 964, one adds 6 to get 970,
then adds 30 to make 1000, then (noting that one does not need to add any 100s to make a whole
number of thousands), adds 9000 to make the goal of 10,000. Thus all together one has added
9000 + 30 + 6 = 9036. This is sometimes called the making change algorithm, since it is what
cashiers often do (or used to do) to produce the correct change when an amount due is paid by a
large denomination bill. It amounts to a place by place enactment of the missing addend
interpretation of subtraction.
If, instead of a denomination, we are subtracting from a place value number, essentially the same
procedure works. Hence, we will include this slightly more general problem under the rubric of
making change problems.
The making change problem can also be approached by a simple method of compensation3. We
subtract 1 from both the subtrahend and the minuend to get
10,000 →
- 964 →
9036 →
9,999
- 963
9036
The point here is that all the necessary regrouping has been accomplished in the subtraction of 1,
so that in the second problem, no further regrouping (borrowing) is needed.
Another Subtraction Method
Here is another procedure for subtraction that utilizes systematically a place-by-place method of
compensation to reduce a subtraction problem to a simpler problem in which, for each order of
magnitude, the digit of either the subtrahend or the minuend is zero. It turns out that such a
problem decomposes into a sum of making change problems.
In more detail, for this method, given a subtraction a – b of decimal numbers, for each order of
magnitude, one subtracts from both numbers the smaller of the components of a and b of that
magnitude, leaving only one of the numbers with a non-zero component of that magnitude. Doing
this systematically leaves one with an equivalent but simpler problem.
Let us see how this works in two-digit subtraction. First take an example without regrouping, say
89 - 63. If we write this in the usual form:
89
- 63
The procedure says to subtract the smaller of the two digits in each column from both digits in the
column. This gives a simpler problem with the same answer. In this case
89 → 26
- 63 → - 00
Here we can read off the answer as 26. Thus, when there is no regrouping, this method is
essentially the same as the standard one.
Next, take an example with regrouping, say 83 - 26. Then the procedure subtracts 20 from both
10s components, and 3 from both ones components:
3
Thanks to Mari Muri for bringing this to our attention. It has also been taught for many years by Herb Gross.
11
83 → 60
- 26 → - 03
57
We recognize the simplified problem as a making change problem. Assuming that we know how
to do these, we will find that the answer is 57, as indicated.
In two-digit subtraction, we can not get problems which require borrowing past a zero. Here is a
three digit problem where this happens.
204 → 200
- 9 → -5
195
Again, we see that we have turned the original problem into a making change problem. This is
something that one might do when solving this problem mentally.
In general, the simplified problem will not be a single making change problem, but will break up
into a sum of making change problems. For example, in the four digit problem
2047 →
- 95 →
2002
- 50
1952
We see that the simplified problem 2002 - 50 may be broken into a sum (2000 - 50) + (2 - 0). Each
of these is a making change problem (although the second one, 2 - 0, is a rather trivial one). Note
that we can actually break up the problem by blocks, and insert the separate answers back into the
original at the appropriate places. Thus, we can say
2002 – 50 = (2000 – 50) + (2 – 0) = (200 – 5) × 10 + 2 = 195 × 10 + 2 + 1950 + 2 = 1952
This is the way this method always works. Here is the first example of this section from this point
of view:
1,021,786 → 1,001,060
- 50,829 →
- 30,103
Here the simplified problem decomposes into three disjoint collections of “making change"
problems. More precisely,
1,001,060 – 30,103 = (1,000,000 – 30,000) + (1,000 – 100) + (60 – 3)
= (100 – 3) × 10,000 + (10 – 1) × 100 + (60 – 3) = 97 × 10,000 + 9 × 100 + 57
= 970,000 + 900 + 57 = 970,957
From a mechanical point of view, we can just operate on the leftmost three digits, the next two
digits, and the rightmost two digits as if we were doing the three separate making change
problems. If we write the answers again in the same positions, they will combine to give us the
answer to the whole problem.
12
This suggests the possibility of an approach to subtraction in which making change problems are a
subject of focused attention, and after they are well understood, they can be applied to the general
subtraction problem as illustrated above. Of course a flexible understanding of the role of decimal
components would be necessary for such an approach to make sense. It will be helpful whatever
method of subtraction is used.
2C. Multiplication
Thinking in terms of decimal components also sheds light on multiplication and division.
However, the development requires a firmer grasp of the rules of arithmetic than was needed for
addition. We begin with a discussion of the relevant rules.
Commutative Rule for Multiplication
Multiplication satisfies the Commutative Rule and the Associative Rule, just as addition does, but
they are much less transparent than in the case of addition, and justifying them takes more work.
Probably the best approach is through the Array Model, which is in any case an important
interpretation of multiplication. The Array Model also helps to prepare for understanding area.
The Commutative Rule for multiplication says that, for any two whole numbers n and m, the
products n × m and m × n are the same: n × m = m × n. If we model a whole number as a
horizontal row of some sort of object, say small circles, then multiplication involves replicating
the row a certain number of times. Thus, if 3 is pictured as
,
Then 4 × 3 can be represented
|
|
|
.
(The vertical lines are only for purposes of partitioning the collection of 12 circles into sets of size
three – they are not to be counted.)
For the Array Model, instead of arranging all the groups along the same line, we stack them one
above the other, to make a rectangular array:
This is the Array Model for 4 × 3: four horizontal rows of three items each, stacked vertically
above each other.
Using the Array Model, it is easy to justify the Commutative Rule for multiplication: one just flips
an m × n array over its diagonal, turning it into an n × m array:
Thus, n × m = m × n, which is the statement of the Commutative Rule for multiplication.
13
Associative Rule for Multiplication
A justification similar in spirit, although more complicated, may also be given for the Associative
Rule. The Associative Rule for multiplication is about forming products of several numbers. We
do this by multiplying one pair of numbers at a time, and to do this, we have to choose some order
to do it in.
For example, suppose we want to multiply 2, 3, 4 and 5 together. We can multiply the 5 by the 4
to form 4 × 5. Then we can multiply that result by 3, and then multiply the result of that by 2. We
indicate this procedure by using parentheses to show how the numbers were combined: 2 × (3 × (4
× 5)). Since the 4 × 5 is inside the inner parenthesis, we know we multiplied this first. Then the 3
multiplies the 4 × 5, so the 3 × (4 × 5) is contained in parentheses, and finally, this whole product
is multiplied by 2.
Another way to form a product from these factors is to multiply the 5 by 4, and to multiply the 3
by 2, and then to multiply these two products together. This is indicated by this parenthesization:
(2 × 3) × (4 × 5).
Notice that
2 × (3 × (4 × 5)) = 2 × (3 × 20) = 2 × 60 = 120
On the other hand,
(2 × 3) × (4 × 5) = 6 × 20 = 120
We see that the two ways of multiplying give us the same answer, and this is the eventual result of
the Associative Rule. However, formally, the Associative Rule for multiplication is only about the
simplest multiple factor situation: products of three numbers. With three factors, there are just two
ways to parenthesize.
The Associative Rule for multiplication says that for any whole numbers l, m and n,
l × (m × n) = (l × m) × n
The parentheses on the left hand side say that first n should be multiplied by m, and then the result
should be multiplied by l. On the right hand side, the parentheses indicate that m should be first
multiplied by l, and then n should be multiplied by the result. The Associative Rule asserts that
these two different ways to form the three-fold product give the same result.
Building on the Array Model, one thinks of l × (m × n) as a rectangular 3-dimensional array of l
layers of planar arrays of size m × n. One can then re-slice this array by planes perpendicular to the
rows of length n, and exhibit the same array as an assemblage of n layers of l × m planar arrays.
Thus
l × (m × n) = (l × m) × n
This is the Associative Rule for multiplication. We do not attempt the illustration here. However,
students might well benefit from studying it carefully.
It is worthwhile to note that the Associative Rule for multiplication is much less transparent, both
in its justification and in its application, than the Associative Rule for addition. Indeed, if you
write both sides of the rule for almost any triple of numbers, it will produce an equation which
takes some thought to verify. For example, if we take n = 3, m = 4 and l = 5, we get (3 × 4) × 5, =
14
12 × 5 and 3 × (4 × 5) = 3 × 20. So the Associative Rule says 12 × 5 = 3 × 20. This is of course
true, but if we did not know where this equation came from, we probably would have to compute
both sides to be sure.
Law of Exponents
Before going on, we should discuss the Law of Exponents. This elegant and important formula
describes how to multiply powers of a number. Since place value numbers are just digits times
powers of 10, the Law of Exponents is relevant to multiplying base ten numbers.
The Law of Exponents depends only on the Associative Rule for multiplication. The standard
statement of the Associative Rule says that either of the two possible ways to group three numbers
in pairs to form products will yield the same result. Although it is stated only for three-fold
products, it in fact implies that any grouping of a product of any number of factors will give the
same result as any other grouping. For example, there are five ways to group four factors:
((a × b) × c) × d, (a × (b × c)) × d,
(a × b) × (c × d),
a × ((b × c) × d), a × (b × (c × d))
The Associative Rule allows us to connect these with a chain of equalities:
(a × (b × c)) × d = ((a × b) × c) × d = (a × b) × (c × d)) = a × (b × (c × d)) = a × ((b × c) × d)
(4PAR)
In the above chain, each expression is equal to the next one by a single application of the
Associative Rule. In the first and fourth equations, the rule is applied within a three-fold
subproduct of the four-fold product. In the second equality, the product ((a × b) × c) × d is
regarded as a product of the three factors (a × b), c, and d. In the third equality, it is similar: the
product a × (b × (c × d)) is regarded as the three-fold product of the factors a, b, and (c × d).
This chain of equalities shows that all parenthesizations of a four-fold product yield the same
result. It is the same with products involving any number of factors. This means that if we write a
multiple product without any parentheses, the result is unambiguous: any way of inserting the
parentheses will give the same result. For this reason, products may be, and usually are, written
without parentheses. This is part of the genius behind the logic of the Rules of Arithmetic.
Seemingly very simple rules involving only two or three numbers imply much more complex
manipulations involving many numbers.
When all the factors in a multiple product are the same number a, the result is called a power of a.
More precisely, if we multiply together m copies of a, the result is called am. Here m is called the
exponent of a. In this situation the Associative Rule guarantees that all possible ways of forming
the product of m factors equal to a give the same result, so all that matters is the number of factors,
which is the exponent m. Here are some examples:
22 = 2 × 2 = 4,
23 = 2 × 2 × 2 = 2 × 22 = 2 × 4 = 8
24 = 2 × 2 × 2 × 2 = 2 × 23 = 2 × 8 = 16,
32 = 3 × 3 = 9,
25 = 2 × 2 × 2 × 2 × 2 = 2 × 24 = 2 × 16 = 32
33 = 3 × 3 × 3 = 3 × 32 = 3 × 9 = 27
34 = 3 × 3 × 3 × 3 = 3 × 33 = 3 × 27 = 81,
35 = 3 × 3 × 3 × 3 × 3 = 3 × 34 = 3 × 81 = 243
15
From the point of view of decimal arithmetic, the most important powers, and also the simplest to
write, are the powers of 10:
101 = 10,
102 = 10 × 10 = 100,
103 = 10 × 10 × 10 = 10 × 100 = 1,000
and so forth. We can see that 10m is just the denomination of order of magnitude m. Thus, the
number of zeroes, the power of 10, and the order of magnitude of the denomination are all
different terms for the same number.
The Law of Exponents describes the product of two powers of a number a. It says:
Law of Exponents:
(am) × (an) = am+n
Examples: 22 × 23 = 4 × 8 = 32 =25; 102 × 103 = 100 × 1,000 = 100,000 = 105
Its proof is easy from what we have already said. The power am is the product of m factors, all
equal to a. Similarly, the power an is the product of n factors, all equal to a. If we multiply am and
an together, we will have a total of m + n factors, all equal to a, and this is just the power am+n.
The Law of Exponents is the key identity for dealing with expressions involving powers of
numbers. Besides its utility in computations, it reveals a beautiful parallel between multiplication
and addition. The Rules of Arithmetic emphasize the fact that both addition and multiplication are
governed by essentially the same four basic rules. This suggests that there might be some deep
relationship between them. The Law of Exponents is a concrete expression of this kinship.
Any Which Way Rule for Multiplication
Once students firmly believe the Commutative Rule and the Associative Rule for multiplication,
the same logic as used in the case of addition shows that the Any Which Way Rule holds also for
multiplication: not only can the factors in a multiple product be grouped according to any scheme
of parenthesization, they also can be reordered as desired. If this was not discussed very explicitly
when developing addition, it might be well to consider examples of how the two rules combine to
create more extensive rearrangements. Here is one. The equality
(a × b) × (c × d) = (a × c) × (b × d)
may be justified by the steps
(a × b) × (c × d) = a × (b × (c × d)) = a × ((b × c) × d)
= a × ((c × b) × d) = a × (c × (b × d)) = (a × c) × (b × d)
Here the first two equations are part of the chain of equations 4PAR above, as are the last two. The
third equation uses the Commutative Rule for b × c.
If the reader thinks that the above transformations amount to a lot of work to justify a simple
exchange of factors, s/he would be right; but the general symbolic formulation given above may
make the identity seem too transparent. Consider a numerical example:
24 × 56 = (3 × 8) × (7 × 8) = (3 × 7) × (8 × 8) = 21 × 64
The equality of the first and last products may not be immediately obvious to all.
16
Distributive Rule
In addition to the Any Which Way Rule, multidigit multiplication makes heavy use of the
Distributive Rule, which is the one rule of arithmetic involving both addition and multiplication.
More precisely, it tells us how to multiply sums of numbers. The standard formulation is:
Distributive Rule
For any whole numbers a, b, and c, the equation
a × (b + c) = a × c + a × c
holds.
The Distributive Rule may be also justified using the Array Model. The picture is as follows:
Just as the Commutative and Associative Rules combine many times to produce the practical Any
Which Way Rule, the Distributive Rule also has a more practical form, which we will designate
(rather prosaically) as the
Extended Distributive Rule
If A and B are sums of several numbers, then the product AB may be computed by multiplying
each summand of A by each summand of B, and adding all the resulting products.
For example, if A = a + b and B = c + d + e, then
AB = (a + b)(c + d + e) = ac + ad + ae + bc + bd + de
(Here, and in what follows, we use the standard convention to indicate multiplication simply by
juxtaposition, with no operation sign written explicitly.)
Figure 4: Distributive Rule
a) Example:
3
○
○
○
b) Schematic Representation:
6
○ ● ● ● ●
○ ● ● ● ●
○ ● ● ● ●
2
a
b
4
3 X (2+4) = 3 X 2 + 3 X 4
c
a X (b + c) = a X b + a X c
17
The Extended Distributive Rule can be pictured using arrays almost as easily as the basic version:
Figure 5: Extended Distributive Rule (Schematic Form)
a
b
c
e
d
(a + b) X (c + d + e) = a X c + a X d + a X e + b X c + b X d + b X e
Multiplication According to the Rules
These rules, plus, of course, the Any Which Way Rule for addition, provide excellent guidance for
multiplying base ten numbers together. Since each base ten number is the sum of its place value
components, the Extended Distributive Rule tells us that the product of two base ten numbers may
be found by multiplying each place value component of one factor by each place value component
of the other, and then summing all the products. The Any Which Way Rule for addition says that
we have great latitude how we sum them. Differences in multiplication algorithms come mainly
from choosing different summation procedures.
Multiplying Place Value Numbers and the Multiplication Facts
Before getting into details of summation schemes, we should have a good grasp of multiplying
place value numbers. Here we find a situation analogous to that of addition. When we multiply
two place value numbers, the result is equal to the product of the digits, times the appropriate
denomination, which has order of magnitude equal to the sum of the orders of magnitude of the
factors, since according to the Law of Exponents, 10a × 10b = 10a+b. Examples:
20 × 40 = 800
300 × 6,000 = 1,800,000
Thus, the basic multiplication facts, combined with a grasp of place value, allow us to multiply
any two place value numbers.
We should perhaps point out that the formal correctness of a formula like 300 × 6,000 = 1,800,000
depends on the Any Which Way Rule for multiplication. In carrying out the detailed
manipulations, we would write
300 × 6,000 = (3 × 100) × (6 × 1,000) = (3 × 6) × (100 × 1,000) = 18 × 100,000
= 1,800,000
(Note that the crucial middle equality is a case of the equation (a × b) × (c × d) = (a × c) × (b × d)
discussed above.) As we have already mentioned, the multiplication of the denominations is
described by the Law of Exponents: 100 × 1,000 = 102 × 103 = 105 = 100,000.
Just as in one-digit multiplication, the product of two place value numbers can itself have one or
two components in its expanded form. Their magnitudes will be the sums of the magnitudes of the
factors, and possibly one more. Perhaps the trickiest case is when one digit is even, and the other
is 5. Then the product of digits will be a multiple of 10, and so the full product will have only one
18
decimal component, but its order of magnitude is one larger than the sum of the orders of
magnitude of the factors. This case, however, follows the same rules as all other cases: the extra
order of magnitude is supplied by the product of the digits. An example:
50 × 800 = (5 × 10) × (8 × 100) = (5 × 8) × 1,000 = 40 × 1,000 = 40,000
Doing Multidigit Multiplication
Let us combine these observations with the Extended Distributive Rule to do multi-digit
multiplication. For example, consider
36 × 487
= (30 + 6) × (400 + 80 + 7)
= 30 × 400 + 30 × 80 + 30 × 7 + 6 × 400 + 6 × 80 + 6 × 7
= 12,000 + 2,400 + 210 + 2,400 + 480 + 42
= 17,532
In this computation, we have arranged the products of place value components in a 3 by 2 array.
We have given no clue as to how the addition of the array of six numbers was performed. One
method might be to sum the terms along each row of the array, and then sum the results. This
would yield the intermediate sum 14,610 + 2,922 + 17,532. We can recognize this sum as the one
which appears in the standard calculation
487
× 36
2,922
14,610
17,532
(Here we have filled in the often omitted zero in the 14610 in order to make clear the connection
with the row sums.)
On the other hand, if we sum down the columns of the array, then we obtain as an intermediate
step 14,400 + 2,880 + 252 = 17,532. We can recognize these summands as those appearing in the
standard procedure with the factors taken in the other order:
36
× 487
252
2,880
14,400
17,532
Thus from the array of products of place value components, we can produce the standard method
for computing products, in either order, by summing first along rows of the array, or along the
columns. By looking at the rows of the array, one sees that the sum along a row amounts to the
product of a decimal component of the first factor times the second factor. This is indeed what we
calculate to perform the standard algorithm. Interpreting the standard procedure for both orders of
the factors in terms of the array of products of components could help students to understand why
both give the same answer (as they must if multiplication is to be commutative), although the
intermediate steps are so different.
19
However, we have seen that in some sense the most natural way to do an addition is to break each
summand into its place value components, and to add the components of a given magnitude.
Neither of the standard algorithms does this. However, this procedure can be carried out in quite
an elegant way, giving an algorithm known as the lattice method, or array method, or Napier’s
Bones. It refines the observation that, if we write the products of components in an array, as in the
example above, then the order of magnitude of the products decreases from left to right along
rows, and from top to bottom along columns, and products of the same order of magnitude are
lined up along diagonals running from upper right to lower left.
To see how the lattice model works for the example above, we put the place value component
products in a 3 × 2 array of boxes, and we write each one as the product of the digit product times
the appropriate power of 10:
Figure 6: The Lattice Model
36 X 487 :
Column
Totals:
Row Totals :
30 X 400 =
12,000
30 X 80 =
2,400
30 X 7 =
210
14,610
6 X 400 =
2,400
6 X 80 =
480
6 X 7 =
42
2,922
14,400
17,532
Grand Total
252
2,880
We can see that the terms for which the power of 10 is constant are arranged along diagonals
going from upper right to lower left. This can be refined. Break up each digit product into its base
ten components. If we divide each box by a diagonal line, and put the tens digit of the digit
product (times the appropriate denomination) above the line, and the ones digit (times the
appropriate denomination) below, then terms with the same order of magnitude are again arranged
along the diagonals, which are now composed of equilateral isosceles triangles, facing up and
down alternately:
Figure 7: The Elaborated Lattice Model
10,000
2,000
2,000
400
200
2,000
40
400
400
80
20
10
2
Now summing along these diagonals gives allows us to sum numbers all of the same order of
magnitude, and the sum of these sums will give us the product:
Figure 8: The Elaborated Lattice Model,
Summing Along Diagonals
Diagonal Sums:
10,000
6,000
10,000
2,000
2,000
400
200
2,000
40
400
400
1,400
10
80
130
2
2
Grand Total: 10,000 + 6,000 + 1,400 + 130 + 2 = 17,532
This says that
36 × 487
= 10,000 + 6,000 + (1,000 + 400) + (100 + 30) + 2
= 10,000 + (6,000 + 1,000) + (400 + 100) + 30 + 2
= 10,000 + 7,000 + 500 + 30 + 2 = 17,532
We hope that it is clear from this example how to carry out the general case. Napier’s Bones is a
pleasant algorithm for multiplication which better displays the structure of the process than does
the standard algorithm. We remark that the process of summing along the diagonals in Napier’s
Bones is parallel to shifting the partial products to the left in the standard method.
2D. Division
The interactions between place value ideas and division are especially illuminating, both for
division and for place value. Indeed, study of division brings out two quite distinct interpretations
or properties of the base ten expansion of a number. In one direction, the idea of division-withremainder provides us with a systematic way to find the digits in the base ten expansion of any
whole number. In the opposite direction, the effort to find a method for computing the base ten
expansion of the quotient in a division problem forces us to find a quite different interpretation of
the base ten expansion. This second interpretation connects directly with the uses of base ten
expansions for approximation.
Division-with-Remainder
Division, in the strict sense of inverting a multiplication, cannot in general be carried out in the
whole numbers, or even in the system of decimal fractions (finite decimals). This is one of the
least satisfactory aspects of decimal arithmetic, and is a major reason for using fractions, i.e.,
general rational numbers.
Instead of division as the inverse of multiplication, one deals with a somewhat looser substitute,
division-with-remainder. This can always be carried out with any two whole numbers, but it does
not yield a single number as a result. It yields a pair of numbers: the quotient, and the remainder.
Division-with-remainder, both theoretically and practically, depends on the order properties of
21
whole numbers. These are discussed in some detail in section 3. Here we assume a basic
acquaintance.
The usual recipe for division-with-remainder (often called the division algorithm, although it gives
little guidance about how one would do division in practice) depends on the order structure of the
whole numbers. It is contained in the formula
b = qa + r
Here b is being divided by a. The number q is called the quotient of b by a. The number r is the
remainder. The essential property of r is that 0 ≤ r < a.
b
is also called the quotient of b by a. It can cause confusion to
a
use exactly the same term to refer to two different things by the same word. When we want to be
clear that we are talking about the integer r coming from division-with-remainder, we will talk
about the integer quotient, or the Iquotient for short.
Division-with-remainder can be thought of in terms of measurement. It is as if we were measuring
a ribbon of length b into pieces of length a, and we can fit q of them in, and then there is a shorter
piece of length r left over. Here is a picture of that process for b = 17 and a = 3. As can be seen, in
this case q = 5 and r = 2.
Of course, the rational number
Figure 9: Division-with-Remainder
17 = 5 X 3 + 2
17
0
20
10
3
3
3
3
3
2
Since r is non-negative, it follows that qa < b. Secondly, and this is the key property of q, it is also
true that q is the largest whole number such that qa ≤ b. For if we increase q by 1, we see that
(q+ 1)a = qa + a = b – r + a = b + (a – r)
and the inequality r < a means that a – r > 0. Hence (q + 1)a > b. For 17 ÷ 3 the figure above
illustrates this4.
Here are some numerical examples:
17 = 5 × 3 + 2.
Thus 15 ÷ 3 = 5, remainder 2, and 5 × 3 = 15 < 17 < 18 = 6 × 3.
200 = 11 × 17 + 13.
Thus 200 ÷ 17 = 11, remainder 13, and 11 × 17 = 187 < 200 < 204 = 12 × 17.
6484 = 72 × 89 + 76. Thus 6484 ÷ 89 = 72, remainder 76, and 72 × 79 < 6486 < 73 × 89.
Interpretations of the Base 10 Expansion
We will now discuss two important interpretations of the base ten expansion of a number. One of
them involves repeated division by 10. The other is less well known, but is crucial for
4
Just as there are two distinct operations, division and division-with-remainder, there should be two different symbols for
division, so that one could be sure which operation was meant. However, this essay has introduced enough new terminology
and declines the task of inventing this notation.
22
understanding long division, and also for the approximation properties of the base ten expansion to
be discussed in the next section.
The Base 10 Expansion and Successive Division by 10
The base ten expansion of a whole number can be interpreted in terms of division-with-remainder
by 10. This interpretation begins with the observation that the 1s digit of a number is the
remainder when the number is divided by 10. This is because all positive powers of 10 are clearly
divisible by 10. Precisely, we have the formula
10n = 10n-1 × 10
for any positive integer n. From this we can see that, if we take any place value number larger than
10, and divide it by 10, then the quotient is just the place value number with the same digit, but of
order of magnitude one less. For example,
7000 ÷ 10 = 700, and 400 ÷ 10 = 40, and 50 ÷ 10 = 5
We can extend this to a simple rule for dividing (with remainder) any base ten number by 10: to
get the Iquotient, we drop the rightmost digit, and this digit is the remainder. For example
7,452 = 745 × 10 + 2,
and
83,244 = 8,324 × 10 + 4,
7452 ÷ 10 = 745, remainder 2
or
or
83,244 ÷ 10 = 8324, remainder 4
Continuing to divide by 10 produces all the digits of the number in succession:
745 ÷ 10 = 74, remainder 5,
and 74 ÷ 10 = 7, remainder 4, and 7 ÷ 10 = 0, remainder 7
This scheme can be easily extended to dividing by higher powers of 10. For example, to divide by
100, we drop the rightmost two digits, and the dropped digits form the remainder; for dividing by
1000, we drop the three rightmost digits and they form the remainder; and so forth:
7452 ÷ 10 = 745, remainder 2,
7452 ÷ 100 = 74, remainder 52,
7452 ÷ 1000 = 7, remainder 452,
or
or
or
7452 = 745 × 10 + 2
7452 = 74 × 100 + 52
7452 = 7 × 1000 + 452
The Base 10 Expansion and Approximation by Place Value Numbers
There is a second, seemingly quite different, interpretation of the decimal expansion of a number
that connects with the idea of approximation. It is this interpretation that will allow us to find the
digits of the base ten expansion of an Iquotient, i.e., to perform long division.
Given any number, we can try to approximate it by sums of place value numbers. There are of
course many, many ways to do this, but there is one way to do it which deserves to be called
simplest: the greedy algorithm. Suppose we are given some collection of special numbers, and we
want to express an arbitrary number as a sum of the special numbers. This can be thought of in
terms of placing orders at a factory. Imagine that the factory makes widgets, and it packages them
in boxes of certain sizes. If you want to place an order for a certain number of widgets, the greedy
algorithm for doing this would say to do the following: take the largest package smaller than your
order; then subtract it, and repeat, with the difference taking the place of your original order.
Continue as long as you can: either until you have filled your whole order, or until all available
packages are larger than the amount you still want.
23
This will not always succeed. For example, if the packages have sizes 2 or 3, and you want to
order 4, then the greedy algorithm would say, first order a package of 3. This leaves you wanting
one more; but there is no package of size 1. However, if you had been more moderate, you could
have ordered two packages of size 2 to get exactly 4.
If the collection of package sizes is the collection of place value numbers, however, then the
greedy algorithm for approximation will work, and it exactly produces the decimal expansion of
the number. Here is an example. Take the number 7,452. Since
7,000 ≤ 7,452 < 8,000
the greedy approximation algorithm would say to write 7, 452 = 7,000 + (7,452 - 7,000) = 7,000 +
452. The next step in the approximation process is to do the same for the “error term" 452. Since
400 < 452 < 500
we see the greedy algorithm says to write 452 = 400 + (452 - 400) = 400 + 52. Similarly, the next
step is to write 52 = 50 + (52 – 50) = 50 + 2. Finally, the remainder term of this stage, namely 2, is
itself a place value number, so our approximation is exact, we have zero remainder, and the
process stops. We have found successively that
7,452 = 7,000 + 452 = 7,000 + 400 + 52 + 7,000 + 400 + 50 + 2
The end result of this approximation process is exactly the base ten expansion of 7,452.
The reason behind the success of the greedy algorithm in producing the base ten expansion is the
following property of the collection of all place value numbers:
For any place value number, of given order of
magnitude, the next largest place value number is
obtained by adding the denomination of that same
order of magnitude.
Thus, in our example above we saw that for 7000, the next largest place value number is 8000 =
7000 + 1000, and for 400, the next largest place value number is 500 = 400 + 100. This property is
perhaps quite obvious when the digit of the place value number is 1 through 8. Then we get the
next largest place value number by just increasing the digit by 1. The most interesting case is
when the digit is 9. Then the next largest place value number is the denomination of order of
magnitude one more. However, the rule holds in this case also: we get the next place value number
by adding the denomination of the order of magnitude of the starting number. Examples of this are
9 + 1 = 10, and 90 + 10 = 100, and 900 + 100 = 1000, and so forth.
From this key fact, we can give an argument for any number, as follows. Let n be any whole
number, and let m1 be the largest place value number not larger than n. Then the next largest place
value number is m1 + d1, where d1 is the denomination of the same order of magnitude as m1. Since
m1 is chosen to be as large as possible, we know that m1≤ n < m1 + d1. By subtracting m1 from all
three numbers, we can conclude that 0 ≤ n - m1 < d1. Since the next stage of the approximation
process is to approximate the remainder n - m1 with a place value number, we see that the next
stage will use a number of smaller order of magnitude. Hence, in the final sum, there will be at
most one term of any given order of magnitude, so (according to the characterization of base ten
expansions noted in section 1) the sum will be exactly the expanded form of n.
24
An interesting observation comes out of this point of view on the base ten expansion. Since the
difference between a number and its largest place value component has smaller order of
magnitude, it is less than the largest place value component. In other words, in the equation n = c + r,
where c is the leading place value component of n, and r = n - c is the rest of n, we have c > r, and
n
therefore 2c > c + r = n, or c >
> r. We may state this in words: For any whole number n, the
2
largest place value component of n accounts for more than half of n.
We will give a more refined version of this statement in the Decimal Estimation Theorem of
section 3.
Approximation and Long Division
Using the ideas above, we can give a brief description of the basic ingredients in long division.
Our treatment uses symbolism which might not be appropriate for students being introduced to
long division. However, the principle governing the procedure should be both teachable and
plausible, and examples should make it convincing. The theoretical justification might be delayed
until an algebra course.
We remind ourselves that, in dividing-with-remainder b by a, we are seeking the largest whole
number q such that aq ≤ b. Because of the approximation interpretation of the base ten expansion
of a number, this rather bare-bones description actually adapts rather directly to what one does to
find the base ten decomposition of q, when a and b are given as base ten numbers. That is, it tells
us how to do long division.
Basic Fact of Long Division (BFLD): Suppose that q is the Iquotient of b by a, so that b = qa + r,
with 0 ≤ r < a. Then the largest place value component of q is simply the largest place value
number q1 such that q1a ≤ b.
The problem of long division is to find the place value components of q. The BFLD says, simply
look for the largest place value number q1 such that q1a ≤ b, and this will turn out to be the largest
place value component of q. This effectively solves the problem: to find the rest of the place value
expansion of q, subtract to get b - q1a = b2, and repeat the process with b2. This is exactly what one
does in the long division algorithm.
The justification of the BFLD is parallel to the discussion of the approximation interpretation of
the base ten expansion. The argument uses the same fact that guaranteed success of the greedy algorithm
discussed above: that the next place value number larger than q1 is q1 + d1, where d1 is the denomination
with the same order of magnitude as q1. Because of this, if q1 is chosen to be the largest place value
number such that q1a ≤ b, we must have q1a ≤ b < (q1 + d1) a. It follows that (q1 + d1)a > b ≥ qa. Dividing
by a tells us that q1 + d1 > q. Keeping in mind that q1 + d1 is the next largest place value number after
q1 , we conclude that this says that q1 is the largest place value number less than or equal to q.
According to the approximation interpretation of the base ten expansion, this means that q1 is the
largest place value component of q, which is the claim of BFLD.
Using BFLD to do Long Division
Thanks to the Ordering Algorithm given in section 3, finding q1 is a fairly easy thing to do in
practice. This is illustrated in the example below.
The full process of long division proceeds by iteration, using the BFLD repeatedly. Having
determined the largest decimal component of q, we can now work with the difference b - aq1, and
25
look for the largest decimal component q2 such that aq2 ≤ b - aq1. Continuing in this fashion, we
will eventually find the complete decimal expansion of the quotient. This procedure is in fact
essentially the standard long division algorithm. Thanks to the structure of the family of decimal
components, the connection between the theoretical construct and the practical procedure is quite
close.
Here is an example. In it, we will try to emphasize the practical aspects of the determination of q1,
and to make clear the iterative aspect of the calculation.
Compute 93,482 ÷ 274.
First we determine the order of magnitude of the quotient. This is easy to do by looking at the
orders of magnitude of dividend and divisor, and comparing first digits (or occasionally more, if
the first digits are equal). In our case, 274 has magnitude 2 and first digit 2, while 93,482 has
magnitude 4, and first digit 9. We conclude that 274 × 100 < 93,482 < 274 × 1000. Hence 100 < q1
< 1000.
Next we want to find the first digit of q1. This requires estimating the largest multiple of 274 × 100
that does not exceed 93,482. We will see in section 3 that the first two digits of a base ten number
determine it with a fair amount of accuracy. Therefore, if we replace the dividend 93,482 with
93,000, and also replace 274 × 100 with 274 × 100 = 27,000 we will probably get the correct value
if we find the largest digit d such that d × 27,000 < 93,000. This is in fact a problem of estimating
with two-digit numbers; the final zeroes on both numbers do not affect the answer. Hence, the
issue of finding the digit of q1 can in the great majority of cases be reduced to a question of
dividing a two-digit number, or perhaps a three digit number, by a two digit number. If this way of
estimating the digit of q1 fails, then the true digit will be at most one larger or smaller, and is easy
to determine when the guess made by this method is wrong.
In our case, 3 × 27 = 81 < 93 < 4 × 27 = 108. Hence, our candidate for q1 is 300. We form
93,482 – 300 × 274 =93,482 – 82,200 = 11,282 < 27,400.
Hence q1 = 300 is the correct guess.
We next proceed to the second step, which is 11,282 ÷ 274. We already know 274 × 100 = 27,400 >11,282,
and clearly 274 × 10 = 2740 <11,282, so the next term in the base ten expansion of the Iquotient
will be a digit times 10, and out job now is to determine that digit.
In this step, the dividend 11,282 has magnitude 4, while 274 × 10 = 2740 has magnitude 3. We
again round both numbers to two places, and consider the approximate division problem 11,000 ÷
2700. We can forget about the last two zeroes here so effectively we are dealing with the problem
110 ÷ 27. Since 4 × 27 = 108 < 110 < 135 = 5 × 27, our guess for the digit is 4. Then forming the
difference,
11,282 – 40× 274 = 11,282 – 10,960 = 322
we see that it is positive, but less than 2740, so 4 is indeed the right guess.
We are left with the problem 322 ÷274. It should be clear that the quotient in this case is 1. The
remainder then is 322 - 274 = 48.
Summing up, we have found that
93,482 = (300 + 40 + 1) × 274 + 48
or 93,482 ÷ 274 = 341, remainder 48.
26
The reader can check that our steps are exactly the steps of the standard long division algorithm,
written out with commentary. Thus, the standard algorithm provides a compact format for carrying
out the iterative calculation of the successive digits of the Iquotient in division-with-remainder. In
addition, the discussion of estimation in the next session furnishes a rationale for guessing the
digits in the decimal expansion of the quotient by doing division of a two- or three-digit number
by a two-digit number. The guess that this gives will be correct most of the time.
Remark: Although it is beyond the scope of this essay, the approximation interpretation of the
base ten expansion extends directly to finding the decimal expansion of any real number, and this
algorithm extends likewise, allowing one to find the decimal expansion of any rational number
b/a.
In summary, the BFLD makes long division a practical and easy-to-use algorithm. To execute it
does require fluency with multiplication and subtraction. It also demands some skill in estimation,
but what is needed in practice boils down to division of two- or three- digit numbers by two-digit
numbers. Due to its recursive nature, it does have a step-by-step feel to it, and it can seem
somewhat lengthy. However, the overall understanding of what it is doing, as summarized by the
BFLD, can give students confidence in the process. Also, since long division starts out by finding
the largest digits of the result (rather than the smallest digits, as with the algorithms for the other
three operations), one can carry it out until one has a sufficiently accurate answer, and then stop.
Our discussion of the operations of arithmetic has tried to make the case that thinking explicitly in
terms of place value components yields benefits for developing understanding of arithmetic. We
will next attempt to show that doing decimal arithmetic with emphasis on the role of place value
components has advantages beyond computation. It also makes estimation a relatively simple
matter, and it brings out the connections between decimal arithmetic and algebra.
3. Decimal Components, Estimation and Error.
Order and Magnitude
Besides the arithmetic operations, the other main structure on the whole numbers is order: we
know when one whole number is larger than another. (In fact, order can be expressed in terms of
addition: whole number a is greater than whole number b if a = b + c, where c is another (nonzero) whole number. However, the role order plays is quite different from the role of the
operations.)
The decimal system is highly compatible with the ordering of the whole numbers, and makes it
easy to compare numbers. Recall that we defined the order of magnitude for any whole number in
section 1. It is the order of magnitude of the largest place value component of the number, or
equivalently, it is one less than the number of digits in the standard base ten expression of the
number. As we noted there, a basic fact is that order of magnitude sorts numbers according to size:
any number of a given order of magnitude is larger than any number of a smaller order of
magnitude. For example, the largest number of magnitude 2 is 999, and the smallest number of
magnitude 3 is 1000 = 999+1. This simple fact has several important consequences. The first is a
recipe for comparing any two base ten numbers.
Ordering Algorithm
To decide which of two base ten numbers a and b is larger, compare their place value components.
Find the largest order of magnitude for which the place value components of a and b are different.
Then the number with the larger component of that magnitude is larger.
27
In particular, if the order of magnitude of a is larger than the order of magnitude of b, then a is
larger than b. If they have the same order of magnitude, but the leading (i.e., largest) place value
component of a is larger than the leading component of b, then a is larger. If the leading
components agree, proceed downward in orders of magnitude, until you first find a magnitude at
which the components of a and b differ. Then the larger number is the one with the larger
component of that magnitude.
Relative Place Value
Often, we want to do more than say whether one number is larger than another, we want to say
how much larger it is, or that it is very much larger. We also want to say when two numbers are
close to one another. Paying attention to place value components also makes these things easy to
do. The main point here is that multiplying by 10 just increases order of magnitude by 1. This is
true no matter what decimal place we are talking about. Thus, given any decimal place, the place
just to the left represents numbers 10 times as large as the given place, and the place just to the
right represents numbers only 1/10 the size of the given place. Two places to the left, you find
numbers 100 times as large as where you are, and two places to the right, the numbers are only
1/100 of the place where you are. For facility with estimation, it is important that students
understand not only the values of each place, but also these relative values of the places. The grasp
of relative place value also can support the introduction of decimal fractions. We will discuss this
in section 4.
Absolute and Relative Error
Relative place value ideas help us understand that, in representing numbers, it is the largest few
decimal components (the leftmost few decimal places) that account for most of the size of a
number. This is true no matter what the magnitude of the number. These ideas are particularly
important in dealing with error, which is unavoidable whenever we must deal with “real-life"
numbers, meaning numbers which come from measurement. These are always known only
approximately, and the question of how accurately they are known is fundamental. Here the key
idea is relative error: the size of the error involved in approximating a given number, in
comparison to the number itself. If you only have a few dollars, you care quite a bit whether it is
$4 or $6. However, if you have around a thousand dollars, probably you do not care too much if it
is $1004 or $1006. And if you have a million dollars, even a thousand dollars more or less, let
alone $2, probably will not keep you up at night.
Suppose you need to work with a number V, but you use a possibly different value v in its place5.
22
and the decimal fraction 3.14 are often used in place of the number
For example, the fraction
7
355
and 3.14159. Call v an approximation to V. The
π; more accurate approximations are
113
(absolute value of) the difference, | V - v | = e is the absolute error of approximating V by v. For
many purposes, the actual size of the error e is not as important as how it compares to the size of
V. To capture this idea, we define the relative error to be the ratio e÷V. The relative error can be
thought of as measuring the error in the most relevant units, namely units of the correct amount.
Both the absolute error and relative error are controlled easily in terms of decimal components.
5
This could be for a variety of reasons: you may not know the precise value of V, or it simply may be more convenient to use
v (or both). If V is irrational, like
numerical computation.
π
or 2 , we do not know the exact value of V, so we must use an approximation in any
28
The main observation is that, the more alike the decimal expressions (read from the left) of two
numbers are, the closer together the numbers are. To be precise about this, we will suppose that
we have two numbers V and v, of the same order of magnitude, and that their largest place value
components are equal, or the two largest ones, or perhaps more. There are two ways of
formulating this: we can specify the number of decimal components that agree, or we can specify
the order of magnitude of the largest decimal components that differ. It is easy to convert one type
of information into the other, but we distinguish them because they reflect the two points of view relative versus absolute error.
We can use the decompositions afforded by division-with-remainder by some power of 10, as
described in section 2D on division, to describe the relationship between V and v. We suppose that
their common order of magnitude is m, and that their place value components of magnitude greater
than ℓ agree. This means that, if we divide-with-remainder them both by 10ℓ+1, the Iquotients will
be the same. In other words, we can write
V = q10ℓ+1 + R
and v = q10ℓ+1 + r
((m,ℓ)Decomp)
The common Iquotient q will have order of magnitude m - ℓ - 1, which means that it has m - ℓ
decimal places. The remainders R and r have order of magnitude ℓ or less: they are less than
10ℓ+1.
In this situation, the following simple statement holds.
Decimal Estimation Theorem (DET): Suppose that V and v are two numbers of magnitude m,
and suppose their decimal components of magnitude ℓ or larger are equal. (That is, the largest
m - ℓ + 1 decimal components of V are equal to the corresponding components of v.) Let their
common Iquotient when divided-with-remainder by 10ℓ be q. Then:
i) The error e = | V - v | has order of magnitude at most ℓ - 1, and hence is less than 10ℓ.
e 1
1
ii) The relative error < . In particular, regardless of q, it is always less than m−l .
V q
10
An important case of the DET is rounding down, also sometimes called front end estimation.
This involves ignoring, or more precisely, replacing by zero, the smaller place value
components of a number. For example, we might approximate 7452 by 7450 or by 7400, or by
7000, depending on the relative importance of accuracy and simplicity of thought.
The relative error of approximating 7452 by 7450 is
7452 − 7450
2
1
1
=
<
<
7452
7452 3700 745
The last amount is the estimate given by the DET, with m = 3 and ℓ = 1. The reason the actual
relative error is so much smaller than the estimate provided by DET is because the actual ones
digit is 2, but it potentially could be as large as 9, and the DET must work for all possible ones
digits.
29
The relative error of approximating 7452 by 7400 is
7452 − 7400
7452
=
52
1
1
<
<
7452 140 74
The last amount is the estimate provided by DET when m = 3 and ℓ = 2. If we replace 7452 by
7000, the relative error is
7452 − 7000
452
1 1
=
< <
7452
7452 16 7
where the final amount is again provided by DET, with m = 3 and ℓ = 3. We note that the
estimates provided by part i) of DET for our example 7452 are better than the worst case
estimates provided by part ii). These would be 1/100, 1/10 and 1/1, rather than 1/745, 1/74 and
1/7 respectively.
We hope that these examples provide enough hints about how one might prove the DET in
general.
Limits of Accuracy in Practice
The DET expresses in a concise way the sense in which knowing the beginning (the largest
several decimal components) of the decimal expression of a number tells us most of what we
want to know about the number as a magnitude. If we are number theorists, it may be
important to know all the decimal components of a number, in order to tell, for example, if it is
a perfect square, or if it is divisible by 7, or if it is prime. But if it is a number coming from a
measurement, then all we need to know, or can expect to know, is an approximate value, and
the largest decimal components supply this with great efficiency. Significant figures is a
common term that refers to the number of digits we are sure are correct when giving the result
of a measurement. If we are sure that we know the largest k decimal components of a number,
we say that the number is known to k significant figures. The DET (part ii)) tells us that if we
know a number to k significant figures, then the relative error between the measured values and
1
the “true" value is at most k −1 . So, for example, if we know a number to 4 significant
10
1
figures, we know it with a relative error of less than
.
1000
In fact, it is rather rare to know a measured number with such accuracy. Take the example of
the “radius" of Earth. It is approximately 4,000 miles, but sometimes you will see figures such
as 3,928 miles. However, the last digit does not have a clear meaning. In speaking of the radius
of Earth, we are pretending that Earth is a perfect sphere. However, it is not. It deviates from
being a perfect sphere in three ways. First, because of its rotation, it is slightly oblate –
flattened at the poles, and thickened around the equator. Second, it has bumps and dimples Mount Everest and the Challenger Deep. Third, because of the motion of its liquid interior, it is
slightly deformed, with a bulge in the north Pacific. These imperfections mean that it does not
make sense to speak of a “radius" of Earth more accurately than about 10 miles. Thus, the
radius of the Earth is a number defined only to 3 significant figures. Most other “real-life"
numbers have similar limitations. The results of polls are typically accurate only to about +3%,
which is slightly better than one significant figure. Although the U.S. Census reports state
populations as exact numbers of people, in the millions (6 significant figures) or the 10s of
millions (7 significant figures), it is lucky if these numbers are accurate to 3 figures.
30
In summary, the notion of significant figures successfully captures many of the key notions of
error and approximation for decimal numbers. Especially, it provides an easy way to deal with
relative error. For many purposes, it suffices to know a number to one significant figure. For
most purposes, two significant figures are enough, and it is rare to know a “real-life" number to
more than 3 significant figures. (Financial transactions can be exceptions to these rules; but it
is also debatable how “real-life" the numbers involved in them are.)
Scientific Notation
Scientists and others who use numbers resulting from measurements are mainly interested,
first, in their size, and second, in the accuracy with which they are known. The way of writing
numbers known as scientific notation is designed to exhibit these two points very clearly.
Formally, scientific notation is simply a variant way of writing decimal fractions. Instead of
writing a decimal fraction d in the conventional way, with a certain number of places to the left
of the decimal point, and a certain number to the right, scientific notation rewrites it in the
form
d = (d÷10m) × 10m
where m is the order of magnitude of d, so that d÷10m is between 1 and 10. (A slight variant,
used in IBM computers, writes d = (d÷10m+1) × 10m+1.)
In this form, scientific notation separates out a key feature of a number, namely its magnitude,
and displays it prominently. A further convention frequently used by scientists (although it is at
odds with standard mathematical notation) is to report only significant digits when recording a
number in scientific notation, in order to be precise about the accuracy to which the number is
known. (Also, measurements are usually presented rounded to the nearest significant digit,
rather than rounded down, as assumed in the DET. This is a minor issue.) By this convention,
2.3 × 104 means some number known to be between 22,500 and 23,500, while 2.30 × 104
means some number known to be between 22950 and 23050. With this understanding,
scientific notation uses exponents and the idea of significant digits to express in a very direct
way the size and accuracy of a “real-life" number. The DET (modified slightly to account for
the rounding convention usually used in practice) tells us tells the maximum possible relative
error in a number with given significant digits.
4. Decimal Fractions.
If place value is firmly understood, and also the basic ideas of fractions are in place, the
extension of the decimal system from whole numbers to decimal fractions should be
straightforward. We will discuss some details here. In this treatment we will assume a mature
understanding of the nature of fractions and division. Thus, our formulation would probably
not be immediately usable with students first meeting decimal fractions. However, we hope it
provides some guidance for a discussion at that level. As with whole numbers, decimal
fractions are usually introduced a few decimal places at a time, and this makes it easier to be
concrete, discuss particular cases and do examples.
Basic Properties of Fractions
We will assume that the following basic understandings about fractions, multiplication and
division are in place.
1) Fraction notation expresses numbers as the result of a process of division. More precisely,
31
b
signifies “b divided by a,” or, the result of dividing b by a. (Technical
a
terminology: b is the numerator and a is the denominator.) This makes sense for any two
numbers b and a, not just whole numbers. In particular, a and b themselves could be fractions.
b
However, when b and a are whole numbers, this does denote the fraction
in the usual
a
colloquial sense. This is the mature understanding of fraction notation6. It is not clear how
many students achieve this final conceptualization of fractions. However, it seems unlikely that
one can advance very far in algebra without such an understanding, so if this essay can serve to
draw attention to this important stage in understanding fractions, it will be well pleased.
the symbol
2) Division by a number b is the same operation as multiplying by the reciprocal (or
1
multiplicative inverse) of b. The reciprocal is often written as , since this is indeed the result
b
a
1
of dividing 1 by b. Thus = a ⋅ . The exponential notation b-1 is also often used
b
b
1
instead of .
b
1 1 1
= ⋅ . That is, the reciprocal of a product is the product
bc b c
of reciprocals. We will need to use this often. (This is a consequence of the more fundamental
Rules of Arithmetic7, but we will not give the derivation here.)
3) We will also record the identity
Definition of Decimal Fractions
If we understand these points, decimal fractions (a term which unfortunately is usually
a
shortened to “decimals") are easy to define: a decimal fraction is a fraction of the form m
10
where a and m are whole numbers (or zero). That is, a decimal fraction is a fraction in the
colloquial sense, whose denominator is a power of 10.
Although decimal fractions include all whole numbers, there are many numbers which can not
be expressed as decimal fractions. Especially, general fractions are not decimal fractions. The
first example is 1÷3. It follows that the system of decimal fractions is not closed under division
(since it includes whole numbers, so if it were closed under division, it would include all
b
quotients
for any whole numbers b and a).
a
Since decimal fractions are just a restricted kind of fraction, and do not allow us to do
arithmetic freely (since they are not closed under division) why should we pay special attention
to them? There are two redeeming features of decimal fractions which have led to their
b
6
There seem to be at least two earlier conceptions of the fraction a . The first and least productive is that it means “b out of
a". That is, one selects b elements from a set of a elements. At this level, a fraction is not even a number, it is a relationship
between two (whole) numbers. In the next stage of understanding fractions, the fraction is related to a unit, and it is understood
that the a things are equal pieces of the unit. When the fraction is related to a unit, it can be conceived of as a quantity in itself,
that is, as a number, and it can be used in arithmetic. However, this usage may not be very flexible. The rules for operating
with fractions may be somewhat mysterious, and it may be hard to bring the rules of arithmetic to bear. In particular, the
Associative Rule for multiplication is not readily usable.
7
See footnote, page 1.
32
widespread (and since the advent of electronic calculators, more and more dominant) use. They
are
i) The computational algorithms developed for base ten whole numbers extend with almost no
change to allow us to compute with decimal fractions. Since arithmetic with general fractions
seems to cause trouble for large numbers of people, the relative simplicity of computing with
decimal fractions is very attractive.
ii) Any real number can be approximated as closely as one wishes by decimal fractions. This is
often expressed by saying that the decimal fractions are dense in the real numbers.
Furthermore, the approximation process is straightforward, in the sense that the same “greedy
algorithm" that works for whole numbers (see section 2D) applies in general.
This combination of familiar, effective computational methods with the ability to approximate
means that one can do approximate arithmetic to arbitrary accuracy with arbitrary numbers, so
for practical purposes, working with decimal fractions allows us to compute anything we want.
In fact, as we can see from the DET of §4, the accuracy of decimal approximations increases
rapidly with the number of digits of accuracy, so that we can usually get enough accuracy
without incurring an onerous computational load.
Computing with Decimal Fractions, I
Let us look at computing with decimal fractions. We will only discuss addition and
multiplication. Subtraction and division are similar.
c
d
and f = l . Multiplying e and f can be done using the Any Which
k
10
10
Way Rule for multiplication, combined with the properties 2) and 3) above. We see that
Suppose we have e =
e×f=
c
1
1
1
1
d
× l = (c × k ) × (d × l ) = (c × d ) × ( k × l )
k
10
10
10
10
10 10
1
1
c×d
= (c × d ) × ( k
) = (c × d ) × ( k +l ) =
l
10
10k + l
10 ×10
(MultDec)
Here we have inserted times signs to be very explicit that we are doing multiplication.
To add two decimal fractions, as for any two fractions, we need to put them over a common
denominator. This is easier than for two arbitrary fractions, since one of the powers 10k and 10ℓ
divides the other. More precisely, suppose that k ≤ ℓ. Then by the Law of Exponents, we know
c
c ×10l − k
c ×10l − k
that 10ℓ = 10k×10ℓ-k, so that we may write k = k
c÷10k = (c × 10ℓ-k ) ÷
=
l−k
l
10 10 ×10
10
(10k×10ℓ-k ) = (c × 10k )÷10ℓ . This allows us to add:
e+ f =
c
d
c ×10l − k
d
c ×10l − k
+
=
+
=
10k 10l
10l
10l
10l
(AddDec)
Since we know how to do the integer arithmetic to compute the numerators of the final
expressions in equations (MultDec) and (AddDec), these formulas do in fact provide practical
recipes to add and multiply decimal fractions. Here is an example of how they work:
33
take
e=
47
10
then
e× f =
and
e+ f =
and
f =
7452
100
47 × 7452 350244
=
1000
1000
47 ×10 + 7452 470 + 7452
.
=
100
100
Decimal Expansions
However, as they are stated above, the formulas MultDec and AddDec are still somewhat
abstract. They do not tell us what is special about decimal fractions. We could establish quite
similar formulas for fractions with denominators which are powers of any number.
The reason for singling out denominators which are powers of 10 is of course because we use
base ten expansions to write whole numbers, and dividing by powers of 10 is highly
compatible with this system. We know that the base ten system writes every number as a sum
of place value numbers – digits times powers of 10. When we divide by powers of ten, the
numbers we get will still be writable in this way, except that to express the fractional part of
the quotient, we will have to use digits divided by powers of 10. Thus for example, the
numbers used in the calculations above are
47 40 + 7 40 7
7
=
=
+ = 4+
10
10
10 10
10
and
7452 7000 + 400 + 50 + 2 7000 400 50
2
5 × 10
2
5
2
=
=
+
+
+
= 70 + 4 +
+
= 70 + 4 + +
100
100
100 100 100 100
10 ×10 100
10 100
d0
1
100
The base ten system for writing whole numbers is based on the denominations
d1
d2
d3
d4
d5
d6
10
100
1000
10,000
100,000
1,000,000
101
102
103
104
105
106
…
…
…
To express a given whole number, we multiply these denominations by appropriate digits, and
sum up.
As the above examples illustrate, we can extend this system to decimal fractions by extending
the system of denominations to include reciprocals of powers of 10:
…
d-3
d-2
d-1
d0
d1
d2
d3
…
…
1
10
100
1000
…
1
1
1
1000
100
10
1
100
101
102
103
…
…
1
1
3
10
2
10
10
34
Just as in the whole number context, to express an arbitrary decimal fraction, we multiply
certain of these denominations by appropriate digits, and sum up. We will call the numbers u × dk,
where u is a digit, place value numbers, as before, but now we will let k be negative as well.
(When k is negative, it would be appropriate to refer to “fractional place value numbers.”)
Then, as with whole numbers, every decimal fraction is a sum of place value numbers. This is
what allows us to extend our calculation procedures in a straightforward fashion from whole
numbers to decimal fractions.
We started this essay with place value notation for whole numbers. Since the base ten
expansion of a number expressed it as a sum of place value numbers, and the place value
numbers are digits times powers of 10, to know the number it is enough to know the digit for
each power of 10, and this is what the place value notation tells us: it writes the sequence of
digits, from right to left, beginning with the digit for 1 = 100. The location of a digit in this
sequence tells us the power of 10 by which it should be multiplied.
We now see that decimal fractions have a similar expansion: they can be written as sums of
digits times powers of 10, possibly using negative as well as positive powers. A given decimal
fraction is determined by the digits which multiply the powers of 10. (Of course, these digits
are non-zero only for finitely many powers.) These digits naturally make a sequence, with the
position of a digit in the sequence determined by which power of 10 it multiplies. When we
were dealing with integers, this sequence had a natural start, namely the ones place (order of
magnitude zero), and progressed to infinity to the left. When we deal with fractions, however,
the sequence can extend arbitrarily far in both directions. There is no longer any starting point.
So, to know which digits go with which powers of 10, we have to show explicitly where to
find the order of magnitude zero (the ones place). Our standard convention for doing this is to
1
place a mark, the decimal point, between the digit for 100 = 1 and the digit for 10−1 = . This
10
locates the digits. The digits to the left of the decimal place multiply non-negative powers of
10, as before, and the digits to the right multiply the negative powers. For our example
numbers from above, we have
47
7452
= 4.7 and
= 74.52
10
100
When we write a number like this, multiplying it by 10 is extremely easy to do. Multiplying by
10 turns each denomination into the next larger one, so each digit now should multiply the next
larger denomination, so it should get shifted one space to the left, relative to the decimal point.
This amounts just to shifting the decimal point one space to the right. Similarly, division by 10
can be recorded by shifting the decimal place one space to the left.
Decimal Expansions as an Address System on the Number Line
A good way to think about the decimal expansion is that it gives an address system for locating
numbers on the number line. This gives a geometric way to picture how a number is
approximated by the sum of its leading decimal components. Consider a number
4
5
2
f 7452
, which is 7.452 in conventional notation. Let’s
like f ′ = =
= 7+ +
+
10 1000
10 100 1000
locate f′ on the number line. Here is a piece of the number line that goes from 0 to 10:
35
7.452
Figure 10: The Number Line (from 0 to 10)
0
1
2
3
4
5
6
7
8
9
10
The 7 in 7.452 tell us that that f′ is between 7 and 8. Thus, it is somewhere in the 8th unit
interval in this piece of the number line.
To locate f′ more precisely, we look at the next place in its decimal expansion. We divide the
interval between 7 and 8 into ten equal subintervals.
Figure 11: The Number Line from 0 to 10,
showing 1/10ths between 7 and 8
0
1
2
3
4
7.4
5
6
7.452
7.5
7
8
9
10
The 4 in 7.452, representing 4÷10, tells us that f′ is located in the fifth of these subintervals,
between 7.4 and 7.5.
To locate f′ still more accurately, we should next divide up the interval between 7.4 and 7.5
into 10 equal subintervals.
Figure 12: The Number Line from 7 to 8, showing 1/10ths between 7 and 8,
and 1/100ths between 7.4 and 7.5
7.452
7
7.1
7.2
7.3
7.45
7.46
7.4
7.5
7.6
7.7
7.8
7.9
8
We see that the divisions are getting rather fine, almost too fine for us to see the subintervals!
This is graphic proof of the rapidity with which the decimal expansion approximates a number.
If, however, we magnify the interval between 7.4 and 7.5, then we can see what happens at the
next step.
Figure 13:
The Number Line from 7.4 to 7.5
7.4
7.41
7.42
7.43
7.452
7.44
7.45
36
7.46
7.47
7.48
7.49
7.5
Notice that when we expand the interval from 7.4 to 7.5 by a factor of 100, it looks just like the
interval from 0 to 10, except that the labels are changed. The division points 7.40, 7.41, 7.42,
etc., are seen to divide the interval into 10 equal subintervals. The next place value component,
that is the 5 in 7.45, tells us that f′ is in the subinterval between 7.45 and 7.46. Finally, the last
decimal component locates f′ in one of the 10 equal subintervals between 7.45 and 7.46. We
are now working at distances which we cannot distinguish visually. We would need a
microscope to really see what is going on. For practical purposes, knowing four terms of the
decimal expansion of a number locates it to a high degree of precision. However, in principle,
decimal fractions can be carried out to any number of places. The number π has been
computed to several billions of decimal places. This is a degree of accuracy that is
unimaginable. Because of the arbitrary fineness of the divisions produced by the decimal
fractions, any number can be located with arbitrary accuracy on the number line, by a decimal
address, using the procedure described above. More precisely, we can consider that the decimal
expansion 7.452 does not locate a given number, but serves as an approximate address for any
number in the interval between 7.452 and 7.453. This is very much the way the idea of
significant digits is used for recording “real life" numbers.
Computing with Decimal Fractions, II
Now let’s discuss some more the methods for calculation with decimal fractions. The key facts
which shape decimal arithmetic are the formulas for addition of two place value numbers of
the same order of magnitude, and for the product of two place value numbers. We may
summarize them as follows:
u × 10k + v × 10k = (u + v) × 10k ,
and
(u × 10k ) × (v × 10ℓ ) = (u × v) × 10k+ ℓ
In these formulas, u and v are digits, and k and ℓ are whole numbers. In performing addition,
the next step is to decompose u + v into its place value components, and use this to rewrite
(u + v) × 10k as a sum of place value numbers. Similarly, in performing multiplication, the next
task is to decompose the product u × v into its place value components, and use this to rewrite
(u × v) × 10k+ ℓ in terms of its place value components. These procedures both look exactly
the same when we allow negative exponents for the powers of 10. As a consequence, the
arithmetic of decimal fractions is essentially the same as whole number decimal arithmetic.
We will look at each operation in a little more detail. We start with addition. The key fact
about the denominations that gives shape to addition of base ten numbers is that each
1
successive denomination is 10 times the previous one, and, reciprocally, is
of the next one:
10
1
dk+1 = 10dk and dk-1 = d k , for all k ≥ 0.
10
This basic relationship between successive denominations persists when we enlarge the list of
denominations to include the dk for k < 0. This is what guarantees that the procedure for
addition of decimal fractions will be essentially the same as for whole numbers. To be able to
formulate the recipe in general, we should extend the terminology of order of magnitude to the
enlarged system. Thus, we will say that any place value number a × dk has order of magnitude
k, for k positive or negative (or zero). With this agreement about language, we can give exactly
the same recipe for adding decimal fractions as for adding base ten whole numbers, except that
we can no longer assume that magnitude zero is the smallest order of magnitude.
37
1. For each order of magnitude, add the place value numbers of
that order of magnitude from each summand.
2. If the sum of the digits for a given order k of magnitude is 10
or more, break the sum into its place value components:
(u + v) × d k = (10 + w) × d k = d k +1 + g × d k .
Combine the dk+1 with the components of magnitude k + 1.
If one wishes to minimize the amount of rewriting, one would start with the smallest order of
magnitude for which there are non-zero components, and proceed upwards one order of
magnitude at a time. This gives the standard procedure. However, in mental math, dealing with
numbers with only a few non-zero place value components, the additions might be done in a
different order. If one only wants an approximate answer, one would start at the other end, with
the largest components, and stop after a few places.
For multiplication, the basic facts governing the form of the computational algorithm were the
(extended) Distributive Rule and the formula for multiplying two place value numbers, as
given above. That formula has two basic ingredients. First we need to know how to compute
the product u × v of digits. These are the multiplication facts. The other part of the product
involves multiplying two denominations. This is governed by the Law of Exponents:
dk d ℓ = d k+ℓ , or 10k10ℓ = 10k+ℓ for all k ≥ 0 and ℓ ≥ 0
In fact, this formula describing the behavior of the dk under multiplication extends to all k and
ℓ, positive or negative. For example,
d-2d4 =
1
10, 000
× 10,000 =
= 100 = d2 = d −2+ 4 .
100
100
A very elegant way of saying this is to extend the exponential notation to all integers, negative
as well as non-negative. Assuming that we already understand 10k as the product of k tens for k > 0,
we also define
1
100 = 1 and 10-k = k for k > 0.
10
With these definitions, we can check that
The Law of Exponents is valid for all integers.
Thus, the analogy between addition and multiplication displayed by exponential notation
extends to include a parallel between subtraction and division.
With these observations, we have essentially arrived at the usual recipes for decimal
arithmetic. For addition (or subtraction) do the usual thing of placing one number under the
other, making sure to line up the decimal points. This will ensure that you are adding like
powers of 10, and then you can proceed as if dealing with whole numbers. The factor of 10ℓ - k
appearing in the final formula of (AddDec) is what is accomplishing the lining up of decimal
points in that computation.
38
For multiplication, shift the decimal place to the far right of each number. This amounts to
turning the decimal fractions into whole numbers by multiplying by a suitable power of 10.
Then do your computation using the normal procedure for whole numbers. Finally, divide by
the product of the powers of 10 you previously multiplied by. The Law of Exponents says that
this amounts to shifting the decimal place in the product by the sum of the number of places
you shifted each factor. This is exactly what is reflected in the formula (MultDec). The
denominators c and d are the original two numbers with their decimal points shifted to the far
right; dividing their product by 10k+ℓ then shifts the decimal point back to where it should be.
Let’s illustrate these ideas by doing the example calculations above using place value
components rather than converting to whole number computations, as in (AddDec) and
(MultDec). We can do addition by combining components of the same order of magnitude and
regrouping/carrying just as when dealing with whole numbers:
47 7452
7
5
2
7 5
2
) = 70 + (4 + 4) + ( + ) +
+
= (4 + ) + (70 + 4 + +
10 100
10
10 100
10 10 100
12 2
10 + 2 2
2
2
= 70 + 8 + +
= 70 + 8 +
+
= 70 + 8 + (1 + ) +
10 100
10
100
10 100
2
2
2
2
7000 900 20
2
= 70 + 8 + 1 + +
= 70 + 9 + +
=
+
+
+
10 100
10 100 100 100 100 100
7000 + 900 + 20 + 2 7922
.
=
=
100
100
First, notice that the answer is the same as we got using the general formula of (AddDec). What
is more important, however, is that the details of the computation are essentially the same as in
the general formula; for to compute the numerator of the formula in (AddDec) by the usual
rules, we proceed as follows:
47 × 10 + 7452 = 470 + 7452 = (400 + 70) + (7000 + 400 + 50 + 2) = 7000 + (400 + 400) + (70 + 50) + 2
= 7000 + 800 + 120 + 2 = 7000 + 800 + (100 + 20) + 2 = 7000 + (800 + 100) + 20 + 2
= 7000 + 900 + 20 + 2 = 7922.
If we compare this computation with the previous one, we see that the steps of one correspond
closely to the steps of the other. The only differences are those used to change the form of
some power of 10. Thus, in the second calculation, we take a step at the beginning to convert
2
as
47×10 to 470, and in the first calculation, we take a step to rewrite the terms 70, 9 and
10
fractions with denominator 100, and another to combine them into one sum. However,
additions of the terms of the same order of magnitude, and the regroupings of the sums which
add to 10 or more (there is only one such sum in this example) correspond precisely.
39
Here is the multiplication:
47 7452
7
5
2
)
×
= (4 + ) × (70 + 4 + +
10 100
10
10 100
5
2
= 4 × 70 + 4 × 4 + 4 × + 4 ×
10
100
7
7
7 5 7
2
+ × 70 + × 4 + × + ×
10
10
10 10 10 100
20 8
= 280 + 16 + +
10 100
28 35
14
+49 + +
+
10 100 1000
48 43
14
= 280 + 65 + +
+
10 100 1000
280, 000 + 65, 000 + 4,800 + 430 + 14 350, 244
.
=
=
1000
1000
In this computation, the second and third lines are the expansion of the product into an array
according to the Extended Distributive Rule. The fourth and fifth lines compute the indicated
products of place value numbers from lines two and three. Note that the powers of 10 in this
array are constant along the upward slanting diagonals. The sixth line displays the sums along
these diagonals. Finally, the seventh line puts all terms over the common denominator of 1000,
and adds all the summands. Note that this calculation is completely parallel to the
multiplication of the numerators (i.e., 47×7452), but is slightly more complicated due to the
appearance of decimal components for negative powers of 10. The alternative computation, to
just compute the product of the numerators, then divide by the product of the denominators
(i.e., by 1000), would have been simpler. This latter computation is the usual recipe. The final
step, of dividing by 1000, corresponds to the recipe of where to put the decimal point.
5. Decimal components and estimation in arithmetic.
Thinking in terms of relative place value also provides efficient ways of estimating sums and
products. We will avoid exact details here, to keep the discussion short, but the principle is
clear: when estimating, focus on the largest decimal components. With our understanding of
how decimal components enter into arithmetic, we can describe what to do to get an estimate
with desired accuracy for a sum or product.
Estimating Sums
For a sum, simply retain the sums of the largest decimal components. The three largest
components of the larger summand, and the components of the same magnitudes for the other
summand, will give an approximation to the sum with relative error small enough for most
purposes. For example,
83, 244 + 5, 293 83, 200 + 5, 200 88, 200
Here the exact sum is 88, 537, so the relative error is
40
88,537 − 88, 400
88,537
=
137
1
<
.
88,537 600
Error Propagation in Addition
Absolute error at worst adds under addition. That is, if u is an approximation to U, with
absolute error e1=|U-u|, and v is an approximation to V, with absolute error e2=|V-v|, then the
absolute error of u+v as an approximation to U+V is at most e1+ e2. This follows from the
Triangle Inequality for absolute value:
|(U+V) – (u+v)| = |(U-u) + (V-v)| ≤ |U-u| + |V-v| = e1+ e2.
A very pleasant fact is that relative error does not increase under addition, provided that all
summands are positive. We assume that we are adding positive numbers U and V. Suppose that
u approximates U with absolute error e and relative error δ. Similarly, suppose that v
approximates V with (at most) the same relative error. One can show that relative error is
preserved in addition. That is, the relative error with which u+v approximates U+V is also at
most δ. We will leave the verification of this as an exercise.
Error Growth in Subtraction
When working with positive numbers, the operation in which relative error may increase is
subtraction. In fact, in subtraction, one may completely lose control over relative error: it can
become arbitrarily large. This is because the difference of two large numbers may be small,
much smaller than either of the numbers; but when this happens, there is no reason for the
error of approximation to have shrunk. Since to compute relative error, we divide by the
number being approximated, the quotient defining relative error may be quite large. In fact, if
we take two different approximations u1 and u2 to the same number U, then the difference u1 u2 is supposed to be an approximation to U - U = 0. Clearly, if u ≠ u2, the relative error is
infinite. More generally, if u and v are approximations of U and V, each with absolute error e
or less, and if the difference between U and V is less than e, then the number u - v can be any
sufficiently small number. In particular, it could be any number of absolute value not more
than e. Thus, one has essentially no control over U - V, except for a bound on how large it can
be. For this reason, it would be a very dicey proposition to use u - v in any further calculation
involving U - V, particularly one that involved dividing by U - V. In less extreme situations,
one may retain some information about U - V, but with a substantial loss of significant digits.
Estimating Products
To discuss estimation of products, we recall that a product of decimal numbers is the sum of all
the products, of any place value component of one factor times any component of the other
factor. Of these summands, the largest will be the product of the largest decimal components of
each factor. The next largest will be the two terms gotten by taking the product of the second
largest component of one factor, with the largest component of the other. To illustrate this,
look again at the example from section 2C, the product 36×487. The largest product of place
value components is 12,000 = 30×400. The two next largest summands are 30×80 = 2400 and
6×400 = 2400. The sum of these three terms is 16,800. If we use this as an estimate for the
17,532 − 16,800 1
product, which is 17, 532 the relative error is
<
, or less than 5%. If we want
17,532
20
more accuracy, we would take the next largest summands, namely 30 × 7 = 210 and 6 × 80 =
480. Adding these to the 16,800 gives 17,490. The relative error is now
17,532 − 17, 490
1
<
.
17,532
400
41
The situation can be understood nicely by using the lattice model for multiplication. Imagine
the products of decimal components arranged in an array of boxes, as in section 2C. The
largest product is in the upper left hand box. The next largest products are on the diagonal
adjacent to this box. The products next again in size are located on the next diagonal down or
to the right, and so forth. For many purposes, taking the largest six terms, on the three upper
left diagonals, will provide a sufficiently accurate approximation to the exact product.
Figure 14: Approximating Products Using Lattice Multiplication
(Example)
36 X 487 :
12,000
2,400
210
2,400
480
42
First Cut: 12,000
Second Cut: 12,000 + 4,800 = 16,800
Third Cut: 12,000 + 4,800 + 690 = 17, 490
When more accuracy is needed, a straightforward continuation of the above procedure
prescribes the next terms to add. The upshot is a method to get a (relatively) very accurate
approximation to a multidigit multiplication with fairly little work.
Figure 15: Approximating Products Using Lattice Multiplication
(General Scheme)
First
Cut
2nd
Cut
3rd
Cut
2nd
Cut
3rd
Cut
4th
Cut
3rd
Cut
4th
Cut
4th
Cut
4th
Cut
Caveat: because of the rollover phenomenon (see section 2B), we cannot guarantee that the
approximate answers we compute by the above methods will agree in any decimal places with
the true answer. However, we can show that they do approximate well the true answer, in the
sense that they produce a small relative error. Usually, in a sense that can be made precise, they
will have the same leading decimal components as the exact answer.
42
6. Decimal Components and Algebra.
The Rules of Arithmetic Rule in Algebra
Discussing arithmetic in terms of decimal components should help prepare students for algebra
in several ways. One way is that it would create increased awareness of and facility with the
Rules of Arithmetic, which are also the rules for manipulating algebraic expressions. (And
essentially, they are the only rules, so they provide a remarkably compact summary of what
constitutes legitimate algebraic manipulation.) Also, the Rules themselves are most succinctly
expressed via symbolic equations, so they would give students exposure to the ideas of literal
symbolism and variables. In these ways, this approach should promote increased confidence
and facility with algebraic manipulation.
Base Ten Numbers as Polynomials in 10
Another way in which working with decimal components would prepare students for algebra is
that it would bring out the deep analogy between decimal arithmetic and algebra. As we
remarked at the outset, decimal notation exploits algebra in the service of arithmetic. Decimal
numbers can be usefully thought of as “polynomials in 10.” Emphasizing this connection
should shed light on arithmetic, and also make algebra more familiar and learnable.
Here are some examples of the analogy. Consider the numbers 21 and 13. We can compute
21 + 13 = 34, and 21 × 13 = 273.
Now consider the expressions 2x + 1 and x + 3. We can also compute, using the same rules as
we did for regular arithmetic, that
(2x + 1) + ( x + 3) = 3x + 4 and (2x + 1) × ( x + 3) = 2x2 + 7x + 3.
In fact, if we do these computations step-by-step, using the Rules of Arithmetic carefully, we
will see that they are completely parallel:
21×13 = (20 + 1 )× (10 + 3) = (2 × 10 + 1) × (10 + 3)
= (2×10) × 10 + (2 × 10) × 3 + 1 × 10 + 1 × 3
= 2 × (10 × 10) + (2 × 3) × 10 + 10 + 3
= 2 × 100 + 6 × 10 + 10 + 3 = 200 + (6 + 1) × 10 + 3 = 200 + 7 × 10 + 3 = 200 + 70 + 3 = 273.
(2x + 1) × ( x + 3) = (2 × x + 1) × ( x + 3)
= (2 × x) × x + (2 × x) × 3 + 1 × x + 1 × 3
= 2 × (x × ) + (2 × 3) × x + x + 3
= 2 × x2 + 6 × x + x +3 = 2x2 + (6 + 1) × x +3 = 2x2 + 7 × x + 3 = 2x2 + 7x + 3.
In these computations, the first lines translate the expression into less compact form, making
the implicit multiplications and additions explicit. There is one more step in the case of the
base ten numbers, since the addition of terms is already explicit in the polynomial, but not in
43
the base ten numbers. The second line expands the product using the Extended Distributive
Rule. The third line simplifies the products of individual terms, using the Any Which Way
Rule for multiplication. The last line is then devoted to recompactifying the expressions. In the
middle step, the terms involving 10 or x are combined using the Distributive Rule. Again, there
is one extra step for the base ten number.
If students do some calculations like these, they should see parallels between the calculations
with decimal numbers and the calculations with polynomials. Discussion could bring out the
key to the analogy: that if we set x = 10, then the polynomial computations become the
arithmetic computations. Further investigation will show that the analogy is imperfect. For
example (2x + 4) + (3x + 7) = 5x + 11, whereas 24 + 37 = 61. Nevertheless, if we plug x = 10
into 5x + 11, we do get 50 + 11 = 61 as the value. Students who are used to thinking in terms
of decimal components should recognize 50 + 11 as the intermediate result we obtain in adding
24 + 37, before we combine ten 1s into a 10 to get the standard decimal form of 50 + 11. In
other words, polynomial arithmetic is decimal arithmetic without the regrouping process (and
hence, with arbitrarily large coefficients).
In this analogy, the order of magnitude of decimal numbers translates into the degree of a
polynomial: the largest power of x which appears in the polynomial with a non-zero
coefficient. Because of the absence of carrying in polynomial arithmetic, degree is in several
ways better behaved than order of magnitude. For example, the degree of the sum of two
polynomials is always less than or equal to the larger of their degrees. The degree of a product
of two non-zero polynomials is (exactly) the sum of the degrees of the two factors.
Just as order of magnitude guides us in dividing decimal numbers, degree is what we use to
find the quotient of two polynomials. The recursive procedure for finding the quotient of two
polynomials is quite analogous to the one described above for long division of decimals. Given
a ( x)
two polynomials a(x) and b(x), the highest degree term of the quotient
is the monomial
b( x )
c1xm such that a(x) - c1xm b(x) = a1x has degree lower than a(x). This term is quite easy to find:
it is exactly the quotient of the highest degree terms of a(x) and b(x). To find the next term in
the quotient, one uses the same procedure for a1x and b(x). This continues until one gets a
difference of degree less than the degree of b(x); this is then the remainder.
Identities in Arithmetic
The connection between polynomial arithmetic and ordinary arithmetic can be developed and
exploited in other ways. For example, the famous identity
x2 – y2 = (x + y)(x – y)
has many applications in arithmetic. One very serious contemporary application is its use as a
basis for algorithms to factor large numbers. On a more mundane level, it can be used to do
mental math. Take the product 43 × 7. This can be computed mentally by thinking of 43 = 45 - 2 and
47 = 45 + 2. Hence 43 × 47 = (45 + 2) × (45 - 2) = 452 - 22. To compute 452, we can use the
same trick: Write 40 = 45 - 5 and 50 = 45 + 5. Therefore, 452 = 40 × 50 + 52 = 2000 + 25, and
so, 43 × 47 = 40 × 50 + 25 – 4 = 2021. Similar tricks can be used to develop mental math skills,
and to find factorizations of fairly large numbers whose prime factors also are fairly large (i.e.,
no factors less than 20). Since this essay is already rather long, we will curtail further
examples. However, a variety of calculations revealing parallels between decimal arithmetic
and algebra are given in the Exercises.
44
On Beyond . . .
Regarding decimal numbers as “polynomials in 10” amounts to giving the variable x the value
10. This is called specialization. Specialization can be used to interpret other number systems
in terms of polynomials. The specialization which turns polynomial arithmetic into decimal
arithmetic amounts to requiring x to satisfy the equation x – 10 = 0. Polynomials can be made
to mirror other number systems by positing that x satisfies other equations. For example,
requiring x2 – 2 = 0 would make polynomials act like numbers of the form a + b 2 ; in other
words, x is imitating the irrational number 2 . Requiring x to satisfy x2 + 1 = 0 will essentially
create a copy of the complex numbers.
Here we are getting into fairly advanced ideas, drawing close to abstract algebra. It is
questionable whether the constructions of the last paragraph could be productively inserted
into the K-12 curriculum. However, it does seem desirable that high school teachers should be
conversant with these issues. Whether or not they get used in school mathematics, these
possibilities demonstrate the influence that place value has throughout the K-12 curriculum,
and point to the desirability of treating it more explicitly, and making it one of the “big ideas"
that help students arrange the mathematics they learn into a coherent whole.
References
[BP] S. Baldridge and T. Parker, Elementary Mathematics for Teachers, Septon Ash
Publishers, 2004.
[CFL] T. Carpenter, M. Franke and L. Levi, Thinking Mathematically: Integrating
Arithmetic and Algebra in Elementary School, Heinemann, Portsmouth NH 2003.
[KSF] J. Kilpatrick, J.Swafford and B.Findell, eds., Adding It Up: Helping Children
Learn Mathematics, National Academies Press, Washington, DC, 2001.
[Ma] L.Ma, Knowing and Teaching Elementary Mathematics, Lawrence Erlbaum Associates,
Mahwah, NJ, 1999.
[Ock] S. Ocken, Algorithms, Algebra and Access, www.nychold.com/ocken-aaa01.pdf
[Sing] Curriculum Planning and Development Division, Ministry of Education, Singapore,
Primary Mathematics 1A, 1B, US Edition, Federal Publications, Singapore.
45
Download