STUDENTS` UNDERSTANDING OF THE CONCEPT OF CHAIN

advertisement
STUDENTS’ UNDERSTANDING OF THE CONCEPT OF CHAIN RULE IN
FIRST YEAR CALCULUS AND THE RELATION TO THEIR
UNDERSTANDING OF COMPOSITION OF FUNCTIONS
A Thesis
Submitted to the Faculty
of
Purdue University
by
James Franklin Cottrill
In Partial Fulfillment of the
Requirements for the Degree
of
Doctor of Philosophy
May 1999
ii
To Cecelia, who arrived when this work was to begin;
To Stephen, whose arrival delayed its completion;
and
To Julie, my partner in love, life and faith.
iii
ACKNOWLEDGMENTS
I wish to thank my colleagues in RUMEC for their support, encouragement, and
assistance as I learned what it means to conduct research in this field. My special
thanks to Julie Clark, Dave DeVries, George Litman, Karen Thomas, and Draga
Vidaković for their friendship and insights.
I would like to thank Professors Keith Schwingendorf, David Mathews, and Guershon Harel for their willingness to serve on my committee.
I have deep gratitude for Professor Ed Dubinsky and his role in my life. As my
thesis advisor, he helped clarify my thoughts and patiently helped me through the
process. As my colleague, he was a brilliant example of how one can be a great teacher
of mathematics and researcher in education. Also, I thank him for his friendship and
support in my family’s life.
My children’s grandparents have endured years of separation with great patience
and love. I thank them for their support.
Finally, my special thanks to my wife, Julie Cottrill, for her patience and support in a project that involved relocating, transcribing, and months without visible
progress. Thank you.
iv
TABLE OF CONTENTS
Page
LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
vii
ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
viii
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
1.1
Purpose . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2
1.2
Historical background . . . . . . . . . . . . . . . . . . . . . . . . . . .
2
1.2.1
An overview of the development of calculus . . . . . . . . . . .
3
1.2.2
The Chain Rule . . . . . . . . . . . . . . . . . . . . . . . . . .
4
1.3
The RUMEC study . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6
1.4
An overview of this study . . . . . . . . . . . . . . . . . . . . . . . .
7
2 Theoretical Perspective and Research Paradigm . . . . . . . . . . . . . . .
9
2.1
Theoretical perspective . . . . . . . . . . . . . . . . . . . . . . . . . .
9
2.2
Schema development . . . . . . . . . . . . . . . . . . . . . . . . . . .
10
2.3
Paradigm for research . . . . . . . . . . . . . . . . . . . . . . . . . . .
11
2.4
Initial genetic decomposition . . . . . . . . . . . . . . . . . . . . . . .
11
3 Literature Review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
13
3.1
On functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
13
3.2
Students’ understanding of function . . . . . . . . . . . . . . . . . . .
13
3.3
Composition of functions . . . . . . . . . . . . . . . . . . . . . . . . .
15
3.4
Difficulties with differentiation . . . . . . . . . . . . . . . . . . . . . .
15
4 Phase 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
17
4.1
Methods: Subjects . . . . . . . . . . . . . . . . . . . . . . . . . . . .
17
4.2
Instrument . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
18
4.3
Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
18
4.3.1
19
Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
v
4.3.2
Composition . . . . . . . . . . . . . . . . . . . . . . . . . . . .
22
4.3.3
Differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . .
24
4.3.4
Chain rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
25
Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
25
5 Results: Phase 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
26
4.4
5.1
Analysis 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
26
5.2
Analysis 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
27
5.3
An alternate analysis . . . . . . . . . . . . . . . . . . . . . . . . . . .
28
6 Phase 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
32
6.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
32
6.2
Subjects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
32
6.2.1
Procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
33
6.3
Instrument . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
33
6.4
Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
33
6.5
Where the RUMEC study left us . . . . . . . . . . . . . . . . . . . .
34
7 Results: Phase 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
36
7.1
The triad description revisited . . . . . . . . . . . . . . . . . . . . . .
36
7.1.1
Eli [A125] . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
36
7.1.2
Tim [A134] . . . . . . . . . . . . . . . . . . . . . . . . . . . .
39
7.1.3
Al [A137] . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
41
7.1.4
Ray [A123] . . . . . . . . . . . . . . . . . . . . . . . . . . . .
44
7.1.5
Peg [A132] . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
46
7.1.6
Jack [A113] . . . . . . . . . . . . . . . . . . . . . . . . . . . .
49
Using versus understanding . . . . . . . . . . . . . . . . . . . . . . .
50
7.2.1
Coding the aspects of understanding . . . . . . . . . . . . . .
50
7.2.2
Comparing the codes . . . . . . . . . . . . . . . . . . . . . . .
54
8 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
56
7.2
8.1
Revised genetic decomposition . . . . . . . . . . . . . . . . . . . . . .
56
8.2
Understanding of composition and understanding of the chain rule . .
58
vi
8.3
Comparing instructional methods . . . . . . . . . . . . . . . . . . . .
58
8.3.1
Understanding the chain rule . . . . . . . . . . . . . . . . . .
58
8.3.2
Using the chain rule . . . . . . . . . . . . . . . . . . . . . . .
59
8.3.3
Finding a balance . . . . . . . . . . . . . . . . . . . . . . . . .
60
8.4
Quantitative versus qualitative data . . . . . . . . . . . . . . . . . . .
60
8.5
Limitations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
61
8.6
Instructional strategies . . . . . . . . . . . . . . . . . . . . . . . . . .
61
LIST OF REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
63
APPENDICES
A Questionnaire from Phase 1 . . . . . . . . . . . . . . . . . . . . . . . . . .
69
B Phase 1 Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
80
C Interview Guide for Phase 2 . . . . . . . . . . . . . . . . . . . . . . . . . .
84
C.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
84
C.2 Review of function and composition . . . . . . . . . . . . . . . . . . .
84
C.3 Classifying the 10 derivative problems . . . . . . . . . . . . . . . . . .
85
C.4 Establishing the notion of chain rule . . . . . . . . . . . . . . . . . .
85
C.5 Extending the notion . . . . . . . . . . . . . . . . . . . . . . . . . . .
86
VITA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
88
vii
LIST OF TABLES
Table
Page
5.1
Descriptive statistics for variables . . . . . . . . . . . . . . . . . . . .
28
5.2
Correlation statistics for variables . . . . . . . . . . . . . . . . . . . .
28
5.3
Regression model with PGPA . . . . . . . . . . . . . . . . . . . . . .
29
5.4
Regression model without PGPA . . . . . . . . . . . . . . . . . . . .
30
5.5
Comparison of courses . . . . . . . . . . . . . . . . . . . . . . . . . .
30
5.6
Comparison of Traditional and any C4 L . . . . . . . . . . . . . . . . .
31
5.7
Comparison of first quarter course . . . . . . . . . . . . . . . . . . . .
31
6.1
Matrix of interviews . . . . . . . . . . . . . . . . . . . . . . . . . . .
33
7.1
Aspects of understanding . . . . . . . . . . . . . . . . . . . . . . . . .
55
APPENDICES
Table
B.1 Distribution of codes by item . . . . . . . . . . . . . . . . . . . . . .
81
B.2 Student codes in tuples . . . . . . . . . . . . . . . . . . . . . . . . . .
82
B.3 Combined code data . . . . . . . . . . . . . . . . . . . . . . . . . . .
83
viii
ABSTRACT
Cottrill, James Franklin, Ph.D., Purdue University, May, 1999. Students’ Understanding of the Concept of Chain Rule in First Year Calculus and the Relation to
Their Understanding of Composition of Functions. Major Professor: Ed Dubinsky.
The present study is a follow-up to a study conducted by this author with seven
others. That study (Clark et al., 1997) proposed the use of Piaget and Garcia’s
triad mechanism to describe the development of a schema, specifically with respect
to students’ understanding of the chain rule. This study was designed to collect
data to test this theory. Both quantitative and qualitative methods were employed.
Data were collected via a questionnaire on a group of calculus students’ (n = 34)
knowledge of and skill with function, composition of functions, differentiation and
chain rule. The data were analyzed to investigate how their performance on the
composition of functions items related to that of the chain rule. No strong correlation
is reported. Some of the subjects had experienced a reform calculus course which used
computer experiences and cooperative learning. A comparison was made between the
two groups, although no overall differences are reported. Follow-up interviews based
on the questionnaire responses were conducted with six subjects. These results are
positive in the sense that the triad description fits with the data. We present detailed
descriptions of the Intra-, Inter-, and Trans- levels of the development of the chain
rule schema. Some evidence is presented to support the notion that understanding
of composition of functions is key to understanding the chain rule. The type of
instruction was a factor in how a student performed on these tasks. The differences
between the types seem to be related to the difference between using the chain rule
and understanding the chain rule, and an explanation of these differences is offered.
The need for collecting differing types of data is established. Two students scored
high quantitatively but did not perform well in the interview, while another student
failed the quantitative work but demonstrated a high level in the interview. Finally,
the use of writing as a pedagogical tool is recommended based on the results.
Chapter 1: Introduction
This study looks at a particular concept in calculus, the chain rule, in the first
year calculus course. The chain rule is the underlying concept in many applications
of calculus: implicit differentiation, solving related rate problems, and solving differential equations. The rule states that if g is differentiable at c and f is differentiable
at g(c), then the composite function f ◦ g given by f ◦ g(x) = f (g(x)) is differentiable
at c and that (f ◦ g)0 (c) = f 0 (g(c)) · g 0 (c). Conventional wisdom holds that students’
conception of the chain rule (as with other rules) is that of symbol manipulation.
This conception appears to be a straight-forward manipulation of symbols which can
easily be applied in problem situations. However such a situation carries a heavy
requirement for the function to be given by an expression, fostering students’ tendencies toward instrumental understanding. Even in such symbolic problem situations,
many students have difficulty in recognizing the need to apply the chain rule or in
applying it.
In discussing students’ understandings of the elements of the dihedral group,
Zazkis, Dubinsky, and Dautermann (1996) propose the VA model of mathematical
thinking in which visual and analytic modes of thought are mutually dependent in
problem-solving. The concept of derivative offers another example of a situation in
which a student might alternate between the visual (slope of a tangent to the curve)
and analytic (rate of change) interpretations in building her or his understanding
(Asiala, Cottrill, Dubinsky, & Schwingendorf, 1997).
From a visual perspective, the rule might be interpreted as the derivative of the
inner function compensating for, or scaling, the derivative of the outer. Such a conception lends itself to understanding the rule in relation to the applications mentioned
above. This was the understanding of both Newton and Leibniz as they developed
the calculus (Edwards, 1979). This visual perspective is not presented in current
pedagogical practices, perhaps due to the lack of a reasonable procedure for producing the graph of a composition given the graphs of the component functions. This
loss of a visual approach may explain the difficulty many students demonstrate when
working with the chain rule. The fundamental nature of the rule demands that the
beginning student must not only be capable of manipulating the symbols, but must
1
2
have a conceptual understanding of the chain rule. This project is an investigation
to describe how students come to understand the chain rule, and via this description,
propose activities which could bring students to the point of being better able to
build this understanding.
1.1 Purpose
The purpose of this study is to attempt to determine the correlation between
a student’s ability to deal with composition of functions and with using the chain
rule successfully. The data is also used to compare two instructional approaches to
the chain rule: a traditional lecture-recitation method and an innovative “reform”
method. Finally, an attempt is made to describe the understanding of the chain rule
concept on the part of the interviewed students.
It is supposed to answer the questions:
• How does a student come to understand the chain rule?
• In what way does the student’s understanding of (de)composition of functions
factor in the chain rule?
• With reference to the results of the RUMEC study (Clark et al., 1997), is
the chain rule concept described well via a schema, and do the current results
support the reported schema?
This study proposes to test the hypothesis: “The chain rule in and of itself is
not difficult for students to apply; rather it is the issue of composition of functions.”
Also, it will be the second attempt of the first cycle through the research paradigm,
which will result in a revised genetic decomposition.
Thus, the overarching purpose of this study is to test APOS Theory (described in
Chapter 2). The theory suggests that certain kinds of mental constructions should
be evident when students work and reflect on chain rule problems. It also states that
these constructions will be organized in a schema that should have certain characteristics at various levels of development. This study analyzes the data to determine if
the predictions of the theory hold up to scrutiny.
Finally, since APOS Theory is an evolving work, an opportunity to augment or
extend the theory may arise in this work, as it did in the RUMEC study.
1.2
Historical background
In attempting to gain an understanding of how students construct their concept of
the chain rule, it is beneficial to consider its historical development and its proof. The
3
chain rule arose quite naturally in the development of calculus by both Newton and
Leibniz as they worked with infinitesimals. The rule has evolved with the calculus
as Euler brought it to bear on functions and as Cauchy and Weierstrass developed
analysis. The proof has also evolved from a symbolic tautology to an argument based
on the limit of a difference quotient. The historical perspective presented here may
offer an explanation of students’ difficulty with the chain rule.
1.2.1
An overview of the development of calculus
Two types of problems from antiquity served to instigate the development of
calculus. One problem was to rectify a curve, that is, to construct a line with the
same length as that of a given curve. The other was the quadrature of a curve, or
the area beneath it. It was not the case that such problems were unsolved; actually
many specific curves had solutions for both types of problems. The ancient Greeks
used the method of exhaustion and the method of Archimedes (c. 200 b.c.) to find
many volumes and surfaces. The Greeks made use of indivisibles as they dealt with
concepts of infinite processes. What was lacking was a general method for any curve.
(Boyer, 1969)
The work of Oresme, Galilei, and Cavalieri (c. 1635) extended the use of indivisibles as the quadratures of families of curves were obtained. These mathematicians
approached the problems from a geometric point of view. During roughly the same
period, mathematicians such as Descartes, Wallis, and Fermat (c. 1655) were leading
an analytical revolution. The birth of coordinate geometry allowed the creation of
an arithmetic of infinitesimals (indivisibles). Wallis reset the work of Cavalieri in infinitesimal analysis. Fermat extended this and invented the method of differentiating
and integrating monomial expressions which are in use today. (Boyer, 1959)
Fermat did not recognize the inverse relation between these processes, nor did
those who followed his work. It was the genius insight of Newton and Leibniz to
deduce from this work (and the work on infinite sums of Gregory) the Fundamental
Theorem of Calculus. They saw and explicitly stated this inverse relation between
differentiating and integrating. Moreover, they showed that area, volume, and related
problems which had been solved by summation were reduced to anti-differentiation.
The distinction between these insights is indicated by the fact that the Fundamental
Theorem is stated in two parts. (Kline, 1972)
Newton (c. 1668) perceived a curve as the path of a moving point and spoke of
fluents x and y as the horizontal and vertical components of this motion. He defined
the rate of change of a fluent to be a fluxion, with the notation ẋ and ẏ. By interpreting
4
increasingly smaller increments in x and y, he found the derivative to be the ratio
of the fluxions which is also the slope of the tangent to the curve. It is interesting
to note that although Newton developed his calculus by working analytically, his
first published account of it (the Principia Mathematica) is presented in a classical
geometric setting. (Rickey, 1987)
The approach to calculus of Leibniz (c. 1673) was to view these infinitesimals
as mathematical objects, which he called differentials. In working on quadratures,
he noted the summation of the area under a curve is related to its anti-derivative.
Leibniz developed a careful notation for the calculus, much of which persists today.
He also derived many of the rules for computing derivatives and integrals, notably
the chain rule. (Kline, 1972)
Following this seminal work in the calculus, the next major advance was that of
Euler (c. 1748) and his incorporation of the concept of function. This was followed
by a long period in which the methods of the calculus were used to solve problems
and were refined. However, the notion of the infinitesimal had not been satisfactorily
defined by Newton or Leibniz. This lack incited Berkeley and Lagrange to call for a
rigorous formulation of calculus in the late 18th century. The issue was settled by
Cauchy and Weierstrass (c. 1821) with the development of analysis as a mathematical
field. (Edwards, 1979)
1.2.2
The Chain Rule
The current formulation of the chain rule, as stated above, is a result of this
development of real analysis. The concept itself—the derivative of a composition
of functions—was intuitively obvious to both Newton and Leibniz. As C. Edwards
(1979) points out discussing the work of Newton:
The tangent and area problems emphasize the importance of systematic
procedures for differentiation (the calculus of ẏ/ẋ, given f (x, y) = 0) and
anti-differentiation (the converse). Newton exploited the facility for differentiation and anti-differentiation by substitution methods—equivalent
to what we call the chain rule and integration by substitution —that is
essentially “built into” the calculus of fluxions. (p. 196)
Newton did not set the chain rule out as a theorem or procedure. It was a natural
algorithm employed in many of his examples. In a similar manner, the notation of
Leibniz led him to consider the chain rule as intuitive as simplifying a product of
5
fractions. If z = z(u) and u = u(x), then
dz
dz du
=
· .
dx
du dx
Again this process was not even given a name. In Leibniz’ union of geometric and
analytical thinking using infinitesimals, the chain rule is implicitly defined.
The work of Euler and Cauchy precipitated the chain rule in its present form.
By changing the objects of calculus from curves in space to arbitrary functions, the
intuitive nature of the chain rule was lost. It became a theorem to be proved. Cauchy’s
proof of the rule follows:
Now let z be a second function of x, bound to the first y = f (x) by the
formula
z = F (y).
z or F [f (x)] will be that which one calls a function of a function of a
variable x; and, if one designates the infinitely small and simultaneous
increments of x, y, and z by ∆x, ∆y, ∆z, one will find
F (y + ∆y) − F (y)
F (y + ∆y) − F (y) ∆y
∆z
=
=
·
,
∆x
∆x
∆y
∆x
then, on passing to the limits,
z 0 = y 0 F 0 (y) = f 0 (x)F 0 [f (x)].
(Cauchy, cited in Edwards, 1979, p. 313)
Edwards notes that Cauchy overlooks the possibility that ∆y = 0 in the work cited.
In fact, Cauchy does not treat the functional relationship of ∆y and ∆x, and so the
proof is false.
Often in mathematics, the proof of a theorem helps to convey the fundamental
concepts on which it is built. That is, by experiencing the proof, a student will
construct a relational understanding of the theorem. Unfortunately, the method of
proof for the chain rule is generally interpreted as an algebraic trick—multiply by 1
(∆y/∆y). Even though
F (y + ∆y) − F (y) ∆y
·
∆y
∆x
is simply Leibniz’s dz/du · du/dx written in terms of difference quotients, the geometric interpretation is lacking. This may be related to our inability to graphically
6
demonstrate the composition of two functions given by their graphs. The difficulties
that students have with the chain rule may be explained by its history, that is, the
function and composition ideas came later and clouded the intuitive idea of Newton
and Leibniz.
1.3
The RUMEC study
In a previous study (Clark et al., 1997), eight members (including this author) of
an academic community of researchers1 working with a common theoretical framework attempted to explore calculus students’ understanding of the chain rule and its
applications. Specifically, the students were asked, as part of a clinical, task-based
interview situation, to solve the following four problems and to explain what they
were doing. The students were prompted to use the chain rule explicitly in the first
problem if it was not chosen as the method of solution. Prompts were given in the
second problem relating to the chain rule if the student indicated difficulty. The
problems were:
2
1. Let f be the function given by f (x) = (1 − 4x3 ) . Compute f 0 .
2. Let F be the function given by F (x) =
you did.
Z
sin x
2
et dt. Compute F 0 . Explain what
0
3. Let A be a real number. Given that the following relation defines a function,
√
√
x y + y x = A, find its derivative.
4. A ladder A feet long is leaning against a wall, but sliding away from the wall at
the rate of 4 ft/sec. Find a formula for the rate at which the top of the ladder
is moving down the wall.
A preliminary analysis of the responses to these questions indicated that although
almost all of the participants were able to use the chain rule to correctly solve the
first problem, those who had difficulty with the other problems were unable to apply
the chain rule, even when prompted.
Clark et al. (1997)2 attempted to analyze the concept of chain rule based on
an Action-Process-Object-Schema (APOS) theoretical framework which is described
in the next chapter. Using the framework to analyze the mental constructions of
1
The group is known as the Research in Undergraduate Mathematics Education Community or
RUMEC.
2
For the rest of the paper, I will refer to this study as the RUMEC study.
7
the students, the RUMEC study set out to describe each of the 41 participants as
working with an action, process, or object conception of the chain rule (at the time
of the interview). In the negotiation of differences between researchers’ analyses, an
issue arose regarding whether a student’s general power rule (as used in Problem 1,
above) was her or his chain rule. The issue could not be resolved within the given
theoretical framework, and so the RUMEC study failed to meet its objectives.
An analysis of the problem situation concluded that the concept of the chain rule
is best described by a schema. Although the framework treated the mental construction of a schema, it was found to be insufficient to account for the data collected.
Specifically, this insufficiency was that a schema was defined but its development was
not described. The RUMEC study provided the description of schema development
in terms of the chain rule. In this way, it extended the previous APOS framework
(Clark et al., 1997).
The RUMEC study used the interview data to illustrate schema development and
attempted to return to the original objective of the project (describing each of the
41 participants). However, since the data were collected via interviews based on
the previous genetic decomposition, the “right” questions had not been asked. In
particular, no attempt was made to have students discuss the similarity/difference
of Problems 1 and 2. Also, these chain rule problems were at opposite ends of the
spectrum in terms of complexity. The current work is a follow up to the RUMEC
study. Incorporated in its design are a number of examples of problem situations
involving the chain rule, as well as opportunities to explore the students’ perceived
relationships between the situations.
1.4
An overview of this study
This study consists of two phases, one quantitative and one qualitative. Phase 1
involved the collection of data via questionnaires administered to 34 students. Phase 2
used in-depth, task-based interviews with six of the students from Phase 1.
The purpose of Phase 1 is an attempt to determine the correlation between a
student’s ability to deal with composition of functions and with using the chain rule
successfully. Data were collected to establish an individual’s baseline understanding
of function and composition, as well as her or his relative proficiency in mathematics.
Integrated in this phase was a comparison between two methods of teaching calculus
regarding the effectiveness of the instructional treatment of the chain rule.
Students were selected from second quarter calculus by requesting volunteers.
Data were collected on an instrument constructed for this study which assessed the
8
participant’s facility with (a) functions, (b) composition of functions, (c) differentiation, and (d) the chain rule. Each participant was asked to authorize access to her
or his academic record for the researcher to gather data, such as SAT scores and
predicted indices.
The questionnaire items were scored using a 5 point rubric based on general guidelines adapted from Carlson (1998). These codes along with the predictor data were
tested for correlational significance. A second analysis explored any differences between the two groups (traditional versus C4 L). Analysis of variance was used to
determine if there exists a significant difference in sample means of the chain rule
scores.
The second phase of the study involves interviews with participants from the first
phase to attempt to describe how these students construct the concept of the chain
rule. The six participants in the interviews were chosen based on their scores in
Phase 1, covering a range of chain rule scores and a range of overall scores. The
interview follows a guide designed to elicit the student’s understanding of the rule,
based on tasks from the previous instrument. This aspect of the study is based on
the following theoretical perspective and research paradigm.
Chapter 2: Theoretical Perspective and Research Paradigm
As Phase 2 of the study is of a qualitative nature, it is necessary to discuss our
theoretical perspective and research paradigm. This discussion is a brief overview; a
complete description can be found in Asiala et al. (1996).
2.1 Theoretical perspective
The APOS theory is based on the following statement:
An individual’s mathematical knowledge is her or his tendency to respond
to perceived mathematical problem situations by reflecting on problems
and their solutions in a social context and by constructing or reconstructing mathematical actions, processes and objects and organizing these in
schemas to use in dealing with the situations. (Asiala et al., 1996, p. 7)
Three basic types of knowledge—actions, processes, and objects—are observed,
which are organized into structures called schemas. An action is any repeatable physical or mental manipulation of objects to obtain other objects. It is a transformation
that is a reaction to stimuli that the individual perceives as external. In the case of
function, for example, a student indicates an action conception if he or she is limited
to requiring an explicit formula in order to interpret a situation as a function. A
student in such a case would be unable to do much with the function, apart from
evaluating it at a point. This student would have great difficulty dealing with situations involving composition of functions (Breidenbach, Dubinsky, Hawks, & Nichols,
1992).
As a student reflects on an action, it is interiorized and becomes a process. The
student perceives the action as part of her or him and has control over it. For functions, this means the student can think of a function as receiving input, performing
some operation, and returning output. This can be done mentally, without actually
performing the operations on the input (Breidenbach et al., 1992).
As a student realizes that an action can be brought to operate on a process,
the process is encapsulated to become an object. The object conception can be deencapsulated back to the process as needed. In the case of functions, this encapsulation/de-encapsulation arises as one considers manipulations of functions such as
adding, multiplying or forming sets of functions.
9
10
A schema is a coherent collection of actions, processes, objects and other schemas
that is invoked to deal with a new mathematical problem situation. A schema can be
thematized to become another kind of cognitive object to which actions and processes
can be applied. By consciously unpacking a schema it is possible to obtain the original
processes, objects and other schemas from which the schema was constructed. (Note
that this is the description of schema prior to the RUMEC study which extended
it as described in the next section.) A genetic decomposition is an attempt by the
researcher to describe the objects and processes in some set of students’ schemas.
While this description is not unique for any given concept, it is useful in guiding
instructional design and investigations (Asiala et al., 1996).
2.2
Schema development
The triad mechanism, introduced by Piaget and Garcia (1989), distinguishes three
stages in the development of a concept: Intra, Inter and Trans. The Intra stage is
characterized by the focus on a single object in isolation from any other actions,
processes, or objects. With the chain rule, for instance, the student may have a
collection of rules for finding derivatives including some special cases of the chain rule
such as the general power rule and perhaps even the general formula, but might not
recognize the relationships between them.
The Inter stage is characterized by recognizing relationships between different
actions, processes, objects and/or schemas. It is useful to call a collection at the
Inter stage of development a pre-schema. In the case of the chain rule, the student
may begin to collect these special cases and recognize that in some way they are
related.
Finally the Trans stage is characterized by the construction of a coherent structure
underlying some of the relationships discovered in the Inter stage of development. At
the Trans stage of the chain rule, the student would link function composition to
differentiation, and recognize various instantiations of the chain rule as linked in that
they follow from the same general rule through function composition.
It is worth noting that it is only when a schema reaches the Trans stage of development that it can properly be referred to as a schema. The reason is that at the
Trans stage the underlying structure is constructed through reflecting on the relationships between the various objects from the earlier stages. This structure provides
the necessary coherence in order to identify the collection as a schema, that is, it is
a coherent whole. This coherence is deciding what is in the scope of the schema and
what is not.
11
2.3
Paradigm for research
This research in mathematics education is based on Dubinsky’s method (Asiala
et al., 1996) consisting of three components in a cycle: theoretical analysis; design
and implementation of instruction; and data collection and analysis. The theoretical
analysis component influences the instruction component in that the activities and
tasks used in instruction are based on a genetic decomposition. In turn, the implementation of the instruction influences the empirical data component as those who
have been instructed are studied. Data are collected in as many forms as reasonable
in order to gain insight. Tests and homework, direct observation, questionnaires and
clinical interviews are possible sources for data collection. The data analysis leads
to a revision of the current genetic decomposition; thus the empirical data affect the
theoretical analysis.
On the other hand, the theory influences the empirical data since the data collection instruments are constructed based on the genetic decomposition. These instruments are designed to also get at issues involving the “goodness” of the genetic
decomposition. Typically, two genetic decompositions will be presented in a study;
the preliminary one used to design the instruction and the data collection, and the
revised genetic decomposition based on the observations and analysis.
2.4
Initial genetic decomposition
The results of the RUMEC study provide evidence that understanding the chain
rule involves the building of a schema. The schema consists of a function schema
coordinated with a differentiation schema. The student’s function schema would have
at least a process conception of function, function composition and decomposition
(breaking a composite function into two or more component functions). This is linked
to her or his differentiation schema which includes the rules of differentiation at least
at the process level (Clark et al., 1997). Note that a distinction is made between
a differentiation schema and the derivative schema of a student. The former, which
deals with taking derivatives, is a part of the latter which also contains notions of
interpretations and representations of the derivative (Asiala, Cottrill, Dubinsky, &
Schwingendorf, 1997).
The chain rule schema develops through the levels of the triad; Intra, Inter and
Trans. In the first level, the Intra- level, the student has a collection of rules for
finding derivatives in various situations, but has no recognition of the relationships
between them. This collection may include some special cases of the chain rule, and
perhaps even the general formula which is perceived as a separate rule rather than a
12
generalization of the others. The Inter- level, is characterized by the student’s ability
to begin to (mentally) collect all different cases and to recognize that these are related.
At this stage the collection of elements in the chain rule schema is being formed, and
the collection is called a pre-schema. At the Trans- level a student has constructed the
underlying structure of the chain rule. He or she must link function composition and
decomposition to differentiation, and recognize various instantiations of the chain rule
as linked in that they follow from the same general rule through function composition.
The elements in the schema must move from being described essentially by a list to
being described by a single rule (Clark et al., 1997).
Chapter 3: Literature Review
3.1
On functions
Functions are a key component in secondary and undergraduate curricula (see,
for instance, the foreword of Harel & Dubinsky, 1992) and much has been written
on the learning and teaching of the concept. Leinhardt, Zaslavsky and Stein (1990)
present an exhaustive review of the research papers which focus on functions, graphs
and graphing. The studies are organized by three issues: task, student learning,
and teaching techniques. The issue of task is discussed in terms of the action of
the learner, the situation in which the task is set, the variables involved and their
nature, and the focus of attention in the task. The issues of student learning are
presented as intuitions and as misconceptions which the students bring to a task.
Finally, the teaching techniques discussed are entry to the topic, sequencing the tasks,
explanations in the learning, and examples and representations.
Ferrini-Mundy and Graham (1991), Tall (1992), Even (1993), Thompson (1994b),
and Wilson and Krapfl (1994) offer more recent reviews of research on functions.
These note that function is a dichotomous concept—perceived by the student as a
process or as an object (see also Sfard, 1992). Harel and Dubinsky (1992) add that
students start with an action conception of function prior to moving to process and
object. These reviews agree that the concept of function is important to mathematics
and not well understood by our students. These studies and those about which they
report were searched for evidence of explorations of composition of functions and for
discussions of the chain rule. While many of the studies of students’ understanding
of function relate to this study, little was found regarding students’ understanding of
composition of functions explicitly, and no studies were found on their understanding
of the chain rule.
3.2
Students’ understanding of function
Many studies observed students’ understanding of function by presenting questionnaires of various function situations for the student to interpret (Even, 1988,
1993; Vinner & Dreyfus, 1989; Breidenbach et al., 1992; Dubinsky & Harel, 1992;
Schwingendorf, Hawks, & Beineke, 1992; Hitt, 1993; Ferrini-Mundy & Graham, 1994;
13
14
Lauten, Graham, & Ferrini-Mundy, 1994; Wilson, 1994; and Carlson, 1998). Typically these studies ask the student to give her or his definition of function, and then
they present many examples of situations to the student. The situations generally
range over graphs, tables, expressions, equations, sets of ordered pairs, parametric
equations, and finite sequences. In some cases, only verbal descriptions of the situation are provided. The student is then to decide in each case if a function is present
in the situation. Often, the student is asked to relate the answer back to the definition of function previously given, either at the time of the questionnaire or in a later
interview. These studies conclude that the notions students hold about function are
often different from what they give as a definition. Also demonstrated are the specific
types of functions which present difficulty to students.
Breidenbach et al. (1992), Dubinsky and Harel (1992), and Schwingendorf et
al. (1992) suggest that these difficulties indicate the lack of a process conception
of function. They offer evidence of an instructional approach which seems to help the
students construct this process. Even (1988, 1993), Hitt (1993), and Wilson (1994)
focus on the concept of function found in mathematics teachers. They offer rather
disturbing results as the teachers exhibited very limited conceptions of functions (see
also Vinner & Dreyfus, 1989). Ferrini-Mundy and Graham (1994) and Lauten et
al. (1994) provide descriptions of the students’ function concept and suggest methods
of employing graphing calculators as aids in learning. Carlson (1998) compares three
groups of students at different stages of their collegiate careers. She finds a very
narrow view of function is held by high-achieving second semester calculus students.
She also concludes that the concept of function develops slowly, and this development
is facilitated by reflection and constructive activities.
Other researchers investigated the concept from a problem-solving and graphical point of view. They describe methods of overcoming difficulties with functions
by using computer software and problem-solving techniques. These reports generally offer anecdotal evidence as support for their views. Confrey (Confrey & Smith,
1994; Confrey, Smith, Piliero, & Rizzuti, 1991) offers a computer environment called
Function Probe for student use in problem-solving situations. In a similar manner,
Schwarz and Bruckheimer (1990) use an environment Triple Representation Model,
Cuoco (1994) proposes the use of Logo and ISETL, and Goldenberg (1988) employs
the Function Analyzer. These environments allow the student to have control over a
function by switching between representations, exploring covariation, and changing
individual parameters.
15
3.3
Composition of functions
One of the very few papers that discuss the composition of functions is a teaching
methods paper (one which does not present any supporting observations of students)
in which Alson (1992) discussed a qualitative approach to graphing functions. He
defines two operators on the graphs which are essentially composition operators, one
for the domain and one for the range of the graph. His treatment offers insight into
the graphical interpretation of the composition of elementary functions. However, as
an instructional approach, the students must start with a rich concept of function.
Other methods may allow the student to develop this richness while constructing the
idea of composition.
Ayers, Davis, Dubinsky, and Lewin (1988) present evidence that computer experiences induce reflective abstraction as students learn about functions and composition.
Students worked in the microworld of the Unix operating system, using shell scripts
and pipes to represent functions. The authors present a genetic decomposition of
composition and probe the process conception of composition in their investigation.
They indicate that in order for a student to succeed in the task of de-composing a
function—given descriptions of two functions, H, G, find F such that H = F ◦ G—he
or she must have a process conception of function and of composition.
Dubinsky (1991) further discussed the issue of composition and reflective abstraction. Two aspects of reflective abstraction, coordination and reversal, are illustrated
by excerpts of interviews on composition. According to Dubinsky, an understanding
of functions as objects and as processes is necessary for understanding composition
(beyond evaluating each function at specific points with a formula). However, in
her work investigating students’ understanding of inverse functions, Vidaković (1996)
found students whose concept of composition of functions was based on a process
conception of function but not necessarily on an object conception.
3.4
Difficulties with differentiation
While the search for investigations on learning the chain rule gave no results, two
teaching methods papers on the chain rule are available. Mathews (1989) discusses
the use of a computer algebra system, muMATH, to verify the chain rule. Using
the computer to produce the symbols is expected to help the students understand
the truth value of the rule. Thoo (1995) proposes using arrow (or tree) diagrams as
a mnemonic device for the chain rule. He finds the diagrams especially useful with
functions of several variables. Thoo also believes that these will help internalize the
composition of functions.
16
Studies of students’ difficulties with calculus topics (Orton, 1983; Frid, 1992;
White & Mitchelmore, 1992; Thompson, 1994a; Thomas, 1995; Cottrill et al., 1996;
Vidaković, 1996, 1997; Asiala et al., 1997) offer insights to misconceptions and obstacles to understanding of calculus concepts. Specific concepts such as graphical
interpretation of the derivative, limit, rate of change, differentiation, inverse functions and the Fundamental Theorem of Calculus (FTC) have been explored. These
reports offer descriptions of the students’ possible mental constructions and pedagogical suggestions. Thomas (1995) also used Problem 2 of the RUMEC study
(Section 1.3) in her exploration of students’ understanding of the FTC. She found
that those students who were unsuccessful with the problem were unable to deal with
the area-accumulating function (a function defined through a definite integral with a
variable endpoint). This result is consistent with those of the RUMEC study on the
problem. Selden, Selden and Mason (1989, 1994) approached student understanding
with a problem-solving framework. They found students fail to use calculus strategies
when dealing with nonroutine problems.
Chapter 4: Phase 1
The purpose of this portion of the study is to attempt to determine the correlation
between a student’s understanding of composition of functions and understanding the
chain rule. The data will also be used to compare two instructional approaches to
the chain rule: a traditional lecture-recitation method and an innovative “reform”
method. The comparison is an attempt to establish whether there is a difference in
performance which might be attributed to a more conceptual approach to the topic
(the reform course) versus an instrumental approach. This study proposes to test the
hypothesis: The chain rule in and of itself is not difficult for students to apply; rather
it is the issue of composition of functions.
4.1 Methods: Subjects
The subjects for this study were students at a large, southeast, urban university
who had completed at least two quarters of calculus. A total of 34 students volunteered to participate in the study. They were solicited from second and third quarter
calculus classes near the end of the term. Some of the students had been through a
calculus course using cooperative learning, computers and alternatives to lecturing.
The reformed calculus course is known as C4 L (for Calculus, Concepts & Computers
and cooperative learning)1 . The others had completed a traditional, lecture-based
course. Seven students took the C4 L course both quarters, three took C4 L in the first
quarter only (then took the traditional course for the second quarter), seven took
a traditional first quarter then C4 L for the second quarter, and 17 students took a
traditionally taught course for both quarters. This data is recorded with the variable
C4L, defined in Appendix B.
All of the students were majoring in Arts & Sciences or Education programs; five
were graduate students in Education taking undergraduate mathematics courses for
certification. The students volunteered their time (up to 2 hours) to complete the
questionnaire. In order to make some sort of comparison between the volunteers, an
additional data point was sought. The University computes a predicted grade point
index (PGPA) at the time of admission. The PGPA was collected to compare the
academic strength of each student prior to her or his calculus instruction.
1
See Schwingendorf, Mathews, & Dubinsky (1996) for a detailed description of the project.
17
18
An alpha-numeric code assigned to each subject was used to identify all of the
written work collected as data. This code was also used on all audio and video tapes
made in the second phase of the study. This code is used in this report to present
data without compromising confidentiality.
4.2
Instrument
Data were collected on an instrument constructed for this study which tries to
assess the participant’s facility with (a) functions, (b) composition of functions, (c)
differentiation, and (d) the chain rule. The items on the written instrument (questionnaire) are given in the following section along with the coding scheme for each
item. The questionnaire is included as Appendix A.
The questionnaire was piloted with 4 students of known ability (having recently
completed the researcher’s final exam in second quarter calculus) to check for errors
and reliability. As there was no need for major revisions to the questionnaire, the
responses of the pilot study are included in this study.
The data was collected in a piece-wise fashion with individuals and small groups
over the course of a summer session. In each instance, the informed consent form was
read and explained. After the consent forms were signed, the following instructions
were given:
• These items are designed to explore your range of understanding on a number
of topics. Some will be very easy to do, some will be harder. Please do the best
you can.
• Please answer each question as completely as you can. Please do not leave any
answer blank. If you would like to write additional notes with an answer, feel
free to use the back of the page.
• Please do not write your name on any page of this questionnaire. The code
number at the top of this page will serve to keep your identity confidential.
The questionnaires were completed under ordinary test conditions with little interaction with the researcher.
4.3
Analysis
The 20 problems were coded (scored) using a 5 point rubric based on the following
general guidelines adapted from Carlson (1998):
5
Complete response to all aspects of the problem. Indicates complete
mathematical understanding of the problem’s concept. Includes only
minor computational errors, if any.
19
4
3
2
1
0
Responses falling between 5 and 3.
Demonstrates understanding of main idea of the problem. Not totally complete in response to all aspects. Shows some deficiencies in
understanding aspects of the problem. Incomplete reasoning.
Responses falling between 3 and 1.
Attempts, but fails to answer or complete problem. Very limited
or no understanding of problem. Contains words, examples, or diagrams that do not reflect the problem.
No answer. Written information made no attempt to respond to the
problem. Written information was insufficient to allow judgment.
These guidelines were used to construct specific rubrics for each item—a first estimation of each rubric appeared in the proposal for study. Each item was analyzed
using a constant-comparison method (Krathwohl, 1993). In this method, responses
were sorted based on the type of response found in the written work. The resulting
categories were then compared to the general rubric and the initial estimation, and
were given an initial code. More than one category resulting from the sorting operation would receive the same code occasionally, if the two types of responses both
indicated the same level of understanding. This scheme was repeated as new sets of
data became available. The specific responses were then used as descriptors in the
rubric for the item.
Finally, the entire set of data was re-analyzed some weeks after the last questionnaire was completed. This last iteration checked the reliability of the generated
rubric. The refined codes were used in the statistical analysis reported in the next
chapter.
In the following discussion, each item from the questionnaire is given. When
applicable, notes on the item follow, stating such things as where it was originally
published, comments on why the item was chosen, and/or issues that arose in the
coding. The rubric for coding that item is described by giving a description of the
typical responses for each level that was observed in the data. If a code was not given
to any response for the item, it is not listed in the rubric. Likewise, no rubric has a
description for the code 0 since that description did not vary between items.
4.3.1
Functions
Item 1
Express the diameter of a circle as a function of its area and sketch its graph.
20
(Carlson, 1998) This item involved covariation in functions. Many students did
not give a graph as part of their answer. Perhaps this was due to misreading the
problem, rather than a misunderstanding.
5
4
3
2
1
The correct formula was computed and a graph was sketched which
was reasonably similar to that of the function.
The graph was missing; or a circle was sketched in addition to the
correct formula.
Used correct area formula in terms of d, but does not solve for d;
used an incorrect formula (such as circumference) and solves for d;
or, indicated thinking about a function by use of function notation
(with otherwise incorrect answer).
The graph was missing; or a circle was sketched in addition to a
response which was otherwise a 3.
Used some circle formulae without function notation.
Item 2
A student has marked the following as a non-function. State whether this student is
correct and why.
y
x
(Even, 1988, 1993) Items 2 and 3 raised the issue of univalence in functions.
5
4
3
2
1
Answered “No”. Used a definition of function in the explanation.
Answered “No”. Gave a statement about the vertical line test (without elaboration).
Answered “No”. Restated the situation; or stated that discontinuous
functions are allowed.
Answered “Yes”. Indicated a misunderstanding of the open circle on
the graph; or there was an indication of using an erroneous definition.
Answered “Yes”. Any other non-empty response.
21
Item 3
A student has marked the following as a non-function. State whether this student is
correct and why.
A correspondence that associates 1 with each positive number, −1 with
each negative number, and 3 with zero.
(Even, 1988, 1993) It was interesting to note that 8 out of the 34 students literally
read the sentence from left to right, that is, they saw the domain as {−1, 1, 3}.
5
4
3
1
Answered “No”. Used a definition of function in the explanation.
Answered “No”. Drew graph and gave a statement about the vertical
line test (without elaboration).
Answered “No”. Restated the situation; or stated that discontinuous
functions are allowed.
Answered “Yes” due to interpreting the statement literally left to
right.
Answered “Yes”. Any other non-empty response.
Item 4
Decide if it is possible to use one or more functions to describe this situation. If yes,
then describe your function(s) briefly. If no, then explain.


x = t3 + t
t is a real number.
y = 1 − 3t + 2t4 
(Breidenbach et al., 1992) Item 4 was intended to demonstrate whether the student
perceived a process in the situation.
5
3
2
1
Fairly clear description of either a function R → R × R or functions
x(t), y(t).
Indicated desire to solve for t.
Work for linear equation; substitutes expression for x into y; or
answered “Yes” with no reason given or with disconnected reason.
Answered “No” with reason stated.
Item 5
Tim and Donna live 1 km from their school. Usually, they walk to school together.
Yesterday, they both left their houses 10 minutes before school started. Tim started
to walk, but Donna was afraid to be late and started to run. After a while, Tim
22
time
time
distance
time
distance
distance
time
distance
distance
distance
realized that although he tried to walk faster and faster, he had to run if he did not
want to be late, and started to run. At about the same time, Donna became tired
and had to walk instead of run. They both reached school exactly on time. Which
of the following graphs is Tim’s and which one is Donna’s?
time
time
(Even, 1988, 1993) This item involves covariation in functions.
5
2
1
4.3.2
Correct choices.
Correct choice for Tim but chose parabola for Donna
Indicated need for an expression; or chose two parabolas.
Composition
Item 6
t
Given two functions v, w such that v(t) = 5t − 6 and w(t) = + 4, find (w ◦ v)(9).
3
Explain.
Item 6 was a composition problem to determine understanding of “◦” notation.
5
4
3
2
1
Answered 17.
Found (w ◦ v)(t) correctly, but did not evaluate.
Reversed composition; or incorrectly evaluated expression.
Subtracted functions then evaluated at 9.
Evaluated w(9), v(9) and stopped; multiplied functions; found w(9) ·
v(9); solved system of equations.
Item 7
√
Given that (f ◦ g)(x) = 5 2x + 3.
23
a. Find f and g that satisfy this condition.
b. Are there more than one answer to part (a)? Explain.
(Even, 1988, 1993) This item required decomposition of a composite function.
5
4
3
2
1
Two correctly labeled pairs.
√
Misunderstood 5 notation, but indicated understanding of composition.
“No” for part (b); reversed labels (process); or incorrect answer for
(a) and “Yes” for (b).
correct expressions unlabeled; or wrote identity as g = 1.
Left (a) blank; took a derivative; f = 0, g = 0; or g = 1/x.
Item 8
Find k so that g(x + 1) = g(x) + k, given that g(x) = 3x + 5. Explain.
(Carlson, 1998) This item invoked the student’s composition concept and compared it with a functional shift up.
5
4
3
2
1
Correctly finds k = 3 and explains answer in terms of slope.
Pointwise evaluation to find constant without explanation.
No explanation or work shown.
Error working with g(x + 1); used expression g(x + 1) + k; or divided
to solve for k.
Unable to evaluate g(x + 1); or did not evaluate g(x + 1).
In each of the following two questions, f, g, h are functions whose domains and ranges
are the set of all real numbers, and such that h = f ◦ g.
Item 9
If only the information in the following table were known, would it be possible to find
f (2)? If so, find it and if not explain why not.
x h(x) g(x)
−1
1
−3
4
π
1
π
0
2
(Breidenbach et al., 1992) The last two items of this section required students to
compose and decompose functions given without expressions.
5
4
Correct answer (f (2) = h(π) = 0) with work.
Correct answer with inconsistent or no work.
24
3
2
1
Indicated understanding of composition and table but no answer; or
wanted line for x = 2 in table.
Asked for function/equation/expression.
Interpreted as multiplication.
Item 10
If only the information in the following table were known, would it be possible to find
g(4)? If so, find it and if not explain why not.
x h(x) f (x)
−1
1
−2
2
3
1
4
−2
π
(Breidenbach et al., 1992) This problem cannot be solved as stated without assuming that f is one-to-one, since the domain of f is given as the set of real numbers,
rather than just the set of x-values in the table. However, this issue did not confound
the subjects in Breidenbach et al. (1992) as the students easily made what was to
them a natural assumption. Likewise, there were no indications that any of these
subjects saw the need for the one-to-one property.
5
4
3
2
1
4.3.3
“Correct” answer (g(4) = f −1 (−2) = −1) with work.
Work showed strong attempt but cannot see decomposition.
Indicated understanding of composition and table but no answer; or
wanted line for x = −2 in table.
Asked for function/equation/expression.
Interpreted as multiplication.
Differentiation
Compute the derivative of each of the following functions. Show all of your work.
Item 11
f (x) = 11x5 − 6x3 + 8
Item 12
3
g(x) = 2
x
Item 13
h(x) = x2 − 3 5x − x3
Item 14
y = 3ex − 4 tan(x)
Item 15
y = x2 sin(x)
25
5
4
3
2
1
4.3.4
Correct answer.
Minor error such as dropping (−) sign or arithmetic errors; error
with derivative of trig functions.
Errors involving elementary differentiation rules.
Did not use appropriate rule; or compound errors with rules.
Integrated; or guessed.
Chain rule
Compute the derivative of each of the following functions. Show all of your work.
Item 16
2
F (x) = 1 − 4x3
Item 17
4
4
G(x) = 2 5x2 + 1 − 4x 5x2 + 1
Item 18
H(x) = sin 5x4
Item 19
y = cos3 (t)
Item 20
2
y = e−t
5
4
3
2
1
4.4
Correct answer.
Minor error such as dropping (−) sign or arithmetic errors; or applied
chain rule but error with derivative rule.
Computed f 0 (g 0 (x)) or f 0 (g(x))
Applied chain rule indiscriminately; or attempted to avoid chain rule
by expanding/rearranging terms (Item 19).
No evidence of considering chain rule.
Data
The data for Phase 1 are presented in tabular format in Appendix B. The codes
are presented by item, by subject as ordered tuples, and by subject as combined
scores.
Chapter 5: Results: Phase 1
These data were analyzed in two ways. First, as one sample from a single population, a regression model was used to determine which rating (after PGPA) has the
highest correlation with the chain rule rating. It was expected that the composition
rating would have the most significant coefficient in the model after PGPA. This was
not the case however, as is detailed below.
In the second analysis, the data were split by course (C4 L vs. Traditional) and
compared in each category. The comparison between treatments was an ex post facto
study and, since the members of each sample were self-selected as volunteers for the
study, an attempt was made to confirm the similarity of the groups using PGPAs
(Schwingendorf, McCabe, & Kuhn, 1998).
5.1 Analysis 1
The predicted grade point average (PGPA) is computed as an attempt to state the
readiness for a student to attend the university. It was used here to give a measure
of the student’s ability prior to her or his instruction in calculus. The PGPA is
not computed for transfer students or for graduate students. In the case of transfer
students, the cumulative grade point average from the previous institution is used.
There was no PGPA score reported for the five graduate students in the study.
The codes for the five items in each category were summed to create a rating
in that category. Prior to summing, the subsets of codes were inspected as ordered
tuples (Table B.2) in order to decide if such a method was reasonable. This question
arose in observing that ratings in the center of the range for a category might come
about differently and thus tell different stories. For example, a rating of 15 could
come from [3, 3, 3, 3, 3] or from [5, 5, 5, 0, 0]. Should both ratings of 15 be treated
the same way? By returning to the written responses to compare such cases, it was
deemed reasonable to sum the codes. In the example stated, the rating of 15 gave the
same “global” information for the ability of each of the two students in that category.
Also at issue was how to deal with questionnaires with 20–25% of the answers
blank. This latter issue was more easily decided since it involved only three questionnaires. These were excluded from the analysis as being incomplete. Note that the
26
27
directions for the questionnaire specifically stated to leave no item blank. The IDs
for the deleted data sets are marked with an asterisk (*) in Table B.2 and Table B.3.
The descriptive statistics for the five variables used in the regression model are
given in Table 5.1 and the correlation data are in Table 5.2. The means indicate two
pairs of ratings. Function and Composition each have a representative code of a little
more than 3, which would be interpreted as demonstrating understanding of the main
idea of the problem, but showing some deficiencies in understanding aspects of the
situation. The representative rating for Differentiation and Chain Rule is slightly over
4, showing more complete ability in the situation. This difference appears to coincide
with the types of questions in the sections of the questionnaire. Items 1–10 varied
widely in difficulty and in coverage of topics within the headings of function and
composition of function. This variety is a by-product of the existing body of research
in these areas and correct responses might indicate relational understandings of the
situations. The remaining items are of a more instrumental nature, as indicated by
the directions “Compute the derivative of each of the following functions.”
In order to include PGPA in the regression model, only 27 observations are able
to be included. That model is presented in Table 5.3. We note that only Composition
and Differentiation have coefficients significantly different from zero. This result was
predicted by the theory—the genetic decomposition for the chain rule is composed
of the schemas for composition and differentiation. Table 5.4 gives the model using
31 observations without the PGPA score. The difference between models is slight,
although without the PGPA, the Composition rating is no longer significantly nonzero. Both models account for the variance with very low P-values.
The unexpected result is that the coefficient for Differentiation is larger than that
of Composition. It was hypothesized that the Composition rating would predominate.
Again, this could be explained by the different types of items in the two halves of the
questionnaire.
5.2
Analysis 2
The C4L variable in Tables B.2 and B.3 is a binary code for the first two quarters
of calculus. An “11” represents the student took the C4 L version of calculus both
terms. A “10” indicates taking the C4 L version first quarter and a traditional course
the second quarter, while a “01” indicates the opposite situation. A “00” indicates
being in a traditional course both terms. The means and standard deviations for
each variable in the regression model is given for each of the four values of the C4L
variable in Table 5.5.
28
Table 5.1
Descriptive statistics for variables
Mean
Median
Mode
Standard Deviation
PGPA
Function
2.73703
2.8
3.1
0.62212
14.6129
14
14
5.13599
Composition
15.9677
15
24
5.7531
Differentiation
22.6774
24
24
3.07014
Chain
Rule
21.0967
21
24
3.36010
Table 5.2
Correlation statistics for variables
PGPA
Function
Composition
Differentiation
Chain Rule
PGPA
Function
1
0.0089
0.20391
-0.022
-0.1579
1
0.33574
0.10597
0.13552
Composition
Differentiation
1
0.26548
0.40021
1
0.671989
Chain Rule
1
The data were compared by grouping those students with C4L values greater than
zero, that is, those who had at least one non-traditional course. This was done to
compare groups of almost equal size. Also, this grouping has historical precedence.
In previous studies using two groups, the influence of a non-traditional approach was
seen to show up when investigating topics taught traditionally (see Asiala et al., 1997;
Clark et al., 1997). This comparison is presented in Table 5.6.
Finally, a comparison was made based on the method of their first quarter course,
when the chain rule was introduced. These data are given in Table 5.7. There was
a significant difference for the Differentiation and Chain Rule ratings in favor of the
group who had a Traditional first quarter of calculus. However, these differences on
average are three points out of 25—which can be interpreted as making minor errors
on three problems. Since those items were of a more instrumental nature, this result
merely indicates that those students are less likely to make computational errors in
the test setting.
5.3
An alternate analysis
In order to address the difference found in the types of items on the questionnaire,
two new variables were computed. The Function rating was changed to include only
29
Table 5.3
Regression model with PGPA
Regression statistics
0.7978
Multiple R
0.6365
R2
0.5704
Adjusted R2
2.1882
Standard Error
27
Observations
Analysis of Variance
4
22
26
Sum of
Squares
184.502
105.3491
289.851
Mean
Square
46.125
4.7886
Coefficients
6.5755
-1.133
0.0154
0.1967
0.6163
Standard
Error
3.7905
0.7109
0.0965
0.0927
0.1426
t
Statistic
1.734
-1.594
0.1604
2.1217
4.320
df
Regression
Residual
Total
Intercept
PGPA
Function
Composition
Differentiation
F
Significance F
9.6323
0.00011
P-value
0.0946
0.122
0.8737
0.0435
0.000201
Lower
95%
-1.285
-2.608
-0.184
0.00444
0.3205
Upper
95%
14.43
0.3408
0.2158
0.3890
0.9121
Items 1 and 2, and a new Composition rating included Items 6–8. Those items were
selected as being the most like the last ten items. That is, they could be considered to
be straight-forward, computational situations dealing with functions and composition
of functions. We note that the original coding of these items was based on assessing
the depth of understanding in the situation, rather than on an instrumental, “right
or wrong” scheme. The items were not re-coded for this analysis.
The use of the new variables did not change the regression model or the results
between groups. This seemed to indicate that there would be no change in results if
one were to construct a new questionnaire using items of the same type for all four
categories to collect new data.
30
Table 5.4
Regression model without PGPA
Regression statistics
0.7103
Multiple R
0.5045
R2
0.4495
Adjusted R2
2.4929
Standard Error
31
Observations
Analysis of Variance
3
27
30
Sum of
Squares
170.910
167.799
338.709
Mean
Square
56.970
6.2147
Coefficients
3.8272
-0.006
0.1413
0.6663
Standard
Error
3.52159
0.09409
0.08663
0.15379
t
Statistic
1.08679
-0.0712
1.63150
4.33257
df
Regression
Residual
Total
Intercept
Function
Composition
Differentiation
F
Significance F
9.16684
0.00023
P-value
0.28578
0.94363
0.11323
0.000152
Lower
95%
-3.3984
-0.1997
-0.0364
0.3507
Table 5.5
Comparison of courses
PGPA
Function
Composition
Differentiation
Chain Rule
Overall
mean
SD
mean
SD
mean
SD
mean
SD
mean
SD
mean
SD
00
2.62
0.70
15.31
6.13
15.31
5.16
23.63
1.26
22.00
2.48
76.25
10.80
01
2.91
0.35
13.86
3.98
19.43
7.35
23.00
2.83
21.71
2.69
78.00
12.74
10
2.67
0.40
16.33
2.08
13.00
3.46
21.67
4.93
20.00
2.65
72.20
6.24
11
2.82
0.88
12.40
4.45
15.00
5.57
19.80
5.07
18.00
5.52
65.20
15.80
Upper
95%
11.0529
0.18636
0.3191
0.9818
31
Table 5.6
Comparison of Traditional and any C4 L
PGPA
Function
Composition
Differentiation
Chain Rule
Overall
mean
SD
mean
SD
mean
SD
mean
SD
mean
SD
mean
SD
Trad (n = 16)
2.62
0.70
15.31
6.13
15.31
5.16
23.63
1.26
22.00
2.48
76.25
10.80
C4 L (n = 15)
2.83
0.56
13.87
3.89
16.67
6.43
21.67
4.05
20.13
3.96
72.33
13.46
t
-0.90
P -value
0.19
0.78
0.22
-0.65
0.26
1.84
0.04 *
1.58
0.06
0.90
1.70
t
-0.14
P -value
0.45
0.47
0.32
0.98
0.17
2.53
0.01 *
2.48
0.01 *
1.98
0.03 *
* significant difference between groups
Table 5.7
Comparison of first quarter course
PGPA
Function
Composition
Differentiation
Chain Rule
Overall
mean
SD
mean
SD
mean
SD
mean
SD
mean
SD
mean
SD
Trad (n = 23)
2.73
0.60
14.87
5.51
16.57
6.05
23.43
1.83
21.91
2.48
76.78
11.16
C4 L (n = 8)
2.76
0.71
13.88
4.09
14.25
4.71
20.50
4.75
18.75
4.53
67.38
12.76
* significant difference between groups
Chapter 6: Phase 2
6.1
Introduction
The purpose of this portion of the study is to attempt to describe the understanding of the chain rule concept on the part of the interviewed students. Also, it will
be the second attempt of the first cycle through the research paradigm, which may
result in a revised genetic decomposition.
The discussion and conclusions based on the Phase 1 results will be treated fully
in combination with the results from this interview data. The questions to be investigated here are:
• In spite of the lack of support for the hypothesis in the quantitative data,
is there evidence that students can be successful with the chain rule without
understanding composition of functions, and to what extent can this success
move toward understanding?
• With reference to the results of the RUMEC study, is the chain rule concept
well described via a schema, and do the current results support the reported
schema?
6.2
Subjects
Students who were interviewed were drawn from the 34 students who had participated in Phase 1. Each student was placed in a 3 × 3 matrix which was formed by
partitioning the ratings for the Chain Rule and the Overall ratings into three subintervals (high, middle, low). An attempt was made to interview one subject from each
of the nine subsets. However, only six interviews were conducted. Volunteers could
not be found to represent the cells left empty in Table 6.1. Of the two or three subjects in the three categories, either the student had requested to not be interviewed
on the informed consent form in Phase 1, or he or she declined the invitation during
this phase. Table 6.1 indicates the code number for the student interviewed as an
entry in the matrix.
32
33
Table 6.1
Matrix of interviews
Overall
Chain Rule
High (25–23)
High
Middle
Low
(96–81)
(78–66)
(63–45)
A113
Middle (22–20)
Low (19–1)
6.2.1
A134
A132
A125
A137
A123
Procedures
The interviews were audio and video taped with the camera positioned to record
the subject’s actions and written work. The audio tapes were transcribed to written
form and then coded for key events. These were then analyzed according to the
research paradigm described above. The video tapes were logged to ease reference to
key events found in the audio transcripts.
6.3
Instrument
The interview followed a guide designed to elicit the student’s understanding of
the rule, based on tasks from the quantitative instrument. New tasks were presented
to the student to probe the limits of understanding. The students were asked to reflect
on the questionnaire tasks; by asking them for definitions of function and derivative,
domain-range issues in composition, the statement of the chain rule, and a discussion
of any relationships between the 20 problems that the student perceived. The guide
was developed following the initial analysis of the quantitative data.
Then the interview addressed Leibniz’ rule as a way of gauging the robustness
of the student’s concept. A situation with no simple
anti-derivative was presented;
Z
sin x
2
et dt.” If the need to compute
“compute F 0 if F is the function given by F (x) =
0
an anti-derivative was too great, a situation where the integrand has an elementary
anti-derivative was presented to determine if the student can derive Leibniz’ rule from
the result. The text of the interview guide can be found in Appendix C.
6.4
Analysis
The interviews were transcribed from the audio tape recordings by the author and
an assistant. These transcripts were then proofread by the author, who had conducted
the interviews. Each transcript was then coded for key events in the interview and
those events were searched for common issues relating to the topic. Not surprisingly,
34
the events followed the interview questions rather closely, and so most of the issues
found were predicted by the interview guide.
Two new issues arose in the data. We found data that better assessed the interviewee’s understanding of the chain rule than was available in the questionnaire data.
Secondly, by interviewing two students whose data were excluded from the analysis
in Phase 1, we found limitations in employing only one means of data collection in a
study of mathematical knowledge and understanding.
6.5
Where the RUMEC study left us
Clark et al. (1997) introduced the notion of using the triad mechanism (Piaget
& Garcia, 1989) to describe the development of a schema in the RUMEC study
(see Section 2.2, page 10). This mechanism focuses on the relationships between
problem situations and/or between mental constructs that a student brings to bear
on a problem. The relationships may or may not be explicitly perceived by the student
as they are used. The authors found the triad useful, having exhausted themselves in
the attempt to describe the schema in terms of actions, processes and objects alone.
The RUMEC study was able to map the triad levels to the observed data in the
study and gave a genetic decomposition for the chain rule concept based on the triad
(see Section 2.4, page 11). However, it was unable to map the overall response of each
interviewed subject to the triad. The data collected did not indicate clearly if and
how each subject related various problem situations together. Simply put, the data
were collected prior to the researchers’ understanding of the complexity of schema
development. This present study was designed to fill that gap.
The following excerpts from Clark et al. (1997) describe how they interpreted
student responses with respect to the triad in slightly more specific terms than found
in the genetic decomposition.
Students who are at the Intra stage of schema development with respect
to the chain rule see the various rules for differentiation as unrelated. The
students are able to solve some of the problems by simply applying rules
which have been memorized and in some cases, not remembered correctly.
(p. 354)
Many of the students who participated in our study showed evidence of
being at the Inter level of development of the chain rule. They had collected some or all of the various derivative rules in a group and perhaps
35
could provide the general statement of the chain rule, but had not yet
constructed the underlying structure of the relationships. (p. 356)
Once a student’s collection of derivative rules and understanding of function composition attains a coherence as a schema the student has moved
to the Trans stage of development. That is, he or she is capable of operating on the mental constructions which make up her or his collection.
He or she is now able to reflect on the explicit structure of the chain rule
which these constructions were implicitly containing. (p. 358)
The following chapter reports the analysis of the six interviews based on these descriptions and on the initial genetic decomposition. The triad mechanism is seen in
the observations and the data for each subject is described by a level in the triad.
Chapter 7: Results: Phase 2
7.1
The triad description revisited
Three questions from the interview were written to evoke the students’ relationships between situations involving the chain rule. In Question 6, the student is to
sort the differentiation problems from the previous phase. Later, he or she is asked
to give her or his definition of the chain rule, and in Question 11, relate it back to
the list of problems just sorted. Finally, in Question 15 four problem situations are
presented to find if the student sees the involvement of the chain rule.
While these questions form the basis for the following discussion, the rest of the
last half of the interview is also taken into account when applicable. Certainly, the
success or lack of success on the Leibniz rule question indicates how robust a student’s
schema has become.
In the following discussion pseudonyms are used for ease of readability rather
than the alpha-numeric codes and to protect the identities of those interviewed. The
interviewer is indicated throughout by “I”. The code will be given—in the section
headings with the pseudonym—for cross reference with the data tables.
7.1.1
Eli [A125]
Eli may have been working at the Intra- level. In the grouping questions and in
his definition of the chain rule, he was working on the surface of the situation only.
Eli: Can I have like one question that is in two groups?
I: If you want.
Eli: Then probably I would, first I would group 4, 5, 8, 9 to one since they are
all trigonometric.
I: Uh-huh.
Eli: Then probably group #4 and #10 together because they have e in it, and
#2 just one group cause it has fraction, then #3, 6, & 7 that’s one group,
cause you have powers. . .
I: Uh-huh. What about the powers in 1?
Eli: Powers in 1? [Mumbling]
I: You said 3, 6, &7 grouped together because they have powers.
Eli: Yeah.
I: Right exponents. Do you have exponents in #1?
36
37
Eli: Oh, yeah.
I: You have 11x to the fifth. So would that go in there or were you thinking
that or was there a reason that doesn’t go in there?
Eli: Um, it’s because 3, 6, &7 all have [pause]
I: Expressions? Which things were you. . . ?
Eli: Well, I think [mumbling] they have [mumbling]
I: So you were pointing at #6 when you said multiply it out. . . what would you
be multiplying out?
Eli: Well, like 1 − 4x3 times 1 − 4x3 I think [mumbling]
I: Are you thinking of FOIL?
Eli: Yeah. [Mumbling]
I: OK.
Eli: Actually I might need 3 [mumbling] involved with the FOILs. So I would
probably put 1, 3, and 6 together.
I: Uh-huh
Eli: [mumbling]
I: OK, you asked me and I said sure you could. You have some of your groups
overlapped a little bit.
Eli: Uh-huh
I: What if I changed my mind and said, Um, no they can’t overlap. They have
to be disjoint. Can you still group them?
Eli: Um. . . yeah.
I: How would you do that?
Eli: I would put. . . I’d probably put #4 and #10 as one group and then 5, 8, and
9 as the other group. So a trigonometric group and the e group.
I: OK. What about the leftovers then? 1, 2, 3, 6, and 7? Would they be in a
group together or would they each be an individual group? How would they
be treated?
Eli: 1 and 3 as one group and 2 as another group. 6, 7 as another . . .
Eli gave a description of the chain rule but seemed unable to formalize it. This
was consistent with the trouble he had dealing with functions on the questionnaire,
where his combined code was 9/25 with 3 blank responses.
I: OK. Alright. Um the point of the questionnaire was to get you to think about
and use the chain rule in taking derivatives. OK. Uh, do you remember the
chain rule?
Eli: Uh, derivative of inside times derivative of outside. Not really.
I: It had to do with insides and outsides.
Eli: No, not really.
38
I: So for instance, #8 is a problem that involves the chain rule. You’d just take
the derivative. Here’s what you wrote down for, well it’s 18 on this paper.
The original function was sin(5x4 ). You wrote down cos(5x4 ) times 20x3 .
Eli: Uh-huh. Cause um the derivative outside times the derivative of inside.
I: OK. Why don’t you write that down? Um underneath the blue line that I’ve
drawn on that page. Write down, you can use words or symbols or whatever
you like as best as you can recall what the chain rule says. [pause] So what
did you write down?
Eli: Uh, I put symbol like A and B then I put um A inside, or B inside of A,
then I said derivative of A times derivative of B.
I: So the derivative of A, then what happens?
Eli: I just put derivative of A then write the B part down.
I: OK. Um in fact, the last five questions on this list were intended to be chain
rule questions and use the chain rule to complete the derivative. OK. Does
your rule there work for all five of those?
Eli: Uh, I don’t think so.
I: What’s wrong with your rule?
Eli: Because it doesn’t deal with the powers. Here it had A and B but, the ten
only had one power.
Finally, the four problem situations also were dealt with superficially. Eli decided
that the first two did not require the chain rule and the last two did. However, he
did not offer any explanation and ended by stating that he did not see it in the last
problem. The interviewer wrapped things up in an attempt to get some response
from him, but Eli was not able to add anything.
I: OK. Good. Um, here’s another sheet of paper. Read each of the following differentiation situations carefully. Without actually trying to solve the
problem, determine if the solution will involve the use of the chain rule in
some way. If so, describe how the chain rule is used.
Eli: [pause] Uh, number 1. Uh, no.
I: No. OK.
Eli: [long pause] Uh, I think that the first two, #1 and #2 are. . . have nothing
to do with the chain rule, 3 and 4 um is gonna have to use the chain rule.
I: Uh-huh, how does the chain rule get used in part c there, excuse me I meant
number 3?
Eli: Uh, 3? I have to take the derivative uh, 36πV 2 .
I: Um-huh
Eli: Then, um, . . . I don’t know. It just seem like I have to use the chain rule,
uh, I don’t know.
I: OK, and for #4 does that seem any more clear?
39
On #4 it says that you have formula of C and it says that F = −32t2 . So I
plug in F into the formula of C and I will have to use the chain rule there,
uh,. . . Well I can’t see the relationship of the inside and outside function.
I: Well, what variable are we using in C?
Eli: What variable? F .
I: I, right. Capital F . So then we’re saying OK, yeah but then you can even
find capital F in terms of time, in terms of little t. So that’s what is going
on there. So the inside function is all of that, −32t2 , that’s going to go in
for all of those F ’s.
Eli:
7.1.2
Tim [A134]
Another student working at the Intra- level was Tim. Like Eli, he grouped based
on the surface characteristics and did little in his second attempt.
I: Yeah I just want you to group these, like if you decide that number 3 and
number 6 go together, then they are in a group. Read through these and
decide . . .
Tim: Where are the groups?
I: Make up groups, and then put them in the different groups.
Tim: Oh, OK. This one [works for 2 minutes] So.
I: OK. So you put
Tim: 4, 5
I: 4, 5, 8, and 9 in one group. What do they have in common?
Tim: Sine, cosine and tangent.
I: Right, OK. And then we have 1, 3, 6, and 7.
Tim: This one is powers, power of 5, power of 6, so I don’t know how to put these
together.
I: OK, they are just the leftovers.
Tim: Yeah
I: Is there any other way that you could group these?
Tim: Ah, yeah. [pause] I could group the e functions together.
Tim did not recall the chain rule by name. In fact, it took much prompting using
an example for him to see what he had done.
I: OK, good. Let’s use the bottom of this page here, I will draw a line. Could
you write down the chain rule in whatever words or symbols you remember?
Tim: The chain. . . ?
I: The chain rule, for taking derivatives.
Tim: You mean y = x2 − 7x + 5 and you take the chain rule for this one?
I: No, the chain rule is just a rule we have for taking certain kinds of derivatives.
Do you remember using the chain rule?
40
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
Um, y, it was something, um. . . Can I do an example out of the book? The
chain rule for this one? [writes out f (x) = y n , (y n )0 = ny (n−1) ]
That is actually called the power rule.
The power rule.
You have got something raised to a power so you bring the exponent down.
You would use that in a lot of these problems here, the power rule. The
chain rule is talking about something else. Let me show you an example
of a problem where you would use the chain rule, for instance #18 here.
The original problem was sin(5x4 ). [Student is shown written work from the
questionnaire, which has the correct solution to the problem.]
You mean give the rules of this one, how I worked out this one?
Yes, how did you find the derivative there?
Oh, [writes and crosses out f (x) = u(x), f 0 (x) = u0 (x)] [mumbles] [writes
f = (y · x), (y · x)0 = y 0 · x + x0 y] I don’t know, you know, if you give me an
example of how to do the chain rule, I know how to do products.
That is what I am saying, this solution you have right here. . .
Mm-hmm
you used the chain rule
yeah
to get that solution, which is correct. OK? Does this remind you of the chain
rule, then?
Uh-huh, so what you want me to . . .
We are starting with sin(5x4 ) and look at what you wrote down for your
answer, and try and remember how you came up with that idea.
Oh! So first I take derivative of outside, derivative of sine is cosine,
Right
so then I take derivative of inside, so inside is 5x4 , so I write down 20x3 .
OK
That is so easy, you know, I don’t know how to get the something that you
asked me to do.
You don’t. . . So the question was, what does the chain rule say?
If there is a function of u, something like that, and take the derivative of
that, right, first you take the derivative of outside first, then take derivative
of inside.
Tim was able to see the chain rule in the implicit situation, but could not see it
in any of the other three.
Tim: If I do. . . OK, OK. So if I pick one?
I: Uh-huh
Tim: This one not chain rule. Hold on, I use the chain rule, yes. This one use
chain rule.
I: How would you use the chain rule?
41
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
Tim:
I:
√
That one has y and when I take derivative, I write it as y 1/2 and when I take
derivative of that one, I get that and that and [writes (y 1/2 )0 = 21 y 1/2−1 =
1 −1/2
] So if y is something else, I have to take derivative of y.
2y
OK
Not really, y, there is something inside y. I have to use the chain rule for
that.
OK. Right.
I’m not sure about the next problem. I forgot.
The name of the next one is a related rate problem. Do you remember that
name?
That has to do with speed? What is the formula? I not clear on the question
you know, ‘find the formula for the rate.’ [pause] This one doesn’t have
anything to do with it. Do you think?
OK.
So I don’t know if it has derivative. The next one is . . . [reads problem to
self] . . . power rule. . . No, I not use chain rule.
OK
[pause] I don’t think I would use the chain rule for the last one.
OK. Good.
While it is the case that Tim correctly solved all of the chain rule problems on the
questionnaire, it seems clear that he is not working with a general formula. Rather
he is taking each problem as it comes to him and dealing with it individually.
7.1.3
Al [A137]
Al seemed to be at the Inter- level. While his grouping of the ten derivative
problems did not use the chain rule, as was the case for Eli and Tim, he was able to
connect his notion of the chain rule with the items 16–20 (which are numbered 6–10
on the interview sheet).
I: What I would like you to do is to write down the chain rule, using whatever
words or symbols that you like, and can remember.
Al: OK.
I: Feel free to talk out loud.
Al: The chain rule, I know that you have something, you have a function f (x)
[mumbling] , so you’re gonna have f (x) [mumbling —pause] that’s 2, you
have something to the power of whatever and you take the derivative of
that, [has written (x3 + 2x)2 ] that would be 2x cubed plus 2x that the chain
rule would be if you take the derivative of that again and multiply it by that
of the inside.
I: OK, so is there an example of that in those problems there?
Al: [silence] Yeah, #6.
42
I: #6. Good. So you described it, you didn’t write anything down.
Al: No.
I: That’s OK. That’s fine, we got it on tape which is all I’m worried about
really. Um, so here’s a point, the last five questions 6–10 were written with
the intention that you would have to use the chain rule to solve them. So
I’m wondering, um, if we compare what you have written down as the chain
rule with each of those last five problems, does your statement take care of
it?
Al: Takes care of #7.
I: Uh-huh.
Al: Then it takes cares of that because you have cosine of that and the entire
derivative of that; which I said is that you have something on the outside
take the derivative and times the derivative on the inside.
I: OK. That’s not quite what you said before though, right?
Al: Right, I said that each have to raised to powers.
I: Right, so now you’re talking about inside and . . .
Al: Yeah, I know that you’d have to use the chain rule for that and . . . I think
that I would definitely use the chain rule for that.
I: OK.
Al: And, it is a prerequisite condition and I haven’t done chain rule for like a
year now.
I: That’s fine, that’s fine. I was just trying to get whatever you thought of first.
And now what you are thinking of second. This is fine.
Al: OK.
He went on to work through the remaining three problems. The main distinction
between his work and that of Eli and Tim was that he seemed to be in control of
making the connections once the chain rule was pointed out.
In classifying the four situations, Al saw the chain rule in the implicit differentiation problem and in the last problem. He was unfamiliar with the related rate
problems and decided that it would not be used. It seemed as though he might have
answered differently had he worked through the solution to the problems.
I: Yeah, alright. What I have is a set of four situations, that are differentiation
problems that you probably dealt with in the first calculus course that you
took. I don’t want you to solve these problems, but if you would read through
each of these situations and decide if the solution would use the chain rule
in some way and if so, then how would it be used.
Al: OK. [silence] For one you would use it. You have x times the derivative of
y times the. . . that would be you use the product rule to get the derivative
of that, that is y to the 1/2 times the derivative of y. . . −1/2y to the −1/2
times the derivative of y.
43
I: And that’s the chain rule?
Al: Yeah. You have uh, y in itself is a function and y can be x cubed or you
know t cubed. And the square root of it, and to get to that you have uh,
1/2(t3 )−1/2 times the derivative of that which we’d write 3t2 .
I: OK.
Al: For one you’d use it. On to two now?
I: Uh-huh.
Al: [pause] I don’t know how to do #2.
I: Um, would it help if I told you that it was a related rate type of problem?
Al: Um, no.
I: OK. Alright. What you have is two things that are varying in this situation,
the height on the wall and then the distance from the wall on the bottom.
And, I’m given information about the rate at which the bottom is changing
and I’m asked about the rate at which the top is changing. So what happens
is you look for a relationship between those two original quantities. The
height and the distance from the wall and then you try and go from there to
an expression that talks about their rates.
Al: OK, but how would we do that?
I: Oh, you want to see the work?
Al: No,
I: Well, particularly if you draw a picture um you have a right triangle and so
you might label the height h and the bottom could be x. . .
Al: Then you used the triangle 1/2. You use that 1/2, you use that that triangle
thing?
I: Uh, I use a triangle thing. Not the area though, I use the Pythagorean
theorem to talk about the sides. We said that the ladder is L feet long so it
would be L2 = h2 + x2 .
Al: OK.
I: That’s our relationship. That relates these things together and then I would
need to find the rates by taking a derivative.
Al: OK. You want me to do three now?
I: Well, in that description, I didn’t say the word chain rule or use the phrase
chain rule. Do you think that the chain rule might pop up there or not?
Al: Um, no. [pause] No.
I: Um, yeah, so number three.
Al: [student reads to himself—long pause] You just take the derivative of that
and that would put that; you wouldn’t use it there either, I don’t think.
I: OK.
Al: I don’t think so. But, I’m not very good at word problems.
I: [laughs] Very few of us are.
Al: [chuckles] Yeah.
I: Do you recognize that problem as something that you have done before?
44
Al:
I:
Al:
I:
Al:
I:
Al:
I:
Al:
I’ve probably done it countless times. More than likely.
OK, but it’s not familiar to you.
Yeah. I’ve done that like probably ten times but. . .
The second one. . .
Yeah.
Right.
Cause almost every calculus course has that.
It’s a classic. OK, how about four?
[pause—reading] This, I don’t think that I’ve ever seen before. I would use
it here.
I: You would.
Al: Uh-huh. Yeah, because you have −42t2 and put that into F and that’s
squared cubed and then that times that.
So Al had made some connections between situations involving the chain rule, but
is still working toward a cohesive whole.
7.1.4
Ray [A123]
Ray is the most interesting case (he is the topic of section 8.4 below) as he had a
rather highly developed sense of the chain rule but no facility with it generally. This
is due to his making the decision to not memorize the derivation formulas. In the
following excerpt, Ray was attempting to do item 17 from the questionnaire, which
he had left blank.
Ray: This one is messier. I would like to approach this one as a composition.
I: 17?
Ray: 17, yeah, I would like to approach this one as a composition of two functions
as two different things, but I don’t remember those rules either. As far as
like the chain rule and except for this one is like 2x4 is f (x) and g(x) should
be [mumbles] I would have to use the chain rule to find out what that was
the derivative was. Oh, boy, foggy memory of the chain rule, I really did rely
on the, that, that. I actually had memorized the page where the derivative
rules were and then I didn’t have to memorize the derivative rules. Except
right before a test I would memorize them temporarily, I would cram them
into my RAM space.
I: So, um, the situation right now is that you know that there is a chain rule,
and it would apply here for a composition, um, try to dig out as much as
you can.
Ray: It’s like f (g(x)) prime, is g(x) prime. Something like x has derivatives inside
there, different things. . . something along these lines. How ever did you grade
my tests? [sighs] I know that you take the derivative of each one separately
and compose them somehow, either added or multiply them, I think that they
45
are added and beyond that that’s all the fog that covers the skyscrapers of
my mind.
I: That’s called pollution. Um, what other rules do you remember by name?
Ray: Let’s see here, the chain rule, the um sum rule is pretty simple, you can take
them separately. Um, the product rule I remember, by name, um that being
the special case, once again. I don’t exactly know how to do it. There’s a
rule for division, the quotient rule. Those are the important ones—chain,
product and quotient. Then we have like power rule—the easy one it’s like
x2 ; 2x. I remember that one and that really took care of a lot of them. The
power rule. I used that for the first four of them.
I: Right. Right.
As the interview progressed Ray was able to describe which rules he would use to
solve the problems, provided he could see the formulas. With much interaction on
the part of the interviewer, he recalled the formula for the product rule and the chain
rule. He demonstrated the use of the chain rule on item 18.
Likewise, with the four situations, he was able to see the chain rule in most of the
problems. In both the second and third problems (related rates) he did not see the
need for a derivative; hence no need for the chain rule. Apparently, he had associated
acceleration with rate and with integration.
Ray:
OK. [pause] OK, I’m trying to remember . . . got a rate . . . this one doesn’t
even scream for derivation to me. Because you have one rate . . . L is going
to be a constant. Um, looking for a second. . . this doesn’t even really scream
for a derivation except there’s too many variables and to get rid of L that
we’d. . . but no I don’t think that I would use the chain rule on number two I
would set up some kind of trigonometric relationship to two different sides,
time being a constant thing, but distance, the trigonometric relationship
between the distances would dictate everything out.
I: How would they dictate them?
Ray: Well, in the same amount of time that it sits here, [mumbles to self] cosine. . . goes to. . . opposite [mumbles to self] opposite over adjacent is the
tangent. . .
I: Right.
Ray: um I would probably use the tangent, but L doesn’t help a lot and um, x
velocity over 4 feet per second, and I solve for x to get um, I’m not sure. . . I’d
try to solve for x through some large algebra and that would be the answer.
I: OK. OK.
Ray: As the rate, there’s not acceleration there. Which would scream for . . . no.
That would scream for integration, acceleration, but actually. Yeah, yeah.
There’s no change here, it is constant velocity so this doesn’t even ask for
derivation. Therefore I wouldn’t probably use the chain rule to solve this
problem. That was number 2. [Reads to self]. . . oh,oh, S cubed. . . solve for
46
V . . . it gives. . . I don’t think that I would use the chain rule here because I
don’t really compose V , actually yeah they are, S has V in it. [mumbles to
self] Yeah, I’d use the chain rule for finding d(S) and because I would have
to solve for V and S would move into the problem I would have to use the
chain rule for finding d(V ) as well, I would use the chain rule twice here.
That’s number three.
Um [reads #4 to self]. . . hey I remember this problem. . . [mumbles to self]
. . . find the expression . . . uh, that’s a composition—I’m talking about four
now—you would substitute −42t2 in for everywhere that you would have F
and that would be an expression for the rate of change of C. Um, [mumbles to self] well, that’s obviously the composition of functions. [mumbles
to self] As I’m trying to find the rate of change, it’s the equivalent of acceleration. . . [mumbles to self] yeah, I would I don’t think I would even have
to derive here. . . this is question number four. . . but both of these. . . I would
think that I would have to use the chain rule because I’m putting one function
into another.
Ray seems to be at the Inter- level. He made the connections between the various
situations, but he was doing so with what might be considered semi-objects. That
is, one might consider a complete idea of a rule as being the totality of knowing the
formula, how to apply it and when to apply it. Many students seem to focus on and
excel at the first two to the exclusion of recognizing the need in context. Ray, on
the other hand, had focused on the latter two, choosing to look up the formula when
needed.
7.1.5
Peg [A132]
Peg indicated a rather strong Inter- level understanding as she grouped differentiation problems. She used the chain rule as an initial criteria, selecting two of the five
chain rule problems. The remaining problems were classified as being trigonometric
or exponential. When asked for a second way to sort, she kept the chain rule as a
main criterion.
I:
Peg:
I:
Peg:
So is that five different groups?
[pause] Yes. Five different groups.
OK, what was the discriminating features?
OK, One is like the most straight forward, where it is just strictly using the
uh, what’s the name of that rule? The power rule.
I: Uh-huh.
Peg: It’s just straight out the power rule. Uh, two and three are pretty much
exactly the same thing except for two, the expression needs to be rewritten
to use that rule. 6 and 7 is also just the power rule except for you have uh,
uh, you have a power rule and then it’s a chain rule.Uh, 4, 5, 8, & 9 all have
47
trigonometric uh terms in them to where then you also have to know their
trig functions and then 10 is the e function which also has it’s separate rule.
I: OK. When you went through that. . .
Peg: And that’s pretty much from my point of view which was from the easiest
to the most difficult. Although, I guess that the e function is not really that
difficult; it’s just knowing that it, it’s just not as, it’s not difficult it’s just
different because it just doesn’t seem to follow the format as all of the others.
I: Uh-huh. Um, was there any other way that you considered grouping these?
Peg: [pause] Uh. . . in the time frame, I just went with the first thought that popped
into my mind. I’m sure that if I was to sit here longer, I could think of other
ways to do it.
I: OK. What might be other criteria?
Peg: [pause] Uh, the other ones would have been, may be, the ones that just had
single terms as opposed to having two terms that have to be differentiated.
Or anything that that has a chain, or pulling out anything that has the chain
rule and knowing that also have to add more terms to the final expression.
Peg gave a very clear statement of the chain rule, using function notation. She
augmented this with a comment about inside and outside things. But her notion of
chain rule and composition were apparently driven by parentheses.
I: So the last couple of questions that I have for you, write down the chain
rule using whatever words or symbols that you like. It’s as best as you can
remember it. You have here like examples of it. Number 6 and 7 which is
16 and 17 on our papers here. You used the chain rule in your answer. So if
you want to use those to help spark some ideas. . .
Peg: [pause] [mumbles to self] [pause] OK.
I: Read that for the tape.
Peg: OK. Uh, when you have the derivative of f (g(x)) it’s f 0 g(x) times g 0 (x). It’s
basically you take a derivative of the outer term and I’m using outer term
because it’s just the way that I look at it, composed with the inner term and
then multiply it by the derivative of the inner term.
I: OK. Great. Um, actually the last five statements here on the list, 6 through
10,
Peg: Uh-huh.
I: were intended to be, to use the chain rule and you named 6, 7, & 8 as using
the chain rule. Do you see the chain rule being used in 9 and 10?
Peg: Yes, I do, but I had pulled them out separately because they had the trigonometric functions also.
I: OK, no that’s fine. Does your rule that you have written there at the bottom,
does it apply or does it take care of all six of those or uh, all five of those
cases?
Peg: [pause] Uh. . . in the way that I would look at it, the way that I look at the
problem, it does. I look at the outer term and the inner term which would,
48
the outer term that I’m looking at is being either how you solve strictly for
the power rule or for the trigonometric functions and then going inside and
actually you know. . .
She hesitated to include cos3 (t) in the chain rule group since t was in the parentheses
2
alone (or e−t due to lack of parentheses). She had attempted to solve it by rewriting
it as a triple product. This is consistent with the codes she received in Phase 1 on
composition and her lack of success with the final interview problem (discussed in the
next section).
I: OK. [pause] You did a very interesting thing when you solved 19.
Peg: I don’t think that’s correct.
I: I don’t think that it’s very far wrong actually. But what you did is rather
than use the chain rule, you expanded the expression. The expression was
cosine cubed of t. And so you wrote down three products of cosine t which
is cosine cubed.
Peg: Uh-huh.
I: And then, you took a derivative from there.
Peg: Even with that, I didn’t finish it did I?
I: No, you needed to do the other product.
Peg: Yeah, but, yeah, I made it to that.
I: But, my question is, if you can recall, this was a long time ago, um, was
there something that didn’t say chain rule to you in that problem?
Peg: That would be the one that I’m still the least clear on still today because it’s
not as straight as the chain rule because the uh, the variable t doesn’t have
anything associated with it like all of the other ones, that there’s something
inside the parenthesis to differentiate. Where this one. . . so would it be 3
cosine squared t uh times minus sin? Is that the correct answer.
I: Uh-huh.
Peg: Oh.
I: That’s exactly it.
Peg: But I, it’s just not as obvious of the chain rule as all of the others.
Peg was no longer familiar with implicit differentiation or related rate problems.
After being reminded, she does not see the chain rule since there is nothing inside the
other. She did see the chain rule in the last problem although the interviewer had to
point out the expression for F in the paragraph.
Overall we see Peg making connections based on a structure, the use of parentheses. She gave a general rule (again written with parentheses) but had some difficulty
recognizing problem situations which fit her rule. Peg may be working at the Trans-
49
level, that has been built on a weak structure. Her understanding of the chain rule—
discussed in the next section—is rather strong, but the data is equivocal regarding
the triad level.
7.1.6
Jack [A113]
The one example of the Trans- level is Jack, who not surprisingly had the highest
overall score of those interviewed. Jack grouped (and then partitioned) the ten differentiation problems based on levels of difficulty. He clearly saw the chain rule as one
of the criteria along with the power rule, product rule and trigonometric rules. He
stated that the trig rules are the most difficult, indicating that the other rules have
reasons behind them, but the trig rules must be memorized.
Jack:
10, I would probably put, I don’t know, I would be tempted to put 10 in a
group by itself just because with it just being e it’s basically the chain rule,
and the chain rule and the power rule together. That’s it. That one is done.
Um, 7 and, let’s see 7 requires it, 7 requires product rule and chain rule, and
the all powerful power rule. . .
8 just requires product rule. I know, no I probably would just go ahead and
group 4, 5, 8, & 9 all together because they have the trig function and because
I mean, the trig functions are the only derivatives that can throw you off real
easy if you don’t know them, because chain rule, if you understand chain rule
and you understand the use of things like exponentials and logarithms then
you aren’t gonna get messed up bad on chain rule. You aren’t gonna get
messed up bad on the power rule if you know simple mathematics. You’re
not gonna get messed up too bad on product rule as long as you remember
to keep everything straight. But, the trig functions, you know, you got, I
can’t even think of them at the moment. I haven’t used them in a while, but
it’s like you know one it’s the other and it’s just negative and one it’s just
the other, period. And it’s like if you forget that sine, the derivative of sine
just by whether or not it’s got a sign change in it, you just messed yourself
up big time and you’re gonna get a wrong answer because they cycle; and if
you start off on, off with the wrong derivative of it then you have messed up
the cycle already. Then, no matter, if you know all the others you are gonna
be messed up anyway. So. . .
In the next questions, Jack gave the textbook answer for the chain rule. He
elaborates on his answer describing inside and outside functions, as opposed to Al
and Peg who spoke of inside and outside terms. And finally he outlined a proof of
the rule.
Jack:
Um, huh, [pause] let’s see how would I write that down? OK. Let’s see.
[pause—writing] Um, that’s just the way that it runs through in my head.
[had written f (g(x))0 = f 0 (g(x)) · (g 0 (x)) · x0 ] Without any words.
50
I:
Jack:
I:
Jack:
OK.
That’s just the way that I think of it.
OK. Read that for the tape.
Um, you take the derivative, essentially, whenever you use the chain rule you
are essentially looking at a function that has got another function within it
um, I don’t know, it’s sort of like doing a, handling a composite, um and
in order to take the derivative of that composite you have to first take the
derivative of the outside function and not even do anything with what’s the
inside of it, the function that’s on the inside of it, you take the outside, it’s
derivative first and leave the inside function alone and then multiply that by
the derivative of the inside function, and then multiply it by the derivative
of the variable, or however many times you have to break it down. Because
you can have a huge function that’s got a lot of stuff inside of it and you’d
have to do the chain rule several times to get the x variable.
I: OK.
Jack: So, I mean you could have, that’s just like a simple composite f of g, but
you could have, if you have like h(f (g)) then you’d have to do the derivative
h with f of g inside of it and then the derivative of f with g inside it and
then the derivative of g and then derivative of x.
I: OK.
In the four problem situations, Jack saw the chain rule being used in all of them.
His explanation for each indicated thinking about the functions in the situation.
Overall, he had as well established an understanding of and facility with the chain
rule as one can hope for in a first year calculus student. The Trans- level is not so
evident by the preceding quotes, but is more clear in his work on the Leibniz rule
problem which is discussed in the next section.
7.2
Using versus understanding
A working hypothesis of this study is the need to understand composition of
functions as a condition for understanding the chain rule. While the first phase results
did not add evidence to this claim, the claim was not refuted since the questionnaire
items did not assess understanding of the chain rule, rather they could only indicate
success (or lack of success) with the rule.
7.2.1
Coding the aspects of understanding
The interview data provide us with sufficient information to assess the student’s
understanding of the rule. In order to discuss understanding, we have noted three
aspects from the interview:
1. Did the student mention the chain rule prior to question 10 of the interview?
51
2. Did the student arrive at a coherent statement of the chain rule? With or
without a prompt?
3. Was the student successful with the Leibniz rule problem? With or without a
prompt?
These questions were coded as an ordered triple with codes Yes or No for the first
aspect and codes Unsuccessful, Prompted, or Successful for the second and third
aspects. The following excerpts illustrate the codes for each subject which are summarized in Table 7.1 below.
Eli received the code [N, U, P]. As we observed in the excerpts given in the previous
section, Eli’s statement of the chain rule never indicated that the derivative of the
outside function was then composed with the inside function. When working on the
Leibniz problem, he insisted on finding an antiderivative. The interviewer provided
him with the simpler Leibniz problem (H(x)) and had done most of the talking up
to this point, getting Eli to work out the closed form of H:
I: Alright. OK, so now that’s H(x). That’s just computing it all out. Right?
You just did this side, it’s still H(x). So I could put an x in and take a sin
of it and do that other stuff. What’s H 0 ?
Eli: [pause] So you take derivative of this?
I: Uh-huh
Eli: So this is the chain rule again.
I: Always. [pause] OK, so you have. . . Oh, you didn’t label it H 0 . But H 0 (x) is
4 times sin(x) squared times the cos(x). Right? Now we started off with one
of these deals where it’s defined in terms of its definite integral and you were
able to take the antiderivative and evaluate it at the endpoints and come up
with that, and then take the derivative and you get this. Now back to the
original expression up here, do you see a way to jump straight to that answer
without finding an antiderivative?
Eli: Yeah.
I: What is it?
Eli: You just um, replace t by sin(x) then just multiply the derivative of sin(x).
I: OK, then let’s go back to the first one that I asked you. F (x) is the definite
2
integral from zero to the sin(x), et dt. What’s the derivative of F ?
Eli: The derivative? The derivative of. . .
I: The derivative of F at the very bottom here like you found H 0 here.
2
Eli: [mumbling] [writes F 0 (x) = e(sin x) · (cos x)] OK.
Tim’s code is [N, P, U] where the first two are seen in the excerpts above. In the
last problem, he recognized the fundamental theorem of calculus (FTC) version of
the situation (G(x)), but could not make a connection back to the Leibniz version.
52
I: I am asking you to find F 0 where F is the function given by F (x) = the
2
integral from 0 to the sin(x), et dt.
2
Tim: It is the inside, the anti-derivative [writes F 0 = et ].
I: Let me show you one other one, here is a function G(x) which is defined
2
almost the same way, zero to x of et dt. What’s G0 ?
Tim: Can I do this? [works and mumbles]
I: Whoa, whoa. You have taken a derivative there, not an anti-derivative.
Right?
Tim: Oh! Sorry, I’m sorry.
2
I: Let me just help you out here. There is no anti-derivative of et .
Tim: Oh? It is a trick.
I: There is no simple anti-derivative that we can just write down on paper.
So your method there, while it would probably work, if you could take the
anti-derivative of that, it won’t work here.
Tim: [pause—working] [attempts to find an anti-derivative] Yeah, it does not work.
I: But can you answer the question? Can you figure out what G0 is if you know
this is how G is defined?
Tim: [1.5 minute pause] I’m lost!
I: Really?
2
Tim: Yeah, I think G0 (e) = et .
I: OK. G0 of . . .
2
Tim: G0 (e) = et
2
I: G0 (e)? is et .
Tim: Uh-huh
I: OK
Tim: Is that right?
I: Let me ask you this, when you answered this first one, you jumped right to
it and said ‘oh, F 0 is the insides’.
Tim: Same thing here.
I: They are the same. So it doesn’t matter that there is a sin(x) up here?
Tim: Oh?
I: This one goes 0 to x, this one goes 0 to sin(x). So I am wondering if that
makes a difference in the answer?
Tim: So, yeah, yeah, hold on. Sine . . . [works and mumbles] [writes −1 ≤ sin x ≤ 1,
0 ≤ sin x ≤ 1] Yeah, I think it doesn’t matter for that.
I: It doesn’t matter.
Tim: No.
Al got [N, P, U]. In his solution to the Leibniz problem, he quickly stated that the
derivative is found by substituting sin(x) in for t. Then when working through the
53
FTC problem, he wanted to use the chain rule and multiply by 2x. After the prompt
to correct G0 , he did not go back to modify his original answer.
I: OK. Great. What I’m asking you for here is compute F 0 if F is the function
2
given by F of x = the definite integral from zero to the sin(x), et dt.
2
Al: I’d take et and just put the sin(x) squared in there, and then you’d pretty
much have zero and it would be this right here.
I: That’s the derivative? That’s F 0 ?
Al: Yeah.
I: OK, just clarifying.
Al: OK.
I: Let me ask you a similar question. This one is called G, it’s basically the
same except um, the definite integral goes from zero to x. What would um. . .
Al: Just change the t to the. . . x squared.
I: OK, so G0 .
Al: times 2x, and over here you’d take derivative too. I don’t know . . . what I’m
saying, you’d take the derivative, uh . . .
[end of side one]
I: So you’re wondering do you take the derivative. . .
Al: Take the derivative of x2 or not.
I: [talking together] Take the derivative of x2 or not.
Al: Yeah.
I: Alright. Do you remember seeing situations like this where you have the
functions defined through definite integrals?
Al: Uh-huh.
I: OK, it was part of the fundamental theorem of calculus.
Al: Oh, [mumbling] I never. OK, I know this part was. . . right and this part was
right.
2
I: OK, ex you’re sure about, and you’re wondering about, um, the 2x part.
OK. [pause] So, suppose I tell you that you don’t do 2x in this problem. The
2
answer is just ex . That comes from the fundamental theorem of calculus,
which essentially says that if you have a function defined in this manner the
derivative of that function in this case G0 is just the inside written in terms
of x.
Al: Uh-huh. Yeah. And that would be the answer.
I: Without the 2x?
Al: Yeah.
I: Right. Now what I want you to do is to go back and look at this other
problem.
Al: Uh-huh.
I: And see, you have e to the sin(x) squared, and the question is, are you still
comfortable with that?
54
Al: [pause] Yeah.
I: OK.
Al: Yeah.
Ray and Peg received codes of [Y, S, P]. Ray’s first thought on the Leibniz problem
was, since a definite integral is a number, then the answer is 0. It only took a small
prompt for him to see the function involved. When he saw the FTC version of the
question, he recalled the rule and quickly gave the answer. He then saw that the
original question involved a composition and struggles a bit before coming to the
answer for F 0 .
Peg mentioned the chain rule when confronted with a composition problem (that
is not a derivative problem). So the connection between the two is strong. Peg
recognized that the situation involved a theorem that she once knew, but denied
the FTC. The interviewer prompted her through the FTC and also pointed out the
composition.
Jack got a [Y, S, S]. Jack indicated a cohesive schema for the chain rule as he
attempted to deal with the Leibniz problem, which he did not recognize. To deal
with the unfamiliar, he wanted to compute something, so the interviewer provided
him with a simpler Leibniz rule problem with a monomial integrand. Jack worked
through the integration, then needed a little prompt to label his answer (the function
H). When asked to go back to the original problem, he was able to generalize the
use of the chain rule and to arrive at the correct solution.
7.2.2
Comparing the codes
The results of the codes for understanding are presented in Table 7.1. It is sorted
according to a lexicon order on the codes. Also presented are the combined codes
from Phase 1.
Two patterns in this table are of particular note. First is the support for our
working hypothesis. With the exception of Peg, the scores increase for the Composition codes as one is more successful at understanding. We note that Peg left items
9 and 10 blank on the questionnaire because she did not want to deal with the tables. If we exclude the codes for items 9 and 10, the Composition column becomes
[6, 9, 9, 9, 15, 15]T which also lends support to our hypothesis.
The second pattern is the third aspect code and the C4L variable. Those who were
able to deal with the Leibniz rule problem were all in a C4 L section at some point.
Both of these points seem to say something, but must be understood in context.
These patterns exist in these six students’ results only and do not generalize to any
55
Table 7.1
Aspects of understanding
Eli
Tim
Al
Peg
Ray
Jack
Code
C4L
Function
Composition
Differentiation
Chain Rule
[N, U, P]
[N, P, U]
[N, P, U]
[Y, S, P]
[Y, S, P]
[Y, S, S]
11
00
00
10
11
01
9
4
19
14
18
15
7
11
12
9
21
24
16
22
24
25
19
25
20
25
19
21
1
24
other population, not even the Phase 1 group. It would be unwise to attempt to draw
any strong conclusions from this data.
Chapter 8: Discussion
The analysis of the data is consistent with the genetic decomposition for the
concept of the chain rule based on the triad, which was presented at the start of
this work. Some evidence is presented to support the notion that understanding of
composition of functions is key to understanding the chain rule.
The type of instruction was a factor in how a student performed on these tasks.
The differences between the types seem to be related to the difference between using the chain rule and understanding the chain rule, and an explanation of these
differences is offered.
The need for collecting differing types of data is established. Two students scored
high quantitatively but did not perform well in the interview, while another student
failed the quantitative work but demonstrated a high level in the interview. Finally,
the use of writing as a pedagogical tool is recommended based on the results.
8.1 Revised genetic decomposition
The overarching purpose of this study was to test APOS Theory, specifically
the proposed use of the triad mechanism to describe the development of a student’s
schema. For the six participants in the interviews, we claim that it does. The interview data show students working at all three levels by discussing how they appear
to be making connections as they approach problem situations which involve the
chain rule in the solution. After presenting evidence of perceiving relationships in the
situation, a working definition of understanding the chain rule as it appears in the
data was constructed. This construct presents a parallel story as the six subjects are
ranked in understanding along the same lines as the triad levels they demonstrate.
The students who have moved into the Inter- level demonstrate some ability to extend their understanding of the chain rule, as well as the ability to discuss the rule
coherently.
The interview data did not provide evidence which required the modification of
the genetic decomposition given in Section 2.4. So the description of the schema for
the concept of chain rule developed by Clark et al. (1997), including this author, has
the support of this study with the six interviewed students. Clark et al. concluded
that the chain rule schema
56
57
must contain a function schema which includes at least a process conception of function, function composition and decomposition. The function
schema is linked to a differentiation schema which includes the rules of
differentiation at least at the process level. (p. 359)
Clark et al. (1997) further stated that this schema is developed through the three
levels of the triad. In the Intra- level, the student is able to deal with many types of
problems as distinct cases. Some students received high codes for their work on the
chain rule section of the questionnaire. But their concept of the rule is insufficient to
discuss the chain rule in any detail or to extend it beyond their familiar collection of
rules.
At the Inter- level, the student begins to reflect on these various cases and builds
connections relating them. Students in this study not only used phrases such as
“inside function/term/stuff” and “outside function/term/stuff”, but also used them
as the criteria when presented with new situations. In other words, they were using
their perceived relationship (or more general idea) for the chain rule when asked if
the chain rule would appear in the solution to a problem situation.
A student working at the Trans- level has acted on the relationships from the previous level to construct a coherent whole, a system by which he or she can determine
what situations are part of the schema and what are not. This investigation used the
Leibniz rule problem as a way to test the robustness of the subjects’ schema (or preschema, if at the Inter- level). This problem features a composition with a function
defined through an integral, which pushes most students’ function schema to the limit
(Thomas, 1995). The problem was presented with simpler cases on hand in order to
help the students progress to the point of needing the chain rule. The simpler cases
were intended to help the student deal with a function defined through an integral
or to give the student a chance to compute something if the need to do so was great.
The analysis of the responses to this problem was focused on how the chain rule was
incorporated into the problem, once the composition in F was established.
This genetic decomposition is not “revised” in the sense that it has changed as a
result of the data analysis. It has however been revisited with new data and found
to be consistent with that data. The triad mechanism is useful in describing the
development of this schema, and will likely be as useful with other schemas. This
mechanism extends APOS Theory by augmenting our understanding of how schemas
might come to be built in the minds of our students. In particular, this genetic
decomposition will be used as a basis for the design of future studies involving the
58
chain rule and for the development of instructional strategies, such as those presented
below.
8.2
Understanding of composition and understanding of the chain rule
Can we conclude that understanding of composition of functions is key to understanding the chain rule? The results from the first part of this study were largely
inconclusive. The students performed well on the differentiation problems and the
chain rule problems. There was not the same degree of variation as in the function
and composition sections, and this did not allow significant correlations to be found.
This is a problem of assessment; the difficulty in measuring (depth of) understanding
in a written instrument. Since the questionnaire data did not measure understanding
with respect to the chain rule, the evidence does not contradict the hypothesis.
There is a small amount of supporting evidence in the Phase 2 data. As seen
in Section 7.2, for the six subjects who were interviewed, there exists a correlation
between understanding the chain rule and the codes for (three of the five) composition
items. A new study is needed to address this hypothesis. That study might employ
interviews based on the interview guide for this study with a larger sized sample. Since
the focus would be on understanding, collecting data with a questionnaire alone is not
advised (see Section 8.4). A questionnaire could be useful for screening participants
for interviews and for setting the stage for the interview tasks, as it was for this study.
8.3
Comparing instructional methods
The comparison between the traditional and reform methods is an account of using
versus understanding, of instrumental versus relational. In the interview setting,
the students who had experienced some reform methods were more successful in
extending and discussing the chain rule. In the quantitative data, the traditional
group performed better in the differentiation and chain rule sections, in the sense of
statistical significance.
8.3.1
Understanding the chain rule
In Table 7.1 some evidence is found that the students who had experienced C4 L
instruction demonstrated better understanding than the others. Eli is the exception
to this statement, but it fits with his role as an outlier in the quantitative phase and
with his lack of cooperation in the qualitative one. This comparison can only be made
with these six (or five) students and so must be seen as an example, rather than as
a trend. On the other hand, a current review of a series of studies in which student
performance data is available indicates that, on average, students who had instruction
59
based on APOS Theory performed at the same level or above those students who had
traditional methods of instruction. (Clark, Dubinsky, Loch, McDonald, Merkovsky,
& Weller, 1999). The interview evidence of this study is also consistent with that of
Meel (1998).
8.3.2
Using the chain rule
The questionnaire data did give us information on how well the students were able
to use the chain rule. Tables 5.6 and 5.7 both indicate the students in the traditional
course scored higher in the chain rule problems. In the first table, the C4 L group
was taken as C4L > 0, which cut the participants into almost equal sized samples.
In that case, the mean Chain Rule score for the Traditional group was 22.00, a 9.3%
increase over the C4 L group. In the second table, the groups were formed based on
the type of instruction for the first semester. In that case, the mean of the Traditional
group (21.91) was statistically significantly higher than that of the C4 L group (18.75),
16.9% higher.
These results are also consistent with previous studies. Heid (1988) found that
students from traditional calculus courses performed slightly better on skills based
problem situations than students from reformed courses. Dubinsky and Schwingendorf (1991) found the slight differences between groups favored the reform group.
Palmiter (1991) found that the reform students performed better than the control
on both conceptual as well as computational tests. However, her results may have
been skewed since the treatment group was aware of the study and the control group
was not. For our results, there are two global factors and one local factor which may
explain the difference. Globally, a traditional course by its nature emphasizes skill
acquisition. Typically, a student in a traditional course will complete 60–75 problems
in a week, with most of the problems being computational over a small range of topics. Another global factor concerns C4 L in particular and reformed calculus programs
in general. In order to utilize technology as a tool for calculus, time has to be spent
getting familiar with it. C4 L pays a price in its methodology, and part of that price
is less time for practicing skills.
In addition, the implementation of the C4 L course that these students experienced
was new to the institution, as the instructors has just joined the faculty. This local
factor involved changing student expectations for the mechanics of a course, dealing
with a commuter student body (having come from a residential situation), and networking problems inherent in a new computer laboratory. There is no practical way
60
to measure the effect of this factor on the difference in the Chain Rule scores between
courses1 .
8.3.3
Finding a balance
These results are not surprising but do present the question of our teaching objectives. If a conceptual understanding is the goal (and how can it not be?), then
this evidence indicates a focus on developing a schema is necessary, as was done in
the reform method seen here. Of course, Ray is an example of going too far away
from the instrumental knowledge to the conceptual. However, it is certainly easier
to fix this than the reverse. Once a conceptual basis has been established—once a
schema has been constructed—it is merely a matter of practice and drill to augment
the understanding with skills. The use of quizzes on computational items or the use
of gateway examinations might help balance any over-emphasis on the conceptual
understanding by a particular methodology.
8.4
Quantitative versus qualitative data
In the quantitative phase, three sets of data from the questionnaires were excluded
from the analysis because there were too many zeros in the code data—which was
interpreted as incomplete questionnaires. Two of these students were interviewed in
the second phase, Eli and Ray.
In the previous chapter, the case was made that Eli should have been excluded
from the quantitative analysis for non-participation. In fact, he was fairly nonparticipatory in the interview, too. The decision to exclude Ray from the quantitative
analysis was justified. However, it was not the case that he was not participating—he
was just caught off guard. Had he been told what to expect when he came to fill out
the questionnaire, he may have prepared and done very well as he had in his calculus
classes.
In this set of interview participants, two individuals with high scores in the Chain
Rule codes are found: Tim got 25 and Al got 19. Based on this data alone, we might
have concluded that these two “know” the chain rule or even “understand” it. Yet
both have some difficulty extending the concept. Tim even had difficulty expressing
the rule with prompting. At the same time, Ray got a 1 for the Chain Rule in the
written work. This would have to be interpreted as a failure to understand the rule
in a test situation.
1
Over the intervening couple of years, this factor has diminished—especially in the perspective of
the students.
61
The qualitative data paint almost the opposite picture for these three students.
Which is the accurate representation of reality? One conclusion of this study is the
necessity for collecting multiple data types when attempting to describe a student’s
understanding of a concept. The question raised is, how can we as teachers assign
grades based on written exams and homework only?
8.5
Limitations
The results of the study may be limited in their potential to be generalized in two
ways. First, since the subjects were volunteers, there was no control over the number
of subjects for the study nor over the range of abilities of students. The issue might
be addressed by trying to get n ≥ 50, a goal which was unattainable in this study.
With such a number, the spectrum of abilities should be filled out. The statistical
methods used in Phase 1 might have been extended by deleting variables from the
model or throwing more powerful models at the situation. That this was not done
came from a certain sense of purity; the bad data is just accepted as is. The model
was based on the genetic decomposition of understanding a concept. However, the
data which was collected reported a measure of using the concept. It also came from
the fact that only this set of students was being described, rather than an attempt
to describe the entire population of first year calculus students.
The second issue is also about self-selection. Since the comparison of the two
methods took place after the term, no pairing of students across treatments was
possible. Students at the host institution are free to choose whatever section of a
course they desire. Thus other factors (or student characteristics) may be involved
in whether the student chose to take a Traditional or a C4 L course. Schwingendorf,
McCabe and Kuhn (1998) discuss statistical methods that have been developed to
treat this issue, which arises often in health and industrial studies. One such method
is factoring in the students’ PGPA scores, which seems to have accounted for any
natural differences between the classes in this study.
8.6
Instructional strategies
A student’s concept of the chain rule appears to be a schema which is developed through the triad mechanism. The components of this schema are function
de-composition and differentiation. While the students of this study seem to have
62
the differentiation well in hand, they appear to need to move from a focus on “inside/outside” structure based on parentheses to a perception of composition of functions. To aid in the progression through the triad, we should give students opportunities to reflect on a variety of situations in order to build some connections and then
to reflect on those connections.
Short writing assignments may be the tool to take care of both of these needs.
By requesting an explanation which identifies the functions involved in a chain rule
problem, along with the answer, the student will have to approach the situation functionally, rather than symbolically. Likewise, short essays that compare and contrast
banks of problem situations will foster the reflections necessary.
Regular writing assignments will also address the assessment issue raised above.
The written discussion can allow the instructor to see the student extending the
concept to new situations. Wahlberg (1998) found evidence of both learning on the
part of the student and enriched assessment on the part of the teacher using writing
assignments on the topic of limits. While writing assignments take time to grade,
they are a reasonable alternative to scheduling, conducting and transcribing personal
interviews with your students.
LIST OF REFERENCES
63
LIST OF REFERENCES
Alson, P. (1992). A qualitative approach to sketch the graph of a function.
School Science and Mathematics, 92, 182–187.
Asiala, M., Brown, A., DeVries, D. J., Dubinsky, E., Mathews, D., & Thomas,
K. (1996). A framework for research and development in undergraduate mathematics
education. Research in Collegiate Mathematics Education, 2, 1–32.
Asiala, M., Cottrill, J., Dubinsky, E., & Schwingendorf, K. E. (1997). The
development of students’ graphical understanding of the derivative. Journal of Mathematical Behavior, 16, 399–431.
Ayers, T., Davis, G., Dubinsky, E., & Lewin, P. (1988). Computer experiences in learning composition of functions. Journal for Research in Mathematics
Education, 19, 246–259.
Boyer, C. B. (1959). The history of the calculus and its conceptual development.
New York: Dover. (Originally published in 1949.)
Boyer, C. B. (1969). The history of the calculus. In Historical topics for the
mathematics classroom, NCTM Thirty-first Yearbook (pp. 376–402). Washington,
DC: National Council of Teachers of Mathematics.
Breidenbach, D., Dubinsky, E., Hawks, J., & Nichols, D. (1992). Development
of the process conception of function.
Educational Studies in Mathematics, 23,
247–285.
Carlson, M. P. (1998). A cross-sectional investigation of the development of the
function concept.Research in Collegiate Mathematics Education, 3, 114–162.
Clark, J. M., Cordero, F., Cottrill, J., Czarnocha, B., DeVries, D. J., St. John,
D., Tolias, T., & Vidaković, D. (1997). Constructing a schema: The case of the chain
rule. Journal of Mathematical Behavior, 16, 345–364.
64
Clark, J. M., Dubinsky, E., Loch, S., McDonald, M., Merkovsky, R., & Weller,
K. (1999). Performance paper. Manuscript in preparation.
Confrey, J., & Smith, E. (1994). Exponential functions, rates of change, and
the multiplicative unit. Educational Studies in Mathematics, 26, 135–164.
Confrey, J., Smith, E., Piliero, S., & Rizzuti, J. (1991). The use of contextual
problems and multi-representational software to teach the concept of functions (Final
project report). Ithaca, NY: Cornell University.
Cottrill, J., Dubinsky, E., Nichols, D., Schwingendorf, K., Thomas, K., & Vidaković, D. (1996). Understanding the limit concept: Beginning with a coordinated
process schema. Journal of Mathematical Behavior, 15, 167–192.
Cuoco, A. (1994). Multiple representations for functions. In J. J. Kaput &
E. Dubinsky (Eds.), Research issues in undergraduate mathematics learning, MAA
Notes 33 (pp. 121–140). Washington, DC: Mathematical Association of America.
Dubinsky, E. (1991). Constructive aspects of reflective abstraction in advanced
mathematics. In L. P. Steffe (Ed.), Epistemological foundations of mathematical
experience (pp. 160–202). New York: Springer-Verlag.
Dubinsky, E., & Harel, G. (1992). The nature of the process conception of
function. In G. Harel & E. Dubinsky (Eds.), The concept of function: Aspects of
epistemology and pedagogy, MAA Notes 25 (pp. 85–106). Washington, DC: Mathematical Association of America.
Dubinsky, E., & Schwingendorf, K. (1991). Constructing calculus concepts:
Cooperation in a computer laboratory. In C. Leinbach (Ed.), The laboratory approach
to teaching calculus, MAA Notes 20 (pp. 47–70). Washington, DC: Mathematical
Association of America.
Edwards, C. H., Jr. (1979). The historical development of the calculus. New
York: Springer-Verlag.
Even, R. (1988). Pre-service teacher’s conceptions of the relationships between
functions and equations. In A. Borbàs (Ed.), Proceedings of the 12th International
Conference for the Psychology of Mathematics Education (pp. 304-311). Veszprém,
Hungary: Ferenc Genzwein, OOK.
65
Even, R. (1993). Subject-matter knowledge and pedagogical content knowledge:
Prospective secondary teachers and the function concept. Journal for Research in
Mathematics Education, 24, 94–116.
Ferrini-Mundy, J., & Graham, K. G. (1991). An overview of the calculus curriculum reform effort: Issues for learning, teaching, and curriculum development.
American Mathematical Monthly, 98, 627–635.
Ferrini-Mundy, J., & Graham, K. G. (1994). Research in calculus learning:
Understanding limits, derivatives, and integrals. In J. J. Kaput & E. Dubinsky (Eds.),
Research issues in undergraduate mathematics learning, MAA Notes 33 (pp. 31–45).
Washington, DC: Mathematical Association of America.
Frid, S. D. (1992). Calculus students’ sources of conviction. In B. Southwell,
B. Perry, & K. Owens (Eds.), Space—The first and final frontier. Conference proceedings of the annual conference of the Mathematics Education Research Group of
Australasia (15th , Richmond, New South Wales, Australia, July 4–8, 1992).
Goldenberg, E. P. (1988). Mathematics, metaphors, and human factors: Mathematical, technical, and pedagogical challenges in the educational use of graphical
representation of functions. Journal of Mathematical Behavior, 7, 135–173.
Harel, G., & Dubinsky, E. (Eds.). (1992). The concept of function: Aspects of
epistemology and pedagogy, MAA Notes 25. Washington, DC: Mathematical Association of America.
Heid, M. K. (1988). Resequencing skills and concepts in applied calculus using
the computer as a tool. Journal for Research in Mathematics Education, 19, 3–25.
Hitt, F. (1993). Internal and external representations related to the function
concept. In J. R. Becker & B. J. Pence (Eds.), Proceedings of the Fifteenth Annual
Meeting of the North American Chapter of the International Group for the Psychology
of Mathematics Education (Vol. 1, pp. 177–183). Pacific Grove, CA: San José State
University.
Kline, M. (1972). Mathematical thought: From ancient to modern times. New
York: Oxford University Press.
Krathwohl, D. R. (1993). Methods of educational and social science research:
An integrated approach. White Plains, NY: Longman.
66
Lauten, A. D., Graham, K., & Ferrini-Mundy, J. (1994). Student understanding of basic calculus concepts: Interaction with the graphics calculator. Journal of
Mathematical Behavior, 13, 225–237.
Leinhardt, G., Zaslavsky, O., & Stein, M. K. (1990). Functions, graphs, and
graphing: Tasks, learning and teaching. Review of Educational Research, 60, 1–64.
Mathews, J. H. (1989). Teaching differentiation with muMATH. Journal of
Computers in Mathematics and Science Teaching, 8 (3), 50–53.
Meel, D. E. (1998). Honors students’ calculus understandings: Comparing Calculus&Mathematica and traditional calculus students. Research in Collegiate Mathematics Education, 3, 163–215.
Orton, A. (1983). Students’ understanding of differentiation.
Studies in Mathematics, 14, 235–250.
Educational
Palmiter, J. R. (1991). Effects of computer algebra systems on concept and skill
acquisition in calculus. Journal for Research in Mathematics Education, 22, 151–156.
Piaget, J., & Garcia, R. (1989). Psychogenesis and the history of science (H. Feider, Trans.). New York: Columbia University Press. (Original work published 1983).
Rickey, V. F. (1987). Isaac Newton: Man, myth, and mathematics.
Mathematics Journal, 18, 362–389.
College
Schwarz, B., & Bruckheimer, M. (1990). The function concept with microcomputers: Multiple strategies in problem solving. School Science and Mathematics, 90,
597– 614.
Schwingendorf, K., Hawks, J., & Beineke, J. (1992). Horizontal and vertical
growth of the student’s conception of function. In G. Harel & E. Dubinsky (Eds.), The
concept of function: Aspects of epistemology and pedagogy, MAA Notes 25 (pp. 133–
149). Washington, DC: Mathematical Association of America.
Schwingendorf, K. E., Mathews, D. M. & Dubinsky, E. (1996). Calculus, concepts, computers and cooperative learning, C4 L: The Purdue calculus reform project.
Proceedings of the Seventh Annual International Conference on Technology in Collegiate Mathematics (pp. 402–406). Reading, MA: Addison-Wesley.
67
Schwingendorf, K. E., McCabe, G. P., & Kuhn, J. (1998). A longitudinal study
of the C 4 L calculus reform program: Comparisons of C 4 L and traditional students.
Manuscript submitted for publication.
Selden, J., Mason, A., & Selden, A. (1989). Can average calculus students solve
nonroutine problems? Journal of Mathematical Behavior, 8, 45–50.
Selden, J., Selden, A., & Mason, A. (1994). Even good calculus students can’t
solve nonroutine problems. In J. J. Kaput & E. Dubinsky (Eds.), Research issues in
undergraduate mathematics learning, MAA Notes 33 (pp. 19–26). Washington, DC:
Mathematical Association of America.
Sfard, A. (1992). Operational origins of mathematical objects and the quandary of reification—The case of function. In G. Harel & E. Dubinsky (Eds.), The
concept of function: Aspects of epistemology and pedagogy, MAA Notes 25 (pp. 59–
84). Washington, DC: Mathematical Association of America.
Tall, D. (1992). The transition to advanced mathematical thinking: Functions,
limits, infinity, and proof. In D. A. Grouws (Ed.), Handbook of research on mathematics teaching and learning (pp. 495–511). New York: Macmillan.
Thomas, K. (1995). The fundamental theorem of calculus: An investigation into
students’ constructions. Unpublished doctoral dissertation, Purdue University, West
Lafayette, IN.
Thompson, P. W. (1994a). Images of rate and operational understanding of the
fundamental theorem of calculus. Educational Studies in Mathematics, 26, 229–274.
Thompson, P. W. (1994b). Students, functions, and the undergraduate curriculum. Research in Collegiate Mathematics Education, 1, 21–44.
Thoo, J. B. (1995). Compositions and the chain rule using arrow diagrams.
PRIMUS, 5, 291–295.
Vidaković, D. (1996). Learning the concept of inverse function. Journal of
Computers in Mathematics and Science Teaching, 15, 295–318.
Vidaković, D. (1997). Learning the concept of inverse function in a group versus
individual environment. In E. Dubinsky, D. Mathews & B. E. Reynolds, (Eds.), Readings in cooperative learning for undergraduate mathematics, (pp. 175–195). Washington, DC: Mathematical Association of America.
68
Vinner, S., & Dreyfus, T. (1989). Images and definitions for the concept of
function. Journal for Research in Mathematics Education, 20, 356–366.
Wahlberg, M. (1998). The effects of writing assignments on second-semester
calculus students’ understanding of the limit concept. Manuscript submitted for publication.
White, P., & Mitchelmore, M. (1992). Abstract thinking in rates of change and
derivative. In B. Southwell, B. Perry, & K. Owens (Eds.), Space—The first and final
frontier. Conference proceedings of the annual conference of the Mathematics Education Research Group of Australasia (15th , Richmond, New South Wales, Australia,
July 4–8, 1992).
Wilson, M. R. (1994). One preservice secondary teacher’s understanding of
function: The impact of a course integrating mathematical content and pedagogy.
Journal for Research in Mathematics Education, 25, 346–370.
Wilson, M. R., & Krapfl, C. M. (1994). The impact of graphics calculators
on students’ understanding of function. Journal of Computers in Mathematics and
Science Teaching, 13, 252–264.
Zazkis, R., Dubinsky, E., & Dautermann, J. (1996). Coordinating visual and
analytic strategies: A study of students’ understanding of the group D4 . Journal for
Research in Mathematics Education, 27, 435–457.
APPENDICES
Appendix A: Questionnaire from Phase 1
Code:
Questionnaire for Calculus Topics Study
Jim Cottrill
Directions:
• These items are designed to explore your range of understanding on a number
of topics. Some will be very easy to do, some will be harder. Please do the best
you can.
• Please answer each question as completely as you can. Please do not leave any
answer blank. If you would like to write additional notes with an answer, feel
free to use the back of the page.
• Please do not write your name on any page of this questionnaire. The code
number at the top of this page will serve to keep your identity confidential.
69
70
1. Express the diameter of a circle as a function of its area and sketch its graph.
2. A student has marked the following as a non-function. State whether this
student is correct and why.
y
x
71
3. A student has marked the following as a non-function. State whether this
student is correct and why.
A correspondence that associates 1 with each positive number, −1
with each negative number, and 3 with zero.
4. Decide if it is possible to use one or more functions to describe this situation.
If yes, then describe your function(s) briefly. If no, then explain.


x = t3 + t
t is a real number.
y = 1 − 3t + 2t4 
72
time
time
distance
time
distance
distance
time
distance
distance
distance
5. Tim and Donna live 1 km from their school. Usually, they walk to school
together. Yesterday, they both left their houses 10 minutes before school started.
Tim started to walk, but Donna was afraid to be late and started to run. After
a while, Tim realized that although he tried to walk faster and faster, he had
to run if he did not want to be late, and started to run. At about the same
time, Donna became tired and had to walk instead of run. They both reached
school exactly on time. Which of the following graphs is Tim’s and which one
is Donna’s?
time
time
t
6. Given two functions v, w such that v(t) = 5t−6 and w(t) = +4, find (w◦v)(9).
3
Explain.
73
7. Given that (f ◦ g)(x) =
√
5
2x + 3.
(a) Find f and g that satisfy this condition.
(b) Are there more than one answer to part (a)? Explain.
8. Find k so that g(x + 1) = g(x) + k, given that g(x) = 3x + 5. Explain.
74
In each of the following two questions, f, g, h are functions whose domains and ranges
are the set of all real numbers, and such that h = f ◦ g.
9. If only the information in the following table were known, would it be possible
to find f (2)? If so, find it and if not explain why not.
x
−1
4
π
h(x) g(x)
1
π
0
−3
1
2
10. If only the information in the following table were known, would it be possible
to find g(4)? If so, find it and if not explain why not.
x
−1
2
4
h(x) f (x)
1
3
−2
−2
1
π
75
Compute the derivative of each of the following functions. Show all of your work.
11. f (x) = 11x5 − 6x3 + 8
12. g(x) =
3
x2
76
Compute the derivative of each of the following functions. Show all of your work.
13. h(x) = x2 − 3
5x − x3
14. y = 3ex − 4 tan(x)
77
Compute the derivative of each of the following functions. Show all of your work.
15. y = x2 sin(x)
16. F (x) = 1 − 4x3
2
78
Compute the derivative of each of the following functions. Show all of your work.
4
17. G(x) = 2 5x2 + 1
18. H(x) = sin 5x4
4
− 4x 5x2 + 1
79
Compute the derivative of each of the following functions. Show all of your work.
19. y = cos3 (t)
2
20. y = e−t
Appendix B: Phase 1 Data
The following tables summarize the distribution of codes for each item in the
questionnaire. In Table B.1, the data is presented in the form n(a, b) where n is the
number of responses receiving that code, a is the number in the C4 L group (C4L codes
11, 10, 01) and b the number of traditional students (C4L code 00). The maximum
value for a or b is then 17. The other tables give the codes by each student’s ID,
which was used to protect the identities during the analysis. The (*) next to an ID
indicates a questionnaire which was considered to be incomplete.
The C4L variable used in Tables B.2 and B.3 (and in Table 5.5) is a binary code for
the first two quarters of calculus. An “11” represents the student took the C4 L version
of calculus both terms. A “10” indicates taking the C4 L version first quarter and a
traditional course the second quarter, while a “01” indicates the opposite situation.
A “00” indicates being in a traditional course both terms.
80
81
Table B.1
Distribution of codes by item
5
4
3
2
1
0
1
4 (1,3)
8 (3,5)
1 (0,1)
10 (7,3)
4 (3,1)
7 (3,4)
2
12 (4,8)
3 (3,0)
8 (5,3)
3 (1,2)
8 (4,4)
3
7 (3,4)
3 (2,1)
15 (8,7)
4
2 (0,2)
5
27 (14,13)
6
20 (9,11)
3 (2,1)
7
17 (11,6)
8
3 (1,2)
6 (3,3)
9 (4,5)
9 (5,4)
6 (4,2)
4 (2,2)
2 (1,1)
1 (0,1)
2 (2,0)
1 (0,1)
7 (4,3)
1 (0,1)
3 (1,2)
7 (2,5)
1 (1,0)
4 (2,2)
2 (0,2)
15 (8,7)
3 (1,2)
3 (1,2)
3 (1,2)
8 (5,3)
2 (1,1)
9
8 (6,2)
3 (1,2)
4 (3,1)
13 (3,10)
3 (1,2)
3 (2,1)
10
2 (2,0)
6 (3,3)
5 (4,1)
11 (4,7)
3 (0,3)
7 (4,3)
11
34 (17,17)
12
23 (12,11)
5 (2,3)
3 (1,2)
2 (1,1)
1 (1,0)
13
22 (11,11)
6 (2,4)
3 (1,2)
1 (1,0)
1 (1,0)
14
21 (7,14)
8 (6,2)
4 (3,1)
15
25 (9,16)
2 (1,1)
2 (2,0)
16
28 (12,16)
4 (3,1)
17
12 (5,7)
14 (7,7)
18
23 (10,13)
19
20
8 (4,4)
1 (1,0)
1 (1,0)
3 (3,0)
2 (2,0)
1 (1,0)
1 (1,0)
3 (2,1)
2 (1,1)
2 (1,1)
1 (1,0)
6 (3,3)
2 (1,1)
1 (1,0)
1 (1,0)
1 (1,0)
14 (5,9)
4 (3,1)
1 (1,0)
13 (6,7)
21 (9,12)
5 (3,2)
3 (2,1)
1 (1,0)
2 (2,0)
3 (1,2)
1 (1,0)
82
Table B.2
Student codes in tuples
ID
C4L
[Items 1–5]
[Items 6–10]
[Items 11–15]
[Items 16–20]
Overall
A111
01
[4, 5, 5, 2, 5]
[5, 5, 1, 5, 4]
[5, 5, 5, 1, 2]
[5, 3, 5, 5, 5]
82
A112
01
[4, 3, 3, 1, 5]
[4, 5, 5, 5, 5]
[5, 5, 5, 5, 5]
[5, 5, 5, 5, 5]
90
A113
01
[1, 5, 4, 0, 5]
[5, 5, 5, 5, 4]
[5, 5, 5, 5, 5]
[4, 5, 5, 5, 5]
88
A115
01
[2, 1, 1, 1, 5]
[5, 5, 5, 5, 5]
[5, 5, 4, 5, 5]
[5, 4, 5, 4, 5]
82
A121
11
[2, 4, 0, 2, 5]
[5, 5, 5, 5, 4]
[5, 5, 5, 5, 5]
[5, 5, 4, 5, 5]
86
A122
01
[5, 1, 3, 3, 2]
[5, 5, 5, 5, 2]
[5, 5, 5, 5, 5]
[5, 5, 5, 0, 5]
81
A123*
11
[4, 3, 3, 3, 5]
[5, 5, 5, 3, 3]
[5, 5, 5, 3, 1]
[1, 0, 0, 0, 0]
59
A124
11
[1, 1, 0, 0, 5]
[3, 1, 1, 3, 3]
[5, 5, 5, 4, 4]
[5, 4, 5, 4, 3]
62
A125*
11
[0, 4, 0, 0, 5]
[1, 1, 4, 1, 0]
[5, 3, 2, 4, 2]
[5, 4, 5, 3, 3]
52
A126
10
[2, 5, 3, 3, 5]
[5, 5, 5, 0, 0]
[5, 4, 1, 3, 3]
[5, 4, 2, 2, 4]
66
A127
00
[5, 5, 3, 5, 5]
[5, 5, 5, 5, 4]
[5, 5, 5, 5, 5]
[5, 5, 4, 5, 5]
96
A128
00
[5, 5, 5, 3, 5]
[5, 3, 1, 2, 2]
[5, 4, 5, 5, 5]
[5, 5, 5, 5, 5]
85
A129
10
[2, 4, 4, 2, 5]
[5, 3, 1, 3, 3]
[5, 5, 5, 4, 5]
[5, 5, 5, 2, 5]
78
A130*
00
[0, 5, 5, 0, 0]
[0, 1, 0, 2, 2]
[5, 5, 4, 5, 5]
[5, 1, 5, 2, 4]
56
A131
00
[0, 3, 0, 1, 1]
[5, 5, 1, 2, 2]
[5, 5, 5, 5, 5]
[5, 5, 5, 4, 5]
69
A132
10
[2, 3, 3, 1, 5]
[1, 3, 5, 0, 0]
[5, 5, 5, 5, 5]
[5, 4, 5, 2, 5]
69
A133
00
[4, 1, 3, 1, 5]
[1, 3, 4, 2, 2]
[5, 5, 4, 3, 5]
[5, 3, 5, 5, 3]
69
A134
00
[0, 1, 0, 1, 2]
[1, 3, 5, 1, 1]
[5, 3, 5, 4, 5]
[5, 5, 5, 5, 5]
62
A135
00
[4, 2, 1, 3, 5]
[2, 1, 3, 2, 0]
[5, 5, 3, 5, 5]
[5, 4, 5, 2, 1]
63
A136
00
[1, 3, 5, 0, 5]
[1, 0, 2, 4, 0]
[5, 5, 5, 5, 5]
[5, 4, 4, 5, 4]
68
A137
00
[4, 5, 3, 2, 5]
[5, 0, 4, 2, 1]
[5, 4, 5, 5, 5]
[5, 2, 5, 2, 5]
74
A138
00
[4, 1, 1, 3, 5]
[4, 3, 2, 1, 1]
[5, 4, 5, 5, 5]
[5, 4, 5, 2, 5]
70
A139
00
[2, 5, 3, 2, 5]
[5, 5, 5, 2, 2]
[5, 5, 5, 4, 5]
[5, 4, 5, 5, 5]
84
A140
01
[0, 1, 3, 1, 5]
[1, 2, 1, 0, 0]
[5, 5, 4, 4, 2]
[5, 2, 3, 5, 4]
53
A141
01
[2, 3, 3, 1, 2]
[5, 5, 0, 4, 3]
[5, 5, 5, 4, 5]
[5, 4, 4, 4, 1]
70
A142
00
[5, 5, 5, 5, 5]
[5, 5, 5, 4, 3]
[5, 5, 5, 5, 4]
[5, 4, 4, 2, 5]
91
A143
11
[0, 5, 5, 3, 5]
[4, 5, 3, 2, 2]
[5, 1, 5, 5, 5]
[4, 4, 5, 2, 5]
75
A144
00
[2, 5, 3, 2, 5]
[5, 5, 3, 3, 4]
[5, 5, 4, 5, 5]
[4, 5, 5, 2, 5]
82
A145
11
[2, 3, 3, 0, 1]
[3, 5, 2, 2, 2]
[5, 2, 0, 4, 1]
[2, 1, 1, 2, 4]
45
A146
00
[2, 3, 3, 1, 5]
[5, 4, 5, 2, 2]
[5, 3, 3, 5, 5]
[5, 5, 5, 5, 5]
78
A147
00
[3, 1, 0, 2, 5]
[5, 5, 5, 2, 0]
[5, 2, 5, 5, 5]
[5, 5, 3, 2, 5]
70
A155
00
[4, 5, 4, 3, 5]
[5, 3, 5, 5, 4]
[5, 5, 5, 5, 5]
[5, 4, 5, 5, 5]
92
A156
00
[0, 2, 3, 2, 2]
[5, 4, 1, 2, 2]
[5, 5, 4, 5, 5]
[5, 4, 5, 5, 1]
67
A157
11
[1, 2, 5, 2, 5]
[1, 4, 1, 2, 2]
[5, 4, 3, 3, 3]
[4, 3, 4, 2, 2]
58
83
Table B.3
Combined code data
ID
C4L
PGPA
Function
Composition
Differentiation
Chain
Rule
Overall
A111
01
2.70
21
20
18
23
82
A112
01
2.30
16
24
25
25
90
A113
01
3.40
15
24
25
24
88
A115
01
3.10
10
25
24
23
82
A121
11
2.40
13
24
25
24
86
A122
01
3.00
14
22
25
20
81
A123*
11
3.00
18
21
19
1
59
A124
11
2.40
7
11
23
21
62
A125*
11
2.80
9
7
16
20
52
A126
10
2.60
18
15
16
17
66
A127
00
3.90
23
24
25
24
96
A128
00
23
13
24
25
85
A129
10
17
15
24
22
78
A130*
00
10
5
24
17
56
A131
00
2.90
5
15
25
24
69
A132
10
3.10
14
9
25
21
69
A133
00
1.20
14
12
22
21
69
A134
00
4
11
22
25
62
A135
00
15
8
23
17
63
A136
00
14
7
25
22
68
A137
00
3.00
19
12
24
19
74
A138
00
2.50
14
11
24
21
70
A139
00
2.80
17
19
24
24
84
A140
01
2.80
10
4
20
19
53
A141
01
3.10
11
17
24
18
70
A142
00
2.30
25
22
24
20
91
A143
11
3.80
18
16
21
20
75
A144
00
3.20
17
20
24
21
82
A145
11
3.70
9
14
12
10
45
A146
00
1.80
14
18
21
25
78
A147
00
3.10
11
17
22
20
70
A155
00
2.30
21
22
25
24
92
A156
00
2.40
9
14
24
20
67
A157
11
1.80
15
10
18
15
58
2.30
Appendix C: Interview Guide for Phase 2
C.1
Overview
The interview will follow a guide designed to elicit the student’s understanding of
the rule, based on tasks from the previous instrument. New tasks will be presented to
the student to probe the limits of understanding. The students will be asked to reflect
on the questionnaire tasks, possibly by asking them for definitions of function and
derivative, domain-range issues in composition, the statement of the chain rule, and
a discussion of any relationships between the 20 problems that the student perceives.
The guide will be written following the piloting of the written part. This will also
then be piloted with the pilot subjects.
The interview will also address Leibniz’ rule as a way of gauging the robustness of
the student’s concept. A situation with no simple anti-derivative
will be presented,
Z
sin x
2
such as “compute F 0 if F is the function given by F (x) =
et dt.” If the need
0
to compute an anti-derivative is too great, a situation where the integrand has an
elementary anti-derivative will be presented to determine if the student can derive
Leibniz’ rule from the result. The data from these situations will be analyzed and
compared to that of Thomas (1995).
The interviews will be structured with four basic components, although each component will be modified to fit the student.
C.2
Review of function and composition
The student will be presented with a copy of her or his questionnaire from Phase
I and will be asked to discuss two function situations; one which he or she dealt with
successfully and one less successfully. The same will be done for the composition
questions. The goal will be to get a definition of function from the student. The
student will be allowed to ask about any of the first ten problems.
1. Here is a copy of your work from the questionnaire for question ??. Look it
over and describe how you solved the problem.
2. Would you change anything in that answer?
3. Repeat (1) and (2) for another function and two composition problems.
84
85
4. Questions 2–3 dealt with whether a situation can be a function. What is your
definition of function? Omit if the definition has been established.
5. Do you have any questions about the first 10 questions?
C.3
Classifying the 10 derivative problems
The student will be asked to group in any way that makes sense the ten derivative
problems. It is expected that he or she will use the elementary rules as a guide. This
may help determine if the student sees all five of the chain rule problems as a single
class.
6. Here is a copy of the last 10 problems on one sheet. The directions for all of
these were to compute the derivative of each of the following functions and to
show all of your work. I would like you to group these 10 problems in whatever
manner makes sense to you. You can have as many of them in a group as you
wish and you can form as many groups as you want.
7. Describe how each of the groups were determined.
8. Is there any other way that you considered grouping these? If so, describe it as
well.
9. If no answer given has disjoint sets: What if I required that your groups are
disjoint? Can you still group them?
C.4
Establishing the notion of chain rule
The student will be asked to state the chain rule. The questionnaire problems
will be used to get as general a statement as the student can agree with. The student
will be asked to discuss the proof (or need for proof) of the chain rule as well as any
intuitive notions of the rule that they have.
10. Write down the chain rule using whatever words or symbols that you like.
11. The last five questions were intended to be solved using the chain rule. Compare
each of those situations with what you have written. Does your statement deal
with each or can you modify it so that it does?
12. What is the key idea in the chain rule?
13. Do you recall how you learned the chain rule for yourself? Did you use mnemonic
devices or any sayings or tricks to keep the rule straight?
86
14. In mathematics, theorems need to be proved using more elementary definitions
and theorems. Does the chain rule need a proof? If so, what can you say about
the proof; if not, why not?
C.5
Extending the notion
The student will be asked to solve the Leibniz Rule problem from the RUMEC
study. A simplified version of the problem will be offered if needed to find a solution.
Finally, a set of word problems from a first semester calculus course will be presented.
The set will include a related rate problem and an implicitly defined function problem.
The task will be to identify any situations that require the use of the chain rule in
the solution, discussing in detail how the rule might be used. The student will not
be expected to solve the problems; however if a solution is required to complete the
task, one will be provided to the student.
15. Read each of the following differentiation situations carefully. Without actually
solving the problem, determine if the solution will involve the use of the chain
rule in some way. Describe how the chain rule is used.
(a) Given that the following relation defines y as a function of x,
√
√
x y + y x = 42
find its derivative.
(b) A ladder L feet long is leaning against a wall, but sliding away from the
wall at the rate of 4 ft/sec. Find a formula for the rate at which the top
of the ladder is moving down the wall.
(c) The volume V and the surface area of a sphere are related by the equation
36πV 2 = S 3 . Find dS/dV , the rate of change of the surface area with
√
respect to the volume when the volume is π 6 ft3 .
(d) The following formula give a reasonable estimate of the Heat Capacity C
of a certain element at very low Fahrenheit temperatures F :
C = 0.171F 3 + 236F 2 + 10, 800F + 16, 600, 000
Suppose that the temperature F is found at time t according to the equation F = −42t2 . Find an expression for rate of change of the Heat Capacity
with respect to time.
87
Z
sin x
2
16. Compute F if F is the function given by F (x) =
et dt. Freely offer the
0
fact that there is no Zelementary anti-derivative of the
integrand. One simpler
Z
x 2
sin x
situation is G(x) =
et dt. Another is H(x) =
4t2 dt. Have all three
0
0
on separate pages.
0
17. Finally, do you have any questions?
VITA
88
VITA
Jim Cottrill was born in Akron, Ohio on 1 February 1963, the son of Wayne and
Paulette. He was married to Julie Ann Vetter, daughter of Ron and Judy, on 15 June
1991 in Chillicothe, Ohio. They have a daughter Cecelia, born in 1996, and a son
Stephen, born in 1998.
Jim attended the University of Akron where he earned a Bachelor of Arts degree
in Secondary Education, in the fields of Mathematics and Chemistry, in 1986. He
taught mathematics, computer science and chemistry for the Akron Public Schools.
In 1992, he was accepted into the doctoral program at Purdue University. After
teaching at Georgia State University, Jim received his Doctor of Philosophy degree
in Mathematics Education from Purdue University in 1999.
He has co-authored three articles in the Journal of Mathematical Behavior : “Understanding the Limit Concept: Beginning with a Coordinated Process Schema”
(1996); “The Development of Students’ Graphical Understanding of the Derivative”
(1997); and “Constructing a Schema: The Case of the Chain Rule” (1997). His research interests are in undergraduate mathematics education and in the training of
pre-service teachers.
Jim has accepted a position as Assistant Professor in the Department of Mathematics at Illinois State University in Normal, Illinois beginning August 1999.
Download