Neither Div nor Curl nor Both Constitute the Derivative Charles S. Barnett Adjunct Mathematics Instructor Las Positas College Presented at the 41st annual fall conference California Mathematics Council, Community Colleges Monterey, California December 13 and 14, 2013 1 2 Abstract The Div and Curl operators are derivative-like but are only aspects of the derivative of a map from 3-space to 3-space. The derivative of such a given map is a linear map from 3-space to 3-space. By employing the Frechet difference quotient at the outset, we can construct that derivative without use of the heavy machinery of advanced analysis. The process, results, and properties parallel those of the one-dimensional case. Attribution 1. H. K. Nickerson, D. C. Spencer, and N. E. Steenrod, Advanced Calculus, Dover Publications, 2011. Unabridged republication of the work originally published in 1959 by the D. Van Nostrand Company. 2. Spivak, Michael, Calculus on Manifolds: A Modern Approach to Classical Theorems of Advanced Calculus,” W. A. Benjamin, Inc., New York, Amsterdam, 1965. 3 3. H. M. Schey, Div, Grad, Curl, and All That: An Informal Text on Vector Calculus, W.W. Norton & Company, Inc., New York, 1973. 4 Outline ο· Alternate definition of the derivative of a real-valued function of a real variable (the π: β → β case) ο· The Frechet derivative of a scalar-valued function of a vector (the π: β3 → β case) ο· The Frechet derivative of a vector-valued function of a vector (the πΉ: β3 → β3 case) ο· The divergence and curl of a vector field ο· An application: the divergence and curl of a steady flow ο· Generalizations and the chain rule 5 Scalar-valued Functions of a Scalar (1) Situation: β represents the real numbers. π: β → β . Goal: Define the derivative of π at π₯, π ′ (π₯ ) . For (π₯, π¦) ∈ β2 consider the following limit: 1 [π (π₯ + βπ¦) − π (π₯ )] ββΆ0 β π» ′ (π₯, π¦) = lim If this limit exists, then π» ′ (π₯, π¦) is called “the derivative of π at π₯ with respect to π¦.” 6 Scalar-valued Functions of a Scalar (2) Claim: π» ′ (π₯, π¦) is linear in the variable π¦. Pick and fix πΆ in β and consider π» ′ (π₯, πΆπ¦). Show first that π» ′ (π₯, πΆπ¦) = πΆπ» ′ (π₯, π¦). Well, 1 ) π» π₯, πΆπ¦ = lim [π (π₯ + βππ¦) − π (π₯ )] β→0 β ′( π [π(π₯ + βππ¦) − π (π₯ )] β→0 βπ = lim 7 1 = π lim [π(π₯ + π‘π¦) − π (π₯ )] π‘→0 π‘ (π‘ = βπ) = ππ» ′ (π₯, π¦) . Therefore, π» ′ (π₯, ππ¦) = ππ» ′ (π₯, π¦) Scalar-valued Functions of a Scalar (3) Now let π¦ and π§ denote two non-zero elements of β and consider π» ′ (π₯, π¦ + π§). 1 π» ′ (π₯, π¦ + π§) = lim [π(π₯ + βπ¦ + βπ§) − π (π₯ )] β→0 β 1 = lim [π(π₯ + βπ¦ + βππ¦) − π (π₯ )] β→0 β for some π ∈ β 1 = lim [π(π₯ + β(1 + π)π¦) − π(π₯ )] β→0 β 8 = (1 + π)π» ′ (π₯, π¦) = π» ′ (π₯, π¦) + ππ» ′ (π₯, π¦) = π» ′ (π₯, π¦) + π» ′ (π₯, ππ¦) = π» ′ (π₯, π¦) + π» ′ (π₯, π§). So, π» ′ (π₯, π¦) is linear in the second variable. Scalar-valued Functions of a Scalar (4) Definition: The derivative of π at π₯, denoted by π ′ (π₯ ), is the linear map from β → β defined by (π ′ (π₯ ))(π¦) = π» ′ (π₯, π¦). Examples (old chestnuts in new shells): 9 1. π is a constant function, π (π₯ ) = π, where π is a constant. Then (π ′ (π₯ ))(π¦) = 0, i.e. π ′ (π₯ ), sends all π¦ to zero for every π₯ ∈ β. Argument: π(π₯ + βπ¦) − π (π₯ ) = π − π = 0. 2. π is a linear map, i.e., π (π₯ ) = ππ₯, where π ∈ β is an arbitrary but fixed number. Then π ′ (π₯ ) is a constant map, i.e., (π ′ (π₯ ))(π¦) = ππ¦ for every π₯ ∈ β, π¦ ∈ β. Argument: π(π₯ + βπ¦) − π (π₯ ) = π(π₯ + βπ¦) − ππ₯ = βππ¦. Scalar-valued Functions of a Scalar (5) Examples (old chestnuts in new shells), cont’d: 3. π (π₯ ) = π₯ 2 implies that π ′ (π₯ ) = (2π₯ ), i.e., π ′ (π₯ )(π¦) = (2π₯ )π¦ all (π₯, π¦) ∈ β2 10 Argument: π(π₯ + βπ¦) − π (π₯ ) = (π₯ + βπ¦)2 − π₯ 2 = π₯ 2 + 2βπ₯π¦ + β2 π¦ 2 − π₯ 2 = 2βπ₯π¦ + β2 π¦ 2 Now, divide by β and pass to the limit to see that π» ′ (π₯, π¦) = (2π₯ )π¦ βΉ π ′ (π₯ ) = 2π₯. Scalar-valued Functions of a Scalar (6) Best affine approximation to π Pick and fix π₯0 ∈ β. Then the mean value theorem says that 11 π (π₯ ) = π(π₯0 ) + π ′ (π₯0 )(π₯ − π₯0 ) + πππππ . You will see direct parallels to this equation as we work our way up to functions from β3 βΆ β3 . Scalar-valued Functions of a Vector (1) Situation: π: β3 → β 12 Goal: Define the derivative of π at π, π ′ (π). For (π, π) ∈ β3 × β3 , consider the following limit that defines function π» ′ : β3 × β3 → β. ′( 1 [π(π + βπ) − π(π)] ββΆ0 β π» π, π) = lim If this limit exists, then π» ′ (π, π) is called “the derivative of π at π with respect to π”. Scalar-valued Functions of a Vector (2) 13 Claim: π» ′ (π, π) is linear in the variable π. First, show that π» ′ (π, ππ) = ππ» ′ (π, π) for π ∈ β. Pick and fix π in β. If π = 0, the difference quotient is 0. If π ≠ 0, then 1 π» π, ππ) = lim [π(π + βππ) − π (π)] β→0 β π [π(π + βππ) − π (π)] = lim β→0 βπ ′( 1 = π lim [π(π + π‘π) − π (π)] π‘→0 π‘ = ππ» ′ (π, π) 14 Scalar-valued Functions of a Vector (3) Now, consider π» ′ (π, π + π). We must show that π» ′ (π, π + π) = π» ′ (π, π) + π» ′ (π, π), where 1 π» ′ (π, π + π) = lim β [π(π + βπ + βπ) − π(π)]. β→0 There exists a version of the mean value theorem that we will need: If π» ′ (π + π‘π, π) exists for 0 ≤ π‘ ≤ 1, then there exists π, 0 < π < 1 such that π(π + π) − π (π) = π» ′ (π + ππ, π) (∗) This version can be established fairly easily from the standard (β → β) version. In the proof of linearity for the (β → β) case I used the fact that for any two nonzero real numbers π₯, π¦ there exists a third, π, say, such that π₯ + π¦ = π₯ + ππ₯. The above (β3 → β) version (∗) of the mean value theorem is used similarly to complete the proof of linearity for the present case. The details, somewhat involved, appear on page 5 of the handout. 15 So, π» ′ (π, π) is linear in π for every π. Scalar-valued Functions of a Vector (4) Definition: The derivative π ′ (π) of π at π is the linear map π ′ (π): β3 → β defined by (π ′ (π))(π) = π» ′ (π, π). 16 Scalar-valued Functions of a Vector (5) The Gradient Let π ∈ β3 Expand π in the standard basis. Invoke the fact that π» ′ (π, π) is linear in π and expand π» ′ (π, π) via π’s components π¦1 , π¦2 , π¦3 . Arrive at 3 ′( π» π, π) = ∑ π¦π π=1 ππ . ππ₯π A theorem from linear algebra says that if π: β3 → β is linear, then there exists a unique vector π΅ such that π(π) = π΅ β π. 17 So π» ′ (π, π) = π΅ β π, which implies that π΅ = ∇π (π). Hence, π» ′ (π, π) = ∇π(π ) β π. Scalar-valued Functions of a Vector (6) Observe that knowing ∇π (π) is equivalent to knowing π ′ (π), but ∇π (π) ∈ β3 and π ′ (π): β3 → β. Best affine approximation to π Pick and fix π0 ∈ β3 . Then π(π) = π(π0 ) + (π ′ (π0 ))(π − π0 ) + πππππ. Or, in terms of the gradient, π(π) = π(π0 ) + ∇π(π0 ) β (π − π0 ) + πππππ. 18 Vector-valued Functions of a Vector (1) Situation: πΉ: β3 → β3 For (π, π) ∈ β3 × β3 , consider the following limit that defines β± ′ (π, π): 1 β± ′ (π, π) = lim [πΉ (π + βπ) − πΉ (π)] β→0 β if the limit exists. Examples: 1. πΉ: β3 → β3 is a constant function. πΉ (π) = π΅ ∈ β3 for every π. π΅ is a fixed vector. 19 Then πΉ (π + βπ) − πΉ (π) = π΅ − π΅ = 0 ∈ β3 . So, β± ′ (π, π) = 0 ∈ β3 . 20 Vector-valued Functions of a Vector (2) 2. πΉ: β3 → β3 is a linear map. πΉ (π) = π(π), where π is linear. Then πΉ (π + βπ) − πΉ (π) = π(π + βπ) − π(π). = π(π) + βπ(π) − π(π) = βπ(π). Now, divide by β and pass to the limit to see that β± ′ (π, π) = π(π) for all (π, π) ∈ β3 × β3 21 Vector-valued Functions of a Vector (3) Claim: β± ′ (π, π) is linear in the variable π. Let {π΄1 , π΄2 , π΄3 } represent the usual basis in β3 . Then πΉ (π) = ∑3π=1 ππ (π)π΄π , where the ππ represent the component functions. Let π»π′ (π, π) represent the derivative with respect to π of the component functions. Earlier I sketched an argument that showed that the π»π′ are linear in the variable π. It can be shown that β± ′ (π, π) inherits linearity in π from the π»π′ . So, 1 β± π, π) = lim [πΉ (π + βπ) − πΉ (π)] β→0 β ′( is linear in the variable π. Definition: The derivative πΉ ′ (π) of πΉ at π is the linear transformation πΉ ′ (π): β3 → β3 defined by 22 (πΉ ′ (π))(π) = β± ′ (π, π) . Vector-valued Functions of a Vector (4) Examples (restatement in terms of πΉ ′ rather than β± ′ ): 1. The derivative of a constant function is the zero vector. So πΉ ′ (π) transforms all of β3 into the zero vector. 2. The derivative of a linear function is a constant. πΉ (π) = π(π), π linear, implies that πΉ ′ (π) = π for every π. Now we can get back to more familiar, classical ground. πΉ ′ (π) is a linear map. Given a basis, a linear map can be represented by a matrix relative to that basis. So, as above, let {π΄1 , π΄2 , π΄3 } represent the standard basis. Then π = ∑3π=1 ππ π΄π , π = ∑3π=1 π¦π π΄π , πΉ (π) = ∑3π=1 ππ (π)π΄π , where the scalar-valued ππ are the component functions of πΉ. 23 Then ππ (πΉ ′ (π))(π) = ∑3π=1 ∑3π=1 π¦π ππ₯ π π΄π . (∗) π Vector-valued Functions of a Vector (5) I have here invoked both the fact (shown earlier) that, for functions π: β3 → β, ππ ′ π» ′ (π, π) = ∑3π=1 π¦π , and the linearity of πΉ (π ) to arrive at Eqn.(∗). ππ₯π Observe from Eqn. (∗) that, relative to the standard basis, (πΉ ′ (π))(π) is represented by ππ1 ππ₯1 ππ2 [πΉ ′ (π)][π] = ππ₯1 ππ3 [ππ₯1 ππ1 ππ₯2 ππ2 ππ₯2 ππ3 ππ₯2 ππ1 ππ₯3 π¦1 ππ2 [π¦2 ] , ππ₯3 π¦ 3 ππ3 ππ₯3 ] where here and below I use [β] to indicate the matrix of β . Best affine approximation to F 24 As in the previous (β → β) and (β3 → β) cases, we have a best approximation via Taylor’s formula: πΉ (π) = πΉ (π0 ) + (πΉ ′ (π0 ))(π − π0 ) + πππππ. Divergence and Curl of a Vector Field (1) Given πΉ: β3 → β3 , then πΉ ′ (π) is a linear map from β3 → β3 . Linear maps may be broken into the sum of their symmetric and skew-symmetric parts. Let πΉ ′ (π)∗ represent the adjoint of πΉ ′ (π). Then 1 ′ ′( ) πΉ+ π = [πΉ (π) + πΉ ′ (π)∗ ] = π π¦ππππ‘πππ ππππ‘ 2 and 1 πΉ−′ (π) = 2 [πΉ ′ (π) − πΉ ′ (π )∗ ] = skew-symmetric part. Observe that πΉ ′ (π) = sum of the two parts. 25 Divergence and Curl of a Vector Field (2) To simplify typography, until further notice let πππ ≡ πππ . ππ₯π Then the matrix representation of πΉ ′ (π) is π11 [πΉ ′ (π)] = [π21 π31 π12 π22 π32 π13 π23 ] , π33 and the matrix of πΉ ′ (π)∗ is 26 π11 [π12 π13 π21 π22 π23 π31 π32 ] . π33 Divergence and Curl of a Vector Field (3) Then a bit of matrix algebra leads to the matrix representations of 2πΉ+′ (π) and 2πΉ−′ (π) They are 2π11 [2πΉ+′ (π)] = [π12 + π21 π13 + π31 π12 + π21 2π22 π23 + π32 π13 + π31 π23 + π32 ] 2π33 27 0 [2πΉ−′ (π)] = [π21 − π12 π31 − π13 π12 − π21 0 π32 − π23 π13 − π31 π23 − π32 ] . 0 Divergence and Curl of a Vector Field (4) Definition: The Divergence of πΉ is a map π·ππ£ πΉ: β3 → β, defined by π·ππ£ πΉ (π) = πππππ πΉ ′ (π). The Trace of πΉ ′ (π) is equal to the sum of the roots of the characteristic polynomial of πΉ ′ (π), a property of the linear map that is invariant to matrix representation. The trace of any linear map from β3 → β3 is equal to the sum of 28 the diagonal elements of any matrix representation. Hence, the familiar version of the Divergence: π·ππ£ πΉ (π) = π11 + π22 + π33 = ππ1 ππ2 ππ3 + + . ππ₯1 ππ₯2 ππ₯3 Observe also that π·ππ£ πΉ (π) = πππππ πΉ+′ (π) . Divergence and Curl of a Vector Field (5) Definition: The Curl of πΉ at π is defined by 2πΉ−′ (π)(π) = [πΆπ’ππ πΉ (π)] × π (∗) 29 How do we know that a vector that makes the right side of Eqn. (∗) equal to the left-hand side exists? Well, 2πΉ−′ (π) is a skew-symmetric transformation and item A5 of the Appendix of the handout asserts that such a vector exists for such maps; we name it πΆπ’ππ πΉ (π). Also, according to A5, if the matrix of a skewsymmetric map looks like 0 [−π −π π 0 −π π π] , 0 (∗∗) then the vector at issue (πΆπ’ππ πΉ (π) here) is −ππ + ππ − ππ . (∗∗∗) Divergence and Curl of a Vector Field (6) Compare the matrix representation of 2πΉ−′ (π) and you see that πΆπ’ππ πΉ (π) = (π32 − π23 )π + (π13 − π31 )π + (π21 − π12 )π 30 or, in the usual notation, ππ3 ππ2 ππ1 ππ3 ππ2 ππ1 πΆπ’ππ πΉ (π) = ( − )π + ( − )π + ( − )π ππ₯2 ππ₯3 ππ₯3 ππ₯1 ππ₯1 ππ₯2 Divergence and Curl of a Steady Flow: an application (1) Consider a vector field πΉ: β3 → β3 and the flow induced by the differential equation 31 ππ = πΉ (π) . ππ‘ (β) Note that time does not appear on the right hand side of Eqn. (β), hence “steady flow.” Pick and fix π0 ∈ β3 . Expand πΉ around π0 via Taylor’s formula; substitute the result into Eqn. (β) to yield π (π − π0 ) = πΉ (π0 ) + πΉ ′ (π0 )(π − π0 ) + πππππ . ππ‘ (ββ) Split πΉ ′ (π0 ) into the sum of its symmetric and skew-symmetric components πΉ ′ (π0 ) = πΉ+′ (π0 ) + πΉ−′ (π0 ) (βββ) and substitute into Eqn. (ββ) to obtain π (π − π0 ) = πΉ (π0 ) + πΉ+′ (π0 )(π − π0 ) + πΉ−′ (π0 )(π − π0 ) + πππππ ππ‘ (ββββ) Divergence and Curl of a Steady Flow: an application (2) 32 Now, consider the motion of the points within a small sphere centered at π0 at time zero, say. How has that spherical test ball changed in a small time π‘? The terms on the right-hand side yield an approximate answer. Consider first the motion induced by π (π − π0 ) = πΉ−′ (π0 )(π − π0 ) ππ‘ 1 = πΆπ’ππ πΉ (π0 ) × (π − π0 ) 2 (β) Equation (β) says that the test ball rotates about an axis through π0 parallel to 1 πΆπ’πππΉ (π0 ) with angular velocity 2 |πΆπ’πππΉ (π0 )|. That motion is volumepreserving. 33 Divergence and Curl of a Steady Flow: an application (3) Consider next the motion described by π (π − π0 ) = πΉ+′ (π0 )(π − π0 ) . ππ‘ (ββ) πΉ+′ (π0 ) is a symmetric map centered at π0 . Let π1 , π2 , π3 represent its eigenvalues and π΅1 , π΅2 , π΅3 denote the corresponding orthonormal set of eigenvectors. Then the corresponding eigenvalues of the flow described by Eqn. (ββ) are π π1 π‘ , π π2 π‘ , π π3 π‘ . That flow distorts the spherical ball slightly into an ellipsoid. The volume of the ball changes by a factor that is the determinant of the transformation. That determinant is ππ₯π(π‘ π‘ππππ πΉ+′ π0 ) and its time rate of change when π‘ = 0 is π‘ππππ πΉ+′ (π0 ) = π‘ππππ πΉ ′ (π0 ) = π·ππ£ πΉ (π0 ). Finally, consider the motion induced by π (π − π0 ) = πΉ (π0 ) . ππ‘ (βββ) Equation (βββ) says that the ellipsoid experiences a small translation with velocity vector πΉ (ππ ). 34 35 The General π: βπ → βπ Case and the Chain Rule (1) The chain rule is so important in differential calculus that its multidimensional version deserves a brief discussion. Before doing so, I point out that our restriction to the 1 ≤ π ≤ 3 case above was not necessary; I invoked the restriction because of the emphasis on the undergraduate calculus sequence. Let π and π represent natural numbers and consider πΉ: βπ → βπ . If πΉ is differentiable at π, then its derivative can be defined via the Frechet difference quotient just as we did in the earlier cases. The process is identical to the one used for the 1 ≤ π, π ≤ 3 case. The derivative is a linear transformation as before, and all goes through except the discussion of the Curl, which is unique to the three-dimensional case. With this generalization all cases are subsumed, again except for the Curl, by this general case. 36 The General π: βπ → βπ Case and the Chain Rule (2) Consider now the chain rule in the general case. Let π, π, and π represent natural numbers. Situation: πΉ: βπ → βπ is differentiable at π, and πΊ: βπ → βπ is differentiable at πΉ (π), then πΊ β πΉ is differentiable at π and (πΊ β πΉ )′ (π) = πΊ ′ (πΉ (π)) β πΉ ′ (π), (∗) where β represents function composition. Observe that in Eqn. (∗) πΊ ′ (πΉ (π)) is a linear map and πΉ ′ (π) is a linear map; hence the composition of the two maps is a linear map, and all is well. Use brackets to denote matrix representation, and the matrix version of Eqn. (∗) becomes [(πΊ β πΉ )′ (π)] = [πΊ ′ (πΉ (π))][πΉ ′ (π)]. (∗∗) 37 The General π: βπ → βπ Case and the Chain Rule (3) Recall the one-variable chain rule. π: β → β, π: β → β (π β π)′ (π₯ ) = π′ (π(π₯ ))π ′ (π₯ ) (∗∗∗) where the juxtaposition on the right-hand side of Eqn (∗∗∗) denotes multiplication, but could just as well have been written (π β π)′(π₯ ) = π′ (π(π₯ )) β π ′ (π₯ ) (∗∗∗′ ) because, in the one-dimensional case, composition of linear functions is multiplication. Compare Eqn. (∗) and Eqn. (∗∗∗′ ) and you see that Eqn. (∗∗∗′ ) is just a special case of Eqn. (∗). 38 Nerd’s Bumper Sticker And God said ∇×E+ ππ΅ =0 ππ‘ ∇×H− ππ· =π½ ππ‘ ∇βB=0 ∇βD=ρ and there was light. 39 Closure ο· The derivative is a linear map. ο· The Divergence and Curl are aspects of that map that account for about 4/9 of its properties, an especially important fraction in applications. ________________________________________________________________________ “Some people can stay longer in an hour than others can in a week.” 40 William Dean Howells, American novelist, 1837-1920 41