Uploaded by Alexa Nebula

Mathematics for physicists

advertisement
is written in this grand book, the universe, which stands
open to our gaze. But the book cannot be understood unless one
leams to comprehend the language and read the letters in which it
composed. It is written in the language of mathematics, and its
are triangles, circles and other geometric figures without which
is humanly impossible to understand a word of it; without these one
wanders nbout in a dark labyrinth.
GeI-neo Gnr-rnr
-
words of Galileo are just as true today as when he wrote them in the lTth century.
serious student of physics must first learn the language of mathematics. This is a more
task for today's student than it was for Galileo. The characters include integrals,
goups, tensors, and other concepts unheard of in Galileo's time. A physics major
also faced with a list of physics courses that may include not only the mechanics that
studied, but also electromagnetic theory, quantum mechanics, and relativity. With
push to include more physics in the undergraduate major, there is even less time to study
We usually attempt to remedy this by putting a lot of mathematics instruction
physics
the
classes, introducing the mathematical tools as they are needed.
graduate
A beginning
student usually needs to gather together tools that have been accrued
and develop a deeper understanding of their use. She or he also needs to review
reinforce material learned in undergraduate mathematics classes and become more
in using mathematical tools. That is the purpose of this book. A student who has
the material in this book should be prepared for graduate classes in mechanics,
mechanics, or electricity and magnetism.
The text is an outgrowth of a course at San Francisco State University that is taken by
in their final year of a physics BS or, even more commonly, by incoming graduate
These students have taken the usual undergraduate courses in mathematics: calcuvector calculus, differential equations, and linear algebra. They have also already taken
courses in mechanics and electricity and magnetism. While these students have
some of the material in the first few chapters of this book, they are rarely comfortable,
and accurate using these tools. At San Francisco State, incoming graduate stuwho already have a degree in physics take a placement test, which shows that most
them cannot perform vector calculations correctly, especially when asked to abandon
ian components! Thus, the early chapters include some basic material as a review
vtl
vilt
PREFACE
and for completeness. At the same time, this material can now be approached in a more
sophisticated manner.
There are many books that discuss the mathematics that graduate students in physics need
to know. Some of them are extensive reference works in two volumes. Every student needs
one of these books, but not as a textbook. Most textbooks cover too much material for a
one-semester course, and most of them are too mathematical in their approach for today's
students, who usually do not have an extensive undergraduate preparation in mathematics.
The goal of this book is to offer a one-semester course umed at physics students-a book
that provides students with the tools that they need and shows them how to use those tools
in physics problems. Because these students have some experience in physics, I draw on
that knowledge in examples and exercises.
The book is organized as follows. The eight chapters cover the basic material that I
have taught every year. Then five optional topics provide additional material, from which
instructors may select to round out the semester or which students may read on their own.
Some of the chapters include material normally covered somewhere in the undergraduate
curriculum, but pushed to more depth and/or breadth. Instructors may wish to skip some
of this material or assign it as required reading. Chapter 1 (on vectors and matrices) and
Chapter 3 (Differential Equations) fall into this class. Some chapters cover material not
normally explored in detail until the graduate program. Chapter 6 (on generalized functions)
and Chapter 7 (Fourier Transforms) are in this category. The optional topics are at a more
advanced level than the material in the chapters. Within each chapter, the material becomes
progressively more difficult. Instructors who want a lower-level presentation can simply
omit the final sections of each chapter and the optional topics. Instructors with more wellprepared students may want to use the early sections of each chapter as assigned reading
and concentrate on the more difficult material in the later sections.
Material introduced in one chapter is reinforced in later chapters. Thus, Chapter 2 (Complex Variables) uses material on vectors (Chapter 1); the discussion of inverse transforms
(Chapters 5 and 7) makes extensive use of contour integration (Chapter 2); and Chapter
7 (Fourier Transforms) uses material on delta functions (Chapter 6). Material on special
functions (Chapter 8) draws on series solutions of differential equations (Chapter 3) and
prior experience with Fourier series (Chapter 4) and Fourier transforms (Chapter 7). After
a tool such as the Fourier transform has been introduced, I immediately give examples of
how that tool is used in the solution of physics problems. Even though there is no single
chapter labeled "Partial differential equations," many techniques and examples involving
the solution of PDEs are presented throughout the text.
Mathematical proofs are kept to a minimum in the text, but many have been placed in the
appendices. Students who plan a career in theoretical work will need to study the proofs
in more detail at some time. My goal here is to show students how to use the mathematics
to get results (see, for example, the sections on using contour integration in Chapter 2).
Similarly, many of the examples are applications in physics.
I debated whether to include numerical methods in this text. I finally decided to omit
computer applications almost entirely. (There is a brief discussion of numerical solution of
differential equations in Chapter 3 and some discussion of computer methods in solving
matrix equations in Chapter l.) Of course, in the 21st century every physics student needs
competence in using a computer, and many of the problems suggest using a computer for
PREFACE
IX
graphics or numerical answers. But the student also needs to understand what the computer
is doing, and why. And that same student still needs to have the basic mathematical tools
and to understand why and how they work. The computer material belongs in a different
course, and in a different book.
A Student Solutions Manual, containing solutions to about 25 percent of the problems in
the book, is available for sale to students. A box around the problem number in the text indicates a problem solved inthe Student Solutions Manual. An Instructor's Solutions Manual,
containing solutions to all of the problems, is available to instructors. For further information about either of these ancillaries, instructors should contact their Cengage Learning
sales representative.
ACKNOWLEDGMENTS
The material has been extensively class
tested-with
a
favorable response-at San Francisco
University, and I thank the many graduate students who have taken this class and have
shaped my vision of what this book should be. I am also grateful to my colleagues at SFSU
who read the numerous drafts of the text. Special thanks go to Dr. John Burke (Physics and
Astronomy Department) and Dr. David Ellis (Department of Mathematics). My department
chair, Dr. James Lockhart, also provided useful criticism of the manuscript and supported
my efforts. I would also like to thank the following reviewers for their helpful comments
on various drafts of the manuscript:
State
Albert Altman,
U niv
e
r s ity
of M as s achu s ett s, Low
e
ll
Giles Auchmtty, University of Houston
Thomas Beatty, Florida Gulf Coast University
Paul L. DeVries, Miami University
Nevin Daniel Gibson, Denison University
Porter W. Johnson, Illinois Institute of Technology
David Kastor, University of Massachusetts, Amherst
Igor Ko goutio uk, M inne s o t a S tat e U niv e rs ity -M ankat o
Daniel
P.
Lathrop, University of Maryland
Romulo Ochoa, The College of New Jersey
Daniel Phillips, Ohio University
George R. Plitnik, Frostburg State Universiry
Asok K. Ray, University of Texas at Arlington
John F. Reading, Texas
A&M University
Peter S. Riseborough, Temple University
Sergei Shandarin, University of Kansas
Charles Stanton, CSU San Bemardino
Krzysztof Szalewicz, University of Delaware
Robert L.Zimmerman, University of Oregon
his love and suPPort'
Contents
Preface
1
vii
Describing the Universe
I
1.1 A Universal Language
1.2 Scalar and Vector Fields 17
1.3 Curvilinear Coordinates 28
1
'1.4 The Helmholtz Theorem
1.5 Vector Spaces 37
1.6 Matrices 41
Problems
2
37
65
Complex Variables 75
2.1 AllAbout Numbers 75
2.2 Functions of Complex Variables 89
2.3 Complex Series 101
2.4 Complex Numbers and Laplace's Equation 112
2.5 Poles and Zeros 117
2.6 The Residue Theorem 122
2.7 Using the Residue Theorem 127
2.8 Conformal Mapping 150
2.9 The Gamma Function 156
Problems
3
160
Differential Equations 169
3.1 Some Definitions 169
3.2 Common Differential Equations Arising in Physics 170
3.3 Solution of Linear, Ordinary Differential Equations 174
3.4 NumericalMethods 197
3.5 Partial Differential Equations: Separation of Variables 208
Problems
4
Fourier
4.1
211
Series
Fourier's
219
Theorem
219
XI
xll
CONTENTS
4.2
4.9
Finding the Coefficients 22o
Fourier Sine and Cosine Series 227
4'4UseofFourierSeriestoSolveDifferentialEquations234
4.5
Convergence of Fourier
Problems
Laplace
Series
24o
243
T[ansforms
251
5.1 Definition of the Laplace Transform 251
5.2 Some Basic Properties of the Transform 253
5.3 Use of the Laplace Transform to solve a Differential
5.4 Some Additional UsefulTricks 261
5.5 Convolution 265
5.6 The General lnversion Procedure 269
5.7 Some More PhYsics 273
Problems
Equation
279
Generalized Functions in
Physics
287
6.1 The Delta Function 287
6.2 Developing a Theory of Distributions
6.3 Properties of Distributions 307
6.4 Sequences and Series 310
6.5 Distributions in N Dimensions 313
305
6'6DescribingPhysicalQuantitiesUsingDeltaFunctions315
6.7
The Green's
Function
Problems
318
317
Fourier Tbansforms 323
7,1 Definition of the FourierTransform 323
7.2 Some ExamPles 325
7.3 Properties of the Fourier Transform 329
7.4
7.5
7.6
7.7
Causality
Problems
Sturm-Liouville
8.1
8.2
334
Use of FourierTransforms in the Solution
of Partial Differential Equations 336
Fourier Transforms and Power Spectra 341
Sine and Cosine Transforms 343
351
TheorY
The Sturm-Liouville
3s7
Problem
357
Use of Sturm-Liouville Theory in
Physics
363
E.3ProblemswithSphericalsymmetry:SphericalHarmonics36T
8'4ProblemswithCylindricalsymmetry:BesselFunctions39T
CONTENTS XIII
8.5 Spherical Bessel Functions 419
8.6 The ClassicalOrthogonal Polynomials 425
Problems
428
Topics 439
Tensors 439
.
Optional
A
A.1 Cartesian Tensors 439
A.2 lnner and Outer Products 444
A.3 Pseudo-tensors and Cross Products
A.4 GeneralTensor Calculus 447
A.5 The Metric Tensor 451
A.6 Contraction 452
A.7 Basis Vectors and Basis Forms 453
A.8 Derivatives 455
Problems
B
Group
B.1
445
460
Theory
465
Definition of a Group 465
Examples of Groups 466
8.2
8.3 Classes 470
8.4 Subgroups 471
8.5 Cyclic Groups 472
8.6 Factor Groups and Direct Product Groups
8.7 lsomorphism 473
8.8 Representations 474
8.9 Generators of Groups 484
8.10 Lie Algebras
Problems
C
Green's
487
490
Functions
495
C.1 Division-of-Region Method 496
C.2 Expansion in Eigenfunctions 499
C.3 Transform Methods 503
C.4 Extension to N Dimensions 504
C.5 lnhomogeneous Boundary Conditions 510
C.6 Green's Theorem 512
C.7 The Green's Function for Poisson's Equation
in a Bounded Region 514
Problems
D
Approximate Evaluation of
D.1
D.2
Integrals
The Method of Steepest Descent 531
The Method of Stationary Phase 535
Problems
L-_
526
537
s31
472
CONTENTS
E
Calculus of
Variations
541
E.1 lntegral Principles in Physics 541
E.2 The Euler Equation 544
E.3 Variation Subject to Constraints 548
E.4 Extension to Functions of More Than One Variable
Problems
551
552
Appendices
I
ll
lll
lV
V
sss
Transformation Properties of the Vector Cross Product 555
Proof of the Helmholtz Theorem 556
Proof by lnduction: The Cauchy Formula 559
The Mean Value Theorem for lntegrals 560
The Gibbs Phenomenon 562
The Laplace Transform and Convolution 565
vl
Proof That
Vl
ri@):
(-1)m(1
- *zynffiPrfu)
Vlll Proof of the Relation [f, oJ^(kdJ^(k'p)
lX The Error Function 571
X
Xl
:
1
;6(k
- k)
56e
Classification of Partial Differential Equations 574
TheTangent Function:A Detailed lnvestigation of Series Expansions
Bibliography
Index
dp
568
s8s
Mathematics for Physicists
i
t
i
t
CHAPTER
1
Describing the Universe
1.1. A UNIVERSAL LANGUAGE
The wonder of physics is that its laws are universal. So far as we can tell, the same laws
describe the behavior of things everywhere in the universe. We need a language for these
laws that is equally universal; that language is mathematics. The laws of physics are written
using mathematical terms that are independent of the reference frame that we use; that is, the
laws shouldbe independentof the coordinate systemthatwe choose, and also of any uniform
motion of the reference frame. Ensuring that all physical laws obey the velocity rule was
Einstein's inspiration for the theory of special relativity. Here, we'll begin by considering
the first constraint-that the physical laws be stated in a form that is independent of the
coordinate system used.
1
1.1.1. Common Coordinate Systems
Coordinate systems are used to describe the position and orientation ofobjects in space. The
three coordinate systems in most common use are the Cartesian, cylindrical, and spherical
coordinate systems.
Cartesian Coordinates
The Cartesian coordinate system is the familiar x, ), z system (Figure 1.1a). The coordinate
axes are mutually perpendicular and are generally chosen to be right-handed. That is, ifyou
pick a point to be the origin and then choose the orientation of two of the axes, the third
axis is determined by the right-handed convention.
I
L-_
See
Optional Topic A for more on the mathematics of special relativity.
CHAPTEF 1 DESCRIBING THE UNIVERSE
a point is described by its perpendicular distances from three mutually orthogonal planes, as shown here'
FIGURE 1.1a. Cartesian coordinates: the position of
The distance ds between two neighboring points with coordinates
y I dy, z + dz is given by Pythagoras'theorem (Figure 1'1b):
ds2:dx2+dy2+dz2
I,
y, z and x
I
dx,
(1.1)
using the
FIGURE 1..1b. The distance between two neighboring points (the line element) is found
PYthagorean theorem.
system'
The absence of cross terms like dx dy is characteristic of an orthogonal coordinate
give
We'll
perpendicular'
mutually
are
axes
the
because
The Cartesian system is orthogonal
1'3'1'
in
Section
a more careful definition of orthogonality
Cylindrical Coordinates
Thecylindricalcoordinatesarep,thedistancefromthez-axis; Q,ananglemeasuredcoun(Figure 1.2a).
terclockwise from a reference line (usually the positive x-axis); and z
1.1 A UNIVERSAL LANGUAGE
FIGURE 1.2a. Cylindrical coordinates:
the position
of a point is described by its perpendicular distance p from one coordinate axis, its perpendicular distance
z from a plane perpendicular to that
axis, and an angle. To obtain tbe an-
gle
reference
line
{,
we drop a perpendicular from
: 0, draw a line
from the origin to the foot of the perpendicular, and measure the angle @
between that line and a reference line
in the plane. Traditionally angles are
measured counterclockwise from the
reference line.
P to the plane z
Zp
........-...-'....-y
x-pcosQ, y-psinQ
(r.2)
p: \F+ y', Q:tan-ryx
(1.3)
and, conversely,
This is an orthogonal coordinate system because the unit vectors p (pointing outward from
the e-axis, parallel to the x-y plane), $ (perpendicular to p, pointing in the direction in which
{ increases), andh are mutually perpendicular. Most of the coordinate systems commonly
used in physics are orthogonal coordinate systems.
The distance ds between two neighboring points with coordinates p , Q , z afi p -f dp , Q *
dQ, z* dz may be found by recognizing that the distance along an arc of a circle with radius
p and angle dQ is p dQ (Figure 1.2b). Then, from Pythagoras'theorem,
h2
:
dp2
+
p2 dOz
+
dz2
(1.4)
FIGURE 1.2b. The distance between two neighboring points is found using the Pythagorean theorem.
The distance corresponding to the coordinate difference dQ is p dQ.
CHAPTER
1 DESCRIBING THE UNIVERSE
Spherical Coordinates
the origin; g' the angle
a point P arc r,the distance from
an angle measured counterusually the positive z-axis); and @'
1'3a)'
(Figure
Hne (fositive x-axis)
FIGURE1.3a.
polar axis
drical coordinates'The polar angle 0
to
is measured from the polar (z) axis
the Position vector of the Point'
reference
line
x
ln terms of Cartesian coordinates'
0
and
:
:
CoS
tZ
'-'
r
r sin0 sind,
Q:tarr-rl- z: r cos0
(1.s)
(1.6)
of radius r sin 0
dQ' the poinl P moves around a circle
When we increase the angle Q by
and so
*"o"gtt a distance r s1n0 dQ (Figure 1'3b)'
(1.7)
d'2 : dr2 + 12 doz + 12 sin2 e dO2
FIGUREl.3b.ThedistancebetweentwoneighboringpointsisfoundusingthePythagoreantheorem.
differences d0 and d'Q are r d0 and
co'ordinate
The distances conesponding to the
r sin9 dQ'
1.1 A UNIVERSAL LANGUAGT
1.1.2. Representing Physical Laws2
Physical laws are represented by equations containing mathematical quantities that are independent of the reference frame or coordinate choice we use to describe them. We can write
each physical law in a coordinate-independent way. For example, we may write Newton's
second law without reference to specific coordinates:
i : ^d.: di
dt
(
1.8)
The quantities appearing in this equation are scalars, vectors, and differential operators. No
coordinates appear explicitly in the equation. (In special relativity theory, we also demand
independence of inertial reference frame, and the time / becomes a fourth coordinate. Then
we have to write Newton's second law somewhat differently. For now we are considering
only coordinate independence and three spatial coordinates.)
Scalars
Scalars are mathematical quantities represented by a single number that is independent of
the coordinate system used. Examples are mass of a particle, electric charge of a particle,
and distance between two points. The quantity m in equation (1.8) is a scalan
Vectors
Vectors are geometrical objects (arrows) with magnitude and direction. Thus, equation (1.8)
is a relation between two arrows F and d. These two vectors must have the same direction,
and their magnitudes are related by the scalar ru. Vectors are added geometrically by placing
them head to tail. The sum has its tail at the tail of the first vector in the sum, and its head
at the head of the last vector in the sum.
Some problems can be solved by using the vector representation (1.8). But often we
find it easier to set up a coordinate system and work with the individual components of
the equation. In any particular coordinate system, a vector is represented by three numbersthe three components of the vector. The vector may be written
i : (ur, uy , ur)
or, using index notation, ut, where i : 1,2,3 represents the -r, y, or z component of the
vector. In this representation, we regard the vector i as the sum of three vectors, each
parallel to one of the coordinate axes. The magnitude of each of these vectors is given by
the magnitude of the corresponding component, while the sign of the component indicates
the direction (in the direction of increasing or decreasing the coordinate).
When we change the coordinate system, these three numbers (ur,uy, and u.) change.
But they must change in a specific way in order for equation (1.8) to remain true. In
order to see how the components change, let's restrict attention to Cartesian coordinate
2In this section
it is assumed that the reader has had some previous experience with matrices. Readers who have
not should read Sections L6.1-1.6.3 now.
L
CHAPTER
1 DESCRIBING THE UNIVERSE
systems.3 The two ways to change the coordinates and still maintain a rectangular system
are to (a) move the origin and (b) rotate the axes. Change of origin affects the position vector
but does not affect vectors like and d. Thr s, the important changes are rotations of the
i
i
coordinate axes.
Suppose that the system of coordinates xt , J' , z/ is obtained from the original system
x, y, zby rotating counterclockwise about the z-axis through an angle 0. Futher, suppose
that a vector i has components (ur, uy, uz) in the original coordinate system. Let's find the
components in the new system.
The
x- and y-components may be constructed geometrically by projecting the vector
onto the x-y plane and dropping perpendiculars from the end of the projected vector to the
coordinate axes, as shown in Figure 1.4. Thus,
Ur,:OAIAB
: -1D. * (uu cos d
: fa(r
cosd'
_
u, tand) sinO
sin2 o)
*
u, sin d
:urcos9*ursind
FIGURE 1,4. Rotation of coordinate
(1.e)
axes. Here the prime axes are obtained from the unprime axes
about the z-axis. OC is the projection
by rotating counterclockwise through an angle
of the vector onto the x-y Plane.
I
i
Similarly,
ur'
:
BC
(uy
-
u, tan0) cos0
--uxsing*urcos0
(1.10)
3In any coordinate system we need three numbers to completely describe a vector, but in a non-Cartesian system
the description is more complicated. See, for example, Problem 10 and Optional Topic A.
1.1 A UNIVERSAL LANGUAGE
uz'
Each new component is a linear
are
:
uz
combination of the old components. The two representations
related by a set of nine numbers Arj:
3
,!:DAijuj
(1.il)
i-l
where
:
cos
Aij
:
0,
At2
: sin9,
Azt
: - sin?,
A22
: cos9,
Azz
:
I
O.We may write these relations using matrix multiplicationa:
:(-r:i'"ifiJ;""13):(
,g il? lX,,
) ,,2,
i' :,Ai
(1.
l3)
(1.14)
the
rotation matrix that describes the coordinate transformation. This matrix allows us to
the components of any vector in the new coordinate system.
1,2,3. Similarly, the components
We write the vector components 4s ui,
ffix .A are the numbers A;7, where i labels the rows and labels the columns5 :
i :
j
Atz
Ars
Azz
Azz
x:1
of the
\
(1.1s)
)
Section 1.6.1. To perform the matrix multiplication, take the dot product of the nth row of the matrix with
vector expressed in the original components to obtain the nth new component.
is the most commonly used convention
for labeling the matrix components.
CHAPTER 1 DESCRIBING THE UNIVERSE
This matrix has several nice properties:
as a vector, is orthogonal to every other
1. It is orthogonal; that is, each row, considered
row:
t
A;iApi
I
1.i1'+;1,:
:lip
(1.16)
j
The same is true of the columns:
ty
(1.17)
T
In these expressions,
6;7.
is the Kronecker delta, which equals 1 if
I : k and zero
otherwise.
2. The determinant6 of the matrix is *1.
3. Because of properties (1) and (2),the inverseT of matrix.A equals its transpose:
/
cos?
A_,_n':(;"0
- sind
cos0
0
0
(
1.18)
01
Matrices that share the properties (1) and (3) but with determinant - I represent a transformation from a right-handed coordinate system to a left-handed system; that is, they
represent a combination of a rotation and a reflection.
The matrix multiplication may also be written using index notation, as in equation (1.11).
!.
Whenever
a commonly used shorthand calledthe summation convention, we drop the
an index is repeated once, we understand that we are to sum over that index. That is,
In
3
u'i: Aijuj
means
u'r:lli1u1
j:r
The index I in this expression is not summed; it is called a free index, and it must appear
once on each side of the equation. When using the summation convention, it is important to
remember that, in any one term, an index may not be wittenmore thantwice. For example,
A j ju j, with j repeated three times, is meaningless.
The identity matrix has components
':(i:l)
6See Section 1.6.2.
TSee Section 1.6.3.
(1.19)
1.1 A UNIVERSAL LANGUAGI
and is written in index form as E;7. Multiplication by the identity matrix leaves a vector
unchanged. In index notation, multiplication by the Kronecker delta 6;; simply replaces the
index label
I with l:
8;iu1
:
(1.20)
1t;
We may express the matrix ^A in terms of the angles between the old and new axes. Let
04 be the angle between the lth new axis and the 7th original axis. Then
Aij : cosoij
For the example above (rotation through 0 about the z-axis), 0n :
gn r 12 * 0, 022 : 0, and 0n : 0. The angles 9zj : n 12 fot j :
--
(r.2t)
e,0n : rl2 l, ).
0,
Invariance of Vector Equations
Now let's look at Newton's law again. In the first coordinate system, we may write each
component of the equation as
Fi
:
(1.22)
mai
Each of the vectors may now be transformed to the new system. The new components are
F!
: AijFj
(r.23)
and
a'i: Aiiai
of these relations to obtain
tion (1.23) on the left by the matrix .A-1:
We may invert each
(A-1)*i Fi
Then use the results
:
F; in terms of F/. First multiply
(1.24)
equa-
(A-1)p;A;; F;
: Ar and that A-1.4 is the unit matrix:
(Ar)rifi : (A-r)7.;A;1F1 :3p1F1
that,A-l
and so
A;pF{
: lp
(t.2s)
Equation (1.25) is the inverse of relation (1.23) and transforms the components in the
primed system to the original system. We can use this relation to transform both vectors in
equation (1.22):
AliFl
h--
:
mAiiati
10
cHAprER 1 DEScRIBING THE UNIvERSE
A1i(F1'-ma',):O
Multiply on the left
bY A:
Ap;A1i(F!
-ma',):O
Use equation (1.16):
Sri@i
-
ma',)
:0
Then use property (1.20):
Fi,
- ma|r:
O
if it is true in the original system.
(1.23) is a necessary consequence of the
law
transformation
the
that
see
we
Thus,
of coorgeometrical nature of vectors and the fact that vector relations are independent
a
vector'
of
dinate rotations. The transformation law (i.23) becomes part of the definition
So the equation is true in the primed system
by a set
A vector in three-dimensional space is represented in Cartesian coordinates
(1.23) when
to
equation
according
transform
that
(its
components)
of three numbers
the coordinate axes are rotated.
BA I,AB
and then
ComPare
t the new
Example
about the
it with th
order)'
z-axis (that is, the same rotations but performed in the reverse
: 033' )zt
Using rule (1.2I), we have for the first rotation 0n : 0
1ij
: r
l2'Thtts'
A_
(ril)
rder in which the matrices
the center: (B'N)kj : Bkt
Placement of the indices'
matrix multiPlication' we
repeated indices next to each
other
A
i
i Bp;
:
Bp; Ai
i = (lEA)1;'
: z' all other
1.1 A UNIVERSAL
For the second rotation about the new x-axis, 0zz
9ij
:
: n /2.5o
/t
u:lB
O
LANGUAGE
: 0t,0i2 : n, all other
o o\
-?
I)
The two successive rotations are represented by the matrix product:
u^:/l
o o\/ o
I o\ /o I o\
\o -? lJ(-; s ?J:l? s;J
A unit vector along the x-axis with components (1,0,0) in the original system
has
the following components in the rotated system:
(rii)(i):(l)
That is, it lies along the new z"-axis. Similarly, the y-axis has become the x"-axis,
and the z-axis has become the y//-axis (Figure 1.5).
FIGURE 1.5. The
set of rotations in'the first part of Example 1.1.
Doing the rotations in the opposite order, we get
*:(-; ii)(s
L--
?;):(l li)
11
12
cHAprER 1 DEScRtBtNGTHE UNIvERSE
This set of rotations sends .r to
-y't , y to -2"
, and z to
x" (Figure
1.6).
v
x'
FIGURE 1.6. In the second part of Example
1. 1,
the same rotations performed in the opposite order
give a different result.
However, two rotations about the same axis do commute. Thus, if we rotate about
the z-axis through an angle 01 and then again about the z-axis through dz, the result
is a rotation through 0t
]B.A:
-f
02.
cos91
sin91
sin
cos
91
(
00
(
I
r
(
sin(02 * 9r)
cos(dz * dr)
0
d1
cos 02 sin01
cos0z cos0r
t)
i)
*
sinOz
-
sin02sin91
cos9r 0
?
\
)
_AB
Multiplying Vectors
Product with a scalar A vector may be multiplied by a scalar. The result is another vector.
The product md in Newton's second law is an example of this kind of multiplication. The
new vector has the same direction as the original vector, but its magnitude is changed. The
ith component of the new vector equals the i th component of the original vector multiplied
by the scalar.
Scalar product of two
vectors Two vectors may be multiplied together to give either
scalar or a vector.9
9St
i.tly
speaking, this product is a pseudo-vector, as discussed below.
a
1.1 A UNIVERSAL
LANGUAGE
1
3
The scalar or dot producl oftwo vectors is
i.il:
(r.26)
uucos9
where u ardu arethe magnitudes of the two vectors
between them.
i
I
and fr, respectively, and
is the angle
Equivalently, we may write the product in terms of the Cartesian vector components:
v a:
(1.27)
uiui
where, as usual, the repeated index means that we sum over the index i. That is,
v. u :
s-3
Lr,r, :
Dlttl
I
u2u2
I utul
i:1
To show that this product is a scalar, let's start with the product in the prime frame and
transform to the unprime frame:
i.fr:
l)tpt;
:
AiiuiA;pu1,
Notice that we needed to be careful not to write the index -/ more than twice, so in transforming fr we called the summed index k. Now
AijAit : ,+!,Ait : A jrr Ai*:6 jr
and so
i.d:
Since the product
ulu.t;: A;1A;puiu1,: Sjrvjuk: t)kuk:
is the
same
in the two coordinate
?'d
systems,
it is a scalar. This
demonstration actually shows that the product is the same in any two coordinate systems, since the transformation represented by the matrix .A is an arbitrary rotation.
The dot product is also commutative:
i.il : i.i
The magnitude of a vector
i
may be written in terms of the dot product of
i
with itself:
u:lil :Jii
Examples of dot products in phy-sics include the definition of work
(dW
:
F ' aO' ttte
definitionofelectricflux(dO:fr,-ab,andtheexpressionsP:F'i:t'riforpower.
t--
14
CHAPTER
1 DESCRIBING THE UNIVERSE
or cross prodzct of two vectors is another vector
crossproduct or fuo vectors The vector
whose magnitude is given bY
l? x
il :
(1.28)
uu sin9
Thedirectionoftheproductisgivenbytheright-handrule,asshowninFigurel.Ta.
products. curl the flngers of the right hand from
FIGURE 1.7a. The right-hand rule for cross
towarJfr and the thumb gives the direction
i
i x i'
Its comPonents are given bY
(ixi)r:u2tt3-u3u2
(?xi)z:u3ttr-utu3
and
(ixfi)l:utuT-uzur
using the Levi-Civita symbol e;;1:
We may express the result in index form by
t-i
if i, j, k = an even Permutation of 1,2,3
it i, j, k= an odd Permutation of t, 2, 3
if any two of i, j, k are equal
(r.29)
sequence of numbers ijk --> kij ->
(To get an even pennutation' you may rotate the
--> ikj, obtained by
jki, but you may not interchange two of them. The pirmutation iik
Then
interchanging j and k'is an odd permutation')
(i x fr); :
Eijku juk
(1.30)
1.1 A UNIVERSAL LANGUAGI
15
Strictly speaking, the result is not a true vector. It transforms as a vector under coordinate
rotations but does not transform properly under reflections. (To see what this means, notice
that your right hand looks like a right hand however you turn it, but in a mirror it looks
like a left hand, as shown in Figure 1.7b.) The cross product is called a pseudo-vector. (See
Appendix I and Optional Topic A for the transformation law.)
FIGURE 1.7b. When viewed in a minor, the right hand becomes
a
left hand.
represents the parallelogram formed by the vectors i and fr. The
product
equals the area of the parallelogram, and the direction is
magnitude of the cross
parallelogram.
Examples of the cross product in physics include
plane
of the
normal to the
(t
:
i t Fl, angularrnot"nturn of a panicle (i : i x p)' and the Biot-Savart
torque
law for magnetic field (dB : /rgj x idV l4trr).
The cross product
i xi
vectors We may form a triple scalar product of three vectors by taking
the dot product of a third vector fr with the cross product fr x i. It is sometimes written
Prodacts of three
with square brackets as
td, n, fr'l
:
(fr x
i) 'fr': fr. (i x fr) :
(fr' x
i)
.i
(1.31)
why these three products are equal, note that each is the volume ofthe parallelepiped
with edges given by the three vectors fr, i, and fr lFigure 1.8). We may also derive the result
algebraically:
To see
tfr,n,*l
:
fr '
(i x fr) :
eijkuit)jwk
:
ekijuiujwk:
(fr x
i)' *
tt.32)
similarly for the third relation
For example, a particle in circular motion is acted on by a force F. The particle's angular
velocity is 6, and its linear velocity is i : 6 x i, where the vector i is the position vector
of the particle with respect to the center of the circle. The power delivered by the force F is
and
p
L-_
: F.i :F'
(dr
x i)
:
dr' (i
"
F;
:,;. t
CHAPTER
1 DESCRIBING THE UNIVERSE
of the parallelepiped with the three vectors
FIGURE 1,8. The triple scalar product equals the volume
along its edges.
of dot products:
The tripte vector producl may be expressed in terms
(i x i) x fr:
e;;7.(i x
i)itt':
eiikeihutumwk
(1.33)
To evaluate expression (1'33), we may use a nifty relation:
e
iti€ itm
:
6P13;a
- 5*^3it
(1.34)
index 7 in the same position in
To write relation (1.34) correctly, we first put the repeated
Then we pair up matching
both Levi-Civita symbols by a sequ"nc" oi "u"n permutations.
gives the positive term 6u3i^. Then we pair
indices in order (ft with I and i witt, -). This
negative term'
up the nonrepeated indices the other way to get the
all 3a possible combinations of
check
to
have
don't
(1.34).
We
equation
Let,s verify
k,i,l,andmseparate|ybecausewecanusethepropertiesoftheeand6symbolstoshow. if
First note that the e symbol is zero
that the relation is true for certain groups of values.
: i ot I : m' all three terms in the sum on
any two of its indices equal each oih"t, to \f k
: i or I : m,bothterms on the right-hand side are
the left-hand side are zero. Bot if k
identicalandtheirdifferenceiszero.Therelationistrueinthiscase.
Iftheindicesk,i'I,andruincludeallthevaluesl,2,and3,bothsidesalsovanish'side
k : I, i : 2, I : 2' and m : 3' Then on the left-hand
Suppose, for example, that
we have
eizeiZZ:0
: t323 :0' A quick check shows that in this
since e;rz : 0 unless i :3,but then ei23
on the right-hand side is also zero. So we
term
case one of the I(ronecker deltas in each
all cases except k -- I' i : m ot k : m'
zero-in
have verified the relation-both sides are
: The first of these possibilities (k : I, i : rn) gives
i
l.
e
jki€jrm:l elmeltm: *I
J
1.2 SCALARANDVECTOR
FIELDS 17
Note. In the middle term, we use the summation sign and do not use the summation
convention; that is, we do not sum over I and m.
Ontheright-handsideof equation(1.34),thefirstcombinationof
the second is zero. The other possibility is
e jki€ jtm
k
: s)
:
m,
i : l, which
eimteitm
6sis
1x 1: l,while
leads to
: -1
J
This time it is the second set of deltas on the right-hand side that equals 1, so the right-hand
side is also -1, as required, and relation (1.34) is proved.
Returning to the triple cross product (1.33), we have
(il x
i) x fr:
:
:
(i x i)
x fr
eijk€ jtmutl)muk
eikieilmulDmuk
(5u6i-
ukuiwk -
6p-6i1)u1uaw11
uiukwk
: ntd.m) - fr(n. n,)
(1.3s)
This result is sometimes known as the "bac-cab rule":
dxdxi;:6ta O-itd.6l
right, always start with the middte vector (i or 6 in the examples
above) times the dot product of the other two vectors. The second term has the other vector
To be sure you get the signs
inside the original parentheses (fr or i) times a dot product.
For example, the torque exerted by the magnetic force on a charged particle is
t: i
xF
: i,. q(i x E) : qli(i.il - iff ' tll
Tensors
Some physical laws cannot be represented using only vectors and scalars-they require an
object that is represented by more than three numbers. These objects are called tensors (see
Optional Topic A).
1.2. SCALAR ANDVECTOR FIELDS
in space. For example, a weather
map shows the air temperature in different cities. In principle, we could measure the
Some physical quantities are associated with points
temperature at each point
in space. Since temperature is a scalar quantity, the values of
18
cHAPTEB 1 DEScRIBING THE UNIvERSE
can represent the field visually by draw-
we shall make that assumption in
most cases, the derivatives are also continuous, and
what follows, except when explicitly noted otherwise'
we have a vector field'
when a vector quantity is associated with each point in space,
magnetic field'
m another' For example' the wind speed is
ity vector'
E
a
1.2.1. The Gradient
one vector field we can derive from a scalar field
coordinates, the gradient operator is written
o(i)
is the gradient
io.
tn Cartesian
/a a a\
u' ar)
v- = (a''
- /ao
vo=
(a"
The change in
o
ao
(1.36)
dy
when we move through an arbitrary displacement
4O
ao ao .
- -dx * no, aY4r:
a,
Vo.ad
4i : (dx, dy, d'z) is
(t.37)
: o. Thus, the gradient vector is perpendiculartant@ateachpoint.Inorderto-obtainthemaximum
changed@|d3|,dsmustbeparalleltoVo.Thus'thedirection
in which <D changes most rapidly at that point (Figof V<D at a
o is obtained by moving perpendicular to the lines of
ure 1.g). Th
If
d3 is tangent to the constant-(D contour, dQ
constant
(D.
Thed,irectionalderivativeofOinadirectiondescribedbyaunitvectoriisi'iO'
1.2 SCAI-AR AND VECTOR FIELDS
19
FIGURE 1.9. Contours of constant @. In this diagram, @ is increasing inward. The gradient of
a scalar field at a point is the steepest slope of the scalar field at that point. VO,
indicated by the arrows, is perpendicular to the constant-@ contours.
1.2.2. Properties of Vector Fields
The Divergence
In Cartesian coordinates, the divergence of a vector field is
i .t:
aux
0z
0y *9!u
0x *U"
(1.38)
In index notation, it is
V
.t: 9a
dxi
(1.3e)
The divergence is a scalar that indicates how much the vectors spread apart, or diverge,
from each other around a point. The electric field due to a positive point charge is a good
visual example of a vector field with positive divergence at the position of the charge.
A vector field whose divergence is zero is called solenoidal.
The Curl
In Cartesian coordinates, the curl of a vector field fr is given by
Du" 0u, 0r, au" \
i x n: (Yu
-Yt.
\ ay 0z' 0z 0x' Ar- Ay)
L-
(1.40)
20
cHAprER 1 DESoRIBING THE UNIvERSE
In index notation, it is given by
(ixil)i:€ijk#
(1.41)
The curl, like all cross products, is a pseudo-vector. The curl is nonzero at a point when the
vectors circulate around, or curl around, the point. The magnetic field due to a long straight
wire is a good visual example of a vector field with nonzero curl at the location of the wire.
A vector field whose curl is zero is called irrotational.
1.2.3. IntegralProperties of Vector Fields
Line Integrals and Circulation
A line integral of the form
t: ir.dl
(1.42)
occurs frequently in physics.l0 The integral is the sum of the contributions il ' dl for each
differentiai displacementll di ulong the path from A to B. For example, the work done by
a force F between A and B is
pB-
W(A
-->
B): I
Jt
F
'dl
In general, the result depends on the path taken between A and B (Figure 1.10).
FIGURE 1.10. Tlvo different paths between points A and B. Going from A to B along path
returning to A along path 2, we form a closed curve C.
10See,
II
for example, Lea and Burke, Section 7.1.4.
In Cartesian coordinates, di has components (dx, dy, dz).
1 and
1.2 SCALAR AND VECTOB
However, when
the path between
i : iO
A
arrd
I^
FIELDS
21
for some scalar field O, the integral (1.42) is independent of
B:
u
,r:
:
,".
l"^
dx
*
*
uy dy
u, dz)
/Aa * ao * ao \
(*"
Jo
,o' Eo')
rB
: ft ot:
Jn
o(B)
-
o(A)
(1.43)
When the integral is independent of path, the integral around the closed loop
A
-->
B
--> A
is zero:
f-18-1A
$n at: JA.pathl
1 ir.dt+ JB.path2
I n.dt
J
18-pB
- It. t i.dl- /t. patz n.dl:o
J
parn
J
:
The integral around the closed loop C
A --> B --> A is called the circulation of the vector
field d around the curve C. Thus, we have this result:
When a vector field fr is the gradient of a scalar field O in a region R, the circulation
ofil around any curve C in R is zero.
i
Conversely, if the circulation of fr around every crrrye C is zero, then is the gradient of
some scalar field O, and O may be determined (up to a constant) by equation (1.43).
The language that we use to describe vector fields comes from fluid theory, because
fluid velocity was one of the first vector fields studied. Water flowing out the drain in a sink
usually swirls around the drain. This is an example of a vector field with nonzero circulation
around any curve that surrounds the drain.
Flux
The flux of a vector field
i
through a surface S is
o,: JsI i.fiidA
where ff at any point on the surface is a unit vector normal to the surface at that point. For
fluids, with d equal to the fluid velocity, the flux describes the rate at which fluid flows
through the surface.
Although flux is a scalar, it has a sign that is associated with the direction of the vector
field d. Flux is positive if the field i points generally in the direction of ff and negative if fr
points generally opposite ff.
\-
22
CHAPTER 1 DESCRIBING THE UNIVEBSE
The Divergence Theorem
The divergence theoreml2 relates the flux through a closed surface S to the divergence
of the vector field in the enclosed volume. When integrating over a closed surface, it is
conventional to choose the normal vector ff to point outward' Then
du
o
oo:
l,i.itav
(r.44)
To prove this theorem, we cut the volume V up into a very large number of tiny cubes
(Figure 1.11).
FIGURE 1.11. Tlvo neighboring cubes inside volume V. On touching sides, the normals outward
from the two cubes are opposite each other.
The volume integral is the sum of the integrals over all the cubes. Notice that the normal
vectors on touching sides of two neighboring cubes are exactly opposite each other, and
so the contributions to the surface integrals f, n ' ff dA for the two cubes from these two
sides exactly cancel. Thus, the sum of the surface integrals for all the cubes reduces to
the integral over those surfaces that have no touching neighbors-that is, over the original
closed surface. So if we can prove the result for one cube (Figure l.l2),we will have proved
it for an arbitrary volume.
I'
differential cube with sides of length dx, dy, and' dz, respectively.
The normals to each side are in the directions +f,, * i, and * 2.
FIGURE 1.12. Integrating over
a
12This theorem is also called Gauss' theorem.
23
1.2 SCALAR AND VECTOR FIELDS
The volume integral on the right-hand side of equation (1.44) is
To evaluate the surface
sides at
x
and at
x
*
o. o' o'
(1.4s)
integral on the left-hand side of ( I .44), we first consider the pair of
dx. For the side at x, ff
: -i
I O OdA:
ls
where z,
.W.ff)
v idv : (Y
l,u**u"oo*
i
(x) is the r-component of
and
-ux(x\dy dz
evaluated at x, while for the side at x
*
dx, ff
: *i
and
/o o dA:ux(x-tdxldydz
Js
Using the definition of the partial derivative 0u, f 3 x , we find that the sum of the two terms is
[u,(x
*
dx)
-
u"(x)ldy dz
: !a*
dx
a, a,
y + dy arrd z, z + dz. Thus, the
left-hand side of equation (I.44) also equals the right-hand side of (1.45), and the theorem
We get similar contributions from the pairs of sides at y,
is proved.
To understand the meaning of the theorem, think of the electric field due to a positive
point charge. If we surround the charge with a volume V , then the electric field is outward
at each point of the surface S, and so the flux through the surface is positive. Similarly, the
divergence of the electric field is positive at the position of the charge, as required by the
theorem.
1.2. Compute the divergence of the vector field i : ki, where i is the
position vector and k is a constant. Find the flux of i through a cube of side a centered
at the origin and show that it equals / V i dV over the volume of the cube.
The divergence is
Exampte
i .i:
kV
.('l
+ yi
+
zi:
:1,
(
!.
\d"r
* !,
dy
: ,o
*3.)
dz /
Since the divergence is a constant, we can immediately compute the volume integral
as
3k times the volume, or 3kaj
The flux is
.
6
kk*-r yj + zit
.ffr a
e
Jsurface of cube
On each face of the cube, the normal is
atx:
Ai, *i,
or
*2,
so the flux through the face
a12is
t *!;t t;dA:klaz-oot
2
2
Jh."2
24
cHAprER 1 DESoRIBING THE UNIvERSE
while at x
: -a/2
we have
l,^""0
(-1)i
(-*) at
: kla2 :
+
The result is the same for each of the six faces, so the total flux is 3ka3 and equals
the volume integral of the divergence.
Stokes'Theorem
Stokes'theorem relates the line integral of a vector field il around a closed curve C to the
curl of the vector field. First we define a surface that spans the curve C to be any surface
whose edge is the curve C. Think of blowing bubbles, using the curve C as the wire. Dip
the wire in the soap solution. If curve C lies in a plane, you will get a planar film of soap
solution attached to the wire (the curve C) at its edges. This is the simplest surface that
spans C. Now as you blow the bubble, the surface expands but remains attached to the curve
C attheedges. You obtain a sequence of surfaces, each of which spans the curve C. Stokes'
theorem applies to all such surfaces.
We define the normal to the surface using a right-hand rule. Curl your fingers in the
direction of di and the thumb gives the direction of fi (Figure 1.13). Then
t';
ai:
l,ru
xi'y n ae
(1.46)
FIGURE1.13. DefinitionofthenormalvectorusedinStokes'theorem.Curlthefingersofyourright
hand in the direction
ofdi,
and your thumb points in the direction
offi'
To prove Stokes' theorem, we use a method similar to the one that we used for the
divergence theorem. Divide the surface up into a mesh of little rectangles. The surface
integral is just the sum of the integrals over all the rectangles. Summing the line integrals over all the rectangles, again we note that the contributions fr'dl from touching
sides cancel, since the vectors di have opposite directions. Thus, the sum of the line
integrals over all the rectangles reduces to the integral over the curve C, as shown in
Figure 1.14.
I.2
SCALAR AND VECTOB FIELDS
FIGURE 1.14. The surface divided into differential rectangles.
Let's look at one of the rectangles that is in a single plane, and choose that plane to be
the.r-y plane (Figure 1.15).
D(x,y + dy)
I
C(x
dx,y + dy)
oz
B(x + dx,y)
A(x, y)
FIGURE 1.f5. A single differential rectangle. We choose x- and y-axes parallel to the sides of this
rectangle.
The integral on the right-hand side of equation (1.46) is
f_
9J!\ a, a,
I-/differentialrecrangle
tvxi.l .ffdA:(Vxdl .zdxdy:(Y'
av)
\ 3x
while on the left we have
t/J AB+BC+CD+DA
il - dl
:
ux(x, y)dx
-
:
-l ur(x * dx, y)dy
u,(x, y * dy)dx
luy(x -f dx, y)
0u,, dv
- Ex'0y
'dx -
-
0u-
-
ur(x,
t)dl
ur(x, y)ldy
'dv
-
lu*(x, y
* dy) -
u*(x, y)ldx
dx
which equals the right-hand side. Thus, the theorem is proved.
For the special case of a curve lying entirely in the x-y plane, Stokes' theorem reduces to
f u, a* r u, dy : I,(* - Y) o. o,
26
cHAprER 1
DEScRTBTNG THE
Since any functions
f
(x ,
uNtvERsE
y) and g (x , y) may replace the x- and y-components of fr in this
proof
f ,r o.*sdy): l,(#-K)0,0,
(1.47)
In this form, the theorem is known as Green's theorem.
Example 1.3. Find the circulation of the vector field i : x2y3 (i *-i) around a
squareofside ainthex-yplane,centeredattheorigin.Alsocompute/(ixi;.fiaa
over the surface ofthe square, and show that the result is equal to the circulation.
We compute the circulation by calculating the line integral around each of the four
sides of the square, starting at x : a 12. We have
": I:;,(;)'"
We obtain the same result for the side at
o':
x
:
(;)'
'i1,"'-,,:o
-a12. At y
-
a/2,
,,: I"-r'' (;)' * o" : (;)' +,::,' : - (;)' ? (;)' : -? (;)'
whileaty:-a12,
,o:
a/2
lr"(_;),
*2 d*
rat.3 2 r
:
: _ (;), *!T _",,: - (;)'
; (;)' -? (;)'
Thus, the circulation is
:
f a ai:o-?(|)' *r-'r(;)' -tG)'
Now we calculate the curl. The normal to the square is in the 2 direction, so we only
need the z-component:
1i x i;.
:* -W - 2*y3 -3r'y'
1.2 SCALAR AND VECTOR FIELDS
27
Then the surface integral is
I (u "t)
naa
:
I::,11,,1"" -
:
l:;,(+
=,
Now if curl
i :
a'
- *"')1"'j.,,0.
1",(-* (;)') "
4
: -, (;)'
as we obtained
z'2v2) av
3
(;)'
from the circulation.
0 everywhere on S, then
thusl3 we can express
i : id
$"n'
ai:
0 for any curve C
lyingin S, and
on S. Thus,
ixi:o=+fi:id
Similarly, it is easy to showl4 from the expression for curl that
i:id+ixi:o
Notice that the divergence theorem and Stokes'theorem relate a locally defined property
ofthe vector field (divergence or curl) to a globally defined property (flux or circulation).
"Local" means that the property is defined at each point of the space, while "global" means
the property is defined over a region of the space (on a surface for the flux or around
complete curve for the circulation).
Numerous variants of these theorems exist, and they are all proved in a similar fashion.
example,l5
I,u'o':
frontdA
Repeated Operations with V
can define additional properties ofvector fields using second derivatives. For example,
have already observed that
curl(gradO):Vx(VO)=0
Section 1.2.3.
Problem 19.
Problem 30. Problem 31 gives an additional variant
(1.48)
28
cHAprERl
DEScRIBINGTHEUNIvERSE
Similarly, we can show that
div(curlil;:V'(Vxil):0
for any vector field
i.
(1.49)
Let's evaluate the expression in Cartesian coordinates:
*w-H.*(v-H.*(*-T)
E2u,
02ur: a*ay- ur*--
A2uy
+
A2uv
02u,
+
azar' ayaz-
U'u,"
ozoY
:O
-oxoz'
terms cancel in
Since the order in which we take the partial derivatives is irrelevant, the
parrs.
The second derivatives (1.48) and (1.49) are identically zero, but many others are not'
The combination
div(grado)
: i . tiol - v2o :# -#.#
(l.s0)
appears frequently in physics, as does the combination
curl(curlfr):Vx(ixfr)
We can evaluate this product using the bac-cab rule, being careful that each
i
operates on
everything to its right:
ix(ixi;:Vfi.tl-v2i
(1.s 1)
This expression effectively d'efines the quantity V2fr.
In Cartesian coordinates, the scalar operator V2, the lnplacian operator, is given by
^a2a2a2
v--a?*i7*rt
The same is not
and the same expression holds whether V2 operates on a scalar or a vector.
true in other coordinate systems, as we shall see below'
1.3. CURVILINEAR COORDINATES
is
When solving a physics problem, we always want to choose a coordinate system that
satela
orbit
of
the
well suited to the geometry of the problem. Thus, when studying
lite about the Earth, we would probably choose spherical coordinates with the origin at
1.3 CURVILINEAR COORDINATES
29
the center of the Earth. When studying the magnetic effects of a long current-carrying
wire, we would choose cylindrical coordinates with the z-axis along the wire. And for
studying the electric field due to a conducting disk, the best coordinates are spheroidal
coordinates. l6
1.3.1. UnitVectors
u , and u. The unit vector ff at any
point indicates the direction in which z increases while u and w are held fixed. The vectors
t and fr are defined similarly. We almost always luse an orthogonal coordinate system, in
which ff, i, and fr form a right-handed orthogonal set-that is, the triple scalar product
In general, we may call the three curvilinear coordin ates u ,
[fi,
i' ff1 :
11
We can relate these vectors to the Cartesian unit vectors i, i, and 2 geometrically, using
diagrams such as Figure 1.16, or we can find their components analytically. To see how,
first write the new coordinates u, u, and u in terms of x, y and z and, conversely, express
x, y, and z as functions of u, u, and w : x : x (u, u, w), y : y (u, u, w), and z : z(u, u, w).
Let point P be described by the position vector i and have coordinates uo, u0, and u.'s.
A neighboring point Q has coordinates zs * du, uo, and ru6. Then the vector from P to Q
is in the direction of ff:
di:dx?+dyy+aziq.dutr
(1.s2)
0z^
^ 0x^ 0y^+-z
uo(-x
0u 0u" 0u
and thus
FIGURE 1.16. Unit vectors in cylindrical coordinates. As the coordinate @ changes, so do the
directions of p and S.^We may express these vectors in Cartesian components:
0 : *cosd f isind, $ : -isin@ * $cos@. The third unit vector in this system
is 2, the same as in Cartesian coordinates. Note that the three vectors are mutually
perpendicular.
l6See Chapter
L-_
2, Problem 13, for a definition of these coordinates.
30
oHAPTER 1 DESoRIBING THE UNIvERSE
Since ff is a unit vector, the
ux:
l-,
y-, and z-components of fi are
0x
h1 0u
Jo*ta. T aylou)2 + (Ezlou)2
0y
:
"y-
:-- l0z
0z/0u
Jruta8i
(1.s3)
1ov
l0u
h1 0u
-
uz:
l0x
l0u
rWtor)2 + (ozlou)z
h1 0u
(1.54)
(1.s5)
where the constant of proportionality in equation (1'52) is
ht:
Similarly, we can find
i
and
(#)'. (#)' . (#)'
(1.s6)
fr:
^ I (0* 0y Az\
': h, \ar' ar' ar/
h2:
(#)'. (#)'. (#)'
(1.s7)
(1.s8)
and
^ I /0x
tn:
/r, \a,
(1.s9)
where
h3:
(x)'.
(#*)'
. (u*)'
(1.60)
1.3 CURVILINEAR COORDINATES
31
In a general non-Cartesian coordinate system, the directions of the unit vectors change
with position. This means that we have to be very careful when taking derivatives or integrals
of vectors expressed in curvilinear components.
1.3.2. Metric Coefficients
The distance between two
may be expressed in terms
we may
neighboring points described by the position vectors i andi * di
of differential changes in the coordin ates u , u, and u.r. In general,
write
di : h
and so
du tt + h2 du
i
-f ht dw itr
for an orthogonal system
dsz
: di
.
di : yl
au2 +
tfi av2 + nl awz
(1.61)
(compare with equations 1.1,1.4, and 1.7). The coefficients h; are called metric coefficients.
They relate the length of a geometrical line segment to the differential coordinate element
and can usually be obtained from geometry (for example, Figure l.2b), or they can be
calculated using equations (1.56), (1.58), and (1.60). We can see that such coefficients are
dimensional grounds. For example, in cylindrical coordinates, the coordinate
differential d@ is just a number and does not have the dimension of
length. The correspondinEhz : p (compare equations 1.4 and 1.61) does have the required
dimension of length.
Let's check the expressionfor h2 using equation (1.58). First, from equations (1.2) we
necessary on
@ is an angle, and so the
have
Ex
a0
and z is independent
-p
sin
@;
Y:
o"orr
,r:m
of@, so
:@:o
as
obtained from the geometry in Section
1.
1.1.
1.3.3. Div, Grad, and Curl in Curvilinear Coordinate Systems
Gradient
The
differential change in a scalar field O over a displacement
di
is (Section I.2.1, eqaa-
tion 1.37)
de
: i o . di : i o . (h du i + hz du i * fu dw fi)
32
CHAPTER
.I DESCRIBING THE UNIVERSE
But we can also derive the differential from the fact that O is a function of u, u, and w,
using the rules of calculus:
4q :
ao -f ao
ao
,du * *dw
-du
Comparing these two expressions, we find that the gradient is
tao. lao^ lao^
VO---riI--yI--w
h2 0u
ht 0u
Again we see that the
y'ls
hj
(1.62)
0w
are needed on dimensional grounds'
Divergence
In curvilinear coordinates, the divergenc e is definedby the divergence theorem ( I .44). Thus,
the divergence of a vector field f at a point P is
fv.r:,lryrvI fst-fidA
where the point P is inside the volume V. We begin by evaluating the integral on the righthand side. We can choose our volume V to be a curved "cube" with sides of length du, dv,
and dw and each face lying along a coordinate surface (Figure I .17 a). Then the normals
to the faces are *fi, ti, and *fr. From the pair of faces at u, u + du we get a contribution
- fufu,
where
u, w) h2h3 du dw
+ f"(u
I
du, u, w) hzhz du dw
fu is the u-component of the vector i.
FIGURE l.l7a. A differential volume used to calculate V . f in coordinates u, u, and u.r. The faces
ofthe volume lie along surfaces of constant u, u, arrd rl, respectively, and the edges
have lengths hy du, h2 du, and h3 dw.
33
1.3 CURVILINEAR COORDINATES
We get similar contributions from the other two pairs of faces. Dividing by the volume
hft2h3 du du dw and using the definition of each partial derivative, we have
v
.i:
lim
I- fu(u, u, w) h2h3 I fufu
I
du, u, w) hzhzl
hft2h3 du
du+O
I la(h2h3f,), oththtf,t,
;v'r-hthrkl
;a, - a, -
*
two similar terms
oth1h2f,1l
a,
(1.63)
I
Notice that the fts appear inside the derivatives, since they may be functions of the coordinates il, u, and u and so differ on opposite faces ofthe "cube."
In cylindrical coordinates (Figure l.l7b), with h1 : I, hz - p,andhz:1, we obtain
;, ;_ t la(ofil ,|fo , a(pil1 _1}(pfp) ,10f6 ,0f2
pl, Ep aQ 0z ) p 0p pA0 0z
(1.64)
Notice that the result is dimensionally correct.
FIGURE 1.17b. A differential volume in cylindrical coordinates.
1.4. Compute the divergence of the position vector, i . i, itr cylindrical
coordinates, and compare with the result in Cartesian coordinates (Example 1.2).
In cylindrical coordinates, the position vectorlT is i : p0 * zi. Thus, from equation (1.64), the divergence is
Exampte
:
,
v .i
.
I
I
II
tfrir
ly
: la?'zt dz: !po.
a 1 :z
p dp + pdQ +y
p
dz p Ap +y
!a(e,i
is the same result that we obtained in Example 1.2.
lTThere is no
@-component. Do you see why? See Problem 10.
i
I
t
E
I
II
:
t-
34
cHAprER 1 DESoRIBING THE UNIvERSE
Curl
In curvilinear coordinates, the curl of a vector
i
at a point P is defined via Stokes'theo-
rem (1.46):
6.i 'dl
tV x f) .ff: Iim '"
A+0
,
A
where A is the area of a surface S spanning the curve C, the point P lies on S, and i is the
normal to that surface.
Again we begin by evaluating the integral. We may choose a curve that lies in a constant
, ,rrrfu"" (Figure 1.18). Then the normal is fi : 0, and we will obtain the a-component of
the curl.
u,w
I
alda,wldw
dw
.P
u
a,w
*
du,w
FIGURE 1.18. Calculating the curl using a "rectangle" surrounding point P.
The line integral is
fxh2
dul. * f*hz
dwlu+au
-
frhz dulu+a.
- f*h3 dwl,
aa
dw
: -;(f,hz)du
dw +
*(f,h)du
while the arcais dA : hzhz du dw, and so the il-component of the curl is
1i
x
ir n: hlft,t,r,, - fi<t,r,tf
The other two components may be derived similarly:
a . l"
I ta
vxf : nrrln(f-hiar(f,h))n+
.#1ft,t.r,,- fr,r,',,f*
I
h,h,
t
l#,rt,r,) - *rr,r,r]
(1.6s)
1.3 CURVILINEAR COOROINATES
We may also express
this result using a determinant:
httr hzl
Vxf:
aaa
hft2h3
hsfi
(l.66)
0u 0u 0w
hf" hzfu htf,
When using this expression, take care to keep the terms in the correct order. The operators in the second row operate on the functions in the bottom row, including the metric
coefficients ft;.
In cylindrical coordinates, we obtain
i " i : il# - fr,,r,lu. l* - #16 * i 1ft,,n, - #1,
(t.67)
: (;# - *) ,. (* - #)o . i
lh*o, - *e!"lu
Example
1.5.
The magnetic field in a region of space is given by the function
fr: Bo@la3)(o-p)20,0 < p < a,andB:0forp >
a.Findthecurrent
density in the region.
From Maxwell's equations,
- lj:-VxB
Po
Since fr has only a @-component, we have
i,. n : (-#)o+
ror0
- Bo#lh@,"-
j:
"o
po
*
)1ft<,'alu
r,)],
l(a
i
pa' lrru - d2 -zp2 - D]
L
2Bo
: ffir"
- dtu - 2p)i
p
z-directionforal2<p<a
understand this result. While al
Notice that the curl is in the
L--
al2but
(Figure
ockwise
36
cHAprER 1 DEScRtBtNG THE UNIvERSE
gri
aiis positive when di
runs clockwise around a small curve C
ataf2 < p <
a
because the vectors get shorter as p increases in this region.
,
!\
)a r/ts
,
It
,
t/
/
/
tl
\
/j'
I /t
\
,
pla
a
i\
t\ t
1,
\
t
/t
/r
t/
I
\\
lt
I
I
J
--t./
i
in Example 1.5. The curl is in the positive z-direction ngar t!rc
negative
origin but is in the
z-direction for p > a /2.Note that the integrat fg ' ai ls
positive when di istaken cloclaMse around a small closed curve C placed at p > a /2,
FIGURE 1.19. The vector field
since the length of the vectors
i
i
is decreasing as we go away from tbe origin in this
reglon.
The Laplacian
With O a scalar function, the expressionl8 for V2O may be derived from the above expressions for gradient and divergence:
v2o:
V2O:
l8To evaluate
1
i.tior: u (*#u*
la
t^
hthzhz Ldu
V2i, with i
i
u,ryu.
*#,
ff#) .*(TT). a, (T#)l
a vector
a
function, you must use equation (1.51)
at the end
of Section 1.2.4.
(1.68)
37
1.5 VECTOR SPACES
In cylindrical coordinates, this is
"
V-o
a/lao\ * a/ao\l
: llalao\
; lu, \'a )* a (; uo ) u\'e ))
ralao\
ra2o
l^-lI--I- PoP\"aP)'
P2aO2'
a2o
(1.69)
az2
This operator occurs frequently in physics, and we shall need to refer to result (1.68).
1.4. THE HELMHOLTZ THEOREM
Any vector field
i
may be expressed as the sum of two terms:
F:d+i
fr: io
for some scalar field O and
i:Vx.4,
for some vector field A.
tne proof of this theoremle may
be found in Appendix IL Then
V.i:f.i:v2o
and
i"F:i*fixii
Inparticular,
andif V x F :
i :
i,
0, then we may choose O = constant and express F : i
"
0, then we may choose A : 0 and express F : VO, as we showed in
if i
.
Section 1.2.3.
1.5. VECTOR SPACES
1.5.1. Properties of Vector Spaces
In physics we regard vectors both as geometrical objects (anows) and also as algebraic
objects (an ordered set of numbers: the components of the vector). It is important to have
both descriptions, since each description proves most useful in certain classes ofproblems.
- Here we shall discuss aspects of the algebraic theory of vector spaces.
l
l9See also Morse and Feshbach, Methods of Theoretical Physlcs, Section 1.5.
38
cHAprER 1
DEScRTBTNG THE UNTvERSE
The number of components of a vector equals the dimension of the vector space. In
most applications in this book, we shall be concerned with three-dimensional vector spaces,
corresponding to the three dimensions ofphysical space. Infinite-dimensional vector spaces
importance-in quantum mechanics, for example-but we shall not discuss these
here.
spaces
We shall also restrict attention, for the most part, to spaces in which the vectors
purely
real.2o
are
A vector space consists ofa set ofobjects (the vectors) with the following properties2l:
are also of
1. The sum oftwo vectors is another vector in the
space:
d+6:i
2. Addition is commutative:
d+6:B+d
3. Addition is associative:
G+6)+i:a+6+i)
4. There is a zero vector, d, which
has the property that
d+d:d:d+d
5. Each vector d has a corresponding negative vector -d with the property that
d+(-a):d
6. Vectors
can be multiplied by scalars. The result is another vector:
td
:6
This product is also associative:
k(cd)
:
(kc)d
7. Multiplication by scalars satisfies two distributive laws:
(k
* c)d:
kd + cd
and
c(d+6):cd+ci
8. Multiplication by the scalar unity
20See Optional Topic B
leaves a vector unchanged:
for additional material.
21The first five properties establish the vectors as elements of a group; see Optional Topic C.
39
1.5 VECTOR SPACES
1.5.2. Basis Vectors
Properties (1) and (6) together imply that the sum
f,,0,
i:1
In particular, we can make this sum equal to the zero vector by appropriate choice
If the only possible choice of values that makes the sum equal to zero
:
is ci
0 for every number c;, then the set of vectors i;, i : 1 to n, is said to be linearly
independent. But if there is a set of numbers, at least two of which are not zero, such that
is a vector.
of the numbers c;.
f
,,i,:i
i:l
ofvectors i; is linearly dependent.
In each vector space, there is some number N with the following property: there
then the set
somevectorsi; suchthatDLr
cii; :0onlyif
eachc;
:0,butif
will
be
weaddonemorevector,
c;i1 - 0 with more than
any vector, then we can find a set of constants such that tllt
one of the ci nonzero. The number N is the dimension of the vector space. The original
N vectors are linearly independent and are said to span the space. They may be chosen
as basis vectors i; for the space. Then any vector in the space may be written as a linear
combination of the basis vectors:
N
f:!u;i;
t:l
The numbers ui a.re the components of the vector i. Thus, we may write the vector
algebraically as an ordered set ofnumbers, the components u;.
In physics applications, we define an additional operation called the inner product22 of
two vectors with respect to the basis23:
N
d*6:D",U,:6*d
(1.70)
i:l
Ifthe resulting value is zero, the two vectors are said to be orthogonal. The inner product
of two basis vectors is
N
i76
*i; : I6;1d;; :6;p
t:1
and so the basis vectors are
orthogonal by definition.
inner product may be defined in many different ways. This version is most useful for our purposes here
inner product becomes a more important and useful concept when it is independent ofthe basis chosen. This
to be the case when the operation that changes the basis is orthogonal, See Section 1.6.5 and Problem 50.
40
cHAprER 1 DEScRtBtNG THE UNIvERSE
It is most convenient to have the inner product defined by equation (1.70) correspond to
the dot product defined in Section 1 . 1.2 (equation I .26). We can do this if we choose a set of
basis vectors that are mutually orthogonal in the geometrical sense of being at right angles.
From any initial choice ofbasis vectors, we can construct a set such that the inner product
with respect to this basis is the geometrical dot product. Suppose we have one set of basis
vectors
i;
that are not mutually orthogonal; that is,
ii.6i 4o
+ j. Then we can use a procedure called Gram-Schmidt orthogonalization to
set f;. First we normalize each vector by dividing by the square root
orthogonal
an
create
product
itself:
with
its
dot
of
for some i
ei:
Then
6;'6;:1
These vectors are called unit vectors. Now we start with the vector
iz:62 -
ir :
6t
and define
(62'ir)Gr
so that
i2.ir:
(Gz
.or)
- Gz.6r)(6r .6r) :
o
and
iz .iz
:
:
- Gz. Gr)Grl . 16z - (62 ' 6r)Grl
62. 6z - Gz' 6)2 : | - (62. 6,)2
16z
Then we normalize to get
f2:
l=-"lfr.fo
We may continue in this manner to construct each succeeding f;:
i-r
i:oi-I,n.6)i*: i, :-L
k:t
The vectors
i;
/t, t
are both unit vectors and mutually orthogonal; they are called an ortho-
normal set.
With orthonormal basis vectors, the inner product defined by process (1.70) is identical
to the geometrical dot product defined in Section I.l.2.In what follows, we shall assume
1.6 MATRICES
that the basis vectors are
and the
41
orthonormal, and we shall not distinguish between the dot product
inner product.
In representing a vector algebraically by its components, we write the set of numbers
as
an array, either in a column such as
/o,
d: I
a2
\o,
called a column vector, or in a row such as
i:
(at, az, az)
called a row vector.24 Clearly both represent the same vector.
We may now write properties (1)-(8) of vectors in terms of the vector components in an
orthogonal basis. For example,
1. Two vectors are added by adding the corresponding components:
ci:ai*bi
4. The components ofthe zero vector are all zeros.
5. The components of B : -d are the negatives of the components of d:
bi
6. The components of f,
To
: ci
are
bi
:
: -ai
cai.
work efficiently with vectors written in terms of components, we shall need to develop
additional mathematical tools-the mathematics of matrices.
some
1.6. MATRICES
1.6.1. Basic Properties
ofproperties that defines a vector space includes several procedures for creating one
vector from another-for example, multiplication by a scalar. If the procedure is linear, each
component of the new vector i is a linear combination of the components of the original
vectol5 il:
The list
N
vi:lait<ut<
k:t
24sri.t1y speaking, ifthe column vectoris a, therow vectoris its transpose, ar. See Section 1.6.1.
25In this section we shall not use the summation convention introduced in Section l I.2. When summing over an
.index, we shall show the sum explicitly.
42
CHAPTER 1 DESCRIBINGTHE UNIVERSE
Thus, any linear operation that maps one vector in the space to another may be expressed
as an affay A of N x N numbers called a square matrix, whose elements are the set of
numbers ai1r.The simple operation of multiplying the components of a vector by a number
c is represented by the array
^:(:;l)
.TT
4tr
-
where II is the unit matrix
': (i :?)
This example illustrates the rule:
To multiply a matrix by a number c, multiply each element of the matrix by c.
The elements of the unit matrix may be represented by the Kronecker delta, d;y, whose
components arezerofor i 17 and I when i : ,/.
The usual convention26 for writing matrices with indices, as in a;p, is that the first index
(here l) labels the row while the second (here k) represents the column. Thus, a21 is the
element in the second row and the first column:
/ ott aD orr \
I@a22anl
oy ajz ott
\
/
An important example of a linear transformation is a rotation of the basis vectors (compare
with Section 1.1.2). Other examples are the angular momentum of a rigid body, L : lRri,
where the operator IR is represented by a matrix containing the moments of inertia of the
body and <i is the angular velocity vector.
A symmetric matrix is invariant under reflection about the diagonal that runs from upper
left to lower right; that is, aii : a;i. An antisymmetric matrix has the property that ai; :
-a;y. Thus, an antisymmetric matrix has zeros along the diagonal.
Two matrices are equal only if all of the corresponding elements are equal: .A : lE if and
only if a;i : bi j for each i and j. The elements of the sum of two matrices are the sums of
the corresponding elements: if C : ,A + ts, then c;i : aij * bij.
When two transformations are applied one after the other, we can describe the result with
matrix multiplication. If i : A* and il : lEi, then
i:
B(,Afr)
:
(lEA)fr
The operations in a matrix product are performed in order, right to left. ts.A means first do
A, then do lE.
26Any convention may be used, provided that you make it really clear.
1.6
i:
MATRICES 43
Let's investigate how to perform this matrix multiplication. The components of the vector
Afr
are
u;:l
N
j:r
a;;wi
Similarly,
un:f
b1,;u;
:f
E
Thus, the elements of the product
b1,;a;iwi:
C
:
(
E *rr,",,).,
:f
,0,r,
lB.A are computed as
N
'*i:lbrrioii
i:l
The element
ctj_ is the inner
product27
of
the row vector bp;
/ati\
column vector 4rl - I or', | . fnir row and column intersect
"
\"t"i /
indices
kj
:
(b6,bxz,brz) and the
at the
position labeled by the
(Figure 1.20).
/bn bn brs\
(
^l
,,: )
btz
atz
, \
/ ,tt
,):{t'
";;
a32
1/
Gi
\c:r
ct2 cl3 \
c22
El
ca2 q3 /
FIGURE 1.20. Matrix multiplication: C : lE,A. Element c;; is the dot product of the i th row of IB
with the jth column of A,. This row and column intersect at the position of element
c;y. In this diagram, i :2 and j :3.
Matrix multiplication is not, in general, commutative; that is, B^N + AB.
/r 2 3\
/o l l\
LetA:f I 0 I landB:l 2 I o l.co-putethe
\o t2)
\r 3t)
products .AIB and
Examplel.6.
lB.A,.
?\/g
l;\
^u:/r3
z/\
r
r
3
t)
\o
x2l3xI
x2*IxI
x2l2xl
x 1*2x 1*3x3 1x *2x0*3x
1x1*0x 1*1x3 1x .l0x0*1x
0x1*1x l]-2x3 0x *1x0*2x
1
I
,
s sense
iplied.
only
if
the row and column have the same length. Matrices that do not
44
CHAPTEH 1 DESCRIBINGTHE UNIVERSE
On the other hand,
]8.4,:
(iii)(i i)
:(
I l3\
i : z)
Clearly these products are not the same.
The transpose .Ar of a matrix A with elements a;7 is formed by interchanging rows for
columns. The components of the transpose are (Ar ),, : a j i. Inthe event that the elements
matrix are complex numbers, we can form the complex conjugate A* of the matrix A by
taking the complex conjugate28 of each element. The matrix formed by performing both of
these operations (complex conjugation and transposition) is called the hermitian conjugate
or adjoint matrix, A,t. A matrix is described as hermitian if it equals its adjoint: .A, : .At.
The trace of a matrix is the sum of the elements on the diagonal:
of
a
Tr(A):an+azzl"'*aNN
The trace is a linear operation: Tr (.4,
For a product, we find
Tr(A,JE):
ttIJ
*
lB)
:
Tr (A) + Tr (B).
a;ibii: tt
biiaij:Tr(lEA)
lr
This result is true even when the matrix product is nol commutative: .AlE + BA.
1.6.2. Determinants
A very important and useful property of a square matrix is its determinant. The determinant of a 2 x 2 matix is a number formed by multiplying along the diagonals and
subtracting:
otl an\ : orru,
\-.'./ A)) I
a"t (
\ A)t
ar2a21
The products a\pzz and a12a2r are called elementary products. In an N x N determinant,
each elementary product is a product of N elements, with no two being in the same row or
the same column. We attach a sign to each product of the form atja2k. . .dNm according
to whetherthe set tj,k, ...,ml is an even (*) or odd (-) permutation29 of 11,2,..., N\.
The determinant of an N x N matrix is the sum of all the signed elementary products. For
28See Chapter 2, Section 2.
l.
29To determine whether a permutation is even or odd, count the number of interchanges of two numbers that must
be performed to execute the permutation. An even permutation needs an even number of interchanges, and an odd
one needs an odd number.
45
1.6 MATRICES
a3 x 3 determinant, this result may be written using the Levi-Civita symbol (Section L 1.2,
equation 1.29):
3
A- t
rj
t
ijkali a2j
(1.71)
a3k
'k:1
The determinant of a 3 x 3 matrix .4, may also be formed as follows. Take any row of
the matrix-say, the top row. For each element a;i in that row, take the determinant of the
2 x 2 matix o;; obtained by removing the row and column that contain the chosen element
(Figure 1.21). Then the determinant is
det (A)
:
:
The quantity
an at2
ct2t 422
a3t 432
att(azza33
(-l)t+i
det
-
aI3
: !{-t)t+i
423
433
a;1det
(a;)
J
ana32)
-
ap(a21ay
-
ana3t)
*
an(azpzz
-
azzaz)
(o;;) -- A;1 is called the cofactor of the element a;;.
('"?dan\
( ozr
:
(
Z::)
",,
\i:', i:: i:: ) "''
FIGURE 1.21. Determining the elements of the submatrix
cv;y whose signed determinant
(- t;i+;
A;; -
det (a;; ) is the cofactor of the element a;; . This figure shows how to find
a12. Eliminate the row and column that contain the chosen element. The remaining
suhmatrix is a;7.
We can now write a rule for finding the determinant of any matrix:
The determinant of a matrix A is the sum of the elements of any row or column of the
matrix, each multiplied by its cofactor:
det (A)
:
lAl
: Do,iAti :Do,iA,i
0.72)
lt
Starting with the rule for a 2 x 2 determinant, we may use equation (1.72) to compute
thedeterminantofa3x3matrix,thena4x4matrix,andsoon.ThisiscalledtheLaplace
development.
These properties of determinants now follow30:
. If an N x N matrix is multiplied by a constant c, its determinant is multiplied
cN.
30See
Problems 37 through 39
and4l.
by
46
a
a
CHAPTER
1
DESCRIBING THE UNIVERSE
A matrix and its transpose have the same determinant.
If two rows or two columns of a maffix are identical, its determinant is zero.
The determinant proves very useful in understanding the properties of matrix products.
We will need the following theorem3l:
Product theorem for determinants. The determinant of the matrix C
:
AIE is rfte
product of the determinants of A andB.
Proof (for3x3matrices). MatrixChaselementsci;
is thus (equation 1.71)
-
D*aitbtj,andthedeterminant
3
det(C)
:
i,
D eijkcric2jcak
j,k:r
3
: D t,toDor*u^,Dornb,,\azrbpn
i,j,k:lmnP
: tt
\o1^o2,o3r(
m n p
f,
,,10u^,u4uro\
\,, j.k:l
/
if ffi,fl, p are an even permutation of 1,2,3, then the term in parentheses equals
det (JE), whereas if m,n, p aire al odd permutation of 1,2,3, then we get its negative. If
any two of m, n, p are equal, the term is zero, since it equals the determinant of a matrix
with two equal rows. Thus, we may write det (C) as
Now
det (C)
:
det (AlE)
: t
arma2na3pem,rp det (lB)
:
det (A) det (lB)
(r.73)
m,n,p
as asserted.
A matrix with determinant zero is said to be slngular Such matrices have some interesting
properties. For example, if such a matrix is multiplied by any other matrix, the determinant
of the resulting matrix is also zero. This fact allows us to understand why it is possible for
two matrices, each nonzero, to have a product that equals the zero matrix32:
,AIE
: 0
does not
imply A:0
or
IB
:0
As a consequence, in the mathematics of matrices, the square root of zero exists and is not
necessarily zero.
31
For additional proofs of this theorem, see, for example, Halmos, Section 53 or Tircker, p. 92.
42 and 43.
32See Problems
47
1.6 MATRICES
1.6.3. The Inverse of a Matrix
:
If the product of two matrices is the unit matrix, .A1B
ll, then lB is the inverse of .A, lE
:
.A.-
I
.
From the rule for computing the determinant of a matrix product (1.73), we conclude that
the determinant of A-l is l/det (A). Thus, the inverse exists only if the determinant of the
matrix is not zero (the matrix is nonsingular).
Using the basic rule for matrix multiplication, we have
N
\aiibir:64
j:r
This relation is actually N x N equations for the elements b1r.T\e solution may be found
by a process of successive elimination,33 with the result that the inverse matrix equals the
transpose of the matrix of cofactors of the elements of .4., divided by the determinant of .A:
u,r:K
(1.74)
Let's check this result. The product of a matrix and its inverse must be the identity matrix.
The product has elements
(AlE)4
:D,,,ff
:\a4bi*
l
l
flrst diagonal element is
D1ar1A11
tAt
=
lAl
1
tAt
we used equation (1.72) for the determinant with
2 and 3. The off-diagonal elements are of the form
i
:
1.
The result is the same for
t ot;ff
A't;
(1.7s)
j
numerator is the determinant of a matrix with two identical rows (here the first and
rows) and so is zero.
Example
See
L.7.
Find the inverse of the matrix
below for an example of how to do this.
(i:!)
48
cHAprER 1 DEScRtBtNG THE UNIvEFSE
The determinant of the matrix is D : 2, so the inverse exists. The cofactors are
At : 2, A12 : -4, An : 0, A2t : O, Azz :2, AZZ : 0, AZt : 0, AzZ: 0, and
Azl
:
L Thus, the inverse is
:(i: l):(-i :r)
We can check the result by multiplying the original matrix and the inverse we have
found:
(a: lX
i: i):(il
l)
as required.
A matrix whose transpose equals its inverse is called orthogonal.34 A matrix whose
adjoint equals its inverse is called unitary.
Computer mathematics packages such as Mathematica and Maple have matrix inversion
routines. Even spreadsheets can invert matrices. Formula (1.75) for the elements of the
inverse matrix is not usually the best computational tool.35 Procedures such as GaussJordan inversion are more efficient. To see how these methods work, first we note that
we may devise a set of matrices, called elementary matrices, whose effect is to perform
simplifying operations on a matrix. For example, a matrix with a 1 in each row and in each
column and zeros elsewhere permutes the entries in the matrix:
(r s;
:
X ;ii :,: z,) Qli
Multiplication by this matrix has moved row 2 to row
2.We can also add a multiple of one row to another:
( ; ; ? X ;ii xa: i:):
1,
row
1
z1:
ii)
to row 3, and row 3 to row
(",,7,':',, "-7,?"' "^"t",?',,)
Multiplication by this matrix subtracts two times row 1 from row 2. By putting the number
x on the diagonal of the elementary matrix, we can multiply a row by any number x:
(:
:
; ? X ;ii zi: z,) ftii'
.iai)
xa:
34You should convince yourself that this definition is equivalent to the one given in Section 1.1.2.
35see Press et a1., Chapter 2.
1.6 MATRICES
49
We can thus perform any of these operations on a matrix .A until we get the unit matrix.
Then the product of the elementary matrices that we used must be the inverse of .4.:
EnBn-r.
.
.
ts:EzErA
:
ll
* ErBr-t'
.' lE:lEzlEr
:
A,-l
We may evaluate this product of elementary matrices as a single matrix by applying the
same operations to the unit matrix:
IE,IE,-r
..
.lDrlEzErll
: .A-l[ : A-l
Once we have established this result, we do not actually need to compute the elementary
matrices; we can simply perform the operations on the matrix rows. The allowed operations
are
1. Interchange two rows.
2. Multiply a row by any number.
3. Subtract
a (positive or negative) number times one row from another row.
/z 3 4\
32I
{ I 2
3/
\0
First we check that the determinant is not zero, and thus that an inverse exists:
Example
1.8.
Invert the matrix
/z 3 4\
a"tl I 3 zl:s
\o 2 3)
Next we perform simplifying operations.
1. Subtract row 2 from row
2. Subtract row I from row
1:
(;
i)
/r
0 2\
2:
{o
\o ) \)
3.
Subtract 213 times row 2 from row 3:
(:
:i)
CHAPTER 1 DESCRIBINGTHE UNIVERSE
Subtract 213 times row 3 from row 1:
(r;l)
(r:l)
5. Divide rows 2 and 3 by 3:
Now we perform the exact same operations on the unit matrix.
1. Subtract row 2 from row l:
(i:i)
2.
Subtract row
I
from row 2:
/ r-t o
l-r 2 o
\o o
r
3.
Subtract 2/3 times row 2 from row 3:
/ t
l-r
-1
2
\ 3-t t)
4.
Subtract 2/3 times row 3 from row
l:
/ 5_L _2\
i
?-t
l_?
\
5. Divide rows 2 and 3 by 3:
/ t_t
I
t)
"
_2\
', I
\-i1-l +)
f
'+
Those 9s in the denominator are reassuring, since 9
result by multiplying:
as required.
:
det (,N). Finally, we check our
ei r)ti;i):(i,?)
1.6
MATRICES
51
1.6.4. Matrices and Linear Equations
A set of
N
equations
in N unknowns may be written in the language of matrices. The
equations
atxt+anx2+ ...lawxN aztxtlazzxzl "'*azNxN aNtxt + aNzxz* ':' + oyy*n1 :
bt
bz
6n1
may be written as
.N*:6
where the vector
i
contains the unknowns .rr as its components and the vector f, contains the
multiplying both sides by the inverse matrix .A,- I :
constants b; . The solution may be found by
(1.76)
where A.1i is the cofactor of element
aii. Ttrc numerator is the determinant of the maffix
formed by removing the ith column of .N and replacing it with the vector 6. Result (1.76) is
called Cramer's rule. It provides a formal solution to the equations, but it is rarely the best
computational procedure.
In the Gaussian elimination method,36 we use the first equation to eliminate 11 from
all of the remaining equations. Then we use the second equation to eliminate x2 from all
the equations below it, and so on. The final equation is then solved for x1y, and we then
substitute back up the sequence of equations to find all the unknowns. This algorithm is
easily programmed and is well suited to the characteristics of modern computers.
The homogeneous set of equations
Ai:0
a
nontrivial solution for the vector
are
i if and only if the determinant of .A, is zero. One
ion of the set of equations is that the row vectors with components a1i, a2i,
each perpendicular to the vector i. Thus, all three vectors must lie in the plane
to i. This means that the triple scalar product3T of the three vectors must be
or, equivalently, that the determinant lAl
0.
:
Change of Basis
Tfansformations
elements of a matrix that represents a linear operator in a vector space depend on the
vectors chosen for the space. If we change the basis vectors, then the elements of the
for example, Golub and van Loan, Chapter 4,
Section 1.1.2. Compare equation (1.32) with equation (1.71).
52
cHAprER 1
DEScRTBTNG THE UNTvERSE
matrix change, even though the operator is unchanged. The new basis vectors are linear
combinations ofthe old ones, so the operation ofchanging the basis is also represented by
a matrix. Let's call this matrix C. Thus, if a vector has components .{r in the original basis,
its components in the new basis will be x',, where
,';:l
N
j:r
r;1*1
0.77)
Now suppose an operator represented by A maps the vector
* to a new
vector
]:
N
ti:lai1x1
j=r
In the new basis, the operator is represented by .A/ with components at,, and
N
y',:\a'i1x'1
J:I
Thus,
/
"t-\-k:t
rt
,
:\-L
k:l
N
:\-Lxj
j:r
or, in matrix notation,38
i' :
Now the components
)c
j
CAi
are related to the components
xl
by the inverse of C,
x; :
Di(C-t)1^xl,so
i' : cAC-li' :.A,'i'
Thus,
,A':
C.A,C-l
(1.78)
38In this notation, i and i' represent the same vector, but the corresponding sets of components are expressed
with respect to different bases.
53
1.6 MATRICES
Equation (1.78) describes a similarity transformation. It is sometimes written in terms
of the matrix 'lf
: C-l:
.A.': T-1.47
(r.19)
The elements of the matrices C and lf are the components of the new basis vectors in the
original system. If a vector i with components x; is to be the first basis vector in the new
system, then its prime components are x\ : I, x'i : O if i I l. Thus,
rr:tt;ixt,-1;,
l
ri
Thus, the column vector
is the first column of the matrix
r, of the 7th basis vector form the jth column of 'lf
: C-l.
11.
Similarly, the components
We often want to find the basis that makes the matrix .A corresponding to a given operator
simple as possible. The simplest matrix is one with nonzero elements along the diagonal
and all other elements zero. Then if
61, the components of are simple multiples of
as
i: yi :
/:
Lixi.In particular, for
f
basis vector 6; that has only one nonzero
component, A'6i
),;6;. This relationship among vectors must remain true in any basis,
so there exist N vectors
such that,Ai;
l.iii. These N vectors are the eigenvectors of
the matrix A, and the constants ).; are the corresponding eigenvalues. The matrix.A/ has the
(suitably normalized) are chosen as the
values ),; along the diagonal when the vectors
the components of
-
a
:
i;
i;
basis vectors.
To diagonalize the matrix, we must solve the equation
(.4-).ll)i:0
where ll is the unit matrix. This equation has a nontrivial solution for the vector
det (.4
- lll) :
g
i
only
if9
(1.80)
This equation is called the characteristic equation. If the matrix is a2 x 2 matix, the characteristic equation is a quadratic equation; if the matrix is a 3 x 3 matrix, the characteristic
equation is a cubic; and so on. Thus, there are at most N real distinct eigenvalues. There
may be fewer than N if the eigenvalues are not distinct or if the roots are complex.
The eigenvalues of a real symmetric matrix are real. Let's see why. If l. is an eigenvalue,
then there exists a vector * such that
A*:
).i
both sides and take the complex conjugate4o:
i.A,*
: iA :.1,*i
See Section 1.6.4.
that we are working with vector spaces in which all the vectors are real.
54
cHAprER 1
DEScRTBTNG THE UNTvERSE
(The process of transposing does not change the right-hand side, except that the column
vector becomes a row vector.) Now take the inner product with i in both equations:
iAi : ),*'i: ).*i'i
Thus,
(),
i .i
Since
cannot be zero unless
i :
-
0,
:
),*)i .*
l :
o
).+ and the eigenvalues are real.
If
if
two eigenvalues are distinct, then the corresponding eigenvectors are orthogonal.
For
Ai1
:
'11frt
and
A*2:7'1'
then
*1A*2
: )'z*t'*'z
i2.Ai1
:
and
.l'riz
'ir
Transposing the latter relation and subtracting from the former, we have
0
:
(),2
Since the two eigenvalues are not equal,
Example
1.9.
-
).r)
i1 .*z
ir . iz
:
O
and the eigenvectors are orthogonal.
. (/t 2\
i -
Diagonalize the matrix
o)'
I
The characteristic equation is
Il-r-,r. 2
-'- - l:0
I
| 2 -)'l
-i.(3-r,)-4:0
L2 -?,).
- 4:o
So the eigenvalues are
,_3*JqT6_3+5_
:
^:
2
1A
2 --r'-
1.6 MATRICES
The diagonalized matrix is thus
, /-' n)
A:(
o 4/
We have solved the problem, but we can learn more from this example.
The two eigenvalues are real and distinct, so the eigenvectors are orthogonal. Their
components are found from the equation
2xy-)"x2:Q
Inserting the values of .],, we have
2x1
So,
with
- -x2
or
xr :2x2
xt: -1,
*t : (-1,2)
and, with
xz: ll,
*.2
:
(2,1)
These vectors may be normalized to provide unit vectors as a basis. Let's verify the
relation ,\*
i,i for these two vectors:
:
(1 3)(-i):(-\o):(_;
):-'(-;
)
()3)( ? ): ('|', ): ( I ): ^(?)
required.
We diagonalize the matrix by changing to a new basis with an operator (matrix)
C. Then ,\' : C.NC-I (equation 1.78), and the elements on the diagonal of .A/ are
the eigenvalues of .4. The eigenvectors of .4, are the new basis vectors. Then from
equation (I.77), the components of the kth eigenvector are related by
as
,,{k) _ \-1c-t),ir$t,:
--r
,,
!{c-r)i j3j*:
(c-r);r
J
Thus, the elements in the ftth column of C-l equal the components of the kth eigenvector. If the matrix is orthogonal, then these are the elements of the ftth row of C.
Here, the matrix that effects the transformation is orthogonal and has elements
':+(-" ?)
55
56
cHAprER 1 DEScRtBtNG THE UNIvERSE
Let's verify that this does the job:
cAC_
:+(_" ?)() 3)+c:
:i(-j ?)(-i t)
?)
o\
-(-r
4)
0
\
which is the matrix A/.
In this example, C turns out to be orthogonal. In fact, since the diagonal matrix A/ is
symmetric and an orthogonal transformation maps a symmetric matrix to another symmetric
matrix,4l any symmetric matrix ^A is diagonalized by an orthogonal transformation.
If a matrix has two or more equal eigenvalues, then we cannot prove that the eigenvectors
are orthogonal, but they may be.
Example
1.10.
Find the eigenvalues and eigenvectors of
tr,..utri* ( 3
2-)"
0
-1
-1
0
2-).
0 3-),
0 l:s-
Thus, the eigenvalues are I : 1 and tr
Next we seek the eigenvectors.
:
t5),.+7)"2-1.3:-(),-1)()'-3)2
3, the latter being a repeated root'
(ii sX,):^(l)
('-:ti,) :^(l
First we take the case of i.
:
1:
2x-z:x+x:z
3y:y*/:0
-xl2z:z+z:x
Thus, this vector
41See Problem 48.
i1
-;
)
\-t o 2)
First we find the eigenvalues:
0-
S
has components (x, 0,
x) for any x.
)
16 MATRICES 57
Then, with.l.
:
3, we have
2x-z:3xlx:-z
3Y :3Y t Y is arbitrary
-xl2z:3zlz:-x
Thus, these eigenvectors have components (x, y,-x) for any values of y and x.
Notice first that each of these vectors is orthogonal to i1. Second, we can find two
such vectors that are orthogonal to each other:
(x,y, -x)' (r,u, -u) : xu I yu I xu : O : 2xu -l yu
Thus, with x,u,and y chosen, we simply pick u : -2xuf y. We can always
this, provided that y is not zero. For example, we might pick
i3 : (1, -2, -l). Let's check that i2 is an eigenvector:
/zo
l-li
as
i)(_i
required. You should check that
):(
i2 :
(1, 1,
-1)
do
and
:,(_i)
i)
i3 is also an eigenvector.
If two matricas commute, then if their eigenvalues are all distinct, they have the same
eigenvectors. For example, suppose that AIE
lE,A and.Ai
l,*. Then
:
lB.A,i
:
: lB),i:
).lBi
But
BAi:
AlEi
AiBi:
l.lEi
Thus,
Thus, iEi is also an eigenvector of A with eigenvalue ).. If the eigenvectors of ,A are distinct,
then IB* must be a constant times i, and thus is also an eigenvector of IB.
i
Congruent Tbansformations
A
set of linear equations results when we consider small oscillations of a physical system about its equilibrium. We can often analyze such a system most efficiently using
Lagrangian mechanics. The Lagrangian of a system contains a kinetic energy term42 K
which is quadratic in the velocities and a potential energy term V which, in the case of
42In Lagrangian mechanics,
the symbol
the
transformation matrix, we'l1 use
? is often used for kinetic
K for the kinetic energy.
energy. But since we have already used
lf for
58
CHAPTER 1 DESCRIBING THE UNIVERSE
small displacements from equilibrium, may be expanded in a Taylor series with no linear
term. (This is the equilibrium condition.) Thus, the potential energy term is a quadratic
function of the coordinates. Each of these quadratic forms may be expressed in terms of
matrices:
r-dxT*dx:
v:xrvx
dt dt
where the vector xr on the left is a row vector and the vector x on the right is a column
vector. Furthermore, both matrices are symmetric. The motion of the system looks simplest
when we choose our coordinates to correspond to the normal modes of the system. But we
don't usually know what those modes are until we have solved the problem.
The normal modes of the system can be described by vectors x(N). Thus, to simplify
the system, we need to change our basis to make these vectors our basis Yectors. Let's see
what happens to a quadratic form when we change the basis. The energy is a scalar under
coordinate transformations, so
: x'zVx/: xrVx:V
V/
Butx:'lfx',so
x'zv'x'
:
(Tx')?Vllx'
:
x'z'llz\y'Tx'
and thus
V':
Tzv1f
(1.8 1)
This transformation is almost a similarity transformation (equation 1.79), but here the
transpose of 1f appears instead of the inverse. It is called a congruent transformation.
1l-1 and the two transformations are identical.
transformation is orthogonal, 'lfz
If the
:
Example 1.11. A system has two identical pendulums connected by a spring. Each
pendulum has a point object of mass rn on the end of a stiff but massless rod of length
l.Eachpendulum is attached to a pivot on the roof. The two pivots are a distance ss
apart. The two point objects are connected by a massless spring with spring constant
k and relaxed length s6. Find the possible motions of this system.
This system is most easily analyzed using Lagrangian methods.43 In equilibrium,
both rods hang straight down, and the spring exerts no horizontal force. Figure 1.22
shows the system when displaced from equilibrium. The kinetic energy is
K_
43See Optional Topic E.
:-rle)'.(#)')
1.6 MATRICES
FIGURE 1.22. A system of coupled pendulums (Example
l
I
l). The two objects are connected by
a spring.
The system has both gravitational potential energy and elastic potential energy. To
compute the gravitational potential energy, we put the reference level at the level of
the two pivots. Then
Vs
:
-mgl(cos 0t
*
cos 92)
The length of the spring in Figure 1.22 is a complicated function of the angles, but
we can obtain a much simpler expression if the two angles 01 and 02 remun small.
(This is the small oscillation approximation.) Then with d1 ( I and 0z 11 l, the
two objects are separated by a distance so * {(02 - dr) and the elastic potential
energy is
l"
V,:U**P2-0)2
Now applying the same constraint (0 < l) to the expression for the gravitational
potential energy, we obtain the total potential energy:
v
I
: -mB(I,/ - 70?- 7e"2\+ rke@z
- o)2
^
)
:
-2ms( +)telgt-t mg) +
)ulWt * ^s) -
kr?uzlt
Then the Lagrangian is
L:K-V-
:-el(#)'.(#)'l *2mgl
-)telo,t-t *d - )te:l.*t I
mil + kt20z0r
60
CHAPTER 1 DESCRIBINGTHE UNIVERSE
and Lagrange's equations are
d
aL
dtnF,d2o,
oL _n
^
mt"---J
-t(ktI*g)il*kt20z:0
(l.82)
dzg,
- t (kt * ^g) 0z * klzil :0
(1.83)
dtz
^
ml."---J
dtz
These coupled linear equations show that the motion of one pendulum influences the
motion of the other.
We shall now formulate this problem in the language of matrices. Note that both the
kinetic and potential energies may be expressed as products of a matrix and a vector
containing the velocities or the coordinates of the two pendulums.
| ^ / dor do'>'
y:)mtzt=:,=)(I
2
\dt'dt.
/
det
\
?)t l;,
(1.84)
I
\ dt /
and
V
-t mg) +
: -2mgl *
*
)telO,t
)tel<*t
: -2mgl I
)^t2re,", (
mg)
-
* *ox
:,i r)
kt20z0r
t;;
I
(1.8s)
Here the coupling is indicated by the off-diagonal terms in the matrix. Notice that we
have divided the symmetric termin9fi2 into two equal parts, making the off-diagonal
terms equal and the matrix symmetrical.
To find the normal modes of the system, we should change the basis to obtain
a diagonal matrix, indicating that the oscillations in the new coordinates are uncoupled.
Since the potential energy matrix is symmetric, we know it can be diagonalized by
an orthogonal transformation. A transformation of the type
V:lfrVlf
must also be applied to the matrix in the expression for kinetic energy, but since that
is the unit matrix, it remains unchanged if the transformation is orthogonal:
K'
:'lfrllr :
lfr1f
:'lf-l'll:
ll
1.6 MATRICES
The characteristic equation rs
kp
-+-ml
_L
m
-0
/k e
l-+:-t (.
\rn
:Q
__r k
m
and so the eigenvalues are
I *rL
x:1,
lLm
The eigenvectors are given by
rk c
(;*7
_.L\
Ir
\-;
o !,lG):^(":)
;-v/
(
",,):iG:)
In the first mode,
\t
tr):
,v/
Ltt,-gz):o or
m
iG)
ot:oz
In this mode, the two pendulums oscillate in phase, and the distance between them
stays fixed. The spring does not affect the system.
To find the other mode, we use the second eigenvalue:
(!:,: :l; . i;,,) : r' .,*) (z:)
k
--m
(0t
-10) :
Q
61
62
cHAprER 1 DEScRtBtNG THE UNIvERSE
Here 91 : -02,the two pendulums are 180' out of phase, and the spring is alternately
stretched and compressed.
We want to find a solution in which the system oscillates, so each angular displacement may be written in the form
xi : Ai cos(ait I
Qi)
:
-co! xi , and equations 1.82 and 1.83 take the following form:
Even mode: For 02 - 0y,
Then
d2
x; I dt2
-,?or
-(h*X)"+Ler:o
-1302
+Loz:o
-(L+9)e'
(/
m
\,fl
The equations are identical, as expected, and the solution forthe angular frequency is
0)e:
This is the oscillation frequency for a single pendulum oflength
Odd mode: For the second mode,02 : -01,
-azot
I, as expected'
* 9) o, Le, :o
- (!
\m t./ - m
This mode has a higher frequency because the spring provides an additional restoring
force.
If we initially displace the two pendulums by unequal amounts 0s and 9s, then we
must decompose the displacements into two parts, one corresponding to each normal
mode:
91
(o)
o2(o)
: to :
:
,u
:
ti#
to
*
;tu
to;lu
to
;to
The first term is the even mode, and the second is the odd mode. Thus, the system
evolves as
o1e)
: (r+)
"o,
| f,, .
(r+) "." l rL3,,
1.6 MATRICES
E /eL!!\ro,f;" *t,
oz(t):(eo+et\
2
\
)cos1/7t-l-z / u m t
We close by showing how the Lagrangian may be written in the new coordinates. Let
and
4
+ @t i- o).Then
: | {il - o)
L
:
)^
P
^h
:
lf**)'
-)tfO
+
,.lr)2(rrt
*
-t mg)
70,,
;,r,,)'l *,^,,
- )tfO - ,h)2(t t-t ^g) + trz1q + D@ - th)
: mt2lr*l' . (#)' *,x - o'x - r', (,*;)l
The absence of cross terms
nates. In matrix language,
L:m(2(#
Qlt shows that the motion is decoupled in these coordi-
/do
\
\dt
/
#)G Dll;l
* m(z1q.r,
(t,*o * fll
X)
: *t2(Q,,r, (X -r,,
,*
*oX _
.t2mgt
,,) (rr) * r*rn
The eigenvalues ,1. of the potential energy matrix are in fact the squares of the frequencies of the normal modes.
The transformation matrix has the components of the normalized eigenvectors as
its columns,
"-z\r _l)
rf
-
| (l
64
and
cHAprER 1 DEScRtBtNG THE UNIvERSE
it is orthogonal. Thus, the transformed potential energy matrix is
kgk
V':1f
?\
--l11
mLm
I -l )(
kk
mm -+
g
.(
g
I
11
:( 1 -l )(
2kl*gm
;)(1
_l)
mL
2k[.
*
Sm
m(.
)
0\
(:
2k(+
sm
m(.
/
tI
and it is diagonal, as expected. The diagonal elements are the eigenvalues of the
matrix.
The Lagrangian contains two matrices, and we have succeeded in making them both
diagonal simultaneously. The question then arises: Can we always do this? We can if both
matrices are rcal symmetric matrices and at least one of them is positive definite. A matrix
is positive definite if
x?.Ax
r0
(1.86)
for an arbitrary vector x. In the example above, one of our maffices was the unit matrix,
and the eigenvalues were found from the equation
det (V
- lll) :0
More generally, the equation for the eigenvalues in a Lagrangian oscillation problem is
det (V
where
K is the kinetic
- i.K) :
0
(1.87)
energy matrix. This matrix is always positive definite, and both
matrices are symmetric, so simultaneous diagonalization is possible in this case. (See Problem 53.) The resulting eigenvalues are called generalized eigerwalues. The eigenvectors are
solutions of the matrix equation
Vx:,i,Kx
The transformation in this case need not be, and usually is not, orthogonal.
PROBLEMS
65
1.6.6. Matrices and Quantum Mechanics
One very important application of matrices in physics is in quantum mechanics. The state
of a system is represented by a vector in a finite or infinite-dimensional vector space. Any
physically measurable property of the system may be obtained by allowing a matrix to
operate on the state. The matrix represents the physical operator. The eigenvalues of the
matrix represent the possible values of the physical measurement. The fact that matrices
do not commute, in general, leads to some interesting physical consequences, such as the
uncertainty principle.
PROBLEMS
: fl
i
,
I
I
|
Circular motion. Answer this question without using Cartesian components. A particle
it moving around a circle with angular velocity <ir. Wrlte its velocity vector i as a vector
product of 6 and the position vector i with respect to the center of the circle. Justify
yout expression using geometrical arguments. Differentiate your relation for i, and
hence derive the angular form of Newton's second law
(equation 1.8).
(i :
1d) from the standard form
the vector i : (1, 2,2) and perpendicular to
cts. Determine the transformation matrix .A
ordinate system with x'-axis along fr and yt-
: (-20, g, 12),zurd fr : (0, -4,3) are mutu-
ne the transformation matrix that transforms
stem to a system with x'-axis along il, y/-axis
ansformation to find components of the vectors
I, -2) in the prime system. Discuss the result
lectric and magnetic fields
i
and
i.
Show that
r\u,
)" "u
n
locity
i : io *
e6, where 6 is a unit vector
1)
and
fl : Bo(I, -2,1), set up a
{o(1,-t,
g E x B and y'-axis along E. Determine the
etermine the components of i(r) and i(r) in
e a criterion for "short time."
y <i. Using cylindrical coordinates with z-axis
of the velocity vector i at an arbitrary-point
in cylindrical coordinates to evaluate V x i.
66
CHAPTEB 1 DESCRIBINGTHE UNIVERSE
conservation of mass in a fixed volume V, use the divergence theorem to
derive the continuity equation for fluid flow:
6. Starting from
0p
At
7.
+i.(pi):o
where p is the fluid density and i its velocity.
Find the matrix that represents the transformation obtained by (a) rotating about the xaxis by 45o counterclockwise and then (b) rotating about the y'-axis by 30' clockwise.
What are the components of a unit vector along the original z-axis in the new (doubleprime) system?
[8] ooes the matrix
/ cosl
sind 0 \
I sin0 -cosd 0
o t)
\o
9.
I
represent a rotation of the coordinate axes? If not, what transformation does it represent?
Draw a diagram showing the old and new coordinate axes, and comment.
Represent the following transformation using matrices: (a) a rotation about the z-axis
through an angle z/3, followed by (b) a reflection in the line through the origin and in
the x-y plane, at an angle 2n 13 to the original x-axis, where both angles are measured
counterclockwise from the positive x-axis. Express your answer as a single matrix. You
should be able to recognize the matrix either as a rotation about the z-axis through an
angle a or as a reflection in a line through the origin at an angle cv to the -r-axis' Decide
whether this transformation is a reflection or a rotation, and give the value of a' (Note:
For the pulposes of this problem, reflection in a line in the .r-y plane leaves the z-axis
unchanged.)
10. Solve this problem without using Cartesian components. Using polar coordinates, write
the components of the position vectors of two points in a plane: P1, with coordinates
11 and 01, and P2, with coordin ales 12 and 02. (That is, write each vector in the form
i : uri + uo0.) What are the coordinates 13 and 93 of the point P3 whose position
vector is
iz:il
+iz?
Hint: Startby drawing the position vectors.
11. A skew (nonorthogonal) coordinate system in a plane has x'-axis along the x-axis and
y'-axis at an angle g to the x-axis, where 0 < ft 12.
(a) Write the transformation matrix that transforms vector components from the Carte-
x-y system to the skew system'
(b) Write an expression for the distance between two neighboring points in the skew
sian
system. Comment on the differences between your expression and the standard
Cartesian exPression.
(c) Write the equation for a circle of radius a, with center at the origin, in the skew
system.
PROBLEMS
67
E2l Prove the Jacobi identity:
dx
1.3.
$ ri)+6 x @ x d)+i x (d " B1 :o
Evaluate the vector product
(dxilx(ixd)
rn terms of triple scalar products. What is the result if all four vectors lie in a single
glane? What is the result if a, b, and i are mutually perpendicular? What is the result if
b:d?
14. Evaluate the product (d
x 6) . G x dl in terms of dot
products of d, 6,
i, and d.
15. Use the vector cross product to express the area of a triangle in three different ways.
Hence prove the sine rule (see figure):
sino_sinf_sinlz
ABC
C
PROBLEM
E6.lUse the dot product fd
15
- 6l . td - il to prove the cosine rule for a triangle:
,2 : o2 +bz -2ab cosy
17. A tetrahedron has its apex at the origin and its edges defined by the vectors d, f,, and
i, each of which has its tail at the origin (see the figure on the next page). Defining the
normal to each face to be outward from the interior of the tetrahedron, determine the total
vector area4 ofthe four faces ofthe tetrahedron. Find the volume ofthe tetrahedron.
4A vector area is represented by a vector
l-
cross product as in Section 1.1.2,
68
CHAPTER 1 DESCRIBINGTHE UNIVERSE
o
PROBLEM t7
18. A sphere ofunit radius is centered at the origin. Points U, V, and W on the surface of
the sphere have position vectors i, i, and fr. Show that points P and Q on the sphere,
located on a diameter perpendicular to the plane containing the points U, V, and W,
have position vectors given by
i:*
frxi+ixfr+*xd
li, i, fr'l
where I is the angle between the vectors i and i.
cos 0
L9. Show that
ix(io):o
for any scalar field O.
[zO.l rma an expression for
21. Prove the identity
ira.[t :
i x G x t)
(d.
in terms of derivatives of d and 6.
i)6+ 6. V)a+d x (i " B) +6 x (i
Hint: Start with the last two terms on the right-hand side.
22. Compfie i (a ,. 6) in t"r-, of curl d and curl 6.
23. Obtain an expression for i x (di), and hence show that i
@f,tne
x (Oi il
x
d)
:
O.
equation of motion for a fluid may be written
, (#+ 6 vrt) :-ip+pd
where
i
is the fluid velocity at a point, p its density, and P the pressure. The acceleration
due to gravity is E. Use the result of Problem 21 to show that for fluid flow that is
incompressible (p : constant) and steady (0 I 0t = 0), Bernoulli's law holds:
f + \Ou2 + pCh:
constant along a streamline
69
PROBLEMS
Hint: Express the statement "constant along a streamline" as a directional derivative
being equal to zero.
Under what conditions is P
throughout the fluid?
*
lOr2 + pgh equalto an absolute constant, the same
25. Evaluate the integral
$"
at
JC
where
(a) C is the unit circle in the x-y
plane and centered at the origin and
il:x2y* -*y25,
(b) C is a semicircle of radius a in the x-y plane with the flat side along the x-axis, the
center ofthe circle at the origin, and
i:
xy23 + y*2j,
(c) C is a 3-4-5 right-angled triangle with the sides of length 3 and 4 along the
),-axes, respectively, and
and
i^
:.x-x
+ xyy
u
@
x-
C is a semicircle of radius a inlhex-y plane with the flat side along the r-axis, the
center ofthe circle at the origin, and
i:
(2x
-
y3)i
-
(3y2 +
"')y
26, Evaluate the integral
Ir.aA
"/s
where
(a)
S
is a sphere ofradius 2 centered on the origin and
i: x3i +3yz2y -f3y2zi
(b)
S is a hemisphere of radius 1, with the center of the sphere at the origin, the flat side
in the x-y plane, and
i:
x2yz(! -12)
Show that the vector
i:xi-fyy-2zi
70
cHAprER 1
DESCRTBTNG THE
has zero divergence
function @ such that
uNrvEFsE
(it is solenoidal) and zero curl (it is inotational). Find a scalar
i:
and a vector
i.
Vd
such that
i:VxA
[2&l Strow that the vector
n:i
12
has zero divergence (it is solenoidal) and zero curl (it is irrotational) for
a scalar function @ such that
i:
rI
0. Find
Vd
and a vector d. such that
i:ixi
29. A surface S is bounded by a curve C. The solid angle subtended by the surface S at
a point P, where P is in the vicinity of but not on the curve, is given by
[4+
a:
""_ J,
R,
Here
*
i'
-
dal
is an element of area of the loop projected perpendicular to the vector f,. :
chosen origin O, and
*', * is the position vector of the point P with respect to some
is a vector that labels an arbitrary point on the surface or the curve. Now let the curve
rigidly displaced by a small amount d3. Express the resulting change in solid angle
dQ as an integral around the curve. Hence, show that
be
Vo:-i *{oi
JcR
30. Prove the theorems
(a)
l,u,
o,
:
f,ar
dA
(b)
it dv
IrU "
:
fla
'
i,t oo
71
PROBLEMS
31.
Prove
@
f-f-
6adt:/frxYedA
lc
Js
(b)
f
r/(fixvlxidA:$dtxi
Js
Jc
32. Derive the expressions for gradient, divergence, curl, and the Laplacian
coordinates.
in spherical
33. In polar coordinates in a plane, the unit vectors i and 6 are functions ofposition. Draw
a diagram showing the vectors i at two neighboring points with angular coordinates I
and 0 * d0.Use your diagram to find the difference Ai and hence find the derivative
ai I ae.
@
fne vector operator
I
L_ -FxV
i
appears in physics as the angular momentum operator. (Here
position vector.) Prove the identity
ifi' fl : i+ i(i
.
d) +
I : J=
and
i
is
the
;(i x i)
for an arbitrary vector d.
35. Can you express the vector d : (1, 2,3) as a linear combination of the v_ectors d1 :
(1, 1, 1), i2 : (1, 0,
-l), and i3 : Q,1,0)? Can you express the vector b : (1,3,2)
as a linear combination of the vectors il1 , iz , and d: ? Explain your answers geometrically.
36. Show that an antisymmetric 3 x 3 matrix has only three independent elements. How
many independent elements does a symmetric 3 x 3 matrix have? Extend these results
toanNxNmatrix.
37. Show that if any two rows of a matrix are equal, its determinant is zero.
b&lP.ove that a matrix with one row of zeros has a determinant equal to zero. Also show
that if an N x N matrix is multiplied by a constant c, its determinant is multiplied
by rN.
39. Prove that a matrix and its transpose have the same determinant.
i
40. Prove that the trace of a matrix is invariant under change of
basis-that is,
CAC-I) : Tr (A)
is invariant under change of basis-that is,
determinant of
a
real symmetric matrix equals
72
CHAPTER
1
DESCRIBING THE UNIVERSE
the product of two matrices is zero, it is not necessary that either one be zero. In
particular, show that a2 x 2 matrix whose square is zero may be written in terms of two
parameters a and b, and find the general form of the matrix. Verify that its determinant
@Jlt
1S ZerO.
43. If the product of the matrix ,\
: f:
\c
Or\
and another nonzero matrix lB is zero, find
d)
the elements of B. You may find it necessary to impose some conditions on matrix ,4.
so, state what they are.
If
44. Diagonalize the matrix
(iii)
45. Show that a real symmetric matrix with one or more eigenvalues equal to zero has no
inverse (it is singular).
i46. Diagonalize the
orthogonal?
(
? ) -O
^uti* \3! 4/
find the eigenvectors. Are the eigenvectors
If not, why not?
47. What condition must be imposed on the matrix ,A in order that.AlB
:
AC with B
If
B:(l s)
and
+
C?
a:(; l)
find a matrix ,A such that.AlE : AC.
48. Showthatif .AisarealsymmetricmatrixandCisorthogonal,then.A'/:
C.AC-I isalso
symmetric.
,A,lE : IEA if both A and lB are diagonal matrices.
that
the
inner product is invariant with respect to change ofbasis under orthogonal
Prove
[50.l
49. Show that
transformations.
51. A quadratic expression of the form
cv
x2 + 2frxy
I
yy2
:
1
represents a curve in the
x-y
plane.
(a) Write this expression in matrix form.
(b) Diagonalize
the matrix, and hence identify the form of the curve and find its symmetry axes. Determine how the shape of the curve depends on the values of a, p,
and y. Draw the curve in the case u : I :2,y :3.
52. Two small objects, each of mass m, are joined by a spring of relaxed Tength l.Identical
springs hold each mass to a wall. The walls are separated by a distance 31. Write the
Lagrangian for the system, and find the normal modes and the oscillation frequency for
each mode.
a jointed pendulum system. Two point objects, each of mass
linked by stiff but massless rods, each of length l. The upper rod is attached to
a pivot. The system is in equilibrium when both rods hang vertically below the pivot.
The figure shows the system when displaced from equilibrium.
53. Find the normal modes of
tn,
are
PROBLEMS 73
Write the Lagrangian for the system in matrix form. Simultaneously diagonalize the
kinetic energy and potential energy matrices, and find the generalized eigenvalues. Determine the eigenvectors. Find the matrix that effects the transformation, and verify that
both matrices are diagonalized.
PROBLEM
53
CHAPTER 2
Complex Variables
2.1. ALL ABOUT NUMBERS
Numbers are classified emotionally! The Greeks liked integers and rationals (a rational
number is the ratio of two integers).
Integers:
1.2,3.4,...
Rationals:
l3to7
2' 4' 436" "
But they did not like irrationals (irrationals are numbers that cannot be expressed as the
ratio of two integers). Yet irrationals show up in lots of common circumstances, like the
ratio of the circumference of a circle to its diameter.
Irrationals:
e, rr,
4,]rr-2,
..
.
In spite of the name, there is nothing wrong with these numbers, as we all know, and we
have no trouble computing with them.
What happens when we try to solve the following quadratic equation?
f(z):22+z+5=:g
z
It doesn't factor, so we use the formula:
1,J4
:_r=
:.: -r+/T-to2
2
This result does not exist in the list of numbers written above. We can see this geoqnetrically
by plotting the function f (z)
z2 + z +
{figo." 2.1) andnoticing that the function does
:
t
not cross the z-axis. The solution to the dilemma is to add a dimension to our number
system. We define the quantiryl^/J : i so that the solution to the equation is
I
--: -1*
2-ti:x*iy
i
lEngineers often use the notation
j = nE1.
75
'
L
76
CHAPTER
2
COMPLEX VARIABLES
f(x)
-2
FIGURE2.l.
-1
0
(x):
Thecurve f
12+r+512doesnotintersectthe.r-axis,sotheequation
has no real solutions.
f (x):O
We now need two real numbers, x and y, to describe the complex number z. The number
y multiplying the i is called the imaginary part,2 while.x is the real part. Now the original
expression becomes a function of the two real numbers x and y:
z2+z+
Since i2
5
5
t :(x*iy)2+x-liy1: , :
x2 -l2ixy
I i'y'
+x
I
iy
5
I ,
: -1, this expression becomes
f(z)
:
fi(x, y) r ifz@, !)
:
x2
-
y2
+*+
5
t -t iy(2x * t)
To plot this expression we would have to plot the two functions -fi :
and f2 - y(2x + 1). To obtain solutions of /(z) : 0, we require thar
*2 - y2 + x + ]
f1 and f2 be zero
simultaneously. By allowing the extra dimension, we find that both figures intersect the
plane f : 0 at two common points x : -+, y : ++..
By allowing ourselves to use complex numbers with both real and imaginary parts, we
are able to solve an enormous number of equations that do not have real solutions, notjust
quadratics of the type we considered here.
Complex numbers satisfy all the usual algebraic rules, so long as we remember that
i2
:
-1. Then, for example,
Addition:
u,,',i
2**'
: l' l,',1',,i ;] l',1 : ;,
5+t
2The name is unfortunate; these numbers are not "imaginary" in the normal sense of the word. In fact, they prove
extraordinarily useful in solving problems in the rea.l world of physics.
77
2.1 ALLABOUTNUMBERS
Subtraction:
(a'fib) -
(c
* id):
+ si) -
(3
- 4i) :
(2
(a
(2-
c) + i(b
3)
+
(s
-
d)
+ 4)i
: -l +9i
Multiplication:
: ac * i(bc 'f ad) + izaa
: ac - bd -f i(bc t ad)
(2+5i) x (3 - 4i):26*7i
(a
* ib) x (c *
id)
(2.1)
Division is trickier if we want to write the result in the standard x * iy form. As a preamble,
noticefromequation (2.1)that,ifwetakea - c,theimaginarypartoftheproductiszerofor
d: -b.Thus, theproductof anumberz : a *ib with its complex conjugate z* : a -ib
is purely real.
The complex conjugate z* of any complex number z is formed by changing the sign
of its imaginary part.
The product of a complex number and its complex conjugate is purely real and nonnegatrve:
zz*
: (a:i
ib)(a
-
ib)
:
a2
+
(2.2)
b2
We can make use of this result when dividing two complex numbers. We multiply the top
and bottom of the ratio by the complex conjugate of the denominator:
1!:#i: (T.*)(=#)
xtxz
I
yryz + i(y1x2
*l+fi
-
xrx2l yty2 . .(ytxz -
x1y2)
*]+f, "
xtYz)
*]+fi
So we may write the result of the division in the standard form:
(z+si) _ (2+si)
(3
- 4t)
(3
-
(3
4i1 G
+4i)
+
: _!25*T,
:,l!+T,i
2s
(3') + 4'z)
4i)
2.1.1. The Argand Diagram
The Argand diagram (or complex plane) is a diagram in which we plot complex numbers
as points with x and y coordinates equal to the real and imaginary parts of the number,
L-
78
CHAPTER
2
COMPLEX VARIABLES
respectively (Figure 2.2). Since we may equally well describe a point in the plane using
polar coordinates (r, 0), we can express the number as
z:x+iy:r(cos9*isin0)
(2.3)
where r : J*4 y'is the length of the line from the origin to P and is called the
amplitude (or absolute value) of z: r : lzl. The angle measured counterclockwise from the
x-axis, 0 : tan-r(ylx),is called the argument (or phase) of z:0 : arg (z). As x -+ 0,
y/x
-->
-|ooforpositivey,and9
-+ -n/2asx -+ 0.
< 2r or
--> +TTl2.Similarly,fornegativey,0
By convention, we choose a range of
-t <0 <n.
2r
for the angle 0. Usual choices are 0 < 0
v' Im (z)
-r, Re (z)
01/2
FIGURE 2.2. The Argand diagram allows
is described by Cartesian
usf
plot complex numbers as points in a plane. The point
or polar (z : ,"ie) coordinates.
Q,L
/ x + iy)
There is a nifty way to express z in terms of the polar coordinates
0z
ae
- r(-
sind
* i cos0) :
r
and 0 . Notice that
i7
We can integrate this differential equation to get
z
-
g(r)eio
at 0 : 0 and compare with the original expression (2.3) in terms
and sin 9, we see that we should take g(r) : r. Thus,
If we evaluate this result
of cos
I
z:
rei9
(2.4)
2.1 ALLABOUT
NUMBEHS 79
which is the polarform for z. In this form, multiplication and division are really easy:
:
ZIZ2
r1r2si(0t*02)
and
zl _ rt ^i(ot_oz)
z2 12
Comparing relations (2.4) and (2.3), we see that we may write
eio:cos1 f isin0
a
(2.s)
relation called Euler's formula. Changing the sign of the angle, we find
e-io
:
cosd
i sin0
-
Then, adding the two relations, we have
cos 0
eiq
+ e-iq
--
(2.6)
2
and subtracting, we have
sin0:
eiq _ e-iq
(2.7)
2i
These expressions are similar to the definitions of the hyperbolic functions;
coshx
: 'x + e-x and
22
sinhx
: e' - e-'
t
Thus, we have the relation
cosh
: cosd
(l0)
(2.8)
and, conver'bely,
cos
:
(l9)
cosh
d
(2.9)
Similarly,
sinh (i 9)
: I sinp
(2.10)
I sinhO
(2.1t)
and
sin (i 0)
:
80
CHAPTER
2
COMPLEX VARIABLES
a complex number is not necessarily a real
number between - 1 and I . When using complex numbers, we must recognize that functions
may behave differently than they do when we work with numbers that are purely real.
Using these relations, we note that the cosine of
Example
2.1.
sin (2i
*
Evaluate sin(2i
n
/4)
:
,i
-l r /4).
(2i -fn I 4)
_
2i
e-2(cosn f4
:
i
(2i+n I 4)
"-
*
rt (, sinh 2 *
;
I sin ir l4)
-
e2(cosn l4
-
i
sinn /4)
2i
cosh 2)
:2.6603 + 2.5646i
Notice that both the real part and the absolute value of this number are greater than
1.
Algebraic operations on complex numbers can be regarded as geometric operations in
the complex plane (Figure 2.3, a,b, c). For example,
Addition of complex numbers <----> Vector addition (Figure 2.3a)
Im (z)
Re (z)
012
FIGURE 2.3a. Addition of complex numbers is equivalent to addition of position vectors in the
Plane. Here
zZ:
zt
I
zZ.
Using this diagram, it is straightforward to prove the inequalities
lzr -f zzl
and
lzt-zzl
'
< lzrl * lzzl
(2.t2)
llzrl -lzzll
(2.r3)
2.1 ALLABOUTNUMBERS
81
Complex conjugate <---+ Reflection in the real axis (Figure 2.3b)
Im (z)
Re (z)
FIGURE 2.3b. The complex conjugate is formed by reflection in the real axis.
Multiplication of two numbers <----+ Magnification plus rotation
For example, in Figure 2.3c we show the product
zt x
z2
-
2"ir
14
t
3"in 16 - 6"i5tt lr2
Im (z)
Re (z)
FIGURE 2.3c. Multiplication is equivalent to magnification plus rotation. Here multiplication by
3ein /6 implies magnification times 3 and rotation through art angle r /6.
82
CHAPTER
2
COMPLEX VARIABLES
2.1.2. Roots
Since e2oi : cos (2n) * i sin (2n) : l, we can add2n to the argument of any complex
number without changing its value. That is,
z:
reio
:
rri(o+2n1
Now let's see what happens when we take the nth root of this number:
_ (reiolt/"
_ rl/nri9/n
zr/n
This is the primary root. Now add2tt to the aryument of z and take the root again:
zrln
- ,tl" e*p(ito +Znlln)
_
,1
In
,i0
/n
,2ri
In
This root is distinct from the primary root. We can continue to add
2n (n - l)
times
before we repeat a root. Thus, there are n distinct nth roots of any nonzero number. They
are given by
,1/n
<'
:-' rl/noi(0*2nm)/n
w
eJ4)
r
:rt/nl"o"(t *'"^) *,r,n (t *'"^\1,
n
n
L \
/
Example 2.2. Find the cube roots of
First write
1
:
lexp
\
m
: l:
l/3
eo
1t/3r2ti/3
o
<m
<n
(2.t5)
1.
(2rim), m:0,1,2
Then the roots are exp (2n i m I 3), with m
m:0:
/J
:
0, 7, and 2.
-,7
: cos(2r/3)*isin
(2n13)
| .Jj
: --+r22
and
m:2:
l/3"4ti/3 : cos(4r13)* j sin @n/3): -)-,f
Tbking m > 2 does not give any new roots-we simply repeat the values we have
already found. The roots fall at the vertices of an equilateral triangle in the complex
plane (Figure 2.4).
2.1
ALLABOUTNUMBERS 83
Im (z)
Z2
I
0.5
-0
Zl
0.
-t
Re (z)
05
-0.5.
I
-1
FIGURE 2.4. The cube roots of unity lie at the vertices of an equilateral triangle. One vertex lies on
the real axis at the point z
:
l.
The nth roots of a number always fall at the vertices of a symmetrical n-sided figure
the complex plane. For example, the fifth roots fall at the vertices of a pentagon.
in
2.1.3. Complex Functions
Mappings
A complex function u(z) takes the numb er z : x *iy and generates a new complex number
l.D -- u * iu. Since a plane is needed to plot a single qrmber, complex functions cannot
be represented in a single two- or three-dimensional dialram. We would need 2 x 2 : 4
dimensions. Thus, instead, we use two diagrams, and we think of a complex function as
a mapping of the complex plane onto another copy of itself.
Example
2.3.
Describe the mapping
W:In terms of the coordinates
w
.r ,
I
z
y or r, 0 , we have
:u +i, : -] . :'r].i!. :l
:
x+ty x'+y'
r "-,t
peiQ
We can map out in the u.,-plane the image of a curye in the z-plane. For example, the
unit circle in the z-plane (r : l) mapsto a circle (p : 1) in the u.r-plane, but traversed
in the opposite sense (Q : -0). The inside of the circle (r < 1) in the z-plane maps
to the outside of the circle (p > l) in the u.r-plane, and vice versa (Figure 2.5). This
function gives us the complete u-plane from the complete z-plane, one to one.
84
CHAPTER
2 COMPLEXVARIABLES
FIGURE2.5. Thefunction(mapping)
u:
l/zmapstheinteriorof thecircle lel
: l totheexterior
of the circle u I : 1. Traveling around the unit circle in the z-plane counterclockwise
corresponds to traveling around the unit circle in the u.,-plane clockwise.
I
Multivalued Functions and Branch Cuts
Some functions don't work that way. The log function exhibits some of the possible curious
behaviors:
w
:
lnz
:ln
(reie)
: lnr * i0 :
u
I
iu
If we take all possible points in the z-plane with 0 < 0 < 2n, we get only one strip of the
ur-plane,0<u<2n.Thisisthefirstbranch,orprincipalbranch,ofthefunction'Toget
the whole w-plane,we have to go around the z-plane more than once; that is, we have to add
l2rn
to the argument of z. (We can imagine the plane as a pad of paper containing many
I increases
also. (We move to the next sheet in the pad and to a
new branch of the function.) The positive x-axis forms the branch cut of the cornplex plane
for this function. As we cross the positive r-axis, we move from one sheet to the next, or
to a new copy of the plane. The branch cut-here, the positive x-axis-acts as a ramp that
allows us to move from one sheet to the next. With the log function, we have to go around
infinitely many times to generate the whole ra-plane. Thus, this funption has infinitely many
branches. (Our pad of paper has infinitely many sheeti.) With n : 0, we getthe principal
sheets, each labeled by the integer n.) Every time we cross the positive.r-axis,
by 2n, and thus u increases by
2t
branch of the function.
If we choose to define 0 such that -r < e < z, then the branch cut lies along the
negative x-axis at 0 : n. Thus, we can phoose the position ofthe branch cut by choosing
the range of values for 0. The choice tha[ we make affects the value of f (z) . For example,
consider the point zo that lies on the negative imaginary axis at a distance lzol : 2 from
the origin (Figure 2.6, a,b). With the branch cut along the positive real axis, and hence
0 < 0 < 2n, zohas argument 0 :3n 12, and so for the principal branch
ln z0
lbranch cut on positive real
axis: ln 2 *
3n
,.
t
However, with the branch cut along the negative real axis, and hence
-n < 0 < 1r, zohas
2.1 ALLABOUTNUMBERS
FIGURE 2.6a. Branch cut on the
positive
FIGURE
x-axis. The poirit z6 (x : 0,
| : -2) has argtmett3tr f 2.
argument 0
: -r
85
2.6b. Branch cut on the negative
-x-axis. The point zg has argu-
mert
-n /2.
12, and so for the principal branch
ln zo
luranctr cut on negative real
axis:
,r
h 2 - -t
2
Since the two values are different, so are the two functions.
The cube root function has three branches, since adding more multiples of 2n to the
argument simply reproduces the same roots that we already found with n : O, 1,2.
For both these functions, the branch cut ends at the origin, which is the branch point for
each function. Think of the branch cut as a tadpole with a fixed head (the branch point) and
a movable tail (the branch cut).
2.1.4. An Example from Physics
To demonstrate the use of complex variables in physics, let's look at an electromagnetic
wave. The general expression for a plane wave may be written in the form
y
:
yocos
(f. *. - at i
Q)
*uu"; f , the wave vector, gives the direction of propagation
and also the wavelength, ,1. : 2r /k| o is the angular frequency; and the wave speed is
u : @lk. The phase constant, /, is the wave phase at i : 0, / : 0. For an electromagnetic
wave, the electric wave amplitude is a vector:
where y6 is the amplitude of the
i:
We can
with d
i - c,.tt -f Q)
simplify the expression by choosing our r-axis along
kx - cot + d', we can write
:
E
where
frocos (R'
:
:
io :io"iQ
Eo Re (exp
Re
io
"iQ
(ilkx
e*p
-
(ilkx
f.
Then, using equation (2.5)
ot -t QD)
-
@tl)
:
Re
io
is the complex amplitude of the wave.
exp
(ilkx
-
\
cotl)
(2.16)
86
oHAPTER2ooMPLEXVARIABLES
This mathematical description makes it easier to investigate the propagation of a wave.
Let's start with Maxwell's equations:
Y
.D: pf
(2.17)
i.f,:o
(2.18)
ai
VxE
(2.19)
At
VxH:itt-aD
u
where
D : eE is the electric
displacement, the magnetic induction
(2.20)
i :-
pfr, and H is
the magnetic field. Now assume a solution of theform(2.16) forbothE andB, and substitute
into the equations. For waves in a region where the free charge and current densities are
pf : 0 and j/ : 0, Maxwell's equations take the following form:
Gauss' law (equation 2.17):
zero,
ikeE*
-
ikB"
-
O
Gauss'law for B (equation 2.18):
Q
: O;both i and B are perpendicular to the direction of propagation.
further
by choosing our y-axis along the direction of E. Then from
Now let's simplify
(equation
2.19)
we get
Faraday's law
Thus,
E, :
0 and Bx
aEy
0x
ikE,
_
_aBz
-
iaB,
0t
(2.21)
while from Ampere's law (equation 2.20) we get
(2.22)
- -iaeEy
-ikBJ
LL
Combining equations (2.21) and (2.22), we get
kEy:r\
Since we want a solution with
E, f
/aue \
Er)
*
0, we must have
a1
-:kJw
whichis the wavephase speed. Inavacuum, relation (2.23)becomesl^lf k
(2.23)
:
l/J1t'om = c'
87
2.1 ALLABOUTNUMBERS
Now let's see what happens if we allow the wave to propagate through a conducting
medium with conductivity o, but e and p, real. There will be a current given by
j:
"i,
which must be included in Ampere's law. Equation (2.22)becomes
-ikBJ
u
:
o Et.
-
nr9-
irotE,.* B. -
-i29!
-tK
Now when we combine this equation wirh (2.21), we get
kEr-iif"-iae)p.E,
Again we may cancel the factor En to get a relation between
k2
:
irott(o
a
and k:
- iae): ialto I ro2p,e
(2.24)
We expect the frequency to be a real number,3 but then the squarg of the wave number is
complex! This means that k itself must be complex, so let's write it as
k:rc*iy
(2.2s)
Before proceeding, we should ask whether this idea makes sense. If we put our expression
for k back into equation (2.16), we find
E
:
:
Re Eo exp
(t[(r * iy)x -
Re i, exp (i[rcx ' - io cos (rx ot I
-
rot])
rlrtD exp
(-yx)
(2.26)
Q)e-Y'
thus, the rlave amplitude decreases exponentially as the wave propagates. The wave energy
is converted to kinetic energy of electrons as the electric field drives an oscillating current
in the conducting medium.
OK; let's solve for this complex k. With
k2
:
rc2
-
y2 +2ircy
equation (2.24) becomes
*2
-
y2 -f 2ircy
:
iap,o
I
a2
pte
Equating real and imaginary parts, we get two equations for rc and o:
'2-Y2:'2
l"'
(2.27)
3Thi, ir th" usual convention, but it is sometimes useful to make the opposite assumption, choosing k rcal
complex.
and
a
88
CHAPTER
2
COMPLEX VARIABLES
and
2rcy
- olto
(2.28)
Then, from equation (2.28),
(Dl.LO
r-
(2.2e)
2r
Substituting this into equation (2.27) gives
*4-r21,e*2-(T)':o
We can solve this quadratic in rc2
to get
*r:t!9(r+
2\
Since
r
is real by definition, 12 must be positive, so we take the
*' :"{t
(, *
.\
*
sign:
(*)'
(2.30)
which reduces to our previous result (2.23) when o -+ 0, as required.
Now let's solve for y. We substitute our solution (2.30) for r into equation (2.29):
'2r
@l.Lo
2er+\n+@lr$
Now in the limit of low conductivity (o <<
ra.€), these
results become
which is only slightly different from the result for a nonconducting medium, and
p
^._o
, -ty;
which is small and directly proportional to o, as we might expect.
In the limit of high conductivity (o )) oe), we get
atJ@
T-
A Y*
'l-:
(pl.Ld
2
(2.3r)
89
2.2 FUNCTIONS OF COMPLEX VARIABLES
and
@polz
FO
Y: opo
2*: 2 Urw:Y ,. :*
Now what does this all mean? According to equation (2.26), when the conductivity is
low, the amplitude decreases slowly (y {1 rc), but when the conductivity is high, y : rc
and the wave travels only a short distance into the medium, as shown inFigure 2.7 .
Ey
Es
08
06
04
02
-02
FIGURE 2.7. The magnitude of the electric field vector decreases rapidly with distance inside
conducting medium. Here we show
(the high conductivity limit).
E,
as a
function of .x at a fixed time t , with K
:
a
y
2.2. FUNCTIONS OF COMPLEX VARIABLES
We have already discussed the notion of a complex function as a rnapping of the complex
plane onto itself. It is now time to explore some properties of these mappings.
2.2.1. Continuity
A real function f (x) is continuous at x : a if, for any positive e. we can choose
6 such that whenever lhl < 6, then also
a
positive
lf@+h)-f(a)l<e
/ (x) is close to / (a) whenever x is close to a (for example, see Stewart,
Section 2.5). We can extend this definition to complex functions simply by replacing x with
z and interpreting the absolute value as the absolute value of a complex number. Then
a function is continuous If f (z + ft) is close to f (z) in the mapped plane.
or, loosely speaking,
90
oHAPTER
Example
2 ooMPLEX vARIABLES
2.4.
Showthatthefunction
f (z) : l/z: (l/r)e-id
iscontinuousexcept
at the origin.
Leth
:
n
+
i6.Then, for z
I
0,
I
f(z+n): f@+iy+n+i6): x *rt * t(y +6)
x*n-i(y-1l)
(x -t D2 + (y
*6)2
Thus, neglecting squares of the small quantities 4 and 6, we have
f(z+
h1
i'=
x -f n - i(Y -f 6)
- -'=- fQ) - '(x-lD2+(y*6)2
x2+v2
(rl+,d)(-r -iy)2
(*2 + y2
*
(x2 +2xr1
2xr7 12yD@2
+ y2)
h(x - iy)2
+ y2 +2y6)(x2 + y2)
-+0
asft-+0
Thus, the function is continuous. But the function is not defined at the origin, where
x : ! :0, and so / cannotbe continuous there.
But look at the cube root function (Section 2.1.2)wirh branch cut along the positive
real axis (0 . I < 2n).If we consider only the first, or principal, branch, the function
is not continuous. Let zo : reis and the neighboring point zr - vsi(2r'6') (Figure 2.8a).
Then
: fr(zo) :
wt : h(z) :
wo
but
rr/3 exp (i6/3)
rt/3
exp
(ilZn
-
6]/3)
FIGURE 2.8a. The principal branch ofthe cube root function is not continuous across the branch
cut, here chosen to be along the positive x-axis. The two points zg and z1 are very
close together, but on opposite sides of the branch cut'
The point ur1 is not close to u.,s (Figure 2.8b).
However, there are two more branches of this function:
fz(z)
:
rrl3
exp
(i I2r + el
fi)
2.2 FUNCTIONS OF COMPLEX VARIABLES
91
and
fzk)
: ,i't p (i Vn + el F)
"
: f2(zt) : rr/3 expfi( n - S)/31 and w3 : ftk) :
:
rr/3expti(6n -6)/31
rr/3exp(-is/3) in the ur-plane also (Figure 2.8c). Then we
Let's plot the points w2
that /3(21) lies close to fi(zd. That is, the function is continuous if we are willing to
switch from one branch to another. Points that are on opposite sides of the branch cut are
not close for a single branch of the function. We have to change from one branch to another
see
as we cross the branch cut.
: fi(e6) and
ul : ft(zt) are not close
FIGURE 2.8b. The points u;6
FIGURE2.8c. Thepoint u3
to u;6.
-
-fl(zr)
is close
together.
Functions with More Than One Branch
Point
r
The function f (z) : .tnT? has two branch points, at z : *:i. Thus, there are also two
branch cuts, one starting at each ofthe branch points. We can understand the behavior of
-f i). Let's first choose each branch cut
this function if we factor it as f (z) :
^re= Dk
to run from its branch point upward along the imaginary axis (Figure 2.9a).Then, in polar
coordinates with origin at the point z : i, we write
Z-i:preiLt,
With origin at the point z
wrtn|>Qr
3tr
2
: -i,
: pz"ih,
f2 , Qz,
'- -34
2
Then, for points on the right side of the imaginary axis above z :
Qz: n12, pr: y - 1, and p2: y * l, so
Zl-i
f (z) : J npre@r+D
with
i
, we have @1
: J0 - D(y + t)"in/2 - i 17=
:
n 12,
92
CHAPTER
2
COMPLEX VARIABLES
FIGURE 2.9a. Here we have chosen both branch cuts to run upward. We describe
distances pr and p2 from the points
@2, as shown here.
I and -i
For points on the left side of the imaginary axis above z
:
a
point z by its
and the conesponding angles
i,
Qt
@1
and
: -3n /2 and Qz : -hu /2,
SO
- lxy + D r-i3tr/2 : i{y' - |
and the function is continuoo, u".orl the imaginary axis. Below z : -i, there a.re no branch
cuts and the function is continuous. However, for points between -i and *i, we have
f(z)
:/y
On the right:
tT
f k): t/ ptr-'"12Pzsi'r/2
_y2
- JPtn:
On the left:
f (z):
J;i-E7rer;t ;It : Je
Te
_
i0 + i"-i" _ _u[ _ yz
The function is not continuous across the imaginary axis. This choice for the two branch cuts
is equivalent to having a single branch cut that runs from one point to the other (Figure 2.9b).
i to run upward while that from
On the other hand, if we choose the branch cut from z
>
(Figure
>
then
r
2.9c),
runs downward
z
12 < Qz < 3n 12.
l2but
12 Qr
: -i
Above z
:
:
-3r
j, we have
-r
On the right:
f (z) :
J;A;n;;;n - "io JTn : i
/2
y2_
On the left:
.f (z)
: J;i=tf;/r;O;n -
,-itt /2 4Qy pz :
-i \f y2 -
and the function is discontinuous. A similar situation holds below
there are no branch cuts and the function is continuous.
-i.
1
Between
i
and
-i,
93
2.2 FUNCTIONS OF COMPLEX VARIABLES
FIGURE 2.9b. The situation in Figure 2.9a is
equivalent to having a single
branch cut that runs from one
FIGURE 2.9c, Here we have chosen
one
branch cut to run upward and
the other to run downward.
branch point to the other.
2.2.2. Di|erentiabi I ity
A real function
f (x) is differentiable
f(a*h\- "'f(al
lim"'
h+0
hi
at
x
:
a if, for h >
0,
existsandequals lim
'
i:0
(for example, see Stewart, Section 2.8). Thus, the function
differentiable everywhere, while the function
f
f(a)- f(a-h)
h
f(x) :
x2 (Figure 2.10a)
ifx<0
ifx>0
|. -.r
Q;: {'
f(x)
25
20
l5
10
5
FIGURE 2.10a.
24
The function f (x) :
x2 is differentiable.
ls
94
cHAPTER2ooMPLEXVARIABLES
(Figure 2.10b) is not differentiable at x
:
0 because
-h
lim_-
h+0
h
lim
h-->o
-l : -l
while
-/(o-r) :
i:ohh-ohn-o
l.m /(o)
tin]
o-,(-fr)
:
lim I
:
I
The two limits are not equal, and thus the derivative does not exist.
f(x)
FIGURE 2.10b. This function has an abrupt change of slope at the origin: limr-;g1 df /dx
but limr-6- df /dx : *1. The function is not differentiable at the origin.
: -l
Now with complex functions, we need to let one point approach the other not just along
the real axis, in either direction, but along any line in the complex plane (Figure 2.ll).ln
particular, we must get the same limit for the derivative if we approach z from a neighboring
point along either the real or the imaginary axisa:
.. .f(z+dz)-f(z)l
rrrrr
dz-0
That is, with
----------
/(z)
lim
:
u(x
.. f(z+dz)-f(dl
rrrrr
----l
laz:ax dz-O
dZ
clz
laz:idy
u(x, y) + iu(x, y),
*
dx, y) + iu(x
* dx, y) -
u(x, y)
-
dv)
u(x' v)
iu(x, y)
dx
dx--+O
:
lim
dy-->o
u(x, y
I
dy) + iu(x,
!*
idv
4This condition tums out to be not only necessary but also sufficient.
-
-
iu(x, v)
95
2.2 FUNCTIONS OF COMPLEX VARIABLES
FIGURE 2.11. For
a function to be differentiable at z, we must get the same
the point z along any line in the complex plane.
limit when we approach
Taking the limits, we obtain
0u
u+i
Du l/0u
u:
Eu\ : 0u Du
*'
;(u, u) -' ur* u,
Equating real and imaginary parts, we get the Cauchy-Riemann relations:
0u
0x
0u
0y
(2.32)
and
0u
6x
0u
0y
(2.33)
These relations must be satisfied by any function that is differentiable.s
We may now write the derivative as
df 0u .0u 0u .0u
dz 0x "ax- ay '0y
(2.34)
Conversely, if the Cauchy-Riemann conditions are satisfied and the partial derivatives are
continuous, then the function is differentiable. Consider a small displacement 6z : 6x*i 5y.
Then
ur:(#.,#)u,*(X+tfi)at
5See Problem 17
for an alternative statement ofthe differentiability condition.
96
oHAPTER2coMPLEXVARIABLES
Applying equations (2.32) and (2.33), we find
ur:
(y.,#)
(dx*i uD:
(#+tfr)u
and thus
d.f
-i: ,. 3f : (0, +i.0u\
ol!,* [* u )
for any 62.
Example
2.5'
Show that the function
First note that z2
of the function are
-
(x
*
iy)z
:
f
(z)
:
*2 + zixy
z2 is differentiable'
y2, so the real and imaginary parts
-
u(x, Y) : x2 -
Y2
and
u(x, y)
-
2xy
Then
0u^Eu
Ex
Dy
and
0u
0u
- -1r'
-0y-
0x
so the Cauchy-Riemann relations are satisfied and the partial derivatives are continuous. The derivative is
df
3u .0u I(0u
.0u\
;:i*';:;(ur*'u)
:2(x *
iy)
:22
which is exactly the expression we might expect. In fact, complex derivatives obey
all the usual rules of real derivatives, such as the product rule.
As is the case with real functions, complex functions that "blow up" are not differentiable,
and functions that are not continuous are not differentiable. Thus, functions that have branch
cuts are not differentiable at the cut. However, it is possible for a complex function to be
continuous everywhere but differentiable nowhere. The function f (z) : 7* : r - ly is
one such function. With 0u l0x -- -0u l0y : 1, the Cauchy-Riemann conditions are never
satisfied.
lI
2.2 FUNoToNS oF coMpLEX vARTABLES 97
r
2.2.3. Analyticity
A function / that is differentiable tt I : a andwithin aneighborhood6of z: a is said
to be analytic vt 7 : 6. Thus, a function analytic at a satisfies the Cauchy-Riemann relations within a small but finite region around z : a. The requirement that the function be
differentiable within a neighborhood ofa guarantees the existence ofhigher-order derivatives as well. A function that is analytic in the whole complex plane (except perhaps at
infinity) is called an entire function. Analyticity is a powerful constraint on a function, as
we shall see.
2.2.4. lntegrals
Since a complex function
integral
/(z)
represents a mapping of the z-plane onto the u-plane, any
fz2
I
Jzt
f(oa,
is an integralT along a path that must be specified between the points 7y and zz. Expanding
the integrand, we find
lr','
f {r) o,
:
f,'r'
@
*
iu)(dx
*
i dy)
Defining the vectors d, with components (u,
write the integral as
l,','
f k)or:
f,','
:
f,'r'
-u),
i
u
and
ai+ i
dx
i,
-
u
dy
*
i(u dy -f u dx)
with components (u, z), we may
l,','I
ai
(2.3s)
The Cauchy Theorem
Next we shall proves an extremely powerful theorem about integrals of analytic functions.
Consider the integral from 41 to z2 alonlgpath C1 and returning from zz to zr along path C2
(Figure 2.12).The combined path forms a closed curve C in the complex plane, and result
(2.35) applied to this path gives
f"rrtor:
fri.ai+
i
frt ai
6Aneighborhoodofz:aisasmallregioninthecomplexplanethatcontainsthepointz:astrictlywithin
example, the circle defined by lz - al < e for any e >
it-for
TSee Section 1.2.3 for information on path integrals.
SThe proof presented here has the advantage of being
relatively straightforward, but it requires continuity of the
partial derivatives. An altemative proof due to Goursat that does not require the use of Green's theorem (see, for
example, Jeffreys and Jeffreys, Section I L052) requires only that /(3) be differentiable and C have finite length.
98
CHAPTER
2
COMPLEX VARIABLES
Im (z)
Re (z)
f <O dzis apathintegral in the complex plane. Here the two paths
C1 and C2 canbe combined to form a closed curve C from 31 to z2 and returning
FIGURE 2.12. The integral
Ili
to {1.
Then we may use Green's theorem (equation I .47) to write each integral over C as a double
integral over the surface S spanning C:
t',
Now
if
.f (z) dz
:
l,(#
- x)
dx
dv,'
l,(# - #) o. no
the function is analytic everywhere on S, then both integrands are zero, by the
Cauchy-Riemann relations, and so we find
If f (z) is analytic in and on C, then
$ rrr;dz:o
Jc"
a
(2.36)
result known as the Cauchy theorem.
Cauchy's theorem applies to curves defined in a simply connected region. A curve in
simply connected region can be shrunk continuously to a point while remaining inside the
region. If the region has a hole in it (Figure 2.I3) and the curve surrounds the hole, then the
region is not simply connected and the theorem does not hold. We will see regions like this
in the following sections, and we will also find some ways to get around the restrictions.
An importantcorollary to Cauchy's theorem is thatthe integral of a function /(z) between
two points zr and 72 is path independent if the function / is analytic everywhere in the
region containing the paths of interest. This is important because it means you can often
use a path that hugely simplifies the integration.
a
2.2 FUNCTIONS OF COMPLEX VARIABLES
99
FIGURE 2.13. This white region has a hole (shaded circle), and so it is not simply connected. As
the curve C shrinks, it surrounds the hole and can never shrink to a point.
2.2.5. The Cauchy Formula
Careful application of the Cauchy theorem leads to many interesting results. For example,
consider the integralg
I:Q t'l
JcZ_a
-dz
where C is aclosed curve in the complexplane. The function f (z) : l/(z - a) is analytic
except at the single point z - a. Thus, by the Cauchy theorem, the integral is zero if the
curve C does not enclose the point z : a. If the curve includes z : e, we consider the
integral
I
,':fo _dz
z-a
where the curye C' : C + Br * f f 82 (Figure 2.14).The segments 81 and 82 are very
close together. Note that this new curve C/ does not enclose the point z : e, and thus the
function / is analytic everywhere in and on C'. Thus, I' is zero by the Cauchy theorem.
Since the function / is analytic everywhere except at z - a, it is continuous along 81 and
82 and so the contributions to the integral I' along .B1 and 82 cancel. (The integrands are
equal, but the path is traversed in opposite directions. Equivalently, the upper limit of the
9The symbol
$a as used here means an integral counterclockwise around the closed path C. When you traverse
such a curve, the interior of the curve is always to your left.
100
CHAPTER
2
COMPLEX VARIABLES
Im (z)
:
a from its interior. (When
as to exclude the point z
you travel counterclockwise around the curve, the interior, shown shaded, is on your
FIGURE 2.14. Thecurve C/ is constructed so
left.)
first integral is the lower limit of the second, and vice versa.)
| d,
I
or+6
o:{
'
- Jc+Br*riB2Z-adr:$
- Ic'z-a' or.:f
Jf.clockwiseZ-d
J6z-a
and so
ftfl
fr, - "o':
./r,.ro.r.* rr",
- od'
The integral around f is taken clockwise, the reverse of the usual direction. We may change
the direction if we also change the sign in front ofthe integral:
flfl
frr-"d':frr-ooNowlo on
f, z :
a
I
peiq,so d7
f\f2'l
ipeiq de andthen
l' ' - 'o': Jo fuo''' i ag
t2n
:, I
do
:Zri
Jo
lORemember: Addition ofcomplex numbers is equivalent to addition ofvectors in tbe complex plane (Figure 2.3).
2.3 COMPLEX SERIES
101
Thus, we obtain the first Cauchy formula:
I | ,__ lZni
Icz-o"\ o
if z:alieswithinC
We can easily modify the proof to show that for any function
$ fk, or:[ t'rlll\ar:
Jcz-'
/
that is analytic in and on C,
*+#aYi\ide
fo'"
(2.37)
otherwise
if z:arieswitbinc
otherwise
[o
Then we 7et p --> 0, to obtain
(
6 !''')^ar: 1
Jc1-
if
r2n
ta)
[ 0
"
Jo
d0
:2tif (a) if z :
a lies within
c
e3B)
otherwise
We may extend this result to obtain the general Cauchy formula:
t'rj- "9r": ffi
f('-r'{ztl,-o ir z:
a lies within
c
(2.3e)
The proof of this more general result is in Appendix III. This result verifies the claim
made in Section 2.2.3 that analytic functions possess derivatives of all orders.
Example2.6. Evaluate the integral
/
J circle
lzl:l
'ot'O,
Zt
around the circle of unit
radius centered at the origin.
The function f (z) : cos e is analytic within and on the circle lzl
Cauchy formula with n : 3, we have
cosz
-f
I
./.n.,.
,.,:,
a'
:
7. Using the
d2
: ;2tri fu
t'1.:o : zi (- cos zl':o) : -ti
""t
I
2.3. COMPLEX SERIES
2.3.1. Real Sequences and Series-A Review
Sequences
A sequence of numbers an is an ordered set with a rule for computing lhe nth element in the
set. The number a, is the nth number in the sequence and can often be written as a function
of n. Some examples are
l--
102
oHAPTER2coMPLEXVARTABLES
for which en
:
n, 2;rd
s2
:
{1,
-1,
1,
-1,...}
for which
a,
: (-I)n-r
unbounded if, for any real number M > 0, there is at least one member of the
sequence dn such thatlarl > M.The sequence ,Sr above is unbounded, since we need only
take M - N, any integer, and infinitely mar'y an are greater than N, no matter how large
N is. Conversely, if we can choose an M such that all members of the sequence S satisfy
larl < M, then the sequence is bounded. Sequence 52 is bounded-we could take M :2.
A sequence S is corwergent if therc is a number s such that, given any positive number
r, we can choose anm so that for every n > m
A sequence
S is
lan-sl<e
in which case the limit of the sequence is s. We write
n\on - t
or, more loosely,
an-->s ast -+@
Neither of the sequences .tl or 52 is convergent, but the sequence
s,
r ,;,
r
: (t,r,t t.r 4,
t
)
j
converges to zero. (Take m to be any integer greater than 1/e.)
A sequence that is bounded but not convergent-for example, sequence
oscillate finitely. Another example is
S2-is
said to
s*: {1* t-,,'}
L,?
)
Other sequences may be unbounded and not convergent-for example, sequence
51
above, but also series such as
55
: {/ncos (zn)}
which is oscillating but unbounded (infinitely oscillating).
Further properties of sequences can be found in texts on calculus (for example, see
Stewart, Chapter 11) or on real analysis.
Series
A
series is formed by summing the terms of a sequence. Using sequence 51, we form the
series
i': i,
n:1
n:1
e.4o)
2.3 COMPLEX
SERIES
103
A second sequence is formed from the partial sums of the series:
,+^:io,
n:l
lfthis sequence ofpartial sums is coryergent, then we say that the series is convergent. One
requirement for convergence is for the successive terrns to approach zero;that is, a, -+ 0.
Conversely, the series diverges
if
n\o" 7 o
This includes the case in which the limit does not exist at all.
Clearly, series (2.40) is not convergent, since limr-- a, does not exist. But what about
the
following series?
$t
Ln
(2.41)
n:l
: Iln
do approach zero, but not very fast.
Here the answer is not so clear. The terms dn
It is possible for a series to diverge even though an -+ O. We'l1 need to /esl this series for
convergence (see below).
If the series in which each an is replaced by its absolute value,
it',t
n:I
is said to be absolutely convergent.
Series that converge, but not absolutely, are said to be conditionally convergent. These
series do not have a well-defined sum, since by rearranging the terms we can achieve any
sum that we want (for example, see Stewart, p.736;Arfken and Weber, Section 5.4). Thus,
also converges, then the series
they must be used with caution. In contrast, an absolutely convergent series has a welldefined sum. In addition, absolutely convergent series may be multiplied together to form
a double sum that converges to the product of the original two sums.
There are numerous tests for convergence of a series. A number are listed in Gradshteyn
and Ryzhik, Section 0.22. (See also Stewart, Section 71.2.) They include the following.
The root test. If
]\lo*l'tk
andq
<
1, the series converges absolutely,
fails.
The ratio test.
but
:
q
if4 > l, the series diverges. Ifq :
test
If
r.,, lon*tl:o'
r-ol
ar I
1,11e
104
cHAPTEB
2 ooMPLEXVARTABLES
andq<l,theseriesconvergesabsolutely,butifq>l,theseriesdiverges.If4:1,fl[s
test fails, unless the ratio remains greater than 1 as the limit is approached, in which case
the series also diverges.
Theintegraltest. If
ap: f (k), where /(-r) isdefinedforx > q > l, thentheseries
converges or diverges according to whether the integral
r@
I
Jq
fGsax
converges or diverges.
The comparison test.
Iflanl < bn and ! b, converges, then ! a, converges absolutely.
The alternating series test. If the series is altemating (that is, successive terms in the
series alternate in sign), then the series converges provided
thatlalrlll
.
la*l and
la7.
|
-+
0.
i 1. Oru.rr"r.
Let's apply the integral test. Since oo : tii'i! aefined for k > l, we look at
Example2.7. Show that the series (2.41),
: lnrlf
l,* )0,
which diverges. Thus, the series also diverges.
If the series elements an are notjust numbers but functions of some variable, say x,
then convergence of the series may depend on the value of .r chosen. For any particular
value of -r, we can apply any of the tests listed above. The result determines the pointwise
convergence ofthe series at that value ofx. For example, the series
r"
converges if lxl < I but diverges otherwise. A particularly valuable series is one that converges independent of the value of x and converges equally well for all x. Such series are
said to be unifurmly convergent.
is uniformly convergent to / (x) in an interval 1 if, for any arbitrarily
small positive number €, we can find a number N such that
A sequence
f"(x)
lfo@)-f@)l.e
for all p > N, where N is independent of x in 1.
If
the uniformly convergent sequence f"(x) is the sequence of partial sums of a series
f,(x) : Dk:ou*(x), then that series is uniformly convergent in 1.
One of the major advantages of uniformly convergent series is that they may be integrated
and differentiated term by term, provided that the individual terms are continuous (for
integration) or differentiable (for differentiation).
2.3 COMPLEX
Tests for uniform convergence include theWeierstrass
SERIES
105
M test:
If there is a sequence of numbers M, suchthat lu"(x)l < M, for all x in the interval
la,bf and the series D|rM" converges, then the series
uniformly and absolutely in the intewal [a, b].
f[,
ur(x)
converges
Uniform convergence and absolute convergence are independent properties of a series.
Thus, the most useful series are those that converge both uniformly and absolutely.
2.3.2. Complex Series
A complex sequence is
a sequence
of complex numbers and thus is actually two sequences
of real numbers:
Zn:xrliYn
f
The sequence zn converges if and only
both the sequences
x,
and ),,? converge. Similarly,
the complex series
Drn:D*,*i\t"
if and only if both real series I x, and I y, converge.
We can define absolute convergence for a complex series as we did for a real series:
A series converges absolutely if the series of absolute values converges. But for complex
series, we are now talking about convergence of athird seies,
converges
f l.,l:f
x]+fi
which is different from the previous two. Thus, absolute convergence is a more powerful
constraint for complex series than for real series. If a series converges absolutely for z in
a range lz - zol < R, then R is called the radius of convergence for the series.
Let's look at an important example of a complex series: the geometric series
oo
fon
.L'
n:0
The
patial sums are
Sxr(a)
Now
if lzl <
:
* "' *zN
1 _zN*l
_1-zN+l_
I-z
l-z
l-z
I
*
z
-f
12
+
z3
1, then
t-N+1 I
r-rN+l
ll - z I
11
lt
l-lzl
-
zl
*0
asN--+rc
(2.42)
106
CHAPTER
2
COMPLEX VARIABLES
and so
oo
\-,n
/-'
I
-
n:o
I-z
iflzl <1
(2.43)
The geometric series is uniformly convergent for lzl < r arrd
r (here 1) is the radius of convergence for the series.
r <
1. The limiting
magnitude of
2.3.3. TheTaylor Series
Suppose that a function
z
:
f (z) is analytic in a region R: lz - al < p, centered on the point
f (z) as a series in powers of (z - a):
a. Then we may express
-f(d
: f h) * k - a) f' (at * tJ
t"(o) + "'
This series is uniformly convergent within the circle lz
*"
;i'' Hl,:"*
(2.44)
- al < p, where p is the radius of
convergence.
To prove this result, we'll use the Cauchy formula (equation 2.38) and the geometric
series (2.43). First we construct a closed curve f that surrounds both the points z and a
(Figure 2.15). Then we may wite f (z) as an integral around the curve f :
Next we write
where
lz.-al
I
1.4.1
l€-olsincezisinsideI-,andthusclosertoathanf,foranyfonf.(Wearefreetoconstruct
f
to make sure this is true.) Thus, we can use the geometric series (2.43) to expand the
second fraction:
+:(+) ('.=.(=)'.
t \$/.-aV
:f
:\s-")k\€-
")
)
2.3 COMPLEX
SERIES
1O7
Im (z)
Re (z)
FIGURE 2.15. The curve f for evaluating the coefficients in the Taylor series.
within the region R and encloses the point I : o,.
The geometric series is uniformly convergent on
respect to
f
f
is entirely enclosed
, so we may integrate term by term with
f:
r(z):+in:o e-ayn Ae*rru
27tt
(2.4s)
-
Then we may use the Cauchy formula (equation 2.39) to express the integral in terms of
derivatives of /:
a^f
f(z):i(z-d"
nt dzn lr=o
(2.46)
I
-
as
we set out to prove.
Now suppose there exists some other series for
/
about the same point a:
f (z):Dr,e - o)'
n--0
What can we say about the coefficients
times. Then
!clz :i
nr,(z
n=l
-
c, in this
series? Let's differentiate the series la
ayn-l
@
t2c
u
.I : )4^ n(, - l)cnk dz'
a)n-2
n:z
o: I^
dz'n
:
mtc^
+ i
n:m+I
nfn
-
1).. . (n
-
m
*
t)cnk
-
a)n-m
108
oHAPTER2 ooMPLEXVARTABLES
Evaluating tt 7
:
6, we have
dn
"f |
I
dz^ lr:o
So
if
-
ml.c.
any such series exists, its coefficients are given by equation (2.44).
Example
2.8.
Find the Taylor series about the origin for the exponential function
f (z) :
ez
:
ex-fiy
:
ete'J
:
er cos y
I
iex stny
:
0) is
This function is analytic everywhere:
3u: ,,
cos), :
A*
and
"^
Eu
Ay
3u r.
0u
A*:'-'smY:-ay
The derivatives are
d'f _
dzn
",
for every n, and so the Taylor series about the origin (a
-z2zn
e.:t*r+;+...+1t
This series is exactly what we might expect, since e" has the same form when x is
real.
2.3.4. The Laurent Series
Suppose that the function / is analytic in an annular region R centered ort I : ai pt <
lz - al < p2. Then we may still expand / in a series of powers of (z - a), but the series
will include negative as well as positive powers. This series is called a Laurent series. To
find the coefficients formally, we apply the same techniques that we used for the Taylor
series.
As before, we construct a curve f that surrounds the point z and lies entirely within
the region R. This time the curve cannot also surround a, but we can distort it to form
a composite curve having four parfs (Figure 2.16):
1. The original curve
f
with two snips in it
just inside the inner border of the annulus at lz
,
2. A circular curve C1
- al
3. A circular cuwe C2just inside the outer border of the annulus at lz - al
4. A set of cross cuts linking f to C1 and f to C2
:
:
pt
p2
t
-
e
6
2
3
COMPLEX SERIES
109
Im (z)
Re (z)
FIGURE 2.16. A curve for finding the Laurent series lies within the annulus but excludes the point
2.. C : C1 -l CZ I | - cross cuts.
Now we proceed as before, writing
/
as an
integral around the curve
f
:
t f f(€)dE
I(z):fifrq_,
But, when we traverse our composite curve as shown in the diagram, the integrand is analytic
everywhere inside and on the constructed composite curve, and thus the integral is zero by
the Cauchy theor m.
f,#o' : t'r,*ou * -{..",*0,',,"#.ou *
+$
{(€)-os :o
Jcrosscuts
t'r,.c,ockwise
*ot
I - (
The sum ofthe integrals along the cross cuts is zero, since the function is analytic within
R, and hence continuous at the cross cuts. We can use a minus sign to change the direction
ofthe clockwise integrals to the usual counterclockwise sense. Thus,
{ lLot: JcrE-z
$ {'€' oE - Jcrt-:
$ !"' oe
Jr€-z
Now we proceed as we did with the Taylor series.
For f on Cz, lz - al < l€ - al, and so we expand the denominator as we did before,
obtaining the same result (equation2.45 but not equation 2.46).However, for f on C1,
lz - al . l€ - al, so we have to expand as follows:
I
t-z
-1
(z-a)
1
-1 s/6-oY
(,-Lg\ - z-qk\r-')
\
z-o)
110
cHAPTER2 coMPLEXVARTABLES
and thus
-t t
IIc, €f@dt-$
-a)' d€
z
fr,f(€)(€
7:ok-o)'*a
which gives a series in negative powers of z
$
-
Pou:_i Q-o;n $
JLt 5
.
p
a. Writing
Jcr
,:_,
Here we cannot easily express the coefficents of (z
could for the Taylor series.
Putting together the two parts, we get
- - (n'f l), we get
-
f
(€) (€
-
a)p in terms of derivatives,
r(z):*A*ou: f
o-e-t
dE
as we
c,(z-a)n
where
,,: * t'r4*rtt
ror -oo <n <@
(2.47)
Once we have the result, we can evaluate the integrals over a cornmon circle C centered on a
and lying within the annulus, since the integrands in equation(2.47) are analytic everywhere
in the annulus.
This expression (2.41) for cn amply demonstrates the existence of the Laurent series, but
it is not generally very useful for finding a particular Laurent series. We can resort to a set
of "tricks" that work better. The next example illustrates the geometric series method.
. : t ,o, the function ^f+it;,t*" cannot
rhe function is not analytic at the r*o oo,ni, . : ;;;;:-1.
draw a circular region centered at z : 1 within which the function is analytic. So,
there is no Thylor series for this function about the point z : L However, there are
two annuli centered atz: l:0 . lz - ll . 2,with the point z: -l outside the
annulus and z - 1 inside an infinitesimal hole in the center, and2 < lz - 1l < oo,
Example
2.9.
Find a series about the point
with both points in the inner hole. Thus, we can find two Laurent series centered at
1: oneintheregion0 . lz
In the inner region, we have
z:
- ll
<2andasecondfor lz - Il > 2(Figtxe2.l7).
t/ 1
(z-1)(z+l)
tl
-t\.J-r+t)
I \
2.3 ooMPLEX
sERTES
111
Im (z)
2
Re (z)
4
FIGURE2.17. ThefunctioninExarnple2.9isnotanalyticatz:*l.ThereisaLaurentseriesvalid
in each of the annuli 0
<
lz
- ll < 2 and lz - ll > 2.
Now we can expand the second fraction using the geometric series. In the inner
annulus, lz
z
- Il < 2, so
11loo
+
t
- z - t +, -, (,.
\z/
and thus we have
:
+J :n(-"'(?)
*::l* :n'-"'(=)']
:1S
1-'in*r (t-1\'
\ z )
4'/'t't
/
'n:-1
which is a Laurent series with only one negative power: (z - g-t .
However, in the outer annulus, lz - 1l > 2, so we expand the fraction the other
way:
;-:;;041 :*i'-'r(=)'
and
+.:il-!-i,-D'=nnl
z2-t zlr-l f,:o \4-,, I
:- li
2?
-J-!(z-l)'+r
a series with infinitely many negative powers of z
e.4B)
- I and no positive powers.
112
CHAPTER
2 ooMPLEX
vARTABLES
2.4. COMPLEX NUMBERS AND LAPLACE'S EQUATION
2.4.1. The Equations of lncompressible Fluid Flow
i(i),
giving the velocity of each fluid element,
a vector field
and a scalar field p (i), giving the density at each point. Since the mass within a volume can
change only by flow of fluid into or out of the element,ll we have
A fluid may be described by
dff
; J,'dv:-
Jron'n'ae
where the minus sign arises because the unit vector ff normal to S points out of the volume
V. Now if we apply this relation to a fixed volume V, we can then apply the divergence
theoreml2 and rearrange to get
l,e+n
rpi))
dv:o
Then since this relation must hold for any fixed volume V, we may conclude that
3o
(2.4e)
;*r'(pi):o
which is known as the continuity equation.
Liquids have the property that their density remains almost constant under a very wide
range of applied pressure; they are incompressible. (Gases can also behave incompressibly
under some circumstances.) For incompressible fluids, equation (2.49) simplifies considerably, since the time and space derivatives of the density are identically zeroi
i.i:o
(2.50)
Certain classes of fluid flow are also irrotational,
ixi:O
(2.51)
which means that there are no swirling motions in the flow (no turbulence).
2.4.2. The Velocity Potential
A vector field that satisfies equation (2.51) may be described
function
@, since for any such
function
Vx(V@):Q
In our fluid flow problem, the function
@
as the gradient
of a scalar
i
is called the velocity potential. It is usual to
introduce a minus sign, so that
i:
1 1
See Chapter 1, Problem 6.
12See
Chapter 1, Section 1.2.3
u"i -f ury :
-fi6
l
(2.s2)
2,4 COMPLEX NUMBERS AND LAPLACE'S
EOUATION 113
Then we can put this result into equation (2.50) to get
Y'O
:0
which is Laplace's equation. In this respect, the solutions to irrotational, incompressible
fluid flow problems will resemble the solutions to problems in electrostatics, since the
governing differential equation is the same.
2.4.3. Analytic Functions as Solutions of Laplace's Equation
An analytic function w(z)
and2.33):
:
*
u
iu satisfies the Cauchy-Riemann relations (equations 2.32
0u 0u
Ex 0y
0u
0y
anl
0u
0x
Differentiating again, we have
02u02uA/02\
a*r: a*ay: O (-O/
and thus
02u
02u
ffi+#:y2u:o
(2.s3)
That is, u satisfies Laplace's equation in two dimensions. Such functions are said to be
harmonic. Similarly, we can show that u is also harmonic. Thus, the real part or the imaginary
part of any analytic function will be the solution of an incompressible, irrotational fluid flow
problem in two dimensions.
Let's define an analytic, complex velocity potential function
a:Q*ilr
The fluid velocity is given by equation (2.52).In particular, if we use the real part
as our solution for the real velocity potential, then the velocity components are
a0
a0
U*: anq ,y: 0X
Ay:
al'
A*
and so we can form the complex number
.
:Ux*iry-Then
/a0 .a,/\
(,r-,*)/,
.:-(#.,y):-#
@
of O
114
oHAPTER2coMPLEXVARIABLES
Thus, the velocity components are found from the real and imaginary parts of the derivative
of O; that is,
ux:
/ da\
-*'(a/
and
uy
/ da\
: t- (*/
(2.s4)
Now any physics problem is described mathematically by one or more differential equations plus a set of boundary conditions. For example, fluid cannot flow across a nonporous
boundary, such as a solid wall. On such a boundary we have the boundary condition
i.fi:-n.i4:o
(2.ss)
Thus, to solve such a problem we need only find an analytic function Q : Q * i ry' whose
real part satisfies the boundary condition (2.55).The real part @ then satisfies both Laplace's
equation and the boundary conditions, and the problem is solved.
If the real part of @ is the velocity potential, what is the imaginary part? Note that
3 :
=
V6'V{t
AlL+ A0 ArL :
0y Dy
0x 0x --
A0
--L---:-
u!(_y\ *ua:o
a'\ oy) oyox
(2.s6)
Thus, surfaces of constant ry' are perpendicular to surfaces of constant d. Ghis is the same
relation as that between equipotential surfaces and electric field lines in electrostatics.) The
constant ry' surfaces are thus the streamlines of the flow, and from equation (2.55), a solid
boundary surface must be a surface of constant ry'.
2.4.4. Steady lrrotational Flow Around an lnfinitely Long Cylinder
Suppose that fluid flows from a great distance with velocity i : Voi toward an infinitely
long solid cylinder of radius a (Figure 2.18). We want to find the flow velocity around
the cylinder. To solve this problem, we will use cylindrical coordinates with origin on the
cylinder axis. It is a two-dimensional problem with the boundary conditions
i -+ V6i
as
r -+ oo
(2.57)
and since the flow has to be al,ong the surface of the cylinder, streamlines follow the surface
a.
constant on the circle r
at r
a and
:
:
{ :
Y0
FIGURE 2.18. Fluid flows toward the cylinder (radius a) from infinity with velocity i
:
Voi.
2.4 COMPLEX NUMBERS AND LAPLACE'S
EQUATION 115
First we reformulate the problem in terms of complex functions. The solution for @ is the
real part of a complex function Q
Q * I ry' that is analytic everywhere outside the cylinder
and that satisfies the two boundary conditions. Since O is analytic everywhere outside the
cylinder (that is, in the region r > a), it can be written as a Laurent seriesl3 centered at the
:
origin:
*oo
:
(2.58)
n\'n'"
The real part of this function satisfies the differential equation for our problem. Now we
also need to satisfy the boundary conditions. At infinity, we have
,,: -# - vo:-* (f )
Thus, as
lzl -+ oo, Q : -Voz. The coefficients
cn,
and
n2
uy
:0
2, of the positive powers in
equation (2.58) are all zero, and the coefficient of z must be c1 : - Vo. The other coefficients
cn, n 1 0, are not yet determined, since all the negative powers of z go to zero at infinity.
The second boundary condition is f - constant on r : a, and we may choose that
constant to be zero. (As with electrostatic problems, we may add an arbitrary constant to
the potential without changing the values of its gradient, here the velocity field.) Now the
imaginary part of a complex function may be written as
lb:E(a_a*)
1
and we want this expression to be identically zerc for lxl : a. Inserting the series (2.58)
for O, with both z and c, in polar form (z : aeiq a\d cn : lcnl ei|'), we find
./
o
oo
\
: +(
,-'0 ) +f ,-,o-n r-in' - ,\no-n eine I
z'\ -vsa(eie n:t
/
o:
-voa
sind
:
-voa
sind
- i"n:7
c-nlsin (no - 3-,)
(9
E-,)
- .EJ
a rin - d-r) -to-"lr-nlsin(n0 -
n:z
We can14 satisfy this equation by taking
lc-nl
:0
for n
>
1 and
lc-r I : a2Vo, 6-t
Then
/
a2\
(D:-Volz*-l
\ z/ -
-vo(,.#) - -vo(,.#)
13See Section 2.3.4.
14In Chapter 4, we will show that this is the only possible solution.
-
n.
116
cHAPTER2 ooMPLEXVARTABLES
and on the curve lzl
:
a,
Q:-VoQ*z*):6*
as we require. Thus, our
Laurent series has only two terms, with powers
zl
and z-l
.
The velocity components are given by
ll)
*dQ
:
--
dz
:
vo(r
:
Vol I
-5)
/a2
\r' :ux-iuy
-;€ -rtt)
So
ux
:
vo(t
- 5"o, za)
;
vv
: -Voa2., sin20
The equipotential surfaces are
:
constant
: -Vol/ r - :a2\I sind :
r/
\
constant
0- -ro(, .
+)cosg
and the streamlines are given by
tb
The streamlines are shown in Figure 2.19.
o
FIGURE 2.19. Streamlines of flow around the cylinder. The lines are given by
t :
constant.
2.5 POLES AND
ZEROS 117
2.5. POLES AND ZEROS
2.5.1. Analytic Continuation
a function is analytic in a region R, pt < lz - al < p2, we may describe it using
a Laurent series valid in R. We may also be able to find a second Laurent series in a
different, but overlapping, region. (See, for example, the discussion of series for the tangent
function in Appendix XI.) The second series is an analytic continuation of / into the
second region.
For example, consider the function
If
srn z
frk) z-l
defined in the region
lzl < I (Figure 2.20).We may
evaluate the function with a Taylor
series:
/
-t
\.
-s
fzk):- (.- ;..;..
:
-z
- "-z -e"
-2; -
)(r
5,4
6<.
-
+. +22 +23 +24 +zs +...)
lol -t -.....
l2O. -"'
where f2Q) is valid only in the region lzl < l To evaluate the functionls outside this
region, we can expand about another point in the original region-for example, zt : -r /4.
Im (z)
Re (z)
FIGURE 2.20. The function
'' -- -*t4.
rtl
fzk)
may be analytically continued by expanding in a series about
.I
l5You might think that this would be unnecessary, since we already have the closed-form expression
fi
(3). But
we cannot actually compute sin z except through its series expansion. See Morse and Feshbach, p. 377, for a good
discussion of this point.
118
oHAPTER2 ooMPLEXVARTABLES
For convenience, we define a new variable u)
:
z
- zr :
z
-f
n 14.
Then we obtain the
series
ftk):
sin (z
/sinrl-cosu.,\
t r/a - n/4) sin(w - n l4) I t_t
-lnla)
(r
t)
u
r14/Z \ w-t.78s4 )
w - w3
+..' - (t - w212t+..')
r.78s4(t - w/1.7854)
(z+n1+
t
- -n
131
l/*2w3\
:138s4J-2(.t-'*i*
o*
/ww2w3\
xl1*
" \' ' :.zB54'I
:
0. 39605(1
-
)
r
' 1.78543 ' )
-r--r...
0.4399w * 0.25361w2 + 0.3087t1r,3 + ..' )
1.78542
< l-ln l4,or lz * T l4l < l-lr l4 : 1.7854'
Thus, we have extended the function fzk) into the shaded region shown inFigure2.2l.
andthis series /s(z) is validintheregion lu.rl
We can now pick a point in the new region and continue to extend the function indefinitely. The expressions ftQ), fzk), and fzk) are three different expressions for the same
function.
Im
FIGURE 2.21. The function
fzk)
(z)
has been continued into the shaded region as
fzk).
In some circumstances, we can simply use the original series outside of R as well, but
it does not necessarily describe the original function in the new region. The new function
is also an analytic continuation of /. In this case, we are redefining the function to be the
2.5 POLES AND
series. This result
at the function
is
ZEROS
119
called permanence of algebraic form. For example, suppose we look
111
f(x):i+z*r*:"r*"'
x is on the real axis with lxl > 1. (This series represents the real function
lnfxl@ - l)].) We can use this series everywhere in the annulus lzl > 1 in the complex plane, where it represents a complex function (which we might call the principal
where
lk - 1)]) that is identical with the original function on the real axis. It is an
analytic continuation of the original function.
Notice that we talk about "an" analytic continuation, not "the" analytic continuation.
There is more than one way to continue a given function. In any specific application, we
must choose the continuation that is best for that particular purpose.
branch of ln[z
2.5.2. Zeros
Thepoint
expand
/
z: aisazeroof
in
a
afunction
f it f (a):0. If /
- al < p:
is analytic ata,then wecan
Taylor series in some region lz
f (z)
:i',r,
-
n:0
oY
f , then c0 must be zero. If c1 10, then a is a simple zero of /. If both c6
andclarezeroandrzf0,thenaisazeroofordertwo.Ingeneral,lfcg,cl,...,cn-lare
If a is a zero of
/ of order n. Essentially, the order of the zero of a
6 is a statement about how fast the function goes to zero as z approaches a.
all zero but cn is not, then a rs a zero of
function ?t I
Example
:
2.10.
The function
If we expand in a Thylor
f (z) :
sin (22) has
a
zero at z
:
O.
What is its order?
series, we get
sin (22)
:.2 -
.6
i
+
In this series, both c6 and c1 are zero but c2 is not, so 2
of f .
What is the order of the zero z
:
0 for the function g
:
:
0 is a second-order zero
sin z2
-
z2?
2.5.3. Singularities
Isolated Singularities
It f(z) is analyticintheneighborhoodofapoint z: abut notactually dt 7:6,
then a is anisolated singularity of
f.
12O
oHAPTER2coMPLEXVARIABLES
There are three possible cases.
l. lf Q)l -+ oo as z
--+
a.An example of this
case is the functio n
singularities are called poles.
2. f (z)is
bounded as z
-+ a. An example of this
t
- -. *"r"
f (z): z-a
tot
/ : z-r12 . Then,
(11
case is the function
using I'Hospital's ru1e,16 we have
(z) .. - sinz
,!T1z-7
-,r12: ,!Tp I
-.
cos
: -t
These functions present no problem, since we can redefine the function as follows:
(
cos
f(z):1
(z)
z_n12
[ -l
* r/2
lfz:r/2
if z
The new function is analytic everywhere. This kind of singularity is called removable.
3. f (z)
oscillates. For example, consider the function
f (z):*p f 1)
\z/
Along the real axis, z
: r, we have
f (z) --> a
f(z) --> 0
But along the imaginary axis, 4
trk)t: l*'
as
-r
-+
as
x -+
-
ly,
0 through positive values
0 through negative values
(;)l : l*n (;)l : '
ror
al
varues
orv
Note that the Laurent series
/1\
1
I
".n(;/:|*;+7,7+"'
which is valid up ro the singularity at z : 0, has infinitely many negative powers. This
is a characteristic of this type of singularity, which is called an essential singularity.
(A Laurent series with infinitely many negative powers that is valid in an annulus with
afinite inner radius does not necessarily indicate an essential singularity. An example of
suchaseriesisinSection2.3.4,equation 2.48:theseriesfor
This function has a pole at z
: l.)
l6We could get the same result by expanding cos z in
a
Taylor series about z
+-validfor
zt-l
:
rr/2.
lz
- ll > 2.
2.5 POLES AND
ZEROS 121
Next we shall investigate singularities of the first type, the poles. The function / is
analytic in an annular region 0 < lz - al < p and may be expanded in a Laurent series
centereda;tT:qi
f (z):
-
,t*,,r,
oY
If the coefficientc-1 l0but allc-,-:0 for ffi t l, thenthepole is of order 1 (thepole
is simple).If c-2 is nolzerobut c-^: 0 for ffi > 2, the pole is of order 2 (whether or not
c-1 is zero). In general, if the series may be written as
f (z) :
or'
,t^,,r, -
with c-. # 0, the pole is of order m.lAn essential singularity (case 3 above) is a pole of
infinite order.l
We can test for the order of a pole without finding the Laurent series. The limit
I:!"(, -
a)P
f (z) :
ll{'k - ro ,t^cn(z - a)n
: ,_o
14 i
c'(z
-
a)n+P
p
:
m,and will not exist for
n_^
will
be zero for
p
) ffi,will
be a constant
(c-^)
for
p<
m.
Thus,
The order of a pole at z - a is the lowest integer p for which the
limr-o(z - a)P f (z) exists.
A function that has well-separated poles
limit
as its only singularities is described as mero-
morphic.
Other Kinds of Singularities
Functions may not be analytic for worse reasons than those cited above. For example:
.
t-
The function may have a branch poinr. A branch point is a point from which a branch cut
emerges (the head of the tadpole, see Sections 2.1.3 and2.2.l).To determine where the
branch points zo lie, note that we obtain successive branches ofthe function by increasing
the argument of z - zpby 2n . The function 7 : JVhas a branch point at the origin. The
principal branch ofthe function is not continuous across the branch cut, and therefore the
function is not analytic anywhere along the branch cut. Thus, the singularity at z : 0 is
not isolated.
122
.
cHAPTER2ooMPLEXVARIABLES
We may have an infinite set of singularities that converge to a limit point, so that any
neighborhood of that point contains infinitely many singularities. The function / :
tan (llz) has this property. The singularities are at lf z : nnf2,where n is an odd
integer, or
,n: L
This sequence converges to z : 0. No matter how small a neighborhood of the origin we
pick, say lzl . e, there are infinitely many zn in this neighborhood. To see this, let N be
an integer such that N > 1/e. Then
,*.4.?r.,
Nrt
7t
is inside the neighborhood, and so are all the zm, m
singularity of
f
> N.
Thus, z
:
O
is not an isolated
.
2.6. THE RESIDUE THEOREM
2.6.1. Definition of the Residue
If f(z)
is analytic in a neighborhood
the residue of
f
at a is
tl
,"1
ofe
:
a
axcePt perhaps
ata,then
f, f (z) dz, where the closed curve t "n"tot"t L.r,
By the Cauchy theorem, the residue of f at a is zero if the function / is analytic at a. But
the reverse is not necessarily true: A zero residue at a does not imply that the function is
analytic at a.
Comparison with the expression for the coefficients in the Laurent series (eqtanon2.47)
shows that
the residue of a function f at a isthe c-r coefficient
oI the Laurent series centered I a.
Example
2.11.
Find the residue of the function
f : Y
The function may be written as a series:
1/.2.4\
f (z):;
(t- i.* A+
-22Zl.'4t'
)
at z
:
Q'60)
0.
2.6 THE
The function has a second-order pole at z
residue there is zero.
Example
2.12.
Show that the function
:
O,
RESIDUETHEOREM 123
and since the coefficient c-1
f (z) :
sinz ez
-2-
:
Q, *1s
has a pole at the origin,
find its order, and find the residue there.
We expand the sine and the exponential in Taylor series:
r:Y -r']
:)(,-*.*:(,*.+!+{+
)
I
7- 1_z
<,
--i-.-
Thus,
/(z)
1
.
6.-11" -
3
_3
401"
-.-.
has a simple pole at the origin, and the residue there is
-1.
2.6.2. The ResidueTheorem
If a function
/ is analytic in a simply connected domain D except for afinite number
of isolated singularities and if curve C is within D, then
N
d f Or:2rilRes/(2,)
Jc
7l
wherc zn are the singularities of
/
(2.6t)
contained within C.
To prove this theorem, we deform the curve C so as to exclude each of the singularities
zn, as shown in Figure 2.22.The new curye C/ equals the original curve C, plus a set of
cross cuts leading to, and a small circle f, around, each singularity. We can do this only if
the singularities are isolated, since the function has to be analytic on the cross cuts and the
circles.
Then by the Cauchy theorem,
f
f.f
dz:o
since we have constructed Ct so that / is analytic everywhere in and on C'. Since the
function is analytic and thus continuous along the cross cuts, they do not contribute to the
integral.lT Thus,
^N
d tor+fd/f,,"ro"r*i..
Jc
il'i
lTWe used this argument previously in Section 2.2.5.
fdz:o
124
cHAPTER2ooMPLEXVARIABLES
Im (z)
C
/
Re (z)
FIGURE 2.22. Contolr for proving the residue theorem. The contour excludes the singularities at
zn.Two such points are shown in this diagram.
If the integral around C is in the standard counterclockwise direction, then each of the
integrals in the sum is taken clockwise around its circle f, . Now we move the sum to the other
side of the equation and use the resulting minus sign to change the direction of integration
to the counterclockwise direction. Using the definition of the residue (equation 2.59), we
have
^Nr
to,
{-ra,:Diln:lJl,
JL
N
:ZnilRes/(2,)
n:l
and the theorem is proved.
2.6.3. Finding Residues
There are several different methods for finding residues.
1. Find the Laurent series and pick out the coefficient c-1. This is what we did in
Section 2.6.1.
2. For a simple pole, the residue
at a is
Res/(a):I1E"k-a)f
(z)
(2.62)
2.6 THE
(z
_lim_
z+a
- a) f
(z)
:
RESTDUETHEoREM 125
@
- ol D
lim (e
z+a
cn(z
n:-l
-
a)n
@
: [m f
zta
cn(z-a)n+l
']t
All
the terms with positive powers of (z
Example
2.13.
a) go to zero,leaving c-
-
Find the residue of the functio n
f (z) :
tun'
r,
which is the residue.
at the origin.
First we find the order of the pole. Note that the limit
tanz
lio,.tu1z : lim
z-0 - Zz z+O Z
: li* ttl" :
z--+O I
I
exists, and thus the pole is simple. The above limit also gives the residue as 1.
3. For a pole of order ln, the residue is
Res/(a)
: !:+6+fr#tk - a)^ y
12y1
(2.63)
Again we use the Laurent series to demonstrate the result:
@+iffiu
I:g
-
a)^
rk): lg
- a)^,i^,,,, - o,'
^+fr#(z
Since the Laurent series is uniformly convergent, we may differentiate it term by term.
All terms with nr i n < m - 1 (that is, n < -l) differentiate to zero.
oo
sm_l
tim
*
z+q 42.," '
@
I
.,(. -a)n+^: z+a
lim I
-
-
oru,n
powers or (z
term: ",;;;,""
ld^-1/l\
lim _____:_ :-
7+a (m
as required.
-
1)l dzm-t
cn(nrm)(n*m-l).'.(n-12)(z-a)n+1
- ,;;trzero
in the limit, leaving only the n
_ l)(m _ 2)... I ) ._,
k'- _ a)* f (z) : { ,* l)! @
'
\(tn )
: C_l
: -r
126
oHAPTER2coMPLEXVARIABLES
Exampte
2.14.
The function
there.
cosh z
f : 7
has a pole at the origin. Find the residue
First we find the order of the pole. We find that
coshz: -. coshz
-.
llm
llm.:z-0 Zz z-O Z
-
does not exist. But
-. ,coshz:
..
lim
Itgot --
:
cosh z
I
exists, and so the pole is of second order. The residue is
Resf (0)
:
9
tir.
z-O dZ
coshz
-
lim sinhz
z-0
:0
We can check this result by finding the Laurent senes:
r(z):t-!ri*t
There is no
c-t
:\
*:.*.
term, and thus the residue is zero, as we found using method 3.
4. For a function of the form
J.Q)
where h(z) has a simple zero at z :
dt 7 : 6, and the residue is given by
a
sk)
h(z)
and g(z) is analytic at a,
Res/(a)
f (z) has a simple pole
: z+a
li- n,\Z)
,8,(,tl
(2.64)
To understand this result, first write h(z) in a Taylor series centered at z
h(z):f
r,r, _
:
a:
of
n:l
There is no c0 tenn because h(z) has a simple zero at z
given by (equation 2.44)
,,
l
nt
d"hl
dzn tr:o
I
:
a, and the coefficient c, is
2.7 USING THE RESIDUE
THEOREM 127
Now apply method (2):
Ir:trk
-
a)f (z)
: Irg"(, - ",ffi
:lg(._
',r#=*
gk)
Iim
- i:lr
D*
s@)
c1
tcn(z
_ q)n_l
s@)
h'(a)
as required.
Example2.15. The function f (z)
r 12. Find the residue there.
:Ianz - sinz/cosz has a simple pole at z :
By method 4, the residue is
*.'/ (;) : ,!T,r-"n j- :
5. Finally,
as a last resort, we can
-1
actually evaluate the integral in (2.59).
2.16. Evaluate the residue of the function f (z) : l lz at z
We choose C to be a circle of radius p centered at the origin. Then
Example
:
O.
I f
| ['" | ^,^i0..^ | [2",^
l - :1
t peta' d0: 2ri I ae: - 2ni
*A f k) dz 2triJo
2riJc"
2ti
--1 Jo
-pte
Of course, we could also have obtained this result trivially using method L
2.7. USING THE RESIDUE THEOREM
2.7.1. Evaluating the Integral of a Complex Function
The residue theorem is an extremely powerful tool for evaluating integrals. When using the
residue theorem to evaluate an integral of the form
f
dz
t, f {z\
I suggest that you always use the steps below
1. Draw a diagram showing
the contour C in the complex plane. Also mark on your diagram
any poles or other singularities of the integrand, f (z). If the function has a branch cut,
be sure to show it on your diagram too.
128
oHAPTER2coMPLEXVARIABLES
have to deform the contour C so that the entire branch
cut is excluded from the contour. The contour and the branch cut may not intersect at
any point!
2. If there is a branch cut, you will
3. Note which poles are inside the contour C.
4. Evaluate the residue of f ateach ofthe poles that are inside the contour'
5. Apply the residue theorem.
6. If the contour runs around a branch cut, you will have to evaluate the integral explicitly
along both sides of the cut.
Example
2.17.
Evaluate
/
sinz
f,-d'
where C is a square with comers at the points
Step 1: See Figure 2.23.
(-i),
(2
- i), (2 *
i), and (+i).
Im (z)
FIGURE 2.23. ^fhe contour for Example 2.77. The integrand has
a
pole at q
:
l. This pole is inside
the contour.
Steps 2 and 3: The integrand has a simple pole at Z
There are no other poles or branch cuts.
Step 4: Using method l, we have
Res/(l)
:.lTl
(.
:
- t)
: \.It is inside the contour.
sin(l)
=
Step 5:
"tn' o, :2ni
[ z|
Jc
(sinl)
:
5.287]i
2.7 USING THE
RESIDUETHEOREM 129
2.7.2. Using the ResidueTheorem to Calculate Real lntegrals
Integrals of Tiigonometric Functions
Integrals of the form
f2n
Jo /(sin
9,
cos9)d0
(2.6s)
frequently arise in the solution of physical problems. For example, with 0 : arl, the integral
gives 2t times the ayerage value of the function over one period. We may evaluate such
integrals around the unit circle in the complex plane. On the unit circle,
z:reio:eio,
dz:ieiod1+de:4
tz
e.66)
and
cos6:
eie+e-ie:i(..:)
22
(2.61)
Similarly,
sine:
eio-e-io:+G-:)
2i2
(2.68)
These relations allow us to convert the integral over g to an integral over z.
Example
2.18.
Evaluate
fltl
J, i#ade
This integral is not in the form (2.65) because the limits are 0 to r rather than 0 to
: (- sin g)2 : sin2 d] and so
2n.But notice that the integrand is even [sin2 (-O)
["#o':l:,;fu*'*lo" ffio'
:z [" 3--]-at
Jo + sin'P
130
oHAPTER2coMPLEXVARIABLES
Then we convert to an integral over the unit circle, using relations (2.66) and (2.68):
L"
#o'
: : I:"
u.*L.,o'
:
I
:lni,"i."r"
d.z
,.1+(,-:))'"
dz
I
I f
I
iXnitrirrtr7 \-,2 4' 422 '
:2i6 "zr
..ifunir circle z4 - l4zz +
1
Now we can apply the general method for evaluating contour integrals'
Step 1: SeeFigure2.24.
-QZ
Im (z)
Re (z)
FIGURE 2.24. The contour for Example 2.18 is the unit circle. There are four poles, but only two
are inside the contour.
Step 2 is not needed here.
Step 3: The integrand has four poles, given by
zl-t+zl*1:o
,
t4+
JTP= :7 r4J5:7
2
All four roots are real. The values
+6.9282
are
zr,2: +\nT
69282
:
+3.732r
and
zz,+:
LJT-
e9282: +0.26796
131
2.7 USING THE RESIDUE THEOREM
Of these four poles, only the last two are inside the unit circle (Figare 2.24).
Step 4: The relevant residues are
Res/(z:):lim(z-d#
z+23 -' (z
- z)(z - z)(z - z)(z -
where we used the fact that z2
zq)
(z-z+)#
z--,24'
(z - z)(z - z)(z _ zik _ z+)
D
:
45
k?
-
zl)kz
-
z+)
: -zr, and
Res/(z+): lim
Thus, since Z4
-
z4
-
, a
a.
,
\z'4-zi)k+-zz)
-23, we have
23z4l
'
_:______._____________T-_
residues inside
tzl
e
-
z,fi(zz
-
z+)
{zl
-
zl)k+
-
n)
k?
-
z?)
- -Jj
24
-4Jr-0 +4Jr-
Step 5: Applying the residue theorem, we find
I
..
f"
I 3+sin'e
Jo
Let's
see
_
( /3\
Jj
-- 24/I ::. 6
\
I
n
how an integral of this type arises from a physics problem.
-d0:2tri(2i)
Example 2.19. A circular wire loop of resistance R and radius a has its center at a
distance ds > a from a long straight wire. The loop is oriented so that a diameter of
the loop points directly at the long wire, as shown in Figure 2.25.The current in the
long wire is increasing atarate a : dI ldt. What is the current in the loop?
FIGURE 2.25. The physical system in Example 2.19 has a long straight wire carrying current 1 and
a circular wire loop whose center is distance dg from the wire.
132
CHAPTER
2
COMPLEX VARIABLES
According to Faraday's law, the emf induced in the loop has magnitude
:l!ldtJ[n dool
- ldtl
":1491
I
where O is the magnetic flux through the loop. The magnetic field produced by the
wire forms circles centered on the wire, and at a radial distance d from the wire,
D_,'
"o -
unI
27t(t
Now at a point inside the loop,
d:do*rcosd
and
dA:rdrd0
so the flux is
,: ['" fo2n (do,*o'
.,rdrdo
*rcosd)
Jo lo
The integral over
I
is of the form (2.65), so we may convert it to an integral over the
unit circle:
fzolfldz
_)O
Jo @o* rcosd)""t
/^
/unitcircle
:-2i f
,./unit
do+;(r*)),,
I
circre 2doz
+ r(* + Daz
The poles of the integrand are given by the roots of the quadratic in the denominator:
-2do
i. 4al -
+rz
2r
Now since do
> a > r,dsf r >
1. So only one of thetwo poles,
is inside the unit circle (Figure 2.26).T\e relevant residue is
2.7 USING THE RESIDUE
THEOREM 133
and thus
tf2n_)n:
Jo
I
(do*rcos0)
Im (z)
Re (z)
FIGURE 2.26. The integration contour for Example 2.19. Only one of the two poles is inside the
contour.
Then the flux is
6
: ltol ['
'D:
zir
2o
- ,.. _ Fol [d|-o' -du
I, prctr:
z Jot A
wherez:fi-r2.So
ur7 (3-o'
a:-+#l:;,
--ro1 (
un1
4-o' - d.)
(2.69)
and since 1 is the only time-dependent quantity in this expression, the current in the
loop is
: !:11491 - r'ooao
'-R-RltulR [' r
L'
We can check the result by looking at the limit
approximately
a 11 do.We
e = FoI oo, _ ltol oz
2nd0
2do
expect the flux to be
134
oHAPTER2coMPLEXVARIABLES
Now if we expand the square root in equation (2.69), we get
a:
uorao(t
-
ft) =
pordolt
I d2\l
I
- (t _ __
24))
I
/la2\
uol "
: psrds\ZA):;doo
as expected.
2.7.3. lntegrals Along the Entire RealAxis
Closing the Contour
IiS f U> dx may be converted to an integral in the complex plane
The idea is to "close the contour" by adding additional pieces
some
circumstances.
under
zero or some multiple of the original integral along the
is
either
the
integral
which
along
real axis.
The most common way to close' the contour is to add a large semicircle at infinity
(Figure 2.27).For example, the integral
An integral of the form
r*oo
I
I ----i-d*
' - J-*
7r'+o'y"^
r-
may be intepreted as an integral along the real axis in the complex plane:
r
: [
---]--------=az
Jreal axis (22
+ az)'
-RR
FIGURE 2.27. Closing the contour with
a
big semicircle of radius R
-+
oo in the upper half-plane.
Next we note that the integral along a large semicircle at infinity is zero. First look at the
absolute value of the integrand:
llrl
tt_
|'
)''' l -
kt-
l,
orl'
l1+?l
2.7 usrNc rHE RESIDUE
where we used the result (2.13) lzt f- zzl > I Izr I
big enough that lal I lzl < I /J2, and then
4
kl
-
THEoREM 135
lzzl l. We can make the large semicircle
on the semicircle
Then
lr
tt
r
I
;dzl (length of curve)(max
l,/se'icirct" (22 + az)2"* =
value
44n
:(zR)Ro:Rr*0
of linte$andl
on curve)
asR-+oo
Thus,
flftf
C
.d::
J (zz + az)'
I
+
Jrear axis (22
^d:*
oz1'
I
Jsemicircre
: Ifl ----: ^dz*o
Jreal axis (22 + oz)"
Thus, the integral around the closed contour composed of (a) a straight line
fte{qil
axis and (b) a large semicircle at infinity equals the
'g-tt\'
We may evaluate the integral around the closed con
d the
result equals the integral along the real axis.
The integrand has poles at z : X.ia. Each pole is of order 2. Only the pole at z : lia
is inside the contour (Figure 2.27).Using method 3, we find that the residue there is.
Res/(i
' a)
-
lim
z_ia
Idz,, -
ia12
--J--------=
+ az)"
(22
: ,'Yl"adt
k+;"j
-2
:- lim -2
;;" k + ia13 (2t612 Thus, the value of the integral is
and hence
4ia3
136
cHAPTER2 coMPLEXVARIABLES
Integrals of the Form
I jS "'k' f @) a*
Integrals of the form [lS ,ik' 71*) dx, where k is a real number, may be evaluated by
closing the contour with a semicircle at infinity. Fourier transforms (Chapter 7) are important
examples of this class of integrals.
ES fiOx, where k is real and fr > 0.
The first step is to close the contour using a large semicircle, as discussed above.
Onthelarge semicircle atinfinity, z: R(cos0 * I sin0), with0 < d < n. Thus,
Exampte
2.20.
Evaluate
exp
(ikz)
:
exp (jkR cos 9) exp
(-kR
0)
sin
(2.70)
and so
lexp
(ikz)l
-
lexp (iftR cos d) | exp
sincesin0 ispositivefor0 < 0
(-kR
sin
0)
:
exp
(-kR
sin
9) < I
<tr.
Thus, we have
l,["*"on"
ry#
(ength or path) max lintegrandl
"1=
I I
:rRn.Lax lexp fftz)llF
+
< tt
2
Rl=
R4
for R
2n
:Rr-0
The poles of the integrand are the fourth rootsls
exp(in14linr/2),n:0,1,2,3.
I
tl
> 2t/a
asR-+oo
of -1, exp(it14l2nni14)
Of these poles, only two, n
:0
and n
:
inside our contour (Figure 2.28).The residues are (by method 4)
/ik
nes/(er"r+)
: #1.:",",* :'SiI
\
/k
-
:tz"-'
\
*i!'
and
Res/ (e'3"/a)
18See Section 2.1.2
:
'*r(#r'-"
4e9iit /4
1/2exp
/-k
(VZ
(1
4(r + i)
+
\
t)/
:
1, are
- D)
2.7 usrNc THE REsTDUE THEoREM
137
FIGURE 2.28. Contour for the integral in Example 2.20. We close with a contour in the upper
half-plane, enclosing two of the four poles.
Thus,
E "!-o * : f #* -'tf
*, (-
+)(#
*#)
:,,*^,(-+x###.ffix)
:,,*"-(_+)
i
("r,
tA + r-ki /&) + t (r*ira - r-ki/a)
"(
a
k r\
:7r A*p (Jz*\r
(*'
2
[\- 2 /
a*"" a)
We include real integrals of the form /1S sin /c-r f (x) dx una fS cos kx f (x) dx in
this class of integrals, since each trigonometric function is a linear combination o1
"ikx
*6
. The sine and cosine must be treated as combinations of exponentials because
"-ikx
./semicircre/(.r)cos (kx)dx doesnotvanish. If youare surethattheresultoftheintegration
is a finite real value, then you may write
ll* ""0, f
(x)
dx:
*",[: ,ik*
y 1*1
d*
v
and
,fI
,t"
o' f (x) dx:
t-,ff
"ik'
7 7*1
d*
/
138
oHAPTER
2 ooMPLEXVARTABLES
and evaluate the integral as in Example 2.2o.However, if there is any chance that the integral
may have an imaginary part (and this does happen in physics problems; see the section below
on integrals with poles on the real axis), then you must be more careful and evaluate
f
+oo
''-"o,
/
J-Exampre
r
kx
2.21.
.f
i
r+m
a+m
(x)a*: ! I ,'k* f (*)a* + )2J-*
I ,-ik'7{*\d,
2J-*
Evaluate
l::
.ti#dx,whereft
is real.
Since the cosine is an even function, we may assume that the real number k is
positive. We expect this integral to have a real value, but let's check this assumption
by evaluating
coskx ,..
/+-
_
: 1 /*-
(ik{).t.- I /*- exp(-ikx)
d*
+,
p
fudx
"*p
J-- ;rraa' i J-*
J-* *o,
The first step is to close the contour. Look at the first of the two integrals. On the large
semicircle atinfinity, z
:
R(cos0
*i
sind), with0 < 0 <
r.
Using equation(2.70),
we have
lexp (lkz) I
Thus, for R
.
I
-
lexp (i kR cos d) | exp
olJ2,
(-kR
sin
0)
:
exp
(-kR
sin
0) < I
we have
t
lJsemicircle
o rl < (lengrh
'*! \'o'r)
z' I a' |
:
of path) max lintegrandl
zR max lexp (ikz)l
22n
toRn,
l-]--lz'+ a'l
I
0
asR-+oo
Thus,
(ikxl
I exp(ikz) oz
:
J-* ;riFo* fr* z2 + a2
/+-
exp
The integrand has two poles, at z
,
,
: !ia, but only the one at z - *ia
is inside the
contour (Figure 2.29).T\e pole is simple, and the residue is
Res/(ia)
: )y,(z - ia) #
:,ry,# : #
Using the residue theorem, we have
{
Ic+
or:zni(r=!"\:,a ,_oo
+at \2ia /
"*!r,n1)
zz
(2.jt)
I
2.7 USING THE RESIDUE
_R
THEOREM 139
R
-ra
C-
FIGURE 2.29.
lf
f (x)eikx, with k > 0, we close the contour in the
(C+). If & < 0, we close in the lower half-plane (C- ).
the integrand is of the form
upper half-plane
Now we turn to the second integral. Evaluating exp
(-ikz)
over the big semicircle,
we find
exp
(-ikz) :
exp
(-ikR
cos
9) exp (kR sin 9)
and the real exponential becomes arbitrarily largelg as R -+ m. (This is why we
cannot evaluate the integral ofthe cosine directly, but instead have to write the cosine
as a combination of exponentials.) However, if instead we close the contour downward
(Figure 2.29), where n < 0 < 2r, thensin 0 < 0 and the real exponential is bounded
by unity everywhere on this semicircle. Then we can show that
tr
(_ikz)
, -- 0
----; ---;-azl
I
a'
z'
+
lJsemicircle downward
exp
I
I
-+
as R
oo
I
Thus,
/+-
J-*
Now the pole at
-la
(-ikx) ,
I
-7 *7-o' :
e*p
is inside the contour
Res/(-ia) :
exp(-ikz)
fr- ,z a p
,
az
C-. The residue is
e-ikz
,-ikz
ia) ., , , : lim.
*
az
z+-ia
z--+-ra
zz I
z - ia
-
lim.
(z
e-ko
-2ia
lgAlthough we cannot show that the integral on the upper semicircle is zero, it is not necessarily infinite. Once we
for the integral along the real axis using the lower contour, we can use the upper contour
to calculate the (finite) integral along the upper semicircle. While possible, this is rarely useful.
have obtained the result
140
oHAPTER2ooMPLEXVARIABLES
As we apply the residue theorem, we have to remember that it applies to contours
traversed counterclockwise. This time we are going clockwise, so we have to add a
minus sign:
{
h_
exp(-ik.z)
q4oz: -2riRes
-J"r\vo/\ /(-ra)
z2+az
: -2ni( +\ \-zta/
o
a "-oo
which is the same result that we got for the first integral.
Finally, we add the two results to obtain
[.* r:2:#o*
J--
: ].(inr" *Tu*) :I,-o'
The result is a real number. Notice that we could also have obtained this result by
taking the realpart of equation (2.7I).
Jordan's Lemma
Integrals of the type considered above may be converted to complex contour integrals
providedthatthefunction
f (x) --> 0sufficientlyfastasx -+ oo.Theproofthattheintegral
along the semicircle is zero is facilitated by the use of Jordan's lemma:
It f (z)
converges uniformly to zeto whenever 4
rim
t
4*oo JCn
.f
-+
k)eik' dz
oo, then
:
o
where ft is any positive real number and Cp is the upper half of the circle lzl
Note. /(z) converges uniformly to zero if, given any e, there
f Q)l < e whenever lzl > M, no matter what the argument of z.
exists an
:
M
R.
such that
I
To prove the lemma, choose
R(cos0 * I sind).
R > M. Then on Cn, lf k)l <
e and
z:
Reiq
:
arl ,l [" .*r(i kR cos d) exp (-kR sine1 ni ei| ael
t f k)eik' --l
-= l/o
l"/c*"
I rn/2
exp (-kR sine) del
rtol{o
=
I
I
I
Since we need only an upper bound to the integral on the right, we note that sin d > 20 /n
sin e <
/n . This latter
throughout the range of integration (Figure 2.30) and thut ,-kR
"-2kR0
function may be integrated easily to give
llr -
r,rr,'o' o,l
='#
r, -,-oo t
=
T
Since e may be chpsen as small as we like, the integral goes to zero and the lemma is proved.
2.7 USINGTHE RESIDUETHEOREM 141
f(0)
0
o2
o4
08
l0
t.2
1.4
l6
l8
FIGURE 2.30. The graph ofthe two functions shows that sin 0 (curve) is greater than 20 /n (straight
line) throughout the range 0 < I < tr /2. Since the two functions have the same
valueatd :0 and 0 :n/2butthederivativeofsind iscosd andequals lat0:O
while the slope of the straight line equals 2 / n : 0 . 63662, the curve rises above the
straight line as 0 increases from zero, before falling back to the line at 0 : n /2.
Closing the Contour with a Rectangle
Integrands that contain hyperbolic functions do not lend themselves to the methods used
above, since the integrand does not go to zero on the contour Ca, whether closed upward or
downward. However, closing the contour with a rectangle may work. We choose the height
ofthe rectangle to make the integral along the top side ofthe rectangle equal to a constant
multiple20 of the integral along the real axis.
Example
2.22.
Evaluate the integral
/+- I
/-- .**""
If we try to close the contour with
a semicircle, we find that on CR,
,kRcos0 rikRsin0
1
6,-kRcos0 r-ikRsin0
- ,kRcos9 f 9-kRcos0
and the lower bound equals
I
on the imaginary axis (0
:
r l2).
So we cannot show
that the integral along the semicircle is zero. However, notice that on the line
20See Problem
31(c)
for
a slight variant on this theme
142
CHAPTER
2
COMPLEX VARIABLES
z:x+inlk,
coshkz: coshk
(, *rI) :!(ro*r'o a r-*xr-n):
-coshkx
and so the integral along this line from +oo to -oo, parallel to the real axis, equals
the original integral. Thus, we are led to consider the rectangular contour shown in
Figure 2.31.
FIGURE 2.31. Closing the contour with
a rectangle.
In Example 2.22,the upper side is at Im (3)
ir/k.
: i(2nll)n l2k. Only one pole, the one atz
(Figure
2.3I).
On the vertical side x : R, we have
is inside the contour
The integrandhas poles atz
It
lf,o"
r o'l:
I lf'1(/k
I
;kR;tF4;=Fa;-6i
.'rr*.
lJo
:
ni l2k,
I
dYl
r
:tlr'o
,kR lJo ;ir4;-rFn;:6dYl
r
ln
S-k-tO
A similar argument holds for the side at x
f
I
/+-
asR-+oo
: -R.
I
Thus,
4",*r," ainro': J-* *i*o*t
f-a+in/k I
J+*+inrt *rnoro'
f+ooI
.t_dx
- ' J-*
.o.tttr"^
Using method 4 (Section 2.6.3),we find that the residue at the pole is
I :
/ir\
Rtt/ (.;,/ :
,li,Ttrol.tt"h(/.2)
, \:l
I
:!(
n
krinh (ir/2) k \isin
"/2):
-
2.7 USING THE RESIDUE THEOREM
143
and thus the integral is
[**-l xo:!$
"/-cosh
|-o,:
1
/l\
,(2ni)
(nJ:
2 Jreqangle cosh kz
k
it
Z
Integrals with Poles on the Real Axis
Integrals such as
/+@
sin ftx
J-* * -rd*
in physics applications. The integrand is unbounded at x : 2, within the range
of integration, so we must calefully state exactly what we mean by this mathematical
occur
expression.
In our first definition, we simply remove the offending point from the range of integration.
The principal value of the integral is defined to be
/+m
PI
J_*
kx
/ fz-t sin kx -f /"+m sin k-x \
lim I I
"' I
|
-dx: ;-o
\/_m x -2-dx Jz+, x -2-dx)
-2
sin
x
(2.72)
provided that the limit exists.
When evaluating the integral using the residue theorem, we are not allowed to have any
singularities on the contour, so we must deform the contour so as to avoid the pole. For
example, we could put a small semicircle of radius e over the pole (Figure 2.32). Next we
split the integral up into two exponential integrals:
/+-
I
J-*
sinftx
x -2
I / f+*
,
'
2i \/--
exp(ikx)
x -2
_AX_
,
exp(-iftx) , \
f+* _AIl
I
J-*
x
-2
/
-Ar:-1
FIGURE 2.32. When there is
pole.
a
pole on the real axis, the contour must be deformed to go around the
144
cHAPTEB
2 coMPLEXVARIABLES
(In this discussion, we shall take k to be a positive real number.) To do the first integral,
we close the contour upward, as shown in Figure 2.32. (Compare with Example2.20.)The
integral along the big semicircle is zero, by Jordan's lemma, since the function I lk - 2)
goes to zero uniformly on the semicircle as R -> oo. There are no poles within the contour,
and so
f
exoikz
A #dz:o
JCPy <- z
For the second term, we have to close downward (Figure 2.33) so that the integral along the
big semicircle contributes zero.2r
FIGURE 2.33. The contour for the integral of e-ik' 11x - 2) is formed by closing with a semicircle
in the lower half-plane. The path along the real axis remains unchanged.
Now the pole at z
:
2 is inside the contour. The residue is
Res/(2)
:
linl
,-)''
(z
-
2rexPFilcz)
' z-2
:
exp
'
(-2ki)
Thus,
$
JCn-
exp_(-i!z)
1- L
oz: _2rie_2ki
Finally, we have to evaluate the integral around the little semicircle at the pole. On this
curve, z :2 * eei9 , where I varies from z to 0, so
!:*1,"fi0,:]To +t
: ){"t'o - e-izk) ,t
I:
:
2l Refe. to
i sin
(2k)(-r)
th" discussion in Example 2.21.
i
eieiede
2.7 USING THE RESIDUE
THEOREM 145
The principal value is the integral along the real axis, up to the beginning of the semicircle
and continuing from the end of the semicircle, so finally
.
sin k-r
p Ir*@ _dx
_zisin2k: I/+6 sin kx dx
J-* x-2
J-* x-2
:
1
(-2nie-2ki)l
E[o -
Thus,
+oo
sinkx :
n'
vs-'^t + :!(e2ki x -2
2i'
-dx
::G2ki + e-2ki)
2'
: n cos2k
e-2ki
I
In physics problems, however, the physics usually determines the path of integration
around the pole. We do not always want the principal value. More often we need the integral
along a continuous path from -oo to *oo. The pole settles onto the real axis as a result of
some approximation in the modeling process (zero friction,22 for example). In such cases,
a better interpretation of the integral may be
/n*-*tt
sin ftx
lTo/_-*,; * -rdx
where the path of integration passes over the pole,23 as shown inFigure 2.34.
FIGURE 2.34. Integration path that passes above
the pole.
22We
shall have to wait until future chapters for specific examples of this phenomenon. In Chapter 7, Example ?.4,
one pole moves onto the real axis as the damping parameter y approaches zero. The simpler equation may be
integrated directly to show that the correct answer is obtained
if
the path
ofintegration is
a straight line that passes
over the pole at the origin.
23This path is equivalent to a path along the real axis with the pole pushed slightly downward off the real axis.
146
CHAPTER
2
COMPLEX VARIABLES
We may evaluate this integral as we did above, but there is no integral around the little
semicircle. Thus,
1+oo*ie rin1*
}To"/--*,, * -ra*
: tI / r+oo+it exp(ikx)
O,
Uro/--*,, x -2
(-iftx)
/*-*'" expx-2 r"\
;-b/-oo+r'
/
: lto - (-2nie-2ki)f : nr-zki : o "o, 2k - ni sin2k
2i'
_ ti*
which differs from the previous result by having an additional imaginary part -ni sin2k.
On the other hand, the path may pass below the pole (Figure 2.35). In this case, the
integral around the lower curve is zero, but around the upper curve we have
$ Y*or:2nie2ki
1- L
JCn*
and so
-. 1*a-ie sinkx
I / ..
f+"o-i'
exp
(ikx)
Jro/---,. ,-ra': t Urr J-*-,, -=;dx
,
exp (-ikx)r"\
/*--t"
e+oJ-*-;" x-2
/
: !(zoir'ot - 0) : trr2ki : n cos2k * in sin2k
_ ti,,'
2i'
-R
R
FIGURE 2.35. lntegration path that passes below the pole.
Again the imaginary part of this result differs from both of the previous integrals. Thus, the
correct choice of path is crucial if we are to obtain physically meaningful results. We'll see
examples of these different choices in future chapters.24
d.
24See also Problem 36.
2.7 USING THE RESIDUE
THEOREM 147
Integrals of Multivalued Functions
We can also evaluate integrals of the form
fn*
*" f (") d*
where cv is not an integer. The function zo has a branch point at the origin.
branch cut along the positive real axis (that is, 0 < I < 2n),then
zo
-
Ifwe
choose the
lreie;o :Yariao
and so our real integral is an integral along the top of the branch cut where d : 0 and
,iuq - 1. We construct a closed contour, as shown in Figure 2.36, so that the branch point
at the origin and the entire branch cut are excluded from the interior of the contour.
FIGURE 2.36. A contour that excludes the branch cut along the positive real axis.
Example
2.23.
Evaluate the integral
"
, : [* -{-a*.
Jo xz*I
We choose the principal branch of the square root, as above, so 1 is the integral
along the top of the branch cut. Note that the integrand is analytic everywhere inside
the keyhole-shaped contour shown in Figure 2.36, except for poles at the two points
z : *.i. The residues at the poles are
Res
/(*i) :
{tlr
2i
:
and
Res
,Fi
/(-i ) : j;; :
-zt
148
oHAPTER2ooMPLEXVARIABLES
(Remember:
-i :
"3tti1z.,
0 < 0 < 2n for our chosen
branch of the square root function, so
Then, using the residue theorem, we find
f,ftro':zoifz:
oJ2
Along the big circle of radius R, we have
/F
|"fr l- ntr;Vie
l?, +--rl=
.
lnzl -
1
4
s tN/2
R3/2 (l - r/O
>
ror R
2
Thus,
-tgi l.{^ *a.15
*h
2'
R#: *F- \h :
o
Next we investigate the integral around the small circle at the origin. On this circle of
radius e, we set z : eeio i
:
n0
ie3/z
^i3g/2
-->
J",fudo
o
as e
-+
o
Finally, look at the integral along the bottom of the branch cut, where 0
: 2r:
l:ffidr : - L* ffi,, : fi,, :,
fo*
Thus, we have
fr*":
I,oo*"o,* lr-*l",,o,"or"u,
*
Ir":21
:nJ2
and so
fn Ji
':Jo iua':
,J2
z
2.7.4. Dispersion Relations
We have already noted how powerful the property of analyticity can be. In the 1920s,
H. A. Kramers and R. de L. Kronig discovered how to use this properfy to relate the real and
imaginary parts of the dielectric constant of a material, thus deriving relations that relate
the dispersive and absorptive properties of a material in its interaction with electromagnetic
waves. (See, for example, Jackson, Section 7.10.) The Kramers-Kronig relations arc a
specific example of a more general class of relations called dispersion relations. In recent
2.7 USINGTHE RESIDUETHEOREM
149
years, these relations have proved important in other branches of physics as well- for
example, in particle physics.
Suppose a function (z) is analytic everywhere in the upper half-plane. Then the first
"f
Cauchy formula (2.38) allows us to express the value of the function at a point z6 in terms
of an integral around a curye C that surrounds zo:
f
(zo):
(z)
:$ f o,
znt JcZ_ZO
where both zo and C are in the upper half-plane. Now if I f k)l --+ 0 as z -+ oo, we may
choose C to be a large semicircle with its flat side along the real axis. The integral along
the curved part is zero, and so we find
f
I
(zd: 2"i
/+oo f (x\
J-* ;-;a*
Now we let the point z0 approach the x-axis from above. The path of integration must
remain below the pole, so we put a small semicircle under the pole and obtain
r(xo)
: *1,
l:: *dx * ryl:"{k+P\i,," aef
t
I I I/"+oo f (xt
o*+ tzl(xo)l
-_tP
2ni I J-* x-xo
I
Thus,
f (xil:
From this it is clear that
[**
-L,
7t J-a
f (x) o*
X-xO
f (x) has both real and imaginary
parts, and they are related by
(x)l
I f+oo Im t'"'"dx
f
Re[/(xo)]--P
I
7t J_a .r-x0
(2.73)
and
Im [/(x6)]
If+oo
: --P
I
7t J--
(2.74)
Example2.24. The permittivity of any material approaches ee at very high
frequencies,
e(a\
€0
t + fko)
150
cHAPTER2coMPLEXVARIABLES
(a) is analytic in the upper half-plane. lf f : u * i u,where25 f * (a) : f (- a)
and f (o) -+ 0 as a) --> @, express the dispersion relations over the (physically
ard
e
meaningful) positive half of the co-axis.
Since e(a,l) does not approach zero at high frequencies, we work instead with the
function f Qo) : e(a) les - 1. The given condition on / shows that
u(a)
-
iu@;)
: u(-a) * iu(-a)
That is, u is an even function while u is odd. Then, from equation (2.73),
I r+o
u(as):-p
1
Ir J-a
o u(a\
or:lrl[o
'@) a,,]
'(') dr* J0
[** a-@o
I
n lJ-a@-aj
@-ao
I I roo u(-to\
'-'do*l /t+- ufui\dalI
::-Pll
7r LJO -o-@O
JO @-aO I
au(a) ,..
_lo t[*, uQo) [/ I + I \ldo:-P2 |f@-;---:nda
--'T Jo
n Jo @'-@6
\@+oo ,-roo/
and, conversely, from equation (2.74),
u(rr.,6):
f*
-!,7t J-o
'@) dr-
(D-aO
f
-l7t ,l LJO
uGa)
-a-@0
do* [** @-aO
'r'') arf
JO
J
- If* -l- --P
n Jo
a'-(D6"da
2
@ou(o)
These are the Kramers-Kronig relations. Measurements of the absorption properties
of a material, u(ro), determine the dispersion, and vice versa.
2.8. CONFORMAL MAPPING
2.8.1. Definition of a Conformal Transformation
In Section 2.I.3, we noted that complex functions are mappings of the complex plane onto
itself. If the function is analytic, then the mapping has some particularly nice properties.
If f (z) is analytic
at zo
ffid,f'(zo) does not vanish, then the mapping x --> f (a) is
conformal at zg.
Conformal mappings have the following properties:
1. Conformal mappings preserve angles.
25We shall see
in Chapter 7 that this condition arises from the fact that
function of time.
e(a-r) is the Fourier transform
of a real
2.8 CONFOHMAL
2. The magnification is the same for all curves passing through
3. Infinitesimal circles map to infinitesimal circles.
MAPPING 151
zo.
see how these properties follow from the definition. Consider two sets of curves:
constant and h(z) : constant (Figure 2.37a). Under the mapping w - f (z), we
get two new sets of curves in the rl-plane (Figure 2.37b).If we move along the original
curves from z6 by infinitesimal amounts dzt and dzz in the original plane, this movement
Let's
BQ) :
corresponds to the displacements dwy and dw2 along the mapped curves. (Remember that
addition of complex numbers corresponds to vector addition in the complex plane.)
8v-r(ol
FIGURE 2.37b. The mapped curves in
FIGURE 2.37a. Small displacements away
from 26 along the curves
g(z) : constant and h(zl :
the
ur-plane. If the mapping is
conformal, then?t ,the angle
between
constant in the z-plane.
dwt and dw2,
equals d, the angle between
dz1 and dz2.
Then
a*r: {ldz dzt and dwz
lzo
So
for
j :1,2,
df
dz.
dzz
-
the absolute value of dru; is
lawll:
df
dz
ldril
zo
and the argument is
ary(aw1):
+ arg(dz)
152
oHAPTER2ooMPLEXVARIABLES
Arg (dwi)differs
from
arg (d.21)by
the constant
^rr*
between the two curves in the u.r_plane is
0t
:
TE @w)
-
arg
*r( ll ). **,
\d'l'o/
(dw) : aq (dz) -
arg
and so the mapping preseryes angles (property 1)'
If a small circle about zo has radius , : ldzl, then each
a distance
(dz) :
the angle
0
dw on the mapped curve is
rt:ldwl:r
from
u.r6
: f kO,
and so the mapped curve is also a circle (property 3), with magnification
ldf ldzlrol (nroperty 2). Note that this works only
mapped "circle" reduces to a point.
if
df ldzlro is not zero; otherwise,
the
2.8.2. Some Examples of MaPPings
TVo trivial cases
f(z)
: z*
This is
a
a
translation of the whole plane by an amount a (Figure 2.38)'
:
z * a is a translation of the plane through the vector d, whose
components are the real and imaginary parts of the complex number a.
FIGURE 2.38. The mapping w
f (z):
a4
:
Petdl
This is a rotation by an angle a plus a magnification by p (Figwe2,39).
These two examples are not very useful because they do not change the shape of figures
in the plane.
j
2.8 CONFORMAL
MAPPING 153
FIGURE 2.39. The mapping u) : az conesponds to rotation plus magnification.
More interesting and useful maps
w:lnz
Writing z in polar form, z
:
reio , we find
tD:u+iu:lnr*i0
This function maps circles in the z-plane (constant r) to segments of lines in the u.r-plane
(constant u,0 I u < 2r) and maps radial lines from the origin (0 : constant) to lines
parallel to the a-axis (u : constant) (Figure 2.40).
w-Plane
z-Plarle
FIGURE 2.40. The function u = lnz maps circles centered at the origin to straight lines.
2a
z
w
2a
2a
: ---e-Iv
- -rr
(cosd
-
i sind)
:
u
+ iu
154
cHAPTER2coMPLEXVARIABLES
What kind of curves in the z-plane map to straight lines in the ur-plane? The points that
map to the line u : uo satisfy
2a
2a
-lsind-uo+r---sind
ru0
Thus, we have
2a
x:rcose:--sindcos9: -Lu0
sin20
U0
and
y:rsino --2o
sin2g: -Le-cos2o)
UO
UO
Then
:
r
(+)' . (++ t)2:
t
sin2
20 + cos2 2o
^ / *
x'+
(r
These curves are circles centered at
e-plane: circle centered at
FIGURE 2.41. The function
y
:
-a
-
y
a
:
-qf
o\2
^)
/a\2
: (;/
us with radius R
:
laluol figure 2.41).
w-plane: the circle maps to
the straight line u : a6
^
w :2a/z also maps circles
to straight lines. The circle centered at
/ u0 with u6 negative maps to the straight line u
:
u0.
2.8 CONFORMAL
MAPPING 155
2.25. To see how to use this transformation in a physics problem, let the
circle represent a metal cylinder ofradius R at electric potential V, separated by an
insulating strip from a metal plane on the x-axis at zero potential. Find the potential
everywhere outside the cylinder.
We use the transformation w : 2R2 /z.Then the cylinder maps to the line u - -R.
The plane at y - 0 (0 : 0 and 0 : n) maps to
Example
v
:2Rz
rr
sin (o)
: Q and u :2Rz
sin
(n)
:
Q
which is the real axis in the u-plane. Notice, though, that the origin maps to infinity
and the points at ) : *oo map to the origin. The region outside the cylinder is
described by circles of radius lR2/u6 | greater than R. Each such circle maps to a line
u : -'u0, with u6 < R. Similarly, the region inside the cylinder maps to points with
uo > R. The region below the plane (y < 0) maps to u > 0. The physical region of
interest (outside the cylinder and above the plane) maps to the region of the u-plane
with 0 > u > -R. Thus, in the ur-plane, the potential @ equals 0 at u : 0 and @
equals V at u : -R-a parallel plate capacitor! The potential between the plates in
the u-plane is given by the function Q : -V (u/R). We want this to be the real part
of an analytic function (see Section 2.4.3), so the complex function we need is
<D(u):
-vw:iL*
Rt
R
Mapping back to the z-plane, we find that the potential is
V
O(z):,8
2R2 2RV
,:i-s-tv
2RV
The physical potential is the real part of this expression, or
,
Q:2RV 'ine
And the equipotential surfaces (Figure 2.42) are described by
r:
2RV
0-sind
The field lines are described by the function
2RV
COSP
:
Constant
-
Clearly, the first step in using a conformal transformation in a physics problem is to find
the right transformation. This is a nontrivial task! The Schwarz-Christoffel transformation
maps the interior of a polygon with N sides to the upper half-plane (Morse and Feshbach,
Section 4.7). Circular arcs may be transformed to straight lines using the transformation
w : a2 l(z - zp),where zo is a point on the circular arc. A few additional cases are
explored in the problem set.
1s6
CHAPTER
2 COMPLEXVARIABLES
FIGURE 2.42. Equipotential surfaces around a conducting cylinder on a grounded plane (Example2.25).
2.9. THE GAMMA FUNCTION
The gamma function provides an example of a function that is defined in terms of a complex
integral. We will have an opportunity to see how to extend the definition of the function
from one with a purely real argument to one with a complex argument-another case of
analytic continuation.
The original definition of the gamma function (due to Euler) for x real and
f (x) :
fo*
"-'t'-t
x>
0 is
(2.7s)
at
From this definition we can determine the value of the function for integer arguments. First,
f(1)
: lfoo r-' dt :
Jo
-e-tl;"
:
I
Integrating the definition (2.75) by parts yields a recursion relation between
f(x -
1):
r(x)
:
lo*
'-'t'-r at
,'-r1-"-r;l-
-
Io*
o - t)(-e-t)t"-2
dt
f (x)
and
2.9 THE GAMMA
f(x):(x-
l)f(x- l)
FUNCTION 157
(x > l)
(2.76)
Then, when x is an integer,
f(n *
1)
: nl(n) :
n(n
- l)f(n -
1)
: "' :
n(n
- l)(n -2) "'
l(n]-l):nr.
1f(1)
(2.77)
For integer arguments, the gamma function is just a factorial.
An important special case of a noninteger argument is
r fl\ : [* e-trt/2
\2) Jo
To evaluate the integral, we change variables. Let
zau:
. (;) :
lo* "-u'
t:
l*
u2
dt
.Tlten dt
:
2u du
:
1_
e-u-
du: Jr
2tr/2 du. So
(2.78)
Appendix IX for the evaluation of the integral.)
We'd like to extend the definition of the gamma function to all real and complex arguments. The definition (2.75) remains valid for complex arguments z provided that Re (z) > 0
to ensure convergence of the integral, but it fails for.r < 0 because the integral diverges
at the lower limit. There is another difficulty in that the complex integrand lz-t,-t aut
a branch point at the origin if z is not an integer. We can deal with both problems if we
redefine the function as a contour integral with a well-chosen contour. The new definition
must coincide witir the original definition when z is a real number greater than zero. To
achieve this, we put the branch cut along the positive real axis and create a contour C in
the complex /-plane that runs along the top of the cut, around the branch point at the origin,
and back to infinity along the bottom of the branch cut (Figure 2.43).
(See
Im (t)
Re (t)
FIGURE 2.43. Contour for evaluating the gamma-function integral.
158
cHAPTER 2 coMPLEXVARIABLES
Let's evaluate the integral along this contour. The integrand is
f (2,t1:f-tr-r :
7z-t1r-t:
exp (ln
exp [(z
- 1)lnr -
r]
i0,where / : reiq. Onthe tophalf of contour C, the argument0
equals 0, while on the bottom half , 0 : 2n .The integrand differs by exp f2ni(z - l)l :
exp (2niz) on the two sides. Around the little circle lrl : 6,
Recallthatln/
:
ln
r
*
,'-' e-' dt : ['" Gr''1'-'
[
lc.
exp
{-eeio)ierio de
Jo
r2n
- ,' loLi'ei de
-
^2.
c-1"2niz
-
1)
as
* 0
e --+ o
as e
-+ 0 for Re (e) >
0
Thus,
lr*-'r-'
for
x > 0. Then
dt
:
(e2oi*
-
tl
l,
,x-tr-t dt :
(e2ni*
- l)f (x)
we define the gamma function through the integral expression:
f(z):
#-
r(z): -
tf
(2.79)
frt'-tr-'at
or, equivalently,
";e
Jr{-t)'-t
,-' at
(2.80)
so that the old definition and the new definition are consistent where they are both valid.
The contour C is the one shown in Figure 2.43'
The complex function f(z) defined by equation (2.79) has a singularity where
,2niz - 1 : 0, ot l : n, where n is any positive or negative integer' For positive n,
we must use a limiting process, since the numerator is also zero. Let Z : n * 6. Then
e2ntz-t
Jc
tu: a;=lJ;'+,-t"-'at
*
i(n't6) rn-t6-'
ln*,2tr
"-'
d.t)
: u:"',',n:":,- t [* ,,+t-rr-t dt: [ ,n+6-tr-t 4,
e2ftr\n+6t_lJo
Jo
-+f(n) as6-+0
2.9 THE GAMMA
FUNCTION 159
as required. So these singularities are removable. The singularities at negative integer values
of z are not removable; they are simple poles.
Properties of the gamma function are summarized in Gradshteyn and Ryzhik, Section 8.3.
One useful relation satisfied by the gamma function is
f (x)f (l - x):
,o
SIN
JT
-r
from which it follows that
rf1\r(t-1\:
'
\2) \ 2/ sin(r12)
f
/
a
\ 12
(;)l :7'
L.
/l \
:
'-(rr Ji
as we
found above by doing the integral. Also,
.
(j).
-(
'
l\_
('* :) :
*f_\:
\ 2/
-"
So
-7r
\-r)-;/{
\2/
-ir
t_
- _tr6
---vJ.
.>
2 \2/
where we used relation (2.76).
To compute the gamma function for large real arguments, we may use an asymptotic
series due to Stirling26:
rnlf
(x+l)l:ln(x!): )rnrn*(.* j)
'"'-.***...
Finally, we note that the incomplete gammafunctions are defined by the indefinite integrals
l@. x)
f
(a.
tx
: | ,-t ra-t tr,
Jo
A:
f6
J*
,-t ru-t 7,
where Re (cv) t 0.
The gamma function has numerous applications in physics and in statistics.
26This expression may be derived using the method of steepest descent (Optional Topic D).
160
cHAPTER2coMPLEXVARIABLES
PROBLEMS
fl
ff zr
:
5
+ 2i
and z2
:
3
4i, find zr/ zz and zt x
-
zz.
@)Use the polar representation of z to write two expressions for z3 in terms of
your result to express cos 30 and sin 30 in terms of cos and sin 0.
r and 0. Use
I
3. Prove De Moivre's theorem:
(cos0
* i sind)' :
eosn)
I i sinn?
4. The equation (y - yd2 : 4q(x - xs) describes a parabola. Write this equation in terms
of 7 : a I iy. Hint: Use the geometric definition of the parabola.
[-
Strow that the equation
lz-cl+lz-dl:q
represents an ellipse in the complex plane, where c and d are complex constants and cv
is a real constant. Use geometrical arguments to determine the position of the center of
the ellipse and its semi-major and semi-minor axes.
6.
Show that the equation
z:
aeiQ
*
be-iQ
represents an ellipse in the complex plane, where a and b are complex constants and @
is a real variable. Determine the position of the center of the ellipse and its semi-major
and semi-minor axes. Hint: Use the geometric interpretation of arithmetic operations in
, Section 2.1.I to determine the location of the axes.
7t-find c// solutions of the equations below Show your solutions in a plot of the complex
\-,/plane.
(a) zs
: -1
(b) za
:
16
E. Find all solutions of the equations below.
(a)
cosz:
100
(b)
sinz:6
: -5.
S
10. Find all complex numbers z such that z : ln (-5).
11. Investigate the function w : 1/ JZ.Find the functions u(r,0) and v(r,0), where rr) :
u I iu. How many branches does this function have? Find the image of the unit circle
fina all solutions of the equation
cosh z
under this mapping.
12. T\e function w(z) : zr/a . Find the functions u(r,0) and u(r, 0), where w : u * iu.
How many branches does this function have? Find the image under this mapping of
a square of side 1 centered at the origin.
E}l OUtate spheroidal coordinates u, u, w are defined in terms of cylindrical coordinates
p, Q, z by the relations
p
*iz:
ccosh (u
-liu), w:0
Show that the surfaces of constant u and constant u are ellipsoids and hyperboloids,
respectively. What values of u and u correspond to the z-axis and the z : 0 plane?
I
l
PROBLEMS 161
14. AnAC circuit contains acapacitor C in series with a coil with resistance R and inductance
L.The circuit is driven by an AC power supply with emf e : eo cos ot.
(a) Use Kirchhoff's rules to write equations for the steady-state current in the circuit.
(b) Using the fact that cos arl : Re (ei't),findthe cunent through the power supply in
the form
1
and show that 16
-
es
f
Z interms of l, R, C,
:
Re (Ioei.r)
Z, where Z is the complex impedance of the circuit. Express
and a.
(c) Use the result of (b) to find the amplitude and phase shift of the current.
(d) How much power is provided by the power supply? Show that the time-averaged
power is given by
p
with e
:
eseiat and 1
15. Small amplitude waves in
-
:
(esf
1ne (e/*)
22
- 1*" (r.r)
Z)ei.t.
a plasma are described
by the relations
0nA (nsu):g
^ *^dx
dt
AE
to- : -"n
and
^!At
: -eE -
mvu
where n6, e,m)v,and e6 are constants. The constant u is the collision frequency. Assume
that n, E, and u are all proportional to exp (ikx - iart). Solve the equations for nonzero
n, E , ard u to show that ar satisfies the equation
,2 + ir, :'o"'
:
m€o
,'n
where at, is the plasma frequency. Solve this equation to find the frequency ar and hence
---, show that collisions damp the waves.
[re.)wi."therealandimaginarypartSUanduofthecomplexfunctions.1
\,/
(a)
.f
: z2 sinz
(b)
/ : ;!
l+z
.t M )'
n 1,'^:-*,7 uL{
.I'tr4'/!)
)'i'(''';'"'f
t,
/\
:-r' +t "/
In each case, show that z and u obey the Cauchy-Riemann relations. First find the
derivative d.f ldz in terms of .r and y, and then express the answer in terms of z. Is the
result what you expected?
)
162
cHAPTER
2 ooMPLEX vARIABLES
The variables x andy in acomplex number
and its complex conjugate z*:
Z: x +iy
may be expressedinterms of
Z
1
16:;(z*z*)
z
|
,-.
Y:
- rlz-z*)
Show that the Cauchy-Riemann relations are equivalent to the condition
0f _n
0z*
.,<\
x3 ^ is thereal part of an analytic
^
xy2
On. of the functioos u1 : 2(x - flz anduz:
Q!
T function w(z) : u I iu. Which is it? Find the function u(x, y) and write ro as a function
of z.
19. A very long cylinder of radius a has potential V on one half and
The potential inside the cylinder may be written as a series:
-V
on the other half.
sir[2(2n + 1)0]
2n -11
Express each term in the sum as the imaginaly part of a complex number, and hence
sum the series. Show that the result may be expressed in terms of an inverse tangent.
20. The function -f
:
sin (22)
{t"" Example 2.10) also has a zero
a1a
: Ji.
What is its
order?
series for the following functions about the point specified. In each case,
determine the radius ofconvergence ofthe series.
21. Find the Taylor
ka)lzcoszaboutz:0
(b) ln (1 * z) about z :
(c)
0
sln z
raboutz:Tl2
z
(d) ''. about;:2
z"-r
1
22. Determine the Thylor or Laurent series for each of the following functions in the
immediate neighborhood of the point specified. In each case, determine the radius
of convergence of the series.
cos z-
(b)
z-l
,)
sln z-
aboutz
-
I
z
(c)
ez
-aboutz-0
about z. : ir
z-in
I
PROBLEMS 163
(d)
lnz
z- I.aboutz:l
(e) tan-l (z) about z :0
23. Determine allTaylor or Laurent series about the specified point for each of the following
functions.
eZ
(b)
(c)
-aboutz:3
-:
zz -9
(d)
" zz* +9
1
A.
about the orisin
zz+l
- I
'=-aboutz:i
zz+l
about the origin
Find all the singularities of each of the following functions and describe each of them
completely.
I
(a)
-ez --sincos z
sln z
(b)
477
.
tanhz
. . -(c)
z
(d) h(1+22)
25. Incompressible fluid flows over a thin sheet from
a
distance Xs into a corner, as shown in
:
i :
the diagram. The angle between the barriers is n f 3, andat x
X0,
Voi. Assuming
that the flow is as simple as possible, determine the streamlines and plot them. What is
the velocity
atr
:
Xo13,0
:
n16?
PROBLEM
25
E6.l Prove the Schwarz reflection principle: If a function
the real axis and /(x) is real when x is real, then
f*(z):
f(z*)
/ (z) is analytic in a region including
164
oHAPTER2ooMPLEXVARIABLES
Show that the result may be extended to functions that possess a Laurent series about
the origin with real coefficients.
(a) Verify the result for the function f(z) : cosz.
(b) Verify the result for the function f (z) : tan-l(z).
(c) Show that the result does not hold for all z if f(z)
:
lnz (the principal branch is
assumed).
27. Find the residue of each of the following functions at the point specified.
=1
atz:r
til #
<. -r
/1 I\ atz:O
\z- - /I
(b) exp |
(c)sin"z
at the ongrn
COS Z
(d)
'l/2
atz:116
-srnz
Evaluate the following integrals.
-[ 2Ydr,where C is a circle of radius 2 centeredat the origin
6 ) Jc
z
l0 ,{ fior,where c is a square of side 4 centered at the origin
q ,{ f:}ar,where C is a circle of radius 1 centered at the origin
@
,4
ff
uor,where
c
is a square of side 1 centered at the point z
:
(l + i) 14
29. Evaluate the following integrals.
@
(o)
f2n I + cos7
Jo z_rr*-do
f1T
sinz 0
Jo 1 +
f2n
(c)
" I
cosz
ede
I
lo l+sin'e
t(ol /" sinz, e de
Jo
-d0
30.-Evaluate each of the following
/+- I
,u)
J__ x4rdx
n(bl f+oo x
I
v P J-*
x5 ll
, -1 /n*- cos-dx
arx
G)
xzandx
J__
A
/+oo
_r
sinx
(g /_- xz +2x +2dx
integrals.
l
l
PROBLEMS 165
6/Ur" a rectangular contour to evaluate the integrals.
.1 r+- eo*
(91
where b is real and 0 < Re (a) < b
J-* t a ru,-dx'
t2- /+- sinh a-x
(V
sintr4o*dx
!l
J-*
1+m
xz
R /. *rt,*d'
32. Ev)luate the integrals
(")
/+/
x3/z
,rjdx
f@ xlll
(b) I
Jo x"+I
-dx
b3.'l Evaluate the integral
byintegratingoverapie-slicecontourwithsidesatd:0and
atf - r/N,0 < r < oo.
34. Evaluate the integral
fo*
""
a"
along the positive real axis by making the change of variable u2 : -ix2. Take care to
discuss the path of integration for the u-integral. Use the Cauchy theorem to show that
the resulting a-integral may be reduced to a known integral along the real axis. Hence
show that
(The result has numerous applications in
physics-for example, in signal propagation.)
35. The power radiated per unit solid angle by a charge undergoing simple harmonic motion is
dP
_-Ksinz0
dA
Q+pcosdsina-l/)s
where the constant K : e2cfla l4na2 and F : aatlc is the speed amplitude/c (see, for
example, Jackson, p. 701). Using methods from Section 2.7.2,performthe time average
over one period to show that
/dP\\
/ _
\ao/
K ,^
4+B2cos20
8 ""'" (t-prrorro),/,
_ _-i^Lo_
166
oHAPTER2 ooMPLEXVARTABLES
36. Langmuir waves. Waves in a plasma may be described by a wave form n :
n1exp(ikx - iri;) (compare with Section 2.I.4), where the relation between ar and
k (the dispersion relation) is given by
o:
4 [**
k J-*
I -F
3f (u)/ou
a-ku
o,
where roo is the plasma frequency ne2 f esm and/(u) is the one-dimensional Maxwellian
f (u) : tl
I m
*r;,
exe
/
mu2\
\- 2W )
Notice that the integrand has a singularity at u : co I k, which is on the real axis, if ar and
k are both real. Landau showed that the integral is to be regarded as an integral along
the real axis in the complex u-plane, and that the correct integration path passes around
and under the pole.
(a) Show that the integral may
be expressed as
(ut
If+@ af" "/au dt,:P
J-* a-ku
I
l
1
l
(Compare with Section 2.7.3,page I43.)
(b) Evaluate the principal value approximately, assuming a/k
)) v7 :
l
JEpT/m.
a series in powers
of ku la. Neglect the small effect of the pole, and find the frequency a-l as a function
of k. Now include the imaginary part due to the pole at o I k. Show that the wave is
damped. The result has been confirmed experimentally.
Hint:Firstintegrate by parts, and then expand the denominator in
(c) How would
the result change if the path of integration were to pass over, rather than
under, the pole?
Fil
ts the mapping
, :
z2
conformal? Find the image in the u.r-plane of the circle lz -
il :
I
in the z-plane, and plot it. Comment.
38. Is the mapping w
(a) the
x-axis
:
z
*
(1/z) conformal? Find the image in the u-plane of
(b) the
y-axis
(c) the unit circle in the z-plane
a cylindrical bump of radius 4 on it. The second plate is a distance
a away. One plate is maintained at potential V , and the other is grounded. Find the
potential everywhere between the plates.
A capacitor plate has
d
))
39. Show that the mapping z : tD * eu is conformal except at a finite set of points in the
z-plane.Aparallelplatecapacitorhasplatesthatextendfromx: -l tox : -oo.Find
an appropriate scaling that allows you to place the plates aty : ls . Show that the given
transformation maps the plates to the lines u : ln . Solve for the potential between the
plates in the ru-plane, map to the z-plane, and hence find the equipotential surfaces at
the ends ofthe capacitor. Sketch the field lines. This is the so-called fringing field.
PROBLEMS 167
@ f*o
conducting cylinders, each of radiu s a, ate touching. An insulating strip lies along
the line at which they touch. One cylinder is grounded, and the other is at potential V.
Use one of the mappings from the chapter to solve for the potential outside the cylinders.
: I I Q - 2) maps to straight line segments
(a) lz - 4l :2 with endpoints at z - 3 + Jii
(b) lz - (2+i)l: l withendpoints atz:3 *l and z: I*i
42. Show that f(x) < 0for -1 < x < 0.
41. Show that the mapping w
the arcs
f (z) is analytic and bounded in a region R, lz - zol = R
< M on the circle lz - zol: r, then the coefficients in the Taylor series
E3j Prove Cauchy's inequality:If
and if l/(z)l
expansion of
/
about z6 (equation 2.44) satisfy the inequality
M
," 1V
Hence prove Liouville's theorem:
If
f (z) is analytic and bounded
in the entire complex plane, then it is a constant.
44. A function f (z) is analytic except for well-separated simple poles at z
and zn
0. Show that the function may be expanded in a series
*
N
f(z):/(o)+;",(l+
\z'
,1
f
Zn,n
: I-
N,
t
Z-zn/)
where a, is the residue of at zr. Is the result valid for
H int : Ev aluate the integral
Ju
:
N -+
oo? Why or why not?
(r)_4.
2tti Jr^ a, - rl
: t t
f
where C1s is a circle of radius R1,' about the origin that contains the N poles. You may
assume that I f (z)l < eRlr on C1s for e a small positive constant.
CHAPTER 3
Differential Equations
Any physical situation can be described by a mathematical problem-usually one involving
the solution ofa differential equation subject to a set ofboundary conditions. Thus, solving
a physics problem frequently reduces to solving a differential equation. Because we are
concerned with a real physical situation, we expect a solution to exist. In this chapter, we
shall review some methods for solving differential equations that occur in physics. Much of
the rest ofthe book is devoted to developing additional techniques. A complete discussion of
this topic would require more space than is available here. For important theorems that prove
the existence and uniqueness of solutions, see, for example, the texts by Ince or Murray and
Miller. Interested students should refer to additional texts listed in the bibliography.
3.1. SOME DEFINITIONS
A differential equation is an equation for a function / ofone or more variables that includes
derivatives of / with respect to one or more of those variables. The order of a differential
equation is the order ofthe highest derivative that appears in the equation. Thus, an equation
of the form
d2t,
dl
+3--:- +2,'t :6
--+
dxt
dx
(3.1)
is a second-order differential equation for the function y(x).
Equation (3. l) is also a linear differential equation, because each term in the equation is
a
linear function of y or its derivatives. On the other hand,
dy _a
'dx
(3.2)
is a nonlinear differential equation, since it has a term with a product of y and y'.
These two equations are ordinary differential equations, which means that the solution
y is a function of the one variable x. When we have a function of more than one variable,
such as y (x , t) , and the equation involves partial derivatives with respect to both x and t ,
169
170
cHAprER 3
then we have
a
DTFFERENTTAL EeuATroNS
partial dffirential equation. These equations will be discussed later. (See
also Appendix X.)
Equations (3.1) and (3.2) are also inhomogeneous, since each equation has a term on
the right-hand side that is independent of y and all its derivatives. On the other hand, the
equation
d2v
.dv
-=1+3x'-il2y:o
dx'
clx
(3.3)
is a homogeneous equation. It is also linear. However, unlike equation (3.1), which
has
constant coefficients, equation (3.3) has a nonconstant coefficient, with x2 multiplying the
term in dy ldx.
3.2. COMMON DIFFERENTIAL EQUATIONS ARISING
IN PHYSICS
Some of the common differential equations that arise in physics are described below.
3.2.1. Newton's Second Law
Newton's second law, i : md, is a second-order differential equation for the position of
a particle. If the force is constant, such as the gravitational force on a particle near the
Earth's surface, then the equation is very simple. With the y-axis chosen vertically upward,
we have
m
d2y
or2
:
-m8'
This equation is quite easy to solve, so we may use it to draw some general conclusions.
We may integrate twice to obtain the solution:
l)
y:y0+-:dv t- t8'dto
(3.4)
There are two integration constants, 1lo and dy /dtls : u0, that must be specified to complete the solution.
The functions yr?) : I and yz1) : / are solutions of the homogeneous equation y// : Q.
Thus, the general solution is a linear combination of these two functions pllus a particular
integral-a function that satisfies the inhomogeneous equation. Here the particular inte-
gralis -!rgt2. Oft"n we obtain the values of y6 and dyldtls from the specified initial
conditions-the values of position and velocity at t : 0.
- Nowletussupposethatthereisairresistancethatisproportionaltotheparticle'svelocity:
Fa, : -ai. Then we get
d2v
m--:;:-m8-q,
dt'
dv
dt
(3.s)
This is a second-ordeq linear, inhomogeneous differential equation with constant coefficients. We shall solve this equation in Example 3.2.
3.2 COMMON DIFFERENTIAL EOUATIONS ABISING IN PHYSICS
171
3.2.2. Simple Harmonic Motion
If the force applied to a particle is given by Hooke's law and if y describes the displacement
of the particle from its equilibrium position, then we have
d2y
m7V
: -kt
(3'6)
which is a second-order, linear, homogeneous differential equation with constant coefficients.
If the system also has a damping force proportional to velocity, then the equation becomes
d2v
fll-:
dtz
d2y
dP
where
2q
: y lm *rd ,4 :
dv
'
dt
+2oQ *of,y:g
(3.7)
k/m. This is the equation for a damped harmonic oscillator
3.2.3. Bending of a Beam
A beam will bend when a load is placed on it. Let the coordinate x run along the undisplaced
beam, and let y(x) be the downward deflection of the beam's center line. The elastic properties of the beam are described by the Young's modulus E of its material. Its response to
applied loads is determined by the shape of the cross section, as measured by what engineers
call the moment of inertia 1 of the cross section about the long axis.l Then, with origin at
the left end of the beam, the deflection
y(x) may be computed
as
follows:
1. The beam bends as a result of the net torque acting on it. In equilibrium, torque due to
the internal stresses balances the external torques. Torque balance is expressed by the
differential equation2
d2v
I
:
--+
dx2 --m(x\
EI
(3.8)
where rn(x) is the net counterclockwise torque of all forces acting to the right of point
r. The minus sign arises because y increases downward.
2. The torque is due to the shearing force:
dm
_-f(x)
(3.e)
dx
where r (x) is the sum of all vertical (downward) components of forces acting to the right
of point x.
lLet x' , y' z/ be coordinates with origin at the centroid, with y/ (vertical) and z/ (horizontal) lying in the beam
,
I:
cross section. Then
[(y')2dy' dz'.
2This is the Bemoulli-Euler law See, for example, Long, pp.
9l-94.
172
CHAPTER
3
DIFFERENTIAL EQUATIONS
3. The shearing force results from the load per unit length:
dt
'
clx
Here the minus sign appears because t is
integral.
4. Putting these equations
: -q(x)
fi
q(x') dx' ,where x is
(3.10)
the
lower limit of the
together gives us the differential equation satisfied by the beam:
d4t,
dro:
1
(3.1 1)
EI(t@)
where q(x) is the load per unit length along the beam. This is one of the few equations
in physics that have order greater than2.
3.2.4. Electric Circuits
Applying Kirchhoff's laws to an electric circuit gives an equation for the charge on a capacitor or the current in the circuit. For a single-loop L RC circuit (Figure 3.1) with appropriate
choice of the circuit variables for charge q and current,3 we find
r#*o#*[:e
(3.12)
where t is the applied emf. The circuit is a driven, damped harmonic oscillator. Compare
equation (3.12) with equation (3.7).
C
FIGURE 3.1. Circuit diagram that leads to equation (3.12). The algebraic variables
i
and
q
are
defined here.
Multiloop circuits give rise to a set of coupled linear differential equations. In all such
problems, particular care must be taken in the definition of the variables and the consequent
relations among them.
3See, for example, Lea and Burke, p. 990.
3.2 COMMON DIFFERENTIAL EOUATIONS ARISING IN
PHYSICS 173
3.2.5. Diffusion
Processes involving many random interactions ("collisions") are usually described by
a diffusion equation. Heat conduction is an example of this kind of process. Heat transfer along a rod of cross-sectional area A, for example, is described by the heat flux
H : -kA 0T l0x, where k is the thermal conductivity of the rod's material. The minus
sign indicates that heat flows toward lower temperature. The rate of change of temperature
in a segment of length dx depends on the heat capacity of the segment and the imbalance
of heat flow into and out of that segment. Heat flow out of a segment of length dx reduces
that segment's temperature:
H(x -f dx)
- H(x) : -*r 4*!At
where m is the mass per unit length and c is the specific heat. Dividingby dx, we have the
differential equation
AT
AH kA_
A2T
'-'^
0x
0x2
AT KA A2T_n A2T
"'- 3t
dl
a-^a
mc dx'
dx'
(3.13)
(3.r4)
:
D kA lmc is called the diffusion coefficient. This is a partial differential equation,
since it contains partial derivatives with respect to r and r. The solution is a function of the
two variables x and t.
where
3.2.6. Waves
The wave equation relates the second time derivative of a function to its space derivative:
32y ,o2y
atz-" uz
where u is the phase speed of the wave. If we assume a solution of the form
Re ei't y1x\ (compare with Chapter 2, Section 2.1.4),then we find
(3.1s)
y(x,t) :
"
"d2v
u'-i
-@'Y(x):
dx'
d2y
dr, lk2y:g
(3.
r6)
where k2 : @2 /u2. We have reduced our partial differential equation to an ordinary differential equation known as the Helmholtz equation.
L
174
cHAprER 3
DTFFERENTTAL EeuATtoNS
3.3. SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS
A linear, homogeneous, ordinary differential equation of n th order has n linearly independent4 solutions. The general solution is a linear combination of the n independent solutions.
Thus, to completely specify the solution, we require n pieces of information (boundary
conditionss). For a second-order differential equation, we may know the value of both y
and yt ar one boundary (say x : 0, in which case we have an initial value problem) or
perhaps we may know the values of y at two boundaries (say x - 0 and x : l). In this
section, we shall focus on finding the two linearly independent solutions of a second-order
equation.
3.3.1. Equations with Gonstant Coefficients
The easiest class ofdifferential equations to solve is linear, ordinary differential equations
with constant coefficients, so we shall start with those.
Homogeneous Equations
A linear, homogeneous differential equation with constant coefficients, no matter what the
order, is solved by an exponential function y : et' , where s is a real or complex constant.
Simply substitute this solution into the differential equation to obtain an algebraic equation
for s. Each distinct solution for s corresponds to a linearly independent solution of the
original differential equation. Since a second-order equation gives a quadratic for s, a thirdorder equation gives a cubic equation, and so on, we can expect to find n solutions to an
nth-order equation. (The case ofrepeated roots is discussed below.)
Solve equation(3.12) with t : 0 and with the initial conditions 4
Q att :0 and the current I : dqldt : 0 at / : 0.
Substituting the trial solution est into the equation, we find
Example
3.1.
[.s2sst
a
Pssst
:
*Tett -0
,
1
and so
Ls2
+
Rs -l-
1
-C -0
(3.t7)
which has two solutions:
: -- R +
-2L
.S-L
4A set of
: -a tia
N functions ya(.r) is linearly independent if there are no nonzero coefficients an that make
DI:t onyn: 0 for all values of x. See Chapter 1, Section 1.5.2 for a related concept applied to vectors.
5In the particular case of the variable t (time) with conditions specifi ed att :0, the boundary conditions are called
initial conditions.
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS 175
where
R
2L
and
.
-:
.,2 _
r I/R\2
_
LC- \n)
I
it Rl2L > IlJIrC, and both solutions decrease
with time. Butif Rl2L < llJLC, then the roots form a complex conjugate pair
with a negative real part. Since the complex exponential may be expressed in terms
Both roots are real and negative
of sines and cosines, these solutions are damped oscillations.
The general solution is a linear combination of the solutions e"+' and et-t
q
:
exp
(-at)[Aexp (iot)
f
B exp
:
(-iat)]
where the values of the constants A and B are given by the initial conditions. At
we have
/
:
0,
q(0):Q:A+n
and
I:+ldt
l,:o
:o:(-cu *ia)A+(-d
-iro)B
So
B:.4-oli'
ulia
and
/
-l- iro\
2io +-4: O r
\
O:-41 ll- -a
l--4=ll-i-l
u*ia
u*ito
2\
o/
\
/
cY
Then
B:e-A-f,Q+f)
The solution is then
n
:
3,-"'lQ - ,z),''' * (t *,!),-'''f
:
Qe-at
(.orrt +Y"inrt)
A special case arises lf RlzL
: I/J
LC. The
square root equals zero, andthere is
- R /2L. Thus, we find only one
only one solution to the quadratic equation for s: s
:
176
cHAprER 3
DTFFERENTTAL
EeuAroNS
:
:
e't .We look for a second solution to the differential equation,6
u(t)e't , and substitute into the original equation to obtain a new
differential equation for the function u:
solution for q, q1
ofthe form q2G)
dqz
dt
: utett *
st)ett
and
#
:
t)"e't +2su'e't + s2ue't
Substituting into the differential equation gives
Le't (u"
+
2su/
+
szu)
+
e't R1u'* su)
1
|
l\
:
o
Lu"+u'(2sL+n+(rs2
'+Rs+e)r:O
\
The coefficient of u is zero (equation 3.17). Then, using our solution for s, we have
Lu"+rlt(#) r+Rl ro:o
u":o
Thus, the solution for u is v : A + Bt. So the second, independent solution for 4
is te't . We may readily extend this method to the case where the equation for s has
a root repeated n times (see Problem 2).
Inhomogeneous Equations
To find the general solution to a linear, inhomogeneous differential equation, we first find
any solution of the inhomogeneous equation [the particular integral I p@)). This is not as
hard as it might seem, because we can often find the particular integral we need by setting
one or more of the derivatives to zero. Next we
ofthe homogeneous equation. The general sol
combination of y1 ard y2:
y(x):!p(x)|-ct
We can adjust the constants c1 and c2 to satisfy
3.2. Find the solution to equati
resistance) with the initial conditions y - y
First we find a solution yr (/) to the inhomo
tive to zero. (This means that the particle r
Example
6This is a general method that we discuss in more detail in
]
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS 177
gravitational force is balanced by air resistance and the acceleration is zero.)
0:
-ms
-
"+dtq + yr(t): -^8 ,
Since the homogeneous equation is linear and has constant coefficients, its solution
is of the form e", where s satisfies the equation
*r2 +os :0 f
Thus, the general solution
s
:0,
m
yn(t) to the homogeneous
equation is
yHQ):A+8.*p(-"t)
\ m/
Next we add y7 to yp to obtain the solution to the complete equation:
!(t) : tn t
Y1
-
A+n
"*P
Finally, we make use of the initial conditions ),
the constants
A
and B:
(-o t) - I€t
\ m,/ a
-
),0 and dy
ldt :
us to determine
y0:y(0):A+B
and
dv
+
dt
a
mp
---B----::uolB:-[-l
"
m
d
t:o
rmr2
m
g--uo
\a/
a
Then
t mt,2
m
A:!o+(;,)
B*;uo
Thus, the solution is
y(t)
: * * (:)" l' - *n (-;')] *T @[r - "*n (-I,)] -',)
Check the limit as cv -+ 0 and convince yourself that we get back the solution (3.4).
What happens as a becomes very large?
3.3.2. Linear Equations with Nonconstant Coefficients
When the coefficients in a linear equation are not constant, the solution is no longer a simple
exponential. Before proceeding to a general method, we need to look at some properties of
these equations.
178
cHAprER3DTFFERENIALEeuATtoNS
Singular Points
The behavior of the solutions to a homogeneous differential equation is easiest to analyze
when the equation is written in standardform, where the coefficient of the highest derivative
is unity. For example, the standard form of a second-order equation is
dz,t
dv
dx
d)('
-+f(x)-i*g(x)y:o
Then the values of
y
and dy I
dx at any point determine the value of the
d2v
ji:
-f
A)
dv
dx
(3.
l8)
second derivative
- sk)v
By differentiating this expression, we may obtain the value of all higher derivatives that
exist.
The singular points of the differential equation are those points that are singularitiesT
of the function f (x) or the function S(r).
All other values of x
y
and dy /
are regular points of the differential equation. At a regular point,
dx may take on any values, and the resulting value of the second derivative will be
finite. We may then use the differential equation to compute the higher-order derivatives, and
hence find a Taylor series for y about the regular point. Providing the series has a nonzero
radius ofconvergence, the solution exists.
In contrast, at a singular point, y and dy/dx may take on only a set of special values.
The set may be empty (that is, no values exist). Thus, if we are given values of y and dy /dx
at the singular point and they are not in the set of special values, the Taylor series will not
exist.
The equation
d2v
*7?*2Y:o
has a singular point at x : 0. [In standard form, /(x) : 0 and 8(r) : 2/x,whichhas
a pole at x : 0.1 In order for the second derivative to exist at x :0, y(0) must be zero.
In fact, y must go to zero as r -+ 0 at least as fast as x. The value y(0) : 1 (or any other
finite nonzero constant) is not in the set of special values for this equation, and so there is
no analytic solutions with y(0) : 1.
If the equation is linear, the singular points are fixed. But the singular points of nonlinear
equations also depend on the value of y (and perhaps its derivatives). For example, the
TSee
Chapter 2. Section 2.5.3.
8See Problem 5.
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS 179
equation
,#*(#)':,
has a singular point where
):
(3.19)
0. The solutione to this equation, with initial conditions
+ yil, and thus the singular point occurs at x : -yo/2y'o,
a value that depends on the values of y and yt at x : 0. This variability of the singular
),0 and y6, is y : {iQfi
points accounts for much of the challenge in solving nonlinear equations.
The Wronskian
If y1 and y2 are both solutions of
a linear, homogeneous differential equation, then any
By2
is
also
a solution.
|
TheWronskian of the two solutions is
linear combination Ay1
w(yt,
y) : lDL -
vzvl
(3.20)
The two solutions )1 and y2 arelineaily dependent if there are nonzero coefficients A and
B such rhat Ay1 * Byz : 0 for all values of x. In this case, the Wronskian is identically
zeto.
Now observe that
dW,
;:YtYi-YzY't'
We may evaluate this derivative using the differential equation (3.18). Then
# : ytGfyL - sJz) - vzGfvi- Blr)
:-.fw
(3.21)
This is a linear, first-order differential equation for W. To find an expression for 17, first
rewrite equation (3.21) as
1dw
Wdx
d
*(lnw): -f
and then integrate to obtain the solution:
w:w(xo)*n(\ou
l'"feoe)
can verify this solution by direct substitution into the equation. See also Problem 12.
(3.22)
180
cHAprER
s
DTFFERENTTAL
EeuATroNs
Since the exponential function is never zero for real arguments, V[ is zero for all x if it is
zero at a{.r : .r0; otherwise, it is never zero. When the Wronskian is not zero, the two
functions 1lr and y2 are linearly independent.
Equation (3.22) shows that I4z is a specific function of x. We can evaluate it by using any
form of the solutions 1,r and y2 or by performing the integration in equation (3.22).
Once one solution yr (.r) of the original differential equation is known, we can use the
Wronskian to find a second, linearly independent solution. Note that
d / vr\:
d. \n )
v'vl - vlvz
:7,w
---T-
Thus,
y2: yt [ \oJyi
(3.23)
While equation (3.23) provides a formal solution, it is not always a useful one, because the
integration may be difficult. It does prove, however, that a second-order differential equation
posseses two solutions.
3.3. One solution to the equation y" - 4y' * 4y - O is eb (compare with
Section 3.3.1). Use equation (3.23) to show that the second solution is xeb .
For this equation, the function f (x) = -4, and hence, from equation (3.22), the
Wronskian is
Example
w : w(o)"*n
(lr'
+ a6)
:
woe+'
Then the second solution is
rz',):
r' I
ffidx
: e2'wo
I
o'
: wsxe2*
and thus the second solution is xe2' . The constant W6 is arbitrary.
Method of Variation of Parameters
If one solution yr (x) of the differential equation has been found, we may search for a second
solution of the formlo yz(x) : u(x)yt@). This yields a second, and we hope simpler,
differential equation for the function z(.r).
low" did thi. in Example
3.1.
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
Example
3.4.
One solution to the Legendre equation
(I
is y(x)
EOUATIONS 181
. d2v
- *')li -
r*
dv
*
t2y
: Pr(x) : x. (Verify this by substituting
: xu(x).
:0
into the differential equation.)
Search for a second solution yz@)
Differentiating, we find
dYz
;:u*xu'
and
d'y, ^t,
+xu ,
dx2 -zu
Now substitute into the differential equation:
1l
-
x2112u'
* xu") x1r
2x(u
I
xu'I
- xzSu" + 2(l -
l2xu :
o
:
o
2x27u'
which is artr$-otdet equation for u/:
u" _ _, l -2x2_
-x(l-xz1
ttt
This equation may be integrated as follows. Let x2
:
u. Then 2x dx
:
d.w and
l-2x? dx:- [ r-2w ,.
rnr':-2 [ x(l-xz)
J
J G-w)w
:
- Jt (\u!- .l-)
l-uu/ dw -lnr,u -ln(l - u,)
:,"(a+;5)
Tbking the exponential ofboth sidesll gives
.
I
u':
x\t-x\:
t/t
Z (.P
1 \
t rl I
l \
+;7):2r2
-a (r -' * 1+.)
Integrating again, we get
,:-L+1mfti)
2x' 4"'\t-"/
1
lwe have omitted
y(x).o yz@).
the integration constant. You should
verify that including it adds a multiple of our first solution
182
cHAprER 3
DTFFERENTTAL EeuATtoNS
Finally,
yz:xu:-1*+,nf'*'\
2 4 \l-rl:1Qt@)
I
where in the last step we inserted the usual definition of the Legendrel2 function Q I .
With minor modifications, this method works for inhomogeneous equations as well. If y1
and y2 arc solutions of the corresponding homogeneous equation lh(x) :01, the solution
to the inhomogeneous equation
y" + f (x)y' -t s@)y
:
h(x)
may be written as
y(x)
:
ur@)yr@) -l uz@)yz@)
The differential equation leads to one equation for the two functions
free to impose an additional constraint. The first derivative is
y'
(x)
:
u'1
@)
I t @) *
u r @) y',
ul
and u2, so we are
(r) + u'2@) y2@) + u z@) yL@)
One convenient constraint is
u\(x)m@)*u'2@)yz@):O
(3.24)
in which case the derivative simplifies:
y'
(x)
:
u(x)yl(x) +
uz@)yL@)
The second derivative is then
y" (x)
:
u'r1x;y'r1x;
+ a@)y{(x) + u'r1x;y'r1x)
*
uz@)yi@)
and the differential equation becomes
h
(x)
:
u'r1x\ y'r1x1
+ u (x) y( (x) +
+ f lu@)vl(x)
:
:
[u(x)
*
't
l'2(x) +
uz @)
uz@)vL@)l-t slut@)vr(x)
u2@)ltyi +
u\(x)yl(x) +
u'2@)
yi @)
*
uz@)vz@)l
fyl + 8y1] + u\(x)t\@) + ui@)vL(x)
uL@)yL@)
Using equation(3.24) to eliminate u'2,wehave
h(x)
: r'r@ ly'r(*) - f,or,.,f : -u r@ff
l2The Legendre functions are discussed in some detail in Chapter 8.
Q.2s)
3
3
SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS 183
where W(x) is the Wronskian (3.20) of the two solutions )1 and y2. Thus,
f
I
ut(x\:-
J
h(x\v'>(x\
'"
w(x)
d.r
(3.26)
and, similarly,
u.t(xt: I
I
h(x)YtG)
w(x)
o*
(3.21)
Note the similarity between equations (3.23) and (3.27).
Methods of Solution
We have discussed several methods for finding a second solution to a differential equation,
but how do we find the first one? Methods for solving an ordinary differential equation
include the following:
1. Guess
1
Form
the form of the solution.
a
power series-type solution.
3. Find an asymptotic solution.
4. Recognize the equation, or use a change of variable to transform the equation to a rec-
ognizable form.
5. Integrate numerically.
Guessing is not abad option. A given differential equation with a specified set ofboundary
conditions has a unique solution (we will not prove this theorem here), so if we can guess the
solution, we have the unique solution. Physical intuition and experience are useful guides
to the guess.
Later in the text we will identify some equations whose solutions are "named" functions
with properties that are well known. Legendre's equation (Example 3.7; Chapter 8, Section 8.3.1) and Bessel's equation (Example 3.9; Chapter 8, Section 8.4.1) are examples.
Once we recognize the equation, we may simply write down the solution. Some references
(for example, Polyanin andZaitsev, Murphy) list differential equations and their solutions.
These references allow us to look up the solution in the same way that integral tables allow
us to find the value of an integral.
Numerical integration gives a numerical solution for a specified set of boundary conditions, but the solution must be repeated for each new set of conditions. Thus, an analytic
solution is preferable if one can be found. Numerical methods can be very useful for nonlinear equations, however.
Additional methods for solving linear equations, such as expansion in eigenfunctions and
transform methods, will be developed in the chapters that follow. Here we shall begin by
studying power series solutions.
184
CHAPTER
3
DIFFERENTIAL EQUATIONS
3.3.3. Power Series Solutions
SolutionAbout a Regular Point
The solution to a linear, homogeneous differential equation can be expressed as a Taylor
0,
series about a regular point. Frequently (but not always) we want a solution about -r
so that the solution is a power series in .r. The method is to assume a solution of the form
:
I -lanx"
(3.28)
n:0
and substitute into the differential equation. Since a Taylor series is uniformly convergent
within its radius of convergence,13 we may differentiate term by term. Thus,
,oo
dY
' - \-) narxn-l
dx
(3.29)
?:o
Notice that the n : 0 term disappears because of the factor n multiplying ar. Then the
second derivative is
,')
o.-l
dxz
oo
:5-n(r -
(3.30)
l)anxn-2
n:0
where now both the n : 0 and n : 1 terms are zero.
For example, consider the Helmholtz equation (3.16)
d2v
7?+k"Y:o
(We can guess the solution to this equationi ) : sin kx or ) : cos k-r. Thus, we can check
our result.) This equation has no singular points, so .r : 0 is a regular point. Substitute in
the series (3.28) and (3.30):
anx' : o
irr, - r)anx'-z + k2f
n:o
n:o
The algebraic equation that results can be satisfied only if the coefficient of each power
of x is separately equal to zero. We always start with the lowest power that appears in the
equation.
"0:
We obtain this power by taking
Thus, we get
2. I.
l3See Chapter 2. Section 2.3.3.
a2
n
:
I
k2ag:
2 in the first term and
0
+ o,'2: -k'2
n
:
O
in the second term.
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS 185
xl:
:3
We take n
in the first term and n
3. 2.
a3
: I in the second. Then
I k2at:
0
+
a3
: -ftzlt 3.2
until we come to the lowest power of x for which every sum
in the algebraic equation contributes to that power. Then we can draw a general conclusion
valid for all higher powers of x. Notice that both sums here contribute to the coefficient of
x I , so we are ready for a general statement. Let's look at the coeffic ient of x^-2 .
We have to consider special cases
x^-2i
We need n
:
m in the first term and n
m(m
:
m
- l)a^ + k2o*-r:
0
-
2 in the second.
* a^ : -k2-!!-2
rn\m - I)
a--2 interms of a*-4,
We can now use the same relation again to express
"'
-m-__--
-Pz
m(m
-
3)
etc.
(-k')'o^-o
-k2o^-4
r) (m - 2)(m -
(3.31)
m(m
-
l)(m
-
2)(m
-
3)
Continuing in this way, if m is even, we get
a^: (-I)m/'#oo
(3.32)
a^: (-I)@-t>PYlat
(3.33)
whlle if m is odd, we get
Equation (3.31) is the recursion relation forthe series. It relates each coefficient in the series
to the preceding one. In this case, the coefficents skip one: a* isrelatedto am-z rather than
a^-l.Eventually we can relate each coefficient to ag or al. But we have no relations that
determine the first two coefficients as and a1. These are the two arbitrary constants in the
general solution. The two solutions are
(kxta
yt:ao(t/ -; tkx\2+i
+ \ :aocoskx
)
and
y2:at
(
["-
*,3
k4x5
* _+...
,
5!
\
o,
l:_sinkx
)k
in agreement with our guess. Since (kx)z is positive, the series is alternating, and each term
decreases toward zero for m > lkxl * 1. Thus, these series converge everywhere.
186
cHAprER 3
Example
3.5.
DTFFERENTTAL EeuATroNS
Use a power series method to solve Hermite's equation:
dzv
dv
-2x
A?
* +2ay -
0
This equation arises in the quantum mechanical treatment of a harmonic oscillator.
The point x : 0 is a regular point of this equation, so we expect to be able to find
a Taylor series solution about x : 0. Substitute in series (3.28) and its derivatives
(3.29) and (3.30):
+ 2u\anx' :
irr, - r)anx'-2 - z*Dn&nxn-r
n:0
n:o
o
n--0
oo
D"@ -
l)anx'-2
@
- 2Dna,x' +2"Da,xn
n:O
n:0
:o
n:O
Now consider the coefficient of each power of x, starting with the lowest power.
r0:
The second term does not contribute .Take n
:
2 in the first term and n
:0
in the
third:
2' |'
a2
I
2aao
:
0
t
a2
: -aao
xl:
All terms contribute:
3' 2'
a3
-
2or +2aa1
- 0 + a3 : -To,
For all larger powers, all the terms in the equation contribute. So let's look at x^-2:
l
I
l
m(m
- l)a* -
2(m
-
2)a--z
*
2aa*-z
-
0
and thus
4^:
(m
^
ZA^-2-
Since the recursion relation relates
a-
-2) -a
to a*-2, one solution contains only
even
powers of x while the other contains odd powers. The two series give two independent
solutions of Hermite's equation. As in our previous example, the differential operator
in Hermite's equation, D(x) : d2 /dx2 - 2xd /dx I 2q, : D(-x), is even, and thus
the solution must be purely even or purely odd. The general solution is found by
choosing values of the two constants as arrd at; that is, it is a linear combination of
the two series.
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS 187
For the even solution, we find
am
(m-2\-a
:2--- .--: ^ x 2 (m-4)-a Qm-4
(m-2)(m-3)
mlm - I)
q," (a - m -l 2)
: (-2ym/2d(a - 2)(a ao
ml
Notice that if a
:
2p for
some integer
a2p+2:
p, then we have
zoro,r!ffin:
o
Then a2ra4, a2pt6, and so on are all zero also. Thus, the series terminates with the
coefficient a2o; it is a polynomial of order 2p. These solutions are called Hermite
polynomials, Hzp@). Since the solution is a polynomial and not an infinite series,
the solution exists everywhere.
In this case (a :2p), the second solution to the differential equation does not
terminate. With
z
odd,
Q2n-ll
:2azn-t
(2n-t)-2p
(2n
*
l)(2n)
: (-2)n I2(p - n) - tll2(p
-
n)
and the numerator never vanishes. Indeed,
a2ny1
(2n
- 3l'.' (2p -
-l t)l
t)
al
Solution About a Singular Point
In the region surrounding a singular point, a Taylor series may not exist, so we must modify
our approach. If the singularity of the solution y (x) is isolated, we may express it in terms
of a Laurent series
't*o'"
- xo)n
valid for 0 . lx - xo I = p. However, the singularity may not be isolated; for example,
it may be a branch point. Thus, we may have to allow for noninteger powers of -r. [It is
also possible that, even though x6 is a singular point of the differential equation, one of the
solutions y(x) may be analytic at x6.l
We can allow for all these possibilities by choosing a series of the form
y(x)
:
(x
-
xdP
Do,@ -
xo)n
(3.34)
n:o
where p may be any number, positive or negative, integer or noninteger, real or complex.
This is called the Frobenius method.
188
cHAprER 3
DTFFEHENTTAL
EeuATroNs
If the solution is a Taylor or Laurent
series,
A < lx - r0l < p2, and we may differentiate
,'
it is uniformly convergent within a regionr4
term by term. If p is not an integer, then
: lfta-'o)o] (8" -.,') + (r - (n"",r" - "or'-')
^,'
The derivativeof (x - xs)p exists everywhere except on the branch cut. We may differentiate
everywhere in a region that is keyhole-shaped-that is, an annulus with a channel cut out
at the branch cut. The derivatives are
,oo
d\l
: )-(, + p)an@ i{
dx
(3.3s)
xo)n+P-l
n:o
and
d2,t
3
u.'- :f(n
dx' n:0
Example
3.6.
+ p)(n + p
-
l)an@
-
(3.36)
xs)n+n-z
Solve the hypergeometric equation
r*,-*tfi*(r.-)*+]r:o
Obtain a solution as a series in powers of x.
The equation has singularpoints wherex2 - x : O-that is, at r : 0 and atx : 1.
We are looking for a solution about x : 0, so we choose a series of the form (3.34).
Substituting into the equation, we find
oo@
9 : f{, * p)(n * p - t)anxn+e -D(u. t
n=O
n:O
p)(n + p
-
dr@
l)an76n*n-t
oo
+zfrn * p)a,xn+p - )D, * p)a,x'+P-t a ! lon*n*o
n:o
z=0
n:0
The lowest power that appears in this equation is
and fourth terms), and its coefficient is
1) + Iol"o
-loroZJ
L
Since we want a6
I
a
Taylor series, p1
-
:,
0 for a nontrivial solution, we must have
/
o(o
l4For
xP-l (with
r\
- ,):o
0 and the condition becomes lx
-
xol
.
pz.
n
:
0 in the second
EQUATIONS 189
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
This equation is called the indicial equation. It has the solutions p : 0 and p : I /2.
Looking atthe coefficientof rm +pgives us therecursionrelation forthecoefficients:
- l)a* - (m -f p -f l)(m *
l1
I .am-O
-;(mJ-P]-I)a^a1
z+
(m
*
p)(m
*
p
an.:.
:
p)a^+r
*
2(m
I p)a*
l- p)(m * p * 1) -f I /4
(m-rp-tl)(m+p+ll2)
(m
In this case, the two independent solutions arise from the two different possible values
of p. The recursion relation relates each a^a1to the preceding a^.
Forp-0,
am*t
+ rl4
+ wam
* tl2)2 :
: (m 111^
1 ry21a^
@+
r^ *
I /2m+l\ /2n-l\
:o\^at)\
:
m(m
* r)
r)(m
2m
2(m
*|
+ta^
* )^-'
-t l)ll
2m+t(m + l)! "
(2m
--erl
and the first solution is
S
yt:aoLn:0
- r)!! .
2^;r,:r"
(zn
This series is well behaved at the origin and converges for
The second solution has p - l/2.Then
am*t
lxl <
1.
(m+I/2)(m+312)+114^ _ m2+2m+t
am :
^
1* a3J2y^ trga^
@+3/r)@ + D
2(m * l)
2m+tQn I l)l
- 2m + 3 *^ - (2m -13)ll uu
:
:_ *m+r
+3424^
and
v,
-
Jiaoi-z:!--"
,_n r2n * l)!!
Once again the series converges for lxl < 1, but this function has a branch point at
the origin. The constant as in each solution may be chosen freely; it need not have
the same value in the two solutions.
The Frobenius method may fail to provide two independent solutions if the functions
and/or
l-
/(x)
g(x) in the standard form (3.18) are too badly behaved. In particular, the method
190
cHAprER 3
DTFFERENTTAL EeuATroNS
may fail to provide two solutions about x : r0 unless (x - xd f @) and (x - xs)z g(x) are
analyticls at x : xs, in which case the singular points are called regular singular points.
This result is known as Fuch's theorem. However, one solution may sometimes be obtained
about an irregular singular point, or two solutions may be obtained by choosing a different
central point that is a regular singular point.
The Frobenius method gives two independent solutions of equation (3. 18) about a regular
singularpointprovided thatthe indicial equation has two roots that do not differby an integer.
If the equation has a repeated root or two roots that differ by an integer,l6 the method may
provide only one solution to the differential equation. This problem arises because the
Frobenius method does not provide functions with logarithmic-type singularities. In such
cases, the second solution is of the forml7
oo
y2:
yt lnx +
!
(3.37)
anxn*P
n:0
where y1 (x) is the first solution and the coefficients on are to be determined.
3.7. Legendre's equation arises in the solution of Laplace's equation in
spherical coordinates (Chapter 8). It has the form
Example
(t
dv
^ d2v r*E+/(/
- x")7-
+ l)y
:0
This equation has singular points at x : tl. Let's look at the special case / : 0.
Then one solution is a constant and is valid everywhere. The second solution may
be found as a series in powers of x, but the series converges only for lxl < 1. Find
a solution valid for
x > l.
We can look for a solution about the singular point-that is, a solution in powers
of (x - 1). First we change variables to r, : r - l. Then we rewrite the equation in
terms of u.r:
dv
^ dzv 2x -!
: g
- x')---:dx' - dx
d2v
dv
(u +2)wA# *2(w * 1)A;:o
(1
The equation has singularities at u.; : -2 ard aI u) : 0, as expected. We assume
a solution of the Frobenius type in powers of u.r, and substituting into the differential
15See,
for example, Ince, Section 15.3, where this result is also extended to equations of higher order.
l6See Problem
1
l. Also
see
Problem 28(a) for
a case
the roots of the indicial equation differ by an integer.
lTSee Problem 10.
in which two independent solutions are obtained even though
l
l
I
I
EOUATIONS 191
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
equation, we get
0: t(n
-f p)(n
n:0
t p-
r)a,wn+p
*2f
fn + p)(n + p
-
r)anpn*t-r
n--0
+zl@ i
p)a,wn+P
+2\(n *
n:O
p)an6n+P-l
n:O
We can simplify by adding like powers:
0
: t(n
-t p)(n -t p
*
l)ansntn +
zl@ I
n:0
p)2anry"-rt-l
n:0
The lowest power that appears is
wP-r.Its coefficient is
2P2ag
:
and so the only possible nontrivial solution is
The coefficienr of w*tP : u- is
Q
p
-
0, a repeated root.
a*m(mf l) + 2a.ay(m*
1)2
:
g
of
I
l
N
t
r
,
i
i
am*l
:
an
-m
4^ t, l)
However, notice that for m :0, we get at : 0. There is no series! This is just the
constant solution mentioned above.
The second solution is found by introducing the logarithm:
I
@
I
y2:
where y1 is the first solution,
yt ln r.u *
wPlanw"
i:o
lr : I in this case. The derivatives
are
m
dtnl"f':
aww- +ftn
n:0
*p)anwn-+P-r
and
,1
d'v';
3ctw' :-
-m
I
"
+ftn
n:lJ
+ p)(nI p - l)anl2'+n-z
192
cHAprER 3
DTFFERENTTAL EouATtoNS
The differential equation becomes
o:
(u
it, * p)(n t p - t)anw'*o-')
-++
t''
/
7:o
\
(
+2),
-t 2(w
*
r, ( ! +
\r,ft/ir,
* p)a,w'+e-r\
Rearranging, this becomes
-yi2*rg=)
\rru/7:o
(
-12(w+ rl
i<,
n:o
0
+ (r, +
zlia * p)(n * p - t)ansn+p-l
* p)a,w'+P-l -
: I * D@ + p)(n -f p I
0
r)anwn+n
+
n:O
zl{" * p)2a,s'+n-t
n:O
The first term is a constant (u.,0 term). Thus, at least one term in the two series must
also contribute an equal and opposite constant. Therefore, p carrnot be a fraction,
since in that case n + p + 0 for any n.lf p were a negative number, P : -s, then
fot n :0 in the last term we would have
O
:2s2aow-t-1
requiring as = 0. Similarly, we would find ar: 0 for eachn < s. The first nonzero
term in the series would be erut-t : asrDo. Thus, we may as well take p : 0 and
start the series with a6. Then the x0 term is
I l2at:0 4 aI: -!2
The xm term has coefficient
m(m
or
I l)a* +2(m -t l)2a-a1- 0,
m
>I
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS 193
The constant term as is not determined, since it does not appear in the differential
equation. It may be combined with the first solution, yl : constant. Thus, the second
solution is
@-oo
rn n,
*
i,-t,'*' (+)^.' ;n
m:l
:rn, - tt-tl' (:)^.'
#
The series may be identified as the series for
rn
(r +
\
:tn (t * *)
1)
2/
2 /
\
t)
:* f"l
\ 2 )
Thus, the second solution is
yz@)
:ln
- 1) - ln (r *
:n(r-1\ *,r,,
\x+l/
(x
1)
f
ln2
We may combine the constant ln 2 with the first solution if we wish, as we already
argued for the constant a6. The usual definition of the Legendre function Qo@) for
x>1is
I /x+1\
QoG):2"("_,i
which differs from our )2 only by an inconsequential overall factor of
-l
/2.
Indicial Equation with Complex Roots
The indicial equation may have complex roots.
Example
3.8.
Apply the Frobenius method to the Euler differential equation:
*2y"+xy'+y:g
:
Inserting the series (3.34) with r0
0, we find
@@oo
\f"
n:O
+ p)(n
* p-
l)a,xn+e
+l{" I
p)anxn+P
+lanx"+p :
n:0
o
n:O
The indicial equation is
P(P-1)*Pf1:o
:
which has the solutions p
general recursion relation is
[(m
*'i. This problem
-l P)(m i
P
-
has a second curiosity as well; the
:
l(m*i)2*lfa^-0
l) +
(m
-l P) + lfam
1m2
O
+2im1a*:g
194
cHAprEB 3 DTFFERENIAL EouATtoNs
and the only solution for m
of the Euler equation are
>
O
is
a* :
0' There is no series. Thus, the two solutions
!l: x'
and
!2: x-'
We may rewrite the solutions as follows:
yl :
exp
(lnx'; :
exp (i
lnx);
yz
:
exp
(-i lnx)
Thus, by taking appropriate linear combinations, we find that for real positive
have the real solutions cos (lnx) and sin (lnx).
x
we
Asymptotic Methods
Sometimes the equation simplifies for large values of x, allowing us to find the limiting
form ofthe solution.
Example
3.9.
The modified Bessel equation has the form
dzt
ldv
71+; d,-
/
(l
+
m2\
=,J
):o
(3'38)
Find the form of the solutions (the modified Bessel functions) for large argument.
For large x, the terms with powers of llx become very small, and the equation
simplifies to
d'Y*
t a
" -^
)'oo:u
with the solutions
!*:
ett
The two solutions have exponential behavior at infinity.
We can then find the complete solution by finding the function u(x), where
y
(x)
: u(r)y-(x) :
u
(x)eL'
Then
d! _
,rr+* +ue*,
dx
and
ue*'
#:It"eL* +2u'et' +
3.3 SOLUTION OF LINEAR, ORDINARY DIFFERENTIAL
EQUATIONS 195
Substituting this into the full differential equation (3.38), we have
It,,et, +2u,e*, +ue*,
*!
:O
(r *
5) r"*,
,,, + ( !+z),, *! (+t- 4),:o
x
1r,r*, +ve+*)_
\x
/
\
(3.3e)
"/
Once again we can simplify. Look at the order of the terms in the equation when x is
large:
Term
Order of
n"
u lx2
terml8
t)'
/x
lx
/x2
u'
u
u
u
u
/x
lx
u
u
lx2
lx2
Thus, we need to keep only the two terms of order uf x. Then, dividingby 2u,we
have
,)'
I
u2x
I
lnu:--ln-x
2
1
Jx
Thus, the general solution for large x is ofthe form
']::
e-)c
_ Of V:
'/x
e'
_
(3.40)
Jx
These are the asymptotic forms of the modified Bessel functions.
Solution About the "Point at
But what
values of
Infinity"
if we want more terms in our solution? Since we want a solution valid for large
x, we change variables to tt : l/x, so as to obtain a series in powers of l/x.
A solution valid in a neighborhood of a
:
0 is a solution valid about the "point at infinity"
in x.
Example
ers
3.10.
Obtain
a series
expansion for the modified Bessel functions in pow-
of If x.
We start with equation (3.39) in Example 3.9 and change variables to
The derivatives are
du
dx
du du
du dx
#(:,)
,du
du
u
: Ilx.
(3.4t)
l8We approximate hereby noting thatdifferentiating reduces each powerin a series expansion by 1-the same result
dividing by.x. We can (and should) check the approximation once we obtain the solution for u.
we obtain by
196
cHAprER o
DTFFERENTTAL EeuATtoNS
and
# : *(#) # : -,' (-,# - ;tt*) :t#
* 2,.ilr
Q
42)
Substituting in, we get
^d2u "du
"d
"o# -t 2f? - tu *ztu2fi*
u(*l -
.d2u
du
,' -fr + (u'^+2u)i+
(+1
-
m2u)u
:o
.
m'u)u
:o
This equation has a singular point at Lt : 0, so we look for a series solution of the
form (3.34) and substjtute in:
0
: D
an(n
*
p)(n + p
n:O
-
l)un+o+l
+l
a,1n
*
p)un+n+l
n--0
oo@@
+2Da,(n *
p)u"+p
!.1
anu"+P
n:O
n:0
- m2lonun*o*'
n:O
The lowest power that appears is up, and its coefficient is
ao(+2P
+ 1):0
The indicial equation gives p : I l2fotboth the positive and the negative exponential
terms. This is the leading term (3.40) that we found in Example 3.9.
Now look at the power uk-tp-tr .Its coefficient is
atl&* p)(k+ p -l) + (k+ p) -mzlLa1,a[l -2(k+ 1+p;1 :9
which gives the recursion relation:
ak+t
t : +*t
: +*gJl48(ft + l)
2(k + t)
-S*!
Thus, the two solutions are
tl\^t:
e'' /, (4m2 -1)
utJ--F I r -F ------;-'
yz@):
oo
8x
vr \'
-1-
(4m2 -9)(4m2
- l)
2(8x)2
-'-'
\
/t
and
e-r*
6
/'
[t
-
(4m2
- g4m2 - rl + \
- l\ + ---2'fi-,"
)
Am2
-
3.4 NUMERICAL
METHODS 197
The leading term is independent of the value of m, as we found above. One solution
is exponentially decaying, and one is exponentially growing. These are the functions
K^(x) and I*(x) (see also Chapter 8, Section 8.4.7).
3.4. NUMERICAL METHODS
The computer is an increasingly useful tool for the solution of physical problems. Many
useful pieces of software come with the computer's operating system or may be purchased
at a reasonable cost. Spreadsheet programs may be used to solve differential equations
numerically. Mathematical packages such as Mathematica and Maple have more sophisticated routines for solving differential equations. For more complex or unique problems,
you may find it necessary to write your own program.
3.4.1. Dimensionless Variables
To prepare a problem for numerical solution or even to plot up the results of an analytic
solution, the first step is to write the equation in dimensionless variables. For example, when
solving equation (3.11), we would use the variable u : xlL. The solution obtained can
then be applied to a beam of any length. Then
dud ld
dxdu Ldu
d
dx
Rewriting the equation, we have
I day q(u)
- EI
Scaling the deflection y similarly, we define p : y f L.T}.ren
L4 du4
I dau q(u)
dU4 EI
dau L3q(u)
du!: EI
L3
The source term q is load, or force per unit length, and
footnote 1), so the right-hand side has dimensions
,zM I
L
Tz
---
and since
lEf
:
forcelarea:
M
lLTz,
t
has dimensions of (length)a (see
I
L4lEl
the right-hand side is also dimensionless.
198
cHAprERoDTFFERENTTALEeuAloNs
Example
3.11.
Convert equation (3.5) (motion with air resistance) into dimension-
less form.
We would like to divide the space variable y by a characteristic length and the time
variable t by a characteristic time. Notice that each term in the equation is a force, so
the dimensions of the constant d, may be found from
I dvl
ML
MLT M
+lal:
,zT:T
P;l: 12
This suggests that we choose our characteristic time tobe m f a and the dimensionless
time variable to be r : tq/m. Then the equation becomes
ot dy
o2 d2y
*--lng-C(" mdt
mzdtt
q2 d2y
. o? dy
-'|
aia.=- - aiE
lll
1
Thus, we choose our characteristic length tobe m2 g f a2 and take the dimensionless
space variable to be u - ya2 1m2g. The equation is then
t!:-r-du
dtt
dr
Let's check the dimensions of the characteristic length:
lmzsl
| "t M2L T2 -r
aAl -frw-"
as required.
In what follows, we shall assume that the equations have been put into dimensionless form.
3.4.2. Difference Equations
The basic idea behind all numerical methods of solving differential equations is to replace
the differentials with finite differences. Recall the definition of derivative:
d.f .. f(x+h)-f(x)
E: i:\
h
With finite differences, we do not take the limit but allow ft to be small but finite. Thus,
a differential equation is replaced by a difference equation.
3.4 NUMERICAL METHODS
If a function is differentiable
at the point
x
:
199
r0, then the derivative may be calculated
equally well by taking the limits:
dfl :
dx l*n
-l
llm
h-o
f (xd - f @o -
h)
:
lim
f(xo+hlz)- f(xo-hl2)
h+O
Similarly, we may form finite differences in three different ways:
.
Forward dffirence:
Lf(x):f(x+h)-f(x)
.
Central dffirence:
/ h\
/
h\
d/(x):/(x+t)-f\,-Z)
,
Bachuard dffirence:
v
f(x)
:
f(x)
- f(x - h)
Higher-order differences may be formed similarly. Thus, the second forward difference is
^"
-irj:,; :';i.:'
The
differenceratio
Lf I h is an approximation to the derivative.
=?,i\:i
-
r (x,t
Inaccuracies occurbecause
ofrounding error in the numerical calculation, and because ofmistakes
in one or more of the values. These inaccuracies increase in opposite directions; making
h smaller improves the approximation to the derivative but increases the rounding error.
Thus, obtaining an accurate numerical approximation to a derivative is very difficult, and
we should attempt it only if it is absolutely unavoidable.
ft is not zero, because
3.4.3. Numerical Solution of a First-Order Differential Equation
Suppose we want to find a solution to a first-order equation
y'
: f(x,y)
that we have been unable to solve analytically. The equation need not be (and usually is not)
linear. The output of any numerical method is a set of values for the function y at a set of
y(rg) (a boundary condition) from
values x; for x. We need at least one given value )0
:
which to start.
200
cHAprER 3
DTFFERENTTAL EouATroNS
If the function f(x)
can be differentiated analytically, we can use the Thylor series to
step away from our given value:
yr
:
y(xo + h)
:
yo
: yo I
+ hf (xo, yd -l
hy' (xO
+
f;t"
@il +
"'
h2 d.f
t d- ro*"
Then
d.f
y2:yr*hf(x1,y)a h2
2dx,,
(3.43)
where x1 : .{o * ft, and so on.
As a first approximation, we drop the terms in powers of h greater than 1, effectively
expressing the derivative d y ldx using a forward difference. This is called the Euler method.
It is said to be first-order accurate, because we kept only the first power in ft.
A better approximation is obtained by taking more terms in the Taylor series expansion
(3.43). Thenumber of terms needed at each step is determined by the rate at which the series
converges, which usually depends on the value of h as well as the particular form of the
solution y. If a large number of terms is needed, this method can be inconvenient because
of the amount of analytic work that is required before beginning to compute.
Example
3.12.
Solve the differential equatron
Y'
:3xY - I/3
on the range 0 < x < 1, subject to the boundary condition y(0)
The function f :3xy - 1/3 has derivatives
:
1.
df
^.^dv
:'3y
-.*'3x dx
- :3[y * x(3xy - l/3)]:3y(1* 3x2S - x
dx
d2f
. dv
l8xy *3(l -l3x\!! - -l + lSxy * 3(1+3x2)(3xy +
-l
d*r!:
: -2 + 3x(9y *
9x2y
-
l13)
x)
At this point the labor is getting noticeable, so let's see if we can get away with
only these two derivatives. Let's take h :0.1, one-tenth of our interval. We can use a
spreadsheet to calculate the various terms and add them together. The spreadsheet is
shown in Table 3.1 ; the numerical values appear in Table 3.2.We can check the result
by choosing a value of ft equal to one-half of the previous value and repeating the
calculation. The maximum difference between the two results is 0.3Vo (Figure 3.2).
3.4 NUMERICAL
METHODS 2O1
TABLE 3.1. Spreadsheet for Example 3.12
D
I
Example 3.12
2
h
0t
3
x
0
I
4
5
0.1
6
0.2
7
0.3
8
0.4
--F7
9
0.5
=F8
l0
0.6
:F9
l1
0.7
=
12
0.8
l3
0.9
14
I
f= 3xy- U3
f'=
- ll3
- 1/3
- tl3
:3*A7*87 - ll3
:3+A8+B8 - l/3
: 3* A9*89 - | /3
: 3*Al0+Bl0 - 1/3
:3'All*B11- 1/3
: 3r Ar2*Bt2 - | 13
= 3*A13*B13 - 1/3
: 3* Ar4*Bt4 - | 13
- A4
- A5
- A6
3+B7* (l + 3* A7^2) - A7
3+88*(l + 3*A8^2) - A8
:3+B9+(l +3+A9 2) - A9
: 3+B10*(l + 3*A10^2) - A10
= 3*B1l+(1 + 3*A11^2) - A11
: 3*B12*(l +3+A12 2) - Al2
: 3+B13+(l + 3*A13^2) - A13
: 3*814*(l +3*Al4^2) - AI4
:
:
:
:
:
:3*
A4*84
= 3*A5+B5
:34 A6*86
1
:F4
:F5
:F6
F10
: Fl1
:FI2
: F13
-x
3y(1 + 3x^2)
3*B4* (1 + 3+ A4 2)
3+85*(1 + 3*A5^2)
3*86*(1 + 3*A6^2)
I
2
3f'
4
5
6
7
8
9
l0
l1
12
13
t4
y(n +
:
=
_2 1
(g*84 +
:
:
:
:
A4 2+84 - A4)
+9rA5^2*B5
- A5)
-2*3*A5+(9*B5
3+ A41
9*
- -2a3*A6r(9*B6+9'A6^2+86 : -2 + 3*A7i(9+B7 +9+A7^2*87 : -213+A8t(9*B8 +9*A8^2*B8 : -213*A9+(9*B9 +9*A9^2+B9 -
A6)
A7)
A8)
A9)
=
:
= -2 *3+Al0+(9+Bl0+9*Al0^2*810 -A10)
:-2*3*Al1*(9*Bl1+9*All^2+B11-A11)
= -2 * 3* Al2a (9*Bl2 + 9* Al2 2*Bl2 - Al2)
: -2*3*Al3*(9*Bl3+9+A13^2*B13-A13)
= -2*3+Al4+(9*Bl4+9*Al4^2*Bl4 - Al4)
l)
84 + $B$2'C4 + $B$2 2/2*D4 + $B$2^3/6*E4
85 + $B$2*C5 + $B$2^2/2*D5 +
86 * $B$2*C6 + $B$2^2/2+D6 +
B7 + $B$2*C7 + $B$2 2/2*D7 +
B8
$B$2+C8 + $B$2^2/2*D8 +
89 + $B$2*C9 + $B$2 2/2+D9 +
I
= Bl0 *
:
$B$2*C10
+
Bl 1 + $B$2+C11 +
= B12 * $B$2+Ct2 +
: Bl3 + $B$2*C13 +
: B14 + $B$2*Cl4 +
$B$2^3/6*E5
$B$2^3/6*E6
$B$2^3/6*E7
$8$2^3/6*88
$B$2"3/6+E9
$B$2^2/2*Dl0 + $B$2^3/6*E10
$B$2^2/2"D1 1 + $B$2^3/6+El I
$B$2 2/2*Dt2 +$B$2 3/6*Et2
$B$2 2/2"D13 + $B$2^3/6*El3
$B$2^2/2+Dl4 + $B$2^3/6*E14
TABLE 3.2. Numerical Values for Example 3.12
I
I
t
i
L
1
Example 3.12
0.1
h
c
D
f=3ry-1/3
f'= ly(l+ 3x2)
3
2.93232
0.646096
0.992209283
-x
f'
y(n + 1)
3
x
v
4
0
1
5
0.1
0.98
-0.3333
-0.0389
6
o.2
0.99
0.26199
3.t3382319
3.452247331
r.o34652997
7
0.3
1.03
0.59785
3.64202792
6.86495 1309
1.t1379273r
8
o.4
1.1
I
4.5452397f
tr.47359534
r.23875299
9
0.5
1.24
t.oo322
r.5248
6.OO34532
18.15395671
1.42427s53r
10
0.6
1.42
2.23036
8.28747931
28.2996385
r.693465796
1l
t2
o.7
1.69
3.22294
I 1.8485815
44.2t969028
2.082373136
0.8
2.O8
4.66436
17.M15887
69.84598597
2.647658296
13
0.9
2.65
6.81534
26.3444039
112.0219549
14
I
3.48
10.1054
40.7550206
182.89'75926
3.479585048
4.724385265
0.98 1333333
202
CHAPTER
3
DIFFERENTIAL EOUATIONS
+
+
h:0.05
h:0.1
\J)
/
1
0L
l0
o2
0
FIGURE 3.2. Numerical solution of the equation y/ - 3xy - 1/3 with boundary condition ) (0) : 1
(Example 3.12). The values computed with /r : 0.05 and h :0.1 do not differ
noticeably in this plot.
3.4.4. The Runge-Kutta Method
From Example 3.12, we observe that the derivative of
/
is
df af dy af af "af
dx- ox' d*ay- ax'r ay
Thus, the Taylor series is
y(xo
*
h)
:
y(xs) -l hf (xs, yO a
+(#.,X) +"'-!o*11
where all the terms on the right are evaluated ?t xo, yo. Thus, the increment 17 in the solution
y depends on values of / and all its derivatives at the starting point. We can improve the
accuracy of the truncated series if we can use values of f and its derivatives throughout the
range.r0 to xo * h. We can proceed by a process of successive approximations.
The first guess is to use
nr hf(xo, yo)
:
as the increment.
This takes us to the point P1 with coordinates (xo + h, )o
*
41). Now we
stepbacktothemidpointat(x6 +h/2,W*r11/2).Weevaluatetheslope yt
point and use this new value to compute an increment
nz
h
/
nr\ :
: hf(t*
Z' vo+ ; )
: hr (^. i. r, * T)
: hr(xo.
th,yo *
vo)
+
h
: f
atthis
where
/hAf
hf(xo,til -t h(Za*
This takes us to the point P2 with coordinates (x6
process to find a third increment:
qt
r12,
*
* ;nr0f\
u, )
ryz).
We can now repeat the
(r#. +#) .
3.4 NUMERICAL METHODS
203
Now we can evaluate the fourth increment by using the values at our last endpoint P3 with
coordinates (xo + h, Jo I rt):
ry+
: hf (xo * h, yo t ry) : hf (xo, yO + h(r# . ,tK) *
The best result is obtained by taking an appropriate weighted average of all these estimates
of the true increment 4. That average is
4:
1
(3.44)
O(4r*2nz+2qz'fn+)
This increment makes the expression for y(x * lr) correct to fourth order in h, and thus the
method is called the fourth-order Runge-Kutta method. In practical use, we can calculate the
numerical values we need fromthe function / without evaluating any 4erivatives analytically.
Example 3.13. Use the Runge-Kutta method to solve the equation y' :3xy - 713
with boundary condition y(0) : 1.
Again we can use a spreadsheet to calculate each of the 4s and sum them to
compute the increment in y. Our spreadsheet has columns containing the values
y0 + ry/2,q2,y2: yo*qz/2'xr: x0+h,
:
y4
y0
yo I ry, n, and finally
* 4, which becomes the y6 for the next value x1
of x. The beginning of the spreadsheet is shown in Table 3.3; the numerical solution
appears in Table 3.4. This time we find that the values computed with h :0.05 differ
from those with h :0.1 by at most 0.0037o (Figure 3.3).
x0,yo,ttr,xtl2: r0 * hl2,yt:
y3 :
h:
+
0.05
h:0.1
\))
/
I
0L
0
o.2
08
04
l0
x
FIGURE 3.3. Runge-Kutta solution of the equation
solutions for h
\.
:0.05
and h
:
yt : 3xy - ll3
(Example 3.13). Again the
O.l are not distinguishable.
204
cHAprER 3
DTFFEBENTTAL EouATroNS
TABLE 3.3. Spreadsheet for Example 3.L3
BC
Exsmple 3.13
hnx
0.100
l
2
= c3 +A$3
:c4+A$3
yl
I
v
xU2
I
=c3+A$3/2 =A$3*(3+C3*D3-l/3)
:c4+ A$3/2 = A$3-(3.C4.D4 - l/3)
:N3
:N4
eta
el^2
:
:
=D3+F3/2
=D4+F4/2
:c5+A$3/2 =A$3*(3*C5*D5-l/3) :Ds+Fs/2
A$3.(3-83*G3
A$3+(3*E4*G4
= A$3'(3+85*cs
- l/3)
- l/3)
- l/3)
TABLE 3.4. Numerical Solution for Example 3.13
Example 3 13
htr
0l
9
l0
1l
12
l3
0
I
202
3
4
5
6
1
8
909
l0
v
xl.l2
etal
0
I
-0
01
0 981,145
0.05
0.15
099
0.25
0.026213
0.05982
0.100393
1.005545
0 152618
3159E7
38
03
1.035032
04
1.114386
I 2396't8
I 425735
1 69581
2.086218
2.6541
3 490613
05
06
07
08
I
035
045
055
0.65
0.75
085
095
r.05
033333
-0.00389
o 223299
0.322',t8'l
0.46'1359
o.683274
I 01385
y1
0.983333
o.9795
1.(]f,4941
164583
53'1384
857203
2319897
2.995',737
3.99'7538
et^2
-0.018583
0.010744
0.042083
0.078486
0 123885
0 183805
0.266451
0.38453?
0.558241
o 820452
1.225891
The Runge-Kutta method, or indeed any numerical algorithm, works best if the step size
chosen is the largest possible one that will still achieve the desired level of accuracy. Press
et al. show how to write routines that adjust the step size automatically to obtain maximum
efficiency while retaining accuracy.
3.4.5. Higher-Order Equations
Higher-order differential equations may be reduced to systems of first-order equations
through the introduction of additional variables. For example, the second-order equation
'
y" +2y'2
-6:o
(3.4s)
may be reduced to two first-order equations. First introduce the new variable
U:
y'
(3.46)
Then the original differential equation becomes
u':6-2u2
(3.47)
Each of these equations may now be tackled using one of the standard methods.
3.14. Solve equation (3.45) subject to the initial conditions y(0) : 0.5
and y/(0) : 0. Use the Runge-Kutta method, and compute values for 0 < x < 1.
The differential equations are
Example
y': f(x,!,u):u
205
3.4 NUMERICAL METHODS
TABLE 3.3. (Continued)
!2
:D3+H3/2
=D4+H4/2
=D5+H5/2
eta3
y3
eta
eta4
:
:
:A$3*(3*(E3*r3-ll3) :D3+J3
:A$3*(3.(M*I4-l13) :D4+14
:A$3-(3.(Es.rs-l/3) =D5*J5
A$3.(3*C4-K3
A$3.(3-Cs-K4
= A$3*(3-C6-K5
- l/3)
- l/3)
- 1/3)
=
=
=
y4
(F3 + L3 + 2*(H3 +
(F4 +LA + 2*(H4+
(F5 + L5 + 2*(H5 +
I3))/6
=
rq)/6 :
J5))/6 :
D3 + M3
D4*M4
D5
*
M5
TABLE 3.4. (Continued)
I
t
I
I
y2
eta3
y3
eta4
eta
y4
i
0.990708
-0.018473
o 981527
0.986817
0.01 1073
0 992518
-0.003888
0 0262t8
0.981445
o.992438
1.01348
o 042618
o.o'79465
0.1254',n
1.035 l
t.074274
1.176329
1.331581
Q.186377
1.558963
o2'7M64
1.888079
2365338
0.391484
0.569828
3 064326
084
4.103558
I
l6
0 059827
I
114497
0 100406
018555
0.010994
o.042593
0 079355
I
23985',t
o.152645
o.\25292
1
426056
t.696399
2 087294
2 656046
3.4941
4;1499
o 22335',1
1.425',t35
0.683',199
0.186057
0.2'70015
0 390408
0.567882
I 014897
0.8365
I
259288
-0
o 322911
0 46',761',7
-0
033333
l2
0.9918 12
1.035032
1 I 14386
I
239678
69581
2086218
2 6541
3.490613
4 482425
and
: B(x,!,u):6 -2u2
We step forward from the boundary at x : 0, using an increment h in x. Then we
ut
compute the increments in y and u using
:
my :
rlz :
mz :
Qt
: hus
hg(xs, Y0, u0) : n$ hf (xo,
Ys, us)
hf (xo + h /2, yo -l
hg(xo
+ hl2, lo
*
2ul)
nr 12, vo
-f mt 12)
qr/2,
*
uo
mt12)
and so on. Then we use the increment 4 given by equation (3.44) with a similar
expression for m.
We set up a spreadsheet with columns for x,
x + h/2, and r * h; q; andm;,i :
1,2,3,4; q and m; and the new yalues ),(-r0 + h) : y(xo) * q and u(xo + h) :
u(xo) * m.The beginning of the spreadsheet is shown in Table 3.5; the numerical
solution appears in Table 3.6. The results for h :0.1 and ft : 0.05 are shown in
Figure 3.4. Splitting h inhalf gives results that differ by at most 0.004Vo.
The examples above illustrate the basic ideas involved in finding numerical solutions of
differential equations with given initial conditions. The same ideas are used in problems
with specified boundary values at two points, but the solution is more difficult and usually
involves repeated iteration to converge on a solution. Any one method may prove unsuitable
in a given case, and another method will have to be tried. Important issues of stability and
convergence have not been discussed here. Details can be found in the references in the
bibliography.
2OG
cHAprER 3
DTFFERENTTAL EeuATtoNS
TABLE 3.5. Spreadsheet for Example 3.14
I
C
D
: V4
:V5
ml
etal
: $B$3*C4 : $B$3+(6 -2*C4"2)
: $BS3*C5 : $B$3.(6 -2*C5'2)
: $B$3.C6 : $8$3*(6 - 2*C6'2)
G
Example 3.14
h
0.1
xyv
00.50
0.1 --T4
0.2 :T5
2
3
4
5
6
M
o
y3
eta3
: $B$3*I-,1 : B4 + M4
: $B$3*L5 : 85 + M5
: $B$3*L6 : 86 + M6
m3
yl
xl
: A4 + $B$3/2 :84
: As + $B$3/2 :Bs
: A6 + $B$3/2 :86
+D4/2
+Dsl2
+D6/2
a
1
2
3
4
5
6
:
:
:
$B$3+(6
-
elz4
m4
v3
: C4-l 04 : $B$3*P4 : $8$3+(6 - 2*P4 2)
: C5 + 05 : $B$3*P5 : $8$3*(6 -2+P5 2)
: C6 + 06 : $B$3*P6 : $B$3*(6 - 2*P6 2)
2*LA Z)
$8$3*(6 -2+L5^2)
$B$3* (6
-
2*L6 2)
TABLE 3.6. Numerical Solution to Example 3.14
c
I
2
Example 3.14
h
0.1
3
x
v
4
0
0.5
5
0.1
6
7
o.2
0.3
o.4
0.5
0.6
o.7
0.8
0.9
o.5294
0.6115
o.732
0.8766
8
9
t0
l1
t2
t3
t4
1
1.0348
0
o.5770
1.0386
1.3464
r.52'75
1.6264
t.2004
1.6783
1.3697
1.541
t.7049
t.1184
1.7t32
1.7252
1.88s9
t.7286
etal
ml
0.6
o.5334
0.3843
0.2375
0.1333
0.0709
0.0367
0.0187
0.0094
0.0047
x1
0.0s
0.15
0.25
0.35
0.45
0.55
0.65
0.75
0.85
0.95
yl
0.0000
0.0577
0.1039
0.1346
0.1528
0.1626
0.1678
0.1705
0.1718
0.1725
0.1729
0.00.24
1.05
1.9723
0.5000
0.5583
0.6634
o.7993
0.9530
r.tr62
t.2843
1.4550
t.6269
t.7994
N
o
eta3
y3
m3
v3
eta{
m4
0.o29t
0.s291
0 6100
0.7302
0.8752
0.5831
0.4701
0.3181
0.1900
0.1049
0.0553
0.0285
0.0145
0.0073
0.0037
0.0018
0.5831
0.0583
1.047 t
1.3568
0.to47
0.5320
0.3807
6
0.1 187
7
0.t432
8
1.0339
l1
0.1573
0.1650
0. l 690
0.1711
12
0.172t
t.7r3r
13
'14
o.1127
1.8859
o.r729
2.0588
l0
G
M
0.0806
9
D
t.t999
t.3694
1.5408
a
1.6818
1.7068
0.1357
0.1536
0.1632
0.1682
o.1707
t.7r94
0.r719
1.7257
0.t726
t;7289
o.1729
0.1730
t.5364
1.6324
1.7305
0.23t8
0.t279
0.0670
0.0343
0.o174
0.0088
0.0044
o.0022
0.0011
3.4 NUMERICAL METHODS
207
TABLE 3.5. (Continued)
:
:
:
:C4+84/2
:
:
y2
etaz
v1
c5 +Es/2
c6 +86/2
:
$B$3+H4
m2
$8$3*(6
$B$3*(6
$8$3*(6
:
:
:
84 +14/2
:Bs +r5/z
$B$3*H5
:86 +t6/2
$B$3+H6
v2
-
2*H4 2)
2)
2*H6 2)
-2*H5
-
:
:
c4 +K412
cs + Ks/2
:c6+K6/2
U
eta
(D4
(D5
= (D6
:
:
y(n+
m
1)
:
:
:
:84+54
:B5+55
:86+56
+ 2+t4 + 2*M4 + Q4) 16
+ 2*r5 + 2*M5 + Q5)/6
* 2x16 + 2*M6 + Q6)/6
(E4+2*K4+2*O4+R4)/6
(E5
(E6
+ 2+K5 + 2*O5 + R5)/6
+ 2xK6 + 2*06 + R6)/6
TABLE 3.6. (Continued)
K
v1
ela2
y2
m2
0.3
0.03
o.0843727
0.12307497
0.582
o.45762
o.29705
1.59418
o.t4650973
o.r594r777
0.5150
0.5716
0.6730
0.8052
0.9563
t.66t9t
0.16619128
l.rt79
t.69663
o.t6966329
0.t7142303
r.2852
0.17231005
r.627t
0.02429
0.01228
0.00618
0.t7275595
t.7996
0.0031
I
1.7267
0.r729798t
1.9724
0.00156
t.7294
0.84373
1.23075
1.4651
r.7t423
r.723r
t.72756
1.7298
1.4554
e!tr
y(n+
0.0294
0.5770
0.t205
0.t446
0.5294
0.6115
o.7320
o.8766
0.1582
m
v2
0.2910
0.8058
1.187
r
0.r707
r.4317
o.09172
r.5734
0.M76r
1.6502
1.6904
1.7110
t;12t5
0.46t6
v(n + 1)
o.5770
1.0386
o.3077
t.3464
l
1.5275
1.0348
0.0989
0.1655
0.1693
1.2004
0.05 19
r.6264
t.6783
r.369't
0.1712
1.5410
0.1722
r.7132
0.t'127
0.1730
1.8859
o.0266
0.0135
0.0068
0.0034
0.0017
0.0821
1)
2.0588
fr,.'11
0.181
1.7049
r.7r84
r.7252
1.7286
1.1303
v(n + 1)
C4 +U4
:
:c5+u5
: C6 +U6
208
cHAprER
s
DTFFERENTTAL
EeuAloNS
+
+
h:0.05
h:
O.I
\))
-{
F
4
F
I
-{
n-
0
0.2
0.6
0.4
0.8
1.0
x
FIGURE 3.4. Runge-Kutta solution of the second-order differential equation y" + 0.2y'2
with y(0) : 0.5 and )/(0) : 0 (Example 3.14).
-
6
:
0
3.5. PARTIAL DIFFERENTIAL EQUATIONS: SEPARATION
OF VARIABLES
Many of the differential equations that describe physical processes are partial dffirential
equations. The solution is a function of three space variables and one time variable, and
the equation relates the partial derivatives with respect to the four variables. The diffusion
equation (3r13) is an example. Maxwell's equations are also of this form. If we introduce the
potentials A and O, Maxwell's equations reduce to wave equations for these two potentials.
In SI units,
V"AIn the special case where
j
t a2L :
AM -tlo:
(3.48)
is independent of time, the equation simplifies to
V2A: _poj
A similar equation holds for O:
v2a: _L
t0
e.4g)
This equation is called Poisson's equation.
The mathematics of partial differential equations is extensive and complicated, and we
shall not delve into a general treatment here. Many of the partial differential equations in
physics may be solved by the method of separation of variables.lg In this method, we aim to
lgAdditional methods will be discussed in later chapters.
See,
for example, Chapter 7, Section 7.5.
3.5 PARTIAL DIFFERENTIAL EQUATIONS: SEPARATION OF VARIABLES
209
replace the partial differential equation with a set ofcoupled ordinary differential equations,
one in each of the variables. Then we can make use of any of the techniques we have already
discussed for the solution of ordinary differential equations. In particular, series solutions
are often used.
For example, let's look at equation (3.49) in the special case p
0. This simpler homo-
geneous equation
operator2o is
-
is called Laplace's equation. In Cartesian coordinates, the Laplacian
. a2o a2o* a2o
V-@:
art - af
u7
We look for a solution in which O is the product of three functions, each a function of one
(and only one) of the three variables:
a:
X@)Y(y)Z(z)
First we substitute into the differential equation:
a2o+ a2o+ :-i
a2o
: x"yz + xy"z * XyZ" :0
3
3
ctx' dy' OZ'
where the prime means differentiation with respect to the argument of the function.
Now for a nontrivial solution (O not identically zero),2r we may divide through by @
to get
Xtl ytt
_l_r_
X,Y,Z
Ztl
-0
At this point our equation has separated; each term is a function of only one of the three
variables
x,y,andz. If wenowvariedx, leaving yandz constant,wecouldchangethe
value of the first term without changing the second two. The equation would no longer
be satisfied. Thus, the only way we can satisfy this equation for all values of x , y , and z is
ifeach term separately equals a constant:
Xtt
yt,
XY
--4.
--8.
ztl
Z
and
A+B*C:0
Thus, the method of separation of variables leads to a set of coupled ordinary differential
equations. The constants A, B , and C are called separation constants.
20See Chapter 1, Section 1.2.4, equation (1.50). Solutions in curvilinear coordinate systems will be explored in
Chapter 8.
2lTechnically, we need the function O(i) to be nonzero except on a set ofmeasure zero or, roughly speaking,
within a region of zero volume.
210
CHAPTER
3
DIFFERENTIAL EOUATIONS
In this example, each function is an exponential. Let
-u2-p2.ThesolutionsareoftheformX
:
eLo",Y
-
A:
a2 and
:
e+Pv,andZ
B:
exp
F2.Tlten
C:
(uJdTP)
The boundary conditions must be invoked to complete the solution.
/ x w x hhas all its walls
(Thin rubber strips
potential.
is
at
a
nonzero
grounded except for the top, which
general
for the potential
solution
Find
the
separate the top from the other walls.)
Example
3.15. A rectangular
copper box of dimensions
everywhere inside the box.
The solution follows a set of steps that
we'll want to use for all problems of this
typ".'2
l.
Choose
an appropriate coordinate
system. Here
we
coordinateswithoriginatonecornerof theboxand0 <
choose rectangular
x < 1,0 < y .
w,
0<z<h.
2.
Separate the equation and determine the sets of solutions that exist. Here the solutions are real and complex exponentials in all three coordinates, as we determined
above.
3.
Choose one set of coordinates that has null boundary conditions. Choose the
function that satisfies the zero boundary condition at both boundaries. Here we
need
X(0) : 0 and X(l) :
0. We can combine the complex exponentials to
give a sine function, which is the solution we want. We reject the hyperbolic sine
because it has only one zero, at x : 0. We need a second zeto at x : /. Thus, we
must choose A to be negative: A : -k2 and X(x) : sin ftx.
4.
Choose the value of the separation constqnt to make the function zero at the second
boundary. Here we need
:0
srnk/
which will be satisfied if
kl
:
wT, or
k-
NT
T
This set of constants is the set of eigenvalues for the problem. The functions
X':sin
nrf
x
,
are called eigenfunctions.
5. Repeat the procedure for
the second set of coordinates:
Ym
22See also Chapter 8, Section 8.2.
-
mIf \)
sin ------'
w
PROBLEMS
211
For the last set of coordinates, the separation constant is already determined by
the two values of A and B that we have chosen in steps 4 and 5. Thus,
rmn:,2
c:-A-t:\i)tnn12+(;J
and is positive.
6.
Choose the final function to satisfy the one remaining zero boundary condition,
given the value of C. In this case, with real exponentials as the possible solutions,
we must choose the hyperbolic sine as our solution:
Z:
rinn
(T)' .)
(
Thus, the solution contains terms of the form
. wfx
mTy
Sln-SlIl-STDII
7. Write the general solution
.)
,
as a linear combination of such terms:
(T)'
This answers the question posed, but there is one final step:
8.
Use the final nonzero boundary condition to determine the cofficients
is the subject ofthe next chapter.
an^.This
PROBLEMS
f,
A vehicle moves under the influence of a constant force F and air resistance proportional
to velocity (compare with equation 3.5, with i replacing the gravitational force). Find
the speed of the vehicle as a function of time if it starts from rest at t : 0.
2. Find the general solution to the differential equation
y"'-3y"*3y'-y-o
3.
Hint: ExIend the result for a double root from Section 3.1. l.
A capacitor C, inductor L, and resistor R are connected in series with a switch (see
the figure). The capacitor is charged by connecting it across a battery with emf t. The
battery is disconnected, and then the switch is closed. Find the current in the circuit as
a
function of time after the switch is closed.
212
CHAPTER
3
DIFFERENTIAL EQUATIONS
PROBLEM
3
4. The Airy differential equation is
- xy:o
y"
Find the two solutions of this equation as power series in x.
S
Solve the equation xyl/
*2y :0
(Section 3.3.2) using the Frobenius method. Show that
as discussed in Section 3.3.2.
y(0) cannot equal any nonzero constant,
6. Find
a solution of Laguerre's differential equation
xy" + (l
that is regular at the origin. Show that if
of degree k.
7.
- x)y'*
cv
ay
:6
is an integer k, then this solution is a polynomial
Solve the Bessel equation
4x2y" +4xy'
as a Frobenius series in powers
ofx.
I
(4x2
- l)y :
o
Sum the series to obtain closed-form expressions
for the two solutions.
8.
Solve the hypergeometric equation
(*2
- *)y" + (3x -
112)yt
t
y
:
0
as a series
(a) in powers of
x
(b) in powers of x
-
1
9. Find two solutions of the Bessel equation
/
o\
x2y"+*v'+(*2-t-)v:o
' \
4/"
as series
in.r. Verify that your solutions
agree with the standard forms
fT ,"ot*
V;( "
\
*sinx/
PROBLEMS 213
EOl Consider
a
linear differential equation of the form
*2y"+xfy'+g!:o
Expand the function
f (x) in a power series
f(x):
of the form
fo* ftx
I
fzx2
+...
and expand the function g(.r) similarly. Find the indicial equation. What is the condition
on /6 and 96 if there is only one root? What is the value of the root in that case? Use the
method of variation of parameters to show that the second solution of the differential
equation is given by equation (3.37).Ifint.' Show that the equation for u may be reduced
to the form
dl
*(lnu'}:--+h(x)
where ft(x) is a series of positive powers of x. Integrate this equation twice to obtain
equation (3.37).
[iL
linear differential equation of the form x2 y" + xf y' + g/ : 0, the indicial equation
h(p) : 0, where ft (p) is a quadratic function. Show that in determining
the recursion relation, the coefficient of the cn teffi is h(p -l n). Hence argue that the
method fails to provide two solutions if the solutions of the equation h(p) : 0 differ by
an integer. Are there any exceptions?
For
a
may be writte n as
12. Solve the equation
:
* (*)'
,#
0 (equation 3.19) by writing it in the form
y"
y'
yty
and integrating twice.
13. Find the two solutions of the equation
y"-y'*I:o
x
as series
in powers of x.
14. Determine a solution of the equation
(l
I
x)y" +
(3
+ 2x)y'
* (2* x)y :o
x. Hence determine the solutions for all .r.
li5.l Oete.-lne the large argument expansion of the Legendre function
at large
tion of the equation
(l - *2)y" - 2*y' -l2y :
as a series
in powers of I lx.
o
Q1 by finding a solu-
i
214
cHAprER 3
DTFFERENTTAL EeuATroNS
L6. Solve the equation
*2y"-4xy'|_(6+x2)y:o
17. Solve the equation
xy"
-
y' -l 4x3y
-
O
18. The conical functions are Legendre functions with / : -L, + iX.
(a) Starting from the Legendre equation (Example 3.7), find the differential equation
satisfied by the conical functions P-;+ixt) and Q_y*,^(x).
(b)
Show that one solution is analytic at the point
r:
1
,
and determine a series expansion
fortheconicalfunction P_11ir(cos0) inpowers of sin(0 12). Henceshowthatthis
conical function is real.
) : 0 in standard form, and use Fuch's theorem to show that
the Frobenius method may not give two series-type solutions about x : 0. Change to
the new variable u : Ilx (as in Example 3.10) and show that the new equation can be
solved by the Frobenius method. Obtain the two solutions.
19. Write the equation *4y" +
[20.] Sotve the equation
Y"+Ycosh'r:0
Hint: Firstexpand the hyperbolic cosine in
a series, and then use a
power series method.
21. The Stark effect describes the energy shift of atomic energy levels due to applied electric
fields. The differential equation describing this effect may be written
xy"
+ y'*
(o
- f,r*' +T - *t), :
o
where the term Exz 14 is the perturbation due to the electric field. Obtain a power series
solution for y and obtain explicit expressions for the first four nonzero terms. How many
terms are needed before any effect of the electric field is included?
22. Show that the indicial equation for the Bessel equation
*Q#) *xy-0
has a repeated root. Show that this root leads to only one solution. Find the second
solution using equation (3.37). Try to get at least the first three terms in the series.
23. Attempt to solve the equation
*2y" +
!' :o
using the Frobenius method. Show that the resulting series does not converge for any
value of x.
PROBLEMS 215
[24.]
W"b"t's equation is
I x2\
l"+(* +___lv:0
2 4 /'
Show that the substitution 1l - exp(--x2/4)u(x) simplifies this equation. Find two
solutions for u(x) as power series in x.
25. The Schrcidinger equation in one dimension has the form
h2 d2il.,
2^ip
Develop a series solution for
of two nucleons:
lt
+ (E - V) tL :0
in the case where V is the potential due to the interaction
,,y
--L -'-o'
Obtain at least the first three nonzero terms.
26. The Kompaneets equation describes the evolution of the photon spectrum in
a
scattering
atmosphere:
!dt : n"orr+
\ !vo@, + n + n2)l
mcL x' dx
Here n is the photon number density, x is the dimensionless frequency, and o7 is the
Thomson scattering cross section. We may find a steady-state solution (A lAt :0) when
photons are produced by a source q(x) and subsequently escape from the cloud. When
n remains (( 1, the Kompaneets equation becomes a linear equation:
\lvorn'
x'dx
where
*n )l +
y is the Compton "y" parameter,
scattering)
x
q.")
equal
-on
y
:o
to (fractional energy change
(mean number of scatterings). Assume that
q(x)
-
0 except for
x(
per
1.
(a) Show that for x )) 1 the solution is an exponential. This is the Wien law.
(b) Show that in the special case ), - 1, with q(x) = 0, the solution is a power law in .r.
(c) Verify your answers to (a) and (b) by letting n : e-x D and finding a power series
solution for u. Obtain a solution for general y, and then let y -+ 1 to verify your
answer to (b).
27. Aparticle falls
d under gravity. Air resistance is proportional to the square of
.Wittethe differential equation that describes the particle's
position as a function of time. Choose dimensionless variables, as in Section 3.4.1, and
show that the equation may be put into the form
a distance
the particle's speed:
F
:
kuz
y"+oy'2-F:o
Use the Runge-Kutta method to solve this equation with cv : 0.1. Assume the particle
starts from rest. How long does it take for the particle to fall 10 m?
216
cHAprEB
s
DTFFERENTTAL EeuATtoNS
[ZAl tn astrophysics, the Lane-Emden equation describes the structure of a star with equation
the equation of hydrostatic equilibrium
of state P - Kp(n+t)ln.If we define p -
^.0",
becomes
i*e#) *Q':0
where x is a dimensionless distance variable with
the Lane-Emden equation.
x
:
0 at the center of the star. This is
(a) Find a series solution for @ in the case n : l.
(b) Find the first three nonzero terms in a series solution for @ for arbitrary n. Verify that
your result agrees with the result of part (a) when n : l. Hint: Begin by arguing
that the solution contains only even powers of
r.
(c) Solve the equation numerically for n :2, QQ) : I, and d'(0) : 0. At what value
of x does @ (x) first equal zero? (This corresponds to the surface of the star.)
29. Investigate the effect ofair resistance on the range ofa projectile launched with speed us.
Assumethatairresistanceisproportionaltovelocity:Fres: -ai.Writetheequationsfor
the x- and y-coordinates in dimensionless form. Scale the coordinates with the maximum
range
R
:
,E/5. What is the dimensionless air resistance parameter? Determine the
dimensionless range for values of the air resistance parameter equal to 0, 0.1, 0.2, 0.4,
and 0.5. Determine how the maximum range changes, and also determine how the launch
angle for maximum range changes as air resistance increases. Hint: lf there is no air
resistance, you can obtain exact expressions for the increments in position and velocity
in a time interval Ar. Use the same expressions when s I 0, but with acceleration
computed from the value of i at the beginning of your time interval.
30. The equation that describes the motion of a pendulum is
,r8.
I - -VsnY
(a) When y remains small, the equation may be reduced to the harmonic oscillator
equation. Solve this equation to obtain the solution
(b) With the initial conditions y(0)
: rl3,
yt(0)
y(t).
:
0, solve the nonlinear equation
period.
does
the period differ from your
By how much
numerically to obtain the
result in (a)?
31. Bessel's equation
oforder
u has the form
d2v
ldv
/
v2\
ii+;d*+(t-p,/
):0
Show that the differential equation
d2f
77 lz'7:g
may be converted to Bessel's equation through the relations
f:JZv
l
PROBLEMS 217
and
2
,l*r/,
--r+2'
resulting Bessel's equation? (The solutions are given in Chap-
plus sign is changed to a minus sign, so that the equation is
u"+u'+\u:o
has a solution of the form
u:1Qs-x/'*,
(;)
and find the order v of the modified Bessel function.
i
I
I
:
I
:
CHAPTER 4
Fourier Series
4.1. FOURIER'S THEOREM
The principle of superposition for waves states that when two or more wave disturbances
are present at the same time in a system, the total disturbance is the sum of the individual
disturbances. This physical principle remains valid so long as the linearity of the system is
preserved; that means that neither any of the individual disturbances nor their sum is too
large.l Mathematically, this principle is due to the linearity of the governing differential
equation (3.15). The functional form of the sum of the disturbances need not be a simple
sine function-it can be quite complicated and irregular.
This principle is also the idea behind Fourier's theorem, which says that any moderately
well-behaved2 function /(x) defined in a domain 0 < x < 2r may be expressed as a sum
of sines and cosines:
f (x) :\,{o"sinnl
*
bncosnx)
(4.1)
n:0
Equivalently, we may write the series as a sum of complex exponentials:
f (x):
(4.2)
n\r,dn*
llfthe equation is written in terms of suitable dimensionless variables
(see Chapter 3, Section 3.4.1), the necessary
conditionisy(1.
2we'll give specifics of how well-behaved the function must be in Section 4.5. For now, you may assume that any
function that is the solution to a physics problem is well enough behaved.
219
i
I
t
l
i
220
CHAPTER
4
FOURIER SEBIES
Here we are using a physical principle (the principle of superposition for waves) to
motivate the mathematical result that a function f (x) may be expressed as a superposition
of wavelike functions (the sines and cosines) no matter what use we may make of the
function /. Result (4.1) or (4.2) can be used in a variety of different physical systems, as
we shall see.
Since the sines and cosines are periodic functions, the Fourier series is also periodic. Thus,
f (x t2mr) : f (x), where m is an integer. We can find a Fourier series for a function that
is not periodic, but it will represent the original function only in a finite range. Outside of
that range, the original function and its Fourier series representation will differ; the series
gives a periodic extension of the function in the original range. In many physical situations,
we want a solution valid in a bounded region of space (for example, inside a box of side
L), and a Fourier series with period I works well, since the solution does not even exist
outside of the range 0 to Z.
4.2. FINDING THE COEFFICIENTS
4.2.1.The RealSeries
Let's assume that a series of the form (4.1) exists. Then how do we find the coefficients a,
and bn? First we note that the sine and cosine functions form a set of orthogonal functions
ontheinterval 0<x<2n(or,infact,onanyintervaloflength2r).Thefunctionsare
orthogonal3 in the sense that
fo'" "rnn,
sinmx dx
:o
(4.3)
l
and
j
fo'" "o"n*
unless
z:
cosmx dx
:o
cosmx dx
:o
(4.4)
nin both cases, and also
fo'" "rnr*
(4.s)
for all integers m and n. Multiplying two trigonometric functions and integrating over one
period is analogous to taking the dot product oftwo basis vectors.
3We shall give a more precise definition of the terrn orthogonal in Chapter 8.
4.2 FINDING THE COEFFICIENTS 221
To prove relations (4.3)-(4.5),
we'll
sincvcosp
:
p
:
sina sinB
:
cos
cy
cos
need the following relations:
jtrtn
)f"o"
jt"o,
-
p)l
cos (cv
-
p)l
cos (cv
*
P)l
(a +
fl)*
sin(o
(a +
fl)*
- fl)-
(a
Thus, we may write the integral in (4.5) as
t?n
/
Jo
sin
nx cosmx a*
If m : n, the integrand
: *I /12, trtn (n * m)x * sin (r LJo
m)xldx
\121t
I
I
:ll-| /
cos(n-d*ll
cos(n*m)x2 \ n*m
n-m
/lo
- 0 form fn
is j sin2mx and the integral is still zero.
Next we shall prove result (4.3):
fzosin nx sin mx dx : I f2" - (n
[cos
Z Jo
J,
m)x
: ^I I I sin(r zLn-m
-0 formfn
I
If n
:
-
cos (n
*
m)x1dx
tt2n
I
m)x ------:- sin (n +rn)xl
n+m
I
llo
m, the integral is
['"
Jo
"rn'
nx dx
:: ['" (l 2Jo
cos 2nx)
dx
: :( .- I ,in zr*\l'" : o
2\ 2n
/lo
if n : m :0, in which case the result is trivially zero.
Finally, the integral ofthe cosines is
except
r2n
J,
cosnx cosmx O.
: iI f2o [cos (n - m)x * cos (n -l m)x]dx
Jo
I I
I
l-sin(n-- 2ln-m
:0 formfn
and again the result is
r
1f
m
- nf
0. However, for m
r2t
I dx:2n
Jo
tt2n
I
mrx*n*m
sin(n+zlxll
'Jlo
: n:
O, we
now get
(4.6)
222
CHAPTER
4
FOURIER SERIES
Expressing a function /(x) as a Fourier series is akin to expressing a vector in terms
of its components. To find a component of a vector, we take the dot product of the vector
with the appropriate basis vector. Similarly, here we expect to find the coefficients an in
the Fourier series (4.1) by taking the "dot product" with the basis function sinkx. That is,
we multiply the function /(x) and its series (a.1) by sin kx for some integer k and integrate
from 0 to 2z:
r2n
l-"
Jo
.f1.xlsinkx dx
\
r2r/*
: l-. I Io,sinnx * b,cosnx I sinkxdx
Jo
/
\,=
If the series is uniformly convergent,4 we can interchange the sum and the integral. (We'll
test this hypothesis later.) Then
['" f ,rrsinkxdx:i.(",
['"
Jo
?:o\"Jo
:
*b,"Jo
['" cosnxsinkxdx)
,rnn*sinkxdx
/
AkTt
since all the other terms in the series are zero by relations (4.3) and (4.5). Thus,
ak:
: 1,," .f (x) sinkx dx,
kZ
l
(4.7)
1
Similarly, multiplying by cos kx gives the result
r
t2n
b1,:!1T I .fg;coskxdx, k>l
Jo
(4.8)
Whilefork:0,
uo:
I f2,
*
f
Jn
(x) dx
(4.e)
which is the average value of the function'over the interval.
Example
4.1.
Find the Fourier series for the step function
and/(x):1forr<x<2n.
f (x) :0
for 0 <
First we define the series to be
:i
,or: {l
U
n:O
"lo,=..,1",
4In Chapter 6, we show how to relax this restriction.
r,
sinnr
* bncosnx)
x<z
4.2 FINDING THE COEFFICIENTS
223
Then, using results (4.7) and (4.8), we obtain the coefficients:
an:
: t;"
f (x) sinnx dx
1l (-cosnx)
"' fin
:l
|
0
ifn
l-21
nn
:;1J,
12,
I
nT
-l-1
sinnx dx
+ (-1)'l
is even
if n is odd
and
br:-
tf 2n
I
nJo
/(x)cosrx O* :;
I f2"
J,
cosnxdx
:11
(sinnx)l2f :0
fin
We have to evaluate b0 separately:
bo:
lf2"1f2nl
2"J, fktdx:2oJ, o*:t
As expected, b6 is the average value ofthe function over the interval 0 to 2n. Thus,
the series is
r,"r
: {f ;l:=.i 1or, : :\
-
:i't:*
P:rJ
Figure 4.1 shows the first four partial sums5 of this series. Notice that the value of the
series at x
0 and at x T iq I/2, not 0 or 1. When a function has a discontinuity, the
Fourier series always gives a value equal to the midpoint of the jump. The series is a good
:
-
representation ofthe function, but it is not identically equal to the function at every point;
there can be significant differences at a finite number of points.
As we include more terms, the sum more closely represents the step function. Just beyond
the discontinuity at x : r,the series overshoots the value y : 1. Increasing the number of
terms moves the peak of the overshoot closer to x : n but does not reduce its magnitudean effect called the Gibbs phenomenon. The value of the series at the peak 0f the overshoot
is 1.179. (SeeAppendixV.) The Gibbs phenomenon occurs at every jump of adiscontinuous
function.
The series we have found actually represents a square wave. Figure 4.1 could be repeated
indefinitely to both the left and the right. Thus, the jump from 0 to 1 visible at x : n in
Figure 4.1 also occurs at every odd multiple of n, and, in reverse, the jump from I to 0
visible at x : 0 occurs at x : 2mn.Therefore, the series takes on the value L : LfO + I
when x equals any integer multiple of n.
5See
Chapter 2, Section 2.3.1 for the definition of partial sum.
224
CHAPTER
4
FOURIER SERIES
N:3
f(x)
1.0
0.8
0.6
0.4
o.2
0
FIGURE 4.1. Fourier series for the step function (heavy line). Shown are the partial
t'n!t',*,t)'
1 - tt
^
2 .un:O
2n I I
:
(solid line), and
N
for N :
0 (dashed
line). N : l
(dottedtine),
sums
N:
2
3 (dot-dashed line) terms. The series overshoots the step by
187o. As the number ofterms is increased, the overshoot does not decrease, but moves
closer to the step. This is the Gibbs phenomenon (see Appendix V).
If the function is defined over a range of values that does not span a range of 2n , then
we can proceed by changing to a new variable. For example, if the step function is
f
then we define the variable u
the series is
:
(o
(x\: { ifo<x<0.5
l1
\- if0.5<x<1
2nx
I
Itw\:
so that
2
S
\
0 < u < 2z corresponds to 0 <
2 nu^
n:U
sinl(2n
_
In general, if the function is defined for 0 <
2tt x I
L
so that u ranges from 0 to
f (x):E
2r
[,,
(4.10)
x<
-f l)2trxl
1. Then
(4.11)
2nll
x I L, then we choose the variable a :
anddefine the series
as
"r"(r) *b,cos(4r))
With this argument, the orthogonality relation for these sine functions is
L
1L /2nnx\ /2mnx\
/2"
sin(nr'r) sin(mu);du:
t'" (--) tt" ( r o*:
J,
/
Jo
L
,3^,
(4'12)
4.2 FINDING THE COEFFICIENTS
22s
and thus the expression for the coefficient an is
o": r2 fL /(x) sin / 2nnx\
Jo
\ , )0.
(4.13)
The factor in front of the integral is one divided by (half of the range of integration).
Similarly,
b,: ? lo' ro""" (rf1o.,
n>o
(4.14)
For bo, we get
bo:
;t;
f (x) dx
(4.1s)
Again we find that bs is the average value of the function over the interval.
For a function defined on the interval -L < x I L, we choose the variable u : ltx/L,
whichrangesinvaluefrom-n to1n,againarange of 2n.Thecoeffrcientsof thesinesin
the Fourier series are given by
f+L
o,: i,I J_r,or'^(T)
The coefficients
o-
(4.t6)
b, of the cosines are found similarly.
4.2.2. The Complex Series
Complex exponentials also form a complete set of orthogonal functions. In the onhogonality
relation for complex functions, we must multiply one function by the complex conjugate
of another:
r2tt
J,
y,yi,d*:
r2r
Jo
i(n "inx"-imx4*: --=
: o,
lo
m),itu-m)xl'_"
wlileifn:m,
f2n
J"
,'n'e-'n" dx
:
f2n
Jo
I dx
:2tr
m+n
226
cHAPTER
4
FouRrER SERTES
This expression is valid for n
:
m
:0
also. Thus, the coefficients in the senes
+oo
f(x): D
,nr'"'
(4.17)
are given by
Ln
-
+ l,'" f (x) e-'"* dx
(4.18)
In this case, the number in front ofthe integral is one divided by (the range ofintegration).
Ifthe function f(x) is real, then
\*
r
/ I f2, (x)e-inxdx) :
,;: l*
+ J,
l, f
p2n
f (x)ei"* dx:
c-,
(4.1e)
Let's evaluate the complex series for the step function (4.10):
(o
u ifo<x<o- : +m
:
/(')
D'nn',n',o*
iii.=:,':
; n:-6
\
1
Notice that we have chosen the variable 2n x as the argument of the exponential, since this
variable's value goes from 0 to2r as.r ranges from 0 to 1. Then
e
u^-Jrp'
c,: I fG)r-i2n,,a*:
[' ,-i2nnx4r:f
^L.)r-iznnxlllrP
Jo
\-2nri1
: !-k-2'ni - r-nni1: .Ltt - (-l)'l
znr
znTt
Like the an and bn we obtained previously, this coefficient is also zero unless n is odd, in
which case the term in parentheses is 2. We must evaluate the integral differently if n :0,
since our expression has a zero in the denominator in that case:
co:
Thus. the series is
r1
I o*:)
Jrlz
4
Now notice that the coefficient for n
n:
N:
: -N,
3
FOURIER SINE AND COSINE
SERIES 227
c_N, equals the negative of the coefficient for
c-N : -cN
Thus, we may rewrite the series as
f (x):
1lgi
;* i
t
n:1,
;ru,rn",
n odd
-
r-i2nrx1
1 2 S
sin(2nnx)
\_
_::____.
,
n
2 T ,:ioaa
A.2l\
where the sum is over odd values of n. The result is the same series (4.I 1) that we obtained
using sines and cosines from the start. One advantage ofusing the exponential series is that
the integrals are often easier to do.
A function /(x) may be represented by aFourier series of sines and cosines (equation 4.I
and Section 4.2.1) or of complex exponentials (equation 4.2). Since the two series represent
the same function,
ir,sinnx*
Expanding the sines
""0:"lr**
@ (
ii",
;'ot
ut
(Chapter
,inx - n-inx
-
b,cosnxl:
'2i
,,
f
,n"''*
"rrurton"l.i^tur.rr,we
i-* \
get
+m
,inx
+6--l!l:|',''n'
2 ) n?*
oo
b,
! b, .-in*] : f
iL) [-ia,_+
,inx
lian
'
'
2
z
t n:-6
n:O '
'
-
.,",,"
Thus, the relations between the coefficients are
c--:unI'o',
L-ncr:b-!--J!,
12
2
n>l
't-'
(4.22)
and
co:
bo
4.3. FOURIER SINE AND COSINE SERIES
The function shown in Figure 4.2 is obtained from equation (a.10) by shifting the origin
vertically and multiplying by - 1. The resulting step function,
(-tn
-L<x<o
S.(x)-< '-.- ififo<x<*z
l+t1z
(4.23)
CHAPTER
4
FOURIER SERIES
FIGURE 4.2. The step function 56(-r) is an odd function on the interval
(-L,
L).
is clearly odd in x over the range -L < x < ,L and so should be represented by a series
of sines, which are also odd in x. We did find that series (4.2I) for our previous step
function/square wave contained only sines. The origin shift in y is represented by the
constant 1/2 in equation (4.21). Thus,
g
(nr
S.(x)--T2 ) sin n x /L)
n:Goaa
(4.24)
When the function is odd on the range - L < x < L, the argument of the sines is n x I L
because the period is 2L. [In the previous section, we used the step function (equations 4.20
and 4.2I) with
I l2.l However, we can express the coefficients as an integral over half
the period, 0 < x < Z. The coefficients in a sine series ofthe form
-
I :
f (x):i,o,"inff
n:l
are given by
on
I fL
/nrx\
J-'f r*l ( J
'
fL
/nnx\ \
/nftx\
: ! (f
r\/-rl(x)sin(. )o'*Jo "f(x)sin\ t )d')
-- T
sln
ax
1 / fo
/ nTrxr
/lL
,nnrt, \
:T\Jrf
t-x)sin(- r )d(-x)+Jo /(r)sin\, )dr)
Then, since
an
/(x)
is odd,
I /
: L\-
t
/nlrx\1
1L
,r11)a*\
fo
o* *
/(x)sin
(,
-fa)
ll
\-r, /
l-sin
Jr
Jn
4.3 FOURIER SINE AND COSINE SERIES
an:
Conversely,
sented
/nlfx\
?t: /(x)sin \
if a function is even on the range
by a series of cosines with argument
nxI
f (x) :bo + i
' n:l
-Z
(4.2s)
, )d*
<
x<
229
L, the function will be repre-
L:
a,
nr )c
"o, L
The coefficients are given by
br: t2fL
Jo
/(x)
cos
/nfix\!)o
d*
\t
(4.26)
)
and
bo:
; t,' f (x) dx
We can make the odd square wave (4.23) into an
along the x-axis by L12 (Figve 4.3).
(4.27)
:n fu nction by shifting it to the left
f(x)
x
L
:
FIGURE 4.3. By shifting the origin to the right, we can create a step function Ss(-r) that is an even
function. This function is represented by a cosine senes.
230
cHAPTER
4 FouRrER
SERTES
Then series (4.24) becomes
oo
S"(x)
: -2 \lJ
1f
n:l,n
2
7t
2
7f
1
/x
-sinnzl-+
,?
\L
odd
.xwtxwf
-L2L2
sin nn
@
\./2
n:I,n
@
-l- cos nz
-
sln
-
n
odd
el)(n-r)/2
nL
odd
\,/-
n:l,n
)
nrx
(4.28)
which is a cosine series, as expected.
As with the complete series, if a function is defined only in the range 0 < x < L
(Figure 4.4a), then the Fourier sine (or cosine) series will represent a periodic extension of
the function outside ofthe original range (Figure 4.4b). The sine series gives an odd periodic
extension, while the cosine series gives an even periodic extension (Figure 4.4, c, d). Each
of these series has peiod2L.
f(x)
FIGURE 4.4a. The function /(.r)
is defined on the
range0<x<L.
f(x)
_LOL2L
FIGURE 4.4b. The complete Fourier series gives
period is L.
a
periodic repetition of the original function. The
4.3 FouRrER srNE AND cosrNE
sERrEs
231
f(x)
slne senes
FIGURE 4.4c. The Fourier sine series is a periodic repetition of the odd extension of the original
function. The period is 22.
f(x)
even extension
coslne senes
-LOL2L
FIGURE 4.4d. The Fourier cosine series is
a
periodic repetition of the even extension of
/(x).
The
period is 2.L.
Thus, in principle, we can find three different series for a function
rangeO<x<L:
1. The full series
f (x):
with coefficierts cn
: !
L
.f"t
f Al
,f_rn",rnn,,"
,_2innx/L
dx
andperiod
I.
/(x)
defined on the
-t
CHAPTER
4
FOURIER SERIES
2. The sine series
J$l:
with coefficients an
3. The cosine
a
z
r- anstnnTX
)
L
n:l
rL
I
. nJLI
: ; I /(x) sin
.
LJo
L
-
dx and period 2Z
series
3
nrx
f(x):bo+Lb"cos ,
n:l
: 2 fL / (x) cosnrx d * and b6 equal to the average value of the
L Jo
L
over the range 0 to I. This series also has period 22.
with coefficient" b"
function
Different applications require the use of different series,
as
we shall see in the next section.
Example 4.2. Find the Fourier sine series and the Fourier cosine series for the
function f(x) :x, 0 < x < L.
The sine series /(r) : DansinnnxfL represents an odd function on the range
-L
< x < L (Figure 4.5). The coefficients
o'
2 fL
/nlrx\
: T/
xsin(-l
o'
are (equation 4.25)
2/
L
: r[-';cos,
nr* L
fL L
nTrx \
o* Jo -cosi"
)
2 / L2
/ L\2
,rz*rr\ :
:- L\
I -1 cosn1+ (
l sin--:l
L to/I -ZLeD'
nr
ntT
\ro/
-
FIGURE 4.5. The first six terms in the sine series of the function
f (x) :
-r defined for 0
< x < Z.
233
4.3 FOURIER SINE AND COSINE SERIES
Thus,
-t:--
S) (-l)'
2r
o?
nrx
n
L
-
range
-L
<x<
/(x) : Dbncosnrxf L represents
an even function on the
(equations
(Figure
4.26 and 4.27)
are
L
4.6). The coefficients
The cosine series
bo:
IfL
Jo
i
*o': iL
and
L
,or.\
nrLl -dx
I
f(x)
x
L
FIGURE4.6.Thefirstfourtermsinthecosineseriesofthefunction/(x)=.rdefinedfor0<x<L.
Compare this graph with Figure 4.5. A few terms of the cosine series represent the
function more closely than does the sine series.
Thus,
is
b, :
0 for even n , and b,
: -4L / (ntr)2
for odd n. Therefore, the cosine series
I
L 4tS
z_t^ (2p * l)z
z nz P:O'
'
-
Qptt)nx
L
234
4
CHAPTER
FOURIER SERIES
Compare Figures 4.5 and4.6; the cosine series converges much faster than the sine series
(the terms decrease as If n2 rather than 1/n). This happens because the odd extension is
a discontinuous function with a jump at x : lL, whereas the even extension is a continuous
function. In Problem 2, you will be asked to look at the full series for this function.
4.4. USE OF FOURIER SERIESTO SOLVE DIFFERENTIAL
EQUATIONS
Fourier series may be used to solve differential equations under some circumstances. In this
section, we shall look at two of the most common applications.
4.4.1. An Inhomogeneous Linear Equation Has a Periodic Driving Term
When the source term in an inhomogeneous linear equation is periodic, we expect that the
solution will also be periodic. One example is an electric circuit connected to a power supply
that supplies a periodic emf.
Example
4.3.
A circuit has a 10 O resistor, a 2 mH inductor, and a 0.3 pF capacitor
connected in series with a power supply that generates a square wave emf t(r) with
:
a period of z
0.3 ms (Figure 4.7). The applied voltage varies between zero and
*1 V. What is the current in the circuit?
FIGURE 4.7. The series ZRC circuit in Example 4.3 with
current and charge variables
(I
a
periodic (square wave) emf. With the
and Q) as defined in this diagram,
I:
dQ/dt.
The charge and current variables I and Q are defined in Figure 4.7. Kirchhoff's
rules applied to the circuit give the differential equation
L-dI +RI+
dt
f,:erO
where 1
:
d Q / dt .
If we write the equation in standard form (equation
d2o
R do
- -),
dt2'Ldt----1
t-
ot
' I.C
I.
3. 18),
i
4.4 USE OF FOURIER SERIESTO SOLVE DIFFERENTIAL EQUATIONS
the coefficients of
dQ/dt
235
and Q arc
R 10s2
2a: lL 2mH
and
,3=
I
1
LC
(2 mH)(0.3 rr.F)
1010
6
s-2
Then
l0s
@o:
ftradls
:4 x lOa radls
We have already found the series for the square wave emf
t(t):
t
+oo
^(:.:
\
n:-
where z is the period of 0.3 ms and t6 :
series with period
t
(equation 4.20):
('':))
"ooi"'o
I V. Next, we write the solution
as a
Fourier
r:
Qft):
,t*r,*r(,'+)
(Note: It is much easier to use the series in exponential form here, because the equation
has both first and second derivatives. When we differentiate the series, each term in
the equation contains simple multiples of the original exponential terms. In contrast,
the odd-order derivatives mix sines and cosines: dsinxldx: cos-tr. Instead of one
equation for each coefficient cr, we would have two equations to solve simultaneously
for the coefficients of the sines and cosines.)
Now we substitute the series for t(t) and QG) into the differential equation:
i
i - ('+)'q,ri2nnt/t +2' n:-6
i22o nn"'"nt/'+@?0
:?(:.: r
\
n:-oo.
,,
f
nn"'"n'/'
n:-@
n:-@
Lr,r,,,t,\
odd
/
Next we make use of the orthogonality of the exponentials by multiplying the whole
equation by exp [-i(2mnt lt)] and integrating over one period. Only the terms with
n : m survive. Since this is true for any integer m, we aar' equate the coefficients of
each exponential separately.
The constant (n : 0) term gives
)
a54o:
c
ol)
-
+qo
toC
2
CHAPTER
4
FOURIER SERIES
The other terms are given by
| /zno\2
a,;-\;)*
4otinr "'l h
. +a6):ii
i
i
tstz
Qn:
I
L n l-(2wr)2 + 4qrinru + c'Sr2l
tsr2 4atr +[r3r'- (2nr)2]i/n
r L lr|t, - (2nn)212 I (4arnr)2
n
Notice that the real part of q, is even in n, while the imaginary part is odd. This is
exactly what we expect if the resulting series is to be real. The real parts combine to
give cosines, while the imaginary parts combine to give sines:
i
g(t\:t:9*t:!
2
nL
4wr + la;3r2 -
(2nn)2li
ln
^_^
(
,2nrt\
/
fl@t^o\'r
2ntrt I ^^ (2nr)'lsin
^ 2nnt
4cYrz cos
toC
r ;lafir'
_
- ,L $
- 2 -2tot2
k$t2-(2nr72J2+{4atnrS2
2
-@,n
Finally, we can differentiate to get 1. Then
4Est
I(t):
L
t
4narn sin
2nnt
@3t'
n:1
-
^
(2nr)'l
+ la'ot'
-
(2nn)212
a (4utnr)2
2nnt
The constant in front ofthe sum is
4tst
4(l V)(0.3
L
(2 mH)
ms)
:0.6 V.s :0.6A:10
-H
Thus,
I7t'1: -t0.6A;
p
L
4narn
,in2n!! + b3r2 - (2nr)2l"or'no'
Notice that if the natural frequency ar6 of the circuit times the period z of the emf
is very close to 2nn for some integer n, then the current will be very large if cv is
small; there is
a
resonance at frequency @n
aNf n
:
(4.I x
:
Znn
10a s-1;10.3
lr
.
In our example,
x l0-3 s)/r :3.9
4.4 USE OF FOURIER SERIES TO SOLVE DIFFERENTIAL EQUATIONS
237
which is very close to 4. The solution is shown in Figure 4.8. The current is dominated
by the n : 2 term, which has twice the frequency of the square wave emf.
1(A)
0.02
r (ms)
0
-0.02
-0.04
t--\
\n:2
-0.06
FIGURE 4.8. The first three terms of the solution for the current in Example 4.3. The thick solid line
is the sum. The thin solid line is the n : I term, the dashed line is the n : 2 term,
and the dotted line is the r : 3 term. The solution is dominated by the n : 2 term.
The square wave emf is shown schematically by a thin solid line. The square wave
indicates the phase (but not the amplitude) of the applied emf. The vertical axis does
not apply to this curve.
4.4.2. The Solution We Want Equals Zero at Two Boundaries, .r = 0
attdx=L
If a physical quantity is described by a function /(x) with "f (0) : f (L) : 0 and exists
only in the interval 0 < x < Z, we can make an odd extension of the function /(x) to the
range -L < x < L and represent it as a Fourier sine series. The sine series is always zero
at the endpoints r : 0 and x : I and thus automatically satisfies the boundary conditions.
For example, consider the problem of waves on a string of length L.The differential
equation for the string deflection y (x , r) is (equation 3. I 5)
.,32
u-
y
o2y
*z:
6P
We can solve the equation using separation of variables.6 Let
u2x"T
Dividing through by y
I
I
t
t
t
XT , we get
.X"
XT
t
i
:
: xr"
6See Chapter 3, Section 3.5.
T"
y(x,t): X(r)f
(/). Then
238
CHAPTER
4
FOURIER SERIES
Since each side of this equation is a function of only one variable, each side must equal
a constant if the equation is to remain true for all values of both x arrd t. Since we want the
stringdisplacementltoequal
X to be a sine,
zeroatx:0andx : Lfor alltimes/,wewantthefunction
so we choose the separation constant to be negative:
X,,
,-z
positive integer. Then the equation for Z
T"
T
:
n:l
(T)
k:
nnf L, where n is
t nn:,2 ^
t-2,.2 ___l_l
uz
sin (nn ut I
y(x.t):!sin
0; that is,
IS
"
and the solutions are
be written
sin(ft-r)
sin(kl) :
Next, we choose the constant ft to make
a
) X:
L)
\t )
and cos (nn ut I L) . Thus, the general solution may
fo,'rn
(t:)
* bn cos (tf)]
(4.2e)
which, for fixed r, is a Fourier sine series in x, as expected. The amplitude of each Fourier
component is oscillatory with angular frequency @n : nft u / L. The initial conditions determine the constants an and br.
4.4. Suppose we pull the string up at the middle so that it forms a triangle
of height h (Figure 4.9) and then let go. Find the subsequent motion of the string.
Example
L
FIGURE 4.9. The initial shape of the string (t
The solution has the form (4.29). At
v(x. 0)
:
/:
and the string velocit! at
t
:0
0) in Example 4.4.
0, we have
(xQh/L)
ifO<x.L/2
l(t - x)(2hlt) if Ll2 < x < L
{
:
:i0,"^(ry)
is zero:
Dvl
/nirx\
'l
:0:) Srzu
0rlr:o ?- -sinl-la_
L \ L/
4.4 USE OF FOURIER SEBIES TO SOLVE DIFFERENTIAL EQUATIONS
Thus, all of the coefficients an arezero, and, from equation (4.25), the coefficients bn
are given by
2 fL
bn: i
Jo
o)
nrx
sin
dx
"*'
'
: # (lr''' * sinffa' + l',rrt-
x) sin
#0.)
To do the first integral, we integrate by parts:
L ,
no*rlL/2 fr/z L
l-cos-llL /lo + I nr
nn\
Jo
72 /
wtr /L\2
wv
:rno
lL/2 nrx
I xsin-dx:xL
Jo
wtx
,
----=-dx
L
(-cosi)*\;)sin7
The cosine term is zero unless n is even, and the sine term is zero unless n is odd.
The second integral is
1L nftx
L2 r
nnxt.tL :- L2 t nnr
Ll
-Jrp- si11!!24*:1(-.or"',^)l
2 -,-tl')
L /lt/z nn\IcosjLnn\
and the last integral is
wrx
1L
nrx
L,
nnxr, lL
fL L
I xsin'-'---dx:y{cos )ltp
ll - I
--nn\--- Jtp"L
L Jtpntr
-cos-dxL
:
# (rr-rr' -*"7). (h)' sin)
Adding the three terms, we get the result
b,:4h
z -(-l)'
2
Z nTr 2 +cosf
wrLl-l""rT*Lsinf
*)(r,-r^ -*"7) * *'^T)
:- th
(nr)t
wr
2
which is zero unless n is odd, in which case the result is
u,
where we wrote n
:
ffir-g@-rtp
:2p *
1. Then the
v(x,t):Y\#{",in
P:l
: 12ol*1t-9n
solution for y is
(rzz
+ DT)"o, (tro
. r+)
240
CHAPTER
4
FOURIER SERIES
Figure 4. l0 shows the first four terms of this series. The plot shows the dimensionless variables7 y f h versus x
attimes ut f L :0,I14,I12, and l.
lL
v
n
./
--l---+---F--l---+-
\
t-l,",- - f - - -f ---t--
!
L
FIGURE4.10. Thestringdisplacementattimes/: L/utimesO(thicksolidline),714(dashedline),
1/2 (short dashes), and 1 (thin solid line). The first four terms of the series were used
to make this plot.
4.5. CONVERGENCE OF FOURIER SERIES
Now that we understand why Fourier series are useful, we should worry a little bit about
how well they converge.
4.5.1. Pointwise Convergence
A function is very smooth if at least the first two derivatives of the function exist and the
second derivative is continuous. It is piecewise very smooth if we can divide the interval
(-L, L) into a finite number of subintervals and the derivatives exist within each of the
pieces. The step function (4.23) satisfies this requirement, since the derivatives exist within
each of the two subintervals (-1, 0) and (0, I), even though the derivatives do not exist at
x
:0.
The Fourier series of a function f (x) Ihat is piecewise very smooth 6n an interval
-L < x < I converges pointwise to )U{, * 0) + f(x -0)l for each x in the
interval.
TSee
Chapter 3, Section 3.4.1
4.5 CONVERGENCE OF FOURIER
At the endpoints, we identify
f(L -
x : -L
and
SERIES 241
x : Z to obtain the limit ittef
+ 0) +
o)1.
Fourier series are not, in general, uniformly convergent. They are uniformly convergent
only in closed subintervals where /(x) is continuous. But Fourier series can be integrated
and differentiated term by term. We'll see why in Chapter 6.
4.5.2. Convergence in the Mean
Fourier series converge to the corresponding function
f (x) in the mean.
That is, the square
deviation
**
: J-rl
ft lru, - f ,nr,no,,,l' a* --o as N -+ oo
,7x
I
(This idea should remind you of a least-squares fit of a model to a set of data points.)
The function and the series may differ substantially at a finite number of isolated points
and still converge in the mean. We have already seen this happen with the step function
series (Figure 4.1), where the series always takes on the value at the middle of each step,
independent of the value of the function at that point.
The Fourier series of any piecewise continuous function converges in the mean. This
condition is sufficient but not necessary; that is, there are functions that behave more badly
yet whose series also converge in the mean. As physicists, we do not need to concern
ourselves with such functions because they do not arise in the solution of physics problems.
Now let's ask if we can do better than the series we have. Is there any set of coefficients
cn for which the remainder R1,, is smaller than that given by the Fourier coefficients?
Let
rLN2
RN: | ,f(xlJ
cnei""/L
|
-r.
(4.30)
dx
where the cn are unknown. Then, to minimize R1g, we choose the coefficients cp so that
ff:o
* l:,(t,', - ,t*'n''nn*") (r.u, -,i
Ifthesumff;=
-N
cneinrx/L isreal,thenc[
[ (-
kn x / L
r't
:
a 2'i ktr x / L
"
rL
J-r-f
''0"'tL
dx +
i
---N
c-m,and,if
^t- *'
-
che-i^'.rr) dx :0
f (x)isalsoreal,thederivativeis
*'-' ^"'')
IL ,;*nr/rr-^s-imnx/L
J-L
x
:
o
dx
:
o
d
242
cHAPTER
4 FouRtER SERIES
In the second term, every term in the sum integrates to zero except the one withm
:
k.
Thus, we have
rL
I ytik"r/tdx :2Lc_r
J-r
and so the value of cp that minimizes R1y is
ck:
r
;
rL
,-,0"*rtd*
J_rf
(4.31)
which is the usual Fourier coefficient (equation 4.18). Thus, the Fourier coefficients are
optimum in the sense that they make the square deviation R,ntr a minimum.
We can use the result that Rlr -+ 0 to prove another important result about the Fourier
coefficients. We begin by expanding expression (4.30):
rL/
N
N
\/
Rr: l- ( frrl- t
/-'\'
n:-N
cneino*/L
) (f,rr.- t
m:-N
/ \
\
cf,e-i^"'/Lldx
)
rLrLN
: I lfulP a* - l, \ lft*)*r, r
J -L
- -- n:_N
f')c\nleinttx/14*
rLNN
+ I \) ,nutnn'/' t
J_r,?r
cfle-i^n*/Ldx
m:_N
The last term may be rearranged to give
t
"i -t:-,,,i, I
:
and the integral is zero unless m
x/
L'-imn
x/L
r'iwt
4*
n, when it equals 21. Thus, the last term
rs
NN
2L
I
D c,ctr-zrn:-N
lc,lz
n:-N
We can also simplify the middle term, since
TLNNTL
I c\nrin"'tLdx : L r\, J|-L- tr*r rinnx/1tr,
l--t tr-l n?u
n:Jp
J
NN
: It
*
c* n2Lc-n
: Zf I
n:-N
lc,l2
PROBLEMS 243
and, similarly,
TL/VNN
>, f(x)*c,ei"ft'/Ldr:2L n:-N
D cncfi:ztn:-N
I - n:-N
\
J-L
b,l2
Combining the last two terms, we have
tLN
nx
: |
-L
1ye)12
dx
J
Since Rp
t
-2L \
n:-N
lr,t2
0 by its definition,
rIt
N
tL
\t"t2
;J-,trG)t24*' n=-N
which implies that the series on the right converges. Thus, for a series that converges in the
mean, R1g
-+
0 as
N ->
oo, and so
rIt
@
tL
\
; l_, f(*)2dx:n:-@
(4.32)
cn2
This is one version of Parseval's theorem.8
We can understand this result by thinking about the energy stored in the capacitor in the
circuit of Example 4.3. Averaged over one period, the energy stored in the capacitor is
'
:
I f, QG)2 _
: I \-,^,2
; Jo 1;o'
zc
Lb^r
From this we conclude that the time-averaged energy equals the sum of the energies in each
of the individual Fourier components.
PROBLEMS
I
Snow that the Fourier series (equation 4.1) for a function
f (x) :i
n,
n:O
"o,
/(x)
may be written
(nx -t Q,)
and find expressions for kn andQr.
8See Problem
variant.
2l for Parseval's theorem for
the sine and cosine coefficients, and see Problem 22 for an additional
244
2.
CHAPTER
4
FOURIER SERIES
f(x) : x
(a) overtherange0<x<1
(b) overtherange -l <.r < 1
(c) Make a plot showing the original function and the sum of the first three nonzero
Develop the full Fourier series for the function
terms in each series. Comment on the similarities and differences between the two
series.
3. Develop the Fourier series for the function f (x) : x2 over the range 0 < -r < 1 .
@ An odd function /(.r) on the range (-L, L) has the additional property that f (x + L)
:
- f(x).
(a) Make a sketch showing the important features of this function.
(b) Which kind of Fourier series (sine series, cosine series, or full series) represents this
function on the range -L < x < L?
(c) Show that the series has only terms of odd order (n :2m * l), and find a formula
for the coefficients as an integral over the range 0 < x < L 12.
(d) How does your answer change if f (x -f L) : t f (x)?
(e) How do your answers change if the function is even, but /(x + L) : - f (x)?
5. Which series, the sine series or the cosine series, do you expect will converge more
rapidly to the function f (x) : x3 on the range 0 < x < 1? Give reasons for your
answer. Evaluate the first four nonzero terms in the optimum series. How large is the
fractional deviation ls4
I {G) | u, :
"
lf@)l
sum of the first four terms.
0.5 and
x : l? In this expression,
Sa
is the
6. Find the Fourier series on the range 0 < x < 2n for the function f (x) : sin cvx, where
a is not an integer. Check your result by evaluating the limit a, --+ n. With the value
a : 0.7, plot the original function and the first three terms of your series on the range
0<x<2rr.Comment.
7. Find an exponential Fourier series for the function sinh cvx on the range 0 < x < 2n .
S
9.
By combining terms, rewrite your answer as a series in sines and cosines.
finO the first four nonzero terms in a Fourier series for the function tan r on the range
-n/4<x<n/4.
Use numerical integration to find the first ten terms in a Fourier series for the function
sin x2 on the range 0 < x < n. Discuss the percent error between your series and the
function sinx2 over the given range.
10. Find
a
Fourier series for the ramp function
.f
(x):
{r
ifO<x<1
ifl<x<2
ontheintervalO<x<2.
11. An electric circuit contains a 3 mH inductor, a 50 pF capacitor, and a 200 Q resistor
in series with a power supply that supplies a rectified sine wave voltage (see the figure)
with amplitude 110 V and period 2 ms. Determine the capacitor voltage as a Fourier
series.
system is driven by a periodic driving force with period
\') r",.,,_lo,
7:
ifO<r.T/2
\a(r-r) ifT/2<t
Find the response of the system.r(r) as a Fourier series.
<T
246
CHAPTER
4
FOURIER SERIES
14. A simply supported beam of length L bears aload W that is uniformly distributed over
the first 1/4 of its length. Determine the deflection of the beam as a Fourier series. Make
plots showing the first one, two, and three terms of your answer. How many terms are
needed to obtain a result accurate to l%o? (The differential equation satisfied by the beam
deflection is equation 3. I I , and the displacement is zero at the two ends.)
@e beam rests on supports at its ends, x : 0 and x : L. The load q(x) varies linearly
along the beam: 4 : ax.What are the boundary conditions? Find the displacement of
the beam as a Fourier series. Plot your results, and comment.
16. A guitar string of length L :65 cm is plucked by pulling it to the shape
v(r, o)
ifo<x.L/3
: Io*'
\{"/+'ttt-r12
if Ll3
<x <L
and then letting go. Determine the subsequent motion of the string. Which harmonics
are excited? Plot the string displacement as a function of x for t :0,0.4, and 0.8 times
I/u. Also plot the original string shape for comparison. Comment.
17. A violin string is plucked to the shape of a triangle with apex one-quarter of the way
along the string, as shown in the figure, and then let go. Find the displacement of the
stringatlatertimes.Plotyourresult(uptothen:10term)for/:
Lf2u,wherc
v : ltlJ
1t
L/lou,Ll5u,allid
is the wave speed. Are all harmonics excited?
I
L
0.010
0.008
0.006
0.004
0.002
0
o.4 0.6
PROBLEM
0.8
x
L
17
18. ApianostringoflengthZishitbyahammeroflengthl: tr/l0.Thehammeriscentered
at x : Ll4, and the impulse it imparts is 1. Assume that the impulse is uniformly
distributed over the hammer's length. Determine the subsequent displacement of the
string as a function of x and /. Which harmonics are excited? Plot the string displacement
as a function of x for t :0.1,0.5, and 0.75 times Llu.Use the first five nonzero terms
of the series to make the plot. Comment on your results.
PROBLEMS 247
19. Fourier series may be used to evaluate certain series of functions of integers.To illustrate
the method, develop the Fourier series for the function x2 on the rarrge -7r to z. Set
x
:0
and hence evaluate
I
St-l)'-r
\
2
n2
Which sum do you obtain by setting x
:
I
4's'
-t-.*-*...
n? Finally, use Parseval's theorem to evaluate
$r
Ln4
n:1
EO.'l
Us" the Fourier series for the step function to evaluate the sum
S
t-t.r'
L^2* *
1
m:o
(See Problem 19 for the method.) Use Parseval's theorem applied to the same series to
obtain the sum
$r
z-^
12m
21. A function
-f l)2
f (x) is represented by the Fourier senes
3 r
nrx
f G) : L lo, sin , *
n:O
nr(x\
bncos -----
)
I
on the range (-L, L). Derive a form of Parseval's theorem (equation 4.32) applicable
to this series; that is, express [j! f t*f a* in terms of the coefficients a, and bn.
22.
If f(x) is represented by the series D .fne'"' over the interval 0 < x < 2n,
: D grri'" over the same range, prove the genenlizedParseval theorem:
and
g(x)
1,2noom
L .f,t-,: n:_@
\ f,sl
- I f(xtg(xtdx: n:_@
ztf Jo
where the second expression applies when the function g(x) is real.
23. The capacitor shown in the figure on the next page is charged by the battery and discharges
through the bulb when the potential across it equals 0.9V. (See, for example, Lea and
Burke, Example 31.3,p.993.) Assuming that the capacitor discharges very rapidly, show
that the potential across the capacitor as a function of time is
vc:v(l
-"-,,*r1,
-\)
and repeats periodically with period
that represents this function.
T
:
o < r < RCtn lo
RC ln 10. Find a Fourier series with period ?
248
CHAPTER
4
FOURIER SERIES
PROBLEM
@l n
rectangular box of dimensions
y
23
a x b x a has conducting walls. All the walls
:
are
b. This wall is separated from the others by a thin
insulating strip, and it is at potential V. Using the method illustrated in Chapter 3,
Example 3.15, find the potential everywhere inside the box.
grounded except for the one at
x b x c has all its walls at temperature T1 except for
s, which is held at temperature 72. When the box comes to equilibrium,
the temperature function T(x,y,z) satisfies equation 3.14 (extended to three spatial
dimensions):
25. A rectangular box measuring a
the one dt 7
:
AT
u -
Dv2T
with the time derivative on the left equal to zero. Using the method of Chapter 3, Example 3.15, find the temperature T in the box in the form
T(x,y,z):Ttlt(x,y,z)
where z is expressed in a Fourier series
t (x , y , ,,
Find the function
f (z) nd
:
na*
sinlla
snff f fr>
the coefficients amn.
26. An infinitely long conducting tube with circular cross section of radius a is divided into
four equal pieces by insulating strips running along its length. One of the four pieces
is at potential V, and the other three are grounded. Solve Laplace's equation in two
dimensions, using the method of Example 3.15. Evaluate the solution at p - 6 411
show that the result is a Fourier series. Determine the coefficients, and hence find the
potential everywhere inside the tube.
Wl n Fourier series of the form
f (x):Dr,r'n*
PROBLEMS 249
may be expressed as a power series
f (x):D"nrn
where z : lirnr+t re'r ar'dr < L The function f (x) may be identifiedby summing
the power series. Use this technique to sum the Fourier series
@
s\_ Slfl/l-T
where 0
<x<
you found.
=n
rr . Check your result by evaluating the
Fourier sine series of the function
CHAPTER 5
Laplace Transforms
5.1. DEFINITION OFTHE LAPLACETRANSFORM
An integral transform allows us to convert an inhomogeneous linear differential equation to
an algebraic equation that is easier to solve. We shall look in detail at two such transforms:
the Laplace transform and the Fourier transform (Chapter 7).
The Laplace transform is an integral operator applied to a function f(t) that is defined
for0 < / <
oo:
L(f):F(s)
:
lo*
rale-'t
(s.1)
dt
The factor e-'t in the integrand causes the integral to converge for a very large class of
functions /.
A function
f
is of exponential order o6 ifthere exists a real positive constant
M
such
that
le-"ot y1tll <
for all
r,0 < t <
u
(s.2)
oo, and og is real.
The Laplace transform exists for all piecewise continuous functions of exponential order.
The transform is defined for Re (s) > os.
Let's look at some examples.
1. Exponential functions: f (t) : eot .The function eot is of exponential order cu-we
take o6 : a and M : I in equation (5.2). The transformis
can
|
F(s): [* ,o'r-"4t:
lo
cv-s "tc-s)rl€
Jo
251
252
cHAPTERs LAPLAoETRANSFoRMS
The transform exists for Re (s)
> q, in which
case
F(r):*
2. Powers: f (t)
:
.The Taylor series for the exponential function,
tm
-, :1+
e''
st
e^ t^
+ e2t2
^ * ...* *t +..'
shows that
,^ am!,"
em
and so
Itm e-€t I .
rrEmem
4rr,
"-r,
:
^t'
for any positive real number e. Thus, these functions are of exponential order e. With
m
:0,
the transform is
L(t): fo* "-"at: -+1, :
For m
>
1, we use
:
[6
Jo
te-'t dt
irRe (s) > o
: l,
integration by parts. For m
LO
:
: ---te-t, l* * ;I /l, /,
:
: -+ls'lo +
irRe (s) > o
foo
o-st l@
_,
e-'t dt
Then, in general,
L(t^): I t^e-'tdt
Jo
:
LL(t^-t)
:
1m
1 r@
s Ilo+:sJoI
ry
LG^-2)
:
.
mt^-re-'tL1
..
:
#
More powerfully, we can use the definition of the gamma function (equation 2.75). Make
a change of variable to u : s/. Then
Lltny
-
fo*
tne-,,d,
=#
which is valid for Re (s) > 0 and
:
lo*
(1)'
,"7
fo*uo"-"a":\fiP
p > 0. The
power p does not have to be an integer.
5.2 SOME BASIC PROPERTIES OF THE TRANSFORM
3.
253
Sines and cosines. These functions are easily taken care of, since they can be written
linear combinations of exponentials:
4(cosa.,t)
:
lo*
as
rye-'tdt
_1( I , I \_
- Z \s - ro- - t + i, ) - ;, +,Dt
.s
which is valid for Re (s)
>
0.
We can now begin to compile a table of transforms (Table 5.1). Each transform exists
when the real part of s exceeds some minimum value sg.
TABLE 5.1. Laplace Tbansforms
.f
(t)
f (t)
F(s)
I
J-0
eot
COS(r)T
1
srn @/
,t
r(p +
1P
F(s)
J
szlaz
a
-s2 to2
+
1)
Jp+1
This table gives transforms of the most common functions, but we will need methods
/(r) conesponding to any transform F(s). This process
is called inverting the transform. Since the Laplace transform is an integral operator, we
can change the value of the function f (t) at a finite number of points without changing the
value of the integral (5.1). Thus, the inverted transform cannot be unique. This does not
usually present any difficulties in the solution of physics problems, as we shall see. Usually
we choose a continuous function as the appropriate inverse.
that allow us to find the function
5.2. SOME BASIC PROPERTIES OFTHETRANSFORM
1. The Laplace transform is a linear operotor. This means that
L(af)
:
lo*
o, U, e-'t dt
:o
lo*
f (t)
e-''t dt
:
aL(f
)
where a is a constant, and
L(fi+f): JoIf6 (fi*f)e -" a, :
: L(f) + L(fz)
Io*
fr(t) e -', at +
lo
fzG) e-'t dt
254
cHAPTER
s
LAPLAoETRANSFoRMS
2, Thetransformof thederivativedf ldt maybefoundusingintegrationbyparts:
If / is of exponential
Re (s)
>
order s6, the integrated term aPproaches zero at infinity for
s6. Then
:
'(#)
-"f
(o)
*
(s.3)
sF(s)
Then the transform of any higher derivative may be found by iteration:
'(#) :'l* (#)l: - #1,. "(#)
:-#lo-"to'+s2r1s;
and, in general,
L(4:J\:
\dt^ /
su
F(s)
-t
d^-"
f I ^n-t
(s.4)
dtr'-^ lo"
n:I
3. The attenuation propertyi If a function / (r) is attenuated (or reduced) by
the factot e-at
,
then
L(r-o, f )
:
lo*
e-o'
f (t)
L(e-"'
e-'t dt
f):
F(s
:
*
lo*
71t1 e_-G+o)t
a)
dt
(s.5)
An inversion may be simplified if we can recognize the transform as a simpler function
of (s
*
a) for some constant d.
Exampte
5.1.
Invert the transform F(s)
: ll$2
-l2as + a2).
We recognize the denominator as (s + a)2.The function F(o) : lf o2 inverts
to give f (t) : t (Table 5.1: powers), and so the inverse of F(s) iste-at '
4. The shifting proper4r is closely related to the attenuation property. Suppose we shift the
function f (t) along the t-axis by an amount /6. Since the original function is defined
5
only for
3
USE OF THE LAPLACE TRANSFORM TO SOLVE A DIFFERENTIAL EQUATION
/ > 0, we must cut the shifted
function off at
/:
255
/o (Figure 5. 1) or, equivalently,
-to:0. (Theshiftedfunctionequals zerofort < /s.)Wecandothisbymultiplying
the shifted function by a step function S(l - rs):
att
shifted
/ :
S(/
-
td
f (t -
to)
f (t)
a function. The shifted function is zero for / < t6, but f (t
must cut off the dashed portion with the step function S(t
rO).
FIGURE 5.1. Shifting
-
- td is not, so we
Then the transform ofthe shifted function is given by
CtS(r- totf\t-rs)l:
Now change variables
/ s(r-r0)/0 -tde-'tdt:
Jo
to Lt. : / - /6. The limits become u :
f,fs(r- b\f(t-to\l:
LIS(I
[* ftt-to)e-'tdt
Jro
0 and oo.
I f(u\e-"'+'otdu:r-"o lo[* ffu)e-"du
lo
-tdf Q - ro)l:e-"'oF(s), /o > 0
(s.6)
Thus, atransformthatcontains an exponential factoris thetransformof a shiftedfunction.
In this result, the parameter /0 must be positive.
5.3. USE OFTHE LAPLACETRANSFORMTO SOLVE
A DIFFERENTIAL EQUATION
If we have a linear differential equation of the forml
d2v dv
oji*ti*cy:f(t)
lThe method is not limited to second-order equations (see Example 5.3). A second-order equation is used here for
illustration.
256
CHAPTER
5
LAPLACE TRANSFORMS
we may apply the Laplace transform operator to the entire equation. Then we can use the
Iinearity property to write
^'(#)
* u'(*sr) + CL(Y):
F(s)
and then use the derivative property:
+l
\ - dtlo-
A(
rytol + r2v(rl)
"/ + Bt-y(o)
* sy(s)l * cY(s):
This algebraic equation is easily solved for the transform
r(s)
:
F(s) + By(0) + A[dy
F(s)
I(s):
/dtls*
sy(O)]
(s.7)
As2+Bs+C
The problem of solving the original differential equation is thus reduced to the problem of
inverting the transform I(s). In many cases, it is not too difficult to invert the transform,
and this is the advantage of the transform method.
Linear differential
Laplace transform
equation for y(t)
direct
solution
for the transform
algebraic
I
J
Algebraic equation
solution
(aimcutt)
I(s)
I
J
(easy;
<- inverse transform <Because of the initial values of y and its derivatives that appear in expression (5.7) for the
transform Y, the Laplace transform is well suited to the solution of initial value problems.
5.2. A series ZRC circuit has a constantemf E and a switch. The switch
0. What is the current in the
has been open for a long time and is then closed at /
Example
:
circuitfort>0?
If the switch has been open for a long time, we are to asSume that the current has
dropped to zero and the capacitor is uncharged with q(0) : 0. Then we can define
the current variable i(l) so that
.dq
dt
(s.8)
shown in Figure 5.2. (It is very important that you take care in defining the algebraic
variables corresponding to the physical quantities ofcharge and current. It is easy to
make sign errors that will cause your solution to be incorrect. See, for example, Lea
and Burke, p. 990.) Then Kirchhoff's loop rule applied to the circuit after the switch
is closed gives the differential equation
as
dioL-+Ri*!:E
dtC
(s.e)
5.3 USE
OF THE LAPLACE TRANSFORM TO SOLVE A DIFFERENTIAL
EOUATION 257
FIGURE5.2. Definitionofcharge(i)andcurrent(4)variablesinExample5.l.Theswitchisclosed
at
I
:0.
Now we transform both equations (5.8) and (5.9):
,=t(#):se-q(o)
and
ot
LIsI-t(O)l+RI+;:;
Use the initial conditions 4(0)
:
0 and
t(0)
:
0, and substitute for Q(s):
I /1\ :; e
Lst*RI+r(;,/
Solve for 1:
t/s
t: l,s*R+(l/Cs)
(s.10)
With the usual definitions q
=
1(s
have
258
CHAPTER
5 LAPLACETRANSFORMS
There are several possible ways to invert the transform. We could factor the denominator and then use partial fractions. The roots of the denominator are
-.1 :
s+:-o*
,:
where we have defined
1@
-
I
s/
r/^\
_ | _
r\J''
-- t
-a !.i
ot."y6"n
\I
-
_-
)--
\(t-t+Xt-tJ
,3-o2 - -aLia
( t _ t \
t
LG+--) \Ft*
s-s-)
From Table 5.1, we can invert each fraction to get an exponential:
iO:
:
go
,hL(e'*'
t
,L,
"'
- e'-'): ,fr7e-"t{e+iot - r-i<t:t1
Sln (@f )
As / increases from 0, immediately after the switch is closed, the current is positive;
that is, it is charging the capacitor.
An alternative method uses the attenuation property (equation 5.5). First we complete the square in the denominator, which may be expressed in terms of s * o:
\
a
*
\l- t- I/
I
\(s +cv)'z +,.,3 -@z ) ofl \tt +a)2 +'2 )
I
r-- t/ I
Now, from Table 5.1, we recognize o/(s2 -f ri.21 as the transform of sin (art). Our
transform has (s * a)2 rather than s2, which indicates that the inverse is the sine
. Thus, the solution is
function multiplied by
"-o'
I
i
(t\ : :;e-dt
sin (ror)
as before.
Even if we do not have values of the function and all its derivatives att : 0, we can sometimes leave the unknown values in the solution and use the remaining boundary conditions
to eliminate the unknowns at the end.
5.3. A simply supported
beam of length Z is one that rests on a support
(Figure 5.3). If the beam supports a load Mg that is uniformly distributed
over a distanc e I at lhe center of the beam, find the displacement of the beam'
We set up a coordinate system with x-axis along the beam and origin at one end.
Then the given boundary conditions are
Example
at each end
y(0)
: 0;
v(L)
:
o
5.3 USE
OF THE LAPI..ACE TRANSFORM TO SOLVE A DIFFERENTIAL
EOUATION 259
FIGURE 5.3. A simply supported beam of length L carrying aload Mg.
The differential equation satisfied by the beam is equation (3.1 1):
dav 1
dr!: EIs@)
(s.1 1)
With a fourth-order equation, we need four boundary conditions. We can find the
remaining two boundary conditions that we need from equations (3.8)-(3.10). In
equilibrium, the torque2 about any origin must be zero, and thus m(0) :0. Also,
:0 since there are no forces to the right of the right end. Thus, we know two
quantities at 0, y and y", and two at x : l.
The load function is
m(L)
q(x)
tr (L -.t)12 < x < (L + t)12
: {yrtt
l0
otherwise
Now we transform equation (5.11):
sar
-
s3y10)
-
r2y'(0)
-sy"(0)
-
y"'(o)
: fiOAl
where
Msl ( L+t\s)-exv(/ L-r\1
-"^dx:i
e(s):[#9r-"'
,
2
, '/l
r+
Lexe(:
At
sl "-t'1''2
sinh
t
2
Put in the known initial conditions:
,aY
-
r(s)
:
s2
y'
10)
-
y"' (o)
- !E-r-t' tzz sinh?
Thus,
9"-u1z2sinh 4 + Y'!o) + Y"'\o)
sr EI [.
2
.sz
.s4
2Recall that m(.r) is the counterclockwise torque due to all forces to the right
ofx
260
oHAPTERsLAPLAoETRANSFoRMS
in the first term suggest that we apply the shifting proPerty
(equation 5.6) after inverting 1/ss. The powers of s invert to powers of .r (Table 5.1).
Thus,
The exponentials
:
up / r\ I /
riz \n )Lr
Y(x)
-'+)-
- +)^s('
(" -
+)^s(' -'+))
+ y'(0)r +
y"'Q)*
Now we need to apply the remaining boundary conditions to find the unknowns y/(0)
and y/'/(0). Evaluating the solution at x : L, we have
y(L)
:
ffil(+)^ - (+)^l *,',0,,
: ffil)ttrt'+ h)+
y'(o)r +
+ v"' (uLi
v"'Q1l
:s
and
_ Ms l('_:t\'_('=\'f
t2l
dr2l.:r-2rrtl\ z ) \ z )
:
ffitt
+ y,,,(o)L
*n,,,,0,,
\v
l''
:o
From this result we obtain
y"'(o) =
Then from
y(L)
:0
y'(o)
-#
it follows that
: -ffirr'
+
e) -
r"'e)* :
#9I;9
So the solution is
v(x)
:
#h{;l (" - +)^'('
-'+)-
(" -
+)^'(' - +))
.
e;r)._,uj
5.4 SOME ADDITIONAL USEFULTRICKS
The displacement of the beam in the case L
:
04
0.2
261
L12 is plotted in Figure 5.4.
06
x
i
YEI
MgL3
FIGURE 5.4. Deflection of th ebeamy EI / MgL3 versts x f L (Example 5.3). Notice the very different
scales on the two axes.
5.4. SOME ADDITIONAL USEFULTRICKS
In this section, we shall compile a list of tricks that may help us to find or to invert
a transform.
5.4.1. The Derivative of the Transform
Since the transform F(s) is a smooth function of s, it may be differentiated:
dFdf@_-fcod
ds
cls
:-
I
JO
ln*
.f (t)
e-" dt
: I
JO
f Q1
. e-"'dt
Cls
tf (t)e-'tdt
Thus,
dF
L(tf ) : --
(s.12)
We can repeat this process n times to obtain
L(t"
f): (-l)^#
(5.13)
262
cHAPTER5 LAPLAoETRANSFoRMS
Example
5.4.
From Table
5.
Find the transform of / sin a,l/.
l, the transform of sin art is o I $2
4(rsin
ar2). Thus,
(r+r)
,,1: -ft
:
+
2so
64 upy
5.4.2. The lntegral of theTransform
The transform may also be integrated:
Now do the integration over o first:
:
l,*
f !t)
Io*
,rolo"
"-"0'
: t(+)
(s.14)
It is essential to this derivation that the upper limit of the integral over o be infinite, so that
the exponential goes to zero at the upper limit. It is also necessary that the transform of //t
exist-that is, that the integral converge. For example, we cannot apply this result to the
function f = I, since the integral [f, (e-'t / t) dt does not exist.
Example
5.5.
Find the transform of the functio n
f (t) : l-f{
From Table 5.1 and the linearity of the transform, the transform of the function
1-cos/is
1s
s
s2+a2
5.4 SOME ADDITIONAL USEFULTRICKS
Then
f@/l
o
\
\ , ):J, \;-;taa)d"
4(L-cost1
:
(rno
\ - l2 +
:limrn-!-m-!
S+oo
rn (o2
J 52 +
:ln1-ttt-l
^/s2
.,r2))
')1,
ot2
I
Js2 +
a]
+ a2
/ o,2\
: lnJ;rT;z : t (.t
s
tln *;t/
5.4.3. Periodic Functions
Suppose the function
it
"f
(l) is periodic with period T (Figure 5.5a). Then we can represent
as
and so on, where
g(t)
is
f(t):g(t
n
gure 5.5b). Equivalently,
)
-2T)+...
0123
FIGURE 5.5a. A periodic function
/(r).
(5.15)
264
CHAPTER
5
LAPLACE TRANSFOBMS
0123
FIGURE 5.5b. The function g(t) is nonzero only in the first period
Now we can apply the shifting property to find the transform. First we transform g:
G(s)
: [' sfrle-'tdt
Jo
The upper limit is I, because the function g is zero for / > Z. Next we use the shifting
property (equation 5.6) to evaluate the transform of the second period of the function:
LIS(I
-
T)sQ
- T)f : e-'r G
and so on. Thus, from equation (5.15),
F(s): (Ile-'r *r-2sr +...)G
F(s):
G
(s. r6)
| - e-st
- series (equation2.43). Thus, a denominator
where we used the known sum of the geometric
of the form I - e-'r in the transform F(s) is the signature of aperiodic function /(r) with
period Z.
Example
period Z.
5.6.
The function
G(s) is
Find the Laplace transform of a square wave of amplitude 1 and
g(t)
:
o:
1
for 0
fo'''
< t < T l2and
e-'tdt:
is zero otherwise. Thus, the transform
+l'o''
:'- ':''''
5.5 CONVOLUTION
Note that the upper limit of the integral is
T
/2 . t < T. Then, using equation
T /2, since the function
(5.16), we have
265
g(t) : 0 for
F(s)::(T#):^;-,
The denominator I - e-'7 is the signature of a'periodic function, but it can be masked
by the form of G, as we see here for the square wave. The same thing happens with sines
and cosines-the prototypical periodic functions-for which G o<. | - e-'r , as you should
verify.
5.4.4. Discontinuous Functions
:
has a discontinuity at t
tr. Then we can find the transform F in
Suppose the function
the usual way. But we have to be more careful with the transform of the derivative, since
the derivative does not exist at /1. Thus, we divide the range of integration into two pieces
and integrate each piece by parts:
/
,(#):
:
:
:
Iou
.f
#u$dt
e-"1:;
+
*, l,
1,,*
#n"0,
ye-'tdt + f
"-,, T
Qr) - "f (tr+)l - /(0) *
- "f(0) - e-'tth
s-st1f,f
sF(s)
*, f,,* f e-,tdt
sF(s)
where ft : f (h+) - f (n-) is the jump in the value of the function at
of higher derivatives may be evaluated similarly.
/1 .
The transforms
5.5. CONVOLUTION
5.5.1. Systems with Memory
t
Certain physical systems have memory; that is, the behavior of the system at time / depends
on what has happened to the system at times less than /. The series LRC circuit is an
example of such a system, as we have seen in prior examples. The current in the circuit at
time r depends not only on the emf t applied to the circuit at time r, but also on the past
history of the circuit (when the switch was closed, for example). The current in the circuit
at any time / is determined both by the structure of the circuit and by the emf t(t) applied
to the circuit over time.
If we choose our current and charge variables as in Example 5.2, so that i : 'fdq / dt ,
Kirchhoff's loop rule results in the differential equation
t#*Ri+3:tG)
266
cHAPTER5LAPLAcETRANSFoRMS
Now we transform the equations to get
I:sQ-s(O)
and
LLst
Let's simplify by choosing
i(0)
-t(0)l + RI + 9:
L
:
q(O)
:0.
E(s)
Then, combining the equations gives
,2gt+sRe*9:zg)
or
E(.s)
Q:-#:E(s)R(s)
szl+sR+l/C
where
:i(r#T*)
R(s):Zi:,r\
t(sr+
tt+ tc1
The transform of 4 is the product of the two transforms E(s) and R(s), one being the
transform of the input function t(r) and the otherbeing the transform of the systemresponse
function r(l). The inverse of the transform E(s)R(s) is the convolution of the functions
€(t) andr(t):
t-t€n :
E*
r
: I to)rtt - t)dt
Jo
(5.17)
This result is called the convolution theorem. (See Appendix VI for the proof.)
Physically, this integral represents the input to the system at time r times the response at
timer(thatis,atimet-tlater)tothatinput,summedoveralltimesrfrom0to/.The
convolution may also be written
as
rt
t +r : r xt : I r(r)e(t - r)dr
Jo
that is, time measured backwards from the present (z
where r represents "time ago"
: t).
(t
to the time the clock was started
Thus, the charge on the capacitor in the circuit problem is
ft
q(t): I eG)rQ-t)dt
Jo
:
0)
5.5
where
r(/)
coNVoLUroN 267
is the function whose transform is R(s). We found this function in Example 5.2:
I
r(t): fie-"t
sinia,t
The circuit response is a damped oscillation.
To illustrate the use of the convolution, consider an AC circuit with
t(z) :
t0 cos Oz.
We have
q(t)
: f€o
(t
;cos or sin ar - r) exp [-a(t - r)l dr
J,
to -_o, tt
: -=-e
cos Or sin @ (t La Jo/
:
r)
edr
dr
['6rnyrr+ (o - o))t)+sin[arl - (a + a)rl]eo'dt
=t=o "_-o' Jo'
2La
The integral is most easily evaluated by writing the sines in terms of exponentials. The result
is
Y
\'/
Eo / a[2aosin or
Z, \
[cv2
+
-
(Q
sin a-lt]
a 7,t21
"-ut
(o
- d2][d2 + + @)2]
, ola2 - Q2 + a.,211cos {2t - e-at
(a2
+
Q2
-)
cos t,lf I
You should check that this solution satisfies the initial conditions q(0) :0 and i (0)
The long-time solution is an AC current. First note that for t )) | /a,
q(tt
to (294la sin Qt + <o[cY2 -- o2 + ar2] cos Qt \
- LrD\
)
and therefore
cos Ar + All - ro3l a2l sin Ar
de_
to
\
irt):E--5,2-\m)
^ nz /2a
: )€t
rocos Qr
* x sin or)
where Z is the impedance,3 given by
22
:-
R2
r x2 = n' + (ar- :)'
\ ac,) :
R2
r-'' )'1
: rrl+o, +\r-\r---@-))
:u
+ a2 ( t- "'
lu
: L2fa2 + (o - d2lwz + (o + d2llsz2
3See,
for example, Lea and Burke, p.1022.
See also Problem 2.14.
t
(aD2e - r3/a\2
:
\
0.
268
oHAPTER
5 t-nPr-AcE TRANSFoRMS
This is the expected long-time behavior. The other term in 4(r),
Es a(a2 + Q2 + a2) sinrirt + a\a2
la2 + (o
-
raD2llo2
+
-
Q2
(sz
+
+
or21
cos
@t
-ot
@)21
is an initial transient that goes to zero with a time scale I I a
:
2L I R.
5.5.2. TheTransform of an Integral
We may apply the convolution theorem in the special case that one of the functions is
g(t) = I with transform G(s) : l/s. Then we have
t(lr' r(t)s(t - do,):
,(1,'
a result
F(s)G(s)
rt"dr) _ F(s)
s
(5.18)
known as the primitive function theorem.
Example
5.7.
Find the Laplace transform of the sine integral
Si
(r)
:
/nt sin
Jn
z
;0"
First we use result (5.14):
,(Y) : I,* L(sint)do
: [* -J-oo:
t--lol*ls
J, o2+l
:;
-tan-r,
Then, using the primitive function theorem, we have
ltsi(r)l:;-itl 1t*-r"
or, since tan (n
/2
-
0)
:
cot 0, we may write the result in the more compact form
.CtSi
(r)l
:
cot_l '
.t
-
.s
5.6 THE GENERAL INVERSION PROCEDURE
269
5.6. THE GENERAL INVERSION PROCEDURE
We have so far amassed a collection of tricks for inverting a given transform F(s) to find
the function /(r). We'd like to have a single technique that we can use every time and that
does not require recognizing a trick that will work. This procedure is the Mellin inversion
integral:
f(t):
_t| 1r+ia F (s) e't ds
2ni Jr-;*
(s.19)
The integral is along a line parallel to the imaginary axis in the complex s-plane. The line
must be positioned so that all of the singularities of the function F(s) are to the left of the
line, as shown in Figure 5.6. This line is also called the Bromwich contour.
Im (s)
Re (s)
FIGURE 5.6. Contour for the inversion integral. The vertical line must be placed to the right of all
the singularities ofthe transform F(s).
To prove the result (5.19), we work backwards and take the transform of the function
/(t) defined by equation (5.19).
.
Ltf(t)l: [-* r-" zfit f*t* F(o)eotdodt
JO
"*,]- Jy_jso
Interchange the order of integration and perform the integration over /:
Ll,f(t)l::
:
['*'* ro>' Jo[*
2ri Jr-;*
1
:----.
'2nt
e-st+otdtdoi
1v+ia
e-$-ov
F(o)
o -,5
I
Jr-1*
-l
l@
lO
do
270
CHAPTER
5
LAPLACE TRANSFORMS
Now, provided that Re (s)
we have
> Re (o) : y, the exponential approaches zero as / -+
oo, and
1 fY+i@ F(o\
' do
Llf G)l: :---- I
/.|tt Jy-i6 S-o
Note that the function F(o) is analytic everywhere to the right of the path of integration,
so we may close the contour to the right, as shown in Figure 5.7.
Im (o)
Re (o)
FIGURE 5.7. Contour for proving that equation (5.19) correctly inverts the transform F(s). The
contour is closed to the right, enclosing the pole at o : r.
Then there is a single pole ofthe integrand inside the contour,4 at the point o : s. The
integrand along the big semicircle is zerg, provided that Max l^F(o)l approaches zero on
the semicircle at least as fast as R-6, where e > 0. Notice that we are traversing the contour
in the clockwise direction, rather than the standard counterclockwise direction, so we must
introduce a minus sign when applying the residue theorem. Then the result is
Llf (t)): -2ni ^L.lim (o zttto+s
:
s)I9
s-o
F(s)
/ is F, then / is the inverse transform of F.
When we use the Mellin inversion theorem to evaluate /, we cannot close the contour to
the right because the exponential e't causes the integrand to blow up on the large semicircle
at infinity. Thus, we extend the function F into the region to the left of the contour, using
analytic continuation, and close the contour to the left, thereby enclosing all the poles of F.
If the transform of
5.8.
Invert the transform F (s)
The denominator factors:
Example
t2 -2t - 3:
4Recall that we have already assumed Re (s)
r
z.
:
3s
I (s2
-
2s
(s
+
1)(s
-
3)
-
3)
.
271
5.6 THE GENERAL INVERSION PROCEDURE
Thus, the transform has simple poles at
inversion integral, we get
s : 3 and at s - -1. Using the Mellin
f (r): :2ni ['*'*
Jr-i*
-,r,
, *.tlr')'
l)(s - 3)
(s
y > 3. We close the contour with a big semicircle to the left (Figure 5.8).
Since F(s) goes uniformly to zero as R -+ oo, we can invoke Jordan's lemmas to
show that the integral along the semicircle goes to zero. The integral along each of
the small arcs to the right of the imaginary axis also goes to zero. Let the real part of
s on this piece of the contour be x, where 0 < x < y .Then
where
ltl
I I l<
J
I
arcl
llengthof path)(Maxof lintegrandl onpath)
: v3ert
' ,,,* ll(x+1
:
v3ert
, --, o
^J@
x2-2x-3\2 4@-t)2
o,V('
^u*
lr
) *-i
as R
-+
oo
Im (s)
Re (s)
FIGURE 5.8. Contour for actually performing the inversion (5.19). The contour is closed to the left,
thus enclosing all the singularities of F(s). Here the singularities are at s : -1 and
s
:
3, so we must choose
5To use the lemma, first make the change
y>
3.
of variable s
:
ico.
272
CHAPTER
5
LAPLACE TRANSFORMS
Then applying the residue theorem gives
/
\
_ 1rr"r,
f(r):r(. -e-t
_o * o )= 4.-_+ e_r),
3e3r
5.9. Invert the transform F (s) : 11 46.
This function has a branch point at the origin. We must choose the branch cut along
the negative real axis so that it does not cross the path of integration, and we must
deform our closed contour to avoid the branch cut, as shown in Figure 5.9. Applying
the Mellin inversion integral, we have
Example
f(t):
7 fY+i@ es'
I
2ri Jr-;* -ds
"/s
Im (s)
Re (s)
FIGURE 5.9. To invert the transform in Example 5.9, the contour must pass to the right of the branch
point at the origin and must exclude the entire branch cut. Here 7 > 0.
The integral around the closed contour is zero, since the integrand is analytic
everywhere inside. Thus,
(lr'-:: * Ir-*,/opo*.-"t *,* lr"*lo,,o,"oro.-.n.",)
#":
o
and so
t*['
['*'*!0,:-(\./a,,
Jr-,* ..6"" -
../ropof branch.u,
+[+[
' /a" '
-/bouo,nofbr-rt
)4r,
6*'
"ur)
The integral along Ciq goes to zero as R -+ oo, by Jordan's lemma, since 1/"6 goes
to zero uniformly as s -+ oo on Cp. The integral along the small arcs goes to zero
s.7 soME MORE
as R
-+ m
(and we can also let
y
PHYSTCS 273
--> 0 here). The integral around the small circle at
the origin is
[ lor: [_" exp(ee,_oj) '-rr"ieo,
Jc' s*" Jn JEeie/z
- r$ I,
exp(eei|t'1ei0/2de --r
0
as e --+ 0
We are left with the two integrals along the two sides of the branch cut. On the top
side,s: ret ,andso
et' ,
-/top or uranctr cu , 'rTot
f
t_
exp(-rt)
a' : - If' -----1---7-6r,
t\/r
Ja
f'exp(retftt)
:
J*-fij;''"
(-rt)
:J"f@ -ff4'
exp
and on the bottom side, s
t
./bottomofbranctrcur
:
Io,
/s
re-to
, so
: Jt[* "*'!'11":)
e-iod,: - [* "'!-!0 o'
Jt
Jrs-'n1z
-iJr
/-
:l-Ql
exp
(-rt)
,
t\/r
Je
Thus,
_tI
fY+ia ett
2ni
Jr-;* -ds
Js
:
-l
2ni
-2
I /rJo
Now change variables to u
f
(t):
h
Curiously, the function
tive constant).
:
exp
(-rt)
,
Jr
r/. Then
lr*
,-1/2"-u4r:
f (t) : t-r/2
J=r(;) :
I
fit
is its own transform (except for a multiplica-
5.7. SOME MORE PHYSICS
5.7.1. Modeling Problems
A difficulty that shows up in the mathematical solution of a problem often indicates that
we have chosen an inadequate model for the physical system. When the Laplace transform
274
cHAPTERsLAPLAoETRANSFoBMS
method is applied to the solution of a differential equation, the initial conditions must be
included in the solution. In some circumstances, the solution has a different value at t : 0,
indicating a physical inconsistency. This often happens in circuits containing inductance.
Example 5.L0. The two circuits in Figure 5.10 are coupled by mutual inductance.
The switch in the circuit on the right is closed at / : 0. Find the current in each circuit
fort > 0.
FIGURE 5.10. These circuits, coupled by mutual induction, exhibit some of the difficulties involved
with using an inadequate model for a physical system (Example 5.10). The current
variables i1 and i2 are defined in this diagram.
As usual, we begin by applying Kirchhoff's loop rule to each circuit. The current
variables i1 and i2 are defined as shown in the figure. For the circuit on the right, we
obtain
dir
V:itR+Li*r;
di't
and for the circuit on the left, we obtain
di'>
O:izR*t;**;
dit
with initial conditions that both currents are zeto at /
:
0.
Now we Laplace transform both equations:
v
-:
/rR
0:
IzR
*sLIt*sMI2
and
I
sLIz
*
sMIl
To solve the equations we use a trick. Adding the two equations, we get
V
-:XR*sLX]-sMX
.J
s.7 SOME MORE
where
X : It 112. Similarly, by subtracting and definingY :
11
PHYSTCS 275
- I2,we get
V:YR*sLY-sMY
Now we can solve:
(s.20)
t \
x:vr
s \s(L+M)+R)
1
cv
(L-fM)sIRis L+M
-J-. "-rt, where
--Then, from the primitive function theorem (equation 5.18),
-
From Table 5. r, the inverse of
r(t)
:
[' "-o' at :\(rR' =:
L*M- Jo
:
R
/ (L
* M).
r-o')
Similarly,
Y:yr
' \
s \s(t-M)+R/
y(t):f,<t-"-u'1
where B : R I (L - M). (P is positive since Z2
principles of.electromagnetic theory.) Now we can solve for the two currents:
h
lv
:
: 1@*y)
: Lr,
2R' -
I
> M2 is required by fundamental
e-o' + | - e-fr')
U<t -
e-ot
- "-ft)
while
iz: ){, - r) : {; -
:
e-ot
- (r -
v
- 2R',"-p'-e-ot\
i,-Y
.R
as expected.
and i,t-+0
e-frt),1
276
oHAPTER5LAPT-AoETRANSFoRMS
Now suppose we make this system by winding the wires for both circuits together
: M, and fl + m. We go back to equation (5.20) and
into a coil. Then we have L
notice that it has simplified:
V
J
:YRlsLY -sMY:YR
Thus,
v
v_
and, consequently,
Y(t)
:
v
R
which is a constant. Then the equation for X becomes
v
-:XR*s2LX
,s
v/
v
I
l \
_l
\2rr + R/
From Table 5.1 and the primitive function theorem (equation 5.18), we obtain
X:
vf'
t e-Ytdt
-2L Jo
: v - e-Y')
2Lv'
--(l
: Itt
e-v')
R' whercy = Rl2L.Thus,
lv
it=;(x*y):+(2-e-Y')
and
I.
iz: i@
-
y)
V
: --e-r'
Both solutions still have the correct behavioi for long times, but now neither satisfies
the corect initial conditions! At the closing of the switch, the currents have instantaV /2R and iz
neously jumped from zero to i1
-V /2R. Of course, this cannot
happen. The model fails6 because it is impossible for ^L to be exactly equal to M.
:
:
6This is, in fact, the exact solution to a somewhat different problem. Imagine combining the circuits by replacing
all the inductances with a single inductor L. The second circuit then forms a parallel combination. The solution
we have obtained would have zero cunent through the inductor at t : 0, and this is physically possible.
5.7 SOME MORE PHYSICS
277
Figure 5.11 shows how i2 behaves as M approaches Z. The absolute value of the
current increases more rapidly from zero as I - M approaches zero.
2R.
v'
t.,
-
t (s)
0
-0.2
-0.6
0.2
-0.8
FIGURE 5.11. Current i2 as a function of time for (L
s
- M)/R:
0.05, 0.1, and 0.2 s. The diagram
also shows the limit L
M (dashed line). As L --+ M, the cunent increases more
and more rapidly from zero. In the limit, the current makes an instantaneous jump to
V /2R at the moment the switch is closed. This is the correct mathematical limit, but
this solution is physically prohibited.
:
5.7.2. Nuclear Reaction Networks
A common application of the Laplace transform is to the solution of nuclear reaction problems. For example, in nuclear decay,T the number of nuclei of one species decreases, at the
same time increasing the number of nuclei of one or more additional daughter species. The
daughter nuclei may then in turn decay. For example, in a three-species chain, the decay
may be represented as
N1
-+ N2 -+
N3
The result is a set ofcoupled differential equations ofthe form
:-rrNr, ry
:
ry
dt
dt
* :xrr,
-LzNz*r1N1, dt
which we can more easily solve by converting them to a set of coupled algebraic equations
through the use of the Laplace transform:
sNz(s) sNr(s)
:
Nz(O) :
Nr(0)
-i.rNr(s)
-),zNzG)
sN:(s)-Nl(0):)'zNzG)
?See,
for e*a-ple, Lea and Burke, pp. 1 161-1 165.
* lrNr(s)
278
cHAPTERsLAPLAcETRANSFoRMS
Part of the natural radioactive sequence for the decay of 237Np involves the cv decay of2r7 At into 213Bi, which then B decays into 213Po98Eo of the time
209p6 by either an
and o decays into 209T1 2Vo of thetime. Both species then decay 1o
20eBi,
which is the stable endpoint
a decay or a p decay. The lead then B decays to
of the chain. We may write a set of equations for this reaction chain as follows.
At decays at rate ),1:
Example
5.11.
dNx :
-f
-ltN'q't
213Bi, produced by the decay of At, decays in two different ways:
dNsi
:.1"r Nr,
dt_/!IrtAt
-
o.oD,2Nsi- o.9g),sNsi
where ,tr2 is the decay rate of 213Bi into Tl and 13 is the decay rate o1 2133i into Po.
The other relations are obtained similarly:
:o'98lrNsi -
+
l+NPo
dN.t
-i:o'o2i"zNsi -lsNrt
)
dN"+.
-i
:
dNpl
_f
:
trsNr
*
)"+NPn
-'leNPr
1
I
LeN*,
(where B2 refers to 209Bi;. These relations transform to the much nicer looking set
of equations for the transforms:
sNe.t
sNsi
-
Net(0)
:
NR,
-i.tNe,t
- -0.0}'zNsi -
0.98),sNsi
t
-
0.98)":Nei
- l+Npo +
rNpo
-
NBi:
llNag
NPo:
Net(0)
sfl.r
ItNet
s*0.021.2*0.98r"3
0.981.:Nsi
s*1.+
0.02),zNet
- lsNr + Nn: s*l.s
lsNn * l+Npo
sNpr - lsNrr * aNp6 - i,6Np5 + Npt s*lo
sNn
-
0.02)'zNH
,tr
and, finally,
.rNs2
- l6tvr, +
Ns2
:
loNPu
.f
PROBLEMS 279
The transforms may be inverted by several of the methods we have discussed. For
example, if we want to find the abundance of polonium, we first solve for its transform:
0.98).gNsi 0.98).g
* l+
0.98).3
(s * r+) G *
(s
s
*
lt
*
r.+) (s
1.1
0.0D"2
+
N,q.t
0.0D"2
*
0.98)'3)
Net(0)
0.98).3) (s
*.r'r)
We may use the Mellin inversion integral to invert the transform. The integrand has
three simple poles, each giving an exponential term. The result is
e-Lqt
(0.0D,2 + 0.98).3
Npo
(r)
:
0.98,].:).r Net(0)
-
)'+) ()"r
-
I+)
,-(0'OD'z-lo'98\)t
+
Q"q
-
0.98).3) ()q
-
O.jD"z
e-Lt't
(0.0D,2 * 0.98i.3
-
).1)
0.0D"2
+ (l+
- lr)
-
-
0.98,r'g)
PROBLEMS
1. Show that
the following functions are of exponential order, and find their Laplace trans-
forms.
(t):
@ f ftl :
(a) f
sinha/
tanhat (Hint: Change variables
1o
,-Zat
: u and expand the integrand in
a series.)
r
(c) f(t): sinJat
(d) ,f (r) : s-dt2
(e) f(t): te{
(f) f (t):
sin(art
*
do)
(g) The ramp function f (t)
: {f, ifo<r<te
.Iro ift > 16
2. Using the shifting prop( rty or otherwise, find the Laplace transform of the function
\
/)
S
fo
f(r):1'
''"'-
itt <2
ltt -2)3s-at ift > 2
ni"a the Laplace transform of the function / cosh
4. Find the Laplace transform of the functio n
cvt.
f 1t 1 : + "t
280
cHAPTER5LAPLAoETRANSFoRMS
5. Find the Laplace transform of
(a) the triangle wave function with period T:
(aQ-nT)
"f(t):{
(b)
-
[a[(n + l)T
-tl
ifnT<t<nT+T/2
if nT *T12 <t < (n1-t)T
the sawtooth function:
if nT
f(t):a(t-nT)
<t <(nal)T
6. Use the Mellin inversion integral to invert the following transforms:
s
(a) F(s) : -=-----:sz
*2s t3
1
(b) F(s) :';--;---1----7'
(s2
(c) F(s)
*
at)z
: e-.6
s
- ln(l + s) (Express the answer in terms of the exponential integral
,
Elrt'): Ei(x):
fr
I
J-a
eu
dw
w
7. Invert the transform F(s)
,wherex<0.)
: +
{s'-a'
(a) by expanding in a series
(b) by using the Mellin inversion integral with a branch cut running from -a to a along
the real axis (A change of variable to z, where
'
: Z('* i)
may prove useful.)
8. Use the convolution theorem (equation 5. 17) to evaluate the inverse transform of
F(s)
9.
:
s4''t- to4'
Use the integration rule (equation 5. 14) and Table
5
.
I to derive
the result of Example
5
.9
for the transform of I /,r,/t.
EO.l fne diagram shows a simplified version of an automobile spark coil circuit. The spark
plug itself acts like an open circuit until the potential across it reaches the breakdown
voltage for air. Thus, you may ignore that branch of the circuit until the end of the
problem (part e).The battery voltage V : I2Y, C :0.1pF, R : l0 g, and L :
l0 mH. Assume that the switch (points) have been closed for a long time prior to t : 0.
(a) How long a "long time" is necessary? Write down expressions for the charge on the
capacitor and the current through the coil att :0.
,t
PROBLEMS
281
spark plug
PROBLEM
10
t : 0, the points open. Qualitatively discuss the circuit behavior. What is the
expected long-time solution for the charge and current?
(b) At
(c)
Use a Laplace transform method to solve for the potential difference across the spark
plug as a function of time.
(d) Plot your solution. What is the maximum potential difference achieved?
(e) If the breakdown voltage of air is 3 MV/m, what spark plug gap would be required
with this circuit? Remember that you would like the engine to start even if the battery
is a bit low!
11. A beam is supported at one end, as shown in the diagram. A block of mass M and length
/ is placed on the beam, as shown. Write down the known conditions at x :0. Use the
Laplace transform to solve for the beam displacement. Plot your results for.rs : Q.Sl
and,l : O.2L.
PROBLEM
11
12. Technetium is used in medical procedures as a diagnostic tool. The technetium is obtained
as the decay product of 99Mo, which decays 1o99m1" with a half-life of 66.02 h. The
technetium in turn decays with a half-life of 6.02 h. A medical radiology department
receives a source containing 100 mCi of 99Mo at 9:00 a.m. on Monday morning. Find
the amount of technetium present in the sample as a function of time after 9:00 a.m.
When is the amount of 99-Tc a maximum?
282
CHAPTER
5 LAPLACETRANSFORMS
13. An overdamped harmonic oscillator satisfies the equation
*dtz*zo!dt +of,x: f (t)
where cu2 > a2o andthedriving force is a square wave of period f . Find the displacement
x(t) if the initial conditions are x(0) : dxldtlt=o: 0. Plot the result for a :2ao,
q,T : l, and0 < t < 3T12.(Hint:Use theresultof Example5.6andexpandthe
transform in a series.)
14.
A harmonic oscillator with resonant frequency as and no damping is driven by a
sinusoidal force F(t) - Fs sin a,lf . If the initial conditions are x(0) : O, dx /dt : O
at t : O, use the Laplace transform to find x(t). What happens 1f at : a>o?
(b) Solve the same problem with the initial conditions x(0) : 0, dx I dt : a at t : O.
@
15. The two circuits in diagrams (a) and (b) show how we might use a capacitor to prevent
sparks across a switch when the switch is opened. Assume that the switch has been
closed for a long time and is opened at t : 0. For each circuit, use Kirchhoff's rules to
solve for the current through the inductor and the charge on the capacitor as a function
of time after the switch is opened. Discuss the merits of each of the circuit designs.
PROBLEM
15a
PROBLEM
15b
PROBLEMS
283
switch has been in position A in the circuit shown in the diagram for a long time.
What are the charge on the capacitor and the current I through the inductor? At time
r = 0, the switch is moved to position B. What are the charge on the capacitor and the
current a long time later? Find the charge on the capacitor as a function of time for t > 0.
Give your answer in terms of a.b, where a;B : l/LC,a : RlL, ar.rd B :l/RC. You
You may assume
may also find it useful to define y : (a + p)/2and O : Jr"pthat I is real.
6. The
I
i
I
i
17. The switch in the circuit shown in the diagram has been closed for a long time, and
a constant current flows. What is the charge on the capacitor? At time t : 0, the switch
is opened. What are the charge on the capacitor and the current through the inductor
a long time later? Find the cunent through the inductor as a function of time for t > 0.
Give your answer in terms of a4 and o, where eE : I / LC and cv : R/2L.
PROBLEM
[&'l
17
The switch in the circuit shown in the diagram has been open for a long time. At t : 0, the
switch is closed. Find the current through the inductor and the charge on the capacitor
284
CHAPTER
5 ilAPLACE TRANSFORMS
as functions of time for t > 0. Give your answer in terms of ag,u, and p, where
a;B : l/LC,a : R/2L, and P : oft1+" : ll2RC.You may also find it useful to
defineo:JZ;A-a2-fl2.
PROBLEM
18
L9. In the figure shown, capacitor Cr has charge Q and capacitor C2 is uncharged. At
t : 0, the switch is closed. The two capacitances are equal. Find the voltage across each
capacitor as a function of time for t > 0.
PROBLEM
19
20. The switch in the circuit shown in the diagram has been closed for a long time and is
opened at
t:
0.
What are the currents in the circuit for
t < 0? Use the Laplace transform
C
fa
fD
t2l
PROBLEM
20
PROBLEMS
to find the currents in the circuit as a function of time for
the initial conditions?
285
t > 0. Does your answer satisfy
Rework the problem, leaving the initial value of the current iz in arm AB as an
unknown to be found. Find the solution for the current i1 through R1 and require that
it satisfy ir(0) : E/(h + Rz/2). What value of iz(0) is required? Give a physical
explanation of this result. If R1 : Rz : R and L1 : L2 :.L, plot both solutions. Plot
current in units of E/R versus time in units of L / R. How long is it before both solutions
give the same result to within 17o?
21. The radioactive series that begins with neptunium 93 contains the following decays:
Decay
237ryn
233pu
233pu
2339 -2339
--r 22976
2297A _- 22spu
22s16
*
22s4"
225;" --r 22tp,
22tp, --r 217 61
217
S1 _- 2t3gi
2t3gi --r 2r3Po 198vo1
2r3gi -* 20eTl (ZEo)
i
I
213po
20911
209p6
Half-life
Type
_-
2.14
p
27.0d
x
1.6
d
7340y
p
14.8 d
d,
10.0d
q,
4.8 min
a
0.032 s
47 min
p
4.2
20ep6
20e91
p
3.3 h
209p6
_*
_*
706
y
105y
d,
a
a
p
--
x
a
trt"s
2.2min
If
we regard any decay that takes less than one year to be essentially instantaneous,
then the chain simplifies to
trtNp _- 233g
i
__s 22eTh
_> 2oeBi
Write a series of differential equations that describes this decay chain. Apply the Laplace
transformto findthe fraction of the original 237Np thatis in the form of uranium, thorium,
and bismuth after 10s and 106 years. Make a plot showing the amounts of each element
as a function of time.
22. Find the Laplace transform of the function g(r) : t df ldt. Express the result in terms of
the transform F(s) of the function /(l). Use the result to solve the differential equation
tY'1-Y:s-t
23. Apply the Laplace transform to the differential equation
y"-t2y:t2
Does the Laplace transform offer any advantages in solving this equation? Using any
method of your choice, solve the original equation or the transformed equation subject
286
CHAPTER
5 LAPLACETRANSFORMS
to the initial conditions y(0) : 1 and y/(0)
may be expressed as a power series in 1/s.)
:
0, and comment. (Hint: The transform
@f,fak.e the Laplace transform of the Bessel equation of order zero,
..
y"+-y'*):0
1
and show that
(s2+t)r'(s)*sY(s):Q
Solve for Y(s) and hence find an integral expression for
for more information on Jo(x).]
y(.r)
:
.Io(x). [See Chapter
8
CHAPTER 6
Generahzed Functions in Physics
6.1. THE DELTA FUNCTION
The parlicle is an often used and very valuable physical model. We can describe an electron
as a particle because its size, as far as we know, is smaller than anything else we might care
about. Sometimes we describe something as large as an automobile as a particle because
its size and shape are not important to the question we are interested in. When something is
modeled as a particle, all ofits physical properties, such as mass and charge, are concentrated
at a single point. In a similar fashion, an impulsive force is modeled as a force that is
concentrated at a point in time. To describe the density of the particle or the force
a function of time, we need a function that has the following properties:
.
.
.
as
Its value is zero everywhere except where its argument is zero.
Its value is infinite where the argument is zero.
The integral ofthe function is 1.
This mathematical object was first used in physics by P. A. M. Dirac and is called the
delta function 6(x). Using this function, we write an impulsive force as
F(t) :1511;
The impulse delivered is
r*oo
r+oo
7: J-I ielat:ilJ-*
3(t) dt
: I
To write the density of a particle at the origin, we need a three-dimensional delta function:
p(i)
:
M
3(i)
:
M
3(x)6(y)3(z)
(6.1)
The particle's mass is
M
: JallI
p1)dv
space
r*oo /+oo /+oo
: M
I I
"^ I
J_* J_* J_*
6(x)d(y)d(z)
d.x d.y
dz:
M
287
288
CHAPTER
6 GENERALIZED FUNCTIONS
IN PHYSICS
In both these examples, we can see that the physical dimensionality of the delta function
is one over the dimension of its argument:
I**
uurdt
: t+ t6(r)r : # : TI
and, similarly,
1 1 1:
td(i)l:td(x)6(Y)6(z)l:
- Ixl lyl lzl -
1=
L3
The properties we have listed for the delta function do not describe any proper mathematical function. Thus, we must extend our ideas about functions to include these objects, which
are known as distributions or generalizedfunctions. To begin our study of distributions, we
shall study the delta function in one dimension in more detail.
5.1.1. Delta Sequences
The delta function is defined as a mathematical object that possesses a property known as
the sifiing property:
ll-rr.rur,)dx:
(6.2)
r(o)
That is, if we multiply a function f (x) by the delta function and integrate over the whole
real line, we sift out the value of the function at the origin. Let's see how the sifting property
follows from the properties listed above:
l*
rortr*)
d.x
:
f' ,f{*)o{*) o* because d(x) is zero except at x
:
f (xo)
l" ,u{r)d",
: f (xd [*
J-a
uG)
o*
where
-
e < xo <
:0
e
because 6(.r) is zero except at
= f (xd because the integral of the delta function
+"f(0) ase-+0
is
x
:
0
1
In the second step, we used the mean value theoreml for integrals.
Delta functions arise from modeling a physical quantity that is distributed over a small but
finite region as being concentrated at a point. Thus, we should expect that the mathematical
I
See Appendix
IV for
a
proof of this result. The application here can be made rigorous by prop€r use of limiting
procedures, as we show below.
6.1 THE DELTA FUNCTION
289
quantity d (x) would arise as the limit of a set of functions that become increasingly concenffated at a single point. We can construct a sequence of proper functions d, (-r) labeled by
apara;rrrctetn suchthatasr, -+ oothesequenceQ"@) approachesthedeltafunction.What
we mean by "Qn approaches the delta function" is that as n --> @ we obtain the sifting
property; that is,
lim [** Q*@)f (x)dx
n-co J_e
: f (o)
/(x). Such a sequence of functions is called a delta sequerce.
The simplest delta sequence is a sequence ofrectangular blocks with unit area:
for any continuous function
o'(x) :
("/2 it-l/n <x <lf
otherwise
[o
n
(6.3)
As n gets larger, the blocks get taller and narrower (Figure 6.1), and so @, "looks like" the
delta function as n -+ oo. Next we must test for the sifting property. We multiply QrQ) by
a function /(x) that is continuous at the origin and integrate:
f_l o.av@)dx:;
I:,,:,' r(x)dx
n2
::'r?lrol
1
,,r'
",,
::'
*t
I
=
6"(x)
-l
FIGURE 6.1. The "block" delta sequence
shown here.
0
-0.5
@"
0.5
I
(-r) given by equation (6.3). The first five firnctions are
290
cHAprER 6 GENERALTzED FUNoIoNS
rN
pHystcs
(We used the mean value theorem to evaluate the integral.) Since we obtain the sifting
property, S"@) is a delta sequence.
We can use this sequence to determine the sifting property of 6(x - a). We look at the
integral:
+oo
I O,U
f
J-*
*oo
: If
J-a
- a)f(x)dx
O,t"l;fu *a)du
1
I
<
: n2
where-;-.f(uola),
zn
n =uo +-n
--> f (a) asr? -+ oo
3(x - a) sifts out the value of the function at x : a.
Similarly, we can determine how to express 6(ax). First let a > 0. Then
So we conclude that
l+@
I
J-*
Qn@x).f
f+oo
rut, du
(I
4,tr)f
'"
\a/)-a
J-a
(x)dx :
I
-f (0)
-->
where we used the change of variables u
gives
l-l
o,r-rrrx)dx
:
ax . But
ast4
-+
oo
if a < 0, the
same change of variables
: l** o,rt
r*oo
: - J_*
0,,
>
1
--"f(0)
a
We can express both results as
8(ax):
I
(6.4)
lo16(*)
Sometimes we need a sequence of functions that are better behaved than the block functions @n (x). For example, to investigate the derivative of the delta function, we need a set
of functions that are each differentiable. The functions
Qn$):
I
n.T
sin2
nx
.
(6.5)
6.1 THE DELTA FUNCTION
291
infinitely differentiable and also get peakier around the origin as n increases (Figure 6.2).
(The first zero of this function occrus zt xg - tr f n, and .16 approaches zero as n --+ @.
The value of the function at the origin is nf n and increases linearly with n.) Again these
functions "look like" the delta function as n -+ oo, but we'll need to test for the sifting
property.
are
Q"@)
,
,
\,
n:3
t
I
l
I
t.
t.'
t..
t
I
t
t
.'l
.'l
:l
I
i
i^
Y.
l'.
l' .-
.//
/n:
Z
-5-4-3-2-1012345
FIGURE 6.2. The first three functions in the delta sequence @4(x) given by equation (6.5).
The derivative is
I ( 2nsin nx cos nx
dQ,
E:
n'T
:
This clearly exists for aIl x
sine and cosine for small x:
2
\-----7-
"inn: @x cosnx-
nr xJ
.
\
-'--#
)
sin2 nx
sin n.r)
10. It also exists for.r : 0, as we can see by expanding
/
'2'2\
il,:o:lTo,o,'1"*(t- ,
2nx
4!tl
I /
/-('"-
the
rz3x3\l
o1
:ri^(-?r"\:o
;+0\ 3 o/
This result (dQ, ldx
slope zero at x :0,
:
0 at
x
:
0) agrees with the graphs shown in Figure 6.2, which have
292
CHAPTER
6 GENERALIZED FUNCTIONS
IN PHYSICS
Now let's check for the sifting property. We write the sine in terms of complex exponentials:
:l::#(
"2inx
_2a"
-4
-2inx
),o,
o.
- ('^' ) /t', r,
: 4nn
.L J-*
[** ('
\ r' /"'
I /.+oo 11-"-2inx1
(x)dx
* 0,,
J_* (---;r-/ f
Now we evaluate each integral along the real axis in the complex plane and use the residue
theorem. Suppose the function (z) is analytic except for a set of isolated simple poles and
"f
approaches zero uniformly at infinity. Then the poles of the integrands are the poles of /(z)
and
of
9n,+(z):
There is
1
_
"L2inz
z2
a pole of gn,1(z) at the origin. Let's look at the Laurent series:
1
_
"I2inz
)
Bn.+k)
: +2i! +2n2 + "'
z
This pole is simple, and the residue of g",+(z) there is a2in (Chapter 2, Section 2.6.3,
method 1). We evaluate the first integral by closing the contour with a big semicircle in
the upper half-plane. The pole is on the real axis, so we shall evaluate the integral by moving the contour slightly downward so that the path of integration lies below the real axis2
(Figure 6.3). Then, if the poles of are simple, the integral3 is
/
2ni I
Ir: 4nn
l{-zin)t'lo)+
:
,f (0)
!Res/1 ,ols,,*frr>f
t _
1_^2inz.^
+ .f(0) as n -+ oo
. +)rtesf (z)::;:
p
2The result for the complete integral is independent ofthe method we choose to avoid the pole, provided that
/(z)
has no poles on the real axis, because the singularity of (sin2 nx) l xz is removable.
3There may not be a pole of the integrand at the origin if
/(3) is zero at the origin. But then the integral is zero in
limit and so still equals /(0). If the poles of / are not simple, the expression for the residue of the integrand
zp is more complicated, but the result 11 -+ /(0) as n --+ oo still holds. Note that zp has a positive imaginary
part, since the relevant poles are in the upper half-plane, and so the exponential approaches 0 as n --+ oo faster
tban any power of n. See Problem 30.
the
at
6.1 THE DELTA FUNCTION
293
Im (z)
Re (z)
FIGURE 6.3. To demonstrate the sifting property of sequence (6.5), we evaluate the integral using
a path that passes
below the real axis, under the pole at z
:
For the second integral, we have to close downward. The pole at z
this contour, and so the integral is zero in the limit. Thus,
/+oo I
sin2
0.
:
O
is excluded from
nx
,qL/-- ,"--;z-f Q)dx: f (o)
The sequence
/,
possesses the sifting property as n
-+
oo.
6.1.2. The Derivative of the Delta Function
Now we are ready to determine the sifting property of the derivative of the delta function.
Let's use the delta sequence4 (6.5) to evaluate the integral:
lll r;o,r,.,0.
We can do the integral by parts:
f_l
o"ort(x)dx:
Q,@)r(x)
_: -
/_
Qn@)rt(x)dx
4since the explicit form of
Qn@) is not needed, any differentiable delta sequence
will suffice.
294
cHAprER 6 GENERALTZED FUNCIoNS tN pHystcs
The integrated term is zero, provided that
take the limit as n --> @, we find
f (x) remains
bounded as .r
-+ *oo. Now if we
f+@
/+oo
f +oo
lim , q"G)f'7x7dx
t'tx)f tx)dx: n-6
lim I O"@)f- (x)dx: - n-@
I
J_oo
J _a
J_Then, using the sifting property of @n, we obtain
ll*
a'a>r@)dx: -f'(o)
(6.6)
which is the sifting property for the derivative of the delta function.
We can repeat the steps above to evaluate the sifting property for the nth derivative of the
delta function. The result is
.
I
ll*
l
u,' t", f (x) dx: 1- t;n Y(') 1s;
(6.7)
l
l
I
l
I
moment p : p*. Express the charge
density in terms of delta functions. Given that the electric potential is
Example
6.1. An ideal dipole has dipole
I t p(7') .a
o(x):4"^l
l*.-*tra"''
t
find the electric potential due to a dipole placed at the origin.
We begin by modeling the dipole as two point charges q ffid -q, separated by
a distance .( along the x-axis. Each point charge has a charge density represented by
a delta
function (see equation 6.1):
p(i) : -qd(x)s(y)s
:
Now we let
[
p(i)
-->
:
0
and
(z)
|
-q6(y)6(z)
*
q6(x
6(r)_
lvtl
-
q -+ oo such that the product
't-oLf
-ps(y)s(z)lim
6(x)
t)6(y)3(z)
j(r_
- {(r - l)l
(
)
21
ql :
:
p remains finite.s Then
-oa1r;a k)6'(x)
5Since the delta function is not a proper function, we should demonstrate that the
derivative. The result rnay be proved using a delta sequence; see Problem 23.
limit below behaves as the
I
I
6.1 THE DELTA FUNCTION
295
We evaluate the potential using the sifting property of the delta functions:
o(i):#',lffio,*'
p
/+oo /+oo /+oo:
4nes J -oa J -* J -* Je
reoJ--J--J--rM +oo
+oo
I-* I-*
4neo
J
J
6t(xt)3(yt)3(zt)
x')2 + (y - y')2 -l
8,(xt)a(y,)
l-G- lY + () - y,)2 + z2
P [**-Lo*'
-- 4reo
-a 1/1x - x')2 * y2 +
J
k-
4xt dy' dz'
z.'12
-
dx'dy'
z2
This integral is evaluated using the siftUg_ptgPg4y_glhe derivative of the delta
function (result 6.6) with /(x') : t11/1*- ,F+ y2T7t
.f)
o(i)
f' {x').,_01 : 4neo
l1a
: -f-F
+T
eO
px
4t
where 12
:
x2
*
r3
e11
y2
+
22.
6.1.3. The Delta Function of a Function
Sometimes the argument of the delta function is itself a function. We can use any of our
delta sequences to determine the sifting property of 6[S(x)]:
r*oo
akt")lf(x)dx
/__
lim [** o,rr@)lf @)dx
==nrooJ_oo
(6.8)
We make a change of variable to u : g(x). Then du : g'(x) dx. Next we divide the
range of integration into N pieces, where g/(x) I 0 within each piece (Figure 6.4). Thus,
g/(x) is either positive or negative within each piece and is zero only on the boundaries
at x : x;. Notice also that there is at most one zero of g(x) in each piece. We label the
values of r where g(x) : 0 as xs;, i : I to N' Then the integral on the right-hand side of
equation (6.8) is
N
I
fxi+1
D
Ixi
i:l
J
Q,ts@)tf(x)dx =
,
du
)- J8$,)
/Tskiattentu\f1r-trr)lih
-ro
Let's look at one of the integrals in the sum. If gt(x) is positive throughout the ranEe xi to
.r;11 then g(xi) is less than g@i+). As we take the limit n --+ @, Qn approaches the delta
296
CHAPTER
6 GENERALIZED FUNCTIONS
IN PHYSICS
g(x), g'(x)
FIGURE 6.4. A function g(x) and its derivative. The function is zero at the points x6;; the derivative
is zero at x2 and x3. Thus, g/(.r) is positive between -a and x2 and between x3 and
oo, and is negative between x2 and x3.
function and thus goes to zero outside any finite range containing u :0. We may expand
the range of integration for r.r without changing the value of the integral6:
p|Gi+r)
,J'!L Jr",,,,
Q,@)
r+oo
,!L J_* Q,{"t f 1g-t (u\ffi\l
f (xoi)
: ,rts-l(o)l g\*r)
,
du
f [e-' tuilV# tun:
du
t1t--:
where the particular zero xoi of g lies between x; and x;a1. If there is no zero of g in this
range, then the integral is zero.
Now if g/ (x) is negative throughout the range xi to x;a1, then g (xi ) is greater than g (x;+r )
and we get
^\
t*Gi+r)
du
-im [-*
'| - du
al7#@i
,,,
Qn@)f ls-t(D1{#@j:,1;i;
J**
Jr,,,,
^-rurfrp-t(r
,f
ts-l(o)l
:
8'rst{dr
6St i"tly, we must also redefine the function
to u
: g(xi+t).
fl7-l
f (xoi)
k(rCI)
@)l to be zero (or any constant) outside the
ran ge
u
:
g(x;)
297
6.1 THE DELTA FUNCTIOI.I
Adding the results for each of the pieces gives
ffurrr.' r@)dx:I#ffi
where the sum is over all the zeros of the function
also write the result in the form
(6.e)
:
g(x): 8(xo;)
0,
i-
6rg(x)l:If#
1
to N. We may
(6.10)
Equation (6.9) (or 6.10) has numerous important applications in physics. It is valid only
if the sum is over a finite number N of zeros and there are no repeated roots. [If there is
a repeated root, then g'@oi) : 0 for those roots, and our demonstration fails.]
As a specific example, consider the function g(x) : xz - 4. There are two roots at
x : !2, and g/(x) : 2x is zero at x : 0. Thus, we divide the range of integration as
follows:
f0
/+oo .
- .
te' -Dfe)dx:
IJ-a
IJ-a 6G2-Df@)dx+ JoIl+* 3@2-4)f(x)dx
Now we change variables to u
and so
x
:
x2
- -4Q 14.f!rus,
-
ll* ur., -gr(x)dx
:
I.:s@)
rGJu
+
4, du
:
2x dx .
o+=
In the first integral, x is negative,
a)#
*
6(D r du +
l*o*
by defining f eJn +4 : 0 for < -4.
Next we extend the range of integration
precise value chosen is unimport nt, since 6(u) is zero for u
I
0. Thus,
r+oo
I
J-*
6(x'
- 4) f (x) dx
f+oo
- t stu)f GJu + +)
-l_*o\w)J\-\4I1t
Then, using the sifting
property,
f
+I
+@
J_o
du
q\---:
t<utf <J,' +t-trlQtr4
e have
[**ou'-ofu)a*:ff+
which is in agreement with the g neral result (6.9).
'
The
298
CHAPTER
5 GENERALIZED FUNCTIONS
IN PHYSICS
6.2. A sheet of charge lies in the z : 0 plane. The surface charge density
is os. Express the volume charge density in spherical coordinates.
The charge is localized to one value of zi Z :0. Thus, we may express p with
a delta function in z:
Example
p(i) :
k6(z)
To determine k, we integrate over a cylindrical volume of cross-sectional area dA
extending from z : -oo to z : *oo (Figure 6.5). The cylinder acts like a cookie
cutter, cutting out an amount of charge og dA from the sheet. The amount of charge
inside the cylinder may also be computed from the charge density:
r+@
pi)dv: I
dq: If
Jcylinder
J
k6Q)dzdA:kdA
-a
andthusk:oo:
p(*)
:
oo6(z)
Now we change to spherical coordinates: z
p(i) :
:
r cos 0, and
so
o66(r cos0)
FIGURE 6.5. Cutting out an area dA from the charged sheet (Example 6.2).The charge inside the
cylinder is o6 dA.
To interpret the delta function, we first note that charge exists at all values of r,
0 < r < oo, so we cannot have a delta function in r. But charge exists at only one
value of 0, 0 : r /2. Thus, we must have a delta function in 0. Then we apply result
(6.4) with e: rl
p(x)
:
6
(cos d)
r
6.1 THE DELTA FUNCTION
299
Now we may use result (6.10) to evaluate 6(cos 0). There is only one zero of cos 0 in
the physical range of interest, 0 < 0 < n,and that is at 0 : n f2.Thus,
:t 12) :
: oor ,5(o
l- srnub:r/2
p(7)
!9xe - r /2)
r
We can check the result by calculating the amount of charge inside a spherical shell
of thickness dr (Figure 6.6). This shell intersects the sheet in a circle of thickness dr,
and so the charge inside is dq : oo dA - os2nr dr.lntegrating the volume density
and using the sifting property to evaluate the integral over the delta function, we get
on
:
I,n"upg)dv
:
Io'"
fi
:2roor sin;dr
z
tr
l:
"|urt - r/2)r2
sinl d0 dr
og2nr dr
as required.
FIGURE 6.6. A thin spherical shell ofradius
of inner radius
r
and thickness
r and thickness dr intersects
the plane in an annulus
dr.
6.1.4. The Integral of the Delta Function
of a delta function, let's
x < -l I n. Then
To interpret the integral
(equation 6.3). First let
[.
J-a
o,@) du
integrate our block delta sequence
:o
since @, (a) is zero throughout the range of integration. Now let
x>
nltl" :
fx
ftlnn
Q"@)du: I
-ul
-du
I
2 l;tn
Jt1n2
J-a'"
so
f'
J-*Q^fu)0":
(o ifx<-1/n
\L
if x >
*r/n
I
I
f n. Then
300
cHAprER 6 GENERALTZED FUNcroNs
rN
pHysrcs
As we let n --> @, we have
:
l'*o''
{0,
o'
ifx <0 :
ifr>0
(D(x)
(6.1
l)
where @(-r) is the step distribution.T It has the property that
/*J-a
",r,,r,
x) dx
: [**
Jo
f (r) o,
Since @(x) is the integral of d(.r), we may reasonably conclude that
The delta function is the derivative of the step distribution:
d@
_ :6(x)
(6.12)
dx
We can check this conclusion by determiningthat d@
/dx
possesses the sifting property:
f** 499\
.vf (*)4v: f (x)@t.r)l1S - J-*
/
ax
J-a
'
/^+oo
:,Tlo fG)-o-
f'(x1@@7dx
r*m
J,
f'(x)dx
:,l*f(x)-/(x)13
:,l* f @) -,$"rt')
: (o)
+ /(o)
"f
as required.
6.1.5. The Fourier Series of a Delta Function
We may find the Fourier series of a delta function by finding the Fourier series of a delta
sequence and taking the limit as n --> oo. Since the Fourier series is periodic, we will
actually get a periodic repetition of the delta function. The easiest sequence to handle is
the sequence ofblocks (6.3). The Fourier series ofthis delta sequence Qn@) on the range
-L<x<Lmaybewritten
*oo
Qn@):
^D*r-r'*"''t
TPreviously we used the notation S(-r) for the step function. The notation @(.r) distinguishes the distribution from
the function.
6.1 THE DELTA
FUNCTION 301
where (equation 4.31)
r rL
c^: )= I O,@)r-'mnx/L 4,
2L
J-L
L f""
2L
n
J-ttn
:,
r-imnx/L 4*
(e-imn/nL-,imn/nL\:
4mr \
lnL +
Now as n --> @, mn
first term, we find
2
,in^o
nL
'
2mr
/
-i
0. Expanding the sine in a Taylor series and keeping only the
c*
nrnfi1
- zmn nL: i
Therefore, the Fourier series of the delta function is
. +oo
6(x): a |Lt
2L
(6.13)
"imnxlL
which is a very odd-looking series! Since the coefficients do not decrease as m increases,
the series does not converge in the usual sense (pointwise or in the mean). That should not
surprise us unduly, since 3(x) is not a proper limit. However, we can check for the sifting
property:
+oo t
*oo
r*L ,
I
'
J-, 2L I
,imnx/L y1x1
^L_*
r*L
dx: \- ' I
l-r
^?*2L
a-^
The integral on the right gives the coefficient
"nimnx/L f e)dx
in the Fourier series
*oo
f (x): I
p:-6
for the function
/(x)
[,' +
to
*L.
on the range
-Z
f
f(x)dx
m:-@
",-o","
orrtoo'/t
Then
:f
m:-@
o-^
:f
m:-@
"*:,f(o)
and so the series (6.13) possesses the sifting property. Thus, equation (6.13) represents the
delta function on the range (-L, L).
We should perhaps worry about changing the order of the sum and the integral, since the
series is not uniformly convergent. In fact, it does not converge at all! The validity of this
procedure will be discussed in Section 6.3.
Fourier sine and cosine series may be found sirnilarly.
302
CHAPTER
Example
x:
L
6
6.3.
GENEFIALIZED FUNCTIONS IN PHYSICS
An initially stationary string of length L is hit by a hammer blow at
Determine the subsequent motion of the string. (The speed of waves on the
:
1t, where Z is the tension and pc is the mass per unit length. See
^1tlJ
4,
also Chapter
Section 4,4.2.)
We may model the hammer blow as an impulse 10 occurring at t : O and at x :
L/3.The impulse per unit length is
13.
string is y
{4!-ts6(x-Lt3t
dx
Using the impulse-momentum theorem,8 we can determine the initial velocity of the
string at / : 0+. The change in momentum of a string element of length dx and mass
dm
-
ptdx is
0*)
d I,(x) .
Uclx: * clx
*
Oy(x,0*) 16 .;-_:6(x_Ll3)
0y(x,
which, together with y(x, 0) : 0, gives the initial conditions for the string. We expect
a solution of the form (equation 4.29)
v(x. 1)
: .!)
d.
srn
nrx
nrut
frLL
which satisfies the initial condition of zero string displacement at r
Ey
0t
@
/nzx\ nTu
: s-)
L'' d.srnl-l\L/
L
n:l
ovl
nnD
'l
:) .a
orl,:o fr,,L
:
0. Then
nTrut
L
wrx
L
Applying the second initial condition, we have
bu,*
p
- L/3\:?L"'Lo-n" ,inno'
L
Now we find the coefficients in the usual way:
2Ig nr
Io ^.
wrx
nftu 2 fL -:3(x-Ll3)sin
dx: jsrn---a,-.:=
I
.
" L
p
L
L1t 3
LJo
(6.14)
where we used the sifting property. As we expect for a delta function series, the term
on the right-hand side does not decrease with increasing n. (Compare equation 6.14
8see Lea and Burke, p. 200.
303
6.1 THE DELTA FUNCTION
with equation 6.13.) However, the coefficients a, that describe the string displacement
do decrease:
,
a,:
L
2
sin)
nnu Lpt" 3 ?'o
nnu
Io
p,srn)3
and the solution for the string displacement is
t): i'
b rirT
L "inY!
L
3 "in\!
*t':rnnu\
Notice that all the terms with n : 3m and m an integer are zero, since we have
ajm q. sinmr : 0 and so every third harmonic is missing. The first few terms are
v(x,
nut I
y: Jitn / nx
o'\ttn7srn-+tsrn
4zut
2trx 2trut I 4nxsrn-+"')
srn
L
4ttn L
t
\
Figure 6.7 shows the sum of the series for the string displacement, up to the n : lO
term, at times utf L: l18,I14,and 1/2. Notice the peak atx : L13 atthe earliest
time.
apy
111
\
\
\
\
\1
\a
t
t
I
\
x
L
FIGURE 6.7. The string displacement (Example 6.3) at times ut/L : ll8, l/4, and ll2.The
vertical axis shows the dimensionless quantity ulty/10. The dimensionless variable
x / L is plotted on the horizontal axis. The hammer blow at x : L /3 causes the peak
in the displacement at L 13 at the earliest time. The displacement spreads along the
string.
6.1.6. lntegral Representations of the Delta Function
We can
find an integral representation of the delta function by finding its Laplace transform.
for / > 0. We can use a set of blocks
To use this transform, we need a delta sequence defined
304
cHAprER 6 GENERALTzED FUNoIoNS tN pHystcs
defined as follows:
(o if r <o
t0^(t):\n if0<t<lln
(0
(6.1s)
lln
if t >
This sequence offunctions has essentially the same behavior as our previous block sequence
(6.3), and the proof that it is a delta sequence is almost identical. The Laplace transform is
op(s)
:
fo*
o,r-"0, : Iot'^ ne-'tdt
:,
::r,
e-'/n)
=lii"
Now we take the limit as n --> oo:
Ll6(t)l:
-
i {'-
.}(:)'. ]}
l'-;
t-t+...-+1
asn-+oo
Thus, the Laplace transform ofthe delta function is 1.
We could also obtain this result from the relatione 6(/)
Laplace transform ofthe step function is l/s. Then
LI6(t)l:
s.C[@(/)]
-
@(0)
:
:
d@
s/s
: I
ldt
and the fact that the
If we invert the transform using the Mellin inversion integral (equation 5.19), we obtain
I
r+ioo
6(1): ' I
2ri J-i*
ettds
where we may integrate along the imaginary axis since the integrand has no poles.
Now make the change of variable s : iro. We find
r
.+m
I /+oo
,i''
ei''ida: 2"
J-J_-
' I
d(r): 2ni
d,
(6.16)
This expression is extremely important and finds many applications in physics. Notice the
similarity to equation (6.13). The integral is the extension ofthe Fourier series representation
to the case of a continuous, rather than a discrete, frequency spectrum.l0
9This demonstration appears less satisfactory because we have to invoke a value of the step distribution ar the
origin. Using the theory developed in Section 6.2, we can show that the initial condition may be dropped. See
Problem 24.
l0Demonstration ofthe sifting property may be accomplished in several ways. For example, expression (6.16)
may be written as limn;*(1/22) /]S exp (- ,2 14n2 + itot) da;. The integral is easily evaluated (Chapter 7,
Example '/ .2), and the result is the delta sequence (6.18)
305
6.2 DEVELOPING ATHEORY OF DISTRIBUTIONS
6.2. DEVELOPING ATHEORY OF DISTRIBUTIONS
The delta function (together with its derivatives) is a very useful mathematical object that
often simplifies calculations in physical problems. Because it is not a properly defined
function, we used sequences of proper functions-the delta sequences-to understand the
properties of the delta function. The delta function is mathematically defined by the sifting
property (6.2).ln this section, we shall begin to formalize and generalize these ideas.
To develop a more rigorous theory of distributions,l 1 we start with the definitions of two
classes of functions: corefunctions andtestfunctions. The properties of these functions determine the paticular distribution theory. The example below is illustrative of the techniques
but should not be regarded as a description ofthe only class ofdistributions that exists.
Test functions. For our illustrative theory we choose the test functions /(x) to be infinitely differentiable and to go to zero very fast as -{ -+ m. Taking ,f : 0 outside some
finite range will certainly be fast enough. We'll explore this feature further as we develop
the theory.
Core functions. For our theory we choose the core functions g(.r) to be infinitely differentiable.
We'll also need to define
a new
form of convergence, called weak convergence.
Weak convergence. Let g"(x) be a sequence of core functions. Then the sequence is
weakly convergent
if
lim I^+oo sn(x) f (x) dx
n+a J-*
exists for any test function
/(x).
The definition of a distribution as the weak limit of a sequence of core functions now
follows.
.
Definition of a distribution.
(x) is a distribution if there exists a sequence of core functions g, (x) that converges
weakly to @(.r), in which case
@
,,g/:
for any test function
gn@)f(x)r,:
l::
Q@)f
(x)dx
(6.17)
/(x).
llsee Lighthill's book for more on this topic. The test functions here are akin to his "good" functions and the
core functions to his "fairly good" functions.
306
cHAprER 6 GENERALTzED FUNcrloNS lN PHYslcs
Many different sequences of core functions may converge to the same distribution.
The sequence of core functions
gnT):
n
\/ lt
,-n"'
(6.18)
illustrates some of these concepts. The sequence does not converge pointwise, since
gr(x) --> 0 as n -+ oo for x I 0, but gr(x) diverges as t? -+ oo for x : 0. However, the
sequence does converge weakly. To see which distribution
it converges to, we evaluate the
integral (6.17):
lim
le-""y1r1d*
1+*
n*@ J_oo
\/iT
:.uu (l_:" .
l:,':: . l:,;,) ft,-""f
:
-f Iz
nBL Qt
*
{,)o*
Iz)
/(x) is infinitely differentiable and goes to zero as t -+ too.
Let l/(.{)l < Ml for 0 < x < oo. Then
2=r-"
2?r-n"'d*
:
Now the test function
/(x)
is bounded.
,,
where
* f*
a11uQ
= 2
o(.fr)
J
;1
: +. [:
Ji
tt
,t
\/1T
is the error function,l2 which approaches
<D(.r):
I
as
du
- aQ-n)l
*rt
2
n -+
oo.
Thus,
+
fr r-''dt:t-+':!x +"'
,/tr JO
41T
[The asymptotic expression for O(x) may be found in Appendix IX, equation (29).] Thus,
13 -+ 0 as r? -+ oo. Similarly, we can show that 11 -+ 0. However, for 12, we have
rJi
: fc)ii't Jo
e-,'du
by the meanvalue theorem, where
-ll^fr
=
€<
+llJn andu:
nx. Thus'
12: f G)aQfr)
and so
(*)dx -- rim 12: f e): [**
lim [**
+r-,'*'.f
n+@
J
J _*
Jn
n-cr,
l2The error function is also called erf (-r). See Appendix IX.
-*
Q@)f (x)dx
6.3 PROPERTIES OF DISTRIBUTIONS
307
The distribution Q@), defined as the weak limit of sequence (6.18), exhibits the sifting
property, and so it is the delta function. We may write
ft"-n"'
-+ 6('t)
weaklY
(6.1e)
Some sequences of core functions may have pointwise limits that are ordinary functions
and may also have weak limits that are distributions. It is even possible that gr(x) -+ g(x)
pointwise as n -->
q,
gn(x)
l**
--> Q@) weakly, but
oarr@)dx
t ll) rr,rrr,ro.
This kind of bizarre sequence is rarely of any importance in physics.
In physics, pointwise convergence is not always important, since a physical quantity
cannot be measured at a point, but only over some small but finite region. Thus, weak
convergence is often all we need.
There is a close relationship between functions and distributions.
The Smudging Theorem. For every continuous function h(x), we can construct a sequence gn(x) of core functions such that
,$
ls,(")
-
h(x)l <
e
for any e > 0 and for all x in any finite interval.r3
According to this theorem, we can "smooth" the function h(x) over the interval
- lln,x -f l/n) to create an infinitely differentiable g"(x), and the two functions
differ negligibly. Since the sequence of core functions converges to a distribution weakly,
this means we can replace the function with a distribution.
(x
For every continuous function, an equivalent distribution exists.
We cannot make the reverse statement; there is not necessarily a continuous function
corresponding to every distribution. The delta function is an example of a distribution for
which there is no corresponding continuous function.
The class of distributions is an important addition to our mathematical arsenal. In the
next section, we shall learn more about how to use them.
6.3. PROPERTIES OF DISTRIBUTIONS
.
Distributions may be added, subtracted, multiplied by constants, or multiplied by infinitely
differentiable functions. These assertions may be easily proved using the definition of
l3For the proof of tbis theorem, see Butkov, p. 243, or Lighthill, p.
2 1.
308
cHAprEB 6 cENERALIZED FUNcrloNs lN PHYslcs
a distribution as a weak limit of a sequence of core functions. For example, let h(x) be
an infinitely differentiable function, and let d(-r) be the weak limit of the sequence gr.
Then
+@
f+oo
I
J-*
hfOO(x)lf
- @) dx
f
: n+mJ-m
lim I
lt (r)s"@)lf (x) dx
/"+oo
,\%
/-""
s'@)Ih(x)
f (x)) dx
f*m
I
J-a
Q@ltntof @)ldx
h(x)/(r) is a test function if / is. So we can define a new distribution ry'(x)
h(x)Q@). Thus, for example, rd(x) is a distribution. Let's see what it does.
since
lllwt<.ttr',)dx
Thus,
.
.
.
x6(x)
:
:
l-:
d('r)txl(x)l dx
:0x
/(0)
:
:
Q
0, the zero distribution.l4
Distributions may NOT be multiplied or divided by other distributions depending on the
same variable. Forexample,t6(x)12 is meaningless. [However, d(i) : 6(x)d(y)d(z) does
make sense becauSe the arguments x , y , and z of the three delta functions are independent
variables.l
lf Q@) is a distribution, so are Q@ - a) and QGx).
Distributions are infinitely differentiable, because the core functions are. Moreover, if
gn(x) --> d (x) weakly, then gl (x) -> Qt (x):
,$"/J
s'^@)f
(x)d,: n\(tror^r*>t
=
-,[:
g^@)f'(idx)
The integrated term is zero by the properties ofthe test functions, and so
lll
o'rnr(x)dx: -
I:: Q@)rt(x)dx
(6.20)
This is a property we have already established for the delta function, and it serves to
define the derivative
.
QI
(x). Higher-order derivatives may be computed similarly.
Since disffibutions can be differentiated, they can be the solutions of differential equations.
Example
6.4.
Find the solution of the differential equation
d,t
@-D**y:0
14we'll need this result in the next chapter.
6.3 PROPERTIES OF DISTRIBUTIONS 309
We can integrate this equation directly to obtain one solution:
dy
dx
y
x-r
lny--ln(x-l)+C
A
t- *-1
:
This solution is validforx > I orforx < I but not at x
L
Now consider the distribution y
6 (x
1) . If this satisfies the differential equation,
-
:
then we must have
/^+oo
I
J_q
x*
-
l)6/(.r
- 1) + s(x - l)lf @) dx :
o
Let's evaluate the first term using the sifting property ofd/ (equation 6.6):
f+*
I eJ_a
l)6'(x -r)f(x)dx
d
: - If+@a@- tlilfx
- t)f(x)ldx
ctx
J_a
r*oo
: - II
J_m
d(_r
- t)t/(x) * (r - t)ft(x)ldx
: -f(l)
The second term is
/n+oo
/
J-a
a("
-
l),f(x) dx
: f (Il
so the differential equation is satisfied. We must choose this delta function solution
the point x
I is within our domain of interest.
:
if
Note that since the differential equation is first order, we expect only one solution. What
we learn here is that the character of that solution depends on the domain of validity.
Some differential equations contain a distribution. The solutions of such equations are
also distributions.
Example
6.5.
Find the solution of Poisson's equation for the electric field due to
p : oo6(x - a):
a sheet ofcharge with charge density
dEt oo^.
E-j6(x-a)
From equation (6.12), the solution is
Oi
E,:A*j@(x-a)
€0
310
cHAprER 6 GENERALTzED FUNcrtoNs lN PHYSlcs
where @ is the step distribution (see Section 6.1.4). Since the electric field points
directly away from positive charge, we can find the integration constant A by invoking
symmetry about the sheet (Figure 6.8):
E1(x <
a):
>a)
A_
#)
and so
.oo
2eo
Thus,
E*
:
o)
(-; .
otx
-
a))
FIGURE 6.8. Electric field due to a sheet of charge (Example 6.5).
6.4. SEQUENCES AND SERIES
We can extend the concept of weak convergence to sequences and series. For example,
the function e'" is a core function and therefore may also be regarded as a distribution.
This means that any Fourier series may be regarded as a series of distributions. The series
converges weakly if the sequence of partial sums converges weakly. Once we make this
identification, some very useful results follow.
The sequence of functions etn' has no proper limit. But let's see if there's a weak limit:
6.4 SEQUENCES AND
SERIES
31
1
The integrated term is zero, by the properties of the test functions [/(x) is zero outside
some finite range a < x < bf . The absolute value of the remaining term is
l_J
# t'(x) dx : : I"' ,in' y'1*\ d*
=: l,' l,'^' f'utl o,
by the mean value theorem, where
a < f, < b
[a, b]. Thus,
e'"
: Il'.'"u r'c>l@ - a)
and lei"€ 7'(6) |
ir
-->
0
as
n -+
oo
bounded in the interval
--> 0 weakly
If a sequence of distributions Q,(x) converges weakly to @ (x),Q @)
:
lirnn+oo Q"@),
then the sequence of derivatives QL@) converges to @/(x):
Q'@):
6.21)
,\QL{*l
To see why, note that
lim [**
n-@J_a
O',r*,
f (x) dx
"
: - n-@J_m
lim [**
"
0,@)
f
t(x)
dx
(result 6.20)
Q@)f'(x)dx
0'@)
f @) ax
A convergent series of distributions can be differentiated term by term.
To prove this statement, let S^,(x) : If:, Qn@).Because the sum has a finite number
of terms, we may differentiate to obtain Sir(x) : DItQ'"@). Then, taking the limit, we
have S(x) : limN-oo Sr,,r(x), and, by theorem (6.21) above, S'(") : limru-oo Sir(x).
This is a powerful result: Series of distributions possess some of the same properties as
uniformly convergent series.
If we start with a convergent Fourier series S(x), we can differentiate term by term, thus
multiplying each coefficientby in. We can do this m times, so the series with coefficients
(in)* a" also converges to dm S f dxm. We have thus shown that every Fourier series, considered as a series of distributions, converges (weakly), even when the coefficients do not
decrease with increasing n. As a consequence, every Fourier series may be integrated and
differentiated term by term.l5
15We made
use
of this result in Chapter 4.
312
cHAprER 6 GENEBALTZED FUNcrloNS lN PHYSlcs
Example 6.6. Find a Fourier series for a delta function by differentiating the Fourier
series for the triangle function
f(x):{
on the range (0,
L)
(x(2h/L)
ifO<x.L/2
l(L - x)(2hlL) if L/2 < x < L
(see Example 4.4):
f(x):Yn##,,"s+r!
This series is nicely convergent.
(compare with equation 4.28):
f,(*):
If we differentiate once, we
{'!#,,
{:-
get a step distribution
@(x
- L/D}
(2ntI)nx
sh s (-l)',
:i1rz"*tlcos
"lorir.='*!":T
awhich also converges, but more slowly, and
function:
if we differentiate
f"(*): -?uu
- L/2):+?.i,-,,'*'
t' I'-)
L
"t
again, we get a delta
gl:l!
6.22)
Actually it is a negative delta function, since the step function steps down at x :
L 12. Compare this result with equation (6.14). Here we have no even terms, because
sinl(nn/L)(L/2)l : sin(nnl2) is zero whenever n is even. The series (6.22) is
a weakly convergent series of distributions. The sum of the first twenty terms of the
series is plotted in Figure 6.9.
The process of finding the coefficients in a Fourier series of a distribution such as 6(x)
requiresaminormodificationtoourtheory.First,theintegralsthatgivethecoefficientsare
notoverthecorrectrange(-oo,+oo),andsecond,thefunctions
e'n'(orsinnxorcosnx)
are not appropriate test functions as we have defined them. We can solve both problems by
differentiable "tudge function" u(x) such
;;l;;G;il
,ii
+
'ora
that
L: "in'u(*)o@)0,: #, ll"' "'"'rr*ro'
(6'23)
l
I
6,5 DISTRIBUTIONS IN N DIMENSIONS 313
L2f,
8h
x
L
FIGURE 6.9. The first twenty terms in the series (6.22) for the second derivative of the triangle
function. The variables are L2 7" 18h versus
delta function at x
/L :0.5.
x/I.
This series represents a negative
Then etnx u(x) is a test function, and the integrals can be evaluated in the usual way.16 Once
we know that such a procedure exists, we can just use the normal expression for the Fourier
coeffrcients-that is, the integral on the right-hand side of equation (6.23). Of course, the
resulting series gives a periodic repetition of the original distribution.
6.5. DISTRIBUTIONS IN N DIMENSIONS
The theory of distributions can be extended to N dimensions. In physics, we are frequently
concerned with problems in three space dimensions and one time dimension. Sometimes we
work in a six-dimensional phase space with three position coordinates and three components
of momentum. The modifications are straightforward.
1. The test functions should possess partial derivatives of all orders and go to zero sufficiently fast outside a finite volume of the space.
2. T\e core functions should
possess partial derivatives of all orders.
3. The integrals that define the distributions become volume integrals over the Ndimensional space.
l6Examples of such functions a can be found in Butkou p. 255, and Lighthill, Section 5.2.
314
cHAprER 6 GENERALTZED FUNcrtoNs lN PHYSlcs
Then all the properties of distributions extend as expected' For example,
:
L#,,o0, l(orrr= - I:: o{a.)a,a,
:- Jv[ oTav
dx
The delta function in three dimensions is readily expressed in Cartesian coordinates.
is nonzero only at the origin, so
6(i) :6(x)6(y)6(z)
We may write this as a
It
(6.24)
limit of the core functions (6.18):
:,\%
fim !--e-n"'4"-n'Y' '
6(i): n+a
t/It
\/iT "-""
JiT
(e)' '*r
(6'2s)
A particularly useful relation in three dimensions is
:
"'(i)
-4zd(i)
(6.26)
First we note that the left-hand side of equation (6.26) has the sort of behavior we expect
for a delta function. If r I 0, then
/a2 * a2 a2\
(P i7*
a?
)ffi:
a /-x\ * a /-y\. *
a" \;t/ r, \;t/
/ 3y
-3 / 3x\ - r\-F
'\-Fl
I
)
-3 r2
:7+3-:o
but as
r
--> O, I /
r -+ oo and its derivatives
cannot be computed; the left-hand side of (6.26)
is undefined.
Next we must test for the sifting property. Since we have already shown that V21t 1 r1
for r 10, the volume integral
./,,,n*.
o'
"'(:) f G)dv: ,/nn"."or.u*o." (i) "o,o'
:
O
I
6.6 DESCRIBING PHYSICAL QUANTITIES USING DELTA FUNCTIONS
315
reduces to an integral over a small spherical volume surrounding the origin. We write the
integrand in terms of a divergence so that we can use the divergence theorem:
./pr,... or,uaiu.,
t' (i)
f G) dv
:./nn",.o,.u0,",,
vl v rfav
lu (rclvl) -
(rrrrvl) .aoo- [
: Jsurfaceofsphere\
[
r/
(rrn])
:[
# aa + [
' ' t'
\"
/
:I
Yo,on-[
0r
-].iyav
Jsphereofradiusel'o
,./sphereofradius,
./surfaceof sphere
/sphereofradiuse
_f
- t
J solid angle of sphere
:
Jsurfaceofsphere-
0r
a, aa
rl)da
lfli=6-ft)lr:elda
f
-"f(o)
\!,,
rz
J"nno,on: -4trf
(o)
Thus, since we have obtained the sifting property, relation (6.26) is proved.
A similar relation holds in two dimensions:
vr'(h
L):znqil
where Vr2 is the Laplacian operator in two dimensions (Vr2
constant length, and p
(6.27)
:
A2
/Ax2
+
A2
/Ay\,a
is any
- \FT|
6.6. DESCRIBING PHYSICAL QUANTITIES USING
DELTA FUNCTIONS
We often want to use delta functions to describe the mass or charge density of a system that
is confined to an infinitesimally thin sheet or line. Let's investigate how to describe such
physical quantities using delta functions.
Example 6.7. A ring of charge of radius a lies in the x-y plane. It has a variable
line charge density ),e cos {. What is the volume charge density? Use (a) cylindrical
coordinates and (b) spherical coordinates.
316
cHAprER 6 cENERALIZED FUNcrtoNs tN PHYslcs
(a) First we note that the charge exists only
at z
:0
and at
p
:
4, so we must have
two delta functions:
p(i):C6(z)6(p-a)
To find C (which may be a function of the coordinates), we integrate over a wedge of
space with -oo < z < +@,0 < p < oo, and angular extent dQ (see Figure 6.10a)'
(This volume is chosen by using the entire range of each coordinate in which we have
a delta function, but only a differential range in the third coordinate.) The amount
charge d4 in this volume is )'adQ. Thus, using the sifting proPerty, we have
il.scosQ dQ
- dq :
-
and so C
:
lo
cos
f*"o""rurrr6(p
d:
cos
FIGURE 6.10a.
To find an expression for the charge density
in Example 6.7, we integrate over a wedge
of an infinite cylinder with
<p<
a) p dp dz dQ
Cad0
p(*) :.r"e
O
-
of
-oo < z <
oo, and angular extent d@.
@,
@
6(z)6(p
-
a)
FIGURE 6.10b.
To find an expression for the charge density
in spherical coordinates, we integrate over
the wedge of angular width d@ that extends
throughout the full range of the coordinates
0 andr:0 < 0 < z and0 < r < oo.
6.7 THE GREEN'S
(b) In spherical
coordinates, the x-y plane is at 0
p(i)
:
FUNCTION 317
n 12, so
: ca(e -|) x, - "t
Now weintegrateoveran"orange-wedge" shapewith0 < 0 < 7T,0 <
extent dQ in the azimuthal angle (Figure 6.10b):
a),scosQ dQ
:
:
Thus, C
: lo (cosd)/a,
f*.or"ca(e
Ca2^n
sin
r < oo, and
- Z) uU - a)r2 sin0 dr d0 dQ
,aO:
CazdQ
and
p(i):
l,6cos@d
(t -;)t+
Check the dimensions of the result!
This example illustrates a general method for finding densities of linear or planar charge
or mass distributions in terms of delta functions in curvilinear coordinates.
First, determine which coordinates take on only a single value, and find the delta functions
in these coordinates. (In the example above, charge exists only at 0 : r /2 and r : a.)
Second, determine the function that multiplies the delta functions by integrating over
a region of space that extends over the full range of the coordinates in the delta functions,
but over only a differential range in the other coordinate or coordinates. The value ofyour
integral should be equal to the value determined by elementary methods. (In our example, we
integratedoverawedgethatextendsfrom@ toQ
of the ring and thus contains an amount d e
:
+ dQ.Tltis wedgeslicesoffalengthadQ
)ra dQ of charge.) This method guarantees that
you will not lose information about the dependence of the coefficient C on the coordinates.
Integrating over all space does not work because you will lose the detailed information that
you need.
6.7. THE GREEN'S FUNCTION
An important application of distribution theory is the Green's function. The Green's function
is actually a distribution that describes the response of a physical system to a unit delta
function input (a "point" source). We have already seen an example of this in Example 6.3,
where we applied a point impulse to a string. Other examples are the electric field due to
a point charge, a point mass on a beam, and a voltage impulse applied to an electric circuit.
If the physical system is linear (and thus is described by a linear differential equation), the
response of the system to a sum of inputs is just the sum of the responses to the individual
inputs. Then ifwe can model any input as the sum (integral) ofa set ofpoint (delta function)
inputs, we can use the Green's function to compute the response.
318
cHAprER 6 GENERALTzED FUNcrloNS lN PHYSlcs
If the impulse per unit length applied to the string is 1(x), we can write it
7L
I(x): JoI
If
t1x'p1x-x'1dx'
the response of the system to the delta function is
y(x)
:
fo"
as
,
G(x,
{*')o{*,
x'7, the response to 1(.r) is
(6.28)
x'; dx'
The Green's function is described in more detail in Optional Topic C, where we show
how to find an appropriate Green's function for several different systems and in different
geometries.
PROBLEMS
1. Show that the following
liail a. r*t
-
n
sequences offunctions are delta sequences:
e-nlrl
: L ( -+-)
/r \l+n'x'/
1 - cosnx
(c) d,(x) : ---------1
nTr x'
(b) d,(x)
2. Find a Fourier
t*nr, Use contour integration.)
series representation of the delta function
6
(x) in the range
(-
L,
*L)
in
two ways.
(a) Start with the Fourier series for a step function (equation 4.24, for example) and
differentiate.
(b)
Start with the block functions (equation 6.3) and form the Fourier series. Take the
limit
as
n
--> @.
Are the results the same?
If not, why
not? Give a quantitative as well as a qualitative
account of any discrepancy.
I at a distance
end. Find the displacement of the beam
kail if tne beam is supported at the left end, as in Problem 5. 1 1
3. A point load Mg is placed on a beam of length
(b) if
Ll3
from the left-hand
the beam rests on supports at each end, as in Example 5.3
Ignore the weight of the beam itself.
4. A damped harmonic oscillator (compare with equation 3.7 andProblem 4.13) has initial
conditionsx(0) : x6 and dxldtl,:s: u0.Animpulse 1is applied att : to' Findthe
motion of the oscillator for t > 0.
5. Distributions may be multiplied by infinitely differentiable functions. Do you expect the
product
yt\ t
-
6(x
-
a)
PROBLEMS 319
to be a valid distribution? Why or why not? Investigate the properties of this quantity by
evaluating the integral
f@ 6-(x - a\
(x)dx
J-*-;;f
where Q"@) is a delta sequence of your choice and f (x) is
determine the result for functions that have the property
distribution in this case? Can you identify it?
a
test function. In particular,
f (a) :0.
Is ry'(x) a valid
6. Evaluate
@ /a s-txt51*2 -l2x - 3) dx
o) /]S e-" 3(x2 -t x - 6) dx
7. A string of length Z, with tension Z and mass per unit length p,, is hit simultaneously
at t : 0 at the two points x : L I 3 and x : 2L I 3. The impulse delivered at each point
@
is 1. Find the subsequent displacement of the string.
Using a general curvilinear coordinate system (Chapter 1, Section 1.3) with coordinates
u, u, and u.r, find the charge density due to a point charge q placed at the point u : t!0,
1) : uo, tr : tDo. Hint: Start with the delta sequence (6.25) and note that as n -->
m, only a differential line element ds2 is needed in the exponent. Then make use of
equation (1.61).
9.
Show that the sequence of functions
nf*o
fn7): t;G J_, ,*r[-r2(* - *')2]d*'
converges weakly to the distribution that gives the average value of any test function on
the interval -a to a.
10. Show that the sequence of functions
f'(x):
n
2coshz nx
converges weakly to the delta function. Hint: Use a method similar to the one used in
this chapter for sequence (6. I 8).
1L. According to the properties of distributions in Section 6.3,
Which distribution is it?
12. Starting with the integral (6.16), show that
6(x)
E
:
e-'6t(x) is a distribution.
: I,* coskx dk
gV expanding the cosines in exponentials or otherwise, show that, for
positive real numbers,
3(x
-
a)
_2 [*
-rJn
cos
kx
cos ka
dk
x
and a both
320
cHAprER 6 GENERALIzED FUNcrloNS lN PHYslcs
and obtain a similar expression as an integral over sines. Are these results consistent
with Problem 12? Discuss.
14. Find the Laplace transform
of 6(r - a).
Express the inverse as an integral using
equation (5.19) and demonstrate that this integral possesses the sifting property.
15. A uniform disk of radius a and mass M lies in the .r-y plane. Express the density in
terms of delta functions
(a) in rectangular Canesian coordinates
(b) in cylindrical coordinates
(c) in spherical coordinates
16. A uniform rod of length I and mass M lies along the x-axis with one end at the origin.
Express the density in terms of delta functions
(a) in rectangular Cartesian coordinates
(b) in cylindrical coordinates
(c) in spherical coordinates
E7J a. me of charge with uniform line charge density ), lies along the z-axis. Find the volume
charge density
(a) in cylindrical coordinates (b) in spherical coordinates
18. A disk ofcharge with radius a and surface charge density o(r)
-
osrfalies in the r-y
plane with center at the origin. Find the volume charge density
(a) in cylindrical coordinates (b) in spherical coordinates
19. Current 1 flows in a circular loop of radius a lying in the x-y plane with its center
at the
origin. Find an expression for the current density
(a) in cylindrical coordinates (b) in spherical coordinates
20. Prove the relation (equation 6.27)
v2ktg:zrs(i)
a
where p is the radial coordinate in a cylindrical coordinate system, a is a constant length,
and p is the position vector in a plane. Use the result to find the potential due to a line
charge
I
running parallel to the z-axis at x
:
Q,
!:
b.
21. A circuit contains a resistor, a capacitor, and a square wave power supply with period Z.
Use Kirchhoff's loop rule to write an equation for the current in the circuit in terms of
delta functions, and solve it to find the current as a function of time.
Ezl
starting with the result
D.r
O: :-r3
for the electric potential due to a dipole placed at the origin (see Example 6.1), calculate
the electric field everywhere, including ar the origin. llint: Show that the electric field
contains a delta function term. Use a method similar to that used in Section 6.5 to prove
relation (6.26).
PROBLEMS 321
23. Using
a delta sequence
of your choice, show that the limit
..
16(r)
-
6(x
libl z
-
t)1
l
exhibits the sifting property of d/(r).
24. Use the derivative property (6.20) to show that, for distributions, the Laplace transform
of the derivative Qt(x) equals s times the Laplace transform of @. Show that the Laplace
transform of lnr is -(y + lns)/s, where 7 is Euler's constant, - If, r-'lnxdx :
0.5'772. Hence show that the Laplace transform
tion, is
ln s. (See Zemanian, Chapter 8.)
of I / t (t > 0), considered
as a
distribu-
-y -
25. Stating from equation (6.16), show that
sin Rx
*9- "r :6(r)
Confirm your result by demonstrating the sifting property. Use contour integration to do
the integral.
Similarly, show that
if the integral
is taken to be the principal value.
Demonstrate the plausibility of the results by evaluating
2
sin Nxrx
I, :
and h' : [*
/@
n J,
x
J,
numerically for a set ofvalues ofe ( I and
proaches unity and 12 decreases toward zero.
N )) l.
cos
Nx
x
Show that as
o*
N increases,
E6.l Show that
dx (x) :261*r,
sizn
*
where sign
(t) :
l"l
27. Show that
**Urnrrrr:
*v'
IO
If-t;'1,t71n -
Hint: Use proof by induction (Appendix III).
if n < m
m)116("--)1v1 ifn > m
11 ap-
322
cHAprER 6 GENERALTZED FUNcrtoNS tN PHYSlcs
28. The integral If, *" f (r) dx : I:S xd @(x) f (x) dr, where o < 0, may be integrated
if xo is intetpreted as a distribution. First show that
xoo(x) :
where a
t" + rlt"
t --.
+zl'.'
4n_[x"+,@(x)]
(cv
* n > O and n is an integer. Use the result to evaluate the integral
[* *-:l'r-*
lo
@l
* n) dx'4*
absorbs light at frequency u; due to an atomic transition. The imaginary part
of the dielectric constant may be approximated as a66(u u1,). Use the Kramers-Kronig
lmaterial
-
relations (Chapter 2, Example 2.24) to determine the behavior of the refractive index
n
JPalt/eS as a function of frequency. Comment.
:
30. Demonstrate the sifting property of the delta
sequence (6.5),
I sin2 nx
""--Fin the case that f (x) has a second-order pole 4t z : 7o in the upper half-plane. Can you
hnE):
extend the result to a pole of order m?
CHAPTER 7
Fourier Transforms
7.1. DEFINITION OFTHE FOURIERTRANSFORM
The Fourier transform is the second integral transform that we shall study. Like the Laplace
transform, it provides a convenient way to solve an inhomogeneous differential equation.
But the Fourier transform proves to have many other useful applications in physics, as we
shall see.
Recall that a function f (x) may be represented as a Fourier series over any finite range
-L<x<lL:
f (x):
*oo
,D_on"'"o''t
Outside of the selected range, we obtain a periodic extension of the original function. The
set of coefficients a, forms a unique representation of the function /(x) over this finite
range. Each an may.be calculated using the integral (equation 4.31)
o^: i,1
f+L
J_, trnr-inrx/14,
Now we'd like to increase the range (-L, L) until / may be represented over the whole
real line. But as I -+ oo, nn I L --> 0 for any finite n, so we cannot use equation (4.31) as
it stands. We begin by defining
k:no
ard an
:
L
a(k)
so that
f
(x)
:
*oo
t
L-4lL-
a(k) eik'
^
and
a(k):
+ I_"" f (x)e-ik, dx
323
324
cHAPTERTFoURIERTRANSFoRMS
We multiply this expressionby
Here we may let
L
-->
a
LJTIi
and define
to obtain
F(k):
# l::
(7.1)
r@)e-ik'dx
provided that the integral converges.
Now the difference between two neighboring values of ft is
ui:-
(n*l\n wr r
L - L:T
Thus, we may rewrite the original series in terms of
,f(")
:D
as
2
7f
: -+l- )
Jzn
-
7
Then, as we let
obtain
F(k) and Ak
F(k) ,ikx
61l'
l, -+ oo and consequently A,k --> 0, the sum becomes an integral and we
I r+f(x): _t.,8; I F(k)eik'dk
-*
(7.2)
Equations (7.1) and (7.2) define the Fourier transform and its inverse. The factor of I / J2n
appears in both expressions-this isthe symmetric Fourier transform. The symmetric transform obeys the following symmetry relation:
If F(k)
is the symmetric Fourier transform of
f (x), then /(-k)
is the transform of
F(x).
An alternative definition of the transform has a factor I l2r in front of one integral and
unity in front of the other. The product of both factors must be ll2n.It is also possible to
define the transform with a plus sign in the exponential in equation (7.1) and a minus sign
in the exponential in equation (7.2);infact, this is usually done when the variable is
variable.l
lThe difference is related to the metric of space-time in special relativily.
See Section 7.3.7
a
time
for an example.
7.2 SOME
EXAMPLES 325
The Fourier transform and the Laplace transform are closely related. To see how, let
ift
:
s in equation (7.1). Then we have
Jzo
In the special case that
transform of
/.
f (x) :
reir: [**
J-*
0 for
.r <
f (x) e-sx dx
0, the righrhand side becomes the Laplace
The inverse (equation 7.2) is
I 1*;oo
f(x):GJ
:FGis),'.0:
I
:
/+'oo
J-,* "o'(-is)
^
i
i
I
e"
ds
which is the Mellin inversion integral (equation 5.19) with y --> 0.
An important difference between the two transforms is that the Fourier transform may be
applied to functions defined over the whole range -oo < r < +oo. However, the function
must approach zero as x -+ *oo for the transform to exist.2 In contrast, the Laplace
transform exists for functions that diverge as r -+ oo, provided that they do not diverge
faster than an exponential (see Chapter 5, Section 5.1). The Laplace transform is restricted
to functions defined for positive values of the argument.
7.2. SOME EXAMPLES
Example7.1. Find the Fourier transform of the function
where cv > 0.
/(.r) - exp(-o lxl),
The transform is given by equation (7.1):
F(k):
:
I
/+oo
h J_*
h
:&
,-utxt
"-ikx
(l_*u'.ik*
7,
dx
+
fo**
"-'*r-'o'
or)
_ I (ro_ltt,1o _r-ro+ilr"l+-\
\"-,-l---
t / r
"+," l, )
-l \
___
- Jr; I\cv-ik
a+ik)
_.1
_ td
Y;aTF
I
(7.3)
2This restriction can be lifted if we are willing to consider the function
/(x) to be a generulized function.
Lighthill's book for an extensive discussion. See also Example 7.3 and Problem 28.
See
326
cHAPTERTFoURIERTRANSFoRMS
Now let's go backwards and find the function that corresponds to this transform. From
equation (7.2),
: ;l/+-a
dk
J-* ;+ tceik'
This integral is of the type that can be evaluated using the residue theorem (Section 2.7.3,
Example 2.20). For x > 0, we close the contour upward with a big semicircle (Figure 7.1).
Im (k)
i
j
I
Re (k)
FIGURE 7.1. To invert the transform (7.3), we complete the contour in the upper half-plane when
x>0.
The integral along the semicircle is zero, by Jordan's lemma. The integrand has two
: Lia. Only one of the poles is inside the contour. Thus, the integral is
simple poles at k
l_-
*,ik'dk: f ,+r,ik'dk
/a\
: 2r i (;i)'i(ia)x
-
7T
e-o'
and so
(x): lbre-"'):
1
.f
s-dx
7V
For
x < 0, we must close the contour downward. The pole at k : -ia
contour, and we go around the contour clockwise (Figute7.2).
is inside the
7.2 SOME EXAMPLES
Im (k)
Re (k)
FIGURE 7.2. For x < 0, we close the contour in the lower half-plane.
Thus, the integral is
[_ *,ikx41,: t' -+r,ikx7p
: -2ri (-:)
,i(-iutx :
\-2ia /
andhence
f (x):
eax.We maycombineourresultsforx
f (x) :
ireux
< 0andx > 0toobtain
exP (-cv lx l)
as required.
Exampte 7.2. Find the Fourier transform of the Gaussian function
The transform is
F(k)
: 1 /+oo
G J"
7yr-o2*2r-ik* 7*
We do this integral by completing the square:
-o*2
-
ikx
j1\
: -ot2
- (\^*, *' 4*q,2n 4oa ) ik\2 _
"/
:_q"\.**)
Now we change variables to u
F(k): N
t*
-
4q2
p2
4A
: olx + ikl2a2):
/ e_\
Jzr'*P\-m)
[+@+ikl2d e
._,zdu
J-a+irpa
-
f (x) :
N
e-o"'
328
CHAPTER
7
FOURIER TRANSFORMS
The path of integration has been moved offthe real axis by the amount k f 2a.However,
since the integrand has no poles, the value of the integral is not changed. To see why,
constructarectangularcontourfromRe (u) : -RtoRe (u) : *R, withheightkl2a
(Figure 7.3). The integral along the two vertical pieces at the ends of the rectangle
goes to zero as R -+ oo:
ll,,o.u*
o"l= *max
*^
le-t+n+;rr'l
le-n'?+ziw+f|
*"*r(*)'"-R'--' 0
asR-+oo
Im (z)
Re (a)
FIGURE 7.3. The transform of the Gaussian is computed using an integral along the upper side
of the rectangle. Since there are no poles inside the closed contour and there is no
contribution from the short sides, the integral is the same along both long sides.
The integral around the entire rectangle is zero, by the Cauchy theorem, so the integral
along the upper path is the same as the integral along the real axis. We know the value
(Appendix IX). Thus,
of this integral-it is
"fi
F(k)
:
N
/ k2\
oJr"*p \)
^d
Therefore, the transform of a Gaussian is also a Gaussian, but their widths are inversely
related. The original function reaches one half its maximum value at x : 0.83/a; the
transform reaches one half its maximum value at k : 0.83(2q).
This result is closely related to the uncertainty principle in quantum mechanics. The
relation between momentum and wave number k is
P:hk
A particle is described by a wave packet, and the uncertainty in its position is determined
by the width of the packet---essentially llu.T\e width of the corresponding pulse in
momentum space is h A,k
:2ha.Thus,
the product of the two uncertainties is
L.x A,p
I
:2h
- -2ha
d
7.3 PROPERTIESOFTHEFOURIERTRANSFORM
329
for a Gaussian wave packet. The quantum mechanical relation is
t*tp2h '2
Thus, a particle that is well localized in space is poorly located in momentum, and vice
versa.
Example
7.3.
Find the Fourier transform of the function
/(.r) : l.
The transform is
F(k\: L
'.,/2n
[** ,-ikx 4,
J-a
We can recognize this integral as the delta function (equation 6.16):
F(k): Jn
ag,>
Then, by the sifting property, the inverse is
f
(x): I
f+oo
h J_* J2n 6(k)"ik'dk: t
[Note that etkr does not approach zero as x approaches infinity, so it is not a proper test
function as we defined them in Chapter 6. We must n€urow the class of distributions
by putting the requirement g(x) + 0 as.r -+ oo onto the core functions rather than
the test functions. This narrower class still includes the delta function as well as many
other useful distributions. l
Examples 7.1-:7.3 demonstrate the most commonly used methods for evaluating trans-
forms and their inverses:
1.
2.
3.
4.
Integrate an exponential with linear argument.
Complete the contour with a large semicircle and use the residue theorem.
Complete the square.
Identify the delta function or use the sifting property.
7.3. PROPERTIES OFTHE FOURIER TRANSFORM
Because the Fourier transform is similar to the Laplace transform,
properties that we found in Chapter 5.
7.3.1. Linearity
The Fourier transform operator
f
is a linear operator:
T(f+il:fU)+F(s)
f
(af)
: aF(f)
it
shares many of the
330
oHAPTERTFoURTERTRANSFoRMS
7.3.2. Complex Conjugate
Ifthe function f(x) is real, then
F*(k\
f+oo
: I1
l:
I
f (*)"-ik" orf. :
lJ2r l_a
# I:: f (r)e+ik'dx: F(-k)
(7.4)
7.3.3. Differentiation
We evaluate the transform of a derivative using integration by parts:
'(#):hI IIJ#''*0.t
s.-ikxl*-"'l-- -- ,D; ,'
/*--
h J-* -ik-se-ik* dx
:
'(#) h ll* ,"-'o' dx - ikF(k)
(7.s)
The integrated term vanishes, since the transform exists only for functions /(x) that approach zero as x -+ *oo. Notice that in the case of the Fourier transform, unlike that
of the Laplace transform, no initial conditions appear. Extension to higher derivatives is
straightforward:
,(#):
(ik)'F(k)
(7.6)
The sign that we use in the exponential when forming the transform (equation 7.1) is
reflected in the sign that appears in the derivative rule (equations 7.5 and 7.6). If the transform
is defined as
F(o):
I
f+- (t)e+i't
dt
f
h J_*
then the first derivative has transform
'(#)
: -iaFko)
(7.7)
i
7.3 PROPERTIES
TRANSFORM 331
OF THE FOURIER
7.3.4. Attenuation and Shifting
Let g(x)
:
eo'
f (x) . Then the transform of g is
G(k):
:
If g(x)
: f (x -
I
/+oo
e"''
h J_* f
(x)r-ik'r 4*
h l** ror"_-i(ia-tk)x 4*
:
F(k -l
G(k)
:
I
: x-
a, we have
ia)
(7.8)
a), then
Changing variables to u
G(k):
:
/+oo
(x
h J_* f - a) r-ikx 4*
I
h
/+- (u)e-iktu+o)
J_* f
4u
e-ik" F (k)
(7
.9)
Thdse relations are analogous to (5.5) and (5.6) for the Laplace transform.
7.3.5. Parseval's Theorem
The integral of the product of two functions may be related to the integral of the product of
their transforms.
r
a+m
a+oo
f+*
.,._ f+*
d*l
a*rQe)eik'I
IJ-- fr*le(x)dx:+l
"
2r J-J-*
J--
:
l::
F&)dk
:
l_:
F&)dk
:
f +oo
J_*
doGlo)ei'r
lll crao,* f+* "io'+'t'd*
l_l
cr,t 6(k-ta)da
F(k)GeDdk
If the function g is real, then we may
use equation (7 .4) to write the result as
l** to, g@)dx
F&) G*&) dk
(7.l0)
332
cHAPTERTFoURIERTRANSFoRMS
This result is called Parseval's theorem.3 An important special case occurs when
/ :
g.
Then
r+oo
r*oo
I Lfe)l'dx:I
J-a
J-*
The absolute value signs are necessary since even
(7.tt)
F&)2dk
if f (x) is real, F(k)
may not be.a
7.3.6. Convolution
Suppose the transform
H(k)
:
F(k) G(k) is the product of the two transforms F and G.
Then the inverse is
h(x):
:
I
/^+-
h
J_*
I
/t+oo
H(k)eik'dk
h J-* 'rorG(k)eik'dk
Now we write the transform G(k) in terms of the function g:
h(x):
+
l:: ,n ll.: s(u)e-iku aul,ik'an
We then combine the exponentials and interchange the order of integration:
h(x):
I
f+oo
h
J
r
*
r+oo
I
s{")-L16
J_*
.+m
I
-L
Jzr J-*
s@).f(x-u)du
which is the convolution5 of the functions g and
h(x)
,rorrik(x-u) 4P4,
f
.
Equivalently, we may write
:
the inverse transform of
:
-l-*
F(k) G(k)
(7.12)
l** ra g@ - u) du
3compare with equation (4.32) for Founer senes.
4See Example 7.4.
5This convolution differs from the one defined in Chapter 5 only by the numerical factor
of the limits of integration.
l/J2tr
and the values
7.3 PROPERTIESOFTHEFOURIERTRANSFORM
333
7.3.7. Extension to N Dimensions
We may extend the theory of Fourier transforms to as many dimensions as we wish.
Frequently, in physics, N : 4: three space dimensions and one time dimension. If / is
a function of the three Cartesian coordinates x, y, z (N : 3), we simply transform with
respect to each of the variables, one at a time.
FG) : F(kx,ky,kz)
:
I:: l-: l-- ,r'!,
#
: : lf
/
/til
(J2r)z Jdl
,pu"" "
exp
z) e-ik"x dx e-ikvv dv e-ik'z
(-ik .i) dv
It is usual to define the time transform with the opposite signs, so that for N
F(i.a): ^: [*" I
\z1r )' J -a Jdl
d.
f G,r)exp
[-i([ .i-
:
4,
att)ld3idt
space
Then the function
,f
lf+@l
(i, r) :
I I
-l\zn ). _a Jail i
J
Fd<, alexp
space
[i(f .i -
c,.,r)]
a3iaat
is a sum of plane waves.6
The transform of a derivative may be calculated using a method similar to the one we
used with the one-dimensional derivatives in Section 7.3.3. For example,
-lt-
F(v
f
)
(t/2n\3
/
Jall
tv./(i)lexp (-ik .*) dv
space
lf
------:= /
tVtf t*) exp (-tf ' *)l - /(DV exp (-tf .h\ dv
- \\/'27r)'
Jall
lf
:: (J2tt)3/surfaceu,-"''
I
fr*)exp(-if .i)fids
space
lf
- ("/'2T)' JallI
-----
Now provided thatT
/
space
-irtt(i)
exp
(-ii.;i dv
approaches zero faster than I / r2 as
r ->
oo, the surface integral
(-ii
.*) dv
is zero, and
rri n :
0
+ irt
ah
l^r"o
*/(*)
exp
6Co.p*" with
Chapter 2, Section 2.1.4
TAgain, this condition is also required for the existence of the transform of
/.
334
cHAPTER
7 FouFtER TRANSFoRMS
F(V
f):
(7.t3)
ikF(k)
Similarly, we may show that
TN2fl:-nzr1i<1
(7.14)
f(ixi):;f"i
(7.1s)
and so on.
7.4. CAUSALITY
In an initial value problem, the initial conditions appear explicitly in Laplace transform
theory. In Fourier transform theory they appear more subtly. Let's look at an example to
see how this occurs.
7.4. An electron, initially at rest, is acted upon by an electric fielA fr1r; :
io"-"|for t > 0, where o is real and positive. The electron is also subject to a damping
Example
: -yi. Find the subsequent motion of the electron.
The equation satisfied by the electron's velocity is
force F4
d_
^fit+yi:-ei,1*..t)
First notice that the motion will be one dimensional. Then we take the Fourier transform of the entire equation with respect to time. Define the transform by the relations
i(r)
:
I
/+-. i{a1e-i't da
h J-*
and
ilar;
: I r+- i(r)ei't dt
h J_*
Then the transform of the electric field is
E1r;
: ErhI
:E-:- "
/"+-
J,
L-at
"-at
uiat 4,
ri<ot 16
J2r -a *
-11
: f^-,:
-l
"J2ra-ia
ialn
335
7.4 CAUSALITY
The range ofintegration reduces to positive values of /, since E
:
0 for
/ < 0.
Transforming the differential equation and making use of relation (7.7) for the
transform of the derivative, we find
-imai
I yi : -rfi,o!
E@-ia)
1
and thus
ilar;
-1
: eEs
Jz"
-
e
@
-
ico)(ima
1
Eo
^""
-
y)
1
(7.16)
J2n @*ia)(a-fiy/m7
Now we invert the transform:
i1r;:
hl_:
e)
' En I'
nx " Jzr
1
ko
* ia)(o *
iy lm) "-iat
7.
We can use the residue theorem to evaluate the integral if we close the contour with
a big semicircle.8 For / > 0 we close the semicircle downward, while for r < 0
we must close upward. But for / < 0 the electron has felt no force, and so i must
remain zero. Thus, causality9 requires that the transform have no poles in the upper
half-planel The poles of our integrand are at (D : -ia and o : -iy /m, and both
are in the lower half-plane (Figure 7.4). Notice that one of these poles is contributed
by the differential equation (-iy lm), and one by the driving force (-icv). It is at this
point that the initial conditions enter our solution.
Im
(rrr)
Re (rrr)
FIGURE 7.4. Contour for inverting the transform (7.16) when r > 0. Both poles are in the lower
half-plane and thus are inside the contour.
8see Chapter 2, Section 2.7.3, especially Examples 2.20
and2.2l.
9This argument proves important in deciding where to put the integration path when there is a pole on the real
axis. See Chapter 2, Section 2.7.3.
336
CHAPTER
7
FOURIER TRANSFORMS
For / > 0 we integrate around the lower contour. The contour is traversed clockwise,
so we introduce a minus sign. The solution for r > 0 is
i(r)
/
: !!ior-ro,,
2rm
/m)t
e-i(-ia)t
\r-n+ota*
"-i(-iy
(-iy
/m -f ia)
:
'"o (r-ot - r-hty
am-y
It is easy to verify that i : O at t :0, and also that i
-+ 0 as t -> oo.
7.5. USE OF FOURIERTRANSFORMS INTHE SOLUTION
OF PARTIAL DIFFERENTIAL EQUATIONS
The solution of a partial differential equation will be a function of two or more variables. If
the equation is linear, we can apply the transform operator in one or more of the variables
to obtain either an ordinary'differential equation or an algebraic equation. In this section,
we show how to apply these techniques to two common partial differential equations.
7.5.1. The Wave Equation
J.5. Suppose an infinitely long cable is pulled up at .r : 0, so that its
shape is described by the function f (x),: \s-lxl/a, and then let go. What is the
subsequent motion of the cable?
The equation of motion for the cable is the wave equation (3.15):
Exampte
o2y: ui*'
,ozy
aP
Let's transform the equation with respect to space and time. We define the transform
as
follows:
i(t<,a):
*rE
y(x, t)
"-ikx-tiat
4*
49
I
l
l
and, conversely,
y(x,t): !2n [** f* ,rn,11"ik*-i't
J-* J-* "Then the transformed equation is
a-
i
a
a-
-(t)'Y : -U'k'Y
ko-uk)(a*uk)j:g
dkao
IN THE SOLUTION OF PARTIAL DIFFEBENTIAL
7.5 USE OF FOURIER TRANSFORMS
EQUATTONS 337
One solution to this equation is the trivial solution i : 0, but this is not the solution
we want. We might be tempted to say that the equation is solved by taking o - *.uk,
but we want a solution for the transform i@, k). The transform is zero except where
a : fsft-v
property that is reminiscent of delta functions. The nonzero solution
that we need is
j :
A(k)6(a
-
uk)
+ B(k)8@ * uk)
since we have already established that x6(x) is the zero distribution (Chapter 6,
Section 6.3). The equation becomes
+ B(k)(ot - uk)(co * uk)6(to'f uk)
: A(k)(to* uk) x O * B(k)(c,t- uft) x 0 : 0
A(k)(a
*
uk)(a
-
uk)6(a
-
uk)
as required.
Then the solution for y(.r,
y(x, t)
t) is
: I /+oo /"+oo tattlo
; J _* ;/__
(o
-
uk)
*
*
B(k)s(a
uk)l ,ikx-iat 4p 4,
: ! [** ,ik'[A(k)"-iukt * B(k)e+i'ktf dk
2n
J-*
This expression shows that the displacement of the cable is a sum of rightward-moving
and leftward-moving waves.
0y
To solve for the remaining unknowns, we must use the initial conditions that
/0t :0 and y - 11"-lxl/a at t : O. Thus,
y(r,0)
:*
I::
,ik'tA(k)+ B(ft)l dk
-
he-txt/a
and
#1,:,:
* I::
iukeik*1-A1k) +
B(k))dk:
o
From the second of these relations we conclude that A : B, while from the first we
may make use of relation (7.3) with a : l/a to write the transform of the initial
condition on y:
t, ha
Y
:
1-...
alL\
zl+kzaz tE-""-'
-'l
A(K):
ha
TTFT
So, finally, our solution is
t- nu
r-Foo
^ikx
I
:----(e-ivt't + e+iukt)dk
y(x,t\ "- 2o J-* lrk2al
338
7
CHAPTER
FOUFIER TRANSFORMS
To do this integral, we use the residue theorem:
hf+*1
^
y(x,t)--L,
ztra J-* &+i/a)(k-ilat
tlsikx-ivkt+eikx+iukt)dk
There are two simple poles on the imaginary axis at k : Ii /a.
lst term: Case I: x - ut < 0. We must close the contour downward, thus enclosing
the pole at k : -i / a. Then the integral along the big semicircle is zero, by Jordan's
lemma. We traverse the contour clockwise. Thus,
2ni ei(-i)(x-ut)/a
:;i ,k-ut)/a
it:_*ii
z
Ji
Case
k
: li
II: x
- ut > 0. We must close the contour
upward, enclosing the pole at
/a. The result is
tr:ih2ni
2nd term: Case III: x
*
ut <
2n
"i(i)(x-ut)/a
x
:h "-(x-vt)/a
2
0. We close downward and get
i
ex+Dt
tz---=-,.-- "i(-i)(x1-ur)/a
-,.
Lft
-ztCase [V:
x
I
ut >
/a
z^
0. We close upward. The result is
iz:
2ni ei(i)(x*utl/a
*ii
,
:;i
e-@+Dt)/a
z
Finally, we put all this together:
If x <
-ut
for
/ > 0, then also x <
y(x,t) If
-ut
h "(x-ut)/a
u/, so we have Case I and Case III:
I
"@)-vt)/a
:
he"/o"orh
A
a
< x < ut, then we have Case I and Case IV:
Y(x't) - oe(x-ut)/a
+2e-G+u')/a
-
|rr-ut/a cosh 1
a
If x > ut, then we have Case I[ and Case IV:
4
Y\x,t): h"-(x-rt)la 2-"-G*ut)/a -
he-xlo
{
"orh a
This solution is shown in Figure 7.5. The initial peak at x : 0 propagates both
rightward and leftward along the string and decreases in magnitude with time. The
7.5 USE OF FOURIER TBANSFORMS
339
IN THE SOLUTION OF PARTIAL DIFFERENTIAL EOUATIONS
displacement is never negative, so the string does not oscillate as we might have
expected. This is a consequence of the infinite length of the string and the fact that
the wave is propagating in one dimension (along a line-the string).
Y(x, t)
I
I
0.F
0.6
..'a't.
\
--::.....
!a
-5-4-3-2-1012345
FIGURE 7.5. Shape of the string in Example 1.5 at
ut
/a:
0, l, and 2.
7.5.2. The Diffusion Equation
7.6. At t : 0, a small amount of mud of mass rn is introduced at the point
:
x { into an infinitely long pipe of cross-sectional area A, containing fresh water.
Example
Determine the distribution of mud in the pipe at times t > 0.
The appropriate differential equation is the diffusion equation (equation
3.
l4):
Dp
. _n _o2p
E-"ar2
where D is the diffusion coefficient for the problem. The boundary conditions are
p:0forallxfort < 0,andp -+ 0asx + oo. Att:0,allthemudisconcentrated
at one point. We may model the density as a delta function:
p(x,O):YX, - t)
Since the differential equation is first order in time, we may be successful by taking
the Fourier transform with respect to x only:
F&,t):
I
h
/^+-
J_*
or*,t1e-ik* dx
Transforming the whole equation with respect to.r, we get the first-order equation
ai :
At
-k2
Di
340
oHAPTERTFoURTERTRANsFoRMs
which has the solution
i :
The initial conditions determine
io
fu"-k2Dt
/s:
: v(k,q :
h lll ,r,,01 e-ik'
dx
1 /.+oo
::+m
I tk - t\e'ik, dx
AJ2n J-*
:7*e-ikl
m
Thus, we have
v&,t):|ff"-|"'
and transforming back gives
p(x,t):ft+,f*"^o
(ikx
-
ikt
-
k2D\dk
To do the integral, we complete the square:
k2
Dt + ik((.
-o
:
(*Jot
.,#)'
*
la =!f
Thus,
p(x,t):T+^,(-i+) /_ *, (r-*,?)'foo
lNow let u
:
kJ Dt +
p(x,,)
i(t - x)/2.,,fii,
: T*
"-,
m I
'72"trDt
and thus du
: tDt
(-#) t::::H
dk. Then we have
e-"'
:h
( (x-l)2\
"^o
\- o- /
Thus, the mud distribution is a Gaussian that spreads with time while the maximum
density decreases (Figure 7.6).
7.6 FOURIER TRANSFORMS AND POWER SPECTRA
341
4 p(x)
m
(''... /'2
S-4.,
-5-4-3-2-1012345
FIGURE 7.6. Distributionof mudin Example 7.6for
and 2 m (dotted line) and I : 0.
x(m)
JDt : l12m(solidline), I m (dashedline),
7.6. FOURIER TRANSFORMS AND POWER SPECTRA
The Fourier transform relates the time dependence of a function f(t) to the frequency
dependence ofthe transforn F(a). Thus, it is a useful tool for analyzing the frequency
components present in a physical system. One of the most common applications is to the
electromagnetic power radiated by a system.
An accelerated charge radiates. The power radiated per unit area of wavefront is given
by the Poynting flux:
S=
ExB
Po
Far from the source,
i
is perpendicular to both fi and fr:
k(E.E)
Using
dA
:
- E(k.E)
r2d{2, we find that the power radiated per unit solid angle is
4! :rr13,1-"t'
dA
LLoc
342
CHAPTER
7
FOURIER TRANSFORMS
where E and hence P are functions of time. Then the total energy radiated per unit solid
angle is
ry:
[** Lfr'ro.fr,(Dat
dA J-Foc
Using Parseval's theorem (equation 7.71), we may write this in terms of an integral over
the transform of E:
ry:
[** Li.td'fr:@)aa
dQ J-tLoc
Thus, the energy radiated per unit solid angle per unit angular frequency is
+:da
Li,@).i.(,)
dQ
(7.r7)
LLoc
which is the power spectrum.
The electric field due to an accelerated charge in nonrelativistic motion has two parts: the
Coulombfieldthatdecreases aslf r2 andtheradiationfieldthatdecreases asl/r.Thus,at
large distances from the charge, the electric field is dominated by the radiation field, which
takes the form (Jackson, Chapter 14)
*E: skx(txd)
4nw+
:
d2*/ dt2 is the acceleration of the charge and fi is a unit vector along the direction
ofpropagation. Therefore, the energy radiated per unit solid angle per unit angular frequency
is independent of the distance r to the source, and d2 W / dQ do may be expressed in terms
of the time transform of the charge's position, since d(ar) : -ro2i.(to) (by the derivative
property, Section'7 .3.3):
where d
d2w :
'" ^
:
q2
iN&;AFaa
)
^,1-,,
cr (4n)zes
where d is the angle between d and [.
Integrating over the angles, with pc
dw :
;
q2
1*'1a112 sin2 e
:
,a
lir(r)12
sin2
(7.18)
e
cos 9, we find
r*l
4,0
l*(,)12 (r",
J_, -
Fffi,a
r,2)drt
: r!3 cs(4n)zeo
,,1'.^ ,a 1fi.(r)lz
: -{ra
bft c" eo
Bko)12
('t.ts)
343
7.7 SINE AND COSINETRANSFORMS
7.7. Find the power spectrum radiated by the electron whose motion we
considered in Example 7.4.
Example
Here
it is more
convenient to write d1a-l; ln terms of
i(ro) so that we can use
equation (7.16) forthe transform ofthe electron velocity. The electron radiates energy
per unit angular frequency:
I _l'
I
e2 .ledw _ q2 .,2e , :
lv(all' l-En
---:-tD- t'r-tt
6o1310* li"t J2" @*irr)@+iy/m)l
d, 6113^,2
,4 E3
-0)z
(7.20)
-
12n2*2r3 eo (a2
*
a2)la2 + 0 /m)21
The power spectrum (equation 7 .20) is shown in Figure -7 .7 . The spectrum peaks at
o-
Juffi.
dw
dt
I
L
I
t
I
i
45
Id
6'7
FIGURE 7.7. Power spectrum radiated by the electron in Example 7.7 for y /m
and y
lm - 2a (dashed line).
The frequency variable is
:
0.5o (solid line)
o/u.
7.7. SINE AND COSINE TRANSFORMS
In Chapter 4, we saw that the Fourier series of an even function contains only cosines while
the series of an odd function contains only sines. Similarly, we can construct sine and cosine
transforms for odd and even functions.
Sine and cosine transforns are used when either
1. the function f (x) is known to be even or odd or
2. the function is defined only forx > 0.
344
7
.7.1
Let
cHAPTEFTFoURTERTRANSFoBMS
. The Cosine Transform
f (x) be defined for x >
range
0, and let us make an even extension of the function to the
.r < 0. Then the Fourier transform of the extended function is
F(k):
# I::
ysle-ikr
dx
: -* (lt
(x)
*ral "-ikx 4, * Io** f e-ik' dx)
:
(i
# (- /: f G,u) "ik' du * lo** f e-tk' dx)
:
# lo** fr*rrrikx + e-ikx) dlx
,
F(k)
: ,l:
lr.* /(x)
cos kx
dx = F"(k)
/(r).
which is the Fourier cosine transform of the original function
transform is an even function of /c:
Fc(-k) :
?
7f
lo**
/(x)
Io** /(x)
cos
(7.21)
Note that the cosine
(-kx) dx
cos kx
dx
:
Fc(k)
The inverse transform is given by
6
.f
r+o
(x): rl1 I
Y 7t Jo
F"(k)coskxdk
To verify this expression, we expand out the cosine:
f(x):
:
h
(1.-
F"(k) eik' dk
+ Io** r"Q<) e-ik' ate)
(7.22)
l
l
I
7.7 SINE AND COSINE
Now let rc --
-k
in the second term, and use the fact that F"(ft) is an even function of k:
f (x) :
:
Thus, equatiot
7.7
.2. The
TRANSFORMS 345
(7
Si ne
h (lr*
#
I**
F"1k1eik' oo
'"n'
"ik'
-
Io-*
F.(-r) ,i-' a*)
dk
.22) corresponds to the usual inversion relation (7.2).
Transform
The sine transform is defined similarly by making an odd extension of
/.
The Fourier
transform of the extended function is
F(k):
:
:
# I_: r@)e-ik'dx
h (l'*ro, "-ikx 6* *
h(-
/:
f ?u)
"iku
du
lo**
*
Io**
f (i e-tk- dx)
d*
lo** fr*rr"-ikx - "ik';
r+*
: -i tl lT
I
/(x) sin k-r dx : -i
UTJo
=
f (x) e-ik' dx)
h
F"(k)
where the sine transform is defined by
F.(k) =
(7.23)
which is an odd function of fr. Again the inverse is
f (x)
: ,l 1 lr.* F,(k) sinkx dk
(7.24)
346
cHAPTERzFoURTERTRANSFoRMS
Let's verify equation (7.24):
tl
lT r+* Fs(k) sinkx dk :
; J,
tl:*/*-
nror< eikx
: ,11+(1.*
: # I**
-
e-ikx)dk
'!!,' "'o' oo
oro,
"ik'
dk
* lo-* Fl--*) "'.' d*)
: f (x)
as required.
7.7.3. Use of the Sine and GosineTransforms
Like the Laplace and Fourier transforms, the sine and cosine transforms prove to be useful
tools for solving differential equations. Thus, wti: will need the transforms of derivatives of
a function in terms of the transform of the original function. Let's start by finding the cosine
transform ofdfldx:
t;
: _V;f
ro1+ftrs(k)
The transform of the derivative brings in the initial condition /(0), as is the case with the
Laplace transform, but it also mixes up the sine and cosine transforms. The Fourier cosine
transform of df I dx is k times the Fourier sine transform of / (x). A similar mixing occurs
with the sine transform:
^(#):,11 l,** ffsinkxdx
,11(tsinkxlf -olo**
:
_kF"(k)
This time no initial conditions appear.
f
cosrcxdx)
7.7 SINE AND COSINETRANSFORMS
347
We get back the transform we started with (sine or cosine) when we look at the second
derivative:
#.:o.r^(#)
4!^
,:o+
df
dx ,:o
kl-kFc&)l
-
k2
p.(k)
(7.2s)
and
'"(#)
: _r^(#) :
:
(#)
^
r,
-/. (-
|]
17 r <ot -
k2
rror+
k*(k))
r"(k)
(7.26)
These results suggest that the sine and cosine transforms will be most useful in solving
differential equations that have only even derivatives or only odd derivatives. The choice
of transform will be determined by the initial conditions that are given. For second-order
equations:
. If the value of the function is known at x : 0, use the sine transform.
. If the value of the first derivative of the function is known at x :
0, use the cosine
transform.
If we were to use the Laplace transform method, we would nee d both initial conditions . It
appears that we can get away with only one condition if we use the sine or cosine transform.
But it is implicit in the Fourier transform (sine or cosine) that the function /(x) approach
zero as r --> oo, and this provides our second boundary condition. No such constraint is
needed with the Laplace transform.
Example
7.8.
Use a Fourier ffansform to solve the equation
d2v
77-d"Y:o
with the boundary conditions y(0)
:
y0 and
y -+ 0 as r
+
oo.
348
oHAPTERTFoURTERTRANSFoRMS
Since we are given y(0), we use the sine transform:
"IT
-k2f,(v) + r1f
;n - o2F,1v1 = s
The solution for the transform is
l,
,rvn
F,(y): l;eld
Now the inverse is
y(x):
,l: I,* li;+rsinkxdk
The integrand is even in rk, so we can extend the range of integration:
I /Poo kvn ,ikx - ,-ikx
t\x):;J__Ffr
, ar
We use the residue theorem to evaluate the integral. The integand has poles at k :
*ia. For the first term, we close the contour upward (remember t t 0), and the
integral around the big semicircle is zero, by Jordan's lemma. Only the pole at k :
*ic is inside the contour (see Figure 7.1), and we get
1 [*
L_
,!!e
t ro
dk _
,i{ro1, _
2ia
2ri J -* & * ia)(k - iq)"ik,
!.o
2
"-ax
Similarly, for the second term we close downward, enclosing the pole at k
Iz:
I
/Poo
fr J--G *
kyo
ia)(k
- ia)"-ikx
41,
: -ia:
- _-ioro "-i(-ia)x - -l!2-,-ox
-2ia
where the extra minus sign accounts for the fact that we go around the contour
clockwise. Combining the two terms, we find
)(x): h-Iz:loe-ax
which is the expected result.
Let's
x
:0,
see
how the solution goes using the Laplace transform. We don't k'now dy
so let's call
it b.
r2Y(r)-slo- b-a2Y:o
-i------ru-slo*b
- s' d,'
-
/dx
at
TRANSFORMS 349
7.7 SINE AND COSINE
Inverting,lo we get
:
y(x)
yo cosh cr.r
+4
a
sinh
cx
Now the second boundary condition requires that the positive exponential terms in this
result sum to zero, so we need
yo+!:o+
b:-ato
a
Then the solution is
y(x):
yg(coshcux
-
sinhcux)
:
yoe-dx
as we obtained before.
If we can always use the Laplace transform in this way, why should we bother with sine
and cosine transforms? The answer is that we cannot always use the Laplace transform.
7.9. Suppose the system in Example 7.8 is. now driven by a function /(x),
where the exact functional form is for the moment unknown. Find y(x) in terms of
Example
f
(x).
In this case, the Laplace transform becomes
b
F
rz-qz- s2-.,2
svn -l-
i:
To invert the second term, we use the
convolution theorem (equation
5
.
17). The inverse
is
y(x)
:y0
coshcvr
+
I
sinhar
aJoq
* ['
,rrrsirtha(x
-
E)
o,
Until we evaluate the convolution integral, we cannot make use of the boundary
condition at infinity. However, if we use the sine transform, we get
.fs())
l, kyo- F,(f)
: y;F
*7 trTA
To invert this expression, we need to work out the convolution theorem for the sine
transform:
r;ttt"(f)a(s)l
: rl:
Ir*
F"(f)F,(d
sinkx dk
lZ /^oo I/ lZ
- foo g)sinkt d€ \
F,@
| ; J,
f
ll ; J,
)
sinkx dk
:?ltJo[* f @ Jo
[* r,(s)]tcosr(x
-f) -cosk(r +€)tdkd€
z
louse partial fractions and Table 5.1, or use the Mellin inversion integral.
3s0
CHAPTER
7
FOURIER TRANSFORMS
F;t[F"(f)F,(g)]
:
r[*
Jzn
Jo
f G)tE@ -
5)
-
E@
+ €)ld€
(7.27)
where
E@): F;rtF,(dl
is the inverse cosine tansform of .Fr(g).
To solve our problem, we need the inverse of G(t) : ll(k2 + q.2). This function
is even, but fortunately we need its inverse cosine transform. The method closely
follows what we did in Example 7.8, the only difference being the lack of the factor
k in the numerator. (See also Example 7.1.)
i@):
r;' (Fld)
:
:
I
:
f* 'ikx 1 e-ik' "
& l-* 4*'ao4"*
li ,-"'
2tri f ,itir)* ,-i(-ia)x\
ft
(td6 - t-,6
): ''li;
(x > o)
greater than zero, x - f is not, because f ranges from zero
since
we took the inverse cosine transform, the function f(x)
to infinity. However,
this
example,
must be even. In
E@) : Ji-f2(s-ulxl/cv). Then the solution of our
is
differential equation
Although
x is always
v(x)
:
roe-o,
*
*
Ir*
f G)@-"1,-E1 _ r-u(x+€)y 6q
: loe-dx * * (lr. f G) s-a(x-1) aE + l,*
-*
Ir*
rro e"@-il d€)
f (ile-d(x+€) d€
: e-ax (*. *
d€
lo' rcleo€ - *
*# l,* f c)e-"€
Ir*
rlsve-"e as)
aq
which clearly obeys the condition y(x) + 0 as .r
/, provided that its Fourier transform exists.
-+
oo no matter what the function
PROBLEMS 351
We conclude this discussion by finding the convolution theorem for the cosine transform:
r;tlr"$)F"(s)l
:
rlZ
f
U 7T Jo
F"u)F"(s)coskx dk
: l? ,"G) (,li
coskxdk
l,*
lo* rc,cos/,6d6)
:1
F;tlF"(f)F"(s)l
lr*
:
r"Glf @)tcosk(x -
llooI
Jn
Jo
f G)tsM
-
5) + cosk(x + €)ldkd€
6l) + s(x + E)ld€
(7.28)
This time the functions are not mismatched with the transform, but we must take the even
extension
ofg
when evaluating S@
-
€) for
f > r.
PROBLEMS
1. Find the Fourier transform of the following functions, and verify your results by computing the inverse transform.
" _x2-l 4x-ll3
(a)
1
(b) e-"'2
cos px
(c) I {x)
@ coshax
1"1
x<
otr,.r*ir.
: *
[o
l(t) :
if 0 <
(
(rr-o'
[o
if
I
t>o
otherwise
(0 --+---t
x'+a'
2. Invert the following
(a)
F(ft):
transforms to find the corresponding functions:
t-2ik
l+4k2
I
l(b)lr(k):11,.1r
(c) F(ft) :
1
i sinh ak
-
352
CHAPTER
7
FOURIER TRANSFORMS
istheFouriertransform of f(x), showthat idF/dk isthetransform ofxf(x).
What conditions must F(ft) satisfy for this result to hold?
4. Verify Parseval's theorem in the form of equation (7.10) by evaluating the transforms
of the functions /(x) : cos pr and g(.r) : s-ax2 and evaluating the two integrals in
3.
IfF(t)
equation (7.10).
S
Verify Parseval's theorem in the form ofequation (7.11) by evaluating the transform of
(t if -1<x<l
[o otherwise
/(x) :
f (x)2 and F (t)
F(/c) isthetransform of f (x),then(lla)F(k/a)isthetransform
2
and evaluating the integral s of
6. Provethatif
7.
I
|
.
of
f
(ax).
Show that the result is consistent with Parseval's theorem.
Find the Fourier transform of the function
:
r(t)
(Acosri,nt if
-T<t<T
[o
otherwise
2
that represents a finite train of data. Plot the Fourier power spectrum I F (ar) | as a function
I andaoT 10. Comment. What happens as 7 increases
of arT forthe two cases ar6Z
:
:
toward infinity?
8. Find the Fourier transform of
if -T<t<T
r,-\ : [l-vtt,
\o
'l trr
otherwise
Hence find the transform of the function
(t
I
g(r):(-1
[o
S
if -T<r<o
ifO<r<I
otherwise
Strow that the square deviation between two functions,
o: [**
J-a
V@)
-
g(x)12 dx
equals the square deviation between the transforms,
o:
r+oo
J_* tF(ft) -
G(DP dk
10. A spring-and-dashpot system satisfies the equation
d2x
dx
.
AV+2aV+ofix:f(t)
I
I
l
PROBLEMS
with
@0
>
cy.
The driving force per unit mass
f (t) :
for
11.
t > 0. Find x(l)
for
"f
(t) is zero for
353
/ < 0 and
e-ot sin Q/
t > 0, and verify thatyour
method gives
x
:
0 for
/ < 0.
An electron in an atom may be modeled classically as a damped harmonic oscillator
(compare with Problem l0 above). The electron is driven by an incoming EM wave with
electric field E(t) : Eo(sin At)/fu for -oo < t < oo. What is the appropriate /(l)
for this problem? Solve for the transform .r(ar) of the electron's position.
Use the results of Section 7.6 to determine the power spectrum of the radiated energy.
Plotyourresultsinthecase q
-
10,9:2on.
ogf
Comment.
12. The electric displacement D is related to the electric field B by the dielectric constant €.
In general, e is a function of frequency, so the relationship is one between the Fourier
transforms of D and E:
rt@,
(a)
r'>)
: e(tDfr,.;, a)
Show that the relationship between f,1x,
D(x, r)
:
E(r, t)
r; and i(-r, t) is
/n
OO
+ I G(z)E(x, t J-*
t) dt
and determine an expression for G(r) in terms of e(co).
(b) Find G(t) for the one-resonance model
e(0))-r*a#_*
where ap, ar6, and y are real positive constants and y < ag.
(c) Discuss the physical meaning of your result. Be specific.
An electron in an atom may be represented by a damped harmonic
trl quency
i
&D ang damping rate f . (Compare with Problem 10 with 2a
electric field E0) acts on the electron. Find the Fourier transform i
position as a function of time. If the electron loses energy at a rate P
use Parseval's theorem to show that the total energy loss is
i
LU:-
I
e2
m
/^oo
I
J-* @?o-rt)t+@:212
Note that the integmnd is sharply peaked at @ - a,l6, while E(ar) is a slowly varying
function, and so the integral may be approximated as
LU
:
-2
LtfrtuDP
.oo
a2l
I _*6=#9**0,
Evaluate the integral by contour integration to show that AU is independent of f , and
AU. (In this expression, @s and f are real positive constants, and cr.ro > f .)
hence find
354
CHAPTER
7
FOURIER TRANSFORMS
14. The radon problem. Radon diffuses from the ground into the atmosphere at a rate
r (atoms/m2.s). Model the atmosphere as a semi-infinite medium with boundary (the
ground) at y : 0. Then the density p(y,t) of atmospheric radon is described by the
equation
Do
02o
u-"aF' -)'P
where D is the appropriate diffusion coefficient and
boundary condition at the ground is
apl
d lr-
:
constant
:
I
is the decay rate for radon. The
-cY
What is the boundary condition at y --> oo? Use the Fourier cosine transform in y to
derive an integral expression for p(y, r) in the case that p(y,O): 0. Evaluate 0pl1t at
/ : 0, and hence determine a in terms of r and D.
Extra credit: Obtain expressions for p(0, t) and p(y, oo), and obtain p(y, r) as an
integral over /.
: I cm2 is initially at 15oC. At time / :
15. A long copper rod of cross-sectional area A
one end (at x
0) is placed into a vat of hot
-
0,
oil at 300'C.
(a) Refer to Chapter 3, Section 3.2.5. Write the equation lhat describes the change of
temperature at position x along the rod at time /.
(b) Write an expression for the temperature I(x) of the rod immediately after the end
is placed in the oil.
(c) Discuss
the use of Fourier and/or Laplace transforms in solving this equation. What
determines the best choice of transform for this problem?
(d) Find
the temperature of the rod as a function of position and time for
/ > 0.
(e) Given the following data for copper, plot the temperature along the first 5 m of the
/ : 0.5, 1.5, 3.0, and 6.0 s. Thermal conductivity: 400
heat: 385 J/kg.K; density: 8.96 kg/m3.
rod at times
16.
Wm.K; specific
A long beam is resting on an elastic foundation. The equation satisfied by the beam
displacement is
ur
d4v
lF
:
q@)
- ay(x)
4(x) is the load and q is a constant describing the elastic properties ofthe foundation. If the load is concentrated toward the center of the beam, then we may assume
that y -+ 0 as x -+ too. Transform the equation, and find Y(k) in terms of Q(k). Solve
for the beam displacement if
where
(a) q(x)
(b) q(x)
: Mg6(x - a)
: (MglL)[S(x - a + L/2) -
S(x
- a - L/2)l
PROBLEMS 355
e-t sinx, find
(a) the Fourier sine transform
(b) the Fouriercosine transform
18. For the function x€-o', where c is a real positive constant, find
17. For the function
@
(b)
m" Fourier sine transform
the Fourier cosine transform
thefunction
19. ShowthattheFouriercosinetransformof
lz t
xp-r for0 < p < 1is
D,t
,,1;*cos7r(P)
Hence show that the function 1 / Ji is its own cosine transform. Obtain similar results
for the sine transform. (The results of Chapter 2, Section 2.9 may prove useful.)
[Z0l Oetermine the form of Parseval's theorem (equation 7.10 and 7.ll) that applies to the
cosine transform.
21. The magnetic field in a conducting medium diffrrses away according to the equation
TH(x,t)
02H(x,t):
lto
ox2
at
Solve this equation by taking the Fourier transform in space. Find I1(.r,
netic field att :0 is a step function:
t) if
the mag-
(no it-d/2<x<d/2
|.0 otherwise
II(x.0) : {
Express your answer in terms of the error function (Appendix IX).
22. At t : 0, the distribution of chlorine in a pipe of water is given by
p(x,
O)
:
pos-'2/o2
t > 0 in terms
of the diffusion coefficient D.
E3.l Develop a three-dimensional version of the convolution theorem. Use the result to obtain
the solution of Poisson's equation
Find the distribution of chlorine for
V2(D
: -P(i)
60
f,
as an integral over space. Hint: lJse spherical coordinates to do the integration ou",
and use the principal value in the integral over k. Evaluate the resulting integral explicitly
if p(i)
:
46(i).
24. Find the three-dimensional Fourier transform of the charge distribution
p(i):
ooI!r
+7t
This transform is called the form factor of the charged particle.
356
25.
CHAPTER
Tiake the
7
FOURIER TRANSFORMS
Fourier transform of the three-dimensional wave equation
02s
ift-u'Y"s:f(*'t)
and solve for the transform S([, a.r). Show that the introduction of a damping force
(through the addition of a term u 0s /0t on the left-hand side) moves the poles off the
real axis. Invert the transform in the case cv -+ 0 for the cases
(a) (i, t) : s-rla61l), where r is distance from the origin
"f
(b) ,f (i, r) : 6(*)d(r)
26. At / : 0, the distribution of salt in a pipe of fresh water is given by
1\
p(x,o):pol/ sinax+ 4)
,*
Solve the diffusion equation (for example, equation 3.14 and Section 7.5.2) to find the
salt distribution at
t > 0 in terms of the diffirsion coefficient D. Hint: The result of
Problem 5 may prove useful.
27. Sum the
series
2p -11
i,-'," x2+(zp+t)2
p:0
by taking the Fourier transform of each term, summing the series in the transform space,
and then transforming back.
[2&l Use the derivative rule (7.6) and the symmetry property of Fourier transforms to evaluate the tralsform of x'. Check your result by ,inverting the transform.
29. The differential equation that describes the evolution of neutron density p is
opp
DY2P: S6(i)6(r)
Dt't -
-!_
r
is the neutron lifetime and D is the diffusion coefficient. The source is a point
source S at i : 0, / : 0, and the initial condition is p(i, 0) : 0.
Using any method of your choice, find the density p(i, t) of neutrons for t > 0.
where
CHAPTER 8
Sturm-Liouville Theory
8.1. THE STURM-LIOUVILLE PROBLEM
Many differential equations describing physical systems can be reduced to one or more
linear ordinary differential equations of the form
d/
dr\
@)E
dx \f
)
where u.r(x) > 0 on the range
conditions of the form
-
e@)Y
*
)"u(x)Y
:
(8.1)
0
o < x < b and the solution y(x)
qt
-l
dv
0 atx:
Ari
dx -
also obeys boundary
a
(8.2)
and
qzY
dv
* flz-i
dx -0
atx:b
We want to determine values of the constant ). for which there are nontrivial solutions
This is called the Sturm-Liouville problem. If
dyldx:
0 (Neumannconditions),otif
p
:
a :0
y(r).
the boundary condition simplifies to
0theconditionis y
:
0(Dirichletconditions).
The constants o and p cannot both be zero.
An example of a Sturm-Liouville problem that we have already seen (Chapter 4, Section 4.4.2) is the problem of waves on a string with fixed ends. Using separation of variables,
we reduced the partial differential equation to two ordinary differential equations
s2v
!+
+k2x:o
dxz
(8.3)
357
358
CHAPTER
8
STURM-LIOUVILLE THEORY
and
tI*k2u2T:o
dtz
with the boundary conditions in x:
X(0):X(L):O'
The differential equation for X is of the form (8.1) with /(x) = 1, g(x) : 0, and w(x) : 1.
Also, the boundary conditions have p; : 0, so they are Dirichlet conditions. The solutions
to this problem are a set of sine functions sin/cr with a specified set of values for k:
k : nr lL. These are the eigenvalues for the problem, and the resulting functions X"(x) :
sinnn x f L are the eigenfunctions. We also found that these functions have a property called
orthogonality (for example, equations 4.3 and 4.12):
- :
fL sinnirxsinmrx dx
a
,
J,
L
^
26^n
We shall see that all problems of Sturm-Liouville type have a similar structure-the general
solution is a linear combination of orthogonal eigenfunctions.
8.1.1. Orthogonalaty of the Eigenfunctions
First, let us show that solutions corresponding to different eigenvalues are orthogonal on
the interval (a,b) with respect to the weight function u.r(x). Suppose that there is a set
of solutions y^(x) to equation (8.1) called eigenfunctions, with corresponding eigenvalues
).., and that these solutions also satisfy the boundary conditions (8.2). Then for one solution
labeled
y.,
- s<aw t
* ('o'*)
)'^w(x)Y*
:
(8.4)
Q
while for a second solution labeled yr,
*(,*,*)
- g(x)y" l')'nw(x)y, -
(8.s)
Q
Now we multiply equation (8.a) by yr, multiply equation (8.5) by
term g(x)y-yn cancels, and we are left with
y^,
and subtract. The
,"ft(ratff) - ,** Q atff) t (),^ - )'n)w(x)v^vn : e
Next we integrate the whole equation over the range of interest, -{
:
a to
x
:
b:
(ratff)ldx-()"n-)"^) f,b '{*)Y^Y' a'
(8.6)
8.1 THE STURM-LIOUVILLE
PROBLEM 359
On the left-hand side, we integrate by parts. The first term is
o
,,* (ror*) dx: vnr(xr*u"-
b
dY'
,r*rdY^ o,
dx"'
dx
Clearly the integrated term vanishes if we have Dirichlet ()" : 0) or Neumann (dy^ ldx =
0) conditions at each of the two boundaries. In the general case, we may use the boundary
conditions (8.2) to eliminate the derivative yt at each boundary, giving
r(b)y,(b)(-;r-*r) - r(a)y,(a)(-;r-rr) - l"'
= -firrot,@)y*(b)
All three parts of this expression
+frf
are
<o)r,(a)ym@)
symmetricinm
- I,'
rofrffo.
,or*ffa-
and n. We get an
identical contribution
from the second term on the left-hand side of (8.6), and they subtract to give zero. Thus,
the left-hand side reduces to zero, and so the right-hand side must be zero too. There are
two possibilities:
o either
Lm: Ln and m:
whichl
means that
n
lm = !n,
oOf
w(x)y^y, dx
:
O
(8.7)
Equation (8.7) is the orthogonality integral that we set out to find. Notice that the eigenfunctions y*(x) are weighted by the function u(x). [In the case of the string problem
(equation 8.3), the weighting function is identically equal to 1.1
There are two other important cases in which the left-hand side of equation (8.6) reduces
to zeto. If the function f (x) has the value zero at x : a and x : b,then the integrated
term is zero no matter what the boundary conditions on y(x), provided that y(.r) remains
finite. We shall see that this is the case for Legendre's equation.2 Finally, if yy' f has period
(b - a), then the integrated term vanishes.
I
See Section 8. 1.3 for the degenerate case )"w
2Section 8.3.1.
:
Ln with m
I
n
l
l
l
360
cHAprER
I
sruRM-LrouvrLLE rHEoRy
When equation (8.7) is satisfied, the set of eigenfunctions yn(x) forms a complete
orthogonal set on the interval [a, b]. This means that any reasonably well-behaved function
/(-r) defined for a < x < b can be expanded in a series of eigenfunctions:
f (x):Donrn@)
(8.8)
n=O
where the coefficients an may be found, as we did with the Fourier series (Chapter 4), by
multiplying both sides of relation (8.8) by w(x)y^(x) and integrating over the range a to
b. Only the one term in the sum withm : /, survives the integration, and
um-
I! f @)v^(*)w(x)dx
(8.e)
Illv^@)lzw(x) dx
As we found with the Fourier series in Chapter 4, the sum converges to the function in
the mean; that is,
*ry,*
l,u [ro, -
L^r,or)
w(x)dx
:o
We can also obtain a useful relation, called the completeness relation, by inserting the
expression for a" (8.9) into equation (8.8):
f (x) :
fn:o I!
f t'')Y'(!')w(x')
dx'
In
,,rr,
where
7b
In: I
Ja
lyn(x)l2w(x)dx
Interchanging the sum and the integral, we have
yn@)yn(xt)w(x')
f(x): I,u ro'ri
-ax
In
,
,
l
361
8.1 THE STURM.LIOUVILLE PROBLEM
Since the quantity multiplying f (x') in the integrand exhibits the sifting property (Chapter 6,
equation 6.2),we may conclude that
i
yn(x)yn@')w(x')
:
In
n=O
6(x
-
x')
(8.10)
Equation (8.10) is the completeness relation for the set of eigenfunctions
yr(r).
8.1.2. Reality of the Eigenvalues
Even if the functions f (x), g(x), and u.r(x) are real, the eigenfunctions may be complex.
i" a solution to
Again equation (8.3) offers an example, since the complex function
"ikx
this equation. But, provided that the weight function is real and w(x) > 0 on (4, D), the
eigenvalues are real even if the eigenfunctions a.re not. To prove this result and demonstrate
the orthogonality relation, we take the complex conjugate of equation (8.1):
ft (r<aff) - sr'rf * Llw(x)vi:s
(8.1 1)
Using a method similar to that used in proving orthogonality, we multiply equation (8.1)
by yfi and equation (8.11) by y^ and subtract. Again the term in g(x) subtracts away, and
we have
,^ft (r<.t#) - *ft (ra>*) .
Now we integrate over the rarrge x
two terms by parts. We then have
(t-ratff
:
e to
- v;f @*)1,
x:
(^tr
- x^)w(x)v1v^:0
b, as we did before, integrating the first
: u, -
^D
I,
w@)vf,v^ dx
We can make the left-hand side zero with appropriate boundary conditions, as we did in the
case
ofreal eigenfunctions (Section 8.1.1). Then, if m : n,
0:
The integral is
(ln
-
),D
pb
I
Ja
> 0 if w(x) >
w|)y|yn dx :
0, since lynl2
)'n
and the eigenvalues are real.
t
,0
pb
(),n
-
),tr)
I
Ja
w@)lynl2 dx
no matter what the form of y,. Thus,
: )tl'
362
CHAPTER
8
STURM-LIOUVILLE THEORY
We also obtain the orthogonality integral for complex functions:
I"u
*'*""'n
: o'
dx
n+
(8.12)
m
8.1.3. Degeneracy
The above proofs fail if ),m - 1., for some m + n. Then we cannot conclude that the corresponding eigenfunctions y. and y, are orthogonal (athough in some cases they are). Ifthere
are N eigenfunctions that have the same eigenvalue, then we have an N-fold degeneracy.
In the case of double degeneracy, we can always construct two linear combinations of the
eigenfunctions so that these new functions are orthogonal. Degeneracy reflects a symmetry
of the underlying physical system.
Suppose that we are looking for solutions of equation (8.3) with periodic boundary conditions: y(0) : y(L) and ]'(0) : y/(Z). (Recall that this was the final case we considered
in establishing orthogonality in Section 8.1.1.) There are two eigenfunctions, sinknx and
eos knx , with the same eigenvalue'. k, : 2nn I L. In this case of double degeneracy, these
two eigenfunctions are orthogonal on the range x : 0 to x : l, as we established in
Chapter 4.
Homogeneity of space allows us to place the origin anywhere that we like. If we shift the
origin by an amount x6, then, for example, a new eigenfunction is
2nnxg
2nr(x-xo):sin 2nttxcos,j-cos
,
,
.
sin
2nnx 2nrxg
sin
,
L
which is a linear combination of the previous two functions. The eigenvalues remain the
ofthe system has not changed.
Example 3.15 (potential in a box) shows what can happen in two dimensions. In that
example, the equation for the functions of x and y was found to be
same, as they must, since the physical behavior
a2.f
. a2 f
ar'2+ uV+k'f
:o
The eigenfunctions are fnm : sin(nr x I I) sin(mr y / w), with corresponding eigenvalues
4- : 1n2 1t2 + m2 1w21t 2.If the box is square, I : u), the eigenfunctions f,^ nd f-n
rtt-" ttr" same eigenvalue, reflecting the symmetry of the square.3
8.1.4. The Sturm-Liouville Operator as a Self-Adioint Operator
We may write equation (8.1) in terms of an operator .C, where
Ly=
d/
E\f
dv\
rr>--=,
-
s@)y
3There are additional degeneracies here, since in some cases we can obtain the same value
values of m and n. See Problem 3 for another example.
^2
+ n2 with
several
8.2 USE OF STURM-LIOUVILLE THEOBY IN PHYSICS
so equation
363
(8.1) is
LY
-l )"w(x)Y(r) :
0
Then the left-hand side of equation (8.6) may be written
t;
We have already shown that
!nL!^ o*
-
I"u
y^Lyn dx
if the eigenfunctions y satisfy appropriate boundary conditions,
then this difference is zero, or
f"u
,'r,^ o, :
I"u
y*Ly, dx
The operator,C is then said to be self-adjoint.
8.2. USE OF STURM.LIOUVILLE THEORY IN PHYSICS
The Sturm-Liouville problem often arises in physics problems as a result of separating
variables in a partial differential equation. Thus, there are usually two or more differential
equations, each in a different variable, that are linked by a relation between the eigenvalues.
The general method4 of solution is as follows:
set of functions that satisfy each differential equation for arbitrary l.
which
coordinates have homogeneous boundary conditions (the function or
2. Determine
is
zero)
at both ends ofthe range. These conditions are necessary ifwe are
its derivative
to have a Sturm-Liouville problem.
3. Choose the eigenfunction that satisfies one ofthe zero boundary conditions'
1. Determine the
4. Choose the eigenvalue to satisfy the second zero boundary condition.
5. Repeat steps 3 and 4 for any other coordinates with two zero boundary conditions.
6. For the remaining coordinate, choose a solution that satisfies a zero boundary condition
at one of the boundaries. The remaining condition must be a nonzero condition if the
solution is nontrivial.
7. Write the general solution as a linear combination of the eigenfunctions you have found.
8. Use the orthogonality of the eigenfunctions to determine the unknown constants in the
linear combination.
Let's see how this plan works in an example.
8.1. Find the electrostatic potential inside an infinitely long rectangular
wave guide with conducting walls. The guide measures a x b. One of the sides of
length a is held at potential V; the other sides are grounded.
Example
4see also Example 3.15.
t
364
CHAPTER
8
STURM-LIOUVILLE THEORY
First we choose a coordinate system that fits the problem. With a rectangular system,
we choose Cartesian coordinates and put the origin at one corner. Then the interior
oftheguideisdefinedby0. x < a and0 < y < b (Figure8.1).Thepotential
is independent of z. The differential equation satisfied by the potential is Laplace's
equation:
:0
V2o
I
b
l_
'I
FIGURE E.1. The task in Example 8.1 is to determine the potential in the region O < x < a,
0 < y < b. The boundary conditions are that @ is zero on the sides at x : O, x : a,
y
:
b has potential V.
Separating variables, we let @
:
X(x)Y (y),
and
y
:
0. The side at
and the differential equation becomes
a2e a2o a2x
a2Y
-r
:
-r
: v
;---^;--T
dx' ;--.dy' ;--t,
dx'
dy'
Dividing through by
@
: XI,
we have
r
a2x
I
a2Y
xa7*Tarr:'
where the first term is a function of -r only and the second is a function of y only.
Thus, we can satisfy the equation for all values of x and y only if both terms are
constants:
I A2X
X 0x2 - -),
and
I
AzY
---A
Y 8,tt -1
l
8.2 USE OF STURM.LIOUVILLE THEORY IN PHYSICS
36s
We have chosen the same constant i, with the opposite sign in the second equation so
that the two terms sum to zero, as required.
Each of these equations is of Sturm-Liouville type with
w(x) : 1. The boundary conditions in.r are
f (x) = I, g(x) :
0, and
X(0):X(a):Q
making the problem in x (differential equation plus boundary conditons) a SturmLiouville problem. But the boundary conditions in y are
Y(0)
:0;
Y(b):
V
and thus are not of the correct Sturm-Liouville form.
Now let's follow the steps of the general method.
. By choosing the
Step 1. The solutions for X are exponentials of the form
"+t"fi,x
constant ). to be positive, we obtain complex exponentials, or sines and cosines that
are linear combinations of the complex exponentials. The y-equation is of the same
form and has real exponentials (or sinh r/-i,x, cosh JT'D as its solutions.
Step 2.We have O : 0 at two values of x, so we look at the functions of .t first.
Step 3. To satisfy the constraint X(0) : 0, we need a solution of the form sin kr,
where ). : k2 has been chosen to be positive.
Step 4. To obtain X(a) : 0, we must choose k so that ta is one of the zeros of
the sine function-thatis, ka : wr, ot k : nn /a, for some integer n. These are the
eigenvalues for the problem. Then the solutions for X are Xn : sinnnxf a.Tltese
are the eigenfunctions in the variable x.
Step 5 is not needed here.
Step 6. Now that the separation constant I has been determined-it is (nn la)2-the
equation for Y is
a2Y r nn t,2
Ar2: \;) '
and the solutions are real exponentials. The function Y must be zero at
appropriate linear combination that we need is the hyperbolic sine:
Y
:
sinh
/nfl
\
\;,
)
y
-
0, so the
Notice thatthe function I(y) contains the lengtha of the wave guide inthex-direction.
This happens because the equations are coupled through the separation constant that
was chosen to fit the boundary conditions in x.
Step 7. The general solution is a linear combination of the eigenfunctions we have
found:
O(x, y)
oo
nrfx
nTv
: ) .cn sin
sinh '
aa
n:I
-
366
CHAPTER
8
STURM-LIOUVILLE THEORY
Since each eigenfunction sin (nn x / a) sirr}r (nn y / a) is zero at the three boundaries
: O, x : a, and y : 0, so is the linear combination.
Step 8. The one remaining boundary condition is the value of O at ! : b:
x
V
:
oo
<D(x,
D
:rrn"inry
rinhno-b
n:laa
which is a Fourier sine series in x . We find the coefficients cn by using the orthogonality
of the sines. We multiply both sides by sin (mt x la) and integrate:
fa
mfix
/ V sin o
Jo
-dx
a /
mnx\p :
V- l-cos-ll
mft \
a
'/tO
/4 3
nfix nnb mnx
I ) cn sin a sinh a sin a
Jo7:,
-dx
3
nrb fa nrx mrx
) .cnsinh- / sin-sin-dr
a JO
The integral on the right-hand side is zero unless
:
14
o
3
,#f-cosmn
+ 1) : Icn
_lu
,.. : a
_ (_l)",1
;cm
m7fzQ
sinh
sinh
a
m,in which
n:l
V
a
case iteqaals a12:
nnbra
T (Zu,^)
mnb
_
Thus,
Cm: V
- (-t)-l
2lt
mn sinh(mnbla)
and the complete solution is
2vs.
O(x,y):-)1Tu
N:T
[1
- (-l)',]
n sinh (nnb
Since only odd values ofn contribute [1
for n odd], we may write n :2m * 7:
o(x.r,)-_4v3
)
2^2m
71 m:(l
/a)
- (-l)' :
sin
nftx
nftv
_
sinh '
AQ
0 for n even and 1
- (-l)" :2
(2m*l)ny
I
* |
|
(2m
I
.,
+ l;zx'l srnn
"
"
.-:OtT7W
| srnh
-
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL HARMONICS
367
. Let's see how this solution looks. Figure 8.2 shows the sum of the first ten terms
at various values of y, with b : 2a. Notice how the solution approaches the value
Q I V : 1 as y approaches b, while maintaining the value zero at the boundaries in x.
q
v
FIGURE 8.2. The solution for the potential with b :2a.The plot shows the first ten terms in the
series for @ versus x for several values of y. The dashed line represents Q(x , y) at
y : 0.5a; the dotted line, y : a; the dot-dash line, y : l.Sa;the heavy solid line,
) : l.8a; and the thin solid line, y - 1.9a.
8.3. PROBLEMS WITH SPHERICAL SYMMETRY:
SPHERICAL HARMONICS
Suppose our potential problem has spherical boundaries. Then we would like to solve the
problem in spherical coordinates. Let's look at Laplace's equation again:
^
vzo
la /"ao\ * r a / ao\+ I
: p;1";
) 1"* aa ('ine- ) 7R
a2a
e aor
:0
(8'r3)
We apply the same techniques that we used in the rectangular problem; only the details
change. We are looking for a solution of the form
E:
R(r)P(o)w(Q)
368
8
CHAPTEH
STUBM-LIOUVILLE THEORY
Substituting into the differential equation and dividing by O, we get
t a /"aR\
I
a/
aP\l
I
a2w
a,zu\'"*) + ,rriner, (ttn'r )A- wZRo aQ, =
To separate out an equation for @, we multiply the whole equation by 12 sinz 0:
sin2
o
o
R0r (r#)*sinB-L
I
ta2w
(""r#) -P + --:0
WA6t
Now the last term is a function of @ only, while the sum of the first two terms is a function
ofr and 0 only. Thus, ifthe solution is to satisfy the differential equation for a/l values of
r,0, and Q, each of these two pieces must equal a constant.
Often the region of interest is the inside or outside of a complete sphere. In such a region,
an increase of / by any integer multiple of 2n corresponds to the same physical point.
Thus, the function O must have the same value for Q - Qr and Q : Qr * 2n; that is, the
function W must be periodic with period 2r.We may achieve this behavior if we choose
the separation constant so that
I A2W
--:
W aqz
'l
--2
1
l
with m equal to an integer. Then the solutions are the periodic functions
( sinmd
W:1
or
w : eti^Q
lcosln@
The equation in
r
and d then becomes
a/ aP\l
e a /^aR\
l+sind-lsine-lR0r lr"
arl
a0\
ae/P -m2:o
\
sin2
Next, to separate the
r
and d dependences, we divide through by sin20 to get
| _
^' _n
.##('',#)
P sin2o-"
+*(r#)
The first term in this equation is a function of r only, while the sum of the last two terms is
a function of0 only. Again both pieces must be constant. The equation has separated:
t a I/.aR\
__
ROr\rz_0r/| -
k
(8.14)
369
8.3 PROBLEMS WITH SPHERICAL SYMMETRY SPHERICAL HARMONICS
and
I a/lsinp_l___*k:0
aP\l
m2
P
Ae
sinz
0
\
/
(8.
sin9 00
When working in spherical coordinates, changing variables to 1t
trick. Then dp, : - sin9 d0, and the theta equation becomes
d /
A
[(t -
m2
"dP\
u")
ou)
-
cos
I
is often a useful
*kP:o
r_ irP
rs)
(8.16)
Equation (8.16) is known as the associated Legendre equation. Let's begin our study ofthis
equation by tackling a special case.
8.3.1. Problems with Axisymmetry:The Legendre Polynomials
If the problem
a
has rotational symmetry about the polar axis, then the function W must be
constant (O is independent of @) and so m : O. Then equation (8.16) simplifies:
h(, - t,#)
*kP
:o
(8.17)
We can solve this equation by looking for a power series solution.5 The singular points of
the equation are at p,: *1, so we should be able to find a solution about /.r : 0 of the form
v:fo,p'
n:0
Substituting into the equation, we have
@@@@
D"@
n--0
- I)o,t n-2 -D"@ - l)a,p' - zDnanFn -l kl
n:0
where each power of
n:0
anp."
:
g
n:0
p must separately equal zero. The constant term in the equation is
2az*kao:0 * a2- -Loo
and the first power of trl has coefficient
3
x
2a3
-
2at + ka1 -
O
)
az
- or'J
'3 x2
For all higher powers, every term in the equation contributes. Looking at
p -f 2 in the first term and n - p in the rest, we find
(p + 2)(p
5See
Chapter 3, Section 3.3.3.
I
l)ara2
-
p@
-
l)ap
-
2pap
I
kao
-
0
p"P
, setttng n
37O
cHAprER 8 sruRM-LtouvtLLE THEoRY
and so the recursion relation is
Apl2:
Ap
p(p
-
l) +zp
-
p(p-tt)-k
t<
(p+2)(p+t)
o
(8.1 8)
(p +2)(p + t)
The first two relations we obtained can also be described by this formula with p : 0 and
p : l, respectivelY.
The solution we have obtained is valid for - 1 < l-L < 1, but the series does not converge
for p, - *1. This is a concern, since p - fl corresponds to I : 0 and & : -l to 0 : r.
These points are on the z-axis, where usually we do not expect the potential to blow up.
Thus, we need a solution that remains valid up to and including these points. We can solve
the problem by choosing the separation constant k so that the series terminates after a finite
number of terms. In particular, if we choose k to have the value
k:l(l+D
for some integer /, then according to the recursion relation (8.18),
aI+2:
and so every succeeding ao for
takes the form
al
t(t+t)-t(t+D :0
(t+2)(t+t)
p > I +2
d/
dp
\'
-tn-
is also zero. The differential equation (8.17) now
t,#) +l(l+l)P:0
(8.1e)
and the corresponding solutions are the Legendre polynomials
choose a6 (for even /) or al (for odd /) so that
hj-t).By
convention,
l
P;(1)
=
(8.20)
I
I
The recursion relation becomes
Ap!2:
p(p*1)-/(/+1)
Clp
(8.21)
(p+2)(p+t)
The first few polynomials are found as follows.
I : 0: The only nonzero coefficient is 46, which must equal
Ps(p)
:
l
I
I to make Ps(1)
:
1, 5q
(8.22)
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHEBICAL
HARMONICS 371
/ : 1: The only nonzero coefficient is at, and again we must take at
&(1) : 1. Thus,
h[-r):
I :2:There
tt
= |
to make
(8.23)
are two nonzero terms,
q2:ao(/-2x3\ --3ao
, ,/
and the subsequent an are all zero. Then
PzQD:as(l-3P2)
Evaluating this at pc : 1, we find
P2Q): a0e2):
1
* oo: -).
Thus,
pzQ-r):)ft*'-t>
(8.24)
Notice the pattern. We use the recursion relation to determine the nonzero coefficients as
multiples of the leading coefficient (as or a1). Then we evaluate the resulting polynomial
at p, : 1 and set the result equal to 1, thus determining the value ofthe leading coefficient.
Let's do one more.
I : 3: Applying the recursion relation (8.21) with / : 3, we find
/
hIl):ot(pr+
Evaluating at
p,:1
lx2-3x4,\
t"
3"2
)
gives
P3(r)
/ s\ :, ] a1- 3
: o' (.t
--;)
and so
PtQ.t):!rS*,-Z>
(8.2s)
372
cHAprERssruRM-LtouvtLLETHEoRY
The first four polynomials are shown in Figure 8.3.
P(tt")
l
j
j
1
l
I
!
FIGURE E.3. The first four Legendre polynomials. P6(g.) and P2Qt) are even tunctions; P1 (g.) and
fuQ.t) arc odd functions. All the functions are chosen to have the value I at p. - 1
(top right).
8.3.2. Solution for the Potential
Now that we have the function of d, let's return to the potential problem and solve for the
function of r. With the separation constant determined, equation (8.14) becomes
L (,r9!\:
a'\'ar)Solutions to this equation are powers of
r(/
{
r)R
r: rP, where
* (rT) : # (,'o,o-') : p@ + t)rp : t(t * t)rp
p - /; there is a second solution with p
/0 + 1), as required. We have
One solution has
and
p(p + 1) :
R:rtor+
: -(/ + 1). Then p I I :
-1,
(8.26)
8.3 PROBLEMS WITH SPHERICAL SYMMETRY SPHERICAL
HARMONICS 373
Thus, an axisymmetric potential may be expressed as
o(r. d)
:
f
(o,,, +
1)
r,ot
(8.27)
where the constants A1 and 8; must be determined by the boundary conditions in r.
8.3.3. Legendre Functions of the Second Kind
Noticethatbychoosingtheseparationconstantk: l(l+l)weforceoneofthetwosolutions
of equation (8.19) to terminate. The second solution does not terminate. This solution is
called the Legendre function of the second kind: QtQ-t). For example, with / : 0, the second
solution begins with a1 and contains only odd powers of trc. The recursion relation (8.21) is
Qp*2:ao
p
p(p+l)-O
1r*2yoag:api+2:ap-z
p-2
p-2 p
p p+2:ae-2 p+2
al
p+2
and none ofthe
a, is zero unless a1 is. Thus, the solution is
/ujus\
QoQt)--arlp*+++ +"'I
\35/
at. /l+p\
:ttn\t-r/
In this function, at is taken to be 1 so that
I /1+u\
eoo.t):r^\r_*)
(8.28)
This expression is valid for -1 < 1t, < l.
These functions Qt are not used in the solution of spherical potential problems, but they
do find uses in other situations. For example, they are used when solving potential problems
in spheroidal coordinates.6 Unlike the Legendre polynomials, which diverge at large values
6See Problem 13 inChapter 2 and Problem 9
in this chapter.
;
374
cHAprER 8 sruRM-LtouvtLLE THEoRY
of the argument, the Legendre functions of the second kind approach zero as the argument
tends to infinity. To obtain the limiting form for Qt(x), we solve the differential equation
in inverse powers of x. We find7
I
QrG) --> 't"-fo+1)
q;S
as
e*t
r
--+ oo
(8.2e)
8.3.4. Orthogonality of the Legendre Functions
The Legendre equation (8.19) is of the Sturm-Liouville form (8.1) with
fQD=t-t"2
SQr)
=
0
w(p) :1
and
The eigenvalue is ,1, : l(l + l). Even without any boundary conditions specified, except that
the function remain finite, the Legendre functions must be orthogonal on the range t- 1, ll
because
f (l): f(-l):0.
I*rt,,r*rr,,et)
dp.
:o
for t
I/
(8.30)
To make use of this relation in forming series expansions in Legendre polynomials (that is,
to use equation 8.9), we will need to find the value of the integral for I : l/. In the next few
sections, we shall collect some useful tools that will allow us to do that integral.
8.3.5. Properties of Legendre Polynomials
The Generating Function
Suppose we put a point charge 4 on the polar axis at a distance s from the origin (Figure 8.4).
Then the potentials at point P is
-^-
r
4T es
Q
D --
I
q
4T es
J?TF - zrs cose
TSee Problem I 1
8See, for example, Lea and Burke, Chapter 25, equation 25.9.
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL
HARMONICS 375
FIGURE 8.4. The potential at P due to a point chatge q on the e-axis at z - s may be expressed
in terms of Legendre polynomials using equation (8.27). It may also be written as
q
/4n esD.
whichwecanalsoexpressintheform(8.27).Weletx:rlsfotconvenience,andthenfor
r < s we can expand the denominator to get
-ct1q1
-
,fr
-
4ness
f-.7
J
4neos
Vt*"2-2;u
uf -
21611a
162
q (,_(*2-z*p)_Let/2\(-3/2\.., - "-...\
:_ G*
-2xtt)'* )
z * --T(x"
(t -
: #*('
: *t*
* "u -
*r, - 3r,2). )
(r +"r, (y) + x2 pzur)
which has the form (8.27) with A1
+...)
: q l(4n e6sl+1)
for each I and Bt
:
O.
Thus, we have
the identity
-oo
t-
\-
1/l-2xp,*xz
-
-./ n 2..'
(8.31)
-r:o
The function
G(x,
tr):
\fr
-z.nTp
(8.32)
376
cHAprERBsruRM-LtouvtLLETHEoRY
is called the generating function for the Legendre polynomials. We can use it to determine
several useful properties of the polynomials.
The Orthogonality Integral
We can obtain the integral we want by integrating the square of the generating function:
Il,'o,r,: l::
F;G?r.: l::f,
l=0
,,r*>i*,'' rurDo,
lt--0
: ii",*,' [*,t ptet)Py(p,) dp.
/=0
1/=0
The integral ofthe Pls is zero unless
G2 by achange of variable to u : I
oo
r+l
n*'
For
x<
I
J_,
/' (equation 8.30). Thus, evaluating
2x 1t
*
;lnl-":;lrt+
the integral
of
x2, we have
1
*
ll-x)2 6,
Ju*,j, ?
(l*x)2 1. 1*.r
:;Inl.
- -tn 1-x
O4
logarithm:
x2t+t
*3 xs
*T+"'+zt+t*
s
2/
:r(,
-
:
ProDPtQ.t)ap:
1, we may expand the
l+x
/
*t* h*
:
\
)
) ,2." I*,'
,,,rrr,e,)d*
Both sides of this equation contain only even powers of x. Equating the coefficients of each
power gives
hQt)hQt") dtt
(8.33)
which is the desired result.
Recursion Relations
Our next task is to find a set of relations between different Legendre polynomials. These
relations are known as recursion relations. We begin by finding a relation among polynomials with successive values of l: the pure recursion relation.9 First we differentiate the
9The name "pure" arises from the fact that this relation contains the eigenfunctions only and no derivatives appear.
HARMONICS 377
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL
generating function with respect to x:
AG:
ax
-@ - tt)
-zxpT6n
0
:
-(x 0
-
tt)
zr'tt +
x\t'
Rearranging, we obtain
(t-2xP'+*\Y:Qt-x)G
dx
We proceed by inserting the expansion of G in Legendre polynomials into this expression,
{1
\r
-
)t6
p*
"rl i/:o
txt-t ptet")
:
Q.t
- *)ir,
r1q"y
t:o
and gathering up in powers of .r:
Da *
r)xl+t pteD
The coefficient ofeach power
ulru
-
t:o
ofx
lht1l) -
pt1.r)
*\xtp1et1+Dtxt-r
t:0
:0
/=0
must separately equal zero; so, for
(21+ I)tthQr) + (/ + l)Pr+r(p)
Equation (8.34), the pure recursion relation, relates each
P1
/>
1,
:0
(8.34)
to the ones above and below
it in the sequence. It may be used to determine the P;s once the first two are known, and it
thus provides an alternative to using relation (8.21).
Next we obtain a relation involving the derivatives of P;. Again we start by differentiating
the generating function, this time with respect to trr,:
AGxx
W - Q-
zxlL +
t)'/' - (l -'rxlt
+
t)
Rearranging and inserting the expansion of G gives
(r
i/:0r'*' ri @l - i/:0"'*t
1
Pr
Q-t)
-
-t
2
2xp.
+ x2)\xl rioD
r, Pl QDI
+
:
PitQD
-
zt't'Pi Q.r)
/=0
il=0 *t r; q4 : o
Setting the coefflcient of each power of x to zero,we have, for
P:QD
: xD*' ,,(r)
/:0
*
/>
Pi+Jr")
1,
(8.3s)
378
cHAprER 8 sruRM-LtouvtLLE THEoRy
If we differentiate the pure recursion relation (8.34), we obtain
t
for
I>
ri q@)
1.
- (2t + r) Pt@) - (2t + r) rr Pi QD + 0 + t) Pi*rQtl :
(8.36)
s
Using this relation to eliminate P/*tfrom equation (8.35), we find
PrQ.t)
:
-
Pi_JtD
-
2pPlQD
t
o,
: 1+I I Pi-t}r)
- PtPiQt) t
and so, for
-
Pi-r@)
1
(2t
+
r)
hQD
l+l
- Qt + t)u'Pi0t)
2l+1
* H r',(r)
+I
I > l,
thQ.D: 1t Pifu)
- Pi_Jt')
(8.37)
Similarly, by eliminating Pi-r, we obtain
I Pt
0.0
:
(2t
+
r) Pt1D
a Qt *
(t
I
r)
t t,
Pi QD
t)hQt)
:
-
0
+
Pl+rQr)
- zltl Pi @) +
- pP{(p)
D Pi +rQD
which is valid for / > 0.
Then, adding equations (8.37) and (8.38), we can express the polynomial
terms of derivatives:
(2t
+
I)hQtl: riu@) -
P7
I
Pi{@)
(8.38)
entirely in
(8.3e)
Pi_rQ")
which is valid for I > 1. This expression proves useful in evaluating integrals ofthe P1s.
Next we can combine (8.37) with (8.38) to eliminate fi-t@).Letl --+ / - 1 in (8.38) to
obtain
t
pt_r1r)
:
ri@)
-
:
pi@)
*
(t !)
t-tpl_t1*)
h-tQ.D:
rrPrQ.D
-
p.l.pPi1D
-
I hQDl
(8.40)
'la>
Equation (8.40) is valid for I > l. This relation is called a ladder operator because it
allows us to step down the series in /. If, instead, we eliminate fiu@), we obtain the
step-up ladder operator:
(l
*
r) P11yQt)
: pPl+rjr) -
Pi @)
: trl(t + t) P1QD + p.PlQ.|l -
piQ")
S.3 PROBLEMS WITH SPHERICAL SYMMETRY SPHERICAL
h+rQ.t):
r.tpre.t)
HABMONICS 379
r,*,
- (#)
(8.41)
These relations prove especially useful in quantum mechanics.
The Rodrigues Formula
The Legendre polynomials may be expressed as
P1(x):
h#o' - r'
(8.42)
a relation known as the Rodrigues formula. To demonstrate the validity of equation (8.42),
let's evaluate the derivatives. First we expand the quantity (r2 - l)t using the binomial
expansion:
(r2
-
t)t
:
(-r)r (t - u,
=
(-
*l(l - t) *++... + t-rl, rr,)
lr
r)r
D{-t), ---J!-.*to
After differentiating / times, only those terms with2p
fiu'-
l)/
:,-,,'
r:,# r,rrr-rr'
: (-1)/
i
2 / remain:
#
r.#r"
1-t1,-SE!-{l*',-'
P)! dxr-t
P=l\t+1t/21 Pl(l I
: (-r)/ t 1-tyo2Jg!-2!)#,r,-,
p:t(t+r)/21
where [(/
+
1)
12] means the integer pan of
{r*'dxt '
r)'
(l + l) 12. Continuing in this way, we find
: (-r)1/! i
1-110*zr-t
**
Qp - l)lpl(l - p)l^p:t(t+t)/zt
(8.43)
Notice that if / is even, this polynomi alhas I 12 even powers of x, while if / is odd, it contains
(t + I) /2 odd powers of x. The ratio of the coefficient of xk+2 to the coefficient of xft is
380
CHAPTER
8
STURM-LIOUVILLE THEORY
found by first setting 2p
ctk+2
*:
- I:
k
*
2 and then setting 2p
- I:
k:
&+z+l)t
lI-k -\
&+Dt(L+/+2\
t
\
)t( , -t)l
(/+r+
0+r)t
2)(t+k.t,(?)
/k+l+2\
(k+2)(k+tt(
z
-k2 +t-k
: - t2(k+
l)(k +2):
/
k(k+1) -t(t+t)
(k+t)(k+2)
which is the recursion relation (8.21) for the Legendre polynomials. The numerical
factor in front of the derivatives ensures the correct normalization. We can easily check
itlo by looking at the first few polynomials (/ : 0 is trivial).
I _ L.1.
I
-
I dt
UIIN@"
ld
.
rd*@"
- 1)' :
- l) :x
which is correct.
, _4.
L-L.
**u'dxt'
r)r:
2tlt
:
h#o^ -zx2 +r): * fte*'-o,t
!B*, 2'
r\
which is also correct.
As a by-product, we can use equations (8.42) and (8.43) to obtain an explicit formula
for
P1:
nfur):
Or, setting
k:
p
(-1)r
-f,
- l/2for
t
I
p=l(t+1 )/2t
(2p)t
(I1n *2r-t
(2p-t)tpt(t-p)l
/ even gives
ho.+):+fr-ttor'o
act
losee also Problem ?.
(8.44)
(2k + t)t
(rc.
i),(i-
(8.4s)
r)'
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL
and setting k
:
p
-
(l
-
l)
/2for / odd
HARMONICS 381
gives
(+t)12
Pto4:--T f- <-rlkw, -'
-1-Y{t_r)/z
,
(8.46)
We can also use these expressions to evaluate &(0). For I odd, the polynomial has no
constant term, and so Pl(O) : 0. For I even, the constant term (ft : 0) in equation (8.45) is
&(o)
:
+fu^,?:
(-Dt/2#:
eDt/2(t
-
r)tt
(8.47)
8.3.6. Solving a Potential Problem: Fourier-Legendre Series
Example 8.2. A hollow copper sphere of radius a is divided into two halves at the
equator by a thin insulating strip (Figure 8.5). The top half of the sphere is held at
potential V, and the bottom is grounded. What is the potential everywhere inside?
FIGURE 8.5. The potential on the top half of the sphere in Example 8.2 is V. The potential on the
bottom half is zero.
Since V2O : 0 inside the sphere and the system has axisymmetry, the solution
must be of the form (8.27):
a(r,0)
:
f (o''' * ft) nra
982
cHAprERssruRM-LtouvtLLETHEoRy
We must set all the coefficients 81 Io zero, because nothing is at the origin to cause
a divergent potential. We find the coefficients A7 by evaluating the potential at the
surface r : ai
a@,0)
:io,o'
1:o
r,Q")
0
: {Y T1
"
ifo>pr>-l
.
to
Next we make use of the orthogonality of the P;s by multiplying both sides of the
equation by Py (t t) and integrating from - 1 to *I :
r*l oo
rl
I-t D,
1:o
PtQt)Pr'oD dp
Arat
J
: I
Jo
v P,Qt) dp'
On the left-hand side, only the term with I : // survives the integration. The integral
is given by equation (8.33). Then, dropping the primes, we get
nl
, 2
:,
t,"'h
Jo
Pr1Ddu
Now we need to evaluate the integral on the right-hand side. We can make use of
equation (8.39), which is valid for I > 0:
rtlfl
Jo
r'toap: ai Joei*'tul - Pillr)ldLL
Here the right-hand side can be integrated immediately:
fl
Jo
Since
r'ul
aP
I
: a?
[Pt+(t't)
- &-r(P)]lA
&(l) : I for all values of l,
I
fl
h@) d r' : - zt t[&+r (O) - &-r (0)]
+
Jo
: -fi-''+rror (r . +) - - Pr+l(o)
where we used the pure recursion relation (8.34) to express Pr-r(0) in terms of
Pl+r(0). Then
v2l+l
Ar:-2-;-Pr+r(0),
For the remaining case,
/:
I>1
0, we can easily do the integral, since P6(9,) - |
(equation 8.22):
Sl
I
Jo
pottDap:
nl
I tdp:r
Jo
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL
HARMONICS 383
oo:I
and
e(r.ot:il,
E'+Pr+r(o)
(L)'
Po"r)
The sum reduces to a sum over odd /, since Pl+r (0) : 0 if / * 1 is odd. Then,
use equation (8.47) for Pl+r (0) and set I : 2n * 1, the potential is
a(r
o)
: : ('.
:,-
u.
;ffi
#+,# (:)'.'
if
we
P,.*,u,))
The first few terms are
a(r.d)
'7 rr:,3
3r
: tVz\ (/ I + icosl cos6(5cos2d - 3;
;l)
+ll6\a)fl)t
"ol'(urcos4e -70cos2
e
)
+' tJ'ts)+
t
)
Figure 8.6 shows the first four terms in the series for the potential versus angle for
rf a:0.I,0.2,and0.5.NoticethatthepotentialequalsV/2attheequator
(0
:
r12)
for all values of r. (This should remind you of a similar property of the Fourier series:
The series for a discontinuous function always converges to the midpoint of the jump.)
polar axis
--I
I
,/
/q
,
r
- :0.5
^^ \
\
,t
\
\r
08
08
equator
FIGURE 8.6. The potential inside the sphere in Example 8.2 at several values of r: r/a :0.1,0.2,
and 0.5. The value of the potential @ / V at a specific angle 0 is given by the distance
of the curve from the origin.
384
CHAPTER
8
STURM-LIOUVILLE THEORY
8.3.7. The Associated Legendre Functions and Spherical Harmonics
Orthogonality of Associated Legendre Functions
When a problem does not have symmetry about the polar axis, we need a set of eigenfunctions conesponding to nonzero values of the separation constant m.Thenthe equation
for the function of theta is equation (8.16), where we keep the value k l(l + 1) for that
:
separation constant:
,,
" dP\
d/
,^- ^P+l(l*l)P:0
a(.(' -*")or)- l-p"
(8.48)
Equation (8.48) is of Sturm-Liouville form with /(lr.) : 1 - t-t2, g(l.t) : *2 /(1 - p\,
w(t-t) : l, and ). : l(I * 1). (Note that z is the eigenvalue for the phi equation.)
The solutions of this equation are the associated Legendre functions P1' (l). They satisfy
the orthogonality relation
l*r'
,r
r*r
r,
d
t','
:o
unless
/
: //
(8.4e)
z
is the same in both functions.
Alternatively, if we disregard the physical origin of this equation, we-may identify it as
a Sturm-Liouville equation with S(p) : -l(t * l), w(p') : ll0 - u'2), and eigenvalue
zr2. In this case, the orthogonality relation is
where the value of
'lY\dp:o
untessm:m'
(8.50)
where the value of / is the same in both functions.ll
Form of the Associated Legendre Functions
To obtain an expression for the function P( , we could solve in a series as we did for the
Legendre polynomials. But it is more efficient to relate the solutions P,n to the polynornials
we have already found. We expand out the differential operator in Legendre's equation (8. l9)
II
See Problem 19
for the value of the integral when m
:
mt
.
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL HARMONICS
385
and then differentiate the whole equation m times:
I-
u'>ffi -
r*#
+ Kt +r)P,
:o
,'tffi -r,t*ffi -r4u* +t(t + ufi:o
e - p\# - z, zwffi+ r/(/ + t) - 2 x ttffi :o
s-
Continuing in this way, we get
^ d-*2P,
(l - lt")
a,^+z -2(m *
1n'tt p,
dn pt
r)uiifr
+U(t + t) - m(m + r)lAi :o
(8.s1)
so the mth derivative of the Legendre polynomial P; satisfies the equation
(r
-
p2)y',i,
-
2(m
*
t)pyk +l/0 + t)
InAppendixVll, we show that the substitution
(r
-
t2)2" -2pz'
- m(m* r)ly, :
0
(8.s2)
l^ : (l - p2) ^/22(p) yields the equation
+,ltu+
1)
- {r1:,
and thus z(9,) satisfies the associated Legendre equation (8.48). Thus, we have shown that
Cz(p) : fifu) for any constant C. In physics applications, it is common to choose
f, : (-l)n. Then
P\QD: (-r)-(1 - p\m/2#Pt1')
(8.s3)
Clearly Pf : 0 for m > /, since the highest power of pc that appears in Pl is tr^r/. The
function & (p) is even if / is even and odd if / is odd. The quantity (t - t"2)^12 is an even
function of pr, and after m derivatives, the highest power in (dm /d lt^) hQ-t) is p/-'. Thus,
fi @) is even if I -l m is even and odd if I * m is odd.
Also, since the associated Legendre equation contains m2, the eigenvalue -la leads to
the same differential equation. Thus, Pfn must be a constant times Pln.It is convenient
to define
rl^@):
r-rrffiP(e")
(8.s4)
386
CHAPTER
8
STURM-LIOUVILLE THEORY
the appropriate solution corresponding to the eigenvalue
shown in Figure 8.7 and Table 8.1.
as
-m. A few of the functions
are
P(p)
l:2,m:2
o*,. l:3, m:2
--f
-------I:3, m:
I
\,:,
FIGURE 8.7. The first few associated Legendre functions
Figure 8.3. Notice that
Pf eD :
TABLE 8.1. Associated Legendre Functions
0 for m
Pf
+
P{.
For functions with
Qt)
m:3
m:7
l:0
l:l
t _a
I
-L
l:3
m : 0, see
O.
1
1^
l.L
1Qw',
u
-
1)
,(5w' -3)
2_
-irt*'-t)1/r-*z
3(l l51t
t,.2)
(I -
p.2)
-15(1
- ,zf/z
The Orthogonality Integral
Next we need to find the value of the orthogonality integral when
/
:
//. Using the definition
(8.53), we have
,,-: I_: pf o,t)pt ar: l*r'{t - t"\^{#{#
o*
8.3 PFOBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL
HARMONICS 387
We integrate by parts:
a^
fi^: (t - t)'2)^1^dp*
!'9) dpm-t
'1ltrll+l
l-r
f'1dp l,tL' - J-t
d*,'!t(,t"t
dp^ ) dp^-' o,
p2)^d':tYL)l
The integrated term is zero, and the derivative in the integrand is
!apl"lo - Pp\^u
!'!P1 :
' dpI
-2mtt(t
- -'
-
u2y-ta^ dl.r^
!r!t) +r\r(r
P4!)
t dpm+l
- *ury{i'
We can simplify this expression by using equation (8.51) with m --> m
- l:
dt-l Pt
.- p")1 d-+l Pt ^ d^ Pt:
(ll) m(m- l)laU^*, -2^p aU^ -II(I+ dt;_l
Thus,
11^
: lt(t+ l) - men
' '- /*' (r
' -l)l
Jt
:(l+m)(l-m*l)11,*-1
rr2)^-t
loii'il'
LdP^-t I
o
u
Now we may step down to get
I1*:
:
:
-lm)(I - ml l)(l -lm - l)(l - m -12)h,m-z
(/ + m)(I -f m - t)(l * m - 2)(I - m + 1)0 - m -l 2)(l - m I
(/ + m)(l * tn - 1) . . . (l + l)l .' . (l - m I 2)(l - rn -t 7) h,o
(I
:
3)
h,^-z
(t*m)t 2
(l - m)l2l -t l
where we used equation (8.33) for 4,s. Thus,
l*r' 'i'{D'1t'o':
(8.ss)
388
8
CHAPTER
STURM-LIOUVILLE THEORY
Spherical Harmonics
The general solution to Laplace's equation in spherical coordinates may be written as
e(r,o,r,:
(8.s6)
* E,(*^,,* #)rrrtoei^Q
[The previous result (equation8.27) is a special case of equation (8.56) witham
: 0.1 Next we define the combination
:
bI^
:
0
unless m
2l*l(l-m)l P{(p.)ei^o
=Yt^(o,Q)
4n (I * m)l
where the constant has been chosen to make the functions
l:r' lr*
Y1a(0.Q)Yf
^,(e.o)dQdtt
-
61v6^^,
I1.
(8.s7)
orthonormal; that is,
: I Y14@'Q)Yf
^,@,0)de
JsPhere
(g.5g)
The functions Yt*@, Q) ate called spherical harmonics (see Figure 8.8 and Table 8.2). They
find application not only in potential problems but also in the quantum mechanics of atoms,
wave mechanics, oscillations of spheres (for example, the sun), and the structure of the
cosmic microwave background.
TABLE 8.2. Spherical Harmonics
m:2
m:0
t:0
l=1
I
=3
m=3
I
J4"
,f ! "o.,
Y4n
- i)
'lE (1"",',
fT
/5 " 3\
1f *coso 11"o"e - z)
-^17
V8z
sins.i6
-1,/E"o.B.;oe"io
4Y n
-!l{<s-"te
-1)sinoeil
!,f3!
8V
! , fT
8V r
r
"in2s.zi6
.o"s
"in2
s"2io
,fT
tlr
"in3
e"til
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL HARMONICS
The defining equation for each hgure is
{Retyi(G,
il)}z
m:0, l:0
m:l,I:I
m:0, l:l
m: O,l:2
m:2, l:2
J
m:2, l:3
The firr
firstt few
plotted here.
ics, with
/
:
O,1,2,3; m
:0, l, 2, 3. The real part is
390
CHAPTER
8
STURM-LIOUVILLE THEORY
Recursion Relations for the
P{
Here we shall derive a relation between functions with the same I but differentm. Let's start
with equation (8.51) and multiply by (-1)n(1 - Lt2)*12:
0
:
(-t)z(1
-
p2)tm+2t/2{2
t
d*-*,
-
Pl
t)*(-t)*(t
\' - *zr^nd!+t
r /
"
dpm+l
2(m -tI t)ft\
L\t't
P:
1ll(t + t) - m(m+ 1)l(-1)-( t - uzr^ndi
' du,^
Now we use the definition (8.53) to get
0:
p{+2 -t2(m-r
Dl+FPi'+l
+ U(/ +
1)
-
m(m
*
t)lPin
(8.se)
Numerous similar relations12 may be obtained by differentiating the relations for the P1.
The Addition Theorem for Spherical Harmonics
Suppose y is the angle between two vectors * and i', as shown in Figure 8.9. The addition
theorem allows us to express the Legendre polynomial P1(cosy) in terms of spherical
harmonics in the angles e, Q, e' , and Qt that describe the vectors i and i/.
polar axis
FIGURE 8.9. The angle y is the angle between the vectors
l2Gradshteyn and Ryzhik has an extensive list. See also Problems
l5 and l6'
i
and
i/
8
3
391
PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL HARMONICS
First we note that rt P1(cosl) is a solution of Laplace's equation. Thus, using a ubai'
coordinate system with polar axis along the vector i', we have
l(l + ll
-)
Vir&(cos y) : -:;z:
Pr(cos Z)
Vfi, is the angul ar part of the Laplacian operator (the second two terms in
equation 8.13). Since P1@osy) is a function of the angles only, the first term in the
Laplacian (with derivatives in r) is zero, so we may add it back in to obtain
where
V2Pr("o*
D
:
(l+!&(cos
-l
rz)
Since V2 is a scalar operator, we may express it in any coordinate system. Returning to our
original system, we note that the solutions of
vrf@.0):-t!i!f(e,o)
are the spherical
harmonics Y1* with-l <
m< l. Thus, P;(cos y) must
be expressible as
asumofthese Y;r:
I
P1@osy):
A.(g',|')Yt*@,0)
D
(8.60)
m:-l
In principle, we can find the coefficients
o^ :
I
A-
inthe usual way,
p1@os
y)yf^@,
O)
da
(8.6r)
but this integral is not easy to do!
Instead, we turn things around and regard Yk(e, il as a function of the angles y and p
i in a coordinate system with polar axis along i'. We can expand Yf*(e, fi in
spherical harmonics in y and B, again with a single value of l:
that describe
I
Yk@,O:
t
B^,Y1^,(y,P)
(8.62)
mt---l
If we let
y
--> O,
i
moves onto the polar axis and we have axisymmetry, so only the m'
term remains.l3 Thus, as
d -+ 0t
hm Yh(e.
y+v
and
il :
Q
--> Ql
, wehave
Yf. (e' ,O')
.
47t
"Yr-(O,d)
:0
unless ln
:0,
because
Pf
0 :0,
:
BoYrcQ,
hQ)
Bs
as you should verify.
fl)
:
0
392
cHAprER 8 sruRM-LtouvtLLE THEoRY
(Of course
Y76
is independent of p.) Therefore,
uo:
lEurYf^(e"Q')
Alternatively, we can find the coefficients
B,
(8.63)
using orthogonality of the Y1^ in the (y, B)
coordinate system:
:
B*,
I
yk@, Q)Yh,(y, fl) dszy
In particular,
no
:
J
Yi^@.
ilYfo(y, p) dQy
'f
: Jr v|^re,ottfltt)-t
rt@osv)de,
l2t+t
: tl'#A.(0',0')
U 4/t
,
(8.64)
where we used equation (8.61) in the last step. Since we integrate over the whole sphere,
does not matter whether we use the coordinates 0, Q or y, fl as the integration variables.
Finally, we combine equations (8.63), (8.64), and (8.60) to get
&(cos z)
: -l
4n
F_,fi1Yr^<e'
ilYh (e'' Q')
it
(8.6s)
This is the addition theorem for spherical harmomcs.
Note the expected symmetry in the primed and unprimed coordinates. [It does not matter
which function has the complex conjugate, since the sum is over positive and negative values
of m, andYt,-^ : (-DmYk.lWe can also check the result by letting 0/ approach zero,in
which case the left-hand side is just Pl(cos 9). Verify that the right-hand side also reduces
to P;(cos 9) in this case.
In the special case y - 0, we obtain the sum rule for spherical harmonics:
I
I vm{e,il:2
m---l
2l +1
4n
(8.66)
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHEBICAL HARMONICS
393
The addition theorem is useful for performing rotations of the coordinate axes. When
combined with the generating function for the Legendre polynomials, it also allows us to
express the inverse distance I / l* - i'I in terms of spherical harmonics.l4 This is a particularly valuable result, because this inverse distance appears in the integral expressions for
electric scalar potential, magnetic vector potential, and Newtonian gravitational potential.
The orthogonality of the spherical harmonics often makes the integrals more tractable.
Boundary Value Problems Using Spherical Harmonics
Example
8.3.
Find the potential inside a conducting sphere of radius a with poten-
V onthehalf 0 < Q. r andzero onthehalf z . Q.2n.
The potential may be expressed in terms of spherical harmonics, as in equation (8.56). Since there is nothing at the origin to cause a divergent potential, we
tial
O:
exclude the negative powers of r. Thus,
ool
Q(r,0,d):
tl:0 m---l
I A6rtY1^@,Q)
To find the constants A1^,we evaluate the potential on the surface and set it equal to
the given expression:
-
e(a,0,0):I I
A6aty1^@
t:om:-t
ifo<Q<tr
ifr<Q<2n
^r:[v [0
To evaluate the coefficients, we use the orthogonality of the Y1-.We multiply both
@) and integrate over the sphere:
sides by Yf
-,@,
i o,^o' l"nnu"y1a(o,Q)yf^,@,o)da:,
i m:_l
Jsphere
I:O
Only one term on the left-hand side is nonzero. With I
l:r'
lo"
rr^,rr,Q)dudQ
: lt and m :
mt , the
equals 1. Then
A1,^,ql'
-
Y
rii' (D o*
lo "-i-'Q
Now we may drop the primes:
tA,
ttm
l4See Problem 20.
v
-
7
Pf0r)rr(#)
dO
integral
394
oHApTERBSTURM-LrouvtLLETHEoRy
The result is zero unless m is odd. Then the integral over LL is zero unless /
also odd, for then f( @) is an even function. Let's call this integral l1^.Then
v-
AI*:
The case I
ot
: m:0
1S
+ | (l - m)l L* 2 for/ odd andm odd
^l2I
4r
\l
e + m\1"* im
is special, for
f+l ft
I
I I Y6s : t/4tr
Ji
J -t Jo
-(2)(n):
grvmg
Am:
If m :
O
I t'
but
v
2
0, the result is zero because
f+t f+t I PoQ)hjt)dry:0
I nU"SdP:
J-l
J-r
(equation 8.30). So the potential is
e(r,o.Q):Yz."i
: Y, *,
We can demonstrate
combining the terms
ffi
,,
-
tt'",
bf:y
2l-ft(l-m)l 16. 2 Y1p(0,Q)
4n (l -f m)l tm
h- ? pi
G)"+
.
r,
b
^ry
t
-":;;
,"t,
Pf{
1p'vsi ^ao
:
ffi
:
ffi"'fie''
,,
*
h
+
) ei
-o
for a real physical quantity, by
Using equation (8.54), we have
as expected
withm: M andm: -M.
3
ot
ffi', * h'; * o")'-i
-
r,, trt@'*Q _ e-i MQ)
rutsinMQ
M Q
8.3 PROBLEMS WITH SPHERICAL SYMMETRY: SPHERICAL HARMONICS
Thus,
(D(r, 0,
a)
:
:. ; i
The result is clearly real and
e(r, 0. il
::.:
{It
t,
> r:l
+ffi;-ri
a;""r"rally
T'-tsin
d
@)
sinmQ
correct. The first few terms are
*, (:)' llt#tu,tu' -
1's 1/
t
-
fi
sinq
*ljf,"ttsin3esinro. I)
r3 sinosin@
:, (t ;o
* rze\ot
' "-\t[:1s.or'e-Dsindsin@
r.- --\r+
*5sin3 osin3Q. ])
We can use the addition theorem to show that this result agrees with the result of Example 8.2. Let's compare the two pictures (Figure 8.10).
FIGURE 8.10. Example 8.3 (on the right)
is
just Example 8.2 (on the left) rotated through 90o.
The result of Example 8.2 is
o::{'- I
where
;
y is the angle between the position vector i and the z-axis in the picture on the left
or, equivalently, between the position vector and the y-axis in the picture on the right. To
along the y-axis, so
relate the two results, we may use the addition theorem with vector
i 0':Q':r/2.
l
'#r,*,(o) (l)1arc",r,r}
i'
396
cHAprER
I
sruRM-LrouvtLLE THEoRy
Let's look at the first few terms:
P1(cos/)
:
cos
4nl3
t' :
rr
i L* "*
:!la,"ine)2cos
3 Lgo '"^"v/:lvr
,cos?
* l8trrin L2 "i11gpi{t-n/z) q r-i(Q-tr/zt1l')
1)l :
\Y' - 2/ )
/a
sin0 sind
and
P3(cos
7
/): +olt
t lGrz@)rt(o)+2G
But Pj"(0)
p3 (cos
:
0 unless 3
/)
:
*
g
^=L-ffiPiQ-t)pi(o)"o''(d
m is even, so there is no contribution from m
2l]" (;)'
G
r.r2 -1) sin
-;)l
e
(-
1) sin
d
* fi rra'
sin3
:2.Thus,
B
(-,
"
3d)]
: -(+(s"or', - t) sind sin@ + fr sin3 e sin&)
and the potential in Example 8.2, rotated into the new coordinates, is
a:I['- [
'+Pr+r(o) 1r)'ar"",nr]
: :{' -' (-;) (1)'i"e'i'o
-:tE Gf
::{t.tuG)sind
l-'
(*1tt
cos2
P't'
e -1) sing sin @ *
*1 "n'
3d)
])
sin@
. *(:)t[3(5cos2 e -
Dsin0sin@
r5sin30sin3{].
}
which agrees with Example 8.3.
Now let's try to get
a
general result. We insert equation (8.65) into the result of Example 8.2
and get
e (r, o, a)
:
:- " I
zff r+r10, (:)'
: : -,
n+&+r
(0)
(:)'
2,hr*@,
p,ffi
,i'
o)Yh
(;, +)
r*;ei^a ry {o)s-int'l 1z
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY BESSEL
FUNCTIONS 397
Again we note that Pi'(0) :0 unless I * m is even. But we also need I * I to be even so
/2 : i-m : 1-9@-t) lz 1 i:
that P711 (0) I 0. Thus, I and m are both odd, and
"-imr
A(r,0,0)
(0) (:)' t F_#rr rryei-a r( {0r#
- v2 -u fu,_,zr 2l+ r pr+r''"'\a/
Lt,(ll
m odd
'*o
.-
zl+r_I
-'
L
-, z l,_rS$
r'
I
-'#',*,(o)
|
L
1
^ (1- m)tpy(ot(_1,,(m-t\/2(r_:t
-Q)eD@-t\/'? (L)'
\a/ r((Dsinrn{
(t +
(t+m):
^).Pf
J
|
-Etfr,
rodd u odd
I
Compare this with the result of Example 8.3:
(b(r, 0,
a)
The results agree
if
::.:
(I)' rr rr, sinmQ
D P- +ffi,,^
,,-: llr' Pi@)dw
:' Pi'Q't)dw
(8.67)
lo'
:
r
Pr
+t (o)
Pi
e)l
;
g<^+tt
t
z
, l,modd
Thus, we have obtained a useful expression for the integral on the left. In Problem 8.21,
you will be asked to verify this result.
8.4. PROBLEMS WITH CYLINDRICAL SYMMETRY:
BESSEL FUNCTIONS
8.4.1. Bessel Functions
Bessel functions arise as solutions of potential problems in cylindrical coordinates. Laplace's
equation in cylindrical coordinates is (equation 1.69)
lalao\
la2(D
a2Q
iar\oao-)*FeF* urz:o
398
cHAprER
I
sruRM-LtouvtLLE THEoRY
To separate the variables, we let
<D
:
R(p)W(Q)Z(z).Then we find
I a / aR\
Rp ap
*
\oa )
1 A2W rA2Z
*o'w * z ar' :t)
The last term is a function of z only, while the sum of the first two terms is a function of p
and @ only. Thus, we take each part to be a constantls called k2.Then
a2z
arr:k
Z
and the solutions are
Z
: etk'
The remaining equation is
lalaR\tazw.
Rpapl'u=)*W uFtk':o
(8'68)
Next we multiply through by p2:
o3 /aR\
..
la2W
iA\'A)*0"0"* * uF:'
Here the last term is a function of @ only, and the first two terms are functions of p only. As
with spherical coordinates, we often want a solution that is periodic in @ with peiod2r,
so we choose a negative separation constantl6:
A2W
ffi:-^2w+w:eri-Q
Finally, we have the equation for the function R of p:
a/aR\
,*lr;)+k2p2n-*2R:o
To see that this equation is of Sturm-Liouville form, we divide through by p:
a/AR\
^
m2
^ lP^dp/l+k'Pn--R-0
p
dp\
(8.6e)
15Here we choose a positive constant (real k). The solution is rather different if the constant is negative, as we
shall see in Section 8.4.7.
16In this application, m is usually an integer. Noninteger values are also of interest; see, for example, Section 8.5.
In that case, the separation constant is usually written as -u2.
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY: BESSEL FUNCTIONS 399
Now we have a Sturm-Liouville equation (8.1) with f (p) : p, C@) : m2 /p, eigenvalue
)' : k2 , and weighting function w(p) : p. Equation (8.69) is Bessel's equation.
It is simpler and more elegant to solve this equation if we change to the dimensionless
variable
x
:
kp.Then
A / aR\ ^2
k_IkD_l+k"oR_2_4:0
akp V akp)
kp
d/dR\
m2
_lx_l*xR__R:0
dx\ dx)
x
The equation has a singular point at
x
:
(8.70)
0, so we look for a series solution ofthe Frobenius
type (refer to Chapter 3, Section 3.3.2):
R: xPio,*'
n:O
Then the equation becomes
oo
!t,
n--0
+ p)(n -f p - r)anxn+o-t
+D@ I
p)anvn-tP-r
n:0
+ io,"'+p+r
n:0
-
m2fanx'+n-r
:o
n:O
The indicial equation is
p(p-l)+p-^2:O+p:aa
Thus, one of the solutions (with p : m) is analytic at x : 0, and one (with
not. To find the recursion relation, we look at the k * p - I power of x:
(k
+ p)(k * p - I)a*+ (ft + p)ax + ak-z
-
-
m2a1,
and so
oo------o!-2-:(k*p)2-m2
ak-z
oo-'=
=
k2+zkp+ p2-^2
=
ak-z
-F**:-*1*+zny
O
p : -m)
is
400
CHAPTER
8
STURM-LIOUVILLE THEORY
p
will always
Let's look first at the solution with
the series with as, then ft
at1,
-l
*
-
2n(2n
-
23n(n
-
: *m. We can step down to find each ap.If
:2n, and
-l
2m) (2n
-2)(2n -2-l2m)
(- 1)3
-
r)(n
-
2)23(n
2n nl.
2n(n
*
m)(n
(-r)'
: AA"
we start
be even, k
* m)(n I
a2n-4
-
1)(n
(m
-l l)
m
* m-
2)
a2n-6
1
*
m
- l).
..
The usual convention is to take
(8.7r)
Then
(8.72)
and the solution is the Bessel function:
oo
/
1\n
J^(x)
r)
(;)-.*
(8.73)
The function J^(x) has only even powers if m is an even integer and only odd powers if m
is an odd integer. The series converges for all values of x. Although we have assumed m to
be an integer thus far, expression (8.73) is also valid when m is not an integer.
Let's see what the second solution looks like. with p : -m' the recursion relation is
Qk:
ak-2
k(k
-
(8.74)
2m)
where again k :2n. Now if m is an integer, we will not be able to determine a2^,because
the recursion relation blows up. One solution to this dilemma is to start the series with a2^.
Then we can find the succeeding coefficients a2@+m):
azu+m): -420-lt+zm
721naDn
-42@-z)+2*
24@+m)(n*m-l)n(n-l)
:
(-l)uf(z*1)
nlllna**1122nu'^
:
lm. (ComThese coefficients generate the same series that we had before in the case P
pare the equation above with equation 8.72.) Thus, we do not get a linearly independent
solution this way. This dilemma does not arise if the separation constant is taken tobe
-v2
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY: BESSEL FUNCTIONS 401
with v noninteger. In that case, the second recursion relation provides a series
is linearly independent of the first.
With u : m, an integer, we find
l-
^
(x
)
: i"
####
:S
(-t)'r(m -f l)2m
n:U
a2*
\ "tr1, 'r^ *,
and
if
762(n
t
m)
-
J-r(.r) that
m
/x\m+2n
"'* \r)
we choose
a2m:
(-D^
r@ + Dz^
then
J-^(x) : (-I)m J^(x)
(8.7s)
With this choice, J, (x) is a continuous function of u. (Notice that we can also express the
series using equation 8.72 for the coefficients, with ffi ) -ffi, n --> k * m, and a2n
=O
for n < m.)
We still have to determine the second linearly independent solution of the Bessel equation
when m is an integer. We can find it by taking the limit as v --> m of a linear combination
of ./, and J-, known as the Neumann function, Nr(x):
Nr(x):
Jr(x)cos vtr
sln
-
J-u(x)
vz
Thus,
-. J,(x) cos urr - J-r(x)
N^(x):limN,(x):lim
v+m
v+m
sln uT
e+0
:
lim
e+0
J^+e(x)
(-l)n sinen
- (-l)^ J-6a"1@)
efi
(8.76)
4O2
cHAprER 8 sruRM-LrouvrLLE THEoRy
where we expanded the trigonometric functions to first order in e. Next we expand the
Bessel functions in a Taylor series in u to first order in e and use relation (8.75):
N,(x)
: JTo *lt. +, *1"=,N-(x)
-
I
*'Tl,:,)]
_(_t)*Tl":_l
It_dJ"
" ldD
<-o^ (t-^
(8.77)
,--^
The derivative has a logarithmic terml7:
L (.,s
t-rl, r1)r'\
:
r,.liT
dv d, \ "-o
+ U \il )
"
dJv
: ,"'
$
dv L^
/x\2, )_r,4 i
L^
\;)
d, n:U
n:O
(-l)'
/xv2n
ntt(n+ , + rl (t,)
and
dx'
dv
d
dv
,ulnx
:lnxeuln* : x, lnx
has a term containing Jvlnx. This term diverges as t -+ 0, provided that
0 for u f 0,
zero (that is, for v : 0). The function Ny (.r) also diverges as x
because it contains negative powers of x. (The series for ,I-, starts with a term x-u.) But
Nr(.r) does not diverge as ,r -+ oo, because "/, -+ 0 faster than the logarithm approaches
and so
dlrldv
/, (0) is not
infinity.
TWo additional functions called Hankel functions are defined as linear combinations
of
,/ and N:
HP @) : J*(x) -f iN^(x)
(8.78)
H#)@): J^(x) - iN-(x)
(8.7e)
and
17This result should not be surprising. We could also find the second solution using the methods of Chapter 3,
Section 3.3.3, particularly in the form of equation (3.37). See, for example, Chapter 3, Problem22.
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY: BESSEL FUNCTIONS
403
8.4.2. "Properties of the Bessel Functions
The Bessel functions (Js) are well behaved both at the origin and as x + oo. They have
infinitely many zeros. All of the J^, except Js, arc zero at,r : 0. The first few functions
are shown in Figure 8.11.
f(x)
1.0
0.8
0.6
0.4
0.2
-0.2
-0.4
FIGURE 8.11. The first three Bessel functions. All the functions except
equal zero at -r : 0,
"16(-r)
and all of them approach zero as r -+ oo. All of the J^ oscillate with decreasing
amplitude.
For small values of the argument, we may approximate the function with the first term
in the series:
J^(x)
x
"- -,
(;)^
for
x <( I
(8.80)
The Neumann functions are not well behaved at x : O. N6 has a logarithmic singularity,
> O, N. diverges as an inverse power of x:
and for m
2
11" for x <<
(m-l\l /2\^
N^(x)ry--l_l
rr \x,/
N6(x) = T
(8.E1)
1
forx((
(8.82)
4O4
cHAprER E sruRM-LrouvlLLE THEoBy
For large values of the argument, both .I and N oscillate. They are like damped cosine or
sine functions:
TT / mn z\
J^(x)xV;"ot\., - o)
lT / rnn z\
N.(x)-r,l
"*sin(x- 2 4)
for
x
)) l,m
(8.83)
for
x
)) l,m
(8.84)
ror x
)) t,m
(8.8s)
Thus, the Hankel functions are like complex exponentialsl8:
se,2)
o
rl*" r[*,(, - ry -il
Noticethatifm>l,thelargeargumentexpansionsapplyforxlmrutherthantheusual
x)1.
8.4.3. Relations Between the Functions
As we found with the Legendre functions, we can determine a set of recursion relations that
relate successive J^(x). Starting with the series representation (8.73), we divide by (x /2)^
and then differentiate:
d
?r)nn (!\r"-,
\ :$
xm / ?:onlf (n*m*l) \2/
(2^J^(x)
dr \
t-tl'
(!\"-'
-$
L.@-l)!r(n*m*l)\2/
n=l
Now let
k:
n
- l:
d (2^J^(x) \ : _$
t-tlo
(L\ro*,
\2/
m*r*t)
dr \ xm /
futlr(k+
om @
(-l)k
x^ t-0 1rty11r 1* a
:_o sL_
: l8Refer to Chapter 2, equation (2.5).
2m
_Jm+t(x)
1x12k+m+l
a g \i)
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY: BESSEL
FUNCTIONS 405
and thus
d (J^(x)\_
J.+tQ)
E\,^ 1-which is valid for m
> 0.In particular, with m :
(8.86)
,-
0 we obtain
J1@): -J6@)
(8.87)
Similarly,
*'^
:
2^
(;)'^
ftk^, ^{,)t
*D"frffir,
n=O
* n)
?Dn(m
:2,
-' S
1x12m+2n-l
4"tf(r+^+D\r)
:,'i
(-l)'
(x\m+2n-r
\2/
l+l)
lnlf(n*mn=U
:
ftu^ r^{*)f
which is valid for m
x^
J^-{x)
(8.88)
> l.
Combining relations (8.86) and (8.88), we get
r^+r
r
Im-t
: -.^ * (#)
-,-
(-*h.
*
i
*o^
h)
+
r*(x)l
I (**^-'r^
+ *^
rl,)
:*b-th+^**Jh:2#U
Jm+t
I Jm-r:
2m
x
-T
(8.89)
406
cHAprERssruRM-LrouvtLLETHEoRy
which is the pure recursion relation for the Bessel functions. Similarly, by subtracting instead
of adding, we find
-dJ^
Jm+t-Jm-t:-2-;ax
The same relations hold for the Ns and the
(8.e0)
.F1s.
8.4.4. Bessel Functions as Solutions of the Helmholtz Equation:
The Generating Function
Like Laplace's equation, the Helmholtz equation (equation 3.16 in Chapter 3 extended to
two or three dimensions)
(v2+t2)o:o
may be written in cylindrical coordinates and separated. Suppose we look for a solution in
two dimensions, with @ independent of z. Then the equation becomes
1 a / aR\ I AzW ,.
Rpap\oA)* *7 aF+k':tr
which is the same equation (8.68) that we obtained from Laplace's equation after separating
out the z-dependence. Thus, the solutions are of the same form:
+oo
':
*E*o*J^(kP)ei^o
We may exclude the functions N^(kp) if the origin is within our region of interest.
As with the Legendre functions, we may exploit a simple physical situation-here a plane
wave-to obtain a generating function for the Bessel functions. A plane wave propagating
along the y-direction has the form
eiky
-
uikpsinQ
which must therefore be expressible in Bessel functions:
*oo
,ikpsrnQ
- t
a_, J^,(kp)ei^'Q
mt:-@
Now we make use of the orthogonality of the ei^Q . We multiply both sides by e-i*Q and,
integrate over the range 0 to 2r . Only the one term with mt : m survives on the right-hand
side:
lo2"
ei*o "ino-i*Q d.Q
:
2r a- J*(kp)
407
8.4 PFOBLEMS WITH CYLINDRICAL SYMMETRY: BESSEL FUNCTIONS
:
0, we can evaluate both sides for p - 0 to obtain ao : l.If m t' 0, we expand
the function ,ikpsinQ using the exponential series and expand the Bessel function on the
right using its series (8.73):
With m
* I'",-'*o (r .|n#-),r
n*m.,
- "^ (ry)-
(T)"
I 0, the first term in the integral on the left-hand side integrates to zero. Now for
small ftp, we keep only the leading term on the right-hand side. Then we have
With m
*>=(Y)"
Look first at the case m
,+.
lr'"
iQ
'-imQf
-e-i')n
dQ:a-.#
: l. Keeping only the leading (n :
(+)^
1) term on the left-hand side,
we have
# Ir'" e-iQ@iQ -
e-iQ)a6:
fi<rn-
0)
: "rT +
at:
r
> 1, the terms with n < m ontheleft-hand side are allzerobecause leiQ - s-iQln :
(l - ,-2ib1n has no ei-Q lrormif n < m. (This must be true for consistency, because the
For m
ei"Q
lowest power of p in the Bessel function series on the right is p- .) Thus, the first nonzero
term has n : m.In the factor leiQ - ,-iQ1m, only the term ei-Q survives the integration
over @. We h e
(ikp)* |
ml(2i)m
['o
2n J11"-,-ar,^e
d6 =
sY
-
Thus, we have a generating function for the
,ikpsinQ
_
o_ t-f
l@ l)
qm
e)^ )a*-l
J*,
+oo
t
J^(kp)ei^Q
(8.e1)
and the integral representation,
(8.92)
408
cHAprERBsruRM-LrouvrLLErHEoRy
Next we write sin @ : (eiQ for the generating function:
e-i\ /Zt and let eiQ :
*p^12
l+ (,\ +)l
tll
8.4.5. Orthogonality of the
t. Then we have an alternative form
:f
l^(kr)tm
(8.e3)
m:-6
"I,z
Since the Bessel equation is of Sturm-Liouville form, the Bessel functions are orthogonal
if we demand that they satisfy boundary conditions of the form (8.2). In particular, suppose
the region ofinterest is p - 0 to p : a, and the boundary conditions are that Jm(ka) : O
and J.(0) is finite. We do not need a more specific boundary condition at
the function f(p) : p iszerc there. Then the eigenvalues are
p
-
0 because
dmn
.
nmn--
a
of J-. (The zeros are tabulated in standard references such
and
as Abramowitz
Stegun. Programs such as Mathematica and Maple can also compute
(8.6) becomes
Then
equation
them.)
where a^n is the nth zero
r * (k*^,
d p!!!#@1"
o
-, ^ o^, o, ofu#@|,',
:
(4n,-
t?*,)
fo oh&^,,p)J^(k*np)dp
The boundary conditions make the left-hand side zero, and thus,
fo
o
h&^,,
p)
J^(kanp) dp
if n
I
(8.94)
n',
:o
(8.es)
n : n' , we replace kmn, with an arbitrary value
(8.94) becomes
the
eigenvalues.
Then
equation
of k, not one of
To determine the value of the integral when
J*(ka)a
ffro^,Alr:o:
,o'
- &,t fo oh&*,p)
J^(kp) dp
l
Now we differentiate this expression with respect to ft:
dJ*(ka) dJ'-11r^,p11 :2k
a-o;ak^n
irk^np ro:o Jo["
+&2
I
pJ^(k^np) J^(kp) dp
- u^, Ir' pr^(k^,p)4!#O
dp
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY BESSEL FUNCTIONS
Next let
k
409
--> k^n. The second term on the right vanishes, and we have
k*nlall(k^na)12
:ro^,
fo
pU*(k^nil12 dp
and so
fo ou^&^ndl2
dp
: lvh{t^,o)l'
(8.e6)
which is the integral we need.le
8.4.6. Solving a Potential Problem
Example 8.4. A cylinder of radius a and height /r has its curved surface and its
bottom grounded. The top surface has potential V (Figure 8.12). What is the potential
inside the cylinder?
FIGURE E.12. The top end of the cylinder in Example 8.4 is at potential V. All the other surfaces
are grounded.
The potential has no dependence on @, and so only eigenfunctions with m :
0 contribute. The potential is zero at p : a, so the solution we need is Jo(/cp)
with eigenvalues chosen to make Jo(ka) : 0. Thus, the eigenvalues are given by
19See Problem
29
for the integral with Neumann conditions.
410
cHAprER
I
sruRM-LtouvtLLE THEoRy
kgna : agn, where don are the zeros of the function "16. The remaining function
must be zeto at z : 0, so we choose the hyperbolic sine. Thus, the potential is
a@, z)
:lc,Jo(ks,p)
:
Q(p, D
z
sinh (k6nz)
:
To find the coefficients cu, we evaluate this expression at z
V
of
:DcnJs(ks,p)
hi
sinh (ft0nft)
Next we make use of the orthogonality lf ,h" S"rr"t fonctions. We multiply both sides
by pJo(ko,D and integrate from 0 to a. Only the one term in the sum with n : r
survives the integration:
V
ra
fa
I oJoko, d dp : I
JoJon
pJo&o,dlcnJs(ks,p)
sinh(konh) dp
: c, Ifq p lJo&0, dl2 dp sinh (ko,h)
Jo
a2
: q7U
6ko,a)12 slrr'h (ko, h)
To evaluate the left-hand side, we use equation (8.88) with m
: l:
!' oJokddp: lfad
E J, ooo[kpJr(kp)]dp
Jo
: 1-a
i pJr&p)16:
So
,,:#Jr&o,^ffi
-
v2
k
,"
h(korq)sinh (ko,h)
J6: -h.
where we used the result from equation (8.87) that
,
cD
:
^,,
zv
Finally, our solution is
^-.S Jo(konp) sinh(ko,z)
: 2v
2 kr*tt&r*)tt"h (ftrlr)
The first two zeros of Js are aot
two terms in the potential are
\v
-JtGa)
:
2.4048 and
cu62
:
5.5201, and thus the first
( JoQ.a048pla) sinh(2.404821a)
\rAo48JtQn4q
{rnb QA048Ua) -
Jo(5.5201p/a) sinh (5.52Dtzla)\
(tsrolv") )
55201fi55201)
"1"h
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY: BESSEL
FUNGTIONS 411
These terms are plotted in Figure 8.13 for h/o - 2 and z/a: 0.5, 1, 1.5, and 1.8.
For zf a: 1.8, the first two terms do not represent the potential well. The three-term
result is also shown for Z /a : 1.5 and 1.8. While the third term makes little difference
at zf a : 1.5, it is vbry significant at zf a : 1.8.
o
n
two terms
three terms
p
a
FIGURE 8.13. First two terms in the series for the potential inside the cylinder versus radius for
h/a :2 and zf a: 0.5, l, 1.5, and 1.8. As 2/a increases, the series converges more
:
slowly and tbese first two terms do not represent the potential well at zf a
1.8hence the dip near the axis. The grey lines show the potential when a third term is
added to the series. At the smaller values of z, the extra term makes a negligible
difference in this diagram.
8.4.7. Modified Bessel Functions
Suppose we change the potential problem in Example 8.4 so that the top and bottom
:
of
the cylinder are grounded but the outer wall at p
a has a known potential V (Q, z).Then
we need to choose a negative separation constant so that the solutions ofthe z-equation are
trigonometric functions
:
a2z
-k2Z
02.2
:
:
+ Z:
asinkz
*
bcoskz
: 0. We also need Z(h) : Q,
nn /h.
This change in sign ofthe separation constant also affects equation (8.69) for the function
R(p) because the sign of the k2 term changes here too. The equation becomes
At z
O,
Z(z)
0, so we need the sine and therefore s€t b
so we choose the eigenvalue
k
:
a/aR\
^
^ lp^dp/l-k"pRdp\
^2 R-0
p
412
cHAprERssruRM-LtouvtLLETHEoRy
or, with variables changed to
x
:
kp,
m2
x
* ("#) -xR--R-0
(8.e7)
which is called the modified Bessel equation. The solutions to this equation are J^(ikp).It
is usual to define the modified Bessel function I^(x) by the relation
I
I*(x): ;J^(ix)
(8.e8)
so that the function f. is always real (whether or not m is an integer). Using equation (8.73),
we can write a series expansion for 1.:
ra(x)
t\4)
('!\^*'"
::
- im i4nr.f.=l:11* + l) \2 )
(n
n=U
m
m
I
I-(-r\: )
4nlf(n*m
N:U
As with the Js, if
z
is an integer,
1-'
I-^(x)
*l) (;)^.^
is not independent of
:- i* J-^(ix) :
I*;
(8.ee)
in fact,
I*(x)
The second independent solution is usually chosen to be
K^(x)
:
Lim+t P$) 6x)
(8.100)
Then these functions have the limiting forms for small argument:
r^(x)
x
i.,
(;)^
forr
(
I
(8.10r)
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY: BESSEL
FUNCTIONS 413
and
Ks(x)x-0.5772-t"t forx <
l(m\ /2\n
K^(x)*i
l;),
(8.102)
1
m>0, forx(
I
(8.103)
At large x , x >> I , m, the asymptotic forms are
+f,
r^(x)
x
K^(x)
x
(8.104)
and
7f
_e
2x
(8.105)
(Compare with Chapter 3, Examples 3.9 and 3.10.) These functions, like the real exponentials, do not have multiple zeros and are not orthogonal functions. Note that the 1s are well
behaved at the origin but diverge at infinity. For the Ks, the reverse is true. They diverge at
the origin but are well behaved at infinity (Figure 8.14).
Is
I1
I2
Ks
K1
K2
0
FIGURE 8.14. The first three modified Bessel functions. The functions Kn (.x) diverge
and the functions
1n
(-r) diverge as
r
--)
oo.
at the origin,
414
cHAprERBsruRM-LrouvrLLETHEoRy
The recursion relations satisfied by the modified Bessel functions are similar to, but not
identical to, the relations satisfied by the Js. For the 1s, again we can start with the series:
d (2*I^(x)\_S
1xlzn-t
d"\ xm )-2ntt(ntm+D\z)
Now let
k:
n
- I:
I
d (2mr^(x)\:s
(!\ro*t
d"\ xm / Aklr(ft+m*r-rt)\2/
d (I^(x)\ /,+r(x)
)cm
dx\ x- )-
(8.106)
Similarly,
:
ft{** r^l
xm
Im-r
(8.107)
Expanding out and combining, we get
I*(x)-mb:I^+t
x
4++:tm-l
,
Adding, we obtain
2Ik:
Im+r
I
Im-t
(8.108)
while subtracting, we get
2m
-x
Im
- Im-t -
Im+l
(8.109)
8
4
PROBLEMS WITH CYLINDRICAL SYMMETRY: BESSEL
FUNCTIONS 415
For the Ks, the relations are
fr{*^
*^)
-
-x* K*-l
and
d
dx
/ K.(x)\
K,+r(x)
*-
\"')
(8.1 10)
and, consequently,
2m
K^-l - Km+t:--Km
x
K.-t * Km+t:-2Kk
(8.1 1 1)
(8.1
l2)
8.4.8. Combining Functions
When solving a physics problem, we start with a partial differential equation and a set of
boundary conditions. Separation ofvariables produces asetof coupled ordtnary differential
equations in the various coordinates. The standard solution method (Section 8.2) requires
that we choose the separation constants by fitting the zero boundary conditions first. In
a standard three-dimensional problem, once we have chosen the two separation constants
we have no more freedom, and the third function is determined.
When we solve Laplace's equation in cylindrical coordinates, the functions couple as
follows.
Zero boundary conditions
J. (o^n
I)
where
J^(a*r) :
in
p.
?,"
The eigenfunctions are of the form
sinh a-n1
*
B^n
0. The set of functions J^(a^npf
set on the constant z surfaces that bound the
Zero boundary conditions in
"*
z.
u
a)eri'd
^,1) "+;*o
forms a complete orthogonal
region.
The eigenfunctions are of the form
(#) lA*, r^ ff)
The set of functions sin (nt zl
surface p : constant.
"osh
+ B*n K -
(T)1,.'-'
h)eti'Q forms a complete orthogonal
set on the boundary
Thus, in solutions of Laplace's equation, the ,Is in p always couple with the hyperbolic
p always couple with
sines and cosines (or real exponentials) in z, while the 1s and Ks in
the sines and cosines (or complex exponentials) in z.
416
cHAprER
I
sruRM-LtouvtLLE THEoRY
8.5. Find the potential inside a cylinder of height /r and radius a when the
top and bottom are grounded and the curved walls have a potential V (Q, z) : Vo for
Example
0<d<
n and-Vs forn < Q.2n.
As we discussed in Section 8.4.7 , the z-functions are of the form sin (nn z/ h) so
that the potential equals zero at z : 0 and at z : h.The Potential should be finite on
the axis at p - 0, so we exclude the K functions. Thus, the potential is of the form
Q(p, z, o
.oo+oo
:D
D o^^ "^ (T),*(T o) "'^'
n:lm:'6
:
Now we evaluate the potential at p
e(a,
z,t,
:
{l'*
41
'rlorlrr'.oro
::.f:*
An^
sin
(?
n(ffo)
,t^o
This is a Fourier sine series, and we find the coefficients in the usual way:
t.^f;r^(7")r":ro
loo
"^(T)0,(1," - f)e-i^,do
h
2t1 - (-l),,11
: Vo-(l
-cosrln)-f
lm
wf
]h
(
: ( v,wrmt,
[o
We must evaluate the terms with
A,0 :0' Thus,
:T
m
:
for n odd and rn odd
otherwise
0 separately. The integral over
i i'"
l=-la
(#)' ^
@
^:
\.i-o)'' *'
n I^(Tt)
1,=".?
Combining positive and negative m terms, we have
. /n!3\
Q(p.:.0\:ryi
.-
n=t,
Figure 8.15 shows the
i aP'r
m:t.
solution;";t"
a:
h/2.
r (wr ^\
sinmolm\ P)
'
m ,^(Tt)
is zero,
so
8.4 PROBLEMS WITH CYLINDRICAL SYMMETRY BESSEL
FUNCTIONS 417
o
V,
h
4
p
0
0.5
n
: h/2. This plot shows @(p)/ V6 versus pf h
The
dotted
line
represents
z : h/8; the dashed line, z - h/4; the solid
/2.
h / 2. The first three values of m and n are included in the sum.
FIGURE 8.15. The solution to Example 8.5 with a
at
0
:
Iine, z
tr
:
8.4.9. Continuous Set of Eigenvalues: The Fourier-BesselTransform
In Chapter 7, we approached the Fourier transform by letting the length of the domain in
a Fourier series problem become infinite. The orthogonality relation for the exponential
functions
+ [,^, (,7)
*p
(-iT)
ax
: a^n
becomes
r[*
2n
J-*
,ikx"-ik'x dx : 6(k _ k,)
That is, the Kronecker delta becomes a delta function, and the countable set of eigenvalues
nn f L becones a continuous set of values -oo < k < oo.
The same thing happens with Bessel functions. With a finite domain in p, say 0 < p < a,
we can determine a countable set ofeigenvalues from the set ofzeros ofthe Bessel functions
J., as we did in Example 8.4. If our domain in p becomes infinite, then we cannot determine
the eigenvalues, and instead we have a continuous set. The orthogonality relation
Io"
,^
("^:) n (,*e)
p dp
:
f;ul{o*,)t't,o
418
cHAprER
I
sruRM-LrouvrLLE THEoRy
becomes
(8. r
l3)
(The proof of this relation is in Appendix VIII.) Then the solution to the physics problem
is determined as an integral over ft. For example, a solution of Laplace's equation may be
written
as
e(p,Q,z) : I
,'^o [* A*(k)f (kz)J^(kp)dk
7Jo
where
/(kz)
exponentials
depends on the boundary conditions
,-kz
un6
in z. It will be a combination of the
"*kz.
The potential on a plane at z : O is given by the function
Find the potential above the plane at z > 0.
sin(pla).
Vo@/p)
Example
8.6.
Theappropriatefunction of zise-k',chosensothatO
V(p)
:
+ 0asz -+ oo(along
way from the plane). Then the solution is of the form
Q(p, Q, r,
:
+oc
nN
*\:*r'-' Jo
Evaluating O on the plane at z
:
A^(k)e-kz J*(kp) dk
0 gives
+oo
V(p,il: o(p,d,0) : l.
*a*
roo
"i'Q JoI
l,-(t)t^(kp)dk
Now we can make use of the orthogonality of the ei*Q. We multiply both sides by
e-i^'o and integrate over the range 0 to 2n. On the right-hand side, only the term
with
m:
nrl suryives the integration, and we get
Io'"
,rr,Q)s-i-'q dQ:2n
fo*
o^,{o)r^,(kp)dk
which is a Fourier-Bessel transform. Next2o we multiply both sides by pJ^(k'p),
integrate from 0 to oo in p, and use equation (8.1 13) to get
/^oo f2o
J, J,
v @, Q)s-ima d0 J^(ktp)p dp
: zo
: r,
20We also drop the primes onthe mt for convenience.
fo*
[@
Jo
fo*
o^{o)r^(kp)J^(k' p)p
3(k - k'\
e^&)*:dk
dp dk
A^(k'\
:2r:T
8.5 SPHERICAL BESSEL
FUNCTIONS 419
which determines the coefficient A^(k') in terms of the known function V (p , Q).
In our example, V (p) is independent of @, so only m : 0 survives the integration
over S,leaving
Ao(k):
ifka>t
' f*--a
'lallT
vs: sin Lro&doap:rovo{o
'
a
rt
Jo
-G8 k:a . r
k
(The integral is Gradshteyn and Ryzhik formula 6.671#7.) Thus,
a@,2):
voa2
:ro
[''" -L]o(kp)e-kz
Jo
1/l - (ka)z
f1
dk
x/
Jn'#n(*2)"-"/'a*
The solution for O(p, z) on the z-axis and at p
-
2a is shown in Figure 8.16.
o
Vs
1.0
0.8
0.6
0.4
o.2
z
a
0123
FIGURE 8.16. This plot of the solution to Example 8.6 shows
p
:2a.
@
(p, z) on the z-axis (P
:
0) and at
The integration was done numerically.
8.5. SPHERICAL BESSEL FUNCTIONS
8.5.1. The Wave Equation in Spherical Coordinates
In Section 8.4.1, we showed how Bessel functions arise as solutions of Laplace's equation
in cylindrical coordinates. Closely related functions-the spherical Bessel functions-arise
as solutions ofthe wave equation in spherical coordinates.
The wave equation (3.15)
I a2F _n
" _ __
V.F
,2 at2 -"
420
CHAPTER
8
STURM-LIOUVILLE THEORY
may be solved by first Fourier transforming in time. The wave equation is then transformed
to the Helmholtz equation (3.16) with P : 02 /c2 ,where a.r is the Fourier transform variable
(angular frequency ofthe wave) and c is the wave phase speed. For waves from a point source
or in some other spherical geometry it makes sense to write the V2 operator in spherical
coordinates:
I a /"aF\ *,,sinl L(rineiF) + r A2F +k'F :0
l;1";
) =l 00\ ap ) 7suzt aF
(8'l14)
Next we separate variables, as we have done before. First we write F : R(r)@(d)O(@),
multiply the whole equation by r2 sin2 9, and divide by F, thus isolating the @ dependence:
sin2
e
a
R0r
(r#).##("",#)
The last term must therefore be a constant, and if our region of interest is 0 < Q < 2r , then
we must choose the constant tobe -m2 so that the solutions are periodic with period 22.
Then
Q
: sinmQ,
cosmQ
and, after dividing by sin2 d, equation (8.114) becomes
ta
Eu
("#)
tk2r2.
#*# ('"#) - k
:o
Now we have separated the equation completely, since the first two terms are functions
of r only while the last two are functions of 0 only. The last two terms are set equal to
us the Legendre equation (refer to Section 8.3, equations 8.15 and 8.48)
with solutions Pf (cos 0) and Ql @os 0). The remaining terms give the equation for R:
-l(l + 1), giving
*# t'#) lk2r2-I(l+1):o
We proceed by letting
(8. r 1s)
n = Z /"fi.Then
dR
dr -
IZ
213/2
'
Z,
rt/2
-J--
and
d
d,
So the equation
/"dR\ : rz
ll; ) -;;E
+ rt/2 z' + 13/22"
for Z is
-ii,
1vt/2zt
+
r3/22" + k2r3/22
-
r(t
+
t's
,,,1-
:o
8
5
421
SPHERICAL BESSEL FUNCTIONS
Dividing by rr/2 and gathering terms, we have
d
/
t\22
*(rz't*k'Zr-(,*ri --0
which is Bessel's equation (8.69) of order /
that we need is
+
I12. Thus, the solution of equation (8.115)
: !*''lo
R(r)
This function is called a spherical Besselfunction.The usual normalization is
irk):
T7
(8.1
\f 2*Jr+rn@)
l6)
Thus, the full solution to the wave equation in spherical coordinates is of the form
F
(r, 0, Q,
t)
:
j1(kr) Pi
@os 7Seimf
r-itcct
(8.
l 17)
We may define a similar "spherical" analog of each Bessel function. For example, the
spherical Neumann function is
n1(x): tf
fiN+tp1,1
In the large
argument
(8.1 I 8)
limit, the spherical Hankel functions (refer to Section
8.4.1,
equations 8.78, 8.79, and 8.8 i) have the form
rtlr){rrr)
=
rEry -'#
: (;,1+t{
asr ->
oo
(8.'e)
Thus, solutions of the form
F (r, 0 , Q
'
t)
:
n[t)
1t
r)
r1' 6os g)ei-Q e-ikct o ,ik(r -ct)
correspond to outgoing waves. Since the wave intensity is proportional to lF l2 , and I F | l/r forlarge r, this result corresponds to the usual inverse square law. The second Hankel
function 7@ (kr) describes incoming waves.
Properties of the spherical Bessel functions follow in a straightforward manner from the
properties of the Bessel functions. For example, the recursion relations (8.89) and (8.90)
422
cHAprER
I
sruHM-LrouvrLLE THEoRy
become
*
it+r@)
yr-r (x)
:
2t+1
+t/2)..
2(t
.
(8.120)
xx
-/t(x)
-j,(x):
and
jr+r(x)
-
jr-t@)
: - 2d
: - 2d
a*Jt*tpG)
.F *1,/x7@)1
6
- -"djr$)
dxx-
jr?)
We can simplify this relation by using equation (8.120):
(21* t)ljt+r@)
(l-t
jr-t(x)l
:
-2(2t + gditr?)
ax
I) jr+r@)
-
tjt-r@)
-
:
-
-(21*
Ur+r(x)
*
"r)-r(x)l
DT
(8.
r21)
We may also write the spherical Bessel functions in series expansions. For example, using
the series (8.73) for h+t/2, we find
i)'
1x1t+!+2"
: ,- ---J-*
irGT:,Fi
\r)
" V k;rT;+T+T
jt@)
In particular, for
I
:
t
_Ji L^nlf
2
N:U
(-l)'
(n-tl*
\
1r
f
(8.122)
0,
io@):+fi##t
Let's simplify the
(;)'.^
(;)^
function:
.(,*'u):(,.j) .(,.;) :(,*;) (,-;) .("-:)
: (. * :)("-;)("-) j.(;)
:s#!r':W#:W#
(8 123)
8.5 SPHERICAL BESSEL FUNCTIONS
423
and so
rrr (n +
7l
\/ -(2n
:):
-l l)l
22n+l
Thus,
sln.t
,,b(x):L^#{r,^
(8.124)
x
In fact, each spherical Bessel function may be written in terms of sines and cosines. For
example,
jt@):
jz(x)
sin
x
----;-
-
cos
.x
(8.125)
x
l\: (/3
lsinx\r'--- x/
3
(8.126)
1 cosx
(See Figure 8.17.) As / increases, the expressions become more and more complicated.
f
(x)
1.0
0.8
0.6
0.4
o.2
0
-/
16
-o.2
FIGURE 8.17. The first few spherical Bessel functions j6(x),71 (x), and 72(x). Compare with
Figure
8.
I
l.
424
cHAprER
I
sruRM-LrouvrLLE THEoRy
For large arguments, the expressions become
simplell:
1 /
T\
j"(x) - -sm(\.r-"r)
for
x
)) I,n
(8.r27)
8.5.2. Orthogonality
Equation(8.115)isof Sturm-Liouvilleformwith
the weighting
function w(r) :
f(r):-12,g(r):l(l+ l),f.:
k2,and
12. Thus, the orthogonality relation is of the form
f"' ,' 1,{t r)
11(k'|r)
dx
:o
(8.128)
unless fr : t', provided that boundary conditions of the form (8.2) are satisfied. As with
the regular Bessel functions, since /(0) : 0, we have
fb
,2
Jo
.irtt
r) jr\k'r) dr :
o
provided that the boundary conditions (8.2) are satisfied at r : b,regardless ofthe boundary
conditions, if any, that apply at r : 0. (As always, we do require that the function remain
finite when its argument is zero.)
For k : kt : qt+tl2,pf b,where dr+r/z,p is the pth zero of J+t/2, and therefore also of
7), we may use relation (8.96) to obtain
pb ^
rz1r{trl1r(kr)dr
Jo
rb2
: ;T fbrJ1,412(kr)J1+rp(kr)dr : ;;tJ|*rp@t+,tz.)12
J,
But
.
i'(x)
: tlfnZeJ' J,:
lit
,,1
J
Zt xy2
,lT (''.
I
: Tir
ur' - ;i
,,1
*)
Thus, here we have
7b
Jo
,,i,ro,,i,(kr)dr
:
:
2lCompare with equation (8.83).
n b22kb
;7'i
jt@+tp,il12
*;##)
liir",*'/2.il
1.,,
f;tiiro,*r12,))2
(8. l 29)
425
8.6 THE CLASSICAL ORTHOGONAL POLYNOMIALS
where we used the fact that dr+r /2, p is a zero of n. We can write the result in terms of the
functions themselves rather than the derivatives22 by using the recursion relations (8.120)
and (8.121):
fn'
,,
i,{*ii,&r)
dr
:
(8.130)
f;tj,*r{o+t/2,))2
Spherical Bessel functions find particular application in the study of electromagnetic
waves (see Jackson, Chapter 9) and also in the quantum mechanics ofa particle in a spherical
cavity (Problem 55).
8.6. THE CLASSICAL ORTHOGONAL POLYNOMIALS
The functions we have studied in this chapter have several common features:
.
They are orthogonal on an interval
example, relation 8.30).
.
.
.
They satisfy a set ofrecursion relations (for example, equations 8.34 through 8.41).
[a,b]with
respect to a weight function ur(x) (for
They may be computed from a generating function (for example, equation 8.32).
They may be computed from a Rodrigues{ype formula (for example, equation 8.42).
These properties are coflrmon to a larger class of orthogonal polynomials that arise in
physics problems. They are defined by the generalized Rodrigues formula:
cp(x)
11dn
- K" r@ dr"lw(x)snl
(8.1
3
1)
u(x), the weight function, is real, positive, and integrable on the interyalla,bl;
s(x) is a polynomial of degree < 2 withreal roots; and the product rus satisfies the boundary
where
conditions
w(a)s(a): u(b)s(b) :0
In addition, the first polynomial Cr(x) is a first-degree polynomial in x. The constant K,
serves to normalize the polynomials. [Notice that the Legendre polynomials satisfy these
conditionswith
22See Problem 48.
?r(r):1,s(.t) :
x2
-l,a:
-l,b:1,
Kn -2nnl,andC1
(x):1.]
426
CHAPTER
8
STURM-LIOUVILLE THEORY
We shall begin by showing that the function
defined by equation (8. 13 I ) is a poly-
C
"(x)
nomial of degree n. To construct the proof, we use the symbol
pr(x) to represent
any
polynomial of degree < k. Then
:
ffi@r" rD
(8.132)
u)sn-m pk+m
where ppy^ is another polynomial of degree < k I m. To prove this result, we start with
the definition of Cr from equation (8.131) to obtain
wK1C1:,# *
whereKrCr
that
r# + ,# :.(*rr, - #)
-dsldx isapolynomialof degree <
1.
(Remember:Wehavealreadyspecified
Cl is a first-degree polynomial.) Then
pil : d,' ,n po + nw!s'-t
!@r"
dx
dx
clx
rr, + wsndlk
ax
:,(*rr, - #)u-'rx + nwfrs"-'po +.u*
=
u,sn-t
l("'t'
+ (n -
"#)rr
*'#f
Sinces(x)isapolynomialof degree <2,dsf dx isof degree< l,anddppldx isof
degree
< k - l, the term in brackets on the right-hand side is a polynomial of degree < k +
d
fttws'P1r'1
:
1:
wsn-t Pk+l
[Given specific forms for the functions u(x), s(x), and a polynomial pp, we could use this
relation to construct a specific form for the polynomial p*+t, but we don't need to do that.l
We can continue to differentiate until we obtain the result (8.132).
Setting n : m and k : 0 in equation (8.132) and using the result in equation (8.131),
we obtain
Cn(x):
Pn
that is, Cn is a (yet unspecified) polynomial of degree < n. This polynomial may be written
as a polynomial of degree < n - l, with the possible addition of a term in xn:
Cn
:
pn-r
I Ax"
Next we shall show that A is not zero, and thus Cn is
a
polynomial of degree n.
(8.133)
427
8.6 THE CLASSICAL ORTHOGONAL POLYNOMIALS
To show that A is not zero, and at the same time establish the orthogonality property
the polynomials Cr, we shall first show that
f"u
n.{*lc^(x)u.'(x) dx
:o
for m
<
n
of
(8.134)
for any p*(x). We begin with the defining relation (8.131) and integrate by parts:
rb
I
J"
p^@)Cn(x)w(x) dx
:
lfbdn
i
J" P^(*)*;l*(x\snldx
tb
rh
tn-l
tn-l
\
| /
: nlo^t*>fr,-tt*t')"11,
- J" o^-,@h*J G)s\dx)
where we used the result that the derivative of a polynomial of degree < m is another
polynomialof degree < m-7.Result(8.132) showsthatallthederivatives(d- f dx^)(us')
withm < n vanishattheendpointsr : a andx: b of theintervalbecause u.rs does, and
thus the integrated term vanishes. We may now continue to integrate by parts, reducing the
order of the polynomial in the integrand by one each time, until we obtain
b
p^(x)C"(x)w(x) dx
:
< n, and the next integration shows that the integral (8.134) is zero.
Now we multiply equation (8.133) by C"w and integrate:
for m
l"u
{r,)'w(x)dx:
f"u
o,-rc,wdx *
:o*A
f"u
t
fob
*nc,*d,
*'c^rdx:1,
where we used result (8.134) with m : n - 1 to eliminate the first term. The left-hand
side is > 0, since the integrand is positive throughout the range of integration, so A must
be greater than zerc, and Cn is thus a polynomial of order n. Finally, we set pm : C* in
relation (8.134) to obtain the orthogonality relation we seek:
I,'
r-orr,(x)w(x)dx
:o
form < n
Properties of some of the polynomials used in physics are listed in Table 8.3.
(8.
l3s)
428
cHAprERosruRM-LrouvrLLETHEoRy
TABLE 8.3. Classical Orthogonal Polynomials
u(x)s(r)abKn
-|
-1
e-"
I
-oo +m
xue-"
r
Irgendre
I
Hermite
Laguene
x2
o
|
!t-i-
I
nl
+oo
+l
-l
2n
(-l)'
1l
Ji2"nl
f(n*u*l)
nl
(-2\nnl ,ffi
Jacobi (l-.r)v(la.r)r l-x2 -1 +l
Tchebichef
2
2nnl
+l
(-l\nr
(Zn\l
2nn!
tt
2
-
PROBLEMS
fl
fina the eigenfunctions for the one-dimensional Helmholtz equation
d2v
7j
+ k'Y
:o
subject to the boundary conditions
)=0
atx
:0
and
!' :o
at
x
:
L. (This corresponds to a vibrating string with one fixed end and one free end.)
2. Find the eigenfunctions for the Helmholtz equation
d2v
7fi
+ k'Y
:o
subject to the boundary conditions
atx
:0
al *
bYt
:Q
*
Fy'
:0
and
ay
atx:
L.
429
PROBLEMS
Determine specific forms for the functions, and obtain the eigenvalues for the following
CASES:
(i) a:aandb:F
(ii) aL:bandF:2aL
You should be able to obtain numerical values for the product
kl.
3. The displacement of a square vibrating membrane of side L satisfies
the two-dimensional
Helmholtz equation
02s 02s
arr+ arr*k's:o
where
k : olu,
rr.r
is the frequency, and u is the speed of waves on the membrane.
x : 0, L and y : O, L. Separate variables
and solve for the eigenfunctions s(x, y). Show that the system exhibits degeneracythat is, that there is more than one eigenfunction corresponding to a given eigenvalue
k2. In particular, show that there are two eigenfunctions s1 and s2 that correspond to the
eigenvalue k2 : 5n2 / L2 .What symmetry of the physical system causes this degeneracy?
(Hints: Where are the nodal lines for the two modes? What happens if one side of
the membrane is slightly shorter, equal to L - e?) Any linear combination of the two
eigenfunctions is also a solution. Find some of the nodal lines for combinations of the
modes-for example, .rt + s2. How do these modes reflect the symmetry of the system?
Can you find an eigenvalue that has threefold degeneracy? If so, what do those modes
look like?
Suppose the membrane is fixed at its edges at
set of eigenfunctions y,(x) satisfies the Sturm-Liouville equation (8.1) with boundary
conditions (8.2). The function g = 0. Show that the derivatives un(x) : y'n@) are
also orthogonal functions. Determine the weighting function for these functions. What
boundary conditions are required for orthogonality? Apply your results to the Legendre
equation to determine the orthogonality of the derivatives fi@).
@A
5.
Use the recursion relations in Section 8.3.5 to show that the derivatives Pi
(f)
of the
Legendre polynomials are orthogonal on the range (-1, 1) with weighting function
0 - 1"2), in agreement with the results of Problem 4.
6. To obtain Fourier-Legendre series, we often need to evaluate integrals ofthe form
,f
:
lo'
tr"
h1t)
dpt
(a) Start by evaluating tf , tl, I[, and Ii.
(b) Next use the recursion relations for the polynomials in Section 8.3.5 to determine
recursion relations for the integrals f . Multiply equation (8.37) by ptn andintegrate
by parts to obtain
(c) Use
a
relation between
ti nd li-rt
.
these results to step down until you can use your results from (a) to obtain an
430
CHAPTER
8
STURM-LIOUVILLE THEORY
explicit expression for
If.
Show that
nl
(n-l)ll(n+/+1)!l
t-n-t (l-n-2\ll
(-l)---rn!'
(n*/*l)!!
0
if n > I
If n <
I
andl
-n is odd
ifn<landl-niseven
7. Yeify that the Rodrigues formula gives the correct normalization for every Legendre
polynomial. Hint: Wite (r2 - t)t : (x - l)l(-x + 1)', differentiate, and evaluate the
resultatx:I.
@
Evaluate the integral
f+l
t
PtG)
J-r Jl - xz
dv
for the function I l\n-'.
and hence obtain a Fourier-Legendre series
cursion relations in Section 8.3.5 may prove useful.
Hint: Tlte re-
9. Write Laplace's equation in oblate spheroidal coordinates (refer to Chapter 2, Problem I 3), separate variables, and hence show that the solution requires Legendre functions
in both the coordinates z and u. Argue that the solution exterior to an oblate spheroidal
boundary requires the use of the Legendre function of the second kind, Q.
10. Expand the Legendre function Qo@) for large values of the argument, and show that
your result agrees with the asymptotic form in equation (8.29), modulo a constant.
11. Rewrite the Legendre equation
d
d. (o -
l1|
*tff) *,0 + t)et :
o
in terms of the variable u : I I x, and obtain a solution as a series in u. Show that for
latge x, Q7(r) goes to zero as If xt+t. Show that for / : 0, the solution Qo@) may be
written as in equation (8.28) but with x - I in the denominator instead of 1 - x.
Cunent flow in aconducting sheetis describ-edby the relations fr : -VO ana j: ofr.
Use the charge conservation law (0pl0t + V .j : 0) to show that, in a steady state, O
satisifes Laplace's equation. Find the eigenfunctions for current flow in a circular copper
plate.
Current 1 flows into and out of a plate of thickness t and conductivity o through
r: a extendingfromd :7t -yl2ton 1-y12 andfrom I : -y12to
*y /2.Determine the potential, and plot the current flow lines.
electrodes at
13. A solid sphere of radius a is immersed in a vat of fluid at temperature 16. Heat is conducted
into the sphere according to equation(3.14).If the temperature at the boundary is fixed at
Zo and the initial temperature of the sphere is Ir, find the temperature within the sphere
as a
function of time.
14. Use the Cauchy formula together with the Rodrigues formula to write hQt) as a contour
integral in the complex plane. Take the contour to be a circle of radius \62=, and
PROBLEMS 431
hence obtain the integral expression
I fr r
r1
hA): - Jo lx + t/x2 -
tcosQ)
dQ
15. Starting from the relations in Section 8.3.5, derive the following recursion relations for
the associated Legendre functions:
:
(a) (/ - * + Dt/TlEPl-t
Pi\1 - wPi
(b) (2t + D\/T=EP{-t Pilr - Piir
:
Hint: Start with the pure recursion relation and differentiate.
(c) From relations (a) and (b), derive the following:
(2t
+ I)p.Pf @) : (t -
m
* r)Pi\+
0 + m)PLt
16. Starting from the definition (8.53), obtain the m.rusing recursion relation for the associated Legendre functions:
-^-+Pi1n
- JTl: dP eitu)
t/l - lt'
r;"+t{u):
Combine this result with equation (8.59) to obtain the m-loweingrelation
(t
*m)(t -m
t
t)Pi'-'
: ,f,4hPi' - mJ+EPf
17. Use the results of Problem 15 to show that, for I + m even,
+ m - r)lt'
Pf Q) : eD(t+D/2(l
(t - m)tt
lE.
Show by direct substitution into equation (8.15) that P#(e) o< sin- d. Use the value
the orthogonality integral (8.55), together with the result
rn /2
["''
Jo
,inz^+t
o
dg
:
of
Qm)ll
Qm 1- l)ll
(for example, Gradshteyn and Ryzhik formula 3.621#4), to show that
P#@)
: (-t1m94
2mml,
"inm
s
This relation, together with the z-lowering relation (Problem 16), may be used to generatethe Pf .
19. Verify the result
If+t
Jt
I (t r m)l
' Ut)12
dLr:-l-p' = ' m(l -m)l
Lpl"
432
CHAPTER
8
STURM-LIOUVILLE THEORY
for the second orthogonality integral in Section 8.3.7 in the cases
(c)l:m
(a) l:m:l
(b)l:2,m-l
(d) Stepping down in m, vse proof by induction (Appendix III) to show that the result
is true in general.
[ZOl Using the generating function G(x, tD (equation 8.32) and the addition theorem (8.65),
derive the expansion
-ml
#u : [,t, h#'^''
ilYfi^ (e"
6'7
where r= and r, are the lesser and the larger ofr and r/, respectively.
Hence find the magnetic vector potential due to a circular loop of wire of radius a that
is carrying current 1. (Refer also to Problem 6.19.)
21. Verify the result (8.67)
I,'
where
/
Pi
@)d w
: (- 1)("+r),'tT r,*rfo) Pf Q)
and m are both odd.
(a) First evaluate the Legendre functions to show that
l-2)lt(l*m-t)tt
_
f+t
It^:
I PiQt)du:
, ,i,, +,(l-m)ll
\r'' ' -mn )i
t
(+t)tl
J-t
(b)
I,m
odd
Use the expression for Pff (0) from Problem 18 to show that the result is true for
m : I . Hint: Use contour integration.
(c) Show that the result is true for m : I and / equal to any odd integer.
(d) Use proof by induction (Appendix III) to show that the result is true for all m, | <
m < I, with both I andm odd. Hint:Usethe result from part (b) and step down in m.
22. Find the electrostatic potential inside a hemisphere of radius a with potential O
the flat side and Q : V on the curved part.
:
0 on
23. Quantum mechanical treatment of the harmonic oscillator results in the Hermite differential equation
y,,-2*y,*ly:0
Write this equation in standard Sturm-Liouville form.
If
the boundary conditions are
y(x) -+ 0asx -+ foo,showthatthesolutionsareorthogonalontherange(-oo, *oo),
and find the weight function u.r(x). Solve the equation to find a series expansion for the
Hermite functions. What value of the eigenvalue l, is required for the functions to remain
bounded throughout the interval, including x -+ Iec? (Hint: Experience with Legendre
functions should prove useful.) Normalize the solutions by choosing the coefficient of
the highest power x' to be 2n , and hence determine the first three eigenfunctions.
PROBLEMS 433
[Z+]fne generating function for Hermite polynomials is
G(x.t): '-t2+2'r: i l"r,t,
7=on!
Use this generating function to establish a pure recursion relation for Hermite polynomials (analogous to equation 8.34 for Legendre polynomials). Also obtain the derivative
dH"ldx in terms of the Hn (analogous to equations 8.40 and 8.41).
Differentiate G(x, t) with respect to t a total of n times to obtain the Rodrigues-type
formula
Hn@):
(-l)'r*'#{"
25. By using
the Rodrigues formula (Problem 24) for the Hermite polynomials or otherwise,
obtain the normalization integral:
/n+oo
I ,-"lHn(x)12
J--
dx
- 2"nt.Ji
26. Starting with relation (8.86), derive the Rodrigues-type formula for Bessel functions:
Jn(x)
/ I dY
- *' (\ -:ilxdx/
Jo(x)
27. A drumhead is a circular membrane of radius a. When it is struck, waves propagate
across the drumhead. The membrane vibrates with displacement f , where f (r,
ry1r,0)e-i't and ryQ, d) satisfies the Helmholtz equation
0,t) :
la (-an\-7flzn
;u\'u)+7aertk"q-s
: ,2 lu2 and u is the speed with which waves propagate across the drumhead.
(The speed u depends on the tension in the drumhead, among other things.) The boundary condition is that 4 :0 at r : a. Separate variables, and find the eigenfunctions.
Determine the first three allowable frequencies co in terms of the drum parameters u
where k2
and a.
28. Sound waves propagating through a tube may be described by a velocity potential (refer
to Chapter 2, Section 2.4) that satisfies the Helmholtz equation
/
a.,2\
"
(o'*7)*:o
where c, is the sound speed in the tube. Assume that for propagation along the length of
the tube (in the *z-direction), the potential may be written
Q : Qteikz
434
cHAprEB 8 sruRM-LrouvrLLE THEoRy
(D1 is a function of the transverse coordinates (x and y or r and 0). Because the
air cannot move perpendicular to the walls of the tube, the boundary condition is
where
-ao
fr.VO : =- :0
0n
on the boundary surface
Write the differential equation and boundary conditions satisfied by
the eigenvalues and the set ofallowed frequencies a-t if
<D1
and hence find
[a)] tne tube has a rectangular cross section measuring a x b
(b) the tube has a circular cross section ofradius a
In each case, show that there is a minimum frequency for waves that propagate along
the tube with <D1 not constant.
29. If y.n is the nth zero of Jh@), show that the Bessel functions satisfy the orthogonality
relation:
F
Jo"
t^
/ o\
(^,*) l.
/ o\
o2 /
(y*ol) o ao : T
lt
^2\
- ;)
u^ (y.)t23,t
30. Use the generating function (8.93) to show that
(a) sinx
(b)
@
:2D
n:o
Jz,+r(x)
1:,/o(x)+zitz,@)
n:l
31. Use the generating function (8.91) to show that
+co
Jn(x*y): t
m:-@
J*@)Jn-^(y)
,
and hence show that
Jo(2x)
: t& @)+ z ir-rl
^ J3@)
m:l
bi]
(Hint: Use the orthogonality of the complex exponentials.)
Starting from the Bessel differential equation, show that
[@ J^(x)Jn@)
. 2 sinl(m -n)r/2] fornr *rr > 0
'
' dx:
I
---#
X
It n7'-n'
JO
and that this result equals ll2n when ltl : n. Does this result constitute
a second
orthogonality relation for the Bessel functions? Why or why not?
33. At time / : 0, the surface of the water in a pond has the form s(p, Q,0) : AJo@d.
By taking the Fourier transform of the wave equation with two spatial dimensions, find
the displacement s (p, 0 , t) at later times.
PBOBLEMS 435
34. (a) Show that
[@l
I e-ox Jn(x\dx ::
Jo
Ja2+r
Hint: Use the integral expression (8.92) for Jo(x) and perform the integral over x
(b)
first. Do the integral over the angle using methods from Chapter 2.
Use the same technique to evaluate the integral
e-o*
fo*
J^(*) d*
35. Show that
*t roro*, a,
fot
: (t - 1) rro
if, instead, J1@) : Ql
(other
than zero) of the Bessel function J^(x), xr, 1 , is an
36. Show that the first zero
increasing function of m-that is,
If Jo@)
:
0. What is the value of the integral
I0,1 <xl,l<XZ,l<Xj,I
and so on. Hint: Use relations (8.86)-(8.90).
@ e pendulum has steadily increasing length /(r) :
/o
*
cvt. Show that the equation that
describes small oscillations of this pendulum is
10"
+2a0'
*
g0
:0
Change variables to tt : (1 -f at I lslr/2 and y : u0, and hence show that the general
solution may be expressed in terms of Bessel functions. Find the solution if the pendulum
is released from rest at an angle 0g at t :0 (that is, when I : ld.
38. The equation that describes the angular displacement of
a vertical pole or column is
d2o
trl7
* P,xO :o
where x increases downward from the top of the pole, E is theYoung's modulus, 1 is the
moment of inertia (see also Chapter 3, Section 3.2.3), and L is the mass per unit length.
Make a change of variables to
2fil"^
':;,,1fr*'' v : Ji
e
and hence show that the solution may be expressed in terms of Bessel functions. Show
that there is no solution that fits the boundary conditions 0 (L) : 0 and d/(0) : 0 unless
the pole has a minimum length 1161. Find an expression for L,;n in terms of the physical
parameters of the pole.
436
cHAprERssruRM-LrouvrLLETHEoRy
39. Establish the addition theorem for Bessel functions:
.Io(kR)
:
J*(kp)J^(kpt)
^t*r'^,
where
n
: Jfi
@,)2
-
2ppt cosQ
Hint: Begin by expressing I/l* - i'l as an expansion in Bessel functions, as in
Section 8.4.9, and set 3 : z' : O and 0' : 0. Evaluate the constant A.(k) using the
result of Problem 34.
FO.l Starting from the definitions (8.100), (8.76), and (8.78), show that
Ku(x)
n l-u(x) - Ir(x)
- ,
,i"r"
where u is not an integer. Hence, show that
41. The potential on a plane is V6 for p < a
K-u@)
and zero for
:
Kr(x).
r > a. Find an integral expression
for the potential everywhere. Evaluate the integral with
p:
0 to find the potential on
the z-axis.
42. A cylinder of height h and radius a has the top and bottom grounded. The potential on
the wall at p - a is Vs. Find the potential inside the cylinder.
43. A cylinder of height ft and radius a is grounded except for its base at z : 0. On the base
the potential is Vs f \/ I - p2 / a2 . Find the potential inside the cylinder.
E4.l
(al
Use the series for "Io(;) to show that its Laplace transform is
LtJod.)l:
(b)
-L
Jl*sz
Then use the recursion relation (8.87) to find the Laplace transform of Jr (x). Extend
the result to show that
LIJ-(x)l: 1uffi_gm
'JP
+l
Compare with Problem 34.
(c) Use the convolution theorem to establish the relation
['
JO
,or,
-
u) Jo@)
du
:
sinx
45. Obtainexpression(8.126)for jz@) fromtheexpressionsfor jsand j1 andtherecursion
relation (8.120).
46. Starting with the recursion relations (8.86) and (8.88), derive the relations
a (it@)\_
7+t@)
E\-;-)----7-
PROBLEMS 437
AIId
d
t+l
ftt
ir{il:
xt+t
it-r@)
lZTl Use proof by induction (Appendix III) to establish the Rodrigues-type formula
jn(x)
:
(_t)n xn
(: *)" (Y)
48. Use the recursion relations to show that the orthogonality relation (8.130) is equivalent
to (8.129).
49. Starting from the definition (8.76), show that
n1
(x)
: (- 1)l+l j-rl+ri (")
Hence show that
ns(x)
- -cos'r
x
50. Starting from relations (8.111) and (8.112), establish the recursion relations for the
modified spherical Bessel tunctions kt@)
: JT/iiKlayp@):
h-r-kt+r:-('zJlf\k,
\r
/
and
(t
i
t)k1a1-t tk1-1
:
-(2t + lft*t<x1
[it--] rne Fresnel integrals
s(x)
: ,E
I'
sint2 dt
and
may be expressed as series of spherical Bessel functions. First show that
I f*2
s(x):hJ,
Jiio@)au
and obtain a similar expression for
C(x). Use the recursion relation to do
and hence establish the result
6oo
S(x)
: \l:.D
I J'
n:l
Determine a similar expression for C(x).
jzn-t@2)
the integration
438
CHAPTER
8
STURM-LIOUVILLE THEORY
52. Sound waves in a spherical cavity satisfy the differential equation (V2 + t<2) F : 0 for
r < R with 0F l0r : 0 at r : R. Find the eigenvalues k, for the problem and hence
find the allowed frequencies @n : knu for sound waves inside.the cavity.
53. Electromagnetic waves in a spherical cavity may be described by a mathematical problem
similar to that described in Problem 52, with r) : c, the speed of light. The boundary
conditions depend on the polarization. Find the allowed frequencies if the boundary
condition is F(R) : 0.
54. The modified spherical Bessel functions are defined23 as
i1e):
and k1@): rlZK+trzi)
,l!t,*r,r(r)
Y |Tx
U 2x
Using expression (8.99) and the result of Problem'8.40, verify the expressions for the
modified spherical Bessel functions 16 (x) : sinh x I x and ko (r) : e-x / x. Use proof by
induction to show that
h@)
:1_7)txt(:*)'+
55. We may model the force between particles in an atornic nucleus with
a
three-dimensional
square-well potential
v(r\:[-vo
|.0
forr<R
forr >
R
Schrodinger's equation for this system takes the form
m
(o, - zftvr,t) v : -2-Eill
hz
Write the differential operator in spherical coordinates and show that the solution may be
writtenintermsof sphericalBesselfunctions.Withcy :2(m/h2)VoR2 ande: -ElVo
with E < 0, show that the energy levels are determined by the equation
Jr -irt("J;) i+r("Jr [iCl
:
') J;i,(",h - ')r,t@1r61
Find the energy ofthe lowest energy level for / : 0, cy : 10.
fn" density of neutrons in uranium is described by the equation
!:pyzrqo,
At
where D (the diffusion coefficient) and a (the net production rate) may be taken to be
constants in space and time. Solve the equation using separation ofvariables. Look for
a solution with spherical symmetry that satisfies the boundary condition n : 0 atr : R.
Show that the density increases exponentially if R exceeds a critical value Rs;1, and
determine that value in terms of
D
23Some authors choose different normalization.
and a.
OPTIONAL TOPIC A
Tensors
A.1. CARTESIAN TENSORS
In Chapter 1, we showed how physical laws, such as Newton's second law (equation 1.8),
may be-represented as mathematical relations between vectors. In equation (1.8), the force
vector F and the acceleration vector d are parallel. It is sometimes the case that one vector
is linearly related to another, as in equation (1.8), but the directions of the vectors are not
the same. An example from mechanics is the angular momentum L of a rigid body, which
is determined by the body's angular velocity <i but is not parallel to 6 utless the body has
sufficient symmetry.1 In electricity and magnetism, the current density j in a magnetized
plasma is not necessarily parallel to the electric field E that drives it, because the magnetic
force causes particles to gyrate. In these cases, the vectors are related by a linear operator
that mixes the components. For example, the angular momentum is
i :
ll<ir
where ll is the inertia tensor.2 The vector components are related by a matrix. In index
notation,3
L;:
of
I;ia1
(A.1)
The matrix ll has 3 x 3 : 9 components.4 It is an example of a rank-two tensor. The rank
a tensor is indicated by the number of indices needed to describe its components.
TABLE 4.1. Tensors in Three-Dimensional Space
Object
Notation
Number of Components
Rank of Tensor
scalar
m
Ui
1:30
3:31
9 :32
0
vector
rank-two tensor
I ij
I
2
I
See, for example, Lea and Burke, Chapter 9; Marion and Thornton; Goldstein, Chapter 5.
2This is not the identity matrix. Here II stands for inertia. In Chapter 1, we used the symbol IR for this operator.
3Throughout this topic we shall use the summation convention discussed in Chapter 1, Section 1.1.2.
4The number 3 here represents the three dimensions of space.
439
44O
oPTroNALToPrc A TENSoRS
As with Newton's second law, it is essential that a physical relationship such as (A.1)
remain true when we change the coordinate system (refer to Section 1.1.1). In the new
system,
L'i: Ilj@'j
(A'2)
We already know how the vector components transform under coordinate rotations, so let's
fransform them:
L'i: AitLt
where A;p is the rotation matrix (equation 1.21). To go in the reverse direction, we multiply
on the left by {-t : A7:
A[,L',
:
:
Al;A;pL1,
6nkLk
:
Ln
so (equation 1.25)
Lr:
AinLti
rom:
Aim@}
Similarly,
so equation (A.1) becomes
A;nLt;: In6Aiaoj
Now we multiply on the left by,A,, and we have
Apn
A[, L',
:
Akn
Ir^ A j^tD'j
6P; Lti
:
L'k
:
Ak,
A
i^In^r'j
Comparing with equation (A.2), we find the transformation rule for the tensor
It,
:
AprAi^In*
ly:
(A.3)
Notice how the indices match up in equation (A.3); the repeated indices indicate that we
must sum over both n and m.
l
A.1 CARTESIANTENSORS
441
We can use matrix multiplication to perform the calculation indicated by the indices if
we first put the summed indices next to each other.5 To do this, we have to transpose one
of the matrices:
Ilri:
Alrnlr^Afi1
or, in matrix notation,
['
:
^AJl.Ar
(A.4)
The index notation (A.3) is more general and more powerful. It extends easily to tensors of
rank greater than two, whereas the matrix notation does not.
The transformation laws for vectors, scalars, and tensors are compared in Table A.2.
TABLE A.2. TFansformation Laws
Object
Transformation Law
scalar
ml:m
u'i: Aiju j
It^ : ApnAyli Ini
vector
rank-two tensor
We need one transformation matrix for each index of the tensor. Extending the rule we
have found, we get the transformation law for a rank-three tensor:
Tlip:
Example
A.1.
A;nAi-AkpTn^p
(A.s)
The inertia tensor has components
ti,:.1{r2ti1 -xix)dm
(A.6)
A uniform
square plate of side s is rotating about an axis perpendicular to the plane
of the square and through its center. The angular momentum of the square about its
center is
L:'t'
6
io
Compute the components of the inertia tensor in a coordinate system with origin at
the center of the square ard x- and y-axes parallel to the sides of the square, and
verify the expression for the angular momentum. Next, compute the components of
5See footnote 8 in Chapter 1.
-.!
442
oPTroNALToPrc A TENSoBS
the tensor in a primed coordinate system with axes rotated by 45" about the original
x-axis. Hence find the angular momentum about its center when the square rotates
about an axis at 45o to the plane ofthe square.
FIGURE
A.1.
The square plate in Example I rotates about (a) the z-axis and (b) the z/-axis.
Since the square is a planar object and all mass elements are at
we may immediately conclude that
I;3-13;-Q
unless i also equals 3. Then, with dm : o dA :
(M /s2) dx
dy
we have
tr,
: \ [''' f '' @2 + v2) dx dv
S' J
J
-s12
-t /2
Pl2 113
:5M l_;,(;.
:5 I::,(*
'
's/2
*,')l_,,,0,
. ,,,)
a,
M /s3
u31rs/2
: ;r
(i, *'?)l_",,
M s4
-s26-
Ms2
6
The other components are
,,,
: 5 l::,1::,,,
+ v,
- *27 d. dv
yt f /z l'''
Ms2
:M
,, Tl_,,r,1_,,r: T
z:
0 (Figure A.1),
and 12
:
x2
+
y2,
A.1 CARTESIANTENSORS
443
and
r,,
: 5 I::, I::,-xy dx o, : -5
Similar calculations give 122: 1rr and
lzt : In.
+l:::,,,*1,'_,,,:
o
Thus, in this system,
tr: ,''/'1S\
o ;;)
lB
Then, when the system rotates about the z-axis,
L;:r;iai:#(i:
g
)
(I
:+( I
)
)
i is parallel to <i in this case.
Now we change to a coordinate system rotated by 45o about the x-axis (Figure A. I ).
The transformation matrix is
Thus,
'/& -;??\i )
^:z\3
The new components of the inertia tensor are found from the transformation rule
(A.3):
I!,:
A;nIn^A[,
::(f, ii)#(s:!Xf
:+(f ii)(f i i)
i i)
:*"24 (3 St ?\
3)
\o
In the new coordinate system, the angular velocity is along the z/-axis and has only
one component, so the angular momentum is
L,i:r,@,j:#
(i?
i)(I)
:+( i )
OPTIONALTOPIC A TENSORS
andisnot parallel to dr.
The angular velocity has components @(A/D(O, -1, l) in the original coordinate system. We may check our result by computing the components of L in this
frame and transforming the result into the prime frame.
',:ry(i: l) ,+(-i ): M,"*( ;)
Transforming the resulting vector to the new frame, we have
i i)(-l ): +( i)
L,i:AiLi:u,,,fit"(f,
which agrees with the result from the ffansformed inertia tensor.
A.2. INNER AND OUTER PRODUCTS
The components of the inertia tensor (4.6) are made up of components of the position
vectors of all the elements of the body. Such definitions are common. In fact, the products
a;b i of vectorcomponents are the components of a tensor called an outer product of the two
vectors d and b. All such products are tensors. In an outer product, the rank ofthe resulting
tensor is the sum of the ranks of the tensors making up the product.
When we sum over two or more indices, as in the dot product a;b; or in the product of
a tensor and a vector to give another vector l;itoi, the result is an inner product. All inner
products are also tensors. The process of reducing the rank of a tensor by summing over
a pair ofindices is called contraction. Every contraction reduces the rank ofthe tensor by
two.
It is fairly straightforward to show that an inner or outer product oftwo tensors obeys the
ffansformation law for a tensor of the appropriate rank.6 In fact, we may prove a yet more
powerful result called the quotient theorem.
If a tensor b is the inner or outer product of c and d
and
if c is a tensor with arbitrary
components, then the set of components d is also a tensor.
For example, let
b;i
Then
:
c;PdPi
if b is a tensor, we can transform the components using equation (A.3):
bL^:
6See Problems 4 and 5.
An;Aaib;;
:
An;Aaic;pdpi
A.sPSEUDO-TENSORSANDCROSSPRODUCTS
445
But from the definition of b in the prime frame and the fact that c is also a tensor,
bL^:
c'^od'p-
-
An;Apkcrkd'p*
Setting the two expressions for b'n^ equal, we have
Ari A^i c;pd11i
Now we multiplyT on the left Ay
enl :
Anq
A^icqtd*i
Next we multiply on the left bV
A,l :
(6,jd*j
(d1r,
-
-
Ani A plrc itd'p^
and use
the result An/ Ani
:
6qi
to obtain
Aorcqtdtpm
Amri
: O
A4rApnd'o_)cq* :0
A^, Appdlo^)cq*
Because the components c4ft are arbitrary, the term in parentheses must equal zero, and so
we must have
dk,
Thus, the components
: ApnA^rdp^: e;) e;)alo^
ofd transform
as a tensor, as required.
A similar proof works for tensors of any rank.
A.3. PSEUDO.TENSORS AND CROSS PRODUCTS
In Chapter 1, we saw that the vector cross product is not a true vector because it does not
possess the proper behavior under reflection of the coordinate axes. Also recall that we may
express the cross product using the Levi-Civita symbol (see Chapter 1, equation 1.30):
(fr x
i), :
eijku
jvk
i are both vectors yet fr x i is not, the symbol e;;1 cannot be a tensor. It represents
pseudo-tensor,
a
or a tensor density. The transformation law for a tensor density is similar
to that for a tensor (Thble A.2) except that we must also multiply by the determinant of the
transformation matrix.8 Thus, for the Levi-Civita tensor density,
Since fr and
eli p
:
det (A) A; o A I q A p, e pqr
From this result and the fact that det (A)
: tl,
we may easily see that
1. The product of two tensor densities is a tensor.
2. The product ofa tensor and a tensor density is another tensor density.
7We cannot divide out the "factors"
8See Problem I 1 and Appendix I.
Ar; andcit
because we are summing over
i
and k.
446
oPTroNALToPrc A TENSoRS
We may avoid the difficulties associated with the cross product if instead we represent
the components of the cross product as components of an antisymmetric tensor. A ranktwo antisymmetric tensor has only three independent components, and these will be the
components of the cross product il x i if we choose
Tij:uiuj-uiuj
In matrix form, the components are
0
/
uruz
- u2ut
utu3
- urrt \
u2u3 - u3uz
T: I u2Dt - utuz 0
0
/
\ ,rr' - utui uyu2 - u2u3
(dxi;, -(frxi)r\
o
/
:l -lilxi)j
(ixi)1 |
o
\ tdxn)z -1frxi)1 o I
I
Then we may also define the vector density:
di
:
which is known as the dual of the tensor
,,
at
:r(Tzl -
az
:
1
1
1Qt -
1
(A.7)
t€ijkTjk
T;1 .
The components of d; are
Tn)
:
T23
:
u2u3
-
u31)2
:
(fr x i;1
Tn)
-
T3r
:
u3ut
-
u1u3
:
(fr x
and similarly for d3. Thus, the vector density d
The inverse dual relationship is
T;.i
:
i)z
ir ttr" cross product i x
i.
(A.8)
e;ipdp
We may demonstrate this relationship using equation (1.34) from Chapter 1:
r;1
:
:
r,1t
)rkrmTrm
1
1(T;1
-
:
){t,,ai^
Tii)
Since ?l; is antisymmetric, the result follows.
-
t;^6i)Tm
l
A.4 GENERALTENSOR
CALCULUS 447
Example A.2. Express the angular velocity and angular momentum of a particle in
circular motion as tensors.
We choose to put the origin at the center of the circle. The angular velocity of the
particle is given by
i:rixi
Since
i
and
i
are vectors,
6 is a pseudo-vector. We can find an expression for dl by
ri is perpendicular to i:
taking anothei cross product and using the fact that
ixi:ix(6xi;-drr2
Thus,
- 1,
@-;rXv
_
We may express these components using the antisymmetric tensor:
to;i:
"Giu1
-u;ri)= pn,t
The angular momentum is
i:ix
i:mixi
We may also define an antisymmetric tensor with components equal to the components
of L:
L;i : m(r;ui -
uir j)
- *?ii
Thus, returning to the pseudo-vector representation, we can relate the components
L to the components of 6 by
of
Li : m'2ai
A.4. GENERAL TENSOR CALCULUS
a non-Cartesian coordinate system, we need to identify two different classes of vector-like
entities that obey different transformation laws. First let's write the transformation matrix
for vectors in terms of the old (u') and new (D') components. A differential displacement
vector d* has components dTi that are functions of the original coordinates x'. Thus, we
may use the usual rules of partial differentiation to express dit as
In
dii :
oT',
0xl
d.*j
448
OPTIONALTOPIC A TENSORS
Since all vectors transform the same way, we obtain the transformation law for any vector
i:
ATt
-iut:rjur:A,jul
where the transformation matrix has components
A,j
ATI
(A.e)
flxJ
You should check that this expression is consistent with our previous result (equations 1. 1 1
and I.L2 in Chapter 1) in terms of rotation angles.
Notice that we are now writing the index on the vector component as an upper, rather than
a lower, index. The transformation matrix is written with one upper index and one lower
index. The lower index corresponds to the vector in the denominator on the right-hand side
of equation (A.9). We'll discuss this convention further below.
Now let's look at the gradient of a scalar function O. The gradient has components
- lao ao ao\
v<D:l_._._l
\0x' 0y' 0z /
3;O:
'
ao
]xI
-
(A.10)
The lower index on the left-hand side reminds us that the upper index vector component is
in the denominator ofthis expression. To transform the gradient to a new coordinate system
(labeled -'), we use the chain rule for derivatives. Since <D is a scalar, O : <D.
aO |xt
3;O:
'
0Q
aT' -07'}xJ
-:
and thus the ffansformation matrix for the gradient has components
,
B,t:,
DxJ
AT'
(A. r l)
The gradient operator is written with a lower index to indicate that it transforms with the
matrix IB. Upper index and lower index quantities transform differently, in general. The upper
index quantities are called c ontravariant tensors, while the lower index quantities are called
covariant tensors, or forms. Thus, the differential displacement vector is a contravariant
vector (or just a vector), while the gradient is a covariant vector (or one-form). If the
geometrical image for a vector is an arrow with a specified length and direction, then the
image for a one-form is a set of level surfaces with a given orientation and separation
(Figure A.2).
A.4GENERALTENSORCALCULUS 449
FIGURE A.2. Level surfaces offer a geometric image for a one-form. The dot product d3. i Q : d.@ .
When the vector dd is laid across the level surfaces representing iO, aO ir ttr"
difference in the value of O between the head and the tail of the vector.
In the case of Cartesian coordinates, where the allowed set of transformations is rotations,
these distinctions are not necessary, because the two transformation matrices are the same.
But let's look at what happens when we transform from Cartesian coordinates to cylindrical
coordinates. The relations between the coordinates are
x-pcosQ; y-psinQ; z:Z
(A.r2)
and the inverse relations are
p: ,rF + t';
Q
: t*-l I; Z: z
x
(A.13)
We may differentiate tan@ to obtain an expression for 6Q/0x,
y
06 :- tand 1
v
. .06
tanQ:'-+sec'Q^
,cos'Q:
^
pcosQ
dx --= x' + dx
x
sin@
p
and hence find the components of the transformation matrices.
At,1ox,0o,0o
- ax- p.:COS@; A'Z:-:Sind;
"Idy
Ad
sind
Ad cosd
AL" _
AL" _
'0xp"0yp"0Z
A\:
^ -Q
-- az-"
Ad _ N
AZ" _
'
and
-E-g;
'0x-dyJ1z
A3,
A3r:E:gi
,fr:!:1
while
urt:H-cos@; 4:H:sind:
Bl:93:s
,0x..Eyr]z
---: - -psinQ;
B.r':
'a0-dQ"a0
Bz": * - pcosQ:
Bj: ^ -Q
lv
g]:!:-:g1
B1J:-:l
and
_r
0x ^
"azazaz
B1r:-:0.
^
_z
0z
450
oPTroNALToPrc A rENsoRS
Thus, the two matrices9 are
@
-sinQlp
cos
00
sin
@
cosQ/p
(A.14)
?)
and
cos@
-p sinQ
00
sin@
p cosQ
(A.1s)
t)
which are clearly different. In fact,
m.T
- n-l
(A.16)
Let's verify this result:
f cosQ -p sin
lEr.A:lsind pcosQ
0
@
\o
:(
o
?)
0
1
cos@sin@ - sin@cos@
cos2 4 + siri24
sin@cos/ - cos QsinQ cos2 4 +sin2 6
00
)
i):(i:?)
Relation (A.16) is true in general. (See Problem 15.)
Since the components of the transformation matrix are not constants, in general, but
functions of position, we cannot add vectors defined at different points of space because the
sum would not have a well-defined transformation law For example, we cannot subtract
velocity vector_s at two different points of space to determine an average acceleration. A finite
displacement D43 from point A to point B is no longer a true vector. However, operations
such as adding the electric field contributions E1 gndES at qpoint P due to two different
charge distributions to give the total electric field E : Er * E2 remain valid.
Example
A.3.
The electric field due to a long straight wire may be written in Carte-
I
j
sian coordinates as
i
-),/xv\
E:2ir€o
(7;7';rrP'o)
Transform the electric field vector into the cylindrical coordinate system.
i labels the rows, and j labels the columns. Even though one index is up and one is down, we shall still
write one index to the left ofthe other so that we can maintain the conventions introduced in Chapter I for labeling
rows and columns.
9Here,
1
I
A.5 THE METRIC
TENSOR
451
The electric field is a contravariant vector. Thus, in the new coordinate system, the
components are
E,:AE
(; )
:( -iiif,,
)=-=
"J!"f,,
o
0 rf
3
\
zneoo'
\o/
/ xcos@*Ysin@ \
^ 1l ,-rsind*OycosQllp I
zneop\
/
pcos2q*psin2f
: ^ I t-'cos@ sin@ * P sin Acoso)/o \
2orw2
)
\
:
x
:#G(
f
):h(
i)
The electric field has only a radial component. This is the expected result.
A.5. THE METRIC TENSOR
The line element (Chapter 1, Sections 1.1.1 and 1.3.2) is an important example of a scalar
that is formed as an inner product of a tensor, called the metric tensor, and the differential
vector di:
ds2
:
gii dxidxi
(A.17)
Since the lefrhand side is a scalar and the vector d*. is a vector whose components may
take on any values, we may apply the quotient theorem to conclude that g;1 is a tensor. In
Cartesian coordinates, gii : 6;i.In any orthogonal coordinate system, g;; is diagonal. For
example, in the cylindrical coordinate system (equation 1.4),
dsz
:
dp2
a p2 dq2 +
dz2
and so
/r o o\
,,,:186, ?)
(A.r8)
452
oPTroNALToPlcA TENSoRS
We may obtain these components from the Cartesian components by applying the transformation law in the usual way. The lower indices imply that g;; is a covariant tensor, and so
we transform with the matrix IE:
8,: EsBr: (
_,$tr ,"xf,
?
) (i
: ? ) (qr l;il, I)
: (-'.;tr, ,.'::, (:t, l#,
?)
l)
: ('*'d+sin2d p2sin2elor"o"rQ
l): (i (
?)
in agreement with the result above (equation A.18).
A.6. CONTRACTION
To form an inner product of two tensors of ranks m and n, we must contract the tensors by
summing over (at least) two of the indices. The result is a tensor of lower order (by two for
each pair of summed indices) than the sum of the ranks of the two tensors in the product,
and thus it must transform as a tensor of rank m -l n - 2. As an example, let's consider the
contraction of two vectors to form a dot product-a scalar. If we try to contract by summing
over two upper indices, we find that the result is not a scalar.
d-u'
:
Airuj Aiouk
But the product A'i A'o + 67r except for rotations of a Cartesian coordinate system, and so
this product is not a scalar in general. However, if we contract an upper index with a lower
index, then
-u'u;:
and the product Ai, B ,k
:
6f
Airui
B1ku1'
(eOuation A. 16), so
U'ui
-
L
U"uk
and this inner product is a scalar. Thus, we have the following rule:
In general tensor algebra, an upper index may be contracted with a lower index, but
not with another upper index.
'
i
i
I
A.7 BASIS VECTORS AND BASIS FORMS
4s3
In order to form an inner product of two vectors, we must first form the lower-index
covariant vector that corresponds to the upper-index contravariant vector. The line element
(equation A.17) shows us how to do this. Writing the line element as a dot product of di
with itself,
:
ds2
dx;
:
dxi dx;
:
Bi; d.xi
dxi
we observe that
gii dxl
In this way, the metric tensor maps any contravariant vector to a corresponding covariant
vector (or one-form):
u;:
g;iul
(A.1e)
Similarly, the contravariant metric tensor g'J, whose components form the inverse of the
matrix of components gij, maps a covariant vector to a contravariant vector:
u' : g'luj
(A.20)
Sometimes we say that the metric tensor g'J is used to raise an index, while the tensor grj
is used to lower an index.
These expressions show that there is a covariant vector corresponding to every contravariant vector, and vice versa. The metric tensor is an isomorphism that maps one space to
the other, its "dual" space.
When an inner product is formed, it does not matter which index of a contracted pair is
up and which is down. For example,
lt'L);
:u'gijuJ:ujul
A.7. BASIS VECTORS AND BASIS FORMS
in any coordinate system are defined to have components (1,0,0),
(0, 1,0), and (0,0, 1). However, in a non-Cartesian coordinate system, the basis vectors
are not necessaily unit vectors. We may find the magnitude of a vector from its dot product
The basis vectors
OPTIONALTOPIC A TENSORS
with itself:
fil:
Ju'ui
Thus, for the nth basis vector, we have
t-
16,,,1:
i and j
where (n) labels the basis vector and
ei6siiett,t
1f
eln',
label the components. But
:
6',
and so
li<,rl
: {r,,tfil:
^E;
(A.21)
where in these expressions we do not sum10 over n. Thus, in cylindrical coordinates, the
second basis vector has magnitude ,En : p, and the basis vectors in this system are p, pQ,
and2.
The basis forms are defined similarly. So, for example, the first basis form has components
(1, 0, 0), and its magnitude i" t/if .
A.4. Starting from the components of the velocity vector in Cartesian
coordinates, ffansform to cylindrical coordinates to find the components of i in the
new system, and hence write the velocity vector in cylindrical coordinates.
First we transform the velocity vector:
Example
,
l
)
1
-u' :
A'juJ
sin
@
cos
@
p
0
0xcosd 3v
+ *sind
*
3x
sin@
0y cos@
r__
M p'at
p
0z
u
l0There is no inconsistency here: n is a label, not an index.
I
455
A.8 DERIVATIVES
Next we write the derivatives in terms of the new coordinates, using equations (A.12):
"':
cos
@
sin
@
l)
(ff*'r-o
(ff*"r-o
""a(ysin@*0,",r#) )
ryesind* r""'r#)
I
)
as expected from the definition. To write the vector, we must multiply each component
by the corresponding basis vector. Then
^ 0z^z
- 0p"l) + AQ^ * 32" : Ap^0t. AQPQ-t
o:
^
^
^
* ",
t",r,
dt
dt
dt
which is the standard result.
"e(2)
A.8. DERIVATIVES
We have already shown that the gradient of a scalar is a covariant vector. We must be
more careful when taking the derivative of tensors of rank one or higher. For the moment,
let's discuss vectors. In taking the derivative, we must compare the values of the vector
components at neighboring points. But if the coordinates are not Cartesian, then even if
the vector is unchanged (its magnitude and direction remain the same), its components at
the two neighboring points are not the same (see Figure A.3). Thus, before we can subtract
FIGURE A.3. Parallel displacement. When
the
vector is moved in a non-Cartesian
system, its components change
'
even if the magnitude and direction do not. Here a vector that is in
the r direction at one point has both
r and 0 components when moved
to a second nearby point.
456
OPTIONAL TOPIC A TENSORS
the two vectors, we must first displace the vector parallel to itself and evaluate its new
components. This process is called parallel displacement.
It is customary to start with a covariant vector. Since we are considering differential
displacements, the change in the component u; under parallel displacement is proportional
1
to the displacement. It should also be proportional to the vector components,l so we demand
that
6v;:li.nr'4*t'
(A.22)
where the components f ,{o are called an affinity that specifies the affine connection of the
space. These components are not a tensor. Now, since the dot product u'v; is a scalar, we
require that it not change under parallel displacement,rz 61ui u;1 : 0. Thus,
u; 5u'
* u' 6t)i :0 :
ui
3ui
+ uilirnu i dxk
Since this result must be true for any covariant vector u;, we may conclude that
: -lj*uj
6ui
dxk
(4.23)
Now if the change du' inayector's components is equal to the change 6z' due to parallel
displacement, then the vector has not actually changed, and so its derivative is zero. Thus,
we define the covariant derivative z'., in terms of the difference between the actual change
in components and the change (A.23) due to parallel transport:
du'
-
3u'
:
u';.i dxJ
(A.24)
And so we obtain an expression for the covariant derivative in terms of the affinity:
,t.i
:#
tripiuk
(4.2s)
Working from equation (A.22), we obtain the resultl3
ui:j:
0u;
a*l -l^ij'*
(4.26)
Similarly, by looking at invariant combinations, we may write the covariant derivative of
any rank tensor. It turns out that we need one term in the affinity f for each index of the
llThis is actually a consequence ofrequiring that the parallel displacement in general be consistent with the
familiar definition in Euclidean space. See Lawden, p. 98.
l2Thir i, clearly true in Euclidean space. Once again we are requiring that the new ideas be consistent with the
old. This is a constraint on the kinds of space we are willing to consider as physically interesting.
l3See also Problem 28.
457
A.8 DERIVATIVES
tensor. It will be added, as in equation (A.25), for an upper index, and subtracted, as in
equation (A.26), for a lower index.
The covariant derivative of a vector (equation A.25) is a rank-two tensor. This result
follows from the definition (A.24), since the quantity on the left-hand side is the difference
between two vectors defined at the same point of space and thus is also a vector. Since
the components of dxi are arbitrary, we may apply the quotient theorem to show that the
covariant derivative is a tensor. Similarly, the covariant derivative of any tensor is also
a tensor.
It remains to find the components of the affinity. 14 The result we obtain should be valid in
any space, including ordinary Euclidean space. But if the space is Euclidean,15 we may set up
aCartesiancoordinatesystemxtattheoriginalpointandcomputetheCartesiancomponents
tr
of the covariant vector rr,. These components do not change under parallel displacement:
6ui - Q. We may obtain the components ui from ui using the usual transformation matrix:
a7l
ui:
axiuj
Then
6u;
/a]j\
a7l
- 3\e; )o, + *\ui
:
^r-;
L'!!-d*kv1+o
Dxt Exk
: a2tj
*o*
)-L
,-ox^
A-*j
r^
and thus, comparing with equation (A.22), we find
nm
I i*:
azti
go;
6**
0x^
67
(
.27)
This expression shows that the affinity is symmetric in its lower two indices. However, it is
not very convenient for calculating f.
To obtain a more convenient expression, note that we can obtain the covariant derivative
u;;i in two ways:
1.
Tiake the
covariant derivative u'., and then lower indices.
or
2. Lowq indices and then take the derivative.
l4ln their Section 1.5, Morse and Feshbach give a different derivation of this result and provide an expression for
the affinity in terms of the metric coefficients h;, discussed in ChaPter 1, Section 1.3.2.
l5This requirement is more restrictive than necessary, but allows us to present a more transparent argument. In
fact, even in non-Euclidean space we can set up a local Caftesian system, provided that det (S) * 0, and then
show that the parallel displacement of the components in this system is zero to first order in tbe displacement dx.
458
oPTroNALToPrc A TENSoRS
These two methods must give the same result, and so
gi-u\j : uiij :
(gi*uk);j
:
gik;juk
*
git uk;j
In the second step, we assumed that the product rule applies to the covariant derivative in
the same way as for ordinary derivatives.l6 Then it must be true that
qi*;i
:0
(A.28)
That is, the covariant derivative of the metric tensor is zero, a pleasing and intuitive result.
This requirement actually determines the f . We write it explicitly as
#-tf,1si^-tfis^t,:o
We may permute the indices in this expression to obtain two more similar expressions:
#-rlisi^-t!*E*i:o
and
Y-tfil*^-rfrs^i:o
Now we add the first two, subtract the last, and use the symmetry of the
f
s and the
g;; to
get
#
-2rf,1ti^*H
-ffi:o
Thus,
rf,1si*::(#*H-Y)
Multiplying by g'n, we find
8" ( |gir , }gii asu \
-n
'kj-- 2 \arl - a'l'- a.,)
(A.29)
Expression (A.29) is called the metric affinity. It is also called the Cbristoffel symbol of the
second kind.
We now have the mathematical apparatus that we need to describe physical systems in
any coordinate frame.
l6See Problem 29.
A.8 DERIVATIVES
A.5.
Example
459
The covariant divergence of a vector u'. t is a scalar. The electric field
inside a uniform cylinder of charge is E : poil2eo, where p6 is the charge density.
Evaluate the divergence ofthis vector field in Cartesian coordinates (d : xi + yi).
Evaluate the necessary components of the metric affinity in cylindrical coordinates,
and hence calculate the divergence of the electric field in cylindrical coordinates.
Verify that the results are the same.
In Cartesian coordinates, the
i .i:
f
Ei;
are zero, so
:
Po 0E'
2es 0TI
(0x
Po^
,0Y\
--%r\t-ay)
2eo
Po
p0
so
which is the expected result from Gauss'law.
In cylindrical coordinates, the only derivative of the metric tensor components that
is not zero is
ogzz
ffi:20
The divergence is
. AEi
Eii:
U, +l'kiE"
:9!'
* t(W *V+0x' 2 3x' \xk
\
|
:
pn /2e0, E2
:
Eq\ro
1xn
/
:
-
po l2eo.
O,and E3
0. Thus, in this system, A Ei I axi
The metric tensor is diagonal, so in the second term we get contributions from the
n. With the sums put in explicitly, the term becomes
sum over i only when i
where E
:
lilrrEk
8n'
- \L2
n,k
W*W-#)*
The terms in gr1, arc nonzero only if n
eachn. Thus, the only nonzero term is
t
n,k
gnn ogrn
2 ]xk
:
k, but 1gnrf 0x" (not summed) is zero for
,o - B" ogzz ,t - | .^,po
'tp\zot
2 1xl
:
^0
and so
V.E:
as we
ui,
:#+r',,iEk
found in Cartesian coordinates.
p0a_-_ p0 po
2€o' 2eg- eg
po
6
460
OPTIONAL TOPIC
A TENSORS
Although we have used flat space in our examples here, the maximum benefit of general
tensor mathematics accrues in applications such as special and general relativity, where the
metric cannot be reduced to the simple form 3;; with +1 along the diagonal. Some examples
are given in the problem set.
PROBLEMS
fl
oete.mine the velocity of an electron driven by an electric field E : tor--t't in the
presence of a constant uniform magnetic field Bo. Choose the z-axis along 86, but do
not make any assumptions about the direction of Eo. If there are n electrons per unit
volume, write the current density in the form ji : oij E j and determine the components
of o;i.
2. Starting with the expression ?
: 6xi
expression (,4'.6) given in Example
3.
A.l for the inertia
tensor.
Compute the inertia tensor (equation ,4..6) for a uniform cylinder of radius R and height
ft. Hence find its angular momentum when it rotates with angular speed a,l about
(a)
(b)
(c)
4.
for a particle in circular motion, derive the
an axis through its center and along its length
an axis through its center and along a diameter
an axis through its center making
Show that the outer product aib
j
a45"
angle with each of the axes in (a) and (b)
of two vectors obeys the transformation law for
a
rank-
two tensor.
S
Strow that the inner product a;i1rb1, of a rank-three tensor and a vector obeys the transformation law for a rank-two tensor.
6.
Show that the Kroneker delta tensor 6;7 has the same components in every coordinate
frame.
7.
Show that if a tensor b;y is symmetric in one frame (thatis, bi.i
:
bji
), then it is
in every frame. Similarly, show that the property of antisymmetry (bii
preserved under coordinate transformations.
8. (a) The following
symmetric
: -bj;)
is
set of components is defined in two-dimensional Cartesian space:
a,;:
(/ -y'i,
:J-ry \)
Does this set of components transfonn as a tensor? Why or why not? Hint: Try to
express the components in terms of inner and outer products of known vectors and
tensors. Alternatively, form inner or outer products with vectors and use the quotient
theorem.
Repeat the problem for the following sets of components:
x2\
y- -xy/
(b) Bi; : (' -*l.t
"
\
I
PROBLEMS
(c)
cii:
G, ?)
(d) In three-dimensional
space,
Dij
E fn" magnetic
(j : i)
moment tensor has components
Mik:
where
(a)
(b)
j
461
|
,;lrav
is the current density and the integral is over all space.
Show that Mip is antisymmetric for any steady, finite current distribution.
Show that the corresponding cross product (the dual vector, equation A.7) reduces
to the usual magnetic moment vector fr : I Aff in the case of a planar current loop.
10. The electric quadrupole tensor is given by
g,t
: f (3x;xi J
r2lii)P(*) av
Calculate the quadrupole tensor for a set of four point charges, two of charge q and two of
charge -e, at the corners of a square of side a.Tlte charges alternate in sign, so charges
of equal sign are at opposite ends of the diagonals of the square. Use a coordinate system
with 7- and y-axes along the diagonals of the square.
The force on a quadrupole charge distribution placed in an external electric field is
I
E2E,r
Fi: =Qik
b a& a*k
where the derivatives are evaluated at the origin. Find the force on the square when it
is placed in an external electric field i : o(2*y, -i2 , x2 + y2). The square's normal
lies in the x-z plane at angle I to the z-axis, and its center is at the origin.
11. Show that the components of the Levi-Civita symbol
e;;7.
transform as a tensor density
under coordinate rotations and reflections.
12. Starting from the components of the velocity vector in Cartesian coordinates, transform
to spherical coordinates to find the components of i in the new system, and hence write
the velocity vector in spherical coordinates. (Refer to Example A.4.)
Ell
tn
a
region of space, the electric scalar potential has the form
<D
:
-EoZ.
(a) Working in Cartesian coordinates, compute the gradient to obtain the electric field
components. Transform to a spherical coordinate system, using the appropriate ffansformation law from Section A.4.
462
OPTIONAL TOPIC
A TENSORS
(b) Write the potential in spherical coordinates, and compute the gradient using the
operator 3; (equation A. 10).
Confirm that both methods give the same electric field.
14. Write the components of the gradient form in (a) cylindrical coordinates and (b) spherical
coordinates. Use the metric tensor to raise indices, thus mapping to the corresponding
vector. Finally, multiply by the basis vectors to obtain the conventional expression for
VO, as derived in Chapter I (equation 1.62).
15. Use equations (A.9) and (A.11) in Section A.4 to show that the relation lBr
true in general.
Aii - 4l', show that Ai 1 : Aji ardthat
Can you find a relation between Ai i and A/;? Why or why not?
16. If the tensor Ai"/ is symmetric,
:
.A,-l is
A;i - Aji.
17. Which of the following relations between tensor components could possibly be true?
Say what is wrong with the incorrect expressions.
(a) Vi
(b) fii
(c) V'
(d) y'
: eiik(Jr
: Xikyrj
: Xik(Jr, + Wi
: eiikIJiWlXl,Yi
[f8..l fn special relativity, space-time is described by four-component vectors u". (It is con-
ventional to use Greek letters to label the indices 0,1,2, and 3 in four-dimensional
space-time.) The coordinates of an event in space-time are xo : (ct, x, y, z), and the
metriclT is
/t o o o\
lo-r o ol
'*:lB B-; -i)
The Lorentz transformation matrix relating two coordinate systems moving with relative
speed u along the x-axis is
I
l
I
/ v -vP o o\
nt:l -YP ,n o, Sl
\ o o o t)
where
y
: ll \fr}
and
B
:
y1s.
The electromagnetic potential is described by a four-vectorrvith components Al' :
(O, a", Ar, Ar), where <D is the electric scalar potential and A is the magnetic vector
potential. The electromagnetic field tensor has components
Fl"u
_
AP
Av
_ Av AtL
lTThir i, the sign convention used by Jackson. There are others.
I
l
j
l
PROBLEMS 463
Show that, in the Gaussian unit system, the components of the field tensor in terms
ofE
and B are
Fllv
-
(?'
i,"; fr)
Two particles, each with charge 4 and mass z, are moving along lines parallel to the
x- axis and a distance d apart. Each particle moves with speed u << c. Stan in a reference
frame in which the two particles are at rest. Compute the components of the field tensor
in this reference frame, and hence find the force acting on each particle. Now transform
the field tensor to the lab frame, and again compute the force on each particle. Verify
your result to first order in the small quantity 0 : u lc by computing E and B in the lab
frame using Coulomb's law and the Biot-Savart law. (In the Biot-Savart law, the source
1dl should be identified with a point source qi atthe position of one of the charges.)
19. Using the metric of Lorentz space-time and the electromagnetic field tensor (see P-roblem
18 above), verify that an electromagnetic wave has the same field structure (E I B,
E : B) (in cgs Gaussian units) in any inertial frame.
20. What invariants can you form from a tensor Zpv? Compute these invariants for the
electromagnetic field tensor in Lorentz space-time (see Problem 18).
the wave four-vector has components ktr : (alc,k',ky,kr).
Use the Lorcntz transformation matrix Ap, (see Problem 18) to find the components
of the wave vector in a second frame moving-with velocity i - ui with respect to the
first. What is the result if (a) [ : ki and O) f : ki? Compare with the nonrelativistic
Doppler shift formula, and comment.
[Zil fn Lorcntz space-time,
22. Use the metric gpy for Lorentz space-time (see Problem 18) to compute the line element
ds2 : Bp, dxp dx' . The proper time r is defined by the relation dt : ds f c. Compfie
the proper
timeintewaldt between two neighboring points
movingatspeedu.(Letthepointshavecoordinates
on the world line of a particle
ct,x,y,zandc(t*dt),xldx,y*
z*dz,where dx : Dx dt and dy and dz are defined similarly.) Express your result in
terms of the time interval dt, P : u f c, and y : I I \/ I - p2. Compute the components
of thefour-velocityuP : dxpldt of aparticleandcomputetheinvariantprodtctupuu.
Comment.
dy ,
23. The set of components
*,
I -l
[0
(
t"fv6 :
if a py 6 :
if apy 3 :
0123 or an even permutation of this
1023 or an even permutation of this
otherwise
maybeusedtoformthetensorFof
-
je"fr6 Frt dualtoF(comparewithequationA.T).
(a) Show that the components of edfv6 ffansform as a tensor in Minkowski space-time.
(b) Find the invariant F"0 Fop if F"P is the electromagnetic field tensor defined in
Problem 18. Comment.
464
oPTroNALToPrc A TENSoRs
24. IJse Gauss' law in integral form to find the electric field inside a uniformly charged
sphere. Compute the necessary components of fir, and hence find the divergence of
this electric field in spherical coordinates. Show that the divergence equals the (uniform)
charge density divided by e6.
[2il
In two-dimensional flat space described with polar coordinates, a vector d is in the radial
direction. Displace the vector to a neighboring point (refer to Figure A.3), and compute
the new components in terms of the displacement (6p, d0). Compare with relation (A.23)
in the text and hence compute the components f 1;; of the affrnity. Perform the same
operations with a vector in the I direction to find the remaining components of l. Hint:
Remember that the basis vectors a.re not unit vectors in this system.
:
ff;
26. In three-dimensional
(equation A.29) has 33
space, the affinity
27 components.
Show that the affinity for Euclidean space with cylindrical coordinates has only three
nonzero components, and compute them.
Use equations A.14, A.15, and A.27 to obtain the same result. (Remember that in
equation A.27 i are the Cartesian coordinates.)
27
.
The tensor density etift is defined in Cartesian coordinates as in Chapter I (Section l. 1.2,
equation 1.29). Transform to flat space with cylindrical coordinates, and determine the
p.Compute
components of e'lft . Lower indices to fi nd the components of e, ik, E
rk, andEi i
:
the cross product 6 x i
d for a particle in uniform circular motion with <il : ari.
r
Hint:
16 x
i;t : eiikajuk:
eijkroi uk
28. By lowering indices, show that the covariant derivative of a covariant vector may
be
]
written
ui:j :
as
0u;
a*,] -l^iju*
in equation (4.26). Obtain the same result from equation (4.22).
l
i
[2q.lVe.ify the product rule
l
I
ui.i : (Tikulr);j : ukTik;j + Tikut, j
l
j
I
I
j
.t
I
I
I
OPTIONAL TOPIC B
Group Theory
In modern physics, group theory is becoming an increasingly important mathematical tool.
As we develop the theory we shall see that groups form the natural mathematical description
of symmetries of physical systems. These symmetries are in turn related to the conservation
laws of physics, and thus the mathematics of groups allows us to investigate these laws in
a very general way. Space constraints here do not allow us to do more than touch on the
basics of group theory. Students who plan to pursue an interest in topics for which group
theory is especially important, such as particle physics, will need to consult more advanced
texts to supplement the material here.
B.1. DEFINITION OF A GROUP
A group G is
a set of elements {a}, together with an operation
*, that obeys the following
set of rules:
l.
If a
and
b
are elements
of the group, then so is a x b.
2, T\eoperation * is associative: (a * b) * c - a * (b * c).
3. Thereis aunitelement I in G suchthat 1+ a: a*l: afor everyelement ainG.
4. EveryelementainGpossessesaninverseelementa-t,alsoinG,whichhastheproperty
that
I
-l
a*a':a'*a:I
where
1
is the identity element (see rule 3).
In the special case where the operation is commutative (a + b
:
b * a),the group is said
tobe abelian.
The number of elements in the group is called the order of the group. The order may be
finite or infinite.
465
466
OPTIONAL TOPIC
B GROUP THEORY
8.2. EXAMPLES OF GROUPS
The group of order one The most trivial group is of order one and consists of the single
identity element. Convince yourself that this set obeys all of the rules listed above.
two
The group of order
There is exactly one group of order two. It consists of the identity
1. This group represents the
and one other element a that is its own inverse, so that a r( e
physical symmetry of reflection in a plane or of rotation through an angle z about a single
:
fixed axis.
The group of order three There is exactly one group of order three. It contains the identity
and two other elements a and b, where b is the inverse of a. We may write a multiplication
table for this group, as shown in Table B.l.
Since b : a2, wealso have the relation a * b : e * a2 : aJ : l.This group and the
previous one are examples of cyclic groups. A cyclic group is generated by a single element.
Every element of the group is obtained as a power of this single element, and in a group of
order n, the n th power gives the identity. The cyclic group of ordet n is called Cn. Every
cyclic group is abelian.
TABLE B.1. Multiplication Table
for the Group of Order Three
The cyclic group C, represents a symmetry of a polygon with n sides. We can think
the element a as a rotation through 2n ln that rotates the polygon into an identical copy
of
of
itself.
The set of integers under the operation of addition The set of (positive and negative)
integers forms a group of infinite order. The operation x is addition:
mtrn=mln
Then the sum of any two integers is another integer. The operation is associative:
n*(m*p):(n*m)*p
The identity element for this group is the integer zero:
nl0:0ln:n
for any integer n. The inverse of an element n is the negative of n:
-n.
Then
n*(-n):(-n)ln:O
1
8.2 EXAMPLES OF GROUPS
467
This group is abelian, since the order of the integers in the sum does not matter: n+m : mIn
for all integers m andn.
The set of integers does not form a group under ordinary multiplication, because the
inverse of n,lf n, is not in the set of integers.
The set of all rational numbers m ln under the operatian of multiplication, where n,
m+0
* *P :*
nqn
p
q
X 1-:
mxp
nxq
:"e":) : (T"")":
Theidentityelementis
and has infinite order.
l:1/l,andtheinverse
of m/nisn/m.Thisgroupisalsoabelian
The rotation group Spatial rotations may be represented by rotation matrices (see, for
example, Chapter l). The identity is a rotation through 0 radians (represented by the identity
matrix), and the inverse of a rotation through 0 is a rotation through -9 about the same
axis. This group is not abelian. The set of rotations in a plane is given the name SO(2).
The rotations are represented by orthogonal2 x 2 matrices with determinant +1. The O in
SO(2) refers to the orthogonality of the matrices, the 2 refers to the fact that they are 2 x 2
matrices, and the S stands for special and refers to the condition that the determinant is +1.
This group is abelian.
Notice that in this group the elements themselves are "operations" (rotations), represented
mathematically by matrices, while the gro up operation is "follow one rotation with another"
and is represented mathematically by matrix multiplication.
Permutqtion groaps In these groups, also, the group elements are things we do ("operations"). Tlte group operation is "follow one permutation with another." As an example,
consider the set of permutations of three objects, 53. One element of this group is the
permutation {interchange first two elements}, which we'll label Gp.
Gn: (qbc) --> (bac)
The identity element is {do nothing}.-
l:
(abc) -'+ (abc)
Consider the two elements GB : {interchange lst and 3rd} and G23
and 3rd ) . The product of these two elements may be written
GzzGn:
{@bc) --> (cba)} followedby {(cba)
:
{interchange 2nd
-+ (cab)}
468
OPTIONAL TOPIC
B GROUP THEORY
The result is the group element Gsrz, which means {take the last element and put it first}.
(In standard notation, the operations are done in order from right to left. This is consistent
with the notation for matrix multiplication.) Now if we multiply by Gr: again, we find
GpGlGs :
{@bc)
+
(cba)l {(cba) --> (cab)} l(cab) --> (bac)}
- Gn
where G12 is the group element {interchange lstand 2nd}. Eachofthesethree "interchange"
elements is its own inverse, since interchanging two elements twice gets us back to where
we started. Thus, we can also write this relation as
GnGnGr]
-
(B.l)
Gn
We say that the element G12is conjugate to Gzt.
The sixth element is Gnz, which means {take the first element and put
it last},
or
l+3-->2--+L.
GB2:
{@bc) -+ (bca)}
Using these symbols for the group elements, we can set up a multiplication tablel listing
all of the 3l : 6 group elements for the group 53. (See Thble B.2.) Note that this group is
not abelian, since, for example,2
GrczGn:
{@bc) --> (bac) --+ (acb)l
GnGnz:
l@bc)
:
G23
while
-->
(bca)
-->
(cba)\
- Gn
TABLE B.2. Multiplication Thble for the Group 53
Gn
Gn
I
Go
Gn
Gzz
Gzt
Gnz
Gnz
Gzn
1
Gvz
Gn
Gn
Gzt
Gnz
Gzn
Gn
I
Gtz
Gnz
Gzt
Gnz
Gvz
Grrz
Gn
Gzz
Gzn
Gnz
1
Gnz
Gzz
Gn
Gzn
1
Gn
Grc
Gzz
Gn
Gs
Gzz
Gn
Gn
Gztz
I
I
Gnz
The elements of this group may also be written using the cycle structure, in which G13 is
written (13)(2): atwo-cycle (13) in whichthe element 1 goes to 3 and 3 goes to 1, and a onecycle (2), showing that element 2 goes to itself. The one-cycle is sometimes omitted, since
it is redundant. The element G132 is written (132). The sequence of numbers in the cycle is
I
Since this group is not abelian, the table must be read as follows: Multiply the element in the column on the far
left by the element in the top row to obtain each table entry.
2Remember:We do the operation on the right first.
8.2 EXAMPLES OF GROUPS
469
important, but it does not matter which one we write first, since 2 -+ 3 -> I -+ 2 --> 3.In
this notation, Gyp is written as the three-cycle (3I2) : (123).
We can understand how the operations in this permutation group are related to physical
symmetries by labeling the vertices of an equilateral triangle 1,2, and 3, as shown in
Figure B.1. Then each ofthe two-cycles corresponds to rotation by n about an axis in the
plane of the triangle and through one apex. The cycle (12) cornesponds to rotation about
axis A in the figure. The three-cycles correspond to rotation by 2n 13 or 4n /3 about an
axis through the center of the triangle and perpendicular to the plane of the figure. Each
of these operations leaves the triangle unchanged, except for the positions of the labels
on the vertices. Once we make this identification, it is easy to see why (I23) + (123) :
GyzGzo: Gnz: G32) and (123) * (132) : l.
FIGURE B.1. The group
,S3
describes the symmetry of this equilateral triangle.
The unitary groups Another set of groups that is important in physics is the set of unitary
groups. The unitary n x n matices form the group U(n). A matrix is unitary if its adjoint
equals its inverse. The adjoint is formed by taking the complex conjugate and then the
transpose:
adj (A)
:
G*.)
If A is real, then the condition of unitarity becomes tr : A-r;that is, the matrix is orthogonal. The special group SU(n) is the group of n x n unitary matrices with determinant *1.
The general form of a group element of SU(2) is
(-i. :.
)
with aa*
*
bb*
For example,
l/
eiq
eif \
fi\_r-w ,-'t )
would satisfy these constraints.
:
t
(8.2)
47O
oProNALToPrc B GRoUPTHEoRY
B.3. CLASSES
If a, b, and c
are three elements
of a group and
.
b:
_l
CnC '
(B.3)
then b is said to be conjugate to a (compare with equation B.1).
1. An element is always conjugate to itself, since we can take c to be the identity element.
Then
,or-l : lal:
a
2. If a is conjugate to b, then b is conjugate to a:
a: cbc-l
We multiply on the right by c and on the left by
c-l.
Then
,-1o, : c-lcbc-lc : b: fof-l
where
/ : c-r.
3. If a is conjugate to b and c, then b and c are conjugate to each
Ifa
is conjugate to both b and c, then there are elements
f
other.
and S such that
o:fb.f-t:gcg-l
Thus, if f -r g : h, then h-r - U-r il-r : g-r
on the left by f -r and on the right by /, we find
f .Multiplying
the above expressions
7-t767-17:b:hch-l
and so b is conjugate to c.
The complete set of elements C : at, a2,. . . that are conjugate to one another is called
an equivalence class, or simply a class, of the group. The identity element forms a class
by itself, since ala-t : I for each element a of the group. To find the elements that are
conjugate to a, calculate gag-rfor each element g of the group. The results may not all be
distinct.
For example, in the permutation group S:, the elements conjugate to G n are computed
in Thble B.3. The elements in the class ne {Gp,Gs,Gzzl. This seems sensible, since
these three elements are the "interchange" elements; that is, they do the same kind of thing.
Similarly, the two "cycling" elements form a second class for this group, as you should
check.
I
SUBGROUPS 471
B.4
TABLE B.3. Classof Elements
Conjugate to G12
IGpl-l :GpGoGt) -Gn
GpGpGsr:GzztGn-Gzz
G4G2G2]:GnzGzz-Gn
GyyGpGytl: GnGnz - Gzz
Gp2GpGl]2: GztGzzr - Gn
8.4. SUBGROUPS
Sometimes we can find a subset of the original group that obeys all the rules for a group;
this is called a subgroup. For example, the subset ll, Gn| is a subgroup of 53 because
(Gn)2 : 1. A second example is the set of all even integers. This set forms a subgroup
of the group of integers, since the sum of any two even integers is also even, the subset
includes the identity (zero), and each element has its own inverse within the subgroup. This
subgroup also has infinitely many elements. The group of all rotations by multiples of n 12
around the x-axis is a subgroup of the rotation group. This group has only four elements:
rotations by n 12 : -3r 12, T = -n,3n /2 : -T 12, and2n : 0. This last element is the
identity element. This subgroup is abelian, while the group of all rotations is not.
If a group G has a subgroup S and element a is not in S, then the sets of elements as;
and s;a are the left and ight cosets of the subgroup S with respect to a. Note that none
of the elements in the cosets are in the subgroup if a is not. We can then write the group
elements as
G:Sfa1S*a251...
Thus, the order of the subgroup S is a divisor of the order of the group G.
The set of conjugate elements
aSa-l
forms a subgroup S'. To see this, note the following:
1. The product of two elements is in the
set.
If
s1 and ,t2 are elements
of S, then
(asp-l)(as2a-l) : as1(a-r a)s2a-1 : asrs2a | : as3a-l
where s3
:
sls2 is also in S, and so aqa-r is in S/.
2. The associative property is preserved:
(as1a-l
*
as2a-11
* orjo-l : asra-l * (as2a-l *
3. The identity element is in S/. If S is a subgroup, it contains
the element ala-r : aa-7 : l.
aq,a-l
)
the identity. Thus, S/ contains
472
oPTToNAL ToPrc
B GRoUP THEoRY
4. The inverse of asa-l is as-la-l:
(asa-t)(as-r o-r)
-
alss-r1a-r
: aa-r :
1as-r a-t)(asa.-r)
-
a(s-r s)a-r
: ee-r : I
I
and
If the new subgroup S/ is the same as the original subgroup S, that subgroup is called
invariant, normal, or self-conjugate.
Note that the subgroup of even integers is an invariant subgroup, since
n
l2m -t (-n) :2m
for any integer n. However, the subgroup {l,Gnl of 53 is not invariant,
GpGpGpr - Gzt (see Table B.3) and Gzl is not in the subgroup.
because
B.5. CYCLIC GROUPS
a group G, then a' is also an element, for any integer n. By the first
rule of groups,at( a: a.2 : b must alsobeinthe group. Thena +b: a3 must also be in
the group, and so on. If the group has finite order, then an : I for some n.
If a is an element of
If N is the smallest integer for which aN : l,
then
N is called
the order of the
element a.
The set {o, o2 , . . . , aN : 1} is called the period of a.Itis a group called a cyclic group
Clv. If the order of an element is less than the order of the group, then its period forms a
cyclic subgroup of G. In the permutation group of three objects, ,S3, each of the interchange
elements has order two, while the other two elements (Gzy ard G3p) have order three.
Then, for example, the period [Gn, GLz: 1] forms a cyclic subgroup of order two. The
symmetry of cyclic groups was discussed in Section B.2.
8.6. FACTOR GROUPS AND DIRECT PRODUCT GROUPS
If S is an invariant
subgroup of a group G, then we can identify another group, called the
factor group G/S, whose elements are the cosets aS. The group operation for the factor
group is defined by the rule
(aS)*(bS):(ab)S
where the product ab is computed using the group operation of the original group G. The
invariant subgroup itself acts as the identity element. The inverse of aS is a-l S:
(aS)
* (a-lS; : 1aa-r)S :
S
j
8.7 ISOMORPHISM
473
The group structure of the factor group follows directly from the group structure of G itself.
If the subgroup has m elements, then the order of the group is mn for some integer n, and
the factor group has order n.
A group may be written as a product of subgroups under some circumstances.
G is a direct product of subgroups 51 and Sz, G
:
51
x 52, if
(i) all the elements of 51 commute with all the elements
(ii) every element in G may be written as
8
:
of 52 and
sl't2
where s1 is a member of the subgroup 51 and s2 is a member of 52.
Then the product of two group elements is
gi
I
i
:'t1'is2't't1'is2'i
: sr,isr,jsz,is2,j by property (i)
: sl,ks2,k since 51 and 52 are subgroups
:8k
and is
ofthe correct form (ii).
B.7. ISOMORPHISM
We can relate the elements of two groups to one another by a mapping. The mapping may be
one-to-one, in which one element of Gt maps to exactly one element of Gz; many-to-one,
in which two or more elements of G1 map to a single element of Gz; or one-to-many, in
which a single element of Gt maps to more than one element of G2. A mapping is said to be
"onto"if themappingmapsallof Gr ontoallof G2.Forexample,wecanmaptheelements
of the permutation group of three elements, 53, to the group of integers by assigning each
permutation a number, I through 6.
I -+l
Gp --> 2
Gp --> 3
Gv --> 4
Gp2 --> 5
G3e -+ 6
This mapping is one-to-one but not onto.
We can also construct a mapping from the integers to the permutation group 53. We divide
each integer by 6 and note the remainder. That element maps to the permutation group as
474
oPTroNALToPrc B GRoUPTHEoRY
follows:
Remainder maps to permutation group element
0
1-+1
2
3-+Gn
4
5
-+
Gzrz
-->
Gn
-->
-+
Gzz
Gnz
This mapping is onto but not one-to-one, since, for example, both 6 and 12mapto G312.It
is many-to-one.
In an important class of mappings called homomorphisrns, the mapping also "preserves
that maps group G to group
the operation." To see what this means, consider a mapping
f1. Then we can write
/(g; : h.If /
/
is
a
homomorphism, then
fQr*s):f$ixf(sz)
(B.4)
For example, the mapping that maps the rational fractions to the integers via
n
-m
--+ nm
is a homomorphism, since
, (h.'a) : , (#)
:,'P'?'Q
: (nm)(pq): f (;) - ,G)
This mapping is onto (since every integer is also a rational fraction of the form n I l), but
not one-to-one (l and f both map to 6).
A homomorphism that is also one-to-one and onto is called an isomorphism. The group
of rotations in a plane is isomorphic to the group of 2 x 2 orthogonal matrices, since each
rotation maps to one matrix, and vice versa.
8.8. REPRESENTATIONS3
B.8.1. Reducible and lrreducible Representations
are isomorphic to one another are the same in a fundamental sense. An
isomorphism allows us to represent the elements of an abstract group with the elements of
another group. This is a useful idea because the elements of the isomorphic group can be
mathematical structures such as the matrices that represent rotations. The group of order
two can be represented by the numbers 1 and -l through the isomorphism /(1) : I and
All groups that
f (a): -1.
3In this section, we choose to draw conclusions from examples, ratber than by explicit proof. For proofs of the
results presented here, the reader should refer to the bibliography.
,
I
l
I
475
8.8 REPRESENTATIONS
We can use a homomorphism to construct a representation of an abstract group as follows.
We allow the group elements to operate on a vector space of dimension n. For example,
to represent the permutation group Sr, we can label the points of a triangle with x and y
coordinates, thus allowing the group elements to operate on two-component vectors. Then
A representation of an abstract group G is defined as a homomorphism /: G
where M is a group of n x n matrices. This representation has dimension n.
-->
M,
If the mapping is an isomorphism, the representation is said to be faithful.
Every group has a trivial one-dimensional representation in which every element of the
group is represented by the number 1. This mapping clearly preserves the group operation,
but we have lost a good deal of information about the group structure; the representation is
not faithful.
Another representation, of dimension equal to the order of the group, may be formed
using the group multiplication table. We form n-dimensional vectors whose components
are the elements of the group, in some order, We label the group elements gt, 92, . . . , Bn.
Then we can represent any element g; with an n x n matrix M; that has only one nonzero
element in each row and each column. If
gigj
:8k
then (M;) 1a : 6mk. This matrix represents the multiplication properties of element g;. For
example, the cyclic group C3 of order three whose multiplication table is given in Table B.l
may be represented by the following matrices:
Mt:
(r l
l) M2:(l i ;) M3:(: ? i)
where M2 represents element
a
and M3 represents b. Then
*(i) :(li:X',):( i)
which correctly reproduces the second row of the multiplication table. Note also that
M2M2
-
(li3X?il):(:ls)
-Mz
M2M3
-
(ii:Xris):(i:i)
-Mt
and
(B.s)
476
OPTIONAL TOPIC
B GROUP THEORY
The matrices themselves satisfy the same multiplication table, as required for a representation. This representation is called the regular representation.
A one-dimensional representation of the same group may be formed from the complex
numbers
fi(1) = 1, fr@) : ,2ni/3, f1(b) :
safti/3
(B.6)
Recall from Chapter 2, Section 2.1.1 that multiplication by a complex number represents a
rotation. So each of these elements may be visualized as a rotation in the complex plane.
Extending this idea, we may form a two-dimensional representation using the 2 x 2
rotation matrices:
(t?)
We may also use the
r/-r
-t
z
\ -./r
J1
-1
) :G-f)
(B.7)
3x3 matrices:
Ml:
ML:
(-f,, f,i: i)
-Jt/2
-r/2
0
i)
(8.8)
These 3 x 3 matrices operate on three-component vectors, but they leave the third component
unchanged. Labeling the components .r, y, and z in the conventional way, we may write
any vector in the space as the sum of a vector u in the x-y plane plus a vector w parallel to
the z-axis: v : u * w. The group elements leave w unchanged.
We may generalize this result. If an n -dimensional space V may be written as the sum of
an m-dimensional space and an (n - m)-dimensional space, (V : U I W) and if, under
the group operations, vectors in 14/ remain in I4z, then the subspace 17 is invariant under the
group. As we saw in Chapter 1, we can change the basis vectors of the vector space. The
entries in the matrix will also change. The matrices with respect to the two bases are related
by a similarity transformation (equation 1.78):
M'
:
SMS-r
(B.e)
The corresponding representations M and Mt are said to be equivalenl. But in the primed
basis, the invariance ofthe subspace W is less obvious.
If we reverse this argument, we can see that invariance of a subspace under a group means
that there will be a basis for which the group representation makes the invariance obvious.
The representation is said to be reducible.
8.8 REPRESENTATIONS
477
A representation is reducible if g; is represented with respect to some basis by a matrix
Mi thattakes the form
Mi:
(
f' ";,)
where A;, B;, and D; arc submatrices of dimensions rn x m, m x p, and p x p, respectively, and n : m * p is the order of the representation. Then the group element g1g; is
represented by
(
f' '; )(t'
where the submatrices A*,
u;,): (
^'f,
Bu
o''b!,u,'',
)
: ( f- ';r)
and Dp are of the specified form and
A,u * Biur \
(f,3; )(L): /(r,,)
so that the subspace I4z is invariant under the group.
completely reducible and is written4
If B
:
0, then the representation M is
M:A@D
This corresponds to the case in which both subspaces U andW are invariant under the group,
as is true for the representation (B.8). For finite groups, every reducible representation is
equivalent to a completely reducible representation.
If the submatrices A and/or D are reducible, we may continue this decomposition until
M is diagonal or the remaining submatrices are not reducible. The resulting representation
is then said to be ireducible.
In the representation (B.8), the matrix D is a I x I matrix-that is, just a number. The set
of matrices D; is the trivial representation of the group C3; each Di : 1. We can continue to
reduce the representation (B.8) by diagonalizing5 the matrices Ai. We can do this because
41 is already diagonal and A2 and A3 give rise to the same equation (equation 1.80) for the
eigenvalues:
), +Jit2 :0
I +Jitz -) - x
| -+ -
4The symbol O does not imply addition in the usual sense. It cannot, because the matrices A and D may not have
the same dimension. They are to be "added" by forming a larger matrix with A and D along the diagonal.
5See Chapter
l, Section
1.6.5.
478
oProNALroPrcBGRoUPTHEoRY
So
| . J,.
.
),: -;*;,
- ,2ni/3 o, "4ni/3
This solution is just the representation (8.6) that we found above. There is, in fact,
a
second
representation here:
: l,
f21;.)
fz@)
: u4tri/3,
fz(b)
:
r8ni/3
- ,2tri/3
G.10)
Thus, the irreducible representations are the trivial representation, (8.6) and (B.10). The
regular representation and (B.8) may be reduced to the form
/t o o\
lt o o
r
o
M;:l
M',':l
o
l, ' s ,2ni/3 o
'
\o o r)'
\o o
/t o o \
o I
M;:l s ,4nit3
g
,zni/3
\o
\
I,
,+"iF)
(B.u)
f
The representation has now been fully reduced and may be written
M:T@\@Fz
@.12)
where 7 is the trivial representation and F1 and F2 are as given in (8.6) and (B.10).
In a direct sum ofinequivalent, irreducible representations (irreps), as in equation (B.12),
a given representation may appear more than once. The representations in the direct sum
need not, and usually do not, all have the same dimension.
B.8.2. Orthogonality of the lrreps
Using the representation (B.11), we can form a number of three-component vectors by
selecting one component from each of the matrices that represent the group elements.
These vectors are orthogonal:
3
/
..\*
)-.(ui)/P rrrt;n Ei.
K6rq
K is a constant. Here the different irreps are labeled by
group elements. For example, with p : 1 and Q :2, we have
where
I x 1* l r
r2tri/3
+t
x e4oi/3 : |
p and q,
and k labels the
- ++ |tJl - l- jiJi :o
while with p : q : l, we find K : 3.
Now we want to generalize this result to a group of order m in which one or more of the
irreps have dimension greater than one. Let us label the distinct, inequivalent, irreducible
l
l
479
8.8 REPRESENTATIONS
representatio\s M1, M2, .. .. Within representation i, with dimension ni, a gtoup element
g is represented by the ni x ni matrix MiG). The representations satisfy an orthogonality
relation that we may write as
:
{ill rn[M i G-\],,
\tu,
K 3 ij 3 p,
I
where
6;;
-+
K
depends on 4 and
r. We can determine the value of K by setting
; : j.
Then
1, and
{il) rnlMi (s-r)1,,
\tu,
-
K 6 p,
c
Now we take the trace; that is, we set p
matrix product:
\tu,
p
1
{s- )l, r[u
Since the trace
of
i {B))
6os
:
q
p
:
lM i G-
:
1
)
s and sum from
M i (E))r q
:
I to n;. On the left, we have
1
lM i G- g)1, q
:
lM i (l)1, n
:
the
3, o
ni,
Du,n -,n6ra - Kni
and
u
:
#u,n
Thus, the orthogonality relation is
\
I
ru, {dl rn lM 1 G-r)),,
When the M; arc unitary, we may write this relation
:
L
6,n
;i
E
p
(B.13)
as
{il) onlM G\!, : L lij
l1u,
I
1
3
6
p,
Dq,
(B.14)
We can use this relation to determine a limit on the number and dimensionality of the
irreps. As we did with the irreps of C3, we can think of this relation as the dot product of
zz-dimensional vectors, wherc m is the order of the group. Here the vectors are labeled by
two indices, p arrd q, each of which runs over ni values. There are nf such vectors, and
they are all orthogonal. Similarly, s and r each run over n j values. Furthermore, these n2,
480
oPTroNALToPrc B GRoUPTHEoRY
vectors are orthogonal to the first nl vectors. In total, there are
vectors in the m-dimensional vector space. Thus,
!,
nf mutually orthogonal
t n?.^
i
In fact, it can be shown that the equality holds:
(B.rs)
\"?:*
We found three irreps for the group C3, each of dimension one, and so the left-hand side
of
equation (B.15) equals 3, the order ofthe group, as required.
B.8.3. Character
Equivalent representations are essentially the same. It would be nice to have a convenient
way to characteize representations that labels all equivalent representations the same way.
Again we can review the properties of matrices (Chapter 1) to find the one that we need: It
is the trace,6 since
Tr (SMS-r)
:Tr
(M)
The character of a representation is the set {1; } of traces of the matrices that represent
the group elements.
The traces are not all distinct. In fact, every element
elements g; and gi are representedby M; and Mi,then
TrlM(sisi7it)l
:
Tr
in a class has the same fface. If
(M1M;M,\:rr 1u)
(8.16)
The character of the representation (B.8) of the cyclic group C3 is the set
{3, 0, 0}
and the character of the regular representation (B.5) is the same. Since the characters are
the same, the two representations must be equivalent.
It is most instructive to consider the characters of the ineducible representations. In
the case of C3, or indeed any abelian group, the irreducible representations are all onedimensional,T and the traces are just the numbers themselves. The characters of the three
ineducible representations that we have found m
6See Chapter 1, Problem 40.
TSee Problem ?.
8.8 REPRESENTATIONS
481
vectors, where m is the order ofthe group (equal to 3 for C3). These vectors are mutually
orthogonal in the following sense8:
lfr
:l.xf
<x!)* :6ob
(B.17)
i:l
The index i in this expression labels the elements of the group, and a andb label the different
characters.
We can obtain a second relation by noting that all elements in the same equivalence class
are represented by matrices with the same trace (equation B.16). Thus, equation (B.17)
becomes
*Lo,^;rx!)*
:6ob
(8.18)
where N is the number of distinct classes in the group and p j is the number of elements in
the 7th class. This constitutes a second orthogonality relation for the N-dimensional vectors
J p14. Since there can be no more than N of these, we find that the number of inequivalent,
irreducible representations is less than or equal to the number of classes in the group. Once
again it turns out that the equality holds.
Let's check this result for the group C3 . There are three distinct classes, since each element
forms its own class. Thus, there are three inequivalent, irreducible representations, as we
have found.
We can use the results found so far to determine the character table for any finite group.
Let's find the character table for the permutation group9 53. There are three classes,
and so there are three irreps. One is the trivial representation of dimension 1. Then, by
equation (8.15), the other two irreps have dimensions that satisfy
nl+ n2r:5
Thus, n2 : 1 and n3 : 2. With a one-dimensional representation, the multiplication
table for the characters is the same as the group multiplication table. Therefore, for the
"interchange" class, we must have Xl : I and thus 1; : *1. For the "cycling" class,
x? : l. We also have x1y" - Xi , so Xi : -l and X" : 1. For the last representation, the
character of the identity equals the dimension of the representation, here 2. We have now
obtained the results shown in Table B.4.
8Forone-dimensionalrepresentations,equation(B.14)withp:q:r:r:landn;:lreducesto
I has been replaced by i, and i and j have been replaced by a and b.
equation (B.17). In equation (B.17), the label
9Recall that this group has order six.
482
oPTToNAL ToPrc B GRoUP THEoRY
TABLE 8.4. Character Table for
53
Identity
Class
Three-cycles
Interchange
Number of elements in class
Trivial rep I-dimension 1
1
J
2
1
1
1
2ndrep M2-dimension 1
3rd rep M3-dimension 2
1
-1
1
p
a
2
Then the orthogonality relation (B.18) gives us two equations for
u
and
p:
2-3a-l2P:g
2+3ot-l2B:g
Thus,
0:-l
and cv:0
We can understand these results by referring back to the symmetries in Figure B.1. Let
the triangle be in the x-y plane. Then the second representation has basis 2; the cycling
elements correspond to rotations about the z-axis and so leave the z-axis invariant. The
interchange elements correspond to rotation about an axis in the plane; these operations
invert the z-axis. The third representation must have two basis vectors in the plane of the
triangle. With axis A labeled as the x-axis, the interchange elements are represented by the
reflection matriceslo
",,
:(;
-?
)
GB
: (-'fr,, -{;/'
),
G23:
( -!^fr,, {i')
while the cycling elements are represented by rotation matrices representing rotations of
2r I 3 and 4n f 3, rcspectively :
G23t:
(-_lJir,,
_{;/') -o Gt3z: (-'fr,, _{i')
Notice that these matrices correctly reproduce the character table. Check that they also
satisfy the multiplication table for the group.
The character of a fully reduced representation is the sum of the characters of the irreducible representations that compose it. This result can be helpful in determining the
decomposition of any particular representation.
B.8.4. Representations and Physics
Here we give a brief example to illustrate how group representations can simplify our
understanding of physical systems. The ammonia molecule is a pyramid with the nitrogen
loRefer to Chapter 1, Problem 8.
8.8 REPRESENTATIONS
483
atom at the top and three hydrogen atoms in an equilateral triangle below (see Figure 8.2).
The group 53 is the symmetry group for this system, because we can permute the hydrogen
atoms but we must leave the nitrogen molecule where it is.
FIGURE 8.2. An ammonia molecule is shaped like a tetrahedron with the nitrogen molecule
at
the top.
We can obtain a physical representation of the symmetry group by using real threedimensional space as the vector space ofthe representation and asking how a vector in threedimensional space transforms under these symmetries. This representation is reducible. To
see how it decomposes, we can compute the character. The trace of the representation of the
identity is, as always, the dimension of the rep, here 3. We can take one of the interchanges
to be reflection in the x-z plane, represented by the matrix
'*:(3 ;l)
with trace
* L The cycles
(B.re)
are rotations about the z-axis. One of these is represented by
M{,":
('f;,, -f/'
I)
with trace zero. Thus, the character is
:
^v
Comparing with Table B.4, we see that Xv
13,
:
r,oy
Xt
*
Xr and thus
Mv :T @Mz
484
OPTIONALTOPIC B GROUPTHEORY
In particular, the trivial representation is included. This means that we can find a vector that
remains invariant under the symmetry group and hence, for example, the molecule can have
an electric dipole moment.
A magnetic dipole moment, however, is a pseudo-vectorll that transforms differently
under reflection. The representation (B.19) changes to
/-t o o\
,,f,": (, .;
_i )
3
with trace - 1. The character of the new rep is {3, - 1, 0}, and the decomposition is Mz@ Mt.
The trivial representation is not included. Thus, the molecule cannot have a permanent
magnetic moment. Notice that we did not need to know any of the physical properties of
the system----other than its geometrical structure-to draw these conclusions.
8.9. GENERATORS OF GROUPS
The groups we considered in Section B.8 were primarily finite groups. But many of the important groups in physics have infinite order. In groups such as the rotation group SO(3),'the
group elements depend on a set of continuous parameters-the rotation angles in SO(3)and we can express the group elements in terms of these parameters. The group elements may
be expressed in terms of generators that relate each group element to the set of parameters.
It is convenient to relate the identity element of the group to the parametet zeto.
We can understand the idea behind generators by considering the group of rotations in a
plane-that is, SO(2). Each group element may be represented by a matrix of the form
R(o): / cos?
[_sind
d\
)
parameter is the angle 0, and with 0 : 0 we obtain the identity
sin
cose
element,
Here the relevant
as required. We can express the cosines and sines in exponential form and then use the Pauli
matrices as a basis. These are
"t:(? I
) *:(?
1)
and
",:(i
-? )
Thus,
R@):"*, ( ;
?
).'i.o ( -? I )
lcos0 *iozsin0
For very small rotations
so we may write
d
((
I (that is, "near" the identity element), cos d ry l and sin 0 ry 0,
R(9)
t1
See Appendix
:
exP(io20)
I and Optional Topic A, especially Problem 9.
(8.20)
B.gGENERATORSOFGROUPS 485
At this point, you might ask what it means to have
The meaning is clear
a
matrix
if we define the exponential by its
as the argument of an exponential.
series expansion:
eM:t+MI+1****...
2
Note that, in general,
,A,B:
(t*^+jn-A,+..
-rr
- . -A+lB
+.A.*lE
I
Xt.n+jm-u*
* rA*
^4,
*B
*
I
I
r1-r
7, -rA+lB +lB x.A* 2B xA*A + rB
:
+lE
*'..
+A
*...
1
1,\*lE
*,lE
)
eBea
Expanding the exponential in the rotation matrix, we have
R(g):
t*
io20
-!o?e'-lto"tt
2-z- 6--2- + loteo
24-. + *ioles
120 " -
But
"?:(l
f-.ofeu
720 "
+..'
tXl -t):(l ?)
so we may simplify:
R(g): t*io20 -!e'-lioze3
2 6' + !.eo
24 a liozes
120'
- *e6+...
720
:, -)r' * *t^ - fit'+...+ o(e -it' * *t'.
)
:cosd lio2sin9
This suggests that the representation
R:
exp(ioz9)
holds for all values of 0, not just small ones. The matrix
S:io":(
-
o I\
\-t
0)
is the generatorl2 for the rotation matrices.
We can extend this idea to get the generators for the full set of three-dimensional rotations.
We need the three matrices
s,
/o o o\
/ o o l\
:lo o -l I, sr:l o o o l,
\0, o)
\-r oo)
and
o\
/o
s::11 -lo o
\o oo)
l2Whether S or a2 is taken as the generator is primarily a matter of taste. ackson uses S.
I
486
B GRoUP THEoRY
oPTToNAL ToPrc
Then any rotation may be written as
R
:
(B.2r)
exp (-<o . S)
where the dot product is
ro
If the vector
<o
.S
:
arrSr
*
azSz
*
@3S3
:
I\-rt ;'
/
o
-(D1, ar \
o'-;;
at o
(B.22)
I
)
has only one component o3, we get back our previous result for rotations
about the z-axis. The other components generalize the result for a rotation about an arbitrary
axis described by the vector c). Note that since the rotation matrix has determinant +1, the
(MI)1.
. S is traceless
ldet (eM) "Tr
It is useful to note the relations between the matrices S;:
matrix
<,r
SrSz
:(l? ;X r li):(lls)
and
SzSr
:(-l
So
SrSz
li)(sl :):(lil)
(:sl)
-
S2S1
:
-Sa
Similar relations hold for the other products. Since the gener4tors do not commute, neither
do the rotations they generate.
The rotation group is a subgroup of the Lorentz group (Jackson, Section I 1.7;Arfken and
Weber, Section 4.5). The full group has six generators: three for the three components of the
relative velocity of the frames and three for the three parameters of the rotation (direction
ofrotation axis and angle ofrotation).
The unitary group SU(2) is generated by the three Pauli matrices and is closely related
to the rotation group SO(3). They are homomorphic, not isomorphic. That is, we can find a
mapping that maps two elements of SU(2) to one element of SO(3).13
Now let's generalize. Suppose that elements near the identity in a group G can be described in terms of N real parameters ay. and that the identity element corresponds to ap - 0
for each k. The identity element is represented by the identity matrix:
M\a)lo'_.o
1
3See
Arfken and Weber, Section 4.2, pp.232-236.
:1
B.1O LIE
ALGEBRAS 487
where we have written the N parameters as a vector cr. Since the parameters describe the
matrices continuously, we may use a Taylor series to write elements near the identity asla
M(du)
: M(0).
#|":ooon+
. - M(0) + t (-,
#1":,)
:l*iXtdo&
,*
(B.23)
where the repeated index ft implies summation. The quantities
xn: -i
Yl
dqr la:o
are the generators of the group. If cr has only one component 6u, we can generate the group
element represented by M (a) by repeating the operation represented by M (a I fl) a total of
p
B times. (You can understand this by thinking about rotations-rotate through an angle 0 I
about the same axis B times.)
M(a)
:
I
l,
/
a\1f
\e ))
This element must be a member of the group if M(ql F) is. Now we let B --> a, af p
becomes differential, we obtain M(ulfl) from equation (B.23), and we generate M(a)
through the limit:
M(u)
: nl,(;)]' : /11 (, .,+)'
Expanding the expression on the right, we have
(,.,+| -, * u'+ . P+1,+)' *
M(a)
* iax**)fioxol' +...
-+
1
-
si"xr
asB -+ oo
(8.24)
Thus, the structure of the group is defined by the elements near the identity.
B.10. LIE ALGEBRAS
Suppose a group G with elements R; has generators
equations B2l and 8.24)
$. Then we can write (compare with
R : et'S
l4The introduction of
i
here is to make the resulting generators Hermitian.
488
oPTroNALToPrc B cRoUPTHEoRY
In particulaq there is a set of elements near the identity for which
R;
with e;
(
1.
-
asisi
1
*eiSi +:t?t
+
For such an element,
Rlt :
Now
:
e-eisi
:l -eisi +)t?s? +
if R is unitary, then
det (R;)
and so Tr
(&)
:
RtrR'Ri
:
exp [ei Tr (S;)]
:
1
0. Let R7 be defined similarly. Then
-R;'(l +r,s, +:r?s?* )o,
-rr
-' r
_rrr
-,
:
eiR; rS;R;
Ei
*
/
(l -e;s;
I ^
tt?Rjtsr2n;
+
I *eiSi *e;e;(SiS;
+...
1..\ /
I ^ ^\ I " "
+
yisi )$(t +e;s; * r'iti
) ;isf
-
S;S,)
+:t.S?
where we have dropped terms of third order in the small parameters ei and
compute
RlrR;lnin, : (t - e;s; *
)'f *)(r
:l+e,€j(&Si-SiS,)
+ r,s,
r
e;e;(s;s;
-
srs;)
e
j.
Finally, we
+;'?S?)
to second order in the es. Now this element of the group can also be expressed in terms of
the generators, so we must have
1
*
e;ej(s;si
- si$) :
s.'ii's
: t +lrilr;rs*
k
In particular, if we write
,!, :
(SiS;
t;t 1o!r, then we must have
-
57S;)
= tsi, S;l :1"!,Sn
(B.2s)
k
This is a fundamental property of the generators
Rl andR; thatwepick"d.il;';;ff
,S;,
inde
particular
elements
"i;;;;lledthe #:i,H;l:fft:
i
j
I
B.1O LIE
ALGEBRAS
489
Because of the antisymmetry of the commutator [S;, Sy1 (equation B.25),
u!,
: -aji
(B.26)
We can now compute the double commutator:
rsi' rs''
s'lrr:;;fJ
,l:;3
_l]:ll,-i#
tl
ui1,s^l :\ufi,lsi,
lt,,I
Lm)*mn
s-l
: D"ToDoi^s,
Then the Jacobi identity
ls;, [s;, sr]l + [s;, [sr, si]l
* [S*, [si, s;JJ :0
(8.27)
which follows from the definition of the commutator, gives another constraint on the structure constants:
u!^s, +t t afiaf,^sn : o
tnrnD ul1,a'!^sn +DD"t
nm
nm
tnmt s, (afi,ui* + uf;a|- + affai^) : o
\
("Tr"f* + afraj
^
+ uffai*)
:o
(8.28)
For the rotation group SO(3), we found
/o -r o\
0 0 l:sl
\o o ol
SrSz-SzSr:l I
so
of, -
1.
Then, from equation (8.26), we have
for this group are the Levi-Civita symbols
a), : -l.In
ofj : tij*
fact, the structure constants
@.29)
i
490
oProNALroPrcB GBoUPTHEoRY
Although we used a specific representation to calculate the structure constants, they are
independent of the particular representation.
We can check that relation (B.28) is also satisfied:
D ("yo"y^ + af,aj* + ufiai^) :l
:l
(ei*^enim
: j,6*i 6
*
6ii6*n
Eki*trr*
-f
6p$;1
-
(ei**eimn
I
e1a;*ei^n
I
e;i-e1,*n)
* rrr^iru^)
3;r3it a
3;n31*
-
6i"6*i
_n
-u
If we define the commutator as the multiplication operator for the generators, the vector
space of generators, with both addition and multiplication defined, becomes a field or
algebra called the Lie algebra for the group G.
A Lie algebra is a vector spacels of elements u; with an operation (the bracket or commutator) that satisfies the following rules.
1. The bracket is linear:
faur +
bu2,4l : alur,
uzl
I
bfuz, uzf
and
lu1. au2
-f bql
: afut, uzl I
b[u1,
4l
2. The bracket is antisymmetric:
lut,uzl
- -[uz,ui
3. The Jacobi relation is satisfied:
lq,luz,uzlf *fuz, [u:, ur]l *
[us, [ur, uzff
:O
A
group whose generators form a Lie algebra is called a Lie group. Lie groups are
tremendously important in physics. The group structure is determined by the Lie algebra
of the generators, as expressed by the structure constants. The rotation $roup, the Lorentz
group (which includes the rotation group as a subgroupl6), and the symmetry groups of
particle physics are all Lie groups.
PROBLEMS
I
Strow that the set of permutations of two elements is a group. What is its order? Write the
multiplication table for this group. How many classes are there? Is the group abelian?
15SeeChapter
1.
l6See Problem 15.
PROBLEMS
491
2. The symmetry group of a square contains those operations that
leave the square unchanged. Show that this group has eight elements, and write the multiplication table.
What are the classes? Are there any subgroups?
3.
4.
S
6.
7.
Show that the set {1, -1, l, -l} forms a group under algebraic multiplication. Write the
multiplication table. How many classes are there? Show that this group is isomorphic to
a group of rotations that preserve the symmetry of a square (compare with Problem 2
and Problem 10).
Show that there are two goups of order four, and determine their multiplication tables.
Strow that any group of order n is isomorphic to a subgroup of S,.
Show that unitary matrices of the form (B.2) form a group under matrix multiplication.
Show that in any abelian group, each element forms its own class. Hence show that all
of the irreps of an abelian group are one-dimensional.
8. Showthatthe setof 2x2matricesof theform
/ * y\
(-';)
forms a group under the operations of (a) addition and (b) matrix multiplication. Show
that the group of complex numbers x I iy is isomorphic with this group of matrices in
each of the two cases.
9. The quaternions are four-dimensional complex numbers of the form Q : a + bi +
cj * dk, where a, b, c, and d are real numbers and the quantities i, i , and k obey the
multiplication rules
i2:j2:k2:-7
and
i.i
:k;
ji:-k
(a) Show that the set {+1, +i, +j, tk} forms a group under this multiplication.
(b) Show that i, j , and k may be represented by the matrices
':(;lil)':(i:;i)
.:(,il')
Determine the classes of this group.
(c) Determine the number and dimension of irreps of this group, and find the character
table.
(d) Is the representation in (b) reducible? If so, how?
492
OPTIONAL TOPIC
B GROUP THEORY
[tO.] Strow that the integers I through 4 form a group under the operation of multiplication
mod 5. Write the multiplication table. What is the identity element? How many classes
are there? Is the group abelian? Another group with four elements is a group of rotations
that preserve the symmetry of a square-that is, rotations by multiples of t 12. Are these
groups isomorphic? Why or why not?
11. Consider the mapping
/
that maps the group of rationals to the group of integers by
/m\
r \;):m*n
Is the mapping a homomorphism? Why or why not?
12. Show that all elements in the same class have the same period.
13. Show that if a set of elements {e; } forms a class of a group G, then the
[f+l
I
set {e, } of tne
inverses of {e;} is also a class.
fne center Z of a group G is the set of elements that commute with every element in
the group. Show that the center is an abelian subgroup of G. Is it possible for a group to
have no center?
15. A homomorphism / maps group A to group B. The kernel of the homomorphism is the
set of all elements of A that map to the identity element of B. Show that the kernel is an
invariant subgroup of A.
16. A one-dimensional translation operator f, translates a function along the x-axis by an
amount nd , where d is a fixed step length: Tn f @) : f (x -l nd) .
(a) Show that the set of operators 7l, forms a group that may be represented by the
complex numbers Tn
=
e-ik'd . What
are the conesponding basis functions?
(b) Work out the orthogonality relation (B.14) for this representation, and comment.
You will have to make some changes to account for the infinite order of the group.
(c) Nowlettheoperatortranslatebyanarbitraryamount xt:T(xt)f (x1 : f @*xt).
What are the generators of this group?
17. The water molecule is an isosceles triangle with the oxygen atom at one vertex and the
hydrogen atoms symmetrically located on either side. Determine the symmetry group
for this molecule. May this molecule possess a pefinanent electric dipole moment? What
about a magnetic moment?
[ft.l fne molecule SbSs is square-pyramidal. Four
S atoms form a square base with the Sb
atom at the center. The fifth S atom sits at the top of the pyramid. Determine the symmetry
group for this system. What is the order of the group? Work out the multiplication table.
How many classes are there? Determine the character table. May this molecule possess
a permanent electric dipole moment?
19. The Lorentz group has generators that are 4 x 4 matices with mostly zero elements.
The matrices K; are given by
/01 o o\
Kt:
l'Isl,)
I
PROBLEMS
493
and so on. (The nonzero elements of K; are the ith elements in the top row and the first
column, where the first element is labeled with 0, not 1.) Similarly, the generators S; are
given by
/o o o o\
St=
[,'sl;,J
(The nonzero elements of S; are given by a1*6) : sijk, whete ai* is the submatrix
formed by removing the top row and left-most column and j,k : 1,2,3') Find the
group element generated by K1 and also by 51 .
Compute the product of the two group elements eoKr and ebKz. Hence show that the
elements generated by K; do not form a subgroup. Do the elements formed by the $
form a subgroup? Are there any other subgroups? If so, what are they?
: ax * b (where x' , x, a, and b are real numbers and
two-dimensional representation of this group that acts on
20. Show that the transformations .r'
a+
form a group. Form
the vectors (.r, 1).
O)
a
21. A homomorphism / maps a group G to a group FI. Show that the image f (G) in H
is isomorphic to the factor group G / K, where K is the kernel of the homomorphism
(Problem 15).
@l
Xgroup G has an invariant subgroup S. If element a of group G has period N, where N
is prime, and a is not a member of the subgroup S, show that element aS of the factor
group G/S also has period N.
OPTIONAL TOPIC C
Green's Functions
Many physical systems are linear and consequently are described by a linear differential
equation. An important example is the electromagnetic field, and we shall treat this system
in detail in Section C.5. Such systems obey a superposition principle: The response of
the system to a sum of inputs is just the sum of the responses to the individual inputs. As
aresult, these systems can often be analyzed using a Green's function. The Green's function
is actually a distribution (Chapter 6) that describes the response of a physical system to a unit
delta function input.
In general, any physical system in the region 0 < .x < I is described by a differential
equation for some quantity, say y(x), plus a set of boundary conditions. The boundary
conditions provide a sort of shorthand for representing additional sources that exist outside
the region of interest. The system is given an input 1(x), and we want to find the response.
The Green's function is the solution to a similar physical problem, but with the source term
replaced by a delta function (a "point source") and with zero (or, at worst, constant) values
on the boundary. The Green's function problem is thus easier to solve than the original
mathematical problem, which is the advantage of the Green's function method.
We shall begin by discussing one-dimensional problems for which the solution itself is
zero at the boundaries. A physical system exists in a region defined by 0 < x < L.Ifthe
input to the system is 1(r), we can write it as
I (x)
:
Io'
I (x')3(x
-
x'])
dx'
If the response of the system at-x to the delta function at x/ is G(x, r'), then, by the principle
of superposition for linear systems, the response to the input 1(x) is
y(x)
:
fot
,
{*')o{*,
x'; dx'
(c.1)
The Green's function is symmetric in the two variables:
G(x, x')
:
G(x' , x)
(c.2)
495
496
oproNALToprc
c
GBEEN's FUNcToNS
If the variable is a time variable, then
G(t, t')
: G(-t' , -t)
(c.3)
This sign change is necessary to preserve causality-the response cannot precede the event
that caused it. (See Morse and Feshbach, Section 7 .3, for a more detailed discussion of this
point.)
Because the differential equation satisfied by the Green's function contains a delta function, strictly speaking G is not a function, but a distribution (Chapter 6). Indeed, the physical
solution is obtained as an integral over G, ad expected if G is a distribution. The method is
advantageous provided that the integral (C.l) is relatively easy to do.
The methods used to find the Green's function for a given problem may be classified into
three groups: (a) divide the region of interest into two pieces with the point source on the
boundary between them, (b) expand the Green's function as a series of eigenfunctions, or
(c) use an integral transform.
C.l.
DIVISION.OF-REGION METHOD
First we write the differential equation and boundary conditions that describe the system
in the absence of sources. (The boundaries may be in space, time, or both.) With no sources,
the differential equation will be a homogeneous equation. We determine the solutions of
this homogeneous equation.
Next we write the differential equation for the Green's function problem. The source is
nowadeltafunction[forexample, S(x-xl)linwhichthepositionx/ofthesourceisforthe
moment consideredfxed.This source may be imagined as dividing space into two regions,
x < x' and x > x'. We write down the appropriate solutions of the homogeneous equation
that match the boundary condition or conditions in the two regions. There will be one or
more unknown constants in these solutions.
We also have to consider the boundary conditions that apply at the intermediate boundary
that we have constructed at x : x' .We need enough boundary conditions to determine the
unknown constants in G.
Once G(x, -r') has been determined, we can use it to find the response of the system to
any input, using the integral (C.1). In evaluating this integral, we considerx/ to be variable
and x to be fixed.
To illustrate the method, consider a particle moving under the influence of a driving force
F(r) with damping proportional to velocity. The equationl describing the system is
*Edu *au:
lCompare with equation (3.5), with rng replaced by
F(t)
,
F(l).
C.1 DIVISION-OF-BEGION METHOD
The boundary conditions in time are u
function is
-+
0 as
t -+ too.
The equation for the Green's
dG
m . -fqG:6(t-t')
tlt
For
tI
tt
497
(c.4)
, the delta function is zero and the equation simplifies to
dG
^E IaG:O
which has the solution (Chapter 3, Section 3.3.1)
G: A*r(_;,)
This function approaches zero as / -> *oo, butblows up as / -+ -@. Since the appropriate
solution for / < // must approach zero as t --> -@, it must be identically zero. The system
is set in motion by the impulse applied att : tt. Thus, we can write the solution for t < tl
andfor
t > tt:
. : (o
G(t. t'\'
Since G is a function of both
t
ift<t'
1
[A
exp
(-at
lm)
if t > t'
(c.5)
and tt , we expect that the "constant" A is actually a function
of t/. Our next task is to find this function.
We imagine the delta function input at t : t/ dividing the system into two regions, with
the point t : tt on the boundary between them. To find an expression for A, we integrate
the differential equation (C.4) across the (imaginary) boundary at t : tt:
dc) o' :
[,]"' (*#+
1,,',:,"
s(t
- /) dt :7
where we used the sifting property to evaluate the right-hand side. On the left-hand side,
the first term integrates easily:
mct:|,!:,
*, 1,,',:"' G dt :
t
The remaining integral is bounded:
I',1,'
oarl s
'u*
tct(2e)
<2elAl
Provided G remains finite, this integral approaches zero as e -+ 0. Now we insert our
solution (C.5) for G in the two regions into the resulting equation. At the lower limit, t < tl
498
and
oploNALToprc
G:0;
c
GREEN's FUNoIoNS
attheupperlimit,
t > ttandG:
^lt"*p
?Ir)
-
Aexp
o]
:
(-at/m).Thus,
as
e
-+
0,
'
I zo .r
A--exp[-t')
m\m,/
and so
(o
G(t,t'):{
L'\"I'
- t
ift </
/
l;'-o(-;
l' -''l) it'
(c.6)
'''
A piecewise-defined solution for G always results from the division-of-region method.
Notice that the reciprocity relation (C.3) is satisfied by the function (C.6).
Example
times r :
C.1.
Find u(r) if the input F(t) is a force that is a constant F6 between
0 and
t
: T andzero otherwise.
The resulting velocity is (compare with equation C.1)
ift'<O
r+*/0
f+oo
ro
ifo<t'<T \lc1t,fiat'
u(r): I re)c(t.t')d/: I I
J_a \o
J__
ift,>T
I
Since the force becomes zero for
t/ < 0 and for tt > 7, the limits
of the integral are
0tof.Thus,
ift<0
u(t;:0
andforr > 0,
u(t)
:
1T
Fo
Jo
G(t,t') dt'
If 0 < r < Z, since Gis zerofort < // (equationC.6),theupperlimitof theintegral
is t:
u(r)
:
,o
F6
m
Fs
a
/l\ at'
[' ! "*p-\(-!1,
m - /
JOm
C.2 EXPANSION IN EIGENFUNCTIONS
On the other hand,
499
if t > T,
u(t)
fTl
/ d
: Fo"Jo
t']\ at'
I -exp
m '\ [ --V
m' - '/
/d,
--[t
Fs exp [\m
m
- t'l)
T
qlm
0
l
(-!v
zl\ "-- /-9,\
- &d f"^p
\ m - ) - "^'\
t
tn')l
& exp^\(-gt) f.^p^\m
(" r)/ - rll
d,
m/L
Once the force ceases, the speed decreases exponentially (see Figure C.l). Notice that
both expressions give the same result for u(t) at t : T .
ua
F11
t
T
as a function of time for the particle in Example C.l
velocity decreases exponentially at large times.
FIGURE C.1. Velocity
withaT/m: l.The
C.2. EXPANSION IN EIGENFUNCTIONS
Instead of dividing our region into two pieces and obtaining a piecewise-defined Green's
function, we can sometimes expand the Green's function in a series of eigenfunctions.
If the governing differential equation is of the Sturm-Liouville type (equation 8.1) and
the boundary conditions are of the form (8.2), then there exists a set of eigenfunctions ), (r)
for the problem, with associated eigenvalues l"n. We will assume that the eigenfunctions
have been normalized so that
l"'
r rrrr^rx)yn@)
d,x
: 3-n
(c.7)
500
oproNALToprc
c
cREEN's FUNcroNs
We require that the differential equation for our Green's function problem be of the same
form but with a constant I that is not equal to any of the eigenvalues ).r. The source is
a delta function:
d/
dc\
n \f al::- ) - s{x)c * )"w(x)G: -4n6(x - x')
(c.8)
The factor of -4n is raditional-it arises from the use of this method in potential theory.2
Then the Green's function may be expanded in eigenfunctions:
G(x, x')
:Dy,
n:0
(c.e)
@') l^@)
For the moment we consider the value x/ tobefixed.We substitute this assumed form (C.9)
into equation (C.8):
i
r^<.'
t
lft (t o ft/,
(,)) -
g (x)
*'l,
v,(x)
(x)v, (x)]
: -4r6(x -
xt)
(c.10)
Now we know that each eigenfunction satisfies an equation of the form
- sr'rr' r
* ('o'*)
)'lnw(x)Yn
-
Q
so we use this to simplify equation (C.10):
oo
D
y,
@'
)
[-
)'nw (x)
yn
(x)
*
)'w (x) y n(x
)]
:
-
41 6 (x
-
xt )
n=O
Next we make use of the orthogonality of the eigenfunctions by multiplying the whole
equation by
y^(x)
I' 2"
and integrating over the ran9e
-
x
)')w(x)yn(x')vn@)v^(x) dx
:
& to
: -0,
iU - ),)vn@) f"b ,{r)v^{r)v^(x)dx:
x
:
I"u
b:
6(x
-
xt)v^(x) dx
-4nv^(xt)
()'- )'^)y^(x') : -4ny^(x')
where we used the sifting property to evaluate the integral on the right and equation (C.7)
to evaluate the integral on the left. Then
y^(x') =
4o!-92
lum
lt
-
2See Section C.6 below.
c.2 EXpANsloN
rN
ETGENFUNoToNS 501
and the Green's function is
(c.ll)
Note the symmetry in
x
and
r/,
by equation (C.2).
as required
Example C.2. Find the Green's function forthe one-dimensional Helmholtz equation in the region 0 < x < I with y(x) :0 atx : 0 and at x : L.Hence find y(x)
it f(x) equals I fot L14 < x < 3L/4 and equals 0 outside this range.
The equation for the Green's function is
:
7fi + k'y
d2,v
which is of the form (C.8) with ).
replaced
:
6(*
-
*')
k2 and the factor
(c.12)
-4n
on the right-hand side
by 1. Thus, the equation for the eigenfunctions is of the Sturm-Liouville
form
dzY'
a;
+
r,)
^
:0
with 1", I k2.The solutions to the eigenfunction equation that satisfy the boundary
condition atx:0are
ln :
To satisfy the second condition at
x
cnsin
:
(r/l''.r)
L, we need
1/fi,L:
nn
and so the eigenvalues are
tnnt2
^,--\i)
We still need to normalize the functions. We choose the constant cn so that
1L
I
Jo
,1
lo'
Yn@)Yn@)
,in'
dx =
1
(74 dx:,]L- :1
-,-y ItL
5O2
oproNAlToprc c
GREEN's FUNcIoNS
So the normalized eigenfunctions are
!n(x):
6
tli"^(T-)
and the Green's function is (compare with equation C.11 with
4n replacedby -1)
too
G(x, x')
: '\rlif k :
nn lL for any n. Physically, this
corresponds to driving the system at a resonant frequency. The displacement becomes
arbitrarily large in the absence of damping, and there is no solution.
Notice that the expression is not valid
Thenif
f(x) : I for L14 < x < 3Lf4,
1L
13L/4
I cG,x')f(*')dx': Jr1+
I
-y(x): Jo
:?L
J
GQ,x')dx'
["'o Y-"tnT"nT
or'
rtq ? tQ - (nn lL)z ---. wrx
a
Slh
n
SlIl-
nT x,
:1\- = - L = I13Llasin-'77
L
L?kt-(nrlL\zJ11+
:
2
L
. wfx
\-
L
,
cos
I
2
iwr - cos iwt
n,
+F - <r,rtUt'
nT nft
. nrfx 2sin-5inoo sinL
2
4
-o\- L 1n, 1712 1r2
-2,
wr
n:l
fnrxlnr
-4
i odd(-1v{n.1121#F)
L'
I
n:1.
-
n
The sum is over odd values of n, because
Figure C.2.
'3
sin(nnl2):0 if n is even.
C,3 TRANSFORM METHODS
o
FIGURE
0.2
1.0
0.6
0.8
:
:2/L.
5
are
Terms up to n
C.2. The result of Example C.2 with k
Including tems up to n :9 does not change the plot noticeably.
503
*t
0.4
plotted here
C.3. TRANSFORM METHODS
We have already found the Fourier (Chapter 7, Example 7.3) and Laplace (Chapter 6,
Section 6. 1.6) transforms of the delta function. Thus, we can solve for the Green's function
by transforming its defining equation.
The Green's function for the damped harmonic oscillator is given by
d2cft.
i;
ac (t. t'\
t'\ 12qi--trrJ
+ aloc1t,t,):6(t
-
t,)
(c.13)
We may Fourier transform this equation to get
-r2c1r, tt) - 2iodc(to, t') -l ,20c1r,1'; - -J-ri'"
Jnr
Then the transform of G is
1
"iatl
G(o, t') t/2r afi - 2iao
-
o2
Inverting, we get
G(t, t,)
:*
I_: d-4, _-,-ia,
4,
(c.14)
We may evaluate the integral using the residue theorem. The integrand has two simple poles,
where the denominator is zero. These are
aP: -ia t
,2o
-
o2
504
OPTIONALTOPIC C GREEN'S FUNCTIONS
Both poles are in the lower half-plane, as we expect from causality (Chapter 7, Section 7.4).
For / < //, we must close the contour upward. There are no poles inside the contour, and
the result is zero:
fort <tl
G(t,t'I -g
For
t>
//, we close downward, enclosing both poles. Then, writing
have
: J-e-o{t-tl ("-ia<t-t) - eial-/))
- tt) for t > tl
G(t, /) "-a1t-tl,sit1{2(ta
(c.ls)
{o}* "'=
S), we
(c.16)
The result is of the same form that we would get from the division-of-region method
(Section C.l).
C.4. EXTENSION TO N DIMENSIONS
Problems defined in a region with two or more dimensions are handled in a similar manner.
The Green's function is defined in the same region of space (and/or time) as the original
function /, but both the differential equation and the boundary conditions are simpler.
1. The source term on the right-hand side ofthe original differential equation is replaced
by a delta function representing a point source.
2. The boundary conditions are replaced by
(a) Dirichlet conditions: G(*, i/) : 0 on S
or
(b)
Neumann conditions: n '
VG(i, i/)
:
constant3 on S
where S is the surface that bounds the volume of interest, V.
In case (a), we have the Dirichlet Green's function; in case (b), we have the Neumann
Green's function.
The method used to find the Green's function will be a combination of the methods used
in the one-dimensional case.
We'll begin with an example in two dimensions with homogeneous boundary conditions.
We'll consider inhomogeneous boundary conditions later.
A potential problem is defined in a two-dimensional rectangular region of dimensions
x
a b.The boundaries are conducting and are grounded. We'll choose a coordinate system
3The constajrt may be zero in some cases. See Section C.6 below.
i
1
I
C.4 EXTENSION TO
If
DIMENSIONS
s05
with x- and y-axes parallel to the sides of the region and the origin in one corner. Then the
potential satisfies Poisson's equation:
a2o+ a2o :
A*,
Ay,
-4no(x'
Y)
where the term on the right-hand side describes the sources for the potential (the charge
distribution in the region).
The Green's function is the solution to the similar, but simpler, equation
a2c_(ii')
dx'
where for the moment we consider
+
a'c;!l_.
dy'
*'
i')
:
_4n
6(i _ i,)
(c.17)
tobe fixed.
Method. I Our goal is to reduce equation (C.17) to a one-dimensional equation so that we
may integrate across the discontinuity, as we did in Section C.1. We do this by expanding in
eigenfunctions. This is equivalent to expressing the delta function in one ofthe coordinates
using the completeness relation (8.10).
We imagine the delta function dividing the rectangular region into two pieces. We can
choose the interior boundary through the point *' to be parallel to the x-axis or parallel
to the y-axis. In this example, we'll choose to divide in y (Figure C.3). Then within each
region there are no sources, and the differential equation for G is simpler:
a2G6,1') , a2c(i(, *') _ .,
------;----;.----;--;-_dx'
d!'
(c.18)
0e
0
FIGURE C.3. Dividing
into two regions: region I with y < y/ and region
y/) is on the boundary between the two regions.
a two-dimensional space
II with y > y/. The source
at (x/,
506
oproNAlToptc c
GREEN's FUNcrloNS
We begin by solving this equation by separation of variables, subject to the boundary
conditions at x : 0 and x : a. Let G : X (-t) I(y). Then
X,,Y
We divide through by G
:
+XY,,:O
XY:
X,,
Y,I
i*T:o
To satisfy the equation at aII vahes of x and y, we must have
X,,
-k2
x
and
Y,,
Y -Lz
where we chose the separation constant to be negative so that the solution for X is
a function with more than one zero. (Compare with Chapter 8, Section 8.2.) Here we need
X : sinkx. Then to satisfy the boundary condition at x : a, we need to choose the
eigenvalue kn
:
nr la
so that
sinkna:
0. The equation for Y becomes
.. tnn12
Y
,,,:\;)
with solutions4
Y
In region I, 0
: sirrhnftl and Y - ssshlll
. y { }', we must choose the hyperbolic
sine function so that
y(0)
:
0.
Then our solution has the form
Gr(i,
i') :f
,,{*' ,
y')
sirlla sinn!!)
for
v < v'
n:l
InregionII,b> y > y',wemustchooseasolutionthatiszero ary: b.Theappropriate
choice is sinh
lnn(b
G u(*,
-
y)/al.Then we have
i') :
in:l D,(x' , y') sinlla ,in1",Y9-2
for
v > v'
Now G represents the potential due to a unit "point" charges placed at */, and so the
potential
it produces must be continuous everywhere in our region. This gives our first
4Alternatively, we could choose the functions exp (*nrry/a), but the hyperbolic functions are more useful here.
5The word "point" here refers to a source located at a point in the two-dimensional space. Equivalently, this
solution also applies to a region that is infinite in the third (z) dimension, with no z-dependence, in which case
our source would be a line charge.
1
507
C.4 EXTENSION TO N DIMENSIONS
boundary condition at the internal boundary at
G1(x, yt:
y
:
yt:
,' , y'), Gg(x, y': x' , y')
oo
,, r,, y, \ sin !!! rinh'o;!'
fn--tQa7=taa
r
:i
r, {r,,
Thus, by orthogonality ofthe eigenfunctions
)
r-,
"in
(C.19)
!!a
r'
nn
nn (b
-
v' )
sin(nnxfa),
. - nr(b-y')
,.sinh
. nTTv'
' : Dr(x' , y'), sinh
Cn@' , y')
Defining the new "constant" En(x', y'), we have
n
-
Dn(x' , y')
Cn(x' , y')
sinh
[m(b - y)'laf -
sinh
(nny'la)
(The "constants" are functions of x' and y/, but remember that for the moment we are
considering these values fixed.) Then the Green's function becomes
o,
: i
En(*'
,
y')
,
y')
n:laaa
*
"inno
sinh
'o) ,' *nn
(b
-
yt)
(c.20)
and
oo
cr,
: i
En(x'
n:laaa
rin"* "'*nt(b -
Y)
(c.2t)
,inh'o-J'
We need one more condition to find the unknowns Er. We obtain it by returning to the differential equationfor G (equationC.lT). Writing G intheform D S"O, x', y') sin (nn x /a),
we have
(a\'
i {\ \a/
n:l
-l
+ f!: ,innn' }
""snsin!!!
a
lYt
a )
:
-+o01.,
- x,) 6(y -
y,)
We begin by reducing this relation to an equation in the one coordinate y that we chose to
divide the region. To make use of the orthogonality of the sines, we multiply both sides by
sin (n' n x l a) and integrate from 0 to a. Only the one term in the sum with n : rzl survives
the integration. Then, dropping the prime on n, we have
;lfff '^*#l- -o't'inn'-''
t1v
-v'1
(c.22)
Thus,E, cxsin(nnx'fa).Wecanincorporatethisresultbywriting En: Frsin(nnxtfa)
and, equivaleafly, gn : yn(y,y')sin(nnxt/a). Our expression for G now has complete
508
oproNAlToptc c cREEN's FUNcloNs
symmetry in the primed and unprimed coordinates
equation (C.22) across the boundary at y : yt:
I,'il""
if Ii
is a constant. Next we integrate
;l- ff)' ".#)dY : -4n I,''*"' u' -
Y')dY
The first term goes to zero as we let e --+ 0, since G(*, *'), and hence yn, is continuous
y : y' . We evaluate the right-hand side using the sifting property. Then we have
!
^ tt/+6
oY,
f :
2 oy ly,_, -4n
At the upper limit, y
:
y' + €
)
J',so G
:
(c.23)
G1 and, using result (C.21),we have
I oYnl : -!!! rl" cosh nn(b a
2a
2 |lly,+
At
the lower
limit, y
:
y'
- I < !',so G :
at
Y')
a
"i'hno!'
G1 (equation C.20), and so
l ovnl ::o,sinhnn(b
z 0y ly,- 2- " ------ a
Y')
(Y"ornn")')
ct /
\a
Thus, equation (C.23) becomes
nr /
l)rinh no)' *
-;Fn(cosh-"o9:a
a
sinh
nn(b
a
y')
no)') :
-4tr
"orh a /
The term in parentheses simplifies, leaving
Fn-
nsinh (nnbla)
and so the Green's function (equations C.20 and C.21) is
G(*,
i,)
: i -'=j
,inno''
4nsinh(nnb/a) a
n:l
-
a
"innn'rinh
,'*nn(b 'o)
a
&
y')
G.z4)
fory<y/and
G(i, *,)
: i -.=9 '^ ,inn'*'
a
a "in'o*
n:l
-nsrnn\nrb/a)
for y > y/.
a
"inhno!'
,'nnnn(b
a
- y)
G.z5)
Here again we note the symmetry of the function: G(i, i') : G(*', i). The symmetry
may be made more obvious by writing the function as a single expression. We define
y. :
min (y, y/)
\
C.4 EXTENSION TO N DIMENSIONS
509
and
y, :
max (y, y/)
(compare with Jackson, Classical Electrodynamics) so that we may write
@
G(i,i')' :t-.
.j . ^ rinno*'
z-/.nsinh(nnb/a)
a
a "in"'rinhno)'
a
e r'*nz(b-y')
(c.26)
n=l
Once again we find that division of the region leads to a piecewise-defined Green's
function, but this time each piece is expressed as a series ofeigenfunctions in the undivided
coordinate.
2
An altemative method allows us to expand the Green's function in a double
x and y. But eigenfunctions of which equation? Observe
(C.18)
that the differential equation
is a special case ofthe equation
Method
sum, using eigenfunctions in both
a2c_(ij
dx'
i')
+
a2c_(i_,
dy'
i')
I
),a2G
:
O
(c.27)
with the constant ), : 0. Thus, the eigenfunctions we need are solutions of equation (C.27)
with 1., not equal to zero. This is the Helmholtz equation (compare with equation 3.16 or
Chapter 8, Section 8.4.4). The solutions of this equation are again found by separation of
variables:
X,,
Y,,
V+Tl)'^n:O
Again we choose the separation constant for the X equation to be negative and then choose
the constant tobe -(nn/a)2 so as to obtain the eigenfunctions sin (nnx/a). Then the
equation for Y is
Y"
t nt:,2
v tL^n- \;) :Q
This time we also want eigenfunctions in y that ate zeto at both boundaries, so we choose
Then
and
51
0
oproNAL roptc c cREEN's FUNcrtoNS
Thus, the eigenfunctions are
XY
:
Cmn
. wrxsin mTy
sin'--'
-;:
(C.28)
AD
where we must choose the coefficient C*n to normalize the functions-that is,
lr'
Iru
(c-,"in?!a
dx dy
"inry)'
:
I
Thus,
2
wmn
-
-----
4ab
The corresponding eigenvalue is
r nn..,2 (*o
,
\2
L^,:\;)
*\
r/
Now we make use of the general result (C.l
c(i. *')
l) with
^ "in'TX
: o"F_#u
. nrx
.1,
:
,in^o!
rotlt,
(c.2e)
0 to get
"in"*'
+
,in*T!'
<^,llrr,
mntyttnnfix'
b
mry'
o= Itn a
o "n 4
-= YIy'ttn
ob
fur/a)2 -l (mrlb)2
^L^
(c.30)
Here again the symmetry in the primed and unprimed variables is obvious.
C.5. TNHOMOGENEOUS BOUNDARY CONDITIONS
In the examples we have seen so far, we have imposed the value zero on our solution at the
boundaries. This is not always the case. As an example, let's look at a problem involving
diffusion in one dimension. Suppose heat is conducted along a rod that extends from x : 0
to r : oo. The value of the temperature at x : O is a specified function of time,
T(0,t): t(t)
and T
(x,0)
:
0 for
x > 0. The differential equation for the problem is equation
AT
(3.14):
A2T
i - D ax,:o
We can convert this problem into a one-dimensional problem by using a transform. The sine
transform applied to the space variable is useful because T (x, /) is defined only for positive
CONDITIONS 51 1
C,5 INHOMOGENEOUS BOUNDARY
values of r, our equation has only second derivatives in space, and we know the value of
our function T ar x :0. (Refer to Chapter 7, Section 7.7.3.) When we apply the transform,
T(k,t):
our equation becomes
(E
ai
-ur) :0
(r)
D
at - (kV -u
a7-tk'DT : Dkl;t(t)
lT
*
(c.31)
Equation (C.31) is a one-dimensional inhomogeneous differential equation that we can
solve using methods already discussed.
From Section C.l, with the correspondence
Green's function is (equation C.6)
G(k'
t
.
t')
'
:
Thus, the solution for the transform
u
--+ T, uf
(o
lexr [-kz De
i
(t , t)
- t)l
m -->
k2
D,
the appropriate
ift<t'
ir
t>
(c.32)
t,
i"
: o*rl7
yTJo[' "*p1-t'o(t -
t')lr(t') dt'
Transforming back, we have
T (x ,
t)
:
?
J'
, /JO
o ,rn
or [' ,*,l-k2 D(t - t)lr (t)
JO
dt'|
dk
(c.33)
The expression (C.33) is most easily evaluated by doing the k-integration first and noting
that k sin kx : - (d I dx) cos kx:
T
(x, t)
:
- n?
* lr' , tt' t
cos kx exp
fo*
[-k2 D(t
-
t)f
dk dt,
We write the cosine in terms of exponentials and complete the square to obtain
: Jo[' --i:!L
O(t
J4tr
-
t')3"*n'
*' \ r,,
4D(t
\
- t') /
(
(c.34)
512
oproNALroptc c cREEN's FUNcrloNS
This last expression6 has the form that we expect for a Green's function-type solution, but
now it is the boundary condition rather than the source function that appears in the integral.
In the next section, we generalize this result to show how both the boundary conditions and
the sources can contribute to a solution.
C.6. GREEN'S THEOREM
Here we shall develop a general theorem applicable to solutions of Poisson's equationT in
a region where the solution is not necessarily zero on the boundaries.
Let O and V be scalar functions of position defined in a volume V. Then
i .roivl :9o. iv + ov2{/
and also
V
: Vv . io + q/v2o
.tvVol
Now we subtract and integrate over the volume V and use the divergence theorem (Chapter 1, equation 1.44):
f
/Jv-tV- .toVw) - v .({rvo)l-dv : Js| (ov\v r
\vvo) .ffds
where S is the surface of the volume V and fi is the outward normal. Then
lr(ro'* - vv2o) o,
:
Ir(onv -
vio)
.nas
(c.35)
This relation is known as Green's theorem. It is valid for all scalar functions with derivatives
that exist within the volume V and on the surface S.
Now we choose <D to be the required solution of the Poisson equation
YzcD:
PG)
€0
in the volume V. We choose V to be the Green's function that satisfies the related equation
v2c(i,
*'y: -4n6(i - i')
(c.36)
where * and i/ are within V. Physically, the Green's function G(*, i/) is a constant times
the potential at i due to a unit point charge at i'. In SI units,8 the constant is 4ne6. Then
: 0 in this expression. This is an artifact ofour use ofthe sine transform. The resulting
function is necessarily odd and thus equals zero at x : 0. To obtain the correct boundary condition, we must take
the limit as -r -+ 0 for positive values of x. See Problem 10'
TSee also Problem 5.
8In Gaussian units, the constant is 1, a more pleasing result.
6It appe-, that Z(0, r)
C.6
GREEN'STHEOREM 513
equation (C.35) becomes
t^^f
I @v2c Jv
cv2o) dv
: Jsl(ovc - cvo).ffds
And using the differential equations satisfied by O and G, we get
i')r
," : l,raic- cvo) . nds
l,la-o"u(i - - " (-*)]
The integrals are over the variable
have
-o(i')
.#
i,
i/
with
lroru.i')p(i)
held fixed. Thus, using the sifting property, we
: * l;ovc-
cvo)'nds
o, - G tovc /
GVo) .fids
o'
giving the formal solution for O:
,lflf
o(i') :
4"^ Jrc(i,
*')p(i)
The boundary conditions specify either O or iO ' ff on the surface, but not both. (See
Appendix X.)
(a) If O is specified on the surface (Dirichlet conditions), we choose Go(i, i') : 0 on S
and obtain
o(*')
: J[ Gn(i,i')p(*)dv - !+lt JS[ rvco.ffds
47t€g Jy
(c.37)
This solution9 is consistent with our previous expressions in the special case O : 0 on S.
(b) If iO . ff is specified on the surface (Ne rmann conditions), we choose iCrq(i, *') . n
: constant: K on S and obtain
o(i'):
+#f
hl,G^r(i,*')p(*)dv
In general, we cannot choose
K:
o-u..nds-
0, because
f vrcdv :
J,
I u<n-;(.')dv
-4tr rv
-
-4n
But according to the divergence theorem,
Iro'o
gco-p-"
o,
:
lrfc(i,
this result with equation (C.34). See Problem 18.
ir'y
.ffds
: KA
[
lrr**1ot
51
4
oproNAL Toprc c cREEN's FUNcIoNS
where A is the arealo of surface S. Thus,
K : -4nlA
and so the solution for (D becomes
o(*'):
h
lrGr.r(i,i')p(*)dv
o(*'):
#
l,GN(i,i/)p(i)dv
+
[
+
f / c.vo
frc*vr'fids+
.
nds +
]
lrras
(o)
(c.38)
where (O) is the average value of <D on the surface S. This additional constant does not
affect the physics, which is governed by the gradient of O, and for most purposes can be
removed by redefining the reference point for O.
C.7. THE GREEN'S FUNCTION FOR POISSON'S EQUATION
IN A BOUNDED REGION
C.7.1. Symmetry
The Dirichlet Green's function is symmetric in the primed and unprimed coordinates. To
see why, we begin with equation (C.35) with O : G(i, i) and V : G(fr, i';. Then
Irloru,i;v,2c1d,
:
*')
-
c(d,
*)vlcln,ll]
a3n
*;i,c1t, i') - c(i, i')i,c(i, *l]
,{ [t,u,
.nas,
The Dirichlet Green's function is zero on S, so the right-hand side is zero. On the left, we
use the defining differential equation (C.36):
-+, lv[
[c(fr, *)6(il
-
*')
-
c(d, i/)E(i
-
*)] d3fr
:0
Thus, using the sifting property, we have
G(*"
i) : G(i, i')
which is the symmetry relation we need (compare with equation C.2).
For the Neumann case, the right-hand side becomes
-!
f;ort
i) - c(i, *)tdsu
lONote that A is the area of the total bounding surface of the volume V.
pieces, A is the sum of the areas of all the pieces.
If the surface comprises several separated
C.7 THE GREEN'S FUNCTION FOR POISSON'S EQUATION IN A BOUNDED
REGION 515
Thus, symmetry may be proved only for infinitely large bounding surfaces (A -+ oo).
However, symmetry can always be imposed as an additional condition on the Neumann
Green's function.
C.7.2. The Division-of-Region Method
Because G satisfies equation (C.36), the solution is of the form
c(i.*'):-]-+f(i.i')
lx x'l
-
where V2(1/li - i'l) : -416(* - i/) (equation6.26) andY2lr: 0. The function ry'
is required for G to satisfy the boundary conditions. This form of solution is not always
useful, however, because it can be hard to integrate. Instead we can expand the. Green's
function in a set ofeigenfunctions appropriate to the region ofinterest and make use ofthe
orthogonality of the eigenfunctions when performing the integrations in equations (C.37)
and (C.38).
Here we outline the divison-of-region method for calculating the Green's function in
a bounded region of space. The method works for either the Dirichlet or the Neumann
Green's function. In the Neumann case, it produces the symmetric Green's function. Remember that in calculating the Green's function we take the coordinates of i' to be fixed.
1. Draw the volume under consideration.
2. Choose a suitable coordinate system. (We'll call
the coordinates u, u, w.) Each boundary
of your volume should correspond to a constant value of one of the coordinates.
3. Place a point charge of magnitude 1 at an arbitrary point i' within the volume.
4. Determine the solutions of Laplace's equation V2O : 0 in your coordinate system.
Note which of the functions are orthogonal functions.
5.
Choose one of the coordinates, say u, ar'd use it to divide your volume into two regions,
with the point charge on the boundary between those two regions-that is,
Regionl:u <u'
Regionll:u>u'
It is important that the functions in the other two
coordinates (u, u') be orthogonal
if you are using spherical coordinates, you may only dlide
space in r, since the only functions of r in the solution of V2O : 0 are powers and
functions. This means that
the
are not orthogonal functions. With other systems, you have some choice.
6. The equation for the Green's function is equation (C.36). Thus, in either of your two
regions (butnot on the boundary between them), V2G : 0. Thus, you may expand G in
the eigenfunctions identified in step 4. Use the orthogonal functions in u and u. You must
choose the eigenvalues and eigenfunctions so as to satisfy the boundary conditions on
G at the edges of your volume. Remember: For the Dirichlet Green's function, Go : 0
on the boundary, and for the Neumann problem, \Gyl0n : -4r lA on the boundary.
51
6
oproNAL Toprc
c
GREEN's
FUNcloNs
You will have two different linear combinations of the (nonorthogonal) eigenfunctions
in u: one valid in region I (u < u') and one valid in region II (u > ut). At this point,
your function will have two sets of unknown constants.
7. Use continuity of the potential across the inner boundary at u
G(u: u'-,u,w): G(u:
:
uti
ut+,t),u))
Equivalently, you may invoke the symmetry of the Green's function: G(i,
i') :
G(i', i). This condition will allow you to express one set of unknown constants in terms
of the other. Now you should have only one set of unknown constants remaining.
8. Substitute your expression into the differential equation for G and use orthogonality to
reduce to an equation in u.
9. Use discontinuity ofthe field at i : i' by integrating the differential equation across the
boundary atu : ut. The last set ofconstants is thereby found.
10. Check for symmetry, correct dimensions, and so forth.
C.7.3. Dirichlet Green's Function for the Region Outside a Sphere
Let's find the Dirichlet Green's function for the region outside a sphere of radius a.
Step 1. The diagram is shown in Figure C.4.
FIGURE C.4. The sphere's boundary is at r
:
a. The region outside the sphere is divided into two
regions,r<rtandr>rt.
Step 2. In spherical coordinates, the inner boundary of the region is at
r:
a, and the
outerboundary is r -+
oo.
Step 3. We put a point charge at
i/
with coordinates r'
, e'
,0' . Then we
need to solve the
equation
v2 G(*.,
*')
:
-4n 6(i
- i')
(c.3e)
with the boundary condition
G(*,i'):0
forr:a andforr -> oo
(c.40)
C.7 THE GREEN'S FUNCTION FOR POISSON'S EQUATION IN A BOUNDED
Note: At this point,
i
HEGION 517
is the variable and*t isfixed.
Step 4. Within each region, we have the simpler equation
vzG(*.,i')
:o
with solutions (equation 8.56)
c(i, i,)
:Y
(
"l- \
e,^r,* r'-'/
?3)
ym@,o)
in each region.
Step 5. The functions Y7^(0, @) are orthogonal functions, but the powers of
we divide the space in r;
Regionl:a<r<rl
Regionll:r/<r<oo
Step 6. Next we apply the boundary conditions atr :
a and
In region II, as r -> oo, G must approach zero, so AI^ :0,
Gr(i. i/)
r
are
not, so
r -+ oo.
:Dfut,^@,Q)
(c.41)
I,M
while at
r:
a, equation (C.40) requires
,
A1^at
Br,
I ffi:o
and so
BI^ : -o2ltl AlThus,
Gr(i, *')
:lt,^
(, - #)
Step 7. Next we look at the boundary at
Ym(o. Q)
r : r'. The potential
(c.42)
(G) must be continuous
across this boundary, so
o,-
F-
(t'' t' -
#)
Yt
^(0,
Q)
:
n r-!ft
h^ (0, Q)
Nowweusetheorthogonalityofthe Y1-.WemultiplybothsidesbyYf
{) andintegrate
^,(0,
must be separately
over the whole range of 0 and Q, to show that the coefficients of each I;.
equal. Then
Bt^: At*I(rrlzl*l -
o2l+r1
51
I
oproNALToprc
c
GREEN's FUNoIoNS
Inserting this result into equation (C.41), we have
Grr(*.
i')
: D !4!Yh.(0,
Q)
G
while G1 is given by G.aD. We can make this look more symmetric by writing
At* :
a1^f (r')t+l .Tlten
Gr(i,
g2t+r i') : L,.l'1,,,
ur^ (7t)r+tYt+t'
Grr(i.
i') :
a2t+t)Ym(e, Q)
DY4!#J#\Y^@,
\r'), ''r
Q)
-t_
which exhibits the symmetry of G in r and r' .
Step 8. Now we have one set of constants left to find,the a1*, and one remaining boundary
condition atr : rt . To see what it is, we go back to the differential equation (C.39). Writing
C : D ghYh and expressing the delta function in spherical coordinates (see Chapter 6,
especially Problem 8), we have
nG+YY*
+Y2uoeYt^r,^)
: -a'6(';!)srpt -
1't'st1q
-
q'1
t
l.m
Again we multiply both sides by Yi-,and integrate over the full range of 0 and d. On the
left-hand side, we get zero, except when / : lt and m : mt . We use the sifting property to
evaluate the integral on the right. Then, dropping the primes on / and m, we get
:+y -,ffr,^
,
-aos(';!)yk@,,Q,)
Now we can relabel again: Let gtm : f6Yf^(0',Q') and, correspondingly, arm:
PhYk(0' , //). Then the Green's function is symmetric in all three coordinates if pl.
is a constant, independent of all the coordinates. With this relabeling,
6(r - r')
102rf1. /( + tl -+n
,z
12 ''^
r 0r2
Step 9. Now we multiply this equation by
f :
ft:
r
and then integrate across the boundary at
C.7 THE GREEN'S FUNCTION FOR POISSON'S EOUATION IN A BOUNDED
REGION
51
9
On the righChand side, we use the sifting property to evaluate the integral:
arf,*l'*" _ [''*' /(L+l) fi^dr:
1,,-,-
J,,-, ,
f
4n
-7
Next we let e -+ 0. We have already ensured that f1- is continuous at r : rt , and r is also
continuous, so the second term on the left-hand side goes to zero. Then we have
ar!t^1,,+
: _y
r'
e.43)
3r 1,,At the upper limit,
r : rtl,
lrft^l
we are in region II, and so
a pr, le')2t+t
-
azt+lf
|
F,- lv)''*t - a2t+tf
-' , ---CryT7iiil-
u *: fi ----Gi+Gr- 1,,*:
: -rn-utr:!!!:!!I: -ttu^ () - #;)
1,,
At the lower limit, we are in region I, and so
+1,
*' t(r') +'#)
-: *fft(''*' - +)1,,*: #ft(u
: frm\/l+l,,
+l
d21+l
\
,ryo )
Then equation (C.43) becomes
/l
d2l+l \
/l+l
-ttu^\l - 6W ) - tu^\
,,
d21+l \
+t77p
)
-ft^
The r/ cancels, which it must, since
coordinates. Then
p6
2l+1
4n
r,
4n
1
should be a constant, not a function of either set of
4n
flm: 24
1
and so finally we have
Gr(i,
i') := 5-
4' 9!!'1]'*t'
t-Girrr-Ym@' ilYi'(e'' O')
|,, *
o''*'l
Grr(i. *') := \- .1t , k'f'll-l r, * t--irni-Yr^@'
QtYh(e'' o'\
52O
In
a
oproNAL roprc c cREEN's FUNoIoNS
more compact form,
(c.44)
where r. and r, are the lesser and the greater of r and r/, respectively.
Step 10. Since G is 4neo times the potential due to a unit point charge, it should have
dimensions of l/length, as our expression does. This series should converge nicely fot r > a.
As an example of using this result, let's find the potential due to a ring of charge of radius
b > a and uniform linear charge density )., lying in the equatorial plane of a grounded
sphere of radius a. Using equation (C.37), we have
o1i;
: -t
I Gp(i.i';p1*')dv' -l4tr Js/or*'lV'Gp'ffds'
4treoJv
When we are doing the integration, the coordinates * are fixed while */ is the integration
variable. With O : 0 on S, the second integral is zero:
o(i):
I
4treo
[ )\6(r' -b)8(P')
b
Jv
" rLt =2l11
+ |,
:
I
i.s
tr2!+r-r,.' 'r,'
4----------------r+t
'
',
h^@, Q)yh(0' . Q')o')2 dr' d p' d4'
.02j+t-o2t+t1
2l
+l
4n
^*24'Yt^(o'olu-ffi
ffir;"r', l,
"-imQ
4q'l
where now r. and r, are the lesser and the greater ofr and b, respectively.
The integral over Qtyields zero unless lrt :0. Then
rb \o(*): zto
1r2l+1
-
ozl-1r,,
? ffih(o)hIt)
Now we check dimensions: 2n),b is the total charge on the ring, and the term in r has
dimensions of l/length, so the answer is of order charge/(es x length), which is correct for
potential.
Note that & (0)
:
0 for / odd, so only even / appear in the sum. The first few terms for
0<r<bare
o(i)
-:t
(' :2
4 rrbz
4reob\ r " -!"
3 19 -a9
+' +
.:-(35
64 rrba
r3cos2e
cosa
g
-
-
ts
3o cos2 e
+ 3) +
521
C.7 THE GREEN'S FUNCTION FOR POISSON'S EQUATION IN A BOUNDED REGION
whileforr >
b,
O t'q1*' -o2t-tt1 n(o)Pr(P)
o(i): 4neobl
/+rbt'+r
o (r-1\
- 4zeor
/
\
b
"
o
4T esb
l(r, i)Q:*1
-*'=#(35cosae-30cos2
0+3).
]
The first term is the potential due to a point charge, and this term dominates for large r. The
magnitude of the point charge is the sum of the charge on the ring plus the charge induced
on the surface of the sphere. Thus, the total induced charge on the sphere is Qind : - Qa I b.
The solution in the equator and on the polar axis is shown in Figure C.5.
r
a
@ times 4n esb /Q in the equatorial plane (0 : n /2, solid line) and on the
polar axis (0 : O, dashed line) versus r/a. The first three terms in the series are shown
FIGURE C.5. Potential
here, with b
:2a.
C.7.4. Dirichlet Green's Function for the lnterior of a Cylinder
Again we follow the steps of the standard method. Our region is an infinitely long cylindrical
tube ofradius a.
Step 1. The volume is shown in Figure C.6.
Step 2. We use cylindrical coordinates. The volume is bounded by the surfaca p : a.
Step 3.We place a point charge at i/ with coordinates p' , Q', z'.
Step 4.The solutions oflaplace's equation in cylindrical coordinates are given in Chapter 8, Section 8.4.8. We may choose to divide the space in any one of the three coordinates.
522
OPTIONALTOPIC C GREEN'S FUNCTIONS
region
II
region
I
I
l_
FIGURE C.6. The interior of the tube is divided
Step 5. Here
dt
I:
zt
.
we'll divide the volume in z:
Regionll-z<z'
Region
ll'.2 > z'
Step 6. The eigenfunctions in p that take the value zero at p : e are the functions
J-(x*"p/a),where x^n is the nth zero of J^.We exclude the Ns because they are not
well behaved on the cylinder axis at p - 0. The appropriate functions it z are the real
exponentials,
"-x^nzfa
Gr(i,
for large positive Z
i') :l
vnd.
elx^"2/o for large negative z. Thus, we have
A^nJ^(.*":)
ex^nz/arimt for z < z'
m,n
and
Gn(i, i') :\B^,J^(.^":)
r-xmnzfa"imQ for z > z'
Step 7. Setting the two functions equal at the inner boundary, Z
DA^nJ^(-.,:)
m,n
ex^,z'/arimQ
:I
B^nJ^(-*,:)
m.n
:
Zt, we have
r-xnnztf arimQ
REGION 523
C.7 THE GBEEN'S FUNCTION FOB POISSON'S EOUATION IN A BOUNDED
Since the functions J-(x^np la)ei^Q form an orthogonal set, we may equate the coefficients
of each term in the sum separately:
A^rgx^nz' la
:
fi-ng-xmnZ' lo
= d^,
Thus,
Gr(i, i') :
,-o^,
Gu(*,
t^ (.-,:)
exnntz-z'
*') : D o^, J^ (r^":)
)
ta
eimQ
-z) /a rimQ
"x^n(z'
m,n
Step 8. Next we substitute into the differential equation for G, which we now write as
c
: D s^,(z)J^(r^^I) ,'^r
with s^nQ)
:
c"mnQXP
(TU.- .'))
(c.45)
z. and z. equal to the lesser and greater of z and 4/, respectively. Then, with the delta
function expressed in cylindrical coordinates, we have
with
:
td'ef\@]
)-.r. (*^^L\
- p,)a(Q - 6')6{z \ d/ r,^r l-fur^,k)
dz' ) -!ur,
P
L
z')
We multiply both sides by pJ^, (x-,n, pf a) e-i^'0 and integrate over p and @. On the left,
we use the orthogonality of the eigenfunctions and equation (8.96); on the right, we use the
sifting property.
!
(zD
V ko^,r1'
l-* B*nk) .
e#) -
-4n
r^(r^"+)
,-i*Q'5(z
Now we rewrite:
E^,(z)
-
ymnexp
(:u.-.,))
t- (.-"+)
Step 9. The next step is to integrate across the boundary at 7
a2 lt'^1x.,11'
"
I: "' l-*
_
r.,,*
(z' - z' |
*
-
e-i^o'
7t to find the y^n:
t-#)
rz'+e
: -4n I tk - z') dz'
2.,_t
J
As e
-+
0, the first term on the left vanishes, and the equation simplifies to
o2l${r^,)]2
dymneryQ'-z'\
dz.
-_L
o
r'
-
z')
524
oproNAlToprc c cREEN's FUNoIoNS
At the upper limit, z > z', so the derivative is
*" r(Tu' -.,)1.:., : -ry
while at the lower limit, z < z',
so
xmn
d
/xmn. ,.tl
l.;(t
)1,=,,:
o
E"*P
"
Thus,
a2
1fi 1x ^n112 r*nlY g21 : -4
and so
l^n:
,^noLi^r*^rll,
Inserting this value into the expression (C.45) for G gives
c (i,
i')
:
i2 ;frh;rt r ^ (*^n l) t - (, ^, +)
ei ^(Q
-o'
)
"
v
(! <r.- ., ))
Step 10. The result exhibits the necessary symmetry in * and i' and has dimensions of
inverse length, as required.
What is the potential if the cylinder has a band with potential V6 between z : -a and
z: *a and is grounded everywhere else?
This time only the second integal in equation (C.37) is nonzero. Remember that fr is the
outward normal, here p.
a@.Q,i: -GI f2n f+o,o AG
Jo J_" fr\,,=,adQ'dz'
I
: -#
I,* l:," ?Xnni;mr^(.*':)
rh(x^)ei^(Q-o')
" "*o (fu<r.
- ,))
a dO' dz'
: O terms survive the integration over @/, as expected for a system with
azimuthal symmetry. Thus,
Only the m
e(p, z)
: -++
W
ll."
"*o
(Yu- - z)) az'
C.7 THE GREEN'S FUNCTION FOR POISSON'S EQUATION IN A BOUNDED REGION
For e
525
> a, this becomes
a@' z)
On the other hand, for
: 24!HP ll"' "*, (Tu' - D) az'
: voD !ffiP *sinh x6, "*v (-*o,l)
-a < z
14,
Q(p,z):VoDffi# ll,*' " , (Tu - '\) a'' + f',"*v (Tu' - o) atl
: vo"ut
Jo@9'plo)
lero^11-s-,o, + e''o,t)
*onJr(xor) L
:zvoDJ-q(xo'Pla)
"
?
*orJt(xor)
l1
L
n-'o'"*h
-
I
e-xon*
k'o^I-
e-'0,)l'l
(r*1)]
This solution is plotted in Figure C.7, where we show <D/ Vs versus p /o. A few terms in the
series represent the solution well for z > a, but many terms are required for lzl < a, and
even then the expression
failsll
at p
:
a.
o
Vg
p
0.4
0.6
0.8
1.0
a
FIGURE C.7. Solution for the potential O(p) inside the tube. The solid line shows the potential at
z - a /2 (42 terms). The dashed line shows tbe potential at z : 2a (10 terms). For
z > a, fewer tenns are needed in the sum to obtain an accurate result. The additional
terms decrease exponentially.
C.7.5. Expansion in Eigenfunctions
An alternative method of solution produces a single expression for G(i, i/) throughout the
region. We write G as an expansion in eigenfunctions, as in Sections C.2 andC.4, method 2.
The penalty we pay for having a single expression is that we have a triple sum rather than
llW" hun" seen this phenomenon before in Section C.5. Here our eigenfunctions are zero at p : a, and so the
solution, a superposition of the eigenfunctions, is also zero on the boundary. The solution has the correct limit as
p -+ a from below In Figure C.7, the solid curve @(p) approaches Vg as p --+ a, but dives towatdzero aI p : a.
s26
OPTIONALTOPIC C GREEN'S FUNCTIONS
the double sum we obtained using the division-of-region method. The eigenfunctions we
need are solutions of the Helmholtz equation:
(v2 +
k\f (i; :
(c.46)
o
To demonstrate the method, let's look at the Dirichlet problem in the interior of a cylinder of
radius a and height ft. The solutions of equation (C.46) that satisfy the boundary conditions
h are (Chapter 8, Section 8.4.4)
G 0 tt p a and
:
z:
-
.f
: J^(r*,:)
sinPlrz si^o
,2
/ x.n\2
k-^np:\;)*\l,/
I PT \2
(*)
and the eigenvalues are
where x^n is the n th zero of J^(x).We must normalizethe eigenfunctions, using equations
(8.96) and (4.I2). Then, from equation (C.11) extended to three dimensions, with ). :
k2
:0,
c(*, *')
: +n \
m,n,p
r^(,-,*) '*
1^("^,+)
,rnP!Z' ,-i^o'
! [ri,,*-,,)'f;r,l(+)' . (+)')
- Ih \./'L
L \-\^+mooo
a:-a
Plz ,i-o
n:l p:l
r^(.*,:) r^(,-,#) '* ffsinPTz' ,i-1q-Q')
r * +t (x * )t2
* (T)')
t
l.r,
The dimensions of this expression are l/length, as required.
PROBLEMS
fl
Use the division-of-region method to find the Green's function for a damped harmonic
oscillator. Hence find the response of the oscillator to the input /(r) : 1 - t lT for
0 < / < T andzerootherwise.
2. Find the Green's function for a beam supported at one end (refer to Chapter 5, Problem l1) using a Laplace transform method.
Find
the Green's function for a wave on a string of length I,
3.
A2G
"a2G
,;, - "# :6k -
x')a(t
-
t')
by taking the Fourier transform of the equation in time and using the divison-of-region
method for the space dependence. Give your answer as an integral over the Fourier
transform variable ar.
I
!
,!
I
I
j
PROBLEMS 527
4. Find the Green's function for the diffusion equation
AG
AzG
rp:3(x- x')8(t-t')
;-r
by taking the Fourier transform in space and using the division-of-region method in time.
Take -oo < r < ooand0 < / < oo.
Use the result to compute the function f (x, t) that satisfies the diffusion equation
af
a2f
S(x, r)
-
;-oup:S(x,,)
with the source function
S
,-*2 /o' 3(t)
Strow that we can use Green's theorem (Section C.6) to obtain a solution for O, where
O satisfies the Helmholtz equation
(v2
+t<z)o(i): s(i)
with a source function S(*). Determine the solution for O in terms of the Green's function
when O satisfies the Dirichlet boundary conditions O (i) : f (i), a known function, on
the boundary surface S.
6. Find the Green's function for the one-dimensional Poisson equation
d2e
dx2
with boundary conditions O (x)
when
:
0 at
x
: -P(x)
:0
and
x
:
a. Hence find the solution for O
(a) P (-t) : sit (n x la)
(b) p(r) : *2(o2 - *2)
7. Use a division-of-region method to find the Green's function for the one-dimensional
Helmholtz equation
d2v
d*r+k-Y:f(x)
y(x):0atx:0and.x :
a. FindtheFouriersineseries
intheregion0 < x < a,with
for G(x, x/) and hence show that your result agrees with the result of Example C.2.
8. Sometimes we may expand the Green's function as a series of eigenfunctions even if
the differential equation is not of the Sturm-Liouville form. The governing differential
equation for the displacement of a beam is equation (3. I 1):
dav q(x)
dx4 EI
528
oproNALToprc
c
GREEN's FUNoToNS
A beam of length Z rests on a support at each end so that the boundary conditions are
y(0) : !(L) : 0. Show that the Green's function may be expanded in a series of
f,
eigenfunctions, and determine the form of the Green's function. Use it to find the beam
displacement when it is subjected to a load q(x) : ax lL. Compare with Chapter 4,
Problem 15.
fina the Green's function for heat transfer along a rod with insulated ends. The relevant
differential equation is
+At - r#:
0x
e@,t)
and the boundary conditions are 0 G / 0 x : O at x : 0 and at x : L; G(.r, 0) : 0. Treat
the problem as a two-dimensional problem, and use method 1 in Section C.4, dividing
the region in time.
10. Verify that equation (C.34) gives the correct result z(t) in the limit x -+ 0 from above.
Hint: Expand t (t') in a Taylor series about t/ : t.
11. Find the Green's function for the wave equation in three space dimensions, using spherical coordinates. The wave equation is
u2vzc
62
- atrc:
6(i
-
i')6(r
-
r')
G(i, i' , t, t'| is the displacement and V2 is the Laplacian operator in three dimensions. Transform the equation in space and time, and solve for G. Hence find the
displacement f (i, t) if the source is h 3 (t) e- o2". Co*p*" with Example 7.5.
where
12. Find the Dirichlet Green's function for the two-dimensional Helmholtz equation in
a circular region ofradius a. Obtain the result as a double sum over appropriate eigen-
ltil
functions.
pma the Neumann Green's function for the one-dimensional Poisson equation
d2Q p(x)
dxz
60
withboundary conditions dGldx:0 atx :0 and x: L. Express youransweras
series of eigenfunctions. Hence find the potential O if
lporlL ifO<x.Llz
Dlxl:
and
dQ/dx
:
0 at
x
:
0 and
x
a
<
|.0
:
otherwise
L.
14. Use a Fourier transform method and cylindrical coordinates to find the Green's function
for the wave equation in two dimensions.
15. Using the division-of-space method, find (a) the Dirichlet and (b) the Neumann Green's
function for Poisson's equation in the interior of a sphere of radius a.
16. (a) Find the Dirichlet Green's function for Poisson's equation in the half-space z > 0
using Canesian coordinates.
PROBLEMS
(b)
529
Evaluate the integral to show that G may be interpreted as the potential due to a point
charge and its image in the plane z :0. Hint: Use polar coordinates to evaluate
the integral, and use contour integration to evaluate the integral over the angular
variable, as in Chapter 2, Section2.T .2.
17. Find the Dirichlet Green's function for Poisson's equation in the interior of
a sphere
of
radius a as a triple sum over appropriate eigenfunctions.
Eal
OUtain a relation analogous to equation (C.37) for the diffusion equation
af
At
-
Dv2.f
:
S(i, r)
Define the Green's function through the equation
AG
E - Dv2G: Dr(i - *')6(t - /)
Ffint.'Notethatsince G(t,t'): G(t-t'),0Gl0t' : -0G/0t.Integrateoverbothspace
and time, and let G : 0 on the bounding surface.
Apply your result to the example in Section C.5. Apply the sine transform in space to
the equation
AG
A2G
; - o up -
D6(x
-x')6(r
-
r')
and obtain the Green's function. Show that the solution (C.34) may be expressed in terms
0Gl\xt and that this solution is consistent with the general result you found above.
of
19. Find the Dirichlet Green's function for Poisson's equation in the interior of
of radius a.
a
hemisphere
(a) Choose O < 0 <z and0 < 0 < tr.
(b) Choose0 < P < n/2 and0 < Q < 2r.
(c) Using one of the two Green's functions, (a) or (b), evaluate the potential inside the
hemisphere if O : 0 on the spherical surface and O(r) : Vo(l - r/a) on the flat
face.
20. Obtain the Green's function inside a cylindrical tube (Section C.7 .4) by dividing space
tn p.
OPTIONAL TOPIC D
Approximate Evaluation of Integrals
D.1. THE METHOD OF STEEPEST DESCENT
Often functions are defined in terms of integrals in the complex plane. The gamma function
(Chapter 2) is one example that we have seen. Another example is the Hankel functionl
nJ,)(6)
:: l,*, l; (,_:))f"
where the contour C runs from the origin to
-oo,
as shown
(D.1)
in Figure D.1.
Im (z)
Re (z)
FIGURE D.1. Path C for evaluating the first Hankel function U{t) @).The path starts at the origin
and goes to
-oo
in the upper half-plane.
Such integrals may be written in the form
I(€): II sk)expl€f (z)ldz
Jc
lThis function is introduced in Chapter 8 from
a different
(D.2)
point of view.
531
532
OPTIONALTOPIC D APPROXIMATE EVALUATION OF INTEGRALS
where f is real, (z) is an analytic function, and C is a specified contour in the complex
"f
plane. We wish to evaluate the function 1(6) in the limit that f becomes large. In this case,
the exponent
Ef
:
e)
€lu(x, y) + iv(x, y)l
has a large absolute value. The imaginary part causes the integrand to oscillate wildly, contributing very little to the integral, except where u is approximately constant. The integrand
is maximized where the real part u is maximum.
We know that we can deform the contour C without changing the value of the integral,
so long as we do not cross any singularities of the integrand.2 We can evaluate the integral
most easily if we deform the contour so that it passes through a point z0 such that
1. u is approximately constant near z6 and
2. u is a maximum at zo md decreases rapidly along
the deformed contour C' as we move
away from zg.
Then the integral will be dominated by a small region of the path very near zo.
To make u a maximum, we need
{:o
clz
atz-
(D.3)
zo
since 0u f 0x and 0u l0y are then both zero at zo. (Refer to Chapter 2, Section 2.2.2.) Now
if the function f (z) is analytic, then the function u(x, y) is harmonic; that is,
^ 02u
Y'r,t:ip+
E2u
Ayr:O
if a is a maximum with respect to x (02u/0x2 = 0), it must be a minimum with
respect to y (02u I Ey2 r 0). Thus, the point z6 that we seek is a saddle point. In order that
the contribution of the integrand to the total integral be concentrated in the vicinity of zo,
we must choose our path C/ so that it crosses the saddle in such a way that u decreases from
its value at zo at the maximum rate. (Ct looks]ike the rider's legs on the saddle.)
We also know that for analytic functions Va is perpendicular to Vu (Chapter 2, Section2.4.3).Thus, by choosing the path that maximizes the change in a, we are also choosing
the path along which the imaginary part u remains almost constant. This is exactly what we
Thus,
want.
Now
if
the function g(z) varies relatively slowly in the neighborhood of 26, we can
approximate the integral as follows:
1(6)
: If sk)expl€f(z)ldz
Jc
- skil
:
fonnn*.0 "*o
gkilexp [5,f (zo)]
lf (rt.ol
*)u - ro>'f"ko))) dz
lru,nn"*,0*o
[e
2See Chapter 2, Section 2.2.4.We may not cross a branch cut either
if.
-
zo\2
7" {zs1fdz
(D.4)
D.1 THEMETHODOFSTEEPESTDESCENT
533
where we used the Taylor series to express f (z) near eo -d truncated after the third
term. The second term [involving f'(z)l is zero. To choose the path, we set z - zs =
reiQ and f"(zd : aeia, where a and cv are real constants. Then the Taylor series
becomes
f (z) : f (zo) +
u
:
u(xo, to)
+
u
:
u(xo, to)
+
)r2
"2ia
orio
(2Q
+ q)
sin (2@
* cv)
)or'cos
and
1
1ar2
To keep u approximately constant, we choose the path so that 2Q
*u:
nn; that is,
an
Q:Qo:-t*t,
(D.s)
This path is a straight line in the neighborhood of eo. Then
u
: u(xo,ld +
)or2
cos (nn)
:
u(xo, ys)
+ (-D')ar2
Now we war,ttu to decrease as we move away from 20, so we choose n
u
:
u(xo,
yil
: *1. Then
- !or,
2
The choice of n : *l or - I is made by requiring that r go from a negative value3 before
passing through 46 to a positive value after. The integral (D.4) is then reduced to an integral
over r, with the value of r increasing throughout the range of integration:
[
(-+\z
I (E) =8(zo) exp lEf {zd)
"*p \
./pathnearzg
:
lkd
exP tE
f Qo)te'^
/,: ; #:
a, ,'oo
/
".'
(-+)
o'
3In the usual polar representation of a complex number, the amplitude r is a positive real number, and negative
real parts are described by phase angles between r /2 and 3tt 12. But here we want to describe the straight line
path with a constant phase angle {g, and so we must allow r to take negative values.
i
534
oproNALToprc D AppRoxtMATE EVALUAIoN oF
TNTEGRALS
because f is very large. Thus, we may
The exponent larz 12 becomes very large for r
extend the limits in r to aoo without appreciably changing the value of the integrala:
l0
r
(e =g(zo)
I
(il :
exp t€f kdt,'^
l::
o,
2n
k)leih
g(zo) exp tEf
^, (-+)
(D.6)
.
q,a
The result (D.6) is the asymptotic form of the integral (D.2).
Let's use this method to evaluate the asymptotic form of the Hankel function (D.l).
Comparing equation (D.1) with the standard form (D.2), we find that the function "f (z) is
t/
l\
f(z):l\'-;)
and is analytic except at z
:
O.
Similarly,
1
sQ): ,,+t
We want the path of integration to pass through 20, where
dfl
r\
t/
;1,:,,:t(t*d:o+Zo:ti
Of these choices, only
pointz6
:
*i
lies near the original path C. So we deform C to pass through the
i. Then
..
I
f"(zd: - -l : -t :
- tt
I
7J
So
c: I
4, :
and cv
r /4 -f r 12
:
:
-n
le-i1t/2
|,
12. Thus, the new path C/ must pass through zo at an angle
-
3n /4 (equation D.5). Before zo, the difference z
zo has a positive
so r is negative, as
real part on this path (see Figure D.2) and ,3it/4
+
- -+A
required.
Then
r(zo):f(i):;(t-i) :,
4See Appendix
IX for the evaluation of the integral.
\iJ1,
D.2 THE METHOD OF STATIONARY PHASE
s35
Im (z)
Re (z)
FIGURE D.2. PathCtfor evaluating the first Hankel function n{t) @).The path passes through the
point z
: i at45o to the imaginary
axis.
Thus, applying the result (D.6), we have
n{')G) =
,Eu*'r(e
1f
+
3n
4
z\
-o+2)r)
z\
|
-u
2 - -4/
which is the expression given in Chapter 8, Section 8.4.2 (equation 8.85).
D.2. THE METHOD OF STATIONARY PHASE
A similar method applies to integrals of the form
I (x)
s(x) exp [iQ@)]dx
(D.7)
when the function d(x) (the phase) is real-valued and g(x) is a slowly varying function.
Integrals of this type arise through the use of Fourier transforms, for example, and are of
importance in determining the signal that arrives after propagation through a dispersive
medium (see, for example, Jackson, Chapter 7). The major contribution to the integral
comes from the neighborhood of the point (or points) where @ (x) is stationary-hence the
name of the method. Away from the stationary points, the integrand oscillates wildly and
there is very little contribution to the integral.5
Again we expand the phase in a Taylor series in the vicinity of the stationary point .rr:
Q@):d(x,) +
5lt.
)a - x,)2 Q"(x,)
(D.8)
-o." precise statement may be made in the language of generalized functions. See Chapter 6, especially
Section 6.4.
536
oproNAL Toptc D APPRoxIMATE
EVALUATIoN
oF INTEGRALS
Then, since g(x) varies much more slowly than eif
I(x)
,
- B(xJeiQGr l)1,' "-, (Uo - *,;2q"6"1) ax
Here also we can extend the range of integration to (-oo, *oo) with negligible change in
the value of the integral.
To evaluate the integral, we make a change of variables to
tu:(x-r,)tltO"@,)
The limits of the integral also change correspondingly. Let's assume for the moment that
Q" @r) is positive6 and equals 2A2.Then
Q"
u-(x-xt)
(xt)
-ir/4_A@_x")e-io/4
2
and the integral becomes
I (x)
-
g1x";si\GJeila
f""-" au
Im (z)
Re (u)
FIGURE D.3. The.contour for the u-integral runs from -oo to +oo but at a 45o angle to the real
The path of integration C for u is at 45o to the real axis (see Figure D.3). However,
since the integrand has no poles and the contribution to the integral ftom u : *oo is
zero, the value of the Gaussian integral (equal to Ji when evaluated along the real axis)
6lf
6't 1x"1 is negative, the path is rotated to cross the real axis at an angle of +T
is the same.
f4 ftther
than
-t
/4. The result
PROBLEMS
537
is not changed by this path shift. Thus,
(x)
-
I (x)
-
r
g1x;siQl)eila
ll* ,-"'a,
(D.e)
g(xJsiQ@J.i'/4
If there is more than one stationary point, then the integral is the sum of the contributions
from each ofthe stationary points.
PROBLEMS
f,
Use the method of steepest descent to evaluate the asymptotic form of the gamma function
f (t):
[* tE-tr-'4t = tE6E-rtzr-e
Jo
where the path of integration is along the real axis. This is Stirling's formula.
2.
The modified Bessel function
Kr(f ) has an integral representation
K,(E):
I roo _t ( "_.1\l d'
"*p L-z \.'*;/l ;|
1Jo
with path of integration along the real axis. Use the method of steepest descent to find
the asymptotic form of Kr(f ) as f -+ m.
3. The Bessel function may be represented by an integral
J,(E)
lf
: 2"
Jrexp
(-if
sin z
*
ivz) dz
where the contour C is as shown in the figure. Use the method of steepest descent or
stationary phase, as appropriate, to derive the asymptotic form (8.83).
PROBLEM
3
538
@
OPTIONALTOPIC D APPROXIMATE EVALUATION OF INTEGRALS
Expand the function g(x) in equation (D.7) in a Taylor series about the stationary point.
Show that there is no contribution to the integral from the second (linear) term in the series
if the expansion of the phase @ is truncated at the quadratic term, as in equation (D.8).
5. The function n
j2) 1xl has the
integral expression
nJ,)c):
(,-:)]f"
* l,*'[l
where the path of integration goes from -oo to zero along a path in the lower half-plane
that is the mirror image of the path in Figure D. 1. Verify the asymptotic form (8.85) for
this function.
6. An alternative integral expression for the Bessel functions is
Fu@)
:
O
l"exp
(ivu
- ix sinu) du
where
(a) for I1(1) use contour C1 and k : I lt
(b) for I1(2) use contour C2 andk : lln
(c) for ,/ use contour C3 and k : I l2n
and the contours are as shown in the figure. Evaluate the integrals for large values of
and verify the asymptotic forms in Chapter 8.
PROBLEM
6
7. The Airy integral
Ai(x)
:
+"
I:""n; (r' *+)0,
-x
PROBLEMS
539
arises in the study of diffraction. The path of integration lies slightly abovethercalaxis.T
Use the method of stationary phase to show that
Ai(x)
*, (-'r. '')
- #*
for large, positive x
E tr"
amplitude of a signal arriving from a distant source after propagation through a dispersive medium may be written as a Fourier integral of the form
s(x, r)
:
I*
orr',explik(a)x
-
iorldo.t
where k(ar) is the dispersion relation for the medium (see, for example, Jackson, Chapter 7). Use the method of stationary phase to show that, at time r, the largest amplitude
signal is contributed by frequencies with group speed dro ldk : D I t , where D is the distance from source to receiver. Find an approximate expression for the amplitude at time
t. Obtain
an explicit form for the solution
TSee Jeffreys and Jeffreys, Section 17.07.
if ck(a)
:
F4,
and a, is a constant.
OPTIONAL TOPIC E
Calculus of Variations
E.l.
INTEGRAL PRINCIPLES IN PHYSICS
Physical systems are often characterized by being an extremum (maximum, minimum, or
point of inflection) of some physical property. I For example, a system in equilibrium is at an
extremum of the potential energy. For stable equilibrium, the potential energy is a minimum.
According to Fermat's principle, light rays follow the path of minimum time. The physical
quantity being minimized in these examples (energy or time) is often expressed as an integral
over the system. For example, the time for light to travel a distance dt is dt : dl / u, where
u is the light speed, and thus the total time for light to travel from point A to point B is
fB d(.
t:
I
Je
(E.1)
u
where the integral is taken along the path from A to B. The path from A to B is described
by one or more parameters, and the path actually followed by the light between A and B
will cause the value of the integral (E.l) to be a minimum.
If we want to find the true path between points A and B, we must adjust the parameters
so as to make the integral (E.1) an extremum. To see how to do this, first recall that if a
function /(x) has an extremum at the point rs, then df /dxlro: 0, and the change in /
due to a small change dx
in.r is
6f= df
dx
- 0
0x
*
terms of order (dr)2
xo
to first order in 6x
if the integral (E.l) is an extremum, its value will not change when we make
small changes in the parameters. That is, the integral along paths that lie very close to the
true path will be the same as the integral along the true path, to first order in changes in
the parameters that describe the path. The difference 61 between the value of the integral 1
along the ffue path and that along a neighboring path is called the variation in the integral 1.
Similarly,
I
Note the language carefully; "a" maximum is not the same
as
"the" maximum. We
are interested in local extrema.
541
542
OPTIONAL TOPIC
E CALCULUS OF VARIATIONS
E.L. As a simple first example, consider the path taken by light moving
from point A in a medium with refractive index n1 to point B in a medium with
refractive index n2 (see Figure 8.1). For now, let's take it for granted that the light
travels in a straight line within each medium. Then the path meets the boundary
surface at point P, and we can parameterize the path by the distance s of the point
P along the boundary surface, as shown. Since the speed of light in each medium is
u : c /n, the total time taken for light to travel from A to B is
Example
,
:
lo'
!!dt
n1
+
n2
I" -dt
s2+tfi+
c
.^
c
n2
c
(w
-
s)2
where in this case the integrals are trivial. Now we compute the change 3r in r due to
a small change 6s in s:
6/
:
At
-3s
0s
,,I
t
A
FIGURE E.1. The path of a light ray from point A in medium
I
to point B in medium 2 passes
through point P on the boundary between the two materials. We use Fermat's principle
to determine the location of point P and also to show that line A P is straight.
This change must be zero for any 6s when the value of s corresponds to the true
path. Thus, the value of s we need is given by
nl
S
n2
U-.t
-0
543
E.1 INTEGRAL PRINCIPLES IN PHYSICS
or, in a more familiar form,
n1
sinfi:
nzsin)z
which is Snells'law.
Now let's ask whether the light does travel in a straight line from A to P. Let the
path be described by the function y(r), where y(0) : 0 at point A and y(s) : ftt is
point P. A differential piece of the path has length
d(.:
\/dx2
+
dy2
: 1[* (!oj
o-
(8.2)
and the total time taken for light to travel from A to P is
,:T l,
(#)' dx
Here the integrand is a function of the derivative y/ of y. Our goal is to find the
function y(x) that minimizes the time r, subject to the constraints that the endpoints
y(0) and y(s) are fixed. The quantity / is a functional; that is, it is a function of the
function y(x).
Now as the function y(x) changes, so does its derivative y'(x).If the derivative
changes by a small amount 6y/(.r) (the variation in y'), then the integral changes by
an amount 6r, where
tt6t: T
l,'
T
l,' t/l +
13yt)2 dx
(y')\
zyny'
a,
to first order in 6y/
:T l,'JTrory('.ffi)'.
-+-!-
nl tfs
v'6v'
gT6,y", lo ---J==--:-)-
When y(x) describes the true path, 3t
by parts:
'' :!)
-;
6t
:
0. To obtain 6t in terms of 3y, we integrate
u, "'"
o*
- !-291!:l
-Ll' * 1 Jot' l-L
Jt+ryflo',
t\frTory 2 0 +o))3/2)"r
With the endpoints fixed, the variation 6y must equal zero at x :0 and x : s. Thus,
the integrated term is zero. The remaining integral must equal zero no matter what
the variation 6y(x) if y(x) describes the true path. Thus, the term in square brackets
544
oproNAL Toprc E oALcULUS oF vARrAroNs
must be zero. This term simplifies to
t),1
11+(Y')zlztz-"
and thus the second derivative J" : 0. This means that the slope of the line, y/(x),
must be a constant, and thus the path is a straight line.
E.2. THE EULER EOUATION
Let's review the general procedure that we used in Example E.l. We have ai integral
r (y)
: I"u , o, y, y')
(E.3)
d.x
evaluated along some path between the points a and b. That path is described by the function
y(.r). The endpoints a andb are fixed, and thus the function y(x) has fixed values l(a)
Yr
and y(b)
Y2 at the endpoints. Our goal is to find the function y(x) such that the integral
:
:
1(y) is an extremum.
We begin by expressing the variation 61 in the integral
function y and its derivative y/:
u,:
tb / af
J"
I
in terms of the variation in the
{un,)
[rrrr. ,r, ) ,*
_
To express the second term in terms of the variation 6y, we integrate by parts:
I' (#r') a* : ffiul,'"- I' * (#)',
The integrated term2 is zero because
6r:-
6/
:
0 at
x
:
e and .r
:
r.
b. Thus,
1"'l*ffi -#)ro.
I
We want the variation 61 to be zero for arbitrary3 small variations 8y(x) from the true
function y(x), and so the term in square brackets must be zero:
d (y\_Ef :o
dx \|yt)
2In th"
andx
: "ase
b.
(E.4)
Ay
that the value of y is not fixed at the endpoints, we obtain a second conditi on: 3f / Tyt
3Some restrictions apply, most importartly that 3y be differentiable.
:
0 at
x
:
a
i
1
I
E.2 THE EULER
EQUATION 545
The arbitrariness of 6y is essential to this argument. We can perhaps find a specially contrived
Ey to make the integral zero, but if 6y must be arbitrary, then we are forced to make the
square bracket zero. Then equation (E.4) is the desired equation for the function y(x). This
equation is known as the Euler-Lagrange equation.
An alternative form of equation (E.4) may be derived by noting that the total derivative
d.f a.f af dy 0f dzy
d*: a** u**87?
Solving tor 0f l0y and substituting into equation (8.4), we obtain
L
(
y\ -af : L ( y\ - ! (4L -af -914\
y'
oy - dx
ox
ax \ay'
)
\ay'
)
\dx
ayt dx2
:o
)-"
and so
df _af _ Ln,, _ r,L (91\
'
dx 0x 0y''
dx \Dy'/
:o
We may simplify by noting that
d (.,,a/\ _:' .r,Ef .__.,,d
*'E (af \
a"
V
8
ar,)
\av,)
So finally we have
*(, -r#) -#:o
This version is particularly useful when
/
does not depend explicity on x
since then we may integrate once to obtain
, - ,'#:
constant
."s)
(0fl3x =
O),
(E.6)
E.2.1. Application to Mechanics
The Lagrangian of a mechanical system is
L:T -V
where
Z is the kinetic
V the potential energy of the system. These quantities are
qi that describe the position and velocity
parts of the system. For a system of particles, the coordinates qi arejust the
and
expressed in terms ofthe generalized coordinates
of all of the
546
OPTIONAL TOPIC
E CALCULUS OF VARIATIONS
ordinary coordinates of the particles in the system. Hamilton's principlea says that the
motion of the system will be such as to minimize the integral
Itz
I:
I L(qi,qi,t)dt
Jt,
Thus, we can determine the motion of the system [that is, determine the
calculus of variations.
qi(t)l
using the
E.2. A particle of mass lzl slides down a wedge whose sloping side of
L makes an angle 0 with the horizontal (see Figure E.2). The wedge of mass
Example
Tength
M isfree to slide horizontally. How long
does the particle take to reach the ground?
FIGURE E.2. A particle of mass la slides down a wedge of mass M that is itself free to slide
horizontally.
It is convenient to choose our generalized coordinates to be the distance s that the
particle has slid along the wedge surface and the distance r that the wedge has slld
over the ground. With these coordinates, the kinetic energy of the wedge rs lU*2.
The velocity of the particle with respect to the ground is
- ldx ds \^
n:
(, - Vcoso )*-
ds
Esnav
and thus the total kinetic energy is
,:l* (#)' .;^l(# -ff*",)' *
:
I
r(M
+*)
/ dx\2
dx
ds
\i ) -*iicoss*
I /
ds\2
r^ \A )
With reference level at the top of the wedge, the potential energy is
V
:
4See, for example, Goldstein, Chapter 7, Section 5.
-mgs sin9
E.2 THE EULER
EOUATION 547
Thus, the Lagrangian is
1
/ dx\2 dx ds
| / ds\2 jmgssinl
L:r -v - r(M
+^)
-*iicoso*
r*\i )
\i )
The Euler-Lagrange equations for this system are
!dr (9!\ -oL
:o
0x
\0i /
/dx\ d'-"ore'l
lla,L(M**)\A)-mA
-o:o
I
(8.7)
and
!dr (9!\ -oL :o
\3.i
o (-**cosg*-+)
""dt)
dr\ "'
)
0s
-mgsino:o
(E.8)
Equation (E.7) may be integrated immediately to get
(M + m)
(#) - mfrcose : c1
@.e)
which may be recognized as conservation of momentum in the .r-direction. Thus, if
the system starts from rest, the integration constant Cr : 0. Integrating again, we
find
(M
and again, if
x
:
s
:
0 at
t:
*
m)x
-
0, then Cz
ms cos9
:
:
C2
O.Equation (E.8) may also be integrated
to get
/ dx
ds\
(-tcoso* i)- grsind:o
where again we used the information that the system starts from rest. We may now
dx/dt using equation (E.9) and integrate again with the initial condition
eliminate
s(0)
:0:
!!(tdt\
m
M*m "or'e):ptsino
/ "
/ M+m \gr2
|
" - \,vl +msin20)
2 ""'"
-Slnd
-
548
OPTIONAL TOPIC
E CALCULUS OF VARIATIONS
The particle reaches the bottom of the wedge (s
Tn
t:V'*"\
: I)
in
a
time
/ M +,,,sin2o\
M+^
)
+
oo (fixed wedge) to obtai n
letting M
the correct result for a particle accelerating at g sin 9.
We may check our result by
t
-
JTLI
9FF'|0
,
E.3. VARIATION SUBJECT TO CONSTRAINTS
Sometimes the variations cannot be completely arbitrary because of additional constraints
If we want to find an extremum of an integral 1 (equation E.3) with
integral J : I: g(x, y , y') dx temainconstant, we proceed by
a
second
that
the constraint
(but
yet unknown) multiplier.tr" to form the combination
as
a
constant
introducing
on the physical system.
K:I*)'J
Then
if
1 is stationary and "I is constant,
K
must also be stationary. Proceeding
as
Section E.2, we obtain the Euler-Lagrange equation (E.4 or E.6) but with the function
replaced by the functio11 11 : f | )'9.
E.3. A uniform cable of length L hangs between two points at the same
height and a distance D apart. Find the shape of the curve described by the cable. The
curve is called a catenary.
In this problem, the relevantphysical quantity is the gravitational potential energy of
the cable, which will be a minimum when the cable is in its equilibrium configuration.
If we choose the reference level at the height of the two endpoints of the cable and let
y(x) describe the distance of the cable below this level at coordinate x, 0 < x < D,
the potential energy is
Example
u(y):
rD
|
-t@)s a^ = _u.c JoI y\/t + (y,)2
dx
(8.10)
where g, is the mass per unit length of the cable, dm : It dt, and we express the
length element dt of the cable as in Example E.1 (equation E.2). At the endpoints,
y(0)
: y(D) :0.
However, the length of the cable is a constant, so we may not vary
completely arbitrary manner. We must have
L: I
dL:
loD
G/t) dx:
Ioo
,/T+
1y,y
a,
y(x) in
a
(E.1r)
[Here we choose to call the integrand G(yt) to avoid confusion with the acceleration
due to gravity, g.l None of the integrands contain x explicitly, so we may use equation (E.6), including the constraint. We may absorb the constants p and g into the
in
f
E.3 VARIATION SUBJECT TO CONSTRAINTS
constant multiplier l,:
f
+ ),G - y'a$
!!G) :
0y'
+ ))Jt + 0'Y :
W - r' !t,
dy'
o + xlJ*
o+DJt+ry'Y(r-#h)
constant
c
:.
y+^:
stfrT(Y'Y
Solving for the derivative, we find
dv
ll
]:trl
dx
-.(y*x)'-l
As the cable hangs, the slope dy/dx is positive on the first half of the cable and
negative on the second. Now we may integrate to relate the coordinates (X, Y) of
apointonthecable. For0 < x < Df2,
tX
:Jodx
1,,
Let y
* ), :
C cosh0, dy
:
40
+r)2
-t
C sinhd dd. Then
lcosn-rtr+r)/c C sirn0 dg
J"ort-'(t
/c)
c l"osr'-r fI+)
C
L
\
: x
'inha
:t
CJ
/ -cosh-r1l
Rearranging, we can solve equation (8.12) for
!:
C cosh
I
for0 <
x < Dlz
on the first half of the cables:
/x
(a *cr"n-tl\-l'
C/
The slope of the cable is then
4 :
dx
sinh
(1
\C
+
(E.12)
"ortr-t
1)
Cl
sHaving completed the integration, we no longer need to distinguish between Y and y.
(E.13)
550
oproNAL roprc E cALcuLUs oF vARtATtoNS
At the midpoint, x
:
D 12, the slope is zero:
o:
sinh
1\
/D
l#.""tn-' ;)
Thus,
It is convenient
D
,I
cosh-'2CC
to choose C to be negative, C : -Y, and then
),
:
-y
"o"n
fi
Our solution (E.13) becomes
t
D
-\l
Y:zfcoshr-cos,n(Dl2\ y /l
with slope
dy .. /Dl2-x\
d":stnn\
, )
Next we apply the length constraint (E.
the first half of the cable.
11
) to find y . Again it is easiest to work with
LfDl2fD/2
::zJoJo
| ,/t+(t,)zdx: I
1*sinh2
( Dl2 - *) ,, :
: [''' "orn
---y
lo
\ v )
sinh
(r7)r.
( D/2
-
\ v
x\lD/'z
/lo
D
-ysinh'2y
which is a transcendental equation for y that can be solved numerically. If the cable
length t is twice the distance D between the supports, then
DD
_
: sinh
with solution
Dly :4.355,
shown in Figure E.3.
and hence
2y
y :0.2296D.
The shape of the cable is
551
E.4 EXTENSION TO FUNCTIONS OF MORE THAN ONE VARIABLE
0 0.2 0.4 0.6 0.8 1.0
0
x
D
0.2
0.4
0.6
0.8
v
D
FIGURE E.3. The cable's shape is such as to minimize its potential energy. Here we
length Z : 2D. Remember that y increases downward.
see a cable
with
E.4. EXTENSION TO FUNCTIONS OF MORE THAN ONE
VARIABLE
The theory of calculus of variations may easily be extended to integrals over more than one
variable. The integral is of the form
r: JI f $i;u)d,*
x;, i -- I - n, arc the independent variables and a is a function of the variables -r;.
Upon taking the variation, we have
where
u,:
| (#'".D#rn,(#))"-
Once again, we integrate by parts to convert the variation in each derivative to a variation
in a. We obtain the Euler equation:
Z*ffi^)-{:o
(E.14)
In this expression, the derivative E l0x; is apartial derivative in the sense that we are holding
all the other coordinates fixed, but it is a total derivative in the sense that we must include
the implicit as well as the explicit dependence upon r. For example, if n :3,
aFll-l
-tax
aFl
tT
f
I"on.ty.z dx l"onr, u.y.z.ux.uy,uz
I-
aF 0 (0ul0y)
' A Gu/Ay) 0x
552
OPTIONALTOPIC E CALCULUS OF VARIATIONS
If we already knew the function u(x, y, z), we could obtain the same result by explicitly
writing out the coordinate-dependence of the function u and all its derivatives to obtain
F (x , y , z) md then evaluating the usual partial derivative 0 F I 0 x . But in cases where we
want to use equation (E.14), the function u is the solution that we seek, and so the explicit
dependence onr is notknown apriori.
Example E.4. _ The electrostatic field in a volume V may be described as the gradient
of a potential, i : -id, where the potential Q@, y, z) takes on specified values on
the bounding surface S. Show that the function @ that minimizes the electric energy
in the volume V satisfies Laplace's equation: Y'Q :0.
The electric energy is given by the integral
u:'; lun'av:') lrr'ofav
:7 l,l(*)'. (#)' . (y)'),,
The integrand
tion (8.14) is
f :
(0Q/0x)2
+
a (0Ql0z)2, and so the Euler equa-
@Q1AD2
*(#).*(#) .*(#) :Y2Q:O
PROBLEMS
fl
u : us(l * y). what is the path of a
ray? This model describes the propagation of seismic waves through the Earth's outer
rne
speed of waves in a medium varies with
)
8S
layers.Ifthewavesstartat):0,.tr:0,findthevalueofratwhichthewavesreturn
to the surface as a function of the initial slope of the ray dyldx : m : tan0' Also
determine the total time of travel for each ray.
2. Find the path of a light ray through a medium whose refractive index increases linearly
with depth: n : I *x/a. Assume that the path starts from the origin, I : y : 0, and
dyldx:latx:0.
3. Rework Problem 2, reversing the roles of the labels x and y and describing the time as
an integral over the new r. Is the result the same? Which method is easier?
4. Repeat Problem 2 with refractive index function n(x) - nsex '
A(0, 0) and
S fne brachistochrone. A smooth wire runs between two fixed pointsfriction
on the
y).
without
particle
sliding
a
Find the shape of the wire such that
B(X,
point
A
from
particle
starts
the
that
wire reaches point B in minimum time. Assume
that
of
0.
Show
y
functions
as
x
and
obtain
with speed u6. Hint: Set y/ - tand and
the coordinates of the two fixed points are sufficient to determine the initial and final
values of 0 and the two integration constants. Determine an explicit solution in the case
u0 : 0, X : Y : -1. Plot the shape of the wire in this case.
PRoBLEMS 553
6. Show that the Sturm-Liouville equation (8.1) anses from the problem of finding the
exffemum of the integral
t
:
l"ou{r,),
+
eyrldr: I,u Fdx
subject to the constraint
,:
l"u
,y2 d,
:
l"u
odx
: constant
7. Show that the catenary (Example E.3) is symmetric about the midpoint-that
is,
y(D-x):y(x).
8.
f,
Show that the curve that encloses the greatest area with a fixed perimeter is a circle.
Investigate the problem of finding an extremum of the integral
r:l{H{*dx
subject to the constraint
J:lty'ty'*d.x:t
where 11 is the Hamiltonian operator
rr2 d2
ttu --2^o*'
_ -lV(x)
and V(x) is a known function. Show that the resulting differential equation is the
Schrcidinger equation.
Hint: First integrate by parts to eliminate the second derivative.
10. Using polar coordinates, write the Lagrangian for a particle moving in the potential
V
: - G M m f r, and form the Euler-Lagrange equations.
Show that the equation in the
angular coordinate indicates conservation of angular momentum.
Ll.
A spherical pendulum is a mass free to move on the end of a string of fixed length l. Write
the Lagrangian in terms of the spherical angles 0 and Q, and hence find the equations of
motion. Show that one possible motion is the conical pendulum with constant 9. What
is the value of dQ I dt in this case?
12. Consider the one-dimensional motion of
a particle with potential energy V(.r) that is
independent of time. Show that the Euler-Lagrange equations may be written in the form
of equation (E.6). Give a physical interpretation of this equation.
[if]fne
Lagrangian for a vibrating string may be written
,: lo'l:-(#)' -:, (#)'1,,
554
OPTIONAL TOPIC
E CALCULUS OF VARIATIONS
where y(x, t) is the displacement of the string, p is the mass per unit length, and Z is
the tension. The first term in the integrand is the kinetic energy density, and the second
is the potential energy density. Determine the Euler-Lagrange equations for the system,
and comment.
14. As an alternative approach to Problem 13, we may expand the displacement y(x, /) as a
Fourier series in x (refer to Chapter 4, Section 4.2):
y(x, t)
:ia^(t) sinff
n:o
Write the Lagrangian as a function of the generalized coordinates ar(t) and the time /.
What are the Euler-Lagrange equations now?
1.5. A volume V is formed by rotating a curve y(x) defined for -a < .r < a around the
x-axis. Given that the curve is symmetric about the y-axis, y(a) : )(-a) : 0, and
y'(0) : 0, show that the curve that gives the maximum volume for a given surface area
is a circle and the corresponding volume is a sphere.
16. The Lagrangian for a panicle moving under the influence of electromagnetic fields is
g: !^r2 - qQ + qi-i
2
Eil
where 4! and A are given functions of position and time. Find the equations of motion, and
hence show that the force acting on the particle is the Lorentz force F : q(E * i x B).
Strow that the shortest distance between two points on the surface of a sphere is a great
circle. Hint: You may place the polar axis through one of the points.
APPENDICES
I. TRANSFORMATION PROPERTIES OF THE VECTOR
CROSS PRODUCT
In Chapter 1, we noted that the cross product oftwo vectors does not transform as a vector
(Section I.1.2).Here we shall investigate the transformation properties of the cross product.l In a second, primed coordinate system, the definition of the cross product in terms
of components (Chapter 1, equation 1.30) is unchanged. We can then express the vector
components in the primed system in terms of the original (unprimed) components, using
the transformation law for vectors:
(il' x i'), : eij*u'juL:
eijkA
jutAkmum
(1)
We want to compare this expression with the usual transformation law (1.23) for vectors:
A;o(it x i) p
:
AipE pjku
juk
These expressions are not the same.
As a first step toward finding the correct transformation law, we write the determinant2
of the transformation matrix using the Levi-Civita symbol:
An An
det.A:
:
:
An
Azr Azz Azt
Att Azz Azz
An(AzzAtz
-
A1i e iu AzkA3r
AzzAz)
-
Ap(A21A3z
-
AzzAy)
i An(AuAzz -
AzzAz)
: - A2;e;i pAli A3
Similarresultsareobtainedforallpermutationsoftheindices
and r :3, the first expression for det A gives directly
epnr detA,
:
eluApiAq*Art
l,2,and3.Forp
- l,e:2,
(2)
l See also Optional Topic A, Section A.3.
2See also Chapter l, Section 1.6.2.
555
556
APPENDICES
This relation is confirmed fot p - 2, Q : 1, and r : 3 using the second expression for
det A and the index name changes i -+ i, i --> k, k --+ l. Similarly, we may verify the
relation for each set of values of p, q, and r .
Now we can make use of this relation to see how the cross product transforms. We start
with the definition ( I .30), apply the transformation matrix, and then write the result in terms
of the components in the primed system:
A;p(it
xi)o :
A;pepikuiuk
: Aipepi*Ailtuie;)'h
Since the inverse of the transformation matrix A equals its transpose, we have
Ai
r(it x i),
:
e
oi* Aip Ari
A*pulufi
Now we use equation (2):
A1r(fr x
i)r:
ei1^ut1ul
detA,: (fr' x i'),
detA'
(3)
The usual transformation law for vectors does not lead to the correct expression (1) for the
cross product, so we multiply both sides of equation (3) by det A and use the result that
det.A,
: tl
to obtain
(d' x
i')i :
(detA)A;p(d x
i)o
(4)
which is the transformation law we seek. Because of the determinant in equation (4),
(il'xi'):A(frxi)
for rotations, but
(i'xi';:-A1fixi)
for reflections.3
II. PROOF OFTHE HELMHOUTZTHEOREM
The Helmholtz theorem states that any vector field F may be decomposed into the sum
V x A.
two vectors:
+ i, where fr = VO and
i:
i:i
of
The proof takes the form of a demonstration of how to construct the two functions
A(r, y, z). First we construct the vector function:
@(x, y, z) and
fi(x, y, d:
[ ..space
Jdt
3For additional information, see Optional Topic A.
,u'o:,]';z')
dx, dy,dz,
(5)
II PROOFOFTHEHELMHOLTZTHEOREM 557
where
R:li-i,l
and the integral is over all space. This function satisfies the equation
v2fr': -i
(6)
Let's demonsffate this. Since the operator I differentiates with respect to r, y, e but not
x', y' , z', we may move it inside the integral, where it operates only on the function 1/R:
v2fr
: oz
:
I
fi'ul]t
t>
1
dx,
*r*', y', z')l-6(i.
dy, dz,
:
- *')ldv'
I
:
,r*,, t,,
z)v2J-
dx,
dy,
dz,
-fr(x, y, z)
where we used equation (6.26) andthe sifting property. Thus, we have verified equation (6).
But from equation (1.51),
v2fr:ifi.*l-i"(ixfr)
and so
F:ix(ix*l-Vfi.*l
Then we define the scalar function O by
o
and the vector function
i
: -i .fr'
(7)
Uy
i:ixfr
(8)
so that
i:Vr,.i+io
as required
(9)
by the theorem.
Next we can find explicit expressions for
<D
and
.i. L"t't
start with @ (equations 7
and 5):
cb(x,
y,.)
: -i
.*
: -i *o;l;,,
I
dx,
: - [ U .l(x'-,Y'az') dx, dy, dz,
4nR
J
: - I i(r'. v'. z') .i=l dx' dy'dz.'
4nR
J
fl
dy,
dz,
558
APPENDToES
Then we use the result
(z-z')o
(x-x')^ (y-y')^
: V-------=-2:-\:
-Rr
Rr "
R3
:l17
'R
where
i'
--I-
3,1
R
is the grad operator with respect to the primed coordinates. Thus,
:
a
fF(x"
J
_l',, z',)
.\-'' I
4tr R
dx'dv'dz.'
Now we move the grad operator back through the vector F again, using the result
/V
d+n. ir2 withi :
Fand qr
:
i
'
1,2f1
:
I/R:
o: llr, (t'a.l.tt) #l dx,dy,dz,
(10)
We can rewrite the first term using the divergence theorem (equation 1.44):
q: I i.x"Y"z')'fidA'[n,'!or'
4rR
J 4rR
./sThe surface integral is zero provided that
i +
0 on the surface at infinity at least as fast as
.p-(1+e).11ur,
,:- lY
4tr R
(11)
V,
We can obtain an expression for A similarly:
:i
A :-Yi x *vr-'^l
dx' dv' dz'
* [ i(r''+"nY''z') u^srs<
First we move the curl inside the integral, where it operates only on the function 1/R:
v|' t') dx'dy'az, :
L: Jfv " Ft*"4rR
[(v]=)
J\4rR/
"
F1*',
y'.2')dx'dy'dz'
Next we convert to a derivative with respect to the prime variables and use the result of
Problem 1.23:
A:- Ji (v'-!)
\ 4rR/
:-
"F1'',
y'.2')dx'dy'dz'
llu'" (o*t) - #u'"'l "' dv'dz'
lll
PROOF BY INDUCTION:THE CAUCHY
FOBMULA 559
Now we use a variant ofthe divergence theorem ( Problem 1.30b) to convert the first integral
to a surface integral:
A:- Js[ ,i'' f-L\ ae'+ I[i'"i4nR
\+nn)
dx'dv'dz'
i
i: .lI i','
o*' dy'dz'
4rR
(r2)
again provided that F -+ 0 on the surface at infinity at least as fast as iq-(l+e;.
Equations (11) and (12) allow us to calculate the vectors [ : iO and i : V ,. A.
lll.
PROOF BY INDUCTION:THE CAUCHY FORMULA
To prove the Cauchy formula (Chapter 2,equation2.39),
f,j+r,:#rr@-'){ol,_"
(13)
we use the idea of proof by induction. First we assume that the result is true for some value
of n, say n : m. Then we shall show that the result must also be true for n : m * 1. Since
we have already shown that the result holds for n : I (Chapter 2, equation2.38), it must
be true for all values of n > l.
We start with n : ml1 and express the derivative on the right-hand side of equation (13)
in terms of the (m - l)st derivative:
f '^' k)|,--o: lgb
i (r'*- ",rrl,:"*^ - f '^-D <al,-")
Now we use the assumed result for n
:
m to express the (m
-
l)st derivatives:
1,)t ( {
ri^tu{ f@ orl
'-'lr:o : h-o
- Jck-a1^
---Jo - ar.-)
"f,,,etl
2nih \/clz-(a*h\)n
t
:1-('-l)l
''\k-a-h)'
h+0 2nih f r,rr(
I \
- tr-oy)d'
Jc"
k-a)--(z-a-h)*
_1. fu-l)l f
,:\
2"ih
frf{')ffid'
560
APPENDIcES
We expand the second term
in the numerator of the fraction in the integrand,
using
the binomial series:
(z- a)* - (z - a'- h)^
(z-a-h)*(z-a)*
(z-a)^-lQ-a1^
(z-a-h)m(z-a)m
-
mh(z
a)m-r
-
m(m
-
2
l)
t'r,
-
q'
oY-27r2
a
"'
(z-a-h)^(z-a)^
Then, since
z-
a
+0
anywhere on C, wemay
divideoutafactorof (z -
a)--r toget
(m-l\
I-""; "(z-a)-th+"'
L
---l-
Then the
(z-a-h)^(z-a)
nth derivative becomes
.f'-'e)l z:a
I
which shows that the result (13) is also true for n
:
m
I l. Hence the general result (13)
is proven.
IV. THE MEAN VALUETHEOREM FOR INTEGRALS
If the functions
/(x)
and
g(x)
are continuous on the closed
interval la, bland g(x)
>
O
on the open interval (a, b),then
pb
1b
lf@)s?)dx=f(c)ls@)dx
- Jo
J"
for some c in the interval (a, b).
(14)
INTEGRALS 561
IV THE MEANVALUETHEOREM FOR
The proof uses the mean value theorem for derivativesa applied to the function
F(x)
:
fx
J" f ltls{tl
o,
Afx
- i J" sQ) dt
(ls)
where
o:
I"'f(x)g(x)dx
is the left-hand side ofequation (14) and
u
:
1b
J, sk)dx
is the integral on the right-hand side of equation (14). Notice that
continuous, then
F is differentiable. Then there exists
F'(c): -'F(b) -_
;
But F(D) :
F(a):
0, so F'(c)
equation (15):
-
if
some c between
and g are
and b such that
both
a
/
F(a)
"
0. Next we evaluate the derivative directly from
F'(x)- f(x)s@)-1s';,)
6
Setting
x
:
c, we obtain
A
f (c)s@) - ,sG)
Then, since g(x)
> 0 for all x in (a, b),
:o
we may divide out the factor g(c) to obtain
f(c)B:A
or
f (d
l"u
g(x)dx
which is the result we set out to prove.
In the special case g(x) : 1, then B
:
b
:
-
l"u
f {*)r{*)o*
a, andwe obtain the simpler result
pb
I ff*)dx :(b -a)f(c),
Ja
4See, for example, Stewart, p. 289
wherea < c
<b
(16)
562
APPENDICES
The area under the curve
width b
-
/(x)
a andheight /(c);
between
see
Figure
x : a andx :
l.
b equals the area of a rectangle of
f(x)
0
FIGURE 1. In the simpler version (16) of the mean value theorem, the area of the rectangle equals
the area under the curve /(-r). For this function with a : 0 andb : 4, there are two
possible values ofc, as shown.
V. THE GIBBS PHENOMENON
The Fourier series of a function that has a discontinuity always converges to the midpoint
of the discontinuity, and it overshoots as it makes the jump. To investigate these phenomena
further, we will concentrate our attention on the step function.
We will denote the sum of the first N terms of a Fourier series by Sls:
N
S1v
: !(a,
sinnx
*
bncosnx)
n--0
where (equation 4.7)
t
r2n
an:! JoI
7T
.f<*>sinnxdx
and bn is defined similarly (equation 4.8). Then we can write
an sirtnx
r b, cosnx : !tr ['" f (u)(sinnusin nx *
Jo
r
t2n
: ! I
7T Jo
for n
>
1, and (equation
/(r,r) cos n(u
- x) du
4.9)
bo:
I
f20
z" J,
f
(u) du
cos t?u cos nx) du
V THE GIBBS PHENOMENON
We'll also need the combination
an cosnx
-
In this case, the term with n
:
T
b,
sin
: -I If2n f (u) sinn(u nJo
nx
x) du
0 is identically zero. The series
(x)
: D
@"cos r?r
n:0
-
bn sinnx)
is called the allied series. The sum of its first N terms is denoted by ft,,. Then
t r2r
rt
Srs*iT,v-I "f@)1
" trJo
'tZ * $
u_t,i'tu-'t\ 7u
)
where the first term in the braces (l/2) is the n
Chapter 2, Section 2.3.2, equation 2.42 with ,
(
I f2,
t
Sr,r*iZr:_l
ftu)\-;*
"
rJo
t 2
:
-
0 term. We can evaluate the sum (refer to
"i(u-x))
6
get
1-ri(N+l)(a-.r))
I-et\u-x) )ldu
I f2o
,-itu-xt/2 _ ,i(N*l/2)(u-x)
:iJo
/(I/)t@
(
;\,"
The quantity in braces may be rewritten as follows:
cos
(f) - i sin (5:)
_
-cos f(n + *) (u - x)
(x+l
-t
1
(?) - cos [(ru + j) (z - xt] * sin [(ru + j) (rz - xt]
zsln (f )
2sin (f )
-cos
'
Thus, the real part of this equation is
S1g(x):
1
^
zlr
f2o . sin [(N + +)(u - .{)]
1
Jo .f @S---frdu
,ini r-"'l
\2 /
^
,
(17)
Now let's applythisresulttothe stepfunction f (u):1for0 < u < ir and f (u):0for
rr < u < 2n . For this function, the upper limit of the integral in equation ( 17) becomes n :
S1v(x)
:
I
fJT
2" J,
du
564
APPENDICES
Now let N
+
+
=
r.,, arrrd
(u
-
x)4
-
P:
1 p@-x)n sin u du
S1s(xl:;'2n '
J-,n sin(u/2D n
1
I fxn sin u
:2ort
-J--r*ffid'+ z-
1@-x)n sin u
sinlr/2il4v
J,,
For0<xlT,thefirsttermismuchlargerthanthesecond,sincesinu/2r1
isnotcloseto
zero in the second range of integration. In the first integral, the integrand is even, and so
srv(x): f- ['n ,sin!^ du foro < x < z
srnu/24
n4
Jo
Thus, as we let N, and hence 4, become large, sin
u
/2n
-
u
/2r7 and we have
2 fxn sinu
for0<x<z
Srs(x)=- /
TTJo u
-du
The integral is zero5 for x : 0 and increases steadily as x increases until 4x :
(18)
z,
because
the integrand is positive throughout this range. For u > n, the integrand becomes negative
and the integral begins to decrease. Further cycles of the sine cannot increase the integral
rr because of the larger denominator. As 4 -+ m, we find6
back to its value dt x4
:
s(,r)- ?
:,
[*tin'dr:?1
12
u
nJo
as expected. The peak
foro<
x<n
of the overshoot is thus greater than one and is given by7
? [" "-:7r:
T|JO u
?si Qr):1.179
1r
(1e)
The peak occurs at
'
and gets closer to
5wh"n
x
:
r = 0, the second
7f
N+t/2
0 as N increases (see Figure 2).
integral becomes
* ff' "|a"
-- |
as
n
--+
@,so we retrieve the expected result
that the series converges to the midpoint of the jump (Chapter 4, Section 4.2' 1).
6See Chapter 2, Section 2.7.3, for the evaluation of the integral.
TThe integral may be evaluated numerically. See also Abramowitz and Stegun, Table 5'1.
VI THE LAPLACE TRANSFORM AND CONVOLUTION
565
r.2
r1
tl ^
1.0
S7,,'(x) o.s
0.6
ru
o4ot
I=5
0.1
S1y (.r) (equation l8) for N : 5, 10, 20, 40, and 80. The peak moves closer
0 as N increases, while the height of the peak remains approximately constant
FIGURE 2. The function
to
x
:
for large N.
VI. THE LAPLACE TRANSFORM AND CONVOLUTION
Toprove the convolution theorem (5.17) forLaplace transforms, let's calculate the transform
of the convolution of the functions f and g:
L(f
*il:
lo*
,-"
Io'
f G)e(t -r)dtdt:
,ti!*lr' lr'
e-'t f
(r)s(t -r)drdt
The integral is taken over the half-space in the /-r plane bounded by the t-axis and the line
r : t.T\e integral is evaluated by summing over vertical strips, as shown in Figure 3a.
FIGURE 3a. The Laplace transform of the convolution is an integral over the triangular
the diagonal line
r:
r, summed over the vertical strips shown.
area below
APPENDICES
OT
FIGURE
3b.
We can equally well compute the area as a sum of the horizontal strips shown here.
We can equally well evaluate the integral by summing over horizontal strips, as shown
in Figure 3b. Then the integral becomes
L(f
x
il:
Now we change variables
L(f
*
il:
:
.gn- lr' l,'
to u : t - ri
,lg- lo'
o,
lo'-'
e-'t f
(t)g(t
ou
e-s(u+r)
rT / rT-t
rrr*Jo (/
stu)e-'u
-i
dt
dr
f G)sfu)
\
au)tttle-"
dr
The integral is over a triangular region of the t -u plane, as shown in Figure 4.
As Z + oo, the integral over the square gives
du [''' f ,rre-" dr
lim [''' ,(u)e-'u
T-a Js
JO
:
_[*
Jo
sfu)e-"
du
f@
ft)e-" dt :
Jo f
G(s)F(s)
We can show that the integrals over the two smaller triangles go to zero in the limit:
I
Both
/
ltop ,,i-er. I
:l [' f ft) e-,, o, Jo
['-'
lJrp
g1u)
e-"
dul
I
and g are of exponential order, so, from equation (5.2) in Chapter 5, max lg(u) I
Mlsotu ontherange 0 < u <T12,arrdasimilarrelationholdsfor
'
f: lf G)l 1M2eo2t
VI THE LAPLACE TRANSFORM AND CONVOLUTION
567
FIGURE 4. In the u-z plane, we integrate over the area to the left of the diagonal line. We may divide
this area into the square and the two smaller triangles shown here.
forT/2 < z < T.Thus,
l4op
ri*gr"l <
MrMzllr',r"''-u' dt
: MtMz [' ,'o'-"'
du
1-
Jrtz
:
#llr',r7u--')t
-
r(or--s)r
-
e@,-')') dr]
- e@z-or)T12
02-01
n{or-rlTe@2-ot)T
-ozT
-e -sT'
which approaches zero as I -+ oo provided thats Re (s) > max (ot,o). We can use a
similar argument to show that the integral over the lower triangle also goes to zero. Then
we have
L(f * d:
F(s)G(s)
and, equivalently,
c-t@c): f * I
8The argument rnust be modified
slightly
ifal -
o2. The result is unchanged.
568
vl.
APPENDTcES
PRooF THAT Pf QD= (-1)m(1
- p\^/2#PrQ.r\
In Chapter 8, we showed that the mth derivative of the Legendre polynomial Pr satisfies the
equation (8.52):
(t
Now let
y^(lD
- t"2)yk-2(m'rDpyk+Il(l
+1) -m(m
* l)lv,:0
(20)
: (l - lr2)'z(t ). We will be able to show that z(l.t) satisfies the associated
y^:
Legendre equation with a suitable choice of the power r. We begin by differentiating
yk
: 0-
p'2)' z'
-f r(-2p')(r
-,2;r-r,
and
yk:
-4rlt(l - *27r-t/ -2r(l - 2rpl-2(r - 1)pl(1 - ,2;r-2,
Q
-
1"2)'2"
: (l - ,27r-rl{, - *'rr" - 4rpz' - {*l
1"2)'-rz
+ t2
-t*'af
Substituting this into the differential equation (20), we get
0:6 - p\(t -,zy-tlr,- u\r" -4r1tz'- {p<,+
-
2(m
*
+
U(l
+ I)
1)pt(1
-
-
m(m
p,2)'z'
+
We can divide out a factor of (1
o
:
-
-
1)l (1
-
2r
-
-
-r*'D)
t'r2)'-1 zf
p2)'z
1t2)' to get
+ rr2 -
- 4r1tz' - {ro
+ll(l+t)-m(m+1)lz
frr
p'(l
t'2
p2)2"
rr'.)- 2@ r lp(r' - ;+tr)
Gathering up terms, we have
0:
p.2)2"
-2p.(2r +m*l)zl
(1
-
*
rl,u+ t) - m(mtt>
- , --"rrr+
pt2
-2p2, -2(m-tDt"\)
QD
We want this equation to become equation (8.48). We'll get the correct coefficient of zt
we choose r : -m12.Inserting this value into equation (21), we get
0: (r - p.2)2" -2112' t
1,,,*
t) -m(m+
: (l - p2)2,, -21t2, + rlru * r, - +)
l)+
lVrt
-
p2
- u'nf
if
Vlll PRooF oF THE REI-ATION
If;
and we have the equation we seek. Thus, we have shown that Cz :
C. In physics applications, it is usual to choose C : (-1)^ and then
piQ"):
as
(-1),?(l
: !!i*'t
pJm(kp'tJ^(k' pl ap
Pl
569
for any constant
- ,zynffirt@)
in equation (8.53).
Vlll. PROOF
OF THE RELRTION J[*n
J*(kp) J^&' p) d p =tS;!)
To prove the delta function relation above (equation 8.113), we start with the Bessel differ-
ential equation (8.69):
* ('ry)
With
a
+
k2 o
t^1r,pt
-
( n<r,o't :
o
different eigenvalue k/, the equation is
0 / 0J_&,o\\
,.
&,p)_^2J^1k,py_g
p
uo\o-i )+&')'or^
Following the usual procedure for proving orthogonality (Section 8.1.1), we multiply the
first equation by J^(ktp), multiply the second by J^(kp), and subtract:
l - (k' D
h
Qry) - r
^
(kd
+
(r*P)
+ w'
-
&' )2t
p
t
^
(rc p)
l
^
(kt
p)
:e
Now we integrate from 0 to oo. We integrate the first two terms on the left-hand side by
parts, and the integrals cancel, leaving only the integrated terms:
olt^ro'off - r^(kd9J!P]f : t&)2 -o', Io
pr^(kp)r^(k'p)dp
The integrated terms vanish at the lower limit p : 0. To evaluate these terms ur,n" ffi
limit, we use the asymptotic form (8.83) for J*(kp) and the recursion relation (8.90). The
first term is
;*T
t^n'
d
lJ^-tQrp)
- J^+(kp)]
z\
: lim kP 2
\
2-4)
P-a2npJkk' "o"(k'o-^'
" f"", (oo -'- -t'"
-l
-"o, (oo
-gy-
;)]
570
APPENDTcES
Using the identity
cos A cos
f:
jt"o,
(A
+ B)*
cos (A
-
B)l
we find the first term to be
,:_, I lk I
cos[(ft +k')p-mirl*cos[(k -k')p+nl2]
I
Vft'I -cos[(k +k')p-mlt -z]-cos l(k- k')p-"/zl I
tTk
: p-*nY
lim -^l ; leD^ cos (k t k')p - sin (k - k')n]
k
i*n
The second integrated term is found by interchanging k andk' in this expression. Thus, the
difference of the two integrated terms is
tTk
lim lr/ _
p-ar
Y kl
[t-t)-
cos (k
*
k')p
-
sin(k
-
kt)p]
tlu
k')p-sin(k'-k1pl
- -\l
ftu Ik' ff-rl'cos(k*
:
!lt.- - *' r-r),cos
lim
p+6
7r
L
\/kk'
(k
+ k,)p +k+k'
sin(/<,
'/kk'
-k)p'lI
Inserting this result into equation (22), we have
tf(-l)'+l
,:*
llm
-l
p-@rv
"ot(!J!)t- k+k,
L Jtctc,
I sin(k'-k\o1
' Jklt ;;):
f@
J,
pJ'(kp)J*(k'p)dp
(23)
Now we already know9 from Chapter 6 (equation 6.16) that
rim I [*
n-o2n
J_p "'t'dr:6(k)
1 sin/cR
:6(ft)
,i^
"'oo - "-'n^ : lim
"
R+m 2ikn
R*oo z k
and thus the sine term on the left-hand side of equation (23) is
r ^., .. 3(k'-k)
@6&', -k):The cosine term is zero in the limit. (See Chapter 6, Problem 25.) We can understand this
result by looking at plots of the two functions; see Figure 5.
Thus, we have the desired result:
fo*
gSee
also
Lighthill, p.29.
ot^{tro)J^(k' p) ap
:
ltgr' -
t1
IX THE ERROR FUNCTION
571
f(k)
FIGURE 5. The functio",
{F#
(heavy lines)
""0
for Rkt :
a
anffi24
(thin lines) versus k/k/
20 (solid line) and l0 (dashed line). While the function with the sine has
prominent peak, the function with the cosine has no peak for ft and k/ positive.
IX. THE ERROR FUNCTION
The Gaussian function
*'(-+)
(24)
occurs frequently in statistics. For example, it describes the distribution of a set of measurements around the mean at x : xo, provided that the number of data points is large and
the errors are random. As a consequence, it also appears in physics-for example, in the
Maxwellian velocity distribution, where, for each component,
f (u,)
/ m 1/2 /
*p
- \r"* ',)
The function (24) has a maximum value of
value where
t;4:
-rn
I
mu?\
\-rkr )
at
x
(;) :
:
x6. It reaches one-half its maximum
".2:o6e3t5
or at
x
:
xo
I aArA:
xo
* 0.83255a
572
APPENDICES
The spread x - xo : 0.83255a is called the half-width half-maximum, while the spread
2 x 0.83255a : l.665la is called the full-width half-maximum (Figure 6).
f
-4-2024
FIGURE 6. The Gaussian
function
e-"
(x)
.Thedashed line shows the half-width half-maximum.
Frequently we need values of the area under the curve of this function. The function
o(x)
2fx
- Ji
-I
(2s)
lo "-u'd,
is called the error function or the probability integral. It is also sometimes wriften as ed (x).
The normalization arises from the value of the integral
fo*
"-,'au::
I:: "-u'du : +
(26)
To evaluate this integral (26), we begin with its square:
,': l_: "-"d* Il* ,-nor: I_:
ll*
"-"'*v\dxdy
This is anintegral overthe wholex-y plane. Wemayrewritethis integralinpolarcoordinates.
The area element is
dxdy-dA:rdrd0
and
*2+y2:v2
rx rHE ERRoR
FUNcroN
573
Thus,
,,: Io fr'" ,-,'rdrdo:r, lo* "-*+
where we have made the change of variable w
12
:
12. Then
: n(_ e_,1f,) : n
giving
,
/^+m )
e-'' dx :
:
J_*
Ji
The complementary error function
erfc (.r)
:
1
-
o(x)
:
+
f,*
represents the area under the Gaussian curve from the point
normalized.
For small
r,
(27)
"-u'du
r
out to infinity, appropriately
we may usefully approximate the error function using the series expansion:
@
o
z
i_
,O(-t): -: ) (-l)r-
J" ?o'
A-zk+l
Qk
(28)
+ t)kl
which may be easily derived by integrating the series expansion for
ofx, the function approaches the value I exponentially,
o(x)
I
..2/
-, - {r*r-^
(.t
e-"' . For large
I
3
15
\
- ;t * o*o- 816 + "' )
values
(2e)
and the complementary function likewise approaches zero:
erfc(.r)
=hr'(t-#.
Values of the function erf (-r) are shown in Thble
I
)
and Figure 7.
(30)
574
APPENDTcES
o(x)
FIGURE
7.
The error function erf (x).
TABLE 1. Values of the Error Function
erf (x)
erf (x)
0.7
0.8
0.9
0.6'n8
0.3286
0.4284
1.0
0.5205
0.6039
2.0
0.8427
0.9661
0.9953
2.5
0.9996
0
0
0.1
o.tt25
0.2
0.3
0.4
0.2227
0.5
0.6
1.5
0.7421
0.7969
X. CLASSIFICATION OF PARTIAL DIFFERENTIAL EQUATIONS
Here we shall give a brief summa.ry of the types of partial differential equations most
frequently encountered in physics. For a more extensive discussion, see, for example, Morse
and Feshbach, Chapters 2 and 6. The discussion here is presented in two dimensions, but
the extension to higher dimensions is relatively straightforward.
Suppose we have a function f (x, y) that satisfies a linear partial differential equation of
the form
-u'{ + ctl
et4
'- 0x2 +' r.u
-- 0x
0y - 6yz
:'("'
''''Y'H)
(31)
where A, B, and C are each functions of the coordinates. The function / exists in a specified
region with boundaries defined parametrically by the curyes x : EG), y : 0(r). The type
ofboundary conditions that we need in order to find a solution depends on the form ofthe
X CLASSIFICATION OF PARTIAL DIFFERENTIAL EOUATIONS
575
differential equation, as specified by the relative values of the functions A, B, and C, and
also on the form of the boundary. The boundary conditions are classified as follows:
1. Dirichlet conditions: The value of the function / is specified on the boundary.
2. Neumann conditions: The value of the normal derivative of / is specified on the boundary. For example, if one of the boundaries is at x : a,we would specify 0f l0y at x : a.
3. Cauchy conditions: Both / and its derivative are specified on the boundary.
The boundary is closed if it forms a single closed curve, part of which may be at infinity,
and values are specified everywhere on the boundary, including at infinity. It is open if it
extends to infinity in some region and no values are specified on the part at infinity.
The differential equation (31) is classified according to the values of the functions A, B,
and C, as shown in Table 2. This table also shows examples of each class from the text,
along with specific values for A, B, and C for each example.
IABLE 2. Classification
of PDEs
Name
Hyperbolic
Definition 82 > AC everywhere
Example
Variables
Wave equation (3.15)
B
:0,
x,t
AC
:
Elliptic
Parabolic
82
:
AC everywhere
Diffusion equation
(3. 14)
x,t
A:D,B:C:O
-u2
82 < AC everywhere
Poisson's equation (3.49)
x,y
A:C:1,ll:0
To determine what kind of boundary conditions we need for a solution (Table 3), we step
away from the boundary using a Taylor series. We find that this is possible given Cauchy
TABLE 3. Existence of Solutions for Given Boundary Conditions
Boundary Boundary
Condition Tlpe
Ilyperbolic
Dirichlet
insufficient
to determine
open
Parabolic
Equation
Equation
closed
solution not
Neumann
open
insufficient
to determine
Neumann
closed
Cauchy
open
Cauchy
closed
solution
in one direction
to determine solution
(Example 7.6)
no solution;
overdetermined
unique solution exists
unique
(Examples 8.1, 8.2, 8.3)
unique solution exists insufficient
in one direction
to determine solution
(Problem 7. 14)
no solution;
overdetermined
unique solution exists no solution;
(Examples4.4,'1.5) overdetermined
no solution;
no solution;
overdetermined overdetermined
solution not
Equation
uniquesolutionexists insufficient
solution
unique
Dirichlet
Elliptic
unique solution exists
(Problem 8.12)
unstable,
unphysical solution
no solution;
overdetermined
I
576
APPENDICES
conditions, provided that the boundary does not coincide with one of the characteristic
curves for the equation, which are specified by
Ady:(a+Jnr_ ec)a,
B
(32)
To understand this result, consider the wave equation (3.15). Here we have A:
: O, and C : -1, and the characteristics satisfy
dx
dt
u2,
*u
with solutions
x
Lut:
constant
These are the wave fronts. It is possible to find a solution for a wave on a sffing, for example,
ut t't.
and Ef
at t
O (see Example 4.4). But the line x
if we know both
constant is a wave front. Any function of x ut satisfies the differential equation. Infinitely
:
l|t
/(x)
:
-
-
:
many functions f(u) have a specified value and a specified derivative at one given point,
and so the solution is not determined if the boundary is a line x - ut : constant.
The directionality of the parabolic equations is well illustrated by our example: the
diffusion equation. We can integrate forward in time, but not backward. Physically, this
is related to the increase in entropy with time. The solutions smooth out as time increases,
and it is not possible, in general, to determine the initial state from a given final state.
xl. THETANGENT
FUNCTION: A DETAILED INVESTIGATION OF
EXPANSIONS
SERIES
The tangent function offers us the opportunity to investigate Taylor and Laurent series in
some detail. While the Taylor series for tan x is familiar and easily found in reference books,
the Laurent series for the function tan z are rarely seen. They do have uses, however. We'll
use this function to discuss the relation ofthese less familiar series to the Taylor series and
to discuss some methods for finding them.
First let's see whether the function tan z is analytic by checking the Cauchy-Riemann
relations. We start by finding the functions u and u:
tartz:
z.
srn z
e'z
- e-'z
2i
cos zz
e'
stz
2
I s-tz
_
"-i(x+iy)
si(x-liy) a r-i(x+iy)
-i- "i(x-liy)
e-2y +r2ix _r-2ix -,2y
e-2y + r2ix L,
+ g2l
"-2ix
* i sinh2y :uliu
cos2x I cosh2y
sin2x
e' ,g v
-t x ey
_T *e
v
x
1 ei ; ey
,"
,"
,
,
,
-t
et
+ e a Xe
+ e' a
9
,_
_,
2i sin2x - 2sinh2y
2cos2x | 2cosh2y
Xl THE TANGENT FUNCTION: A DETAILED INVESTIGATION OF SERIES
EXPANSIONS 577
Thus, for this function,
u(x, y)
:
u(x, y)
-
sin2x
cos2x
*
cosh2y
and
sinh 2y
cos2x
I
cosh 2y
You might want to check that these expressions are correct in the case 1l : 0.
Now let's compute the partial derivatives to check the Cauchy-Riemann conditions:
2cos2x
-2sin2x
--" (cos 2x
* cosh 2y-sin2xI cosh2y)z
I * cos 2x cosh 2y
_ 2 cos 2x (cos 2x I cosh 2y) + 2 sinz 2x _,
(cos2x+cosh2y)2 = " 1"o"2* 1*"h2ry
0u
0x
-:
"^^^
cos 2x
-
and
0u
0y
--
2sinh2y
2cosh2y
'
-sinh2 (cos2x * cosh2y)2
cos2x * cosh2y
2 cosh
2y(cos 2x
I
(cos2x
-
cosh2y)
*
cosh
^- I + cos 2x cosh 2y
2sinhz 2y
(cos 2x * cosh2y)2 -
2y)2
0u
A*
Similarly,
0u
0y
-2
sin2x sinh 2y
(cos 2x
I
cosh2y)2
and
Du -2(- sin2x) sinh 2y
Ex - (cos 2x I cosh2y)2 -
2sin2x sinh 2y
0u
(cos2x * cosh 2y)2 -
So the Cauchy-Riemann relations are satisfied.
Where, if ever, does this fail? It fails if the numerator blows up
denominator goes to zero. The denominator is zero where
cos2x
which can only happen
(n
+ l)n.
if y :
*
cosh 2y
0 and cos2x
:
: -1,
0y
(y -+
oo) or
if
the
Q
or 2x
:
(2n
* l)n;that
is,
x:
We already know that the tan function approaches infinity (for real arguments)
when x equals an odd number times n 12. So this is consistent.
578
APPENDToES
Now let's find some series representations for tanz and look at the relations between
them.
Series
1
We can find a Taylor series about the origin in the region
tanz
lzl < r
12 (see Figure 8):
-io,r'
n:o
FIGURE 8. The region within which we can find
a
Taylor series for the tangent function: lzl
The coefficients are given by (equation2.44)
tdn rtanz)l
"r- aazr,
lo
I
Thus,
ao:ttn(0):0
at
!: 1rd
rdz tunzllo :
I
|
or: i1d2
O*ru"rlo: i,
1
r""2 (0)
:
I
secz(secztanz)lo
:0
Before proceeding, it helps to simplify this derivative:
d2
42tan
z
:2^ sinz
cos3 z
Then
I t d sinz. l | /cosz -sinz(-sinz)\l
3---*J, )1,
ot: tI d3
1rttan.lo: j A*J;lo:3 (;e; -
I cos2z-l3sin2zl I l+2sin2zl:- I
3 cos4z lo 3 cosaz le 3
<
n 12.
Xl THE TANGENT FUNCTION: A DETAILED INVESTIGATION OF SERIES EXPANSIONS 579
oo:
t dl+zsin2zl
:
o * E--;;{, l,
4,. rttan.lo
I d4
1
*",
I
4sinzcos2z
-
4(l +2.inz.;1-sinz)
""r=
I
ro
1l
303
srn z
(z+
cos5
Iz n2z )
:Q
0
and
t""1.:* adt2sinz+sin3z
cos5 z
0
.)
5x3
(2cosz13cosz sln- e)cosz-5(- sin x)(2sinz 1sin3 z I
cos5 z
0
2
l5
and so on. Thus, we have
l, * 2.
rrz'*.'.
tanz- z*12'
Let's have the computer package Maple check this. Asked for the series expansion of
tan z,
it outputs
tanz-
z+lf * trr'* #r'
+
ffif + o(zro)
So the program agrees with our calculations (and gives us some more terms, as well).
Series
2
z
/2. The radius p
:
at z
n
- n l2l < z that surrounds the singularity at
by the position of the neighboring singularities
3n /2 (see Figure 9). To minimize confusion, we change to a new
Now let's look in the annulus lz
: tt is determined
: -n /2 and at z :
: z - n 12. Then
variable u)
J^n'
sin
-
cos
z
z
sin (w
-:
cos
* n 12)
(u -l
n
cos u,
12) - sin u;
580
APPENDToES
Now we use the previous series for tan
u.r:
l1
tanuJ w+lu3 +?srt +#.7
,(r + lwz + ?s.o + #*6
+
+
&.n
fu.t
+...1
*o(u.,10)
, (, - *,, - h.o - #,u +... + (-t)(-2) (!., * ?=,0 *#,u* f&*t * )' \
- -;
+ elt?@ (*., * ?r*o * #.u *#3.,'*. )'* .
)
\
:-; l' -!.' - tr.* -#.u* *(f,' **.u.
=
)- *,'l*
!,oo
*...)
-!u\ ( t - !,,2
3 - 45 -aru
945
/
lll.2.
---+-u+-uJ+-uJ+,,,
u345945
I
I / ft\
| r zr3
2 r rrr5
:-e-n/r)*r(t- z)+ qs\'-z)
*sos(t-r)
+'
which is the series we set out to find. It is a Laurent series with only one negative power
and is valid for 0 < lz - r /21 < r.
FIGURE 9. Both Series
I
and Series 2 for tanz are valid in the shaded region, so they must be
identical there.
Now in the region of overlap, the two series are supposed to be identical. Let's check. The
overlap region is 0 < lzl < n/2 (see Figure 9). In this region, we can expand the second
series. We begin by writing the first term as a function of 2zf n, whose absolute value is
Xl THE TANGENT FUNCTION: A DETAILED INVESTIGATION OF SERIES EXPANSIONS 581
less than
I in the overlap
tanz- -
1
region. Then
1
/
*
e-,tlz) 5(t -
: ;2 ,, 1
-rrr*
I
,t -
,t
r
I z
rr\3
t)+ E\'- r)
I
l,
+
ir' *23
+
2 r z\5
%s\'- Z) +"'
1"
1 n
- i"" + ftzn2 -
I
360-o'
| ,rot* | ,oo- | ot*...
2 ,t- r ,or*J-rtor*- g4i"
l5l2o"
r89" " ' r89" "
378" " ' r5r2'"
g " 76,
I
I
2/ rl-lz*
2 4, *l-zt
\I lo
: 1l
+llzo
+.
^zz
=)=o3
z\ tr- rr2- 7rr- n+/ 6 360
_ 1 ot*''ar+ I ^ 7 / - I 1 - | o 3+!rt
t5t20
60ro'* 1gV1zn* *z'n ,Uz"n- * +Sr'
'
1."
I
2
* *z'tr' - lggr"o -l *rz' *...
I . 4
2 r
I .
6o
-
360o"
I
I
n
o2
*#ro
I
8n
n
- t|nor' + F, + az + 60ro' -l ,rzur- + orr'
- *r'n - *,,.r'o' * #r' * *r'
:6.65 x t0-3 *0.972*7.1x
+
fif
lo-222 *0.2423
- firoo +..'
*8.8 x l0-2za +..'
Each term in the expanded series is itself a series, of which we have evaluated only the first
few terms. We can compare with the first series:
l.
2.
tanz-z*12' *,rz-*...
The a6 term should be zero; with four terms, we have 0.006 and are decreasing toward
zero. The er tem' should be 1; we have 0.97. If we could add enough terms, we should be
able to get back the first seies emctly.
Series 3 Next we'll look at the series in the annulus n /2 < lzl < 3r /2. This region is
centered on the origin, but excludes the singularities at z : *n /2 (see Figure 10). We have
a
Laurent series with coefficients (equation2.47)
I f
an: 2ni
tanz
frVa
o'
The contour C must lie within the annulus. Thus, it contains three poles, at
Using the residue theorem, we have
3
an:DRes
p=l
(zp)
I-
Q,
*.n 12.
582
APPENDTcES
v
2T
\
-7f
_7f
FIGURE 10. Annulus
tt
/2 < lzl <
3n 12
within which Series 3 for tanz is valid.
The residue at z : 0 is a'n, the nth coefficient of the Taylor series valid in the immediate
neighborhood of z - 0. To see this, note that near z : 0,
tanz D*o'^z^ \L
Zn+l
Zn+l
-:
a'^
,n*l-m
The residue is the coefficient of the z-1 term in this series (m : n)-that is, the coefficient atr.
To evaluate the residues at the other poles, we use the series valid about each of them.
For example, neur z - n f2, we found
tanz---
I
+!('-!t-
e- ft/D* :(' t)
The integrand has a simple pole at z
:
+
1r
zr3
n\'- z) +"'
r 12, so the residue there, by method
t *!(,_1)+!(,_,,t
n.r/1):
r.^(z-nl2)(_
\2) 7+n/2
V.l-\-(r-"/r)+:\t-t)+u\'-Z)
:
-l
/ 2\'+l
1o121*': - (;/
Intheneighborhood of
-rf2,letu: z- (-n12): zln12.Then
sinz _ sin(a - 1rl2) _ -cos&
tut4a"*-
*t("-"/D
-
*-
1, is
+ \)
Xl THE TANGENT FUNCTION: A DETAILED INVESTIGATION OF SERIES EXPANSIONS 583
andthus the series fortanzrra?t 7
The residue at z : -r 12 is then
n",
"*
- -n12is Series 2, with r.u replacedby, - z*n12.
: (- ,rn (z\'+r
: -' ,, : - (\.-?\'*'
\ z):;*r*r:
tt /
Y" )
(-1'\
These results are valid for both positive and negative values of n. Thus, the integral that
gives the Laurent coefficients is
o^:
$ Ho=
" ^L.
2ni Jc
z'*t
fu,:, *", (zi): a',* (?)".' t-l + (-1)'l
0 for negative n. The term - 1 + (- 1)' is zero for even values of n and equals
odd values ofn (positive or negative). Thus, Series 3 is
atn:
where
-2for
tan
T: -'(+)^ + -'Gf ; -:..('- #)
..' - + a'). ul+ - : (1)').
[l
'-
-
12.
r124.9348
z - - +z(0. 18943)
-
I'76-
+23(4.8219
This is
lzl <
a
x
10-3) + z5(t.9zee
x t0-4) +...
Laurent series with infinitely many positive and negative powers, valid for ir 12 <
31 12.
:
n and compare with the known value tan zr
Let's evaluate this series 4t 7
sum of the six terms listed above gives
tanrr
:
0. The
:
-+
rr 3 tr 15
23n 2r r
13651,2
: "' - ff
* fr"t+ "' - -3'2x t0-2
+r +
3tr3
7T
The next two terms are
( ?\' 1 * f ?)'.,.l * 11 ,, :2.s246x ro-2 at z:
-rl
"L\")
"l':ts
,''\''/
Thus, the sum
of eight terms of the series is -3. 1981
-6.735 x l0-3. The series
n
x l0-2 + 2.5246 x I0-2 -
is approaching the correct value, but rather slowly.
584
APPENDToES
At z :3n 14,the value should be -1. With six terms, we get
,un3n
--'4
:...-
. (?)'
:
/ 4 \3 I
1\t -4.g3,*t(;/
-i+
\3n/
t2.t76f
,o.rrrnx 10-3).
(?)'
3n
4(o'r8e43)
(1.s266x r0-4)
... _ 0.97 +...
These series clearly converge more slowly than the standard ones we are used to using,
but they do have their uses. Notice how we used the first two series to obtain the coefficients
for the third; this is a good demonstration of the idea of analytic continuation.
The region of validity for the third series overlaps with that for the second. Again, the
two series are identical in the region of overlap.
Bibliography
Abramowitz, Milton, and Stegun, Irene A. Handbook of Mathematical Functions,9th printing. National Bureau of Standards, Applied Math Series 55,1970.
Anton, Howard, and Rorres, Chis. Elementary Linear Algebra,6th edition. Wiley, New
York, 1991.
Arfken, George B., and Weber, Hans
J. Mathematical Methods for Physicists,4th edition.
Academic Press, San Diego, 1995.
Bauerle, G. Studies in Mathematical Physics, Volume l: Finite and Infinite Dimensional
Lie Algebras and Applications in Physics. North Holland, Amsterdam, 1990.
Bowman, Frurk. Introduction to Bessel Functions. Dover Publications, New York, 1958.
Bradbury, Ted Clay. Mathematical Methods withApplications to Problems in the Physical
Sciences. Wiley, NewYork, 1984.
Butkov, Eugene. M athematic al P hy s ic s. Addison-Wesley, Reading, I 96 8.
Chattopadhyay, P. K. M athe mat i c al P hy s i c s. Wiley, New York, I 990.
Chow, Thi L. Mathematical Methods for Physicists: A Concise Introduction. Cambridge
University Press, Cambridge, 2000.
Cohen, Harold. Mathematics for Scientists and Engineers. Prentice Hall, Englewood Cliffs,
1992.
Cook, David. Computation and Problem Solving in Undergraduate Physics. Brooks/Cole,
Pacific Grove, CA, in press.
Courant, R., and Hilbert, D. Methods of Mathematical Physics. Wiley, New York, 1953.
Dennery Philippe, and Krzywicki, Andrd. Mathematics for Physicists. Harper and Row,
NewYork, 1967.
Edwards, C. H., Jr., and Penney, David E. Dffirential Equations and Boundary Value
Problems. Prentice Hall, Englewood Cliffs, 1996.
Ford, Lester R. Dffirential Equations. McGraw-Hill, NewYork, 1955.
Garcia, Alejandro L. Numerical Methods for Physics. Prentice Hall, Upper Saddle River,
2000.
Georgi, Howard. Lie Algebras in Particle Physics. Perseus Books, Reading, 1999.
Geroch, Robert. Mathematical Physics. The University of Chicago Press, Chicago, 1985.
Goldstein, Herbert. Classical M echanics. Addison-Wesley, Reading, 1959.
Golub, Gene H., and Van Laon, CharlesF. Matrix Computations. The Johns Hopkins University Press, Baltimore, 1983.
Gradshteyn, I. S., and Ryzhik, l. M. Table of Integrals, Series and Products, corrected and
enlarged edition. Academic Press, Orlando, 1980.
585
586
BTBLToGRAPHY
Halmos, Paul R. Finite Dimensional Vector Spaces. Springer-Verlag, New York, 1974.
Ince, E. L. Ordinary Dffirential Equations. Dover, NewYork, 1956.
Jackson, J.D. Classical Electrodynamics,3rd edition. Wiley, NewYork, 1999.
Jeffreys, Sir Harold, and Jeffreys, Bertha Swirles. Mathematical Physics. Cambridge University Press, Cambridge, 1962.
Jones, H. F. Groups, Representations and Physics, 2nd edition. Institute of Physics Pub-
lishing, London, 1998.
Lawden, Derek. Tensor Calculus and Relativity. Science Paperbacks, Methuen Publishing
Co.,1962.
Lea, S. M., and Burke, J. R. Physlcs: The Nature of Things. Brooks/Cole, Pacific Grove,
cA, t997.
Lighthill, M. I. Introduction to Fourier Analysis and Generalised Functions. Cambridge
University Press, Cambridge, I 958.
Long, Robert R. Mechanics of Solids and Fluids. Prentice Hall, Englewood Cliffs, 1963.
Margenau, H., and Murphy,G.M. Methods of Mathematical Physics. Van Nostrand, Princeton, 1956.
Marion, J., and Thornton, S. Classical Dynamics of Particles and Systems,4th edition.
Saunders, Fort Worth, 1995.
Mathews, J., and Walker,
York, 1964.
R.L. Mathematical Methods of Physics.
W. A. Benjamin, New
Morse, Philip M., and Feshbach, Herman. Methods of Theoretical Physics. McGraw-Hill,
NewYork, 1953.
Murphy, G.M. Ordinary Dffirential Equations andTheir Solulions. Van Nostrand, Princeton, 1960.
Murray, Francis J., and Miller, Kenneth S. Existence Theorems for Ordinary Dffirential
Equations. New York University Press, New York, 1954.
Polyanin, A. D., and Zaitsev,Y. F. Handbook of Exact Solutions for Ordinary Dffirential
Equations. CRC Press, Boca Raton, 1995.
Press, W. H., Teukolsky, S. A., Vetterling, W. T., and Flannery, B. P. Numerical Recipes,
2nd edition. Cambridge University Pres s, Cambri dge, 7992.
Schiff, Leonard L Quantum Mechanics,2nd edition. McGraw-Hill, New York, 1955.
Stewart, James. Calculus,4th edition. Brooks/Cole, Pacific Grove, CA, 1999.
Temme, Nico M. Special Functions: An Introduction to the Classical Functions of Mathematical Physics. Wiley, NewYork, 1996.
Tinkham, Michael.GroupTheory andQuantumMechanics.McGraw-Hill, NewYork,1964.
Tucker, Alan. Linear Algebra. Macmillan, New York, 1993.
Tung, Wu-Ki. Group Theory in Physics. World Scientific, Philadelphia, 1985.
Wallace, Philip. Mathematical Analysis of Physical Problems. Dover, NewYork, 1984.
Wilkes, M. V. A Short Introduction to Numerical Analysis. Cambridge University Press,
London, 1966.
Zemanian, A. H. Distribution Theory and Transform Analysis. Dover, New York, 1965.
Zwillinger, Daniel. Handbook of Dffirential Equations. Academic Press, San Diego, 1998.
Index
The main reference or definition is given in boldface'
Abelian grouP, 465
Absolute convergence, 103' 105
Absolute value, of complex number, 78
Absorption, 150,322
AC current, 161,267
Addition
of complex numbers, 76
as vector addition, 80
of matrices, 42
of vectors, 5, 41
Addition theorem
for Bessel functions, 436
for spherical harmonics, 392
Adjoint matrix,44' 469
Affine connection,456
Affinity,456
Air resistance, 17 O' 2ll, 215, 216
Airy equation,2l2
Airy integral, 538
Algebra, Lie,49O
Algebraic form, Permanence of, 119
Allied series, 563
Altemating series, 104
Ammonia,483
Ampere's law, 35, 86
Amplitude
of complex number, 78
ot of , 44'l
71
oPerator,
Angular momentum
tensor rePresentati
Angular velocitY, 15
Antisymmetric matix, 42
Antisymmetric tensor, 446
Aperiodic function,
Fourier series for, 220
Argand dragram,TT
Argument, of comPlex number, 78
and branch cuts, 84, 121
Associated Legendre equation, 369, 384,
38s, 420, 568
Associated Legendre functions, 385' 569
integral of,387,431
with negative m, 385
orthogonalitY of' 384
table of, 386
Associative ProPerty, 3E, 465, 47 L
Asymptotic form, for modified Bessel
functions, 196
Asymptotic methods
for differential equations, 183' 194
for integrals, 531
Atom, 353,389
Attenuation, and Fourier transform, 331
Attenuation ProPertY, 254
AxisymmetrY, 369, 409, 419
ofwave,85
Analytic continuation, ll7 ' 156,21O
bac-cab rule, 17
Analyticity,97
Basis, change of,51, 58,476
Basis forms, 454
Basis vectors, 39, 220, 453' 47 6
change of, 53
and tangent function, 584
Angles, between curves, 1 5 I
Angular frequencY, 85
Angular momentum, 42, 439, 441
ofparticle,
Backward difference, 199
15
587
588
Branch point, 85,
Beam
bendingof, 17l
on elastic foundation, 354
l2l,
147,157,187,272
more than one, 91
Bromwich contour, 269
simply supporte d, 258-261
Bernoulli-Euler laq 171
Bernoulli's laq 68
Cable, hanging,548
Bessel functions
Calculus of variations, 541
with more than one variable, 551
Capacitor
energy stored in, 243
parallel plate, 155, 166
Cartesian coordinates, I
asymptotic form of, 195, 404, 413, 424
generating function
fol
407
integral of, 409, 418, 434
integral representation of , 286,
407,531,538
Laplace transform of , 436
with negative index, 401
orthogonality of
in finite domain,408
in infinite domain, 418,434,569
relations between, 405406
series, 400
small argument approximation of, 403
zeros of, 41O,522
Bessel's equation, 212, 214, 398, 399
Laplace transform of, 286
modified, 194,217,412
series solution of, 400
Biot-Savart law, 15, 463
Boundary
closed, 575
open, 575
Boundary conditions
cylindrical, 409, 415, 418
for fluid flow problems, I 14
and Fourier series, 237
and Green's functions, 495, 510, 513
for Laplace's equation, 210
and numerical solution of differential
equations, 199
and ordinary differential equations, 169,
174,176
and partial differential equations, 575
spherical, 381, 393
and Sturm Liouville theory 357, 363
Boundary value problem, 381, 393, 409,
415,418,5t2-516
Brachistochrone, 552
Bracket,490
Branch, offunction,84
Branch cut, 84, 90,91,96,121, 188
and contour integrals, 128, 147 , 157 ,272
curl in, 19
divergence
gradientin,
in,
19
18
Green's function in, 501
Laplacian in, 28
potential in, 364
vectors in,5, 10
Cartesian tensor, 439
Catenary, 548
Cauchy conditions, 575
Cauchy formula, 99, 106, 107, 149, 559
Cauchy-Riemann relations, 95, 162
Cauchy theorem, 97,109, 122
Cauchy's inequality, 167
Causality
and Fourier transform, 334
and Green's functions, 496,504
Center,
of agrotry,492
Central difference, 199
Chain, of nuclear reactions, 277
Change of basis, 53
Character,480
orthogonality of,48l
Character table,482
Characteristic curve, 5 76
Characteristic equation, 5 3
Charge, 172,234
induced, 521
Charge density
dipole,294
expressed in terms of delta
functions, 317
ring,315
sheet, 298
Choice
of coordinates ,28,210
of transform, 347
INDEX
Christoffel symbol of 2nd kind, 458
Circle
in complex plane, 153, 154
great,554
lunit, 129
Circuit, LRC, 172, 17 4, 234, 256, 265
Circular motion, 15, 447
Circulation, 21
Class,470
and number of irreps, 48 I
Classical orthogonal polynomials, 425428
Closing the contour, 134
Coefficient, in Fourier seies, 222, 225,
226,227,229,242
Cofactor,45
Collisions,
16l,l73
Column vector, 4l
Commutative property
ofaddition,38
of multiplication, 10, 13,43
Commutatot 488,490
Comparison test, 104
Complementary error function, 573
Completeness relation, 361, 505
Complex conjugate, 44, 7 7
and differentiab ility, 9 6, 162
of Fourier transform, 330
offunction,163
and orthogonality, 225, 362, 388
as reflection in real axis, 8l
Complex exponential, 79
weak
limit
oi
311
Complex impedance, 161
Complex integral, path independence of, 98
Complex number, polar representation
ol
78
Complex plane,77
Complex potential, I 13, 155
Complex roots, of indicial equation, 193
Components, of vector, 5, 37, 39, 41, 439
Compton scattering, 215
Conditional convergence, 103
Conductivity, 87,460
Conformal mapping, 150
Congruent transformation, 5 8
Conical functions, 214
Conjugate elements, 468, 470
Connection, affine,456
Conservation of momentum, 547
Constants
separation, 209
structure, 488
Constrained variation, 548
Continuity, 89
of Green's function, 506, 5 16
Continuity equation, 66, ll2
Continuous function, 89
Continuous set of eigenvalues, 417
Contour
Bromwich, 269
closing, 134, 136, 138, 270
deforming, 100, 108, 124,532
for gamma function integral, 157
for Hankel function integral, 531
keyhole-shaped, 147
rectangular, l4l
semi-circular, 134
Contour integral, basic method for
evaluating,127
Contraction, 444,452
Contravariant tensor, 448
Convention
for labeling matrix components, 7, 42
summation, 8
Convergence
absolute, 103, 105
of Fourier series, 240
in the mean, 241,360
pointwise, 104
of Fourier series, 240
radius of, 105, 106, 184
of series, 103
uniform
of function, 140,241
ofseries, l04,24l
weak,305
Convergence tests, 103-104
Convolution
cosine transform,35l
Fourier transform, 332
Laplace transform, 265, 565
sine transform, 350
Coordinate rotations, 6, 476
Coordinates
Cartesian, 1
curl in, 19
divergence
gradientin,
in,
18
19
589
590
Coordinates (continueA
Green's function in, 501
Laplacian in, 28
potential in, 364
vectors in,5, l0
choosing,28,2lO
cylindrical, 2
charge density in. 3 I 6
curl in, 35
divergence in,33,459
gradientin,32
Green's function in, 521
Laplace's equation in, 397
Laplacianin,3T
transformation matrix into, 449
unit vectors in, 29
velocity in, 454
general curvilinear, 28
curl in, 34
delta function in, 3 19
divergence in, 33
gradient in, 32
Laplacian in, 36
generalized,546
skew, 66
spherical, 4, 28, 298, 314, 315-316
Green's function in, 516
Laplace's equation in, 367
wave equation in, 419
spheroidal, 29, 160, 430
Core function, 305, 313, 329
Coset,471
Cosine function
Laplace transform of, 253
series
transformation law for, 445,556
Cube root function, 90
Cube roots, 83
Curl
in Cartesian coordinates, 19
in curvilinear coordinates, 34
in cylindrical coordinates, 35
expressed with determinant, 35
Fourier transform of , 334
Current
in circuit, l3l, 172, l7 4, 234, 256, 265
in conducting sheet, 430
Current density, 87
Curvilinear coordinates, 28
curl in, 34
delta function in, 319
divergence in, 33
gradient
in,32
Laplacian in, 36
Cycle,468
Cyclic group, 466,472
Cylindrical coordinates, 2
charge density
in,316
curl in, 35
divergence in, 33
covariant, 459
grad and curl in, 31
gradient in,32
Green's function in, 521
Laplace's equation in, 397
Laplacian in, 37
transformation matrix into, 449
unit vectors in, 29
velocity in, 454
for, 185
Cosine rule, 67
Cosine series,229,232
Cosine transfomt,344
of derivative, 346
Cosines
Damped harmonic oscillator, 171
product of,22l
sum of, 221
Coupled pendulums, 58
Covariant derivative, 45 6
of metric tensor,458
Covariant tensor,448
Cramer's rule, 51
Cross product, 14, 445, 555
and antisymmetric tensors, 446
Deflection, of beam,261
Deforming integration contour, 532
Degeneracy,362
Damping,87,17l
Daughter species,277
De Moivre's theorem, 160
Decay, mclear,277
Del squared operator, 28
Delta function, 287 -304
derivative of, 293
expressed as sum of eigenfunctions, 361
Fourier series of, 300
Fourier transform of, 329, 503
INDEX
of
a function, 295
and Green's functions, 495, 505, 512
integral of,299
integral representation of, 303-304, 319,569
Laplace transform of, 304
physical dimension oi 288
as weak
limit, 307
Delta sequence, 288, 3 1 8
Densities, expressed with
delta functions, 317
Derivative
of complex function, 94, 96
covariant, 456
directional, l8
of distribution, 308
expressed as line integral, 101, 559
Fourier transform oi 330, 333
Laplace transform of, 254
of Laplace transform, 261
numerical, 199
Determinants,44, 5l
and curl, 35
and Levi-Civita symbol, 45, 555
product theorem for, 46
Deviation, squ'are, 241, 352
Diagonal, of mafrx,42
Diagonalize a matrix, 53,47'7
Dielectric constant, 86, 148, 353
Difference, backward, central, and
forward, 199
Difference equation, 198
Differentiability, 93
Differential area,34
Differential equations
with constant coefficients, 1 74
with distributions, 308, 495
Fourier series solution of,234
homogeneous, 169, 174
inhomogeneous, l7 O, l7 6
Laplace transform solution oi 255
linear, 169, 17 4, 177, 251, 255,
334,336
numerical solution of, 199
orderof,169
ordinary, 169
partial, 169,173,363
classification of, 574
existence of solutions for, 575
and Fourier transforms, 336-340
separation ofvariables in, 208
power series solution of, 184-196
Differential operator, even, 186
Differential volume, 32, 33
Differentiate term by term, 104, 311
Diffusion, 173
of magnetic field, 355
Diffusion coefficient, 173
Diffusion equation, 173, 57 5
and Fourier transforms, 339
Green's function for, 5lO, 529
Dimension
of group representation, 475
of vector space, 38, 39,475
Dimensionless variable, 1 97
Dipole,294
electric field due to, 320
Dipole moment and molecules,484
Dirac, P. 4.M,287
Direct product grotp, 473
Direct sum of irreps, 478
Directional derivative, 1 8
Directionality, in solution of differential
equations, 576
Dirichlet conditions, 357, 513, 57 5
for Green's functions, 504
Dirichlet Green's function
cylindrical, 524,526
rectangular, 505-5 l0
spherical, 516, 529
Discontinuity, in electric field, 516
Discontinuous functions, 240
Fourier series for, 223
and Laplace transform, 265
Dispersion relations, 148
Displacement, 450
electric, 353
parallel, 456
of string, 303
Distributions, 305
derivatives of, 308
general theory of, 305
in N dimensions, 313
product of, 308
properties of, 307
as solutions of differential equations,
308-309
Distributive law, 38
591
592
Divergence, 19
covariant,459
in curvilinear coordinates, 32
Divergence theorem, 22, 32
Dividing space, 505
Division-of-region method, for Green's
function, 496,515
Dot product, 12-13, 40, 220, 444, 452
Drum,433
Dual, of tensor, 446
Dual space,453
Eigenfunctions , 2lO, 358, 499-502, 505,
509,515,525
complex,361-362
and delta function, 361, 417, 418, 569
and Green's functions, 499
orthogonality of, 359
Eigenvalues, 210, 358
of associated Legendre equation, 384
choosing, 363,409
of commuting matrices, 57
continuous set,4l7
generalized,64
of Legendre equation, 370
of matrix,53
reality of, 53
in Sturm-Liouville theory, 361
Eigenvectors, 53
of commuting matrices, 57
orthogonality of, 54, 56
Electromagnetic waves, in conducting
Equipotentials, I 16, 156
Equivalence class, 470
Equivalent representations, 476
Erf function, 572
Erfc function, 573
Error function, 306, 57 2
asymptotic expansion for, 573
series expansion
values of, 574
ol
573
Essential singularity, 120
Euler differential equation, 193
Euler equation, 544
Euler-Lagrange equation, 544
Euler method, 200
Euler's constant, 321
Euler's definition of gamma function, 156
Euler's formula, T9
Even differential operator, 186
Even function, 129
Existence of solutions, for partial differential
equations, 575
Exponential, of matrix, 485
Exponential function, 79, 85
as asymptotic form, 194
and attentuation, 254
Laplace transform of, 25 1
series for, 108
as solution
of differential equation, 174
as solution of Laplace's equation, 210
Exponential integral, 280
Exponential order, 25 I
Extremum,541
medium,87-89
Elementary matrices,48
Elementary product, 44
Ellipse, 160
Elliptic differential equation, 575
Energy
electrostatic, 552
gravitational potential, 548
Entire function, 97
Entropy,576
Equality of matrices, 42
Equation of state, 216
Equations
that reduce to Bessel's equation,
216,2t7
transcendental, 550
Equilibrium,58
Factor grotp,472
Factorial, 157
Fairly good functions, 305
Faithful representation, 475
Faraday's law,86, 132
Fermat's principle, 541
Fieldlines,114
Fluid,21, 66,68,112
Flux,21, ll2,l32
electric, l3
magnetic, 132
Form,448
basis,453
Form factor, 355
Forward difference, 199
Fourier-Bessel transform, 4l 8
INDEX
Fourier cosine series, 229,23I
Fourier-Legendre series, 3 8 I
Fourier series,2ll, 219 etseq,323
for aperiodic function, 220
multivalued, 84, 147
odd,512
periodic, Laplace transform
263
Functional,543
convergence of , 240-243
of delta function, 300
Gamma function, 156
as distribution, 311
Fourier sine series, 228,231
Fourier transform
of delta function, 329,503
of derivative, 330, 333
as distribution, 337
evaluating, 329
asymptotic form of, 159,537
incomplete, 159
and Laplace transform of powers, 252
Gauss-Jordan method, 48
Gauss'law, 86,459
Gauss'theorem, 22
Gaussian elimination, 5l
Gaussian function, 340, 57 I
as delta sequence, 306
Fourier transform of, 327
Gaussian integral, 157, 572
along path off real axis, 328, 534,536
Generalized eigenvalues, 64
Generalized function, 287
Generalized Rodrigues formula, 425
ofGaussian,32T
of grad, curl, and Laplaciary 334
and Green's function, 503
inverse, 324
and Laplace transform, 325
in N dimensions, 333
symmetric, 324
Fourier's theorem, 219
Fourthroots, 136
Generating function
for Bessel functions, 407,408
for Herrnite polynomials, 433
for Legendre polynomials, 375
Generator, of group, 485, 487
Geometric series, 105, 106, 110
Gibbs phenomenon, 223, 562
Fractions, partial, 258
Free index, 8
Frequency
nafirzl,236
normal mode, 63
resonant, 236,502
Fresnel integrals,437
Fringing field, 167
Frobenius method, 187
Fuch's theorem, 190
Full-width half-maximum, 572
Function
analytic,9T
with branch point, integral of, 147
complex,83
continuous, 89
and distribution, 307
Global2'l
Good functions, 305
Goursat, 97
Gradient, 18,31,448
Gram-Schmidt orthogonalization, 40
Great circle, 554
Green's function, 317, 495
for beam, 526,527
for diffusion equation, 5lO,529
division-of-region method for, 496,
505,515
core, 305
expansion of, in eigenfunctions, 501,
delta,287
discontinuous, 223, 240, 265
508,510, 519,526
for Helmholtz equation, 50I,527
in N dimensions, 512
for Poisson's equation, 505, 5 12
entire,97
even,129
garnma, 156,537
generalized, 288
meromorphic,
oi
test, 305
l2l
with more than one branch point,
with multiple zeros, 210
91
symmetry of,495,514
transform methods for, 503,
Green's theorem, 26, 512
5 10
593
594
as solutions ofLaplace's equation, 211
Hypergeometric equation, 188, 212
Group,465
abelian, 465
cyclic,466,472
direct product, 473
factor,472
generator
Identity element, 465, 466
character of, 481
of,484
for factor grotp,472
Lie,490
Lorentz,486,492
multiplication table for, 466,468
operation,465
permutation, 467,482
rotation, 467,484
unitary,469
Group speed, 539
Guitar,246
Half-width half-maximum, 572
Hamiltonian, 553
Hamilton's principle, 546
Hankel functions, 402, 421
contour for evaluating, 531, 535
Harmonic functions, ll3, 532
Harmonic oscillator, l7 l, 186, 282,
352-353,432
damped, 171
Heat conduction, 173,
5
l0-5 I I
Heat flux, 173,354,430
in semi-infinite rod, 510
Helmholtz equation, 173, 433
in cylindrical coordinates, 406
eigenfunctions of, and Green's
function, 509
Green's function for, 5Ol, 527
in spherical coordinates, 420
Taylor series solution of, 184
Helmholtz theorem, 37, 556
Hermite polynomials, 187, 428, 432, 433
Hermite's equation, 186, 432
Hermitian matix,44
Ha@\402
Homogeneity of space, 362
Homogeneous boundary conditions, 504
Homogeneous differential equation, 169, l7 4
Homomorphism, 474
Hooke's law, 171
Hyperbolic differential equation, 575
Hyperbolic functions, 79, l4l
and catenary, 549
and generators, 484. 486
Identity matrix, 8
ra@),412
Imaginary part,76
Impedance,267
complex, 161
Impulse, 287
applied to string, 302
Incomplete gamma function, 159
Incompressible flow, 68, 112
Indices, raising and lowering, 453
Indicial equation, 189, 190
complex roots of, 193
repeated roots of, 190, 213,214
roots of that differ by an integer, 190, 213
Inductance, 16l, 172, 234,256
Induction, proof by, 559
Inequalities, for complex numbers, 80
Inertia tensor, 42, 439, 441, 460
Infinity, point at, 195
Inhomogeneous boundary conditions,
510,513
Inhomogeneous differential equation, 170,
176,334
choice oftransform for, 349
Fourier series solution of,234
general solution of, 183
Laplace transform solution of, 255,265
Initial conditions, 170, 174, l'77
and Fourier transform, 334
and Laplace transform, 254, 256, 27 4
and sine and cosine transforms, 346-347
for vibrating string, 238
Initial value problem, 256
Inner product, 39, 444, 452
Integers, 75
group of,466
Integral representation, of delta function, 304
Integral test, 104
Integral
Airy,538
asymptotic form of, 534
INDEX
complex,9T
contour, 127-148,269
general method for,127
of delta function, 299
exponential, 280
ofGaussian,328
of Laplace transform, 262
mean value theorem for, 560
of multivalued functions, 147
orthogonality, 359
path,2O
path independence of, 98
of P1-,38'7,431,432
Ladder operatoq 378
Lagrange's equations, 60, 54'l
Lagrangian, 57,59,545
with EM fields, 554
for vibrating string, 553, 554
Laguerre polynomials, 428
Laguerre's eqtation, 212
Landau damping, 166
Lane-Emden equation. 2 I 6
Langmuir waves, 165
Laplace development, 45
Laplace transforms, 251, 325, 347, 348, 436
ofBessel function,436
probability,572
and convolution, 265, 565
along real axis, 134-148
of delta function, 304, 503
of a derivative, 254
of a discontinuous function, 265
sine, 143,564
of trigonometric functions
over one peiod, 129-134, 220-221
along real axis, 137
Interior, of curve, 100
Invariance,9,
13
Invariant subgroup, 472
Invariant subspace, 476
Inverse, of matix,8,47
Inverse element,465
Inverse square law, 342,421
Inverting a transform, 253, 269, 324, 344-345
Irrationals, 75
Irreducible representations, 477
Irreps, 478
number of, 480, 481
Irrotational flow, 112
Irrotational vector field, 20, 70
Isolated singularity, ll9, 123
Isomorphism,4T4
Isotherm, 18
Jacobi identity, 67, 489
Jacobi polynomi als, 428
Ja@),400
Jordan's lemma, 140
Kernel,492
Kinetic energy, 57-58
Kirchhoff's laws, 161, 172,234,256
KmG),4t2
Kompaneets equation,
21 5
Kramers-Kronig relations, 150, 322
Kronecker delta, 8, 39, 42
595
and inversion, 256, 258, 269
of a periodic function, 263
ofa square wave,265
table of,253
Laplace's equation, 367, 552
and analytic functions, I 13
in cylindrical coordinates, 397
solution of
in Cartesian coordinates, 209,364
in cylindrical coordinates, 415, 418
in spherical coordinates, 373, 388
in spherical coordinates, 367
use of in finding Green's function, 515
Laplacian,28,36
and delta function
in three dimensions, 314
in two dimensions, 315
Fourier transform of, 334
Laurent series, 1.08-11.1, 1 15
and differential equations, 187
and residues, 122,124
and singularities, l2O, l2l
for tangent function, 580, 583
Legendre functions
with complex l, 214
of second kind, 193, 373,430
Legendre polynomials, 37V37 2, 425, 428
derivatives of, 385
explicit expression for, 381
integral of, 376
orthogonality of,374
value with zero argument, 381
lrgendre's equation, 214, 370, 420
and Laplace's equation, 367-369
second solution oi 181, 190
series solution of, 190, 369
Levi-Civita symbols, 14, 16,45
in four dimensions, 463
product of, 16
as tensor density, 445, 464
Lie algebra,490
Lie group,490
Limit point, of singularities, 122
Line charge, 506
Line element, 31,319, 451
Line irtegral,20,97
path independence of ,
2l
Mathernatica,4S, 197
Matrices, 7, 41, 469, 47 4480
indices of, convention for, 42
inverse of, 8,47
multiplication of , 10,
4L4!
441, 467, 479
orthogonal,48
positive definite, 64
sum of,42
transpose of, 44
unitary,48
Maxwellian velocity distribution, 166, 57 1
Maxwell's equations, 86, 131, 208
Mean value theorem for integrals, 288, 560
Mellin inversion integral, 269
Memory systems with, 265
Linear differential equation, 169
Linear independence
of functions, 174, 179
of vectors, 39
Linear operator, 253, 329
Linearity, of Fourier transform, 329
Liouville's theorem, 167
Load, on bearn,172
Meromorphic function, 121
Metric coefficients, 3l
Metric tensor, 451
Modeling, 145
problems in, 274
Modified Bessel equation, 194,217 , 412
Modified Bessel functions, 412
asymptotic form of, 196,413
Local,27
integral representation of, 537
recursion relations for, 414415
series representation of, 412
Moment of inertia, 17 l, 439, 441
Log function
branches of, 84
as
conformal mapping, 153
for, ll9, 193,376
Lorcntz force,554
Lorentz grotp,486,492
Momentum,328
Motion
with air re-sistance, 176
in E and B fields, 65,554
Multiplication table, of group, 466, 468, 47 5
Multivalued tunctions, 84,90, 147
Mutual inductance, 274
Lorentz transformation, 462
Lowering indices,453
Neighborhood,97
cosine and sine of, 194
and delta function, 315
and second solution of differential
equation, 190
series
ZRC circuit, 172,256
with periodic driving emf,234,267
Magnetic field, 15, 35,86,132,432
diffusion of, 355
Magnification, 81, 151
Magnitude
of complex number, 78 (see alsoAmplitude)
ofvector,
13
Network, nuclear reaction, 277
Neumann conditions, 357, 513, 528,
for Green's function, 504
Neumann function, 4Ol, 421
Neutron density, evolution of, 356
in uranium,438
Newton's second law, 5,9,170
57 5
N^(x),402
Nonsingular matix,47
Map\e,48,197,579
Normal modes, 58,60,72
Mapping,473
Normal subgroup,472
conformal, 150
Normal to a surface, 24
function as, 83
Normal vector, and Stokes'theorem, 24
INDEX
Nuclear reactions, 27 7, 28I, 285, 438
Nucleon interaction, 215
Numerical solution
of lst order differential equation, 199
of 2nd order differential equation, 204
Oblate spheroidal coordinates, 160
Laplace's equation in, 430
One-to-one, 83,473
Onto,473
Operation
group,465
preservation of,474
Operator
ladder, 378
self-adjoint, 362
translation,492
Order
of differential equation, 169
of element,472
of group, 465,471,479
of numerical integration method, 2OO,2O3
ofpole, l2l
ofzero,119
Ordinary differential equations,
coupled, 209
Orthogonal coordinates, 1,29, 451
Orthogonal functions, 220, 35V359, 384,
388,408, 418,424,425
Orthogonal matrix, 8, 48,467,484
Orthogonal polynomials, 37 0, 425428, 432
table of,428
Orthogonal transformation, 56
Orthogonal vectors, 39
orthogonality, 220, 35u359
of associated Legendre functions, 384
of Bessel functions
in finite domain, 408,434
in infinite domain, 4I7,569
of characters, 481
of complex eigenfunctions, 362
of complex exponentials, 225
of derivatives of eigenfunctions, 429
of eigenfunctions, proof of, 358-359
of eigenvectors, 54
of eigenvectors with equal eigenvalues, 56
of group representations, 479
in infinite domain, 417
of Legendre functions, 374
of sine functions,224
of spherical Bessel functiots,424
of spherical harmonics, 388
of trigonometric functions, 220
ofvectors,39
Orthogonality integral, 359
for associated Legendre functions, 387
for Bessel functions, 409
for Legendre polynomials, 376
for spherical Bessel functions,425
Orthonormal basis vectors, 40
Orthonormal functions, 388, 499
Oscillations
damped, 175
small,57
Outer product,444
Parabola, 160
Parabolic differential equation, 575
Parallel displacement, 456
Parameters, variation ol 180
Parseval's theorem
for Fourier seies, 243,247
for Fourier transform, 33 l, 332
Partial differential equations
classification of,574
and Fourier transforms, 336
Partial fractions, 258
Partial sums, 103
of Fourier seies, 224, 241, 562
Particular integral, 17 0, 17 6
Path independence, of integrals, 2 I, 98
Path integral, 20, 97, 531, 541
Pauli marices,484
Pendulum,216
conical, 553
coupled, 58
with increasing length, 435
Period
ofgroup element,472
integral over, 129, 220-221
Periodic function, Laplace
transform of, 263
Permanence of algebraic form, 119
Permittivity, 149
Permutation group, 467,
characters of, 481
Permutations, 467
even or odd,14,44
47
0
597
598
INDEX
Phase
of a complex number,
78
535
85
constant,533
85
l'73
Physical dimension
ofdelta function, 288
of Green's function, 520
Physical law,5
Physical model,I45,273
Piano,246
Piecewise smooth, 240
Plane wave, 85, 333
in cylindrical coordinates, 406
Plasma waves, 161, 165
Point at infinity, 195
Point charge, 287
Point source, 317,495
Poisson's equation, 208
as elliptic equation, 575
and Fourier transform, 355
Green's function for,5l4
in one dimension, 309
in two dimensions, 505
Polar angle, 4
Polar axis, 4, 395
symmetry about, 369
Polar form, of complex number, 78-79
Pole, 120
order of, 121
onrealaxis, 143, 149
stationary,
a wave,
Phase angle,
Phase constant,
Phase speed, 86,
of
in Cartesian coordinates, 363-367
in cylindrical coordinates, 409,415
due to dipole, 295
due to point charge,375
in spherical coordinates, 388
vector,208
velocity,
ll2
Potential energy, 58, 545,548
Power, 13, 15
in AC circuit, 16l
radiated, 165,341
Power series, 105-lll, 184-196
Power spectrum, 342
Poynting flux, 341
Preservation, of operation, 474
Primitive function theorem, 268
Principal branch, 84
Principal value, of integral,t43
Principle, uncertainty, 328
Probability integral,5l2
Product
inner and ofier,444
of Levi-Civita symbols, 16
Product rule for covariant derivatives,
458,464
Product theorem for determinants, 46
Projectile with air resistance,2l6
Proofby induction, 559
Pseudo-tensor,445
Pseudo-vector, 15,445
Pure recursion relation for Legendre
polynomials, 377
Pythagoreantheorem,2,3,4
Pole expansion, 167
Polygon
83
symmetry of,466
Polynomials
classical orthogonal,425
in complex plane,
Qo@),193,373
QtG),182,213
Quadratic form, 58
Quaternions,49l
Quotient theorem,444
Hermite, 187, 428, 432, 433
Laguene,428
Legendre,369
Position vectors, 23
angle between, 390
in polar coordinates, 33, 66
Positive definite matrix, 64
Potential
axisymmetric, 373
complex, 113
Radiation due to accelerating charge, 341
Radius ofconvergence, l05, 184, 190
Radon, 354
Raising indices,453
Range, ofprojectile with air resistance, 216
Rank, of tensor, 439
Ratio test, 103
Rationals, T5
group of,467
599
Recursion relations
for associated Legendre functions, 390,431
for Bessel functions, 405
for Legendre polynomials, 377 -3'1 8
for modified Bessel functions,474
for power series, 185
Reducibility, of group representations, 477
Reflection
ofcoordinate axes, 8, 15,66,556
in real axis, 81, 163
Reflection matrices, 8, 66, 483
Regular point of a differential equation, 78
solution about, 184
Regular representation, 476
Regular singular point, 190
Relativity, special, l, 5, 462, 463
Remainder,24l
Removable singularity, 120
Repeated roots, 17 6, 297
of indicial equation, l9O,2l4
Representations
dimensions of, 475
equivalent,4T6
faithful, 475
of group, 475
regular,476
trivial,4T'l
Residue, 122
methods for finding, 124-126
Residue theorem, 123
use of, 127-150, 269-27 3
Resonance, 236, 282, 502
Response function, 266
Right-hand rule, 14
Right-handed coordinates, l, 29
Ring of charge, potential due to, 520
Rodrigues formula, 379
generalized,425
Rodrigues-type formula
for Bessel functions, 433
for Hermite polynomials, 433
for spherical Bessel functions, 437
Root test, 103
Roots,82, 136
Rotation matrix, 7 -9, 467, 484
Rotations
in complex plane, 81
of coordinate axes, 6, 395,467, 556
group of, 467,484
repeated,
l0
Rounding error, 199
Row vector,4l
Runge-Kutta method, 202
Saddle point, 532
Scalar field, 18
Scalar product, 12, 40, 452
Scalars, 5, 12,31, 439
Schrodinger eqtation, 215, 438, 553
Schwarz-Christoffel transformation, 1 55
Schwarz reflection principle, 163
Second derivatives, ofvectors, 27
Seismic waves, 552
Self-adjoint
oper ator,
3
62
Self-conjugate subgroup, 472
Semicircle at infinity, 134, 149
Separation constants, 209
for Helmholtz equation, 420
for Laplace's equation, 365, 368,
37 O,
398, 41 I
for waves on string, 238
Separation of variables, 208
and Green's functions, 509
and Helmholtz equation, 420
and Laplace's equation, 363, 367, 398
and waves on string, 237
Sequences, 101
delra, 288
of disributions,3l0
of partial sums, 103
radioactive,2TS
Series, 102, 184, 356
allied, 563
complex, 105, 106, 108
complex Fourier, 219, 225
convergence of, 102, 105
of distributions, 3 I I
Fourier, 219
Fourier-Legendre,382
geometric, 105
Laurent,108, 187
for tangent function, 580, 583
for log tunction, ll9, 193,376
power, 105-l I I, 184-196
for tangent function, 576
Taylor,106, 184
Series solutions, of differential equations, I 84
Shearing force,172
I
I
I
I
600
INDEX
Shifting property
of Fourier transform, 331
oflaplace transform, 254
Si (.r), 268
Sifting property,288,361
Similarity transformation, 53
Simple harmonic motion, 165,l7l
Simple pole, 121
Simple zero, 119,126
Simply connected region, 98
Simply supported beam, 258-261
diagonalization,64
function
Laplace transform of, 253
series for, 185
Sine integral, 268,564
Sine rule, 67
Sine series, 228,232
Sine transform,345, 510
of a derivative, 346
Sines
ofcomplex numbers, T9-80
sum of, 221
Singular matrix, 46
Singular point
of differential equation, 178
solution about, 187
variability of, 179
regular, 190
Singularities, 119-122
essential, 120
of gammafunction, 159
isolated, I 19
of Laplace transform, 269
removable, 120
Skew coordinate system, 66
Small oscillations, 57
Smooth function, 240
Smudging theorem, 307
Snell's law,543
5C(2),467
Solenoidal vector, 19, 70
Solid angle, 70
Span, 39
Specific heat, l'73
Spherical Bessel functions, 421
integrals of, 425
orthogonality of ,424
series representation of, 422
Simultaneous
Sine
Spherical coordinates,4,28,298,
Green's function in, 516
Laplace's equation in, 367
wave equation in, 419
3
14, 3 15-3 16
Spherical harmonics, 388-389
addition theorem for,392
sum rule for,392
table of, 388
Spheroidal coordinates, 29, 160,430
Spreadsheet, 201,204,205,206,207
Square deviation,24l,352
Square wave,223,234
Laplace transform oi 264
Standard form, of differential equation, 178
Star, structure of, 216
Stark effect, 214
Stationary phase, method of, 535
Steepest descent, method of, 531
Step function
eve\229
Fourier series for, 224,226,228,230,563
odd,228
and shifting property, 255
and step distribution, 300
Step size, in numerical solution, 204
Stirling's approximation,159,53'l
Stokes' theorem,24
Streamlines, 114,116
Structure constants,4S8
ofSO(3),489
Sturm-Liouville differential equation,
357,499,553
Sturm-Liouville problem, 357
Sturm-Liouville theory, general method for
use of,210, 363
SU(n),469
Subgroup, 471
Submatrix,45
Subspace,
invaiant,476
Sumrule for spherical harmonics,392
Sumrnation convention, 8
Superposition, principle of, 219
Surface spanning ctwe,24,98
Symmetric matix, 42,56,60,64
Symmetry
and degeneracy, 362
of Fourier transform, 324
of Green's function, 495,496,514
and groups, 465, 466, 483
INDEX
function,576
106
Tangent
Taylor series, 58,
and approximation ofintegrals, 532,
for cross product, 556
for tensors,44l
535
Transient, 268
anddifferentialequations,lTS,lS4,lST Translation,l52,492
200,202
579
Tchebichef polynomials,428
Technetium,28l
Temperature,inrod,5lO
Tensor density, 445
Tensors, 17
Cartesian, 439
and numerical solutions,
for tangent function,
ofmatrix, 8, 44
Triangle function, 238,312
Trigonometric function, integral of
one period, 129-134,220-221
Transpose,
alongrealaxis, I37
Triple scalar product, 15, 29, 5l
Triple vector product, 17
Trivial represeniation, 475,482,484
contravariant and covariant, 448
general,441
metric,45l
Terminal velocity, 176
Testfunction,305,313,329
Tests for convergence, 103
Theorem
addition, for spherical harmonics, 390
Cauchy, 97
divergerce,22
Uncertainty principle, 328
Uniform convergence, 104, 106, 140
Unit circle, 84
integral around,l29
Unit vectors, 3, 29,30, 40, 453
Unitary group,469
Unitary matrix, 4E,469,486
Upper and lower indices, 448
Uranium,438
Green's, 26,512
Helmholtz,7,556
Liouville's,167
560
function,268
product, for determinants, 46
quotient,444
smudging, 307
Stokes', 24
Thermalconductivity, lT3
Torque, 15, l7 , 259
on beam, 171
Trace, of matrix,44,71,480
Transcendental equation, 550
Transform
choice of, 347
Fouier,323
Fourier-Bessel,4l8
of integral, 268
Laplace,25l
Transform method, for Green's function,
503,510
Transformation
mean value, for integrals,
primitive
conformal, 150
congruent, 58
orthogonal, 55
Transformation law
Variables, separation of,208
Variation
constrained, 548
of integral,54l
of parameters, 1 80
Vector area, 15, 67
Vector field, 17, 19
Vector potential,208,554
Vectorproduct, 14
Vector space, 37 , 47 5
Vectors, 5, 38
basis, 39, 53,220, 453,476
column,4l
component of, 5, 39,4l
definition of, l0
row,4l
sum
oi
5,
38,4l
Eansformation law for, 7,9, 441
wit,29
Velocity potential, 112
Yiolin,246
Volume, of parallelepiped, 15
Water molecule, 492
Wave equation
in cylindrical coordinates,406
601
602
INDEX
Wave equation
(c ont inue
d)
for finite sting,237
and Fourier transforms, 336,356
Green's function for, 528
as hyperbolic equation, 575
for infinite string, 336
in one dimension, 173
in spherical coordinates, 419,438
Wave front, 576
Wave number, 328
Wave packet, 328
Wave vector, 85
sound, in cavity, 438
on string, 237, 302, 501, 553, 554
in three dimensions, 356,419421
Weak convergence, 305
Weber's equation, 215
Weighting function, 358, 425
Weirstrass
M
test, 105
Wien law,215
Wronskian, 179
Y6,388
Young's modulus,
17
l,
435
Wavelength, 85
Waves, 173
and complex variables, 85
electromagnetic, 85-89, 34 I
on infinite string, 336-339
plane, 85,406
seismic, 552
Zero distribution, 308, 337
Zeromatix,46
Zero vector,38,41
Zeros, ll9
of Bessel functions, 410,522
Gradient
i<Ort>:Oirl'+,LiA
ira. Bt : G. i)6+ r$. ila+d x 1i'
+B x
B)
d
x dt
Divergence
i
: di .t + tn.ito
i. 1a " B; :8. (i x d) -i. fi
Curl
rotl
x 6t
9x(dn):id*i+d(ixi)
i x (d " f,; : ad .Bt + r[.ila - B(i' at - fd.vt6
Second derivatives
v2i:ifi.nl-i"(ixi)
ix1i4;:s
i'.ti'xd):g
Divergence theorem
Standard form:
Variants:
Stokes'theorem
Standard form:
lrr7 "
Variants:
i)
.fit dA
:
r-ro (v0\ dA:
/, "
frt,
h0
ai
dr
I
f tff x V) x fr oo:
;/
trdrxit
eoiico,tsr$
#NMFR;
ITIONIREA?}IBIilRIE$
rdxBtxi=B(i.i)-dd i)
ti ,. 6) .ri " i) : (d.06.dl - ra.al<[,tt
(i x B) x G x dy :
6ta,
i,dt -
dtf,, a, dl
Cylindrical coordinates
- - ao + lao" -t- ao
p0Q' -2
0p' --d
0z
-o
V.i: pta@fp)
p0Q*of,
0p *lafa
0z
VO
Vxf= /t0f,
\p ad
*)u. (*
v2o:
- #)o * i lftr,ra - W),
i#Q*\*i*4.#
Spherical coordinates
ao lao^
I ao^
vo-_i+__0+__6
0r r00 rsin90Q'
I o(sind/6), I afo
;,;_
v'r-7 lav2fr),
-rti"e
a,
ao -rti"eaO
vx
r:
t /a sino[o
^r"B
(m
t/a
+ ;\*rfe
"
v'o
a \ t/ r
- uok )i * ; (*" O' - *,o)u
a \^
- *f,)+
| a / ao\
l a20
: 71aa, /"ao\
* ;rr'r** (ttntm + ;;rR eW
a,
\'" )
)
The first term rnay be written in the alternative forms
r a2
a2e+-,. zaa
--(rO):
r 'drz '
0rz
r 0r
I
a
Download