18.303 Problem Set 1 Solutions Problem 1: 10+(5+10)+10 points

advertisement
18.303 Problem Set 1 Solutions
Problem 1: 10+(5+10)+10 points
(a) It has a solution if b is in C(AT A) = N (A)⊥ , and since N (A) is spanned by n this means
it has a solution if b ⊥ n, i.e. nT b = 0 . Given any solution x of AT Ax = b, we can get
another solution if we add anything in N (AT A) = N (A) to x. i.e. all solutions are given by
x + αn for any scalar α. (Notice that, for this problem, the left nullspace of A is irrelevant.)
(b) Suppose that A is a real 21 × 21 matrix with eigenvalues 1, 12 , 212 , 213 ,. . ., 2120 and corresponding
eigenvectors x0 , x1 , x2 , . . . , x20 , respectively (that is, the eigenvalues are λk = 2−k for k =
0, . . . , 20).
(i) Since it has 21 distinct eigenvalues, A must be diagonalizable—defective matrices only
occur when there are repeated roots of the characteristic polynomial. It is not necessarily
symmetric: symmetric matrices have real eigenvalues, but the converse is not necessarily
true. [For example, suppose C is a symmetric matrix; a non-symmetric matrix with
the same real eigenvalues is the similar matrix A = B −1 CB for any non-orthogonal
invertible matrix B.]
(ii) First, let us write the solution of the recurrence in a simpler form: y2 = Ay1 = A2 y0 ,
y3 = Ay2 = A3 y0 , . . ., yn = An y0 . (You can also prove this more formally by induction,
but the pattern is obvious.)
P
Since A is diagonalizable, we can write y0 in the P
basis of the eigenvectors: y0 = k ck xk
for some coefficients ck . Then yn = An y0 = k ck λnk xn , since An simply multiplies
each eigenvector by λn . However, all of the eigenvalues have |λ| < 1 except for λ0 = 1,
which means that λn → 0 for large n. Thus, for large n, everything except for the x0
term goes to zero, and yn ≈ c0 x0 . However, we can be more explicit than this because
A was given to be symmetric in this part, which means that the eigenvectors are orxT y0
x0 xT
xT y
thogonal and we can write ck = xTk xk0 . Hence, yn ≈ T0 x0 = T 0 y0 for large n,
k
x0 x0
x0 x0
where for aesthetic reasons I rewrote it in the form of a projection matrix x0 xT0 /xT0 x0 .
If you want, you could use a more accurate approximation, and write yn as the sum
of thehx0 term plus the
i lowest-order correction by including the next-biggest term x1 :
x xT
x xT
yn ≈ xT0 x00 + 21n x1T x11 y0 .
0
1
Note that you can not assume that the eigenvectors are normalized to length = 1,
i.e. you cannot assume here that xT0 x0 = kx0 k2 = 1. Although one can always choose
the eigenvectors to be scaled in this way, it is not automatic, and their length was not
specified in this problem.
(c) Suppose x is an eigenvector of A, i.e. Ax = λx. Take the dot product of both sides with
x , i.e. multiply both sides with x∗ = xT (noting that we cannot assume that x is real),
obtaining:
x∗ Ax
= λx∗ x = λkxk2
∗
=
(A∗ x) x
=
(−Ax)∗ x = (−λx)∗ x = −λ̄kxk2 ,
where in the third line we have used the fact that A is real-antisymmetric, so that A∗ = AT =
−A. Then, since by definition an eigenvector has x 6= 0 and thus kxk 6= 0, by comparing the
1
first and third lines we obtain λ = −λ̄ , and hence the eigenvalue is purely imaginary.
Problem 2: 10+5+10 points
(a) The eigenfunctions must still be sines, cosines, or exponentials, as noted in class, with the
boundary conditions determining which one. u(0) = 0 means that they must be sines. To get
u0 (L) = 0, they must reach a maximum or a minimum at L (a phase that is an odd multiple
of π/2), i.e. the eigenfunctions must be of the form:
1 πx
un (x) = sin n +
2 L
π 2
] . (Of course,
for integers n = 0, 1, 2, . . ., with corresponding eigenvalues λn = −[(n + 12 ) L
any constant multiple of this works as well, as well as n < 0, but those variations do not
give linearly independent solutions.) We can check that these are orthogonal, using the trig.
identity sin A sin B = [cos(A − B) − cos(A + B)]/2:
Z
L
un (x)um (x)dx
=
0
=
1
2
Z
L
πx πx i
cos (n − m)
− cos (n + m + 1)
dx
L
L
0

L 
πx sin
(n
+
m
+
1)
L  sin (n − m) πx
L
L 
−
2π
n−m
n+m+1
h
0
=
0
for n 6= m (and hence distinct eigenvalues λn 6= λm ), since the sine of any integer multiple of
π is zero.
(b) The are not a vector space, e.g. 2v(x) will not have the same boundary values.
(c) If we simply write v(x) = u(x) + 2 + x, for example, then u(0) = 0, u0 (L) = 0 implies v(0) = 2,
d2
v 0 (L) = 1; that is, q(x) = u(x) − v(x) = −2 − x. Plugging v = u − q into dx
2 v(x) = g(x)
d2
00
00
yields dx2 u(x) = f (x) with f (x) = g(x) + q (x) = g(x). That is, r(x) = q (x) = 0.
This is not the only possible answer, of course. For example, q(x) = −2 −
gives r(x) = q 00 (x) = −1/L.
x2
2L
also works, and
Problem 3: 10+15 points.
In 18.303, we will mostly solve PDEs in convenient arbitrary units where we don’t have any pesky
dimensionful constants flapping around the equations and annoying us. This represents no loss of
generality because, even if someone hands you a a PDE in SI units, you can always rescale things
to get back into dimensionless form.
∂
(a) If we rescale x̃ = x/X and t̃ = t/T , then by a simple 18.02 exercise ∂t
= T1 ∂∂t̃ (dt = T dt̃)
˜ 2 . Therefore, the Schrodinger equation in these new coordinates
and similarly ∇2 = X12 ∇
becomes:
~2 ˜ 2
~ ∂ ψ̃
−
∇ + V (x) ψ̃ = i
.
2
2mX
T ∂ t̃
We then obtain the desired equation
h
i
˜ 2 + V (X x̃) ψ̃ = i ∂ ψ̃
−∇
∂ t̃
2
√
by choosing X = ~/ 2m and T = ~.
This looks a bit strange because our x̃ and t̃ are now in “weird” units (not dimensionless). If we
wanted to “fix” this, we would divide through the whole equation by some typical energy scale
V0 . This will replace
V by the dimensionless Ṽ (x̃) = V (X x̃)/V0 , and in that case we would
√
obtain X = ~/ 2mV0 and T = ~/V0 , which have units of distance and time respectively so
that x̃ and t̃ become dimensionless quantities.
˜ 2 u = ∂ 2 2u . As
(b) As suggested, let’s begin by rewriting the u equation in “tilde” coordinates: ∇
∂ t̃
in the previous part, we want to relate these to the “real” coordinates by some rescaling:
x̃ = x/X and t̃ = t/T for some X and T . Similar to (or rather, the reverse of) the previous
T 2 ∂2u
part, this will give the transformed equation ∇2 u = X
2 ∂t2 in the (x, t) coordinates.
Now, what are X and T ? First, the X rescaling has to transform from x̃ on the unit circle to
x on the circle of radius R, so clearly X = R = 0.2 m. Second, by comparison of the u and h
equations, c = X/T and thus we must have T = X/c = R/c = 0.001333̄ s.1
Finally, in addition to rescaling the coordinates, u can be related to h by some overall scale
factor H, since this just corresponds to multiplying both sides of the wave equation by H:
u(x̃, ỹ, t̃) = h(Rx̃, Rỹ, Rt̃/c)/H = h(x, y, t)/H. To get H, we must compare the initial conditions: h(Rx̃, Rỹ, 0) = (0.1 m) sin[1 − (x̃2 + ỹ 2 )] versus u(x̃, ỹ, 0) = sin[1 − (x̃2 + ỹ 2 )]. Clearly,
H = 0.1 m. Thus:
h(x, y, t) = (0.1 m) u(x/R, y/R, ct/R) .
1 This
corresponds to a typical timescale of this system, which here is the time for waves to propagate from the
center to the edge of the drum.
3
Download