An introduction to non-smooth analysis and geometry

advertisement
Departments of Mathematics
Montana State University
Fall 2015
Prof. Kevin Wildrick
An introduction to non-smooth analysis and geometry
Lecture 8: An introduction to Cheeger Differentiability
1. An interpretation of differentiability
Let f : Rn → R be a function, and let z0 ∈ Rn . We say that f is differentiable at z0 if
there is a linear function Lz0 : Rn → R such that
lim
z→z0
|f (z) − f (z0 ) − Lz0 (z − z0 )|
= 0,
||z − z0 ||Rn
which we write in shorthand as
f (z) − f (z0 ) = Lz0 (z − z0 ) + o(||z − z0 ||) as z → z0 ,
and say “f is differentiable at z0 if there is a linear function approximating it to first order
at z0 .”
Let {e1 , . . . , en } be the standard basis of Rn . Consider the coordinate functions xj : Rn →
R defined by
xj (w) := hw, ej i
so that
w=
n
X
xj (w)ej .
j=1
Clearly, these functions, and linear combinations thereof, are linear function.
Using these functions, for any linear mapping L we may “express it in the standard
basis” as:
n
X
L(w) =
L(ej )xj (w).
j=1
Note that each L(ej ) is just a real number. Hence, we have characterized linear mappings
as linear combinations of the coordinate functions.
In this way, we can say that f is differentiable at z0 if there is a linear combination of
the coordinate functions approximating it to first order at z0 .
Moreover, we note that the coordinate functions are 1-Lipschitz. This leads us to
the first big idea: we can in fact characterize differentiability in Rn almost everywhere,
without using any thing more than the metric on Rn , by replacing the concept of coordinate
functions with the concept of Lipschitz functions!
Consider the following:
Definition 1.1 (Toy definition). Fix Lipschitz functions φ1 , . . . , φn : Rn → R. We say
that a function f : Rn → R is Cheeger differentiable at z0 ∈ Rn with respect to (φ1 , . . . , φn )
if there are constants c1 , . . . , cn ∈ R such that
!
n
X
f (z) − f (z0 ) =
cj (φj (z) − φj (z0 )) + o(||z − z0 ||) as z → z0 .
j=1
1
Definition 1.2 (Toy definition, metric version). Let (X, d, µ) be a metric measure space,
and let φ1 , . . . , φn : X → R be Lipschitz functions. We say that a function f : X → R
is Cheeger differentiable at z0 ∈ X with respect to (φ1 , . . . , φn ) if there are constants
c1 , . . . , cn ∈ R such that
!
n
X
f (z) − f (z0 ) =
cj (φj (z) − φj (z0 )) + o(d(z, z0 )) as z → z0 .
j=1
We have just shown above that in Euclidean space and for the choice φj = xj , j =
1, . . . , n, Cheeger differentiability and classical differentiability coincide.
Now, let φ1 , . . . , φn be arbitrary Lipschitz functions, and suppose that f : Rn → R
is Cheeger differentiable with respect to these functions at almost every z0 ∈ Rn . By
Rademacher’s theorem, all the functions c1 φ1 , . . . , cn φn are differentiable at z0 for almost
every z0 ∈ Rn , and hence, for almost every z0 ∈ Rn .
!
!
n
n
X
X
cj (φj (z) − φj (z0 )) =
cj ∇φj (z0 )(z − z0 ) + o(||z − z0 ||) as z → z0 ,
j=1
j=1
Hence we also have
f (z) − f (z0 ) =
n
X
!
cj ∇φj (z0 )(z − z0 )
+ o(||z − z0 ||) as z → z0 ,
j=1
Note that
w 7→
n
X
!
cj ∇φj (z0 )(w)
j=1
is a linear function on Rn , and so we see that f is classically differentiable almost everywhere as well.
Now, if we choose the functions φ1 , . . . , φn poorly, it may not be the case that a clasically
differentiable function is Cheeger differentiable. For example, suppose that φ1 = x1 and
φj = 0 for j = 2, . . . , n. Then the function f (x) = hx, e2 i is not Cheeger differentiable
with respect to these functions.
This leads us to the conclude that in order for Cheeger differentiability to be a reasonable concept, the choice of Lipschitz “coordinates” has to reflect the structure of the
space they are defined on.
This leads us to a deep question:
Question 1.3. Let (X, d, µ) be a metric measure space. Are there Lipschitz functions
φ1 , . . . , φn : X → R so that every Lipschitz function f : X → R is Cheeger differentiable
almost everywhere with respect to (φ1 , . . . , φn )?
2. Measurable Differentiable Structures
A metric measure space might be look rather different in different parts of the space.
For example, we might take two spaces and glue them together along a nice subset. Our
intuition says that apart from the set where we glued, the “almost everywhere differentiability” of Lipschitz functions on such a space should not be ruined under this gluing.
However, the structure of the two spaces might be very different. We are led to the
following definition.
Definition 2.1 (Cheeger). A measurable differentiable structure on a metric measure
space (X, d, µ) is a countable collection {(Xα , φα )} of pairs, called coordinate charts,
where
• : each Xα is a measurable subset of (X, d, µ) with µ(Xα ) > 0, and
!
[
µ X\ Xα ) = 0.
α
• for each α, there is a number N (α) ∈ N, called the dimension of the coordinate
chart, so that φα : X → RN (α) is a Lipschitz function; in other words, each φα is
N (α)
a tuple (φ1α , . . . , φα ) of Lipschitz functions from X to R,
• for each α, the coordinate function φα spans the differentials on Xα , i.e., for every
Lipschitz function f : X → R, there is a measurable function df α : Xα → RN (α)
so that for µ-almost every z0 ∈ Xα ,
(2.1)
f (z) − f (z0 ) = df α (z0 ) · (φα (z) − φα (z0 )) + o(d(z, z0 )) as z → z0 ,
• if g : Xα → RN (α) is another measurable function satisfying (2.1) for µ-almost
every z0 ∈ Xα , then g = df α , µ-almost everywhere,
• the dimensions N (α) are bounded above independently of α.
The dimension of the measurable differentiable structure is defined to be the quantity
max N (α).
α
The main goal of this course will be to find a class of metric measure spaces so that
(1) each space in the class posses a measurable differentiable structure,
(2) the class contains many classical smooth spaces,
(3) the class is stable under appropriate limits,
We can begin by giving some examples of spaces that posses a measurable differentiable
structure.
First, and of course most importantly, is Euclidean space Rn . Here we may choose
(Rn , id : Rn → Rn ) as the single coordinate chart. Then the condition that id span the
differentials is exactly the assertion of Rademacher’s theorem.
Next, it’s a simple definition chase to show that if (M, dg , µ) is a Riemannian manifold
equipped with its volume measure, then any covering collection of smooth charts also
forms a measurable differentiable structure. Again we will need to apply Rademacher’s
theorem, in the charts.
Now, suppose that E ⊆ Rn is any subset of positive Ln -measure. Then (E, || − ||Rn , Ln )
again supports a measurable differentiable structure simply by restriction. There is, of
course one small detail to check (you may have missed this in the previous example as
well) - uniqueness of the differential df α . This is an excellent exercise (hint:use density
points and Lipschitzness).
We can also glue structures together. Suppose (X, d, µ) and (Y, ρ, ν) are metric measure
spaces with measurable differentiable structures {(Xα , φα )} and {(Yβ , ψβ )}. Let E ⊆ X
and F ⊆`Y be subsets such that there is an isometry g : E → F , and consider any metric
d0 on X Y /g with the property that d0 |X×X = d and d0Y ×Y = ρ. Then
{(Xα , φα )} ∪ {(Yβ , ψβ )}
`
is a measurable differentiable structure on X Y /g.
3. The Cantor set has no measurable differentiable structure
It’s easy to see that an infinite dimensional normed vector space cannot have a measurable differentiable structure. However, it’s a more subtle to construct a doubling metric
space that has no measurable differentiable structure. To do so, we’ll use the following
convenient terminology:
Definition 3.1. Let (X, d, µ) be a metric measure space and let z0 ∈ X. We say that a
collection of functions {f1 , . . . , fn : X → R} is dependent to first order at z0 if there is a
vector (λ1 , . . . , λn ) ∈ Rn \{0} such that
n
X
λi (fi (z) − fi (z0 )) = o(d(z, z0 )) as z → z0 in X.
i=1
Definition 3.2. Let (X, d, µ) be a metric measure space. We say that the Lipschitz
differentials in X have dimension at most N if every collection of (N + 1) Lipschitz
functions is dependent to first order at µ-almost every point of X.
Proposition 3.3. If (X, d, µ) supports a measurable differentiable structure with with
dimension N , then the Lipschitz differentials in X have dimension at most N .
Proof. Let f1 , . . . , fN +1 : X → R be an (N + 1)-tuple of Lipschitz functions on X. Let
(Xα , φα ) be a chart, and consider that for any z0 ∈ Xα the vectors df1α (z0 ), . . . , df αN+1 (z0 ) ∈
RN(α) are necessarily linearly dependent. Hence there is λ ∈ RN +1 \0 such that
N
+1
X
λi dfiα (z0 ) = 0.
i=1
Moreover, at µ-almost every point z0 ∈ Xα , for every i = 1, . . . , N + 1,
fi (z) − fi (z0 ) = dfiα (z0 ) · (φα (z) − φα (z0 )) + o(d(z, z0 )) as z → z0 .
Hence
N
+1
X
λi (fi (z) − fi (z0 )) =
i=1
N
+1
X
λi (dfiα (z0 ) · (φα (z) − φα (z0 )) + o(d(z, z0 )))
i=1
=
N
+1
X
!
λi dfiα (z0 ) · (φα (z) − φα (z0 ))
+ o(d(z, z0 ))
i=1
= o(d(z, z0 ).
Since this is true at almost every point of an arbitrary chart, it is true almost everywhere.
In fact, the converse of Proposition 3.3 is also true, as we will later show.
We will use Proposition 3.3 to show that the standard 1/3-Cantor set does not support
a measurable differentiable structure.
Recall that the 1/3-Cantor set is constructed as follows. Let S denote the collection
of all finite or countably infinite sequences with entries in {0, 1}. Denote the length of a
sequence σ ∈ S by |σ|. We declare that the empty sequence has length 0.
Define I∅ = [0, 1]. Now, suppose that Iσ has been defined for all sequences σ ∈ S
with |σ| = k ∈ N. We then define Iσ,0 and Iσ,1 to be the left-most and right-most closed
subintervals of Iσ with length 1/3 of that of Iσ .
For each k ∈ N, let
[
Ck =
Iσ .
σ∈S,|σ|=k
The Cantor set C is defined by
C=
\
Ck .
k∈N
Further, for each number k ≥ 1, set
Uk = Ck−1 \Ck .
Then Uk is the union of 2
k−1
open intervals of length 3−k .
The Cantor set has a natual “uniformly distributed measure” µ. It can described in
many ways:
• Define a sequence of probability measures µk on [0, 1] that restrict to Lebesgue
measure on each Iσ with |σ| = k. These measures converge weakly to a measure
µ on [0, 1] that is supported on C,
• normalized log3 2-dimensional Hausdorff measure,
• consider C as the compact topological group of infinite binary sequences; then let
µ be the Haar measure on this group,
• consider the Cantor function c : C → [0, 1], a continuous surjection, and consider
a measure µ on C satisfying µ(E) = L(c(E)).
Of course, we endow C with the restriction of the Euclidean metric on R. Let’s check
that µ is a doubling metric measure space. Let x ∈ C, and let r > 0; without loss of
generality we may assume that r ≤ 1. Find k ∈ N such that 3−(k+1) < r ≤ 3−k . By
definition x ∈ Iσ for some σ ∈ S with |σ| = k + 1. Hence, B(x, r) ⊇ Iσ ∩ C and so
µ(B(x, r)) ≥ 2−k+1 ≥ rlog3 2 /2.
On the otherhand, the 3−k -neighborhood of Iσ in C consists of the parent I of Iσ ; hence
µ(B(x, r)) ≤ 2−k ≤ 2rlog3 2 .
Thus
µ(B(x, 2r)) ≤ 21+log3 2 rlog3 2 ≤ 22+log3 2 µ(B(x, r)).
Theorem 3.4. There is no N ∈ N so that the Lipschitz differentials in (C, | · |, µ) have
dimension at most N .
Proof. For an infinite sequence σ ∈ S denote the kth entry of σ by σ(k), and define
uσ : [0, 1] → [0, 1] by
X
uσ =
σ(k)χUk .
k∈N
Then define vσ : [0, 1] → [0, 1] by
Z
vσ (x) =
x
uσ (t) dL1 (t).
0
We may of course consider the restriction of vσ to C. Suppose that σ1 , . . . , σN are infinite
sequences in S, and suppose that the corresponding functions vσ1 , . . . , vσN are dependent
to first order at almost every point z0 ∈ C. Let z0 be such a point, and let (λ1 , . . . , λN ) ∈
RN \{0} be such that
N
X
λi (vσi (z) − vσi (z0 )) = o(|z − z0 |) as z → z0 in C.
i=1
We claim that
(3.1)
lim
k→∞
N
X
λi σi (k) = 0.
i=1
Towards this end, let > 0, and choose δ > 0 so that if |z − z0 | < δ and z ∈ C, then
N
X
λi (vσi (z) − vσi (z0 )) ≤ |z − z0 |.
i=1
Choose k0 to be the smallest natural number so that 2 · 3−k0 < δ. Let k ≥ k0 . Then
z0 ∈ Iσ for some σ ∈ S with |σ| = k. Let U be the interval in Uk adjacent to Iσ that is
further away from z0 . Denote the endpoints of U by y and y 0 where y 0 is between y and
z0 . Then,
N
N
N
X
X
X
0 0 λi (vσi (z0 ) − vσi (y ))
λi (vσi (y) − vσi (z0 )) + λi (vσi (y) − vσi (y )) ≤ i=1
i=1
i=1
0
≤ (|y − z0 | + |y − z0 |)
≤ 33−k .
On the otherhand,
vσi (y) − vσi (y 0 ) = σi (k)3−k ,
so that
N
N
X
X
0 −k λi σi (k) ,
λi (vσi (y) − vσi (y )) = 3 i=1
i=1
showing that
N
X
λi σi (k) ≤ 3
i=1
and proving the claim.
It remains to show the following: for any N , there is a collection σ1 , . . . , σN of infinite
sequences in S so that for every vector (λ1 , . . . , λN ) ∈ RN \{0}, the limit (3.1) fails to
hold. Let
(
1 k = i mod N,
σi (k) =
0 k 6= i mod N.
Then
N
X
i=1
(
λi
λi σi (k) =
0
k=i
k 6= i
mod N,
mod N.
Since (λ1 , . . . , λN ) is not the zero vector, at least one of the limits
N
X
lim λi σi (ik) = lim |λi | =
6 0.
k→∞ k→∞
i=1
Download