The Four-Vertex Theorem and its converse In memoriam Björn Dahlberg Dennis DeTurck, University of Pennsylvania with Herman Gluck, Dan Pomerleano, Shea Vick The Four Vertex Theorem says that a simple closed curve in the plane, other than a circle, must have at least four "vertices", that is, at least four points where the curvature has a local maximum or local minimum. There must be at least two local maxima, separated by two local minima. The Converse to the Four Vertex Theorem says that any continuous real-valued function on the circle which has at least two local maxima and two local minima is the curvature function of a simple closed curve in the plane. Preassign curvature ..... then find the curve That is, given such a function κ defined on the circle S1, there is an embedding α: S1 R2 whose image has curvature κ(t) at the point α(t) for all t ∊ S1 . History. In 1909, the Indian mathematician S. Mukhopadhyaya proved the Four Vertex Theorem for strictly convex curves in the plane. Syamadas Mukhopadhyaya (1866 - 1937) In 1912, the German mathematician Adolf Kneser proved it for all simple closed curves in the plane, not just the strictly convex ones. Adolf Kneser (1862 – 1930) In 1971, Herman Gluck proved the converse for strictly positive preassigned curvature, as a special case of a general result about the existence of embeddings of Sn into Rn+1 with strictly positive preassigned Gaussian curvature. In 1997, the Swedish mathematician Björn Dahlberg proved the full converse to the Four Vertex Theorem without the restriction that the curvature be strictly positive. Björn E. J. Dahlberg (1949 – 1998) The work of Dahlberg completes the almost hundred year long thread of ideas begun by Mukhopadhyaya. I. Why is the Four Vertex Theorem true? A counter-example would have a nonconstant curvature function with one max and one min, and be weakly monotonic on the two arcs between them. Let's try to build such a counter-example from a few arcs of circles. Start with the largest circle at the bottom, giving the minimum curvature there. Then go part way up to the right, switch to a smaller circle as shown, go up a little more and switch to a yet smaller circle, and go to its top. As we increase curvature by switching to smaller circles, their vertical diameters move to the right. Hence the top point lies to the right of the vertical diameter of the original circle. When we try the same construction, but this time move up to the left from the bottom of the original circle, the top of the third circular arc will lie to the left of the vertical diameter of the original circle. Thus the convex curve that we are trying to build does not close up at the top...and so does not exist. If we permit one self-intersection, then it's easy to get just one max and one min for the curvature. r = — 1 — 2 sin θ Recommended reading. Robert Osserman "The Four-or-More Vertex Theorem" American Mathematical Monthly Vol. 92, No. 5 (May 1985), 332 - 337. II. The Converse to the Four Vertex Theorem for strictly positive curvature. The Generalized Minkowski Problem. Does there exist an embedding of the n-sphere Sn into Euclidean space Rn+1 whose curvature has been preassigned as a strictly positive continuous function? In the early 1970's, Herman Gluck proved the following result. GENERALIZED MINKOWSKI THEOREM. Let K: Sn R , for n ≥ 2 , be a continuous, strictly positive function. Then there exists an embedding φ: Sn Rn+1 onto a closed convex hypersurface whose Gaussian curvature at the point φ(p) is K(p) for all p ∊ Sn . When n = 1, the Four Vertex Theorem places a constraint on the curvature, so the above theorem says that by contrast, there are no constraints for n ≥ 2 . The Converse to the Four Vertex Theorem. The proof of the Generalized Minkowski Theorem, when rewritten for n = 1 , shows that the required embedding a exists if and only if the curvature is either a nonzero constant or else has at least two local maxima and two local minima. The spirit of the argument is shown in the following picture. What story does this picture tell? A snake searches unsuccessfully for its tail Each of the eight little figures shows a map of an interval into the plane, which begins at the origin, moves off in the positive x-direction, ends up pointing the same way, and is positively curved throughout. We imagine that the snake is following a concealed loop of instructions, from which the above pictures are samples. The error vector in each picture extends from the snake's tail to its head, and the pictures are arranged so that the one at location θ on the circle has an error vector pointing in the direction θ . The resulting loop of error vectors circles once around the origin. If the loop of instructions is contractible in the space of all instructions, then some instruction in that space will have a corresponding error vector which is zero, and hence will tell the snake how to close up. A specific example of the snake dance. The most elementary curvature function which can illustrate these ideas is a step-function, and the simplest case among these leads to curves built from four circular arcs, two cut from a circle of one size, alternating with two cut from a circle of another size, as shown below. Bicircle Since our curvatures are strictly positive, we can express the preassigned curvature as a function of the angle of inclination φ of the outward pointing normal vector. This has the advantage of making total curvature 2π automatic. Given a curvature function κ(φ) , there will then be a unique map of the interval [0, 2π] into the plane which realizes this curvature function, subject to the initial condition that it begin at the origin, and start moving in the positive x-direction. The figures on the left report the curvature κ as a step function of the normal angle φ with the heavier markings indicating the larger circle (hence the smaller curvature). The figures on the right show the resulting curve in the plane, built from arcs of these two circles. Preassign κ0(φ) Get this bicircle Preassign κ(φ) Get this curve Explanation of this picture. The coloring of the left circle reports the preassigned curvature stepfunction κ(φ) , while the corrresponding curves are shown on the right. Each curvature step function κ(φ) is a distortion, via a diffeomorphism of the circle, of the step function κ0(φ) which corresponds to the bicircle. The diffeomorphisms comprise a loop of diffeomorphisms, which is easily seen to be contractible in the space of all diffeomorphisms of the circle. The error vectors turn once around the origin as we go once around the loop of diffeomorphisms. It follows that there is some diffeomorphism whose corresponding error vector is zero, and so the corresponding curve closes up. Of course we knew this, since a bicircle closes up. But this argument is robust, and so will apply to curvature functions which are close to κ0(φ) . Analogy with standard topological proof of the Fundamental Theorem of Algebra. This is analogous to proving that the polynomial p0(z) = an zn has a zero by noting that it takes any circle in the complex plane to a curve which winds n times around the origin, even though we already know that p0(z) has a zero at the origin. Again the point is that this argument is robust, and will apply just as well to the polynomial p(z) = an zn + an—1 zn—1 + ... + a1 z + a0 , once we pay attention to a suitably large circle. Proving the converse to the Four Vertex Theorem for strictly positive curvature. Given a preassigned strictly positive curvature funtion κ: S1 R having at least two relative maxima and two relative minima, we first agree to parametrize our proposed curve by the inverse of the Gauss map. Then we find a diffeomorphism h: S1 S1 so that the curvature function κ ◦ h: S1 R is ε-close in measure to the two-valued step function of some bicircle, meaning that the function κ ◦ h is within ε of this step function on almost all of S1 , and violates this only on a subset of measure less than ε . The robustness of the winding number argument for the two-valued step function then implies that, for sufficiently small ε > 0 , there is another diffeomorphism h1 of the circle so that the curve built with curvature function κ ◦ h ◦ h1 closes up smoothly. Since the curvature is positive, this closed curve is convex, and since its normal vector rotates by just 2π as we go once around, the curve is simple. Reparametrizing this curve, it realizes the preassigned curvature function κ . III. Dahlberg's proof of the Full Converse to the Four Vertex Theorem. When asked to draw a simple closed curve in the plane with strictly positive curvature, and another one with mixed positive and negative curvature, a typical response might be... But in response to the second question, Dahlberg envisioned the following curve. Its four major subarcs are almost circular. They are connected by four small wiggly arcs, each of which has an almost constant tangent direction, but largely varying curvature, including negative curvature. This curve "marginalizes" its negative curvature and emphasizes its positive curvature, and from a distance looks like a bicircle. Dahlberg's Key Idea. You can construct such a curve with any preassigned curvature which has at least two local maxima and two local minima. You can use the winding number argument to get it to close up smoothly, and you can also make it C1 close to a fixed convex curve, which will imply that it is simple. Dahlberg's proof plan. (1) Choose as common domain the unit circle S1 , with arc length s as parameter. Since most of the curves will fail to close up, you can also think of the domain as the interval [0, 2π]. All the curves will have length 2π , and at the end can be scaled up or down to modify their curvature. (2) Given a preassigned curvature function κ: S1 R which is not identically zero, evaluate ∫02π κ(s) ds . If this is zero, precede κ by a preliminary diffeomorphism of S1 so as to make the integral nonzero. Then rescale κ by a constant c so that ∫02π c κ(s) ds = 2π . When we later modify this curvature by a diffeomorphism h: S1 S1 , we will rescale the new curvature κ ◦ h by a constant ch so that the total curvature is 2π: ∫02π ch κ ◦ h(s) ds = 2π . (3) Construct an arc-length parametrized curve ψh: [0, 2π] R2 with curvature ch κ ◦ h in the usual way, starting at the origin and heading to the right. In general, the curve ψh will not close up, and the gap is measured by the error vector I(h) = ψh(2π) — ψh(0) . If the error vector is zero, then the curve closes up, and does so smoothly, because its total curvature is 2π . (4) Use the winding number argument to solve the equation I(h) = 0 for the unkown diffeomorphism h: S1 S1, with the search for h restricted to a certain 2-cell D (yet to be defined) within Diff(S1) . Finding h will give us a smooth closed curve with curvature function ch κ ◦ h , which we then rescale to realize the curvature function κ ◦ h , and finally reparametrize to realize the curvature function κ . We can find such a curve arbitrarily C1-close to a bicircle, and conclude that it is simple, finishing the proof. Dahlberg – using arclength as the parameter Configuration space. A central role in the proof is played by curvature step-functions with just two values, and these correspond to curves built from arcs of two different size circles. To deal with this kind of data, let CS denote the configuration space of ordered 4-tuples (p1 , p2 , p3 , p4) of distinct points on the unit circle S1 , arranged in counterclockwise order, as shown here. The configuration space CS is homeomorphic to S1 R3 . The diffeomorphism group Diff(S1) acts on CS in the natural way: h(p1 , p2 , p3 , p4) = (h(p1) , h(p2) , h(p3) , h(p4)) . The core of configuration space. If the ordered 4-tuple of points (p1 , p2 , p3 , p4) guides the construction of a curve composed of arcs cut alternately from circles of two different sizes, then the curve will close up if and only if opposite arcs are equal in length, equivalently, if and only if p1 and p3 are antipodal, and also p2 and p4 are antipodal. These conditions are easily seen to be equivalent to the equation p1 — p 2 + p 3 — p4 = 0 in the complex plane. We will call the set of such points (p1 , p2 , p3 , p4) the core of the configuration space CS. This core is homeomorphic to S1 R1 . Reduced configuration space. To aid in visualization, we can reduce the dimension of the configuration space from four to three, without losing any essential information. Define the reduced configuration space RCS CS to be the subset where p1 = 1 . Then RCS is homeomorphic to R3 and we can use the group structure on S1 to express the homeomorphism S1 RCS CS by (eiθ, (1 , p , q , r)) (eiθ, eiθp , eiθq, eiθr) . Notice that this homeomorphism preserves cores, that is, 1—p+q—r = 0 iff eiθ — eiθp + eiθq — eiθr = 0 . Now change coordinates by writing p = e2πix , q = e2πiy and r = e2πiz . Then RCS ≅ {(x, y, z) : 0 < x < y < z < 1}. The reduced configuration space appears as an open solid tetrahedron The core of the reduced configuration space. A point (1, p, q, r) in RCS is in the core if and only if 1 and q are antipodal, and also p and r are antipodal. In the x, y, z coordinates for RCS , this means that 0 < x < y = ½ < z = x + ½ < 1. In the open tetrahedron in xyz-space which represents RCS , the core corresponds to the open line segment connecting the point (0, ½, ½) to the point (½, ½, 1) . Special Möbius transformations. Dahlberg's choice of 2-cell D Diff(S1) consists of thespecial Möbius transformations gβ(z) = (z — β) / (1 —β z) , where |β| < 1 and β is the complex conjugate of β . These special Möbius transformations are all isometries of the Poincaré disk model of the hyperbolic plane. The transformation g0 is the identity, and if β 0 , then gβ is a hyperbolic translation of the line through 0 and β which takes β to 0 and 0 to —β . The point β/|β| and its antipode —β/|β| on S1 (the circle at infinity) are the only fixed points of gβ on the unit disk. Mapping Dahlberg's 2-cell into configuration space. Start with Dahlberg's 2-cell D Diff(S1) , and map it into the configuration space CS by picking any point (p1 , p2 , p3 , p4) in the core of CS , and sending gβ (gβ(p1), gβ(p2), gβ(p3), gβ(p4)) . KEY PROPOSITION. The above map of D into CS is a smooth embedding which meets the core transversally at the point p , and nowhere else. We omit the proof, which makes essential use of the fact that the transformations gβ are isometries of the Poincaré disk model of the hyperbolic plane. Finding the image of Dahlberg's disk in the reduced configuration space. Take the embedding of Dahlberg's disk D into the configuration space CS , and then project to the reduced configuration space RCS . In this tetrahedral picture of the reduced configuration space RCS , the midpoint (1/4 , 1/2 , 3/4) of the core corresponds to the point (1 , i , —1 , —i) . Dahlberg's disk is mapped into RCS by gβ (1, gβ(1)—1gβ(i) , gβ(1)—1gβ(—1) , gβ(1)—1gβ(—i)) , and the image is then converted to xyz-coordinates. Completion of Dahlberg's proof. We start with a continuous, preassigned curvature function κ: S1 R which has at least two local maxima and two local minima, and must find an embedding φ: S1 R2 with curvature κ(t) at each point φ(t) . Changing the sign of κ if necessary, there are real numbers 0 < a < b and four points on S1 in counter-clockwise order where κ takes the values a , b , a , b in succession. The points 1 , i , —1 , —i divide the circle S1 into four equal arcs of length π/2 . Let κ0 by the step function which takes the values a , b , a , b on these arcs. The value of κ0 at the four division points is irrelevant. Given any ε > 0 , it is easy to find a diffeomorphism h: S1 S1 such that κ ◦ h is "ε-close in measure" to the step function κ0 . For notational simplicity, replace κ ◦ h by κ . Since κ is bounded, we can choose ε small enough so that the total curvatures of κ and κ0 are arbitrarily close. We then rescale both of these to achieve total curvature 2π, and they will again be ε-close in measure, for a new small ε . Suppose now that h = gβ is one of the diffeomorphisms selected from Dahlberg's disk D , and consider the curvature functions κ ◦ h and κ0 ◦ h . If β is close to 0 , then h is close to the identity, so κ ◦ h and κ0 ◦ h have total curvatures close to 2π . Hence they can be rescaled to have total curvatures exactly 2π : ∫02π ch κ ◦ h(σ) dσ = 2π = ∫02π c0h κ0 ◦ h (σ) dσ . Then in the usual way we construct curves φh and φ0h : [0, 2π] R2 , with these curvatures, and consider their error vectors I(h) = φh(2π) — φh(0) and I0(h) = φ0h(2π) — φ0h(0) . The finale. Let the diffeomorphism h = gβ circle around the identity in Dahlberg's disk D , keeping |β| small and fixed. Since the image of the disk D is transverse to the core, the error vector I0(h) will circle once around the origin in the complex plane. If the curvature function κ is sufficiently close in measure to the step function κ0 , and |β| is sufficiently small, then the error vector I(h) will also circle once around the origin in the complex plane. Hence there must be a diffeomorphism h close to the identity in the Dahlberg disk, for which the error vector I(h) = 0 . This means that the corresponding curve φh closes up, and since the total curvature is 2π , it does so smoothly. For ε and |β| sufficiently small, this curve φh is C1-close to the bicircle corresponding to the original step function κ0 , and hence must itself be simple. The simple closed curve φh realizes the curvature function ch κ ◦ h , rescaling it realizes the curvature function κ ◦ h , and reparametrizing this realizes the curvature function κ . This completes Dahlberg's proof of the Converse to the Four Vertex Theorem.