Hadwiger’s Theorem, Part 1
Posted by Tom Leinster
Hugo Hadwiger was a Swiss mathematician active in the middle part of the 20th century. An old-fashioned encyclopaedia might follow his name with something like “(fl. 1930–1970)”. Here “fl.” means “flourished”. I love that usage. (Are you flourishing right now?) Here’s a photo of Hadwiger, flourishing:
Hadwiger proved—among other things—a really fundamental theorem on the geometry of Euclidean space. Simon and I have both mentioned it on this blog before, but it’s so beautiful that it deserves its own space.
It’s all about ways of measuring the size of subsets of . For example, perimeter and area are both ways of measuring the size of (sufficiently nice) subsets of . Hadwiger’s theorem neatly classifies all such measures of size.
This is the first of a pair of posts. In this one, I’ll take my time explaining Hadwiger’s original theorem. In the next one, I’ll tell you something new.
Hadwiger’s theorem says something like this:
The only measures of size for subsets of are the following.
So there are three things I need to explain.
First, what’s a “measure of size”?
Second, what’s meant by “the following”? In other words, what measures of size on are there?
Third, why are these the only ones? In other words, how do you prove Hadwiger’s theorem?
What’s a “measure of size”?
I’ll begin by pointing out the major limitation of Hadwiger’s theorem, and probably the reason why it’s not better known: it concerns only measures of the size of convex sets. People have tried to extend the result to more general classes of space: see for instance Simon’s post on the Weyl tube formula. Nevertheless, the class of convex sets is not to be sniffed at: it includes sets with highly irregular, non-smooth boundaries, for instance.
Moreover, Hadwiger’s theorem can easily be extended to cover polyconvex sets, that is, finite unions of convex sets. Any figure can be represented to arbitrarily high accuracy by a polyconvex set. For example, the text you’re reading now is a finite union of square or round pixels, and therefore polyconvex.
I’ll stick to the convex setting, though. And by “convex set” I’ll always mean a compact convex subset of . Here is a natural number that I’ll regard as fixed.
A “measure of size” will be a map
satisfying three properties.
1. It’s a valuation. This means that and that satisfies the inclusion-exclusion principle:
whenever and are convex sets with convex. (Then is automatically convex.)
2. It’s invariant. More exactly, it’s invariant with respect to isometries: for any element of the isometry group of , and any convex set , we have .
I want to emphasize that is being treated here as a metric space only (with the Euclidean metric). These are not necessarily linear isometries. So invariance means invariance with respect to not only rotations and reflections, but also translations.
3. It’s continuous. More exactly, it’s continuous with respect to the Hausdorff metric.
What’s the Hausdorff metric? Well, given a metric space (here, ), the Hausdorff metric is a metric on the set of nonempty compact subsets of . For and , write for the set of points within distance of some point of . Then the Hausdorff distance between and is the least such that and .
So, the informal phrase “measure of size” really means “continuous invariant valuation on convex sets”.
Let’s have some examples.
- -dimensional Lebesgue measure is a continuous invariant valuation on convex subsets of , for any .
- Euler characteristic is a continuous invariant valuation on convex subsets of , for any . Since we’re dealing only with convex sets, it’s extremely simple: if is nonempty, and if is empty. (You might think that “Euler characteristic” is a pretentious name for what is basically the function with constant value . But if you venture outside the world of convex sets—to polyconvex sets, for instance—you’ll see that it really does deserve the name.)
- Perimeter is a continuous invariant valuation on convex subsets of . It’s worth spending a moment drawing a mental picture to convince yourself that perimeter really is a valuation.
- Similarly, surface area is a continuous invariant valuation on convex subsets of .
Actually, I don’t think it’s obvious how surface area, or even perimeter, should be defined in the full generality of convex, not necessarily smooth, sets. We’ll see how later. But for the moment, it should be intuitively plausible that they’re examples of continuous invariant valuations.
The examples listed have something in common: they’re homogeneous. A valuation is homogeneous of degree if
for all convex and all , where . For example, Euler characteristic is homogeneous of degree .
It’s clear from the definitions that any real linear combination of continuous invariant valuations is again a continuous invariant valuation. For example,
is a perfectly good continuous invariant valuation on convex subsets of . It might seem strange, because whereas area is naturally measured in metres squared and perimeter in metres, this valuation has no natural units of measurement. But it’s a valuation, all the same.
I’ll write for the real vector space of continuous invariant valuations on convex subsets of .
What measures of size are there?
Hadwiger’s theorem will describe the vector space . It will do more than just describing it up to isomorphism, that is, specifying . But it will in particular tell us .
Let’s try to guess the sequence
by working out the first few entries.
For , we’re looking at valuations on convex subsets of . Well, is a singleton, so has just two convex subsets: the empty set and itself. A valuation is obliged to take value on the empty set, so we have just one degree of freedom: its value on the subset . This can be any real number. Thus, and .
For , we’ve already seen two different valuations: Euler characteristic and Lebesgue measure (length). One is homogeneous of degree , and the other is homogeneous of degree . It follows that they’re linearly independent, and so . Now, the only convex subsets of are the empty set and compact intervals, and from that you can convince yourself that a continuous invariant valuation is determined by its values on and . So , giving .
For , we’ve seen three different valuations: Euler characteristic, perimeter, and 2-dimensional Lebesgue measure (area). They’re homogeneous of degrees , and , and therefore linearly independent. So . Things get harder now: although no other candidates suggest themselves, it’s not so obvious that these really are the only ones. Nonetheless, it’s true: every continuous invariant valuation on is a linear combination of the ones mentioned. So .
The sequence therefore begins
The obvious guess is that for all . Moreover, we’ve seen that for small values of , there’s a basis consisting of one homogeneous valuation of degree for each . So it’s tempting to conjecture that this pattern continues indefinitely.
It’s tempting—and true. But that’s not obvious.
In fact, we find ourselves in unfamiliar territory as soon as we move up to . We have a continuous invariant valuation that’s homogeneous of degree : Euler characteristic. We have one of degree : surface area. And we have one of degree : volume. But we’re missing one. How can we cook up a measure of size of convex subsets of that’s homogeneous of degree —that is, measured in metres?
Here’s how to do it. Let be a convex subset of .
- Choose at random a line through the origin.
- Take the orthogonal projection of onto , which is a line segment.
- Take the length of .
The expected value of that length is our -dimensional measure of . It’s called the mean width of . If you want to understand this stuff more deeply, you can take a moment to convince yourself that mean width really is a continuous invariant valuation—or you can just take my word for it.
That covers : we’ve constructed continuous invariant valuations of degrees , , and . What about general ?
Our mission is to construct continuous invariant valuations
where is homogeneous of degree . Really I should be writing “” instead of , because the valuations depend on as well as ; but for reasons I’ll explain later, it’s OK not to.
Now we just imitate the definition of mean width. There and ; here is any natural number and . Take a convex subset . For a -dimensional linear subspace , write for orthogonal projection onto . Informally, is
the expected value of for a random -dimensional linear subspace of
(up to a scale factor). Here denotes -dimensional Lebesgue measure on .
Formally, write for the space of all -dimensional linear subspaces of (“Gr” for Grassmannian). There is a natural action of the orthogonal group on , and it’s a fact that there’s a unique-up-to-scaling -invariant measure on . Then
where is a positive constant. I’ll come back to that constant in a moment.
Example This construction gives a 2-dimensional valuation on convex subsets of : it’s the expected area of the shadow cast on a random plane. But we’ve already seen another 2-dimensional valuation: surface area. Hadwiger’s theorem is going to tell us that these two measures must be the same up to a scalar factor.
You can work out what the factor is by considering the unit ball. The expected area of the shadow is , whereas its surface area is . So for any convex body in three-dimensional space, the expected area of the shadow cast is a quarter of its surface area. This is known as “Cauchy’s theorem”.
What about that constant ? Any choice gives us a continuous invariant valuation, but some choices are more convenient than others. Here’s the one I like best. It turns out that there’s a unique family of constants with the following two properties:
- For all , the valuation on is volume (Lebesgue measure).
- Let be a subset of , and let . Embed into in the usual way. Then the value of does not depend on whether is viewed as a subset of or of . (This is what makes it safe to write rather than .)
With this normalization, the valuation is called the th intrinsic volume, for reasons that I hope are self-explanatory.
Why are these the only measures?
It’s time to state Hadwiger’s theorem formally:
Theorem (Hadwiger) Let . Then the vector space has dimension , and the intrinsic volumes are a basis.
It’s a corollary that the only continuous invariant valuations that are homogeneous of degree are the scalar multiples of . (We implicitly used this corollary in the example with surface area and shadows.) That’s just because a linear combination of valuations that are homogeneous of different degrees can never be homogeneous, except in the most trivial way. So is a canonical basis, up to scaling.
What about the proof? Hadwiger’s own proof is said to have been extremely involved. Dan Klain simplified it enormously in the mid-1990s. His original paper is here, and you can find a nice account of it in the book Introduction to Geometric Probability by Klain and Gian-Carlo Rota. I gather that it was simplified further in a 2004 paper of Beifang Chen, but I haven’t seen that.
Here’s a very rough outline, based on my understanding of Klain’s proof. It’s easy to see that are linearly independent: just test against the -dimensional cube for each . The work is to show that they span : that is, every continuous invariant valuation is a linear combination of the intrinsic volumes.
What this boils down to is a theorem characterizing volume. Say that a valuation on is simple if it takes value zero on all sets of dimension . For example, , Lebesgue measure, is simple, as are all its scalar multiples. The volume characterization theorem states the converse: every simple continuous invariant valuation is a scalar multiple of . This must be true if Hadwiger’s theorem is to hold, and it’s not too hard to reduce Hadwiger’s theorem to this case.
To prove the volume characterization theorem, you have to get out the scissors and glue. Let’s try it for . Take a simple continuous invariant valuation, . We may assume that : for if not, subtract from a suitable scalar multiple of area. So, we know that takes value zero on both the square and all convex sets of dimension , and our task is to show that is zero on all convex sets.
Chop the square into four smaller squares of equal size. Using simplicity, invariance, and the valuation property, we find that . The same kind of cutting and pasting argument shows that whenever is a rectangle with rational side-lengths. By continuity, the same is true for all rectangles .
Now take a rectangle and cut it down the diagonal. Arguing in the same way, we deduce that whenever is a right-angled triangle. A couple more steps gives you the same thing for all triangles. It follows that for all convex polygons . And the space of convex polygons is dense in the space of convex subsets of , so by continuity, is identically zero.
Given that sketch of , you can see why it’s no mean feat to produce a graceful proof of the general case. I wouldn’t call Klain’s proof “graceful”, actually, but it’s pretty short.
Next time: What happens if you replace Euclidean space by another metric space?
Re: Hadwiger’s Theorem, Part 1
It seems clear that the scaling group should act on Val_n. It’s not clear to me (absent the theorem) why it acts semisimply, with integer eigenvalues.
That is, given a valuation v, we could let k = log_r v(r * unit cube), then let v’(X) = lim v(r*X) / r^k, and hope to show that v’ must be a multiple of V_k. But why must k be an integer (much less in 0..n)? Huh.