These notes synthesize material from L. Simon’s Introduction to Geometric Measure Theory, F. Morgan’s Geometric Measure Theory: A Beginner’s Guide, L.C. Evans and R.F. Gariepy’s Measure Theory and Fine Properties of Functions, and F. Maggi’s Sets of Finite Perimeter and Geometric Variational Problems, enriched with material from C. De Lellis’s lecture notes and T. De Pauw’s survey articles.
Chapter 1: Hausdorff Measure and Dimension
The classical Lebesgue measure in \(\mathbb{R}^n\) assigns an \(n\)-dimensional volume to measurable subsets of Euclidean space, but it is fundamentally an \(n\)-dimensional tool: it cannot distinguish the “size” of lower-dimensional objects living inside \(\mathbb{R}^n\). A smooth curve in \(\mathbb{R}^3\), for instance, has zero Lebesgue measure, yet clearly has a well-defined length. Felix Hausdorff’s groundbreaking 1918 construction remedied this deficiency by introducing a one-parameter family of outer measures \(\mathcal{H}^s\), indexed by a real parameter \(s \geq 0\), that can detect sets of any fractional “dimension.” This framework not only subsumes Lebesgue measure and classical notions of length, area, and volume, but also provides the correct language for measuring the size of fractal sets, rectifiable sets, and the singular sets that arise in the calculus of variations.
1.1 Outer Measures and Carathéodory’s Criterion
We begin by recalling the general framework of outer measures, which provides the foundation for all measure-theoretic constructions in geometric measure theory.
An
outer measure on \(\mathbb{R}^n\) is a function \(\mu^* : \mathcal{P}(\mathbb{R}^n) \to [0, \infty]\) satisfying:
- \(\mu^*(\varnothing) = 0\),
- (Monotonicity) If \(A \subseteq B\), then \(\mu^*(A) \leq \mu^*(B)\),
- (Countable subadditivity) For any sequence \(\{A_j\}_{j=1}^\infty\) of subsets of \(\mathbb{R}^n\),
\[
\mu^*\!\left(\bigcup_{j=1}^\infty A_j\right) \leq \sum_{j=1}^\infty \mu^*(A_j).
\]
Not every subset of \(\mathbb{R}^n\) need be “well-behaved” with respect to an outer measure. Carathéodory’s criterion identifies those sets for which the outer measure enjoys full additivity.
Let \(\mu^*\) be an outer measure on \(\mathbb{R}^n\). A set \(A \subseteq \mathbb{R}^n\) is said to be \(\mu^*\)-measurable (in the sense of Carathéodory) if for every set \(E \subseteq \mathbb{R}^n\),
\[
\mu^*(E) = \mu^*(E \cap A) + \mu^*(E \setminus A).
\]
Carathéodory's Theorem. Let \(\mu^*\) be an outer measure on \(\mathbb{R}^n\). Then the collection \(\mathcal{M}\) of all \(\mu^*\)-measurable sets forms a \(\sigma\)-algebra, and the restriction \(\mu = \mu^*|_{\mathcal{M}}\) is a complete measure.
A crucial refinement is that outer measures constructed from metric notions automatically render all Borel sets measurable.
An outer measure \(\mu^*\) on a metric space \((X, d)\) is called a metric outer measure (or a Borel regular outer measure) if
\[
\mu^*(A \cup B) = \mu^*(A) + \mu^*(B)
\]
whenever \(\operatorname{dist}(A, B) > 0\).
Carathéodory's Metric Criterion. If \(\mu^*\) is a metric outer measure on \((X, d)\), then every Borel set is \(\mu^*\)-measurable.
It suffices to show that every closed set \(C\) is \(\mu^*\)-measurable. Let \(E \subseteq X\) be arbitrary. Define \(C_k = \{x \in E \setminus C : \operatorname{dist}(x, C) > 1/k\}\). Since \(C\) is closed, \(E \setminus C = \bigcup_{k=1}^\infty C_k\). Moreover, \(\operatorname{dist}(E \cap C, C_k) \geq 1/k > 0\), so by the metric property,
\[
\mu^*(E \cap C) + \mu^*(C_k) = \mu^*((E \cap C) \cup C_k) \leq \mu^*(E).
\]
We set \(D_k = C_{k+1} \setminus C_k\). Then \(E \setminus C = C_1 \cup \bigcup_{k=1}^\infty D_k\), and since \(\operatorname{dist}(D_j, D_k) > 0\) when \(|j - k| \geq 2\), the metric property gives
\[
\mu^*(C_{2m}) \geq \sum_{k=1}^{m} \mu^*(D_{2k}), \quad \mu^*(C_{2m+1}) \geq \sum_{k=0}^{m} \mu^*(D_{2k+1}).
\]
If \(\sum \mu^*(D_k) < \infty\), then \(\mu^*(E \setminus C) \leq \mu^*(C_k) + \sum_{j=k}^\infty \mu^*(D_j) \to \mu^*(E \setminus C)\) as \(k \to \infty\), which yields the desired splitting. If the series diverges, then \(\mu^*(E) \geq \mu^*(C_{2m}) = \infty\), and the splitting holds trivially.
1.2 Hausdorff Measure
With the framework of outer measures in hand, we now define the central object of this chapter. The idea is elegantly simple: to measure the \(s\)-dimensional “size” of a set, cover it by sets of small diameter and sum the \(s\)-th powers of their diameters.
Let \(s \geq 0\) and \(\delta > 0\). For any subset \(A \subseteq \mathbb{R}^n\), define the \(\delta\)-approximate \(s\)-dimensional Hausdorff measure by
\[
\mathcal{H}^s_\delta(A) = \inf\left\{\sum_{j=1}^\infty \omega_s \left(\frac{\operatorname{diam}(C_j)}{2}\right)^s : A \subseteq \bigcup_{j=1}^\infty C_j,\; \operatorname{diam}(C_j) \leq \delta\right\},
\]
where \(\omega_s = \frac{\pi^{s/2}}{\Gamma(s/2 + 1)}\) is the volume of the unit ball in \(\mathbb{R}^s\) when \(s\) is a positive integer, and is defined by the same formula for all \(s \geq 0\). The \(s\)-dimensional Hausdorff measure is
\[
\mathcal{H}^s(A) = \lim_{\delta \to 0^+} \mathcal{H}^s_\delta(A) = \sup_{\delta > 0} \mathcal{H}^s_\delta(A).
\]
For each \(s \geq 0\), the set function \(\mathcal{H}^s\) is a Borel regular outer measure on \(\mathbb{R}^n\). In particular, all Borel sets (and all analytic sets) are \(\mathcal{H}^s\)-measurable.
The properties \(\mathcal{H}^s(\varnothing) = 0\), monotonicity, and countable subadditivity for each \(\mathcal{H}^s_\delta\) follow immediately from the definition. Since these properties are preserved under the supremum over \(\delta\), \(\mathcal{H}^s\) is an outer measure. To verify the metric property, suppose \(\operatorname{dist}(A, B) > 0\) and choose \(\delta < \operatorname{dist}(A, B)/2\). Then any covering of \(A \cup B\) by sets of diameter at most \(\delta\) partitions into disjoint subfamilies covering \(A\) and \(B\) respectively, so \(\mathcal{H}^s_\delta(A \cup B) = \mathcal{H}^s_\delta(A) + \mathcal{H}^s_\delta(B)\). Taking \(\delta \to 0\) gives the metric property. By Carathéodory's metric criterion, every Borel set is \(\mathcal{H}^s\)-measurable. Borel regularity (every set is contained in a Borel set with the same measure) requires a slightly more delicate argument using \(G_\delta\) sets; see Evans-Gariepy, Section 2.1.
1.3 Basic Properties and the Isodiametric Inequality
Scaling. For any \(A \subseteq \mathbb{R}^n\) and any \(\lambda > 0\),
\[
\mathcal{H}^s(\lambda A) = \lambda^s \mathcal{H}^s(A).
\]
Translation invariance. For any \(A \subseteq \mathbb{R}^n\) and any \(x \in \mathbb{R}^n\),
\[
\mathcal{H}^s(A + x) = \mathcal{H}^s(A).
\]
Countable sets. If \(A\) is countable, then \(\mathcal{H}^s(A) = 0\) for all \(s > 0\), and \(\mathcal{H}^0(A) = \#A\) (the counting measure).
The agreement of Hausdorff measure with Lebesgue measure hinges on the isodiametric inequality, which asserts that among all sets of a given diameter, the ball has the greatest volume.
Isodiametric Inequality. For any bounded set \(A \subseteq \mathbb{R}^n\),
\[
\mathcal{L}^n(A) \leq \omega_n \left(\frac{\operatorname{diam}(A)}{2}\right)^n.
\]
Equality holds if and only if \(A\) is contained in a ball of diameter \(\operatorname{diam}(A)\).
We use the Steiner symmetrization argument. For a set \(A \subseteq \mathbb{R}^n\) and a hyperplane \(H\), the Steiner symmetrization \(\sigma_H(A)\) replaces each line segment \(A \cap \ell\) (where \(\ell\) is perpendicular to \(H\)) by a segment of the same length centered on \(H\). By Fubini's theorem, \(\mathcal{L}^n(\sigma_H(A)) = \mathcal{L}^n(A)\). Moreover, one verifies that \(\operatorname{diam}(\sigma_H(A)) \leq \operatorname{diam}(A)\) using the Brunn-Minkowski inequality or a direct geometric argument. Iterating Steiner symmetrizations with respect to a dense sequence of hyperplanes, the symmetrized sets converge (in the Hausdorff metric on compact sets) to a ball of diameter at most \(\operatorname{diam}(A)\), while the Lebesgue measure is preserved at each step. This yields the inequality.
Agreement with Lebesgue Measure. On \(\mathbb{R}^n\), we have \(\mathcal{H}^n = \mathcal{L}^n\).
For any Borel set \(A \subseteq \mathbb{R}^n\), the covering definition gives \(\mathcal{H}^n(A) \leq \mathcal{L}^n(A)\) by using a covering of \(A\) by small cubes and applying the isodiametric inequality. Conversely, by the Vitali covering lemma, given any covering of \(A\) by sets \(\{C_j\}\) with \(\operatorname{diam}(C_j) \leq \delta\), we have \(\mathcal{L}^n(A) \leq \mathcal{L}^n(\bigcup C_j) \leq \sum \mathcal{L}^n(\overline{C_j}) \leq \sum \omega_n (\operatorname{diam}(C_j)/2)^n\), where the last step is the isodiametric inequality. Taking the infimum over coverings gives \(\mathcal{L}^n(A) \leq \mathcal{H}^n_\delta(A)\), and letting \(\delta \to 0\) gives \(\mathcal{L}^n(A) \leq \mathcal{H}^n(A)\).
1.4 Hausdorff Dimension
One of the most striking features of Hausdorff measure is the existence of a critical dimension at which the measure “jumps” from infinity to zero.
Let \(A \subseteq \mathbb{R}^n\). If \(\mathcal{H}^s(A) < \infty\) for some \(s \geq 0\), then \(\mathcal{H}^t(A) = 0\) for all \(t > s\). If \(\mathcal{H}^s(A) > 0\) for some \(s \geq 0\), then \(\mathcal{H}^t(A) = \infty\) for all \(0 \leq t < s\).
Suppose \(\mathcal{H}^s(A) < \infty\). For \(t > s\) and any \(\delta\)-covering \(\{C_j\}\) of \(A\), we have
\[
\sum_j \left(\frac{\operatorname{diam}(C_j)}{2}\right)^t \leq \delta^{t-s} \sum_j \left(\frac{\operatorname{diam}(C_j)}{2}\right)^s.
\]
Taking the infimum and then \(\delta \to 0\) gives \(\mathcal{H}^t(A) \leq \lim_{\delta \to 0} \delta^{t-s} \cdot C = 0\) (after adjusting normalizing constants). The second statement follows by contrapositive.
The Hausdorff dimension of a set \(A \subseteq \mathbb{R}^n\) is
\[
\dim_H(A) = \inf\{s \geq 0 : \mathcal{H}^s(A) = 0\} = \sup\{s \geq 0 : \mathcal{H}^s(A) = \infty\}.
\]
At the critical value \(s = \dim_H(A)\), the measure \(\mathcal{H}^s(A)\) can be \(0\), \(\infty\), or any value in between.
1.5 Examples: Cantor Set and Self-Similar Fractals
The power of Hausdorff dimension is most vividly illustrated by computing it for classical fractal sets.
The Middle-Thirds Cantor Set. Let \(C_0 = [0,1]\) and define \(C_{k+1}\) by removing the open middle third of each interval in \(C_k\). The
Cantor set is \(C = \bigcap_{k=0}^\infty C_k\). At stage \(k\), \(C_k\) consists of \(2^k\) closed intervals, each of length \(3^{-k}\).
We claim \(\dim_H(C) = \frac{\log 2}{\log 3}\). Let \(s = \frac{\log 2}{\log 3}\), so that \(2 = 3^s\), i.e., \(2 \cdot 3^{-s} = 1\).
\[
\mathcal{H}^s_{3^{-k}}(C) \leq 2^k \cdot \omega_s \left(\frac{3^{-k}}{2}\right)^s = \omega_s \cdot 2^{-s} \cdot (2 \cdot 3^{-s})^k = \omega_s \cdot 2^{-s},
\]
which is bounded independently of \(k\). Hence \(\mathcal{H}^s(C) \leq \omega_s \cdot 2^{-s} < \infty\), giving \(\dim_H(C) \leq s\).
Lower bound. We use the mass distribution principle. The natural probability measure \(\mu\) on \(C\), assigning mass \(2^{-k}\) to each interval of \(C_k\), satisfies \(\mu(B(x,r)) \leq C r^s\) for all \(x \in C\) and \(r > 0\), where \(C\) is a constant. By the mass distribution principle (or Frostman’s lemma), this implies \(\mathcal{H}^s(C) > 0\), hence \(\dim_H(C) \geq s\).
Therefore \(\dim_H(C) = \frac{\log 2}{\log 3} \approx 0.6309\).
The computation above generalizes to a broad class of self-similar fractals via the following theorem, due to Moran (1946) and later refined by Hutchinson (1981).
Moran-Hutchinson Theorem. Let \(f_1, \ldots, f_N : \mathbb{R}^n \to \mathbb{R}^n\) be contracting similitudes with ratios \(r_1, \ldots, r_N \in (0,1)\), and let \(K\) be the unique nonempty compact set (the attractor or self-similar set) satisfying
\[
K = \bigcup_{i=1}^N f_i(K).
\]
If the open set condition holds (there exists a nonempty bounded open set \(U\) with \(f_i(U) \subseteq U\) and \(f_i(U) \cap f_j(U) = \varnothing\) for \(i \neq j\)), then \(\dim_H(K) = s\), where \(s\) is the unique solution to the Moran equation
\[
\sum_{i=1}^N r_i^s = 1.
\]
Moreover, \(0 < \mathcal{H}^s(K) < \infty\).
The Koch Snowflake Curve. The Koch curve is the attractor of four similitudes, each with ratio \(r = 1/3\). The Moran equation gives \(4 \cdot (1/3)^s = 1\), hence
\[
\dim_H(\text{Koch curve}) = \frac{\log 4}{\log 3} \approx 1.2619.
\]
The Koch snowflake (the boundary of the region enclosed by three Koch curves) has the same Hausdorff dimension \(\frac{\log 4}{\log 3}\). It is a continuous curve of infinite length that encloses a finite area.
The Sierpiński Triangle. This is the attractor of three similitudes with ratio \(1/2\). The Moran equation gives \(3 \cdot (1/2)^s = 1\), so
\[
\dim_H(\text{Sierpiński triangle}) = \frac{\log 3}{\log 2} \approx 1.585.
\]
1.6 Density Theorems
Density results relate the local behavior of a measure to its global structure. They play a fundamental role in the theory of rectifiability.
Let \(\mu\) be a Radon measure on \(\mathbb{R}^n\) and let \(s \geq 0\). The upper and lower \(s\)-dimensional densities of \(\mu\) at a point \(x \in \mathbb{R}^n\) are
\[
\Theta^{*s}(\mu, x) = \limsup_{r \to 0^+} \frac{\mu(B(x,r))}{\omega_s r^s}, \qquad \Theta_*^s(\mu, x) = \liminf_{r \to 0^+} \frac{\mu(B(x,r))}{\omega_s r^s}.
\]
If these are equal, we write \(\Theta^s(\mu, x)\) for the common value and call it the \(s\)-density of \(\mu\) at \(x\).
Upper Density Estimate. Let \(A \subseteq \mathbb{R}^n\) be \(\mathcal{H}^s\)-measurable with \(\mathcal{H}^s(A) < \infty\). Then
\[
\Theta^{*s}(\mathcal{H}^s \mathbin{\vrule height 6pt depth 0pt\relax\vrule height 0.5pt depth 0pt width 4pt} A, x) \leq 1 \quad \text{for } \mathcal{H}^s\text{-a.e. } x \in A,
\]
and
\[
2^{-s} \leq \Theta^{*s}(\mathcal{H}^s \mathbin{\vrule height 6pt depth 0pt\relax\vrule height 0.5pt depth 0pt width 4pt} A, x) \quad \text{for } \mathcal{H}^s\text{-a.e. } x \in A.
\]
1.7 Marstrand’s Projection Theorem
John Marstrand proved in 1954 a fundamental result about how Hausdorff dimension behaves under orthogonal projections. This theorem reveals a deep connection between the dimension of a set and its “visibility” from different directions.
For \(\theta \in [0, \pi)\), let \(\pi_\theta : \mathbb{R}^2 \to \mathbb{R}\) denote the orthogonal projection onto the line through the origin making angle \(\theta\) with the horizontal axis:
\[
\pi_\theta(x_1, x_2) = x_1 \cos\theta + x_2 \sin\theta.
\]
More generally, for \(V \in G(n, m)\) (the Grassmannian of \(m\)-dimensional linear subspaces of \(\mathbb{R}^n\)), let \(\pi_V : \mathbb{R}^n \to V\) be the orthogonal projection.
Marstrand's Projection Theorem (1954). Let \(A \subseteq \mathbb{R}^2\) be a Borel set.
- If \(\dim_H(A) \leq 1\), then \(\dim_H(\pi_\theta(A)) = \dim_H(A)\) for \(\mathcal{L}^1\)-a.e. \(\theta \in [0, \pi)\).
- If \(\dim_H(A) > 1\), then \(\mathcal{L}^1(\pi_\theta(A)) > 0\) for \(\mathcal{L}^1\)-a.e. \(\theta \in [0, \pi)\).
Proof sketch (case \(\dim_H(A) \leq 1\)). The upper bound \(\dim_H(\pi_\theta(A)) \leq \dim_H(A)\) holds for all \(\theta\), since projections are Lipschitz with constant 1 and Lipschitz maps cannot increase Hausdorff dimension. For the lower bound, one uses the energy method: define the \(s\)-energy of a measure \(\mu\) by
\[
I_s(\mu) = \int\!\!\int \frac{d\mu(x)\, d\mu(y)}{|x - y|^s}.
\]
If \(I_s(\mu) < \infty\) for some \(\mu\) supported on \(A\), then \(\dim_H(A) \geq s\). One shows that the projected measures \((\pi_\theta)_\# \mu\) have finite \(s\)-energy for a.e. \(\theta\) by integrating over \(\theta\) and using the explicit kernel computation. This yields the a.e. lower bound on the dimension of the projection.
Chapter 2: Lipschitz Functions and Rectifiability
Lipschitz functions occupy a central position in geometric measure theory: they are the natural morphisms of the theory, playing a role analogous to smooth maps in differential geometry but requiring far less regularity. The fundamental theorems of this chapter — Rademacher’s theorem on almost everywhere differentiability, the area formula, and the coarea formula — provide the technical backbone for all subsequent developments.
2.1 Lipschitz Maps: Basic Properties
A map \(f : A \to \mathbb{R}^m\), where \(A \subseteq \mathbb{R}^n\), is Lipschitz if there exists a constant \(L \geq 0\) such that
\[
|f(x) - f(y)| \leq L|x - y| \quad \text{for all } x, y \in A.
\]
The infimum of all such constants \(L\) is the Lipschitz constant of \(f\), denoted \(\operatorname{Lip}(f)\).
Let \(f : \mathbb{R}^n \to \mathbb{R}^m\) be Lipschitz with constant \(L\). Then:
- \(f\) maps sets of \(\mathcal{L}^n\)-measure zero to sets of \(\mathcal{H}^n\)-measure zero (the Lusin \(N\)-property).
- \(\operatorname{diam}(f(A)) \leq L \cdot \operatorname{diam}(A)\) for all \(A \subseteq \mathbb{R}^n\).
- \(\mathcal{H}^s(f(A)) \leq L^s \mathcal{H}^s(A)\) for all \(A \subseteq \mathbb{R}^n\) and all \(s \geq 0\).
- In particular, \(\dim_H(f(A)) \leq \dim_H(A)\).
Kirszbraun's Extension Theorem (1934). Let \(A \subseteq \mathbb{R}^n\) and \(f : A \to \mathbb{R}^m\) be Lipschitz with constant \(L\). Then there exists a Lipschitz extension \(\tilde{f} : \mathbb{R}^n \to \mathbb{R}^m\) with \(\operatorname{Lip}(\tilde{f}) = L\).
2.2 Rademacher’s Theorem
The following theorem, proved by Hans Rademacher in 1919, is one of the most important results in real analysis and is indispensable for geometric measure theory.
Rademacher's Theorem (1919). Let \(f : \mathbb{R}^n \to \mathbb{R}^m\) be Lipschitz. Then \(f\) is differentiable \(\mathcal{L}^n\)-almost everywhere. That is, for \(\mathcal{L}^n\)-a.e. \(x \in \mathbb{R}^n\), there exists a linear map \(Df(x) : \mathbb{R}^n \to \mathbb{R}^m\) such that
\[
\lim_{y \to x} \frac{|f(y) - f(x) - Df(x)(y - x)|}{|y - x|} = 0.
\]
We give the proof for the case \(m = 1\); the general case follows by applying the scalar result to each component.
Step 1. We first show that directional derivatives exist a.e. For a unit vector \(e \in S^{n-1}\), the function \(t \mapsto f(x + te)\) is Lipschitz on \(\mathbb{R}\), hence differentiable for \(\mathcal{L}^1\)-a.e. \(t\). By Fubini’s theorem, the directional derivative \(\partial_e f(x) = \lim_{t \to 0} \frac{f(x + te) - f(x)}{t}\) exists for \(\mathcal{L}^n\)-a.e. \(x\).
Step 2. Let \(\{e_1, \ldots, e_n\}\) be the standard basis. The partial derivatives \(\partial_{e_i} f\) exist \(\mathcal{L}^n\)-a.e. by Step 1. Fix a countable dense set \(D \subseteq S^{n-1}\). By taking a countable intersection, the directional derivative \(\partial_v f(x)\) exists for all \(v \in D\) simultaneously, for \(\mathcal{L}^n\)-a.e. \(x\).
Step 3. At such a point \(x\), the directional derivative \(\partial_v f(x)\) depends linearly on \(v\) for \(v \in D\) (this follows from the chain rule applied to restrictions to lines, and the Lipschitz bound). Since \(D\) is dense and \(|\partial_v f(x)| \leq L\) for all \(v\), the map \(v \mapsto \partial_v f(x)\) extends uniquely to a bounded linear functional on \(\mathbb{R}^n\), giving the differential \(Df(x)\).
Step 4. It remains to show that this linear map actually gives the derivative, i.e., that \(|f(y) - f(x) - Df(x)(y-x)| = o(|y-x|)\). This is the hardest step. One uses the maximal function estimate: the set where the directional derivative exists but the full derivative does not has measure zero, which can be shown using the Lebesgue differentiation theorem applied to the distributional gradient.
The area formula is the natural generalization of the change-of-variables formula to Lipschitz maps, and it computes the Hausdorff measure of the image of a Lipschitz map in terms of the Jacobian.
For a linear map \(L : \mathbb{R}^n \to \mathbb{R}^m\) with \(n \leq m\), the Jacobian is
\[
J_n L = \sqrt{\det(L^* L)},
\]
where \(L^* : \mathbb{R}^m \to \mathbb{R}^n\) is the adjoint. Equivalently, \(J_n L\) is the \(n\)-dimensional volume of \(L(Q)\), where \(Q\) is the unit cube in \(\mathbb{R}^n\), or equivalently the product of the singular values of \(L\).
Area Formula. Let \(f : \mathbb{R}^n \to \mathbb{R}^m\) be Lipschitz, with \(n \leq m\). Then for every \(\mathcal{L}^n\)-measurable set \(A \subseteq \mathbb{R}^n\),
\[
\int_A J_n Df(x)\, d\mathcal{L}^n(x) = \int_{\mathbb{R}^m} \mathcal{H}^0(A \cap f^{-1}(\{y\}))\, d\mathcal{H}^n(y).
\]
In particular, if \(f\) is injective on \(A\),
\[
\int_A J_n Df(x)\, d\mathcal{L}^n(x) = \mathcal{H}^n(f(A)).
\]
Proof sketch. The proof proceeds in several stages.
Stage 1: Linear maps. For a linear map \(L : \mathbb{R}^n \to \mathbb{R}^m\), the formula reduces to \(\mathcal{H}^n(L(A)) = J_n L \cdot \mathcal{L}^n(A)\), which follows from the polar decomposition \(L = O \circ S\) where \(O\) is an isometry and \(S\) is symmetric.
Stage 2: Approximation. By Rademacher’s theorem, \(f\) is differentiable a.e. On the set where \(Df(x)\) exists and is injective, the implicit function theorem (suitably generalized) ensures that \(f\) is locally “almost linear.” We decompose \(A\) into countably many pieces \(\{A_k\}\) on which \(Df\) is approximately constant, apply the linear case to each piece with controlled error, and sum.
Stage 3: The critical set. On the set \(\{x : J_n Df(x) = 0\}\), one must show \(\mathcal{H}^n(f(\{J_n Df = 0\})) = 0\). This is the content of a Sard-type theorem for Lipschitz maps.
Stage 4: Multiplicity. The general formula (without injectivity) follows by a disintegration argument, decomposing \(A\) into level sets \(A \cap f^{-1}(\{y\})\).
Surface area of a graph. Let \(g : U \to \mathbb{R}\) be Lipschitz, where \(U \subseteq \mathbb{R}^n\) is open. The graph map \(f : U \to \mathbb{R}^{n+1}\) defined by \(f(x) = (x, g(x))\) is Lipschitz and injective. One computes \(Df(x) = \begin{pmatrix} I_n \\ \nabla g(x)^T \end{pmatrix}\), so
\[
J_n Df(x) = \sqrt{\det(I_n + \nabla g(x) \otimes \nabla g(x))} = \sqrt{1 + |\nabla g(x)|^2}.
\]
The area formula gives
\[
\mathcal{H}^n(\operatorname{graph}(g)) = \int_U \sqrt{1 + |\nabla g(x)|^2}\, dx,
\]
recovering the classical formula for surface area.
The coarea formula is a far-reaching generalization of Fubini’s theorem that relates integration over a domain to integration over level sets. It goes in the opposite direction from the area formula: instead of mapping \(\mathbb{R}^n\) into a higher-dimensional space, we project down.
For a linear map \(L : \mathbb{R}^n \to \mathbb{R}^m\) with \(n \geq m\), the coarea factor is
\[
J_m L = \sqrt{\det(L L^*)},
\]
which equals the product of the \(m\) largest singular values of \(L\), or equivalently the \(m\)-dimensional volume of \(L(Q)\) where \(Q\) is the unit cube in \(\mathbb{R}^n\) (suitably interpreted).
Coarea Formula. Let \(f : \mathbb{R}^n \to \mathbb{R}^m\) be Lipschitz with \(n \geq m\). Then for every \(\mathcal{L}^n\)-measurable function \(g : \mathbb{R}^n \to [0, \infty]\),
\[
\int_{\mathbb{R}^n} g(x) J_m Df(x)\, d\mathcal{L}^n(x) = \int_{\mathbb{R}^m}\left(\int_{f^{-1}(\{y\})} g(x)\, d\mathcal{H}^{n-m}(x)\right) d\mathcal{L}^m(y).
\]
The classical coarea formula. When \(m = 1\) and \(f : \mathbb{R}^n \to \mathbb{R}\) is Lipschitz, we have \(J_1 Df(x) = |\nabla f(x)|\), and the coarea formula becomes
\[
\int_{\mathbb{R}^n} g(x) |\nabla f(x)|\, dx = \int_{-\infty}^{\infty} \left(\int_{\{f = t\}} g\, d\mathcal{H}^{n-1}\right) dt.
\]
Taking \(g = |\nabla f|^{-1} \chi_A\) (where \(|\nabla f| > 0\)), this gives
\[
\mathcal{L}^n(A) = \int_{-\infty}^{\infty} \int_{\{f = t\} \cap A} \frac{1}{|\nabla f|}\, d\mathcal{H}^{n-1}\, dt,
\]
a generalized Cavalieri principle. For the distance function \(f(x) = |x|\), the level sets are spheres and we recover the polar coordinates formula.
Chapter 3: Rectifiable Sets and Measures
The notion of rectifiability provides a measure-theoretic generalization of smooth submanifolds. A rectifiable set is, roughly, one that can be covered — up to a set of measure zero — by countably many Lipschitz images of Euclidean space. This chapter develops the theory of rectifiable sets, their tangent structure, and the deep Besicovitch-Federer projection theorem that characterizes purely unrectifiable sets.
3.1 Countably Rectifiable Sets
A set \(E \subseteq \mathbb{R}^n\) is countably \(m\)-rectifiable (or simply \(m\)-rectifiable) if there exist Lipschitz maps \(f_j : \mathbb{R}^m \to \mathbb{R}^n\), \(j = 1, 2, \ldots\), such that
\[
\mathcal{H}^m\!\left(E \setminus \bigcup_{j=1}^\infty f_j(\mathbb{R}^m)\right) = 0.
\]
Equivalently (using the area formula and Rademacher's theorem), \(E\) is \(m\)-rectifiable if and only if there exist \(C^1\) maps \(g_j : \mathbb{R}^m \to \mathbb{R}^n\) such that \(\mathcal{H}^m(E \setminus \bigcup_j g_j(\mathbb{R}^m)) = 0\). This equivalence is non-trivial and relies on Whitney's extension theorem.
A set \(E \subseteq \mathbb{R}^n\) with \(\mathcal{H}^m(E) < \infty\) is purely \(m\)-unrectifiable if it contains no \(m\)-rectifiable subset of positive \(\mathcal{H}^m\)-measure. Equivalently, \(\mathcal{H}^m(E \cap f(\mathbb{R}^m)) = 0\) for every Lipschitz map \(f : \mathbb{R}^m \to \mathbb{R}^n\).
Rectifiable-Unrectifiable Decomposition. Let \(E \subseteq \mathbb{R}^n\) with \(\mathcal{H}^m(E) < \infty\). Then there is a unique decomposition \(E = E_{\mathrm{rect}} \cup E_{\mathrm{unrect}}\) (up to \(\mathcal{H}^m\)-null sets) where \(E_{\mathrm{rect}}\) is \(m\)-rectifiable and \(E_{\mathrm{unrect}}\) is purely \(m\)-unrectifiable.
Rectifiable sets.- Any \(C^1\) \(m\)-dimensional submanifold of \(\mathbb{R}^n\) (or countable union thereof) is \(m\)-rectifiable.
- The graph of any Lipschitz function \(g : \mathbb{R}^m \to \mathbb{R}^{n-m}\) is \(m\)-rectifiable.
- Any countable set is \(0\)-rectifiable.
Purely unrectifiable sets. The product \(C \times C\) of two copies of the middle-thirds Cantor set in \(\mathbb{R}^2\) is purely \(1\)-unrectifiable (despite having Hausdorff dimension \(\frac{2\log 2}{\log 3} \approx 1.26 > 1\)). Any set of Hausdorff dimension strictly less than \(m\) is trivially purely \(m\)-unrectifiable.
3.2 Tangent Planes and Approximate Tangent Planes
For smooth submanifolds, the tangent space at a point is defined via the derivative of a parametrization. For rectifiable sets, we need a measure-theoretic substitute.
Let \(E \subseteq \mathbb{R}^n\) be \(\mathcal{H}^m\)-measurable with \(\mathcal{H}^m(E) < \infty\). An \(m\)-dimensional plane \(V + a\) (where \(V \in G(n,m)\) and \(a \in \mathbb{R}^n\)) is an approximate tangent plane to \(E\) at \(a \in E\) if
\[
\lim_{r \to 0^+} r^{-m} \mathcal{H}^m(E \cap B(a, r) \cap \{x : |(x-a) - \pi_V(x-a)| > \epsilon |x-a|\}) = 0
\]
for every \(\epsilon > 0\). Equivalently, the rescaled sets \(r^{-1}(E - a)\) converge in a measure-theoretic sense to \(V\).
Existence of Approximate Tangent Planes. Let \(E \subseteq \mathbb{R}^n\) be \(m\)-rectifiable with \(\mathcal{H}^m(E) < \infty\). Then \(E\) has a unique approximate tangent plane \(\operatorname{Tan}^m(E, a)\) at \(\mathcal{H}^m\)-a.e. \(a \in E\).
Since \(E\) is \(m\)-rectifiable, it can be covered (up to an \(\mathcal{H}^m\)-null set) by countably many Lipschitz images \(f_j(\mathbb{R}^m)\). By Rademacher's theorem, each \(f_j\) is differentiable a.e. At a point \(a = f_j(x_0)\) where \(Df_j(x_0)\) exists and has rank \(m\), the image of \(Df_j(x_0)\) gives an \(m\)-dimensional plane that is the approximate tangent plane to \(f_j(\mathbb{R}^m)\) at \(a\). The area formula ensures that these tangent planes are well-defined and independent of the parametrization chosen.
3.3 Characterizations of Rectifiability
One of the great achievements of geometric measure theory is the multitude of equivalent characterizations of rectifiability. These connect geometric, analytic, and measure-theoretic viewpoints.
Characterizations of \(m\)-Rectifiability. Let \(E \subseteq \mathbb{R}^n\) be \(\mathcal{H}^m\)-measurable with \(0 < \mathcal{H}^m(E) < \infty\). The following are equivalent:
- (Parametric) \(E\) is \(m\)-rectifiable.
- (Tangent) \(E\) has an approximate \(m\)-dimensional tangent plane at \(\mathcal{H}^m\)-a.e. point.
- (Density) \(\Theta^m(\mathcal{H}^m \mathbin{\vrule height 6pt depth 0pt\relax\vrule height 0.5pt depth 0pt width 4pt} E, x) = 1\) for \(\mathcal{H}^m\)-a.e. \(x \in E\).
- (Projection) For every \(V \in G(n,m)\), \(\mathcal{H}^m(\pi_V(E)) > 0\) (i.e., \(E\) projects onto a set of positive measure in almost every \(m\)-plane).
3.4 The Besicovitch-Federer Projection Theorem
The following theorem, conjectured by Besicovitch and proved by Federer (1947), provides a striking geometric characterization of purely unrectifiable sets via their projections.
Besicovitch-Federer Projection Theorem. Let \(E \subseteq \mathbb{R}^n\) be \(\mathcal{H}^m\)-measurable with \(\mathcal{H}^m(E) < \infty\). Then \(E\) is purely \(m\)-unrectifiable if and only if
\[
\mathcal{H}^m(\pi_V(E)) = 0 \quad \text{for } \gamma_{n,m}\text{-a.e. } V \in G(n,m).
\]
The set \(C \times C \subseteq \mathbb{R}^2\), where \(C\) is the middle-thirds Cantor set, is purely \(1\)-unrectifiable. By the Besicovitch-Federer theorem, \(\mathcal{H}^1(\pi_\theta(C \times C)) = 0\) for a.e. \(\theta\). This is remarkable because \(\dim_H(C \times C) = \frac{2\log 2}{\log 3} > 1\), so the set is "large" in terms of Hausdorff dimension, yet it projects to a null set in almost every direction.
3.5 Rectifiable Measures
A Radon measure \(\mu\) on \(\mathbb{R}^n\) is
\(m\)-rectifiable if:
- \(\mu\) is absolutely continuous with respect to \(\mathcal{H}^m\), and
- \(\mu\) is concentrated on an \(m\)-rectifiable set, i.e., there exists an \(m\)-rectifiable set \(E\) such that \(\mu(\mathbb{R}^n \setminus E) = 0\).
Equivalently, \(\mu = \theta \mathcal{H}^m \mathbin{\vrule height 6pt depth 0pt\relax\vrule height 0.5pt depth 0pt width 4pt} E\) for some \(m\)-rectifiable set \(E\) and some locally \(\mathcal{H}^m\)-integrable function \(\theta : E \to (0, \infty)\), called the
multiplicity function.
Preiss's Theorem (1987). A Radon measure \(\mu\) on \(\mathbb{R}^n\) is \(m\)-rectifiable (with integer multiplicity) if and only if the \(m\)-density \(\Theta^m(\mu, x)\) exists and is a positive integer for \(\mu\)-a.e. \(x\).
Chapter 4: Currents
The theory of currents, developed by Georges de Rham and vastly extended by Herbert Federer and Wendell Fleming in their landmark 1960 paper, provides a powerful framework for studying generalized surfaces with boundaries. Currents can be thought of as “distributions acting on differential forms” — a linearized, measure-theoretic substitute for oriented submanifolds. The Federer-Fleming compactness theorem for integral currents is the key tool that enables the solution of Plateau’s problem in all dimensions.
We begin by recalling the algebraic preliminaries.
Let \(V = \mathbb{R}^n\). The space of \(k\)-vectors is the exterior power \(\Lambda_k(\mathbb{R}^n) = \Lambda^k V\), the vector space spanned by elements of the form
\[
v_1 \wedge v_2 \wedge \cdots \wedge v_k, \quad v_i \in \mathbb{R}^n.
\]
Its dimension is \(\binom{n}{k}\). Elements of \(\Lambda_k(\mathbb{R}^n)\) are called \(k\)-vectors; those of the form \(v_1 \wedge \cdots \wedge v_k\) are called simple (or decomposable) \(k\)-vectors. The inner product on \(\Lambda_k(\mathbb{R}^n)\) is defined by
\[
\langle v_1 \wedge \cdots \wedge v_k, w_1 \wedge \cdots \wedge w_k \rangle = \det(\langle v_i, w_j \rangle)_{i,j}.
\]
The induced norm is \(|\xi| = \sqrt{\langle \xi, \xi \rangle}\).
The space of \(k\)-covectors (or exterior \(k\)-forms) is \(\Lambda^k(\mathbb{R}^n) = (\Lambda_k(\mathbb{R}^n))^*\). A differential \(k\)-form on an open set \(U \subseteq \mathbb{R}^n\) is a map \(\omega : U \to \Lambda^k(\mathbb{R}^n)\). In coordinates with basis \(\{e^{i_1} \wedge \cdots \wedge e^{i_k}\}\),
\[
\omega(x) = \sum_{I} \omega_I(x)\, dx^{i_1} \wedge \cdots \wedge dx^{i_k},
\]
where the sum is over increasing multi-indices \(I = (i_1 < \cdots < i_k)\). We write \(\mathcal{D}^k(U)\) for the space of smooth, compactly supported \(k\)-forms on \(U\).
4.2 Currents: Definitions
A \(k\)-dimensional current in an open set \(U \subseteq \mathbb{R}^n\) is a continuous linear functional on the space \(\mathcal{D}^k(U)\) of smooth, compactly supported differential \(k\)-forms. The space of \(k\)-currents is denoted \(\mathcal{D}_k(U)\). We write \(T(\omega)\) for the action of the current \(T\) on the form \(\omega\).
The definition of continuity here means: \(T(\omega_j) \to T(\omega)\) whenever the forms \(\omega_j\) converge to \(\omega\) in the standard topology on \(\mathcal{D}^k(U)\) (all derivatives converge uniformly, supports contained in a fixed compact set).
The boundary of a \(k\)-current \(T \in \mathcal{D}_k(U)\) is the \((k-1)\)-current \(\partial T\) defined by
\[
\partial T(\omega) = T(d\omega), \quad \omega \in \mathcal{D}^{k-1}(U),
\]
where \(d\omega\) is the exterior derivative. Since \(d^2 = 0\), we have \(\partial^2 = 0\), mirroring the topological fact that "the boundary of a boundary is zero."
Current of integration. Let \(M \subseteq \mathbb{R}^n\) be a smooth oriented \(k\)-dimensional submanifold (possibly with boundary). Then \(M\) defines a \(k\)-current \(\llbracket M \rrbracket\) by
\[
\llbracket M \rrbracket(\omega) = \int_M \omega, \quad \omega \in \mathcal{D}^k(U).
\]
By Stokes' theorem, \(\partial \llbracket M \rrbracket = \llbracket \partial M \rrbracket\), where \(\partial M\) is the boundary manifold with the induced orientation. This motivates the definition of the boundary operator for general currents.
4.3 Mass and Normal Currents
The mass of a \(k\)-current \(T \in \mathcal{D}_k(U)\) is
\[
\mathbf{M}(T) = \sup\{T(\omega) : \omega \in \mathcal{D}^k(U),\; |\omega(x)| \leq 1 \text{ for all } x\}.
\]
Here \(|\omega(x)|\) is the comass norm of the \(k\)-covector \(\omega(x)\), defined as \(|\omega(x)| = \sup\{|\omega(x)(\xi)| : \xi \in \Lambda_k(\mathbb{R}^n),\; |\xi| \leq 1\}\).
A \(k\)-current \(T\) is a normal current if both \(T\) and \(\partial T\) have finite mass:
\[
\mathbf{N}(T) := \mathbf{M}(T) + \mathbf{M}(\partial T) < \infty.
\]
The space of normal \(k\)-currents is denoted \(\mathbf{N}_k(U)\).
\[
T(\omega) = \int \langle \omega(x), \vec{T}(x) \rangle\, d\|T\|(x),
\]
where \(\|T\|\) is a Radon measure (the total variation measure or mass measure of \(T\)) and \(\vec{T} : \mathbb{R}^n \to \Lambda_k(\mathbb{R}^n)\) is a \(\|T\|\)-measurable unit \(k\)-vector field (the orientation of \(T\)).
4.4 Integral Currents and the Federer-Fleming Compactness Theorem
A \(k\)-current \(T\) is called
integer-multiplicity rectifiable (or simply
rectifiable) if it can be written as
\[
T(\omega) = \int_E \langle \omega(x), \vec{T}(x) \rangle \theta(x)\, d\mathcal{H}^k(x),
\]
where:
- \(E\) is a countably \(k\)-rectifiable set,
- \(\vec{T}(x)\) is an orienting unit simple \(k\)-vector spanning the approximate tangent plane to \(E\) at \(x\), for \(\mathcal{H}^k\)-a.e. \(x \in E\),
- \(\theta : E \to \mathbb{Z}_{>0}\) is a locally \(\mathcal{H}^k\)-integrable positive integer-valued multiplicity function.
We write \(T = \tau(E, \theta, \vec{T})\).
An integral current is a rectifiable current \(T\) whose boundary \(\partial T\) is also a rectifiable current (or zero). The space of integral \(k\)-currents is denoted \(\mathbf{I}_k(U)\).
The following theorem is the cornerstone of the entire theory. It provides the compactness needed to solve variational problems.
Federer-Fleming Compactness Theorem (1960). Let \(\{T_j\} \subseteq \mathbf{I}_k(\mathbb{R}^n)\) be a sequence of integral \(k\)-currents with
\[
\sup_j \left(\mathbf{M}(T_j) + \mathbf{M}(\partial T_j)\right) < \infty
\]
and all supports contained in a fixed compact set. Then there exists a subsequence \(\{T_{j_\ell}\}\) and an integral current \(T \in \mathbf{I}_k(\mathbb{R}^n)\) such that \(T_{j_\ell} \to T\) in the sense of currents (i.e., weakly: \(T_{j_\ell}(\omega) \to T(\omega)\) for all \(\omega \in \mathcal{D}^k\)).
The flat norm of a \(k\)-current \(T \in \mathcal{D}_k(U)\) is
\[
\mathbf{F}(T) = \inf\{\mathbf{M}(R) + \mathbf{M}(S) : T = R + \partial S,\; R \in \mathcal{D}_k(U),\; S \in \mathcal{D}_{k+1}(U)\}.
\]
Deformation Theorem (Federer-Fleming). Let \(T \in \mathbf{I}_k(\mathbb{R}^n)\) be an integral current and let \(\epsilon > 0\). Then there exist an integral polyhedral chain \(P\), an integral current \(R\) with \(\mathbf{M}(R) < \epsilon\), and an integral current \(S\) with
\[
T = P + R + \partial S,
\]
where \(P\) is supported on a \(k\)-dimensional polyhedral complex with mesh at most \(\epsilon\), and
\[
\mathbf{M}(P) \leq C(\mathbf{M}(T) + \epsilon), \quad \mathbf{M}(S) \leq C\epsilon \cdot \mathbf{M}(T),
\]
for a constant \(C = C(n, k)\).
4.6 Slicing of Currents
Slicing provides a way to “restrict” a current to level sets of a Lipschitz function, generalizing the notion of intersecting a surface with a hyperplane.
Slicing Theorem. Let \(T \in \mathbf{I}_k(\mathbb{R}^n)\) and let \(f : \mathbb{R}^n \to \mathbb{R}^m\) be Lipschitz with \(m \leq k\). Then for \(\mathcal{L}^m\)-a.e. \(y \in \mathbb{R}^m\), there exists an integral \((k-m)\)-current \(\langle T, f, y \rangle\), called the
slice of \(T\) by \(f\) at \(y\), satisfying:
- \(\langle T, f, y \rangle\) is supported on \(\operatorname{spt}(T) \cap f^{-1}(\{y\})\).
- \(\int_{\mathbb{R}^m} \mathbf{M}(\langle T, f, y \rangle)\, d\mathcal{L}^m(y) \leq (\operatorname{Lip} f)^m \mathbf{M}(T)\).
- (Boundary formula) \(\partial \langle T, f, y \rangle = (-1)^m \langle \partial T, f, y \rangle\) for a.e. \(y\).
Moreover, if \(m = 1\) and \(f\) is a coordinate function, then the slicing satisfies the coarea-type identity
\[
T \mathbin{\vrule height 6pt depth 0pt\relax\vrule height 0.5pt depth 0pt width 4pt} f^{-1}((a, b)) = \int_a^b \langle T, f, t \rangle\, dt + (\text{boundary terms}).
\]
Chapter 5: Varifolds
While currents carry orientation information and allow for a boundary operator, certain geometric problems — particularly those involving surfaces with singularities, soap films, and phase interfaces — require a framework that does not depend on orientation. William Allard and Frederick Almgren developed the theory of varifolds in the 1960s and 1970s precisely for this purpose. A varifold is, roughly speaking, a measure on the Grassmann bundle that records both the location and the tangent plane of a generalized surface, but forgets orientation.
5.1 Definitions and Basic Properties
The Grassmann bundle over an open set \(U \subseteq \mathbb{R}^n\) is
\[
G_m(U) = U \times G(n, m),
\]
where \(G(n,m)\) is the Grassmannian of (unoriented) \(m\)-dimensional linear subspaces of \(\mathbb{R}^n\). We equip \(G(n,m)\) with a natural metric and \(G_m(U)\) with the product topology.
A
general \(m\)-varifold in \(U\) is a Radon measure on \(G_m(U)\). The space of general \(m\)-varifolds is denoted \(\mathbf{V}_m(U)\).
\[
\|V\|(A) = V(\pi^{-1}(A)) = V(A \times G(n,m))
\]
for every Borel set \(A \subseteq U\), where \(\pi : G_m(U) \to U\) is the projection.
A varifold \(V \in \mathbf{V}_m(U)\) is rectifiable if there exist a countably \(m\)-rectifiable set \(E \subseteq U\) and a locally \(\mathcal{H}^m\)-integrable function \(\theta : E \to (0, \infty)\) such that
\[
V(\phi) = \int_E \phi(x, \operatorname{Tan}^m(E, x))\, \theta(x)\, d\mathcal{H}^m(x)
\]
for all \(\phi \in C_c(G_m(U))\). If \(\theta\) takes values in the positive integers, \(V\) is an integer-rectifiable varifold (or integral varifold). We write \(V = \mathbf{v}(E, \theta)\).
5.2 First Variation and Generalized Mean Curvature
The first variation of a varifold captures how its mass changes under smooth deformations, providing a weak notion of mean curvature.
Let \(V \in \mathbf{V}_m(U)\). The first variation of \(V\) is the linear functional \(\delta V : C^1_c(U; \mathbb{R}^n) \to \mathbb{R}\) defined by
\[
\delta V(X) = \int_{G_m(U)} \operatorname{div}_S X(x)\, dV(x, S),
\]
where \(\operatorname{div}_S X(x) = \sum_{i=1}^m \langle D_{e_i} X(x), e_i \rangle\) is the tangential divergence of \(X\) with respect to the \(m\)-plane \(S\), and \(\{e_1, \ldots, e_m\}\) is any orthonormal basis for \(S\).
A varifold \(V\) is stationary if \(\delta V(X) = 0\) for all \(X \in C^1_c(U; \mathbb{R}^n)\). More generally, \(V\) has locally bounded first variation if the total variation \(|\delta V|\) is a Radon measure. In this case, by the Riesz representation theorem, there exists a \(\|V\|\)-measurable vector field \(H\) such that
\[
\delta V(X) = -\int \langle H(x), X(x) \rangle\, d\|V\|(x)
\]
for all \(X \in C^1_c(U; \mathbb{R}^n)\). The field \(H\) is called the generalized mean curvature of \(V\).
The monotonicity formula is a key quantitative tool that controls the local behavior of stationary varifolds.
Monotonicity Formula. Let \(V \in \mathbf{V}_m(U)\) be a stationary varifold (i.e., \(\delta V = 0\)). Then for every \(a \in U\), the function
\[
r \mapsto \frac{\|V\|(B(a, r))}{\omega_m r^m}
\]
is non-decreasing for \(0 < r < \operatorname{dist}(a, \partial U)\). In particular, the density
\[
\Theta^m(\|V\|, a) = \lim_{r \to 0^+} \frac{\|V\|(B(a, r))}{\omega_m r^m}
\]
exists and is a non-negative real number for every \(a \in \operatorname{spt}\|V\|\).
Proof sketch. Choose a radial test vector field \(X(x) = \eta(|x - a|)(x - a)\), where \(\eta\) is a suitable cutoff function. Substituting into the first variation formula \(\delta V(X) = 0\) and using the coarea formula, one derives the identity
\[
\frac{d}{dr}\left(\frac{\|V\|(B(a,r))}{\omega_m r^m}\right) = \frac{1}{\omega_m r^m} \int_{B(a,r)} \frac{|(x-a)^\perp|^2}{|x-a|^{m+2}}\, d\|V\|(x) \geq 0,
\]
where \((x-a)^\perp\) denotes the component of \(x - a\) perpendicular to the tangent plane \(S\). The right-hand side is non-negative, yielding monotonicity.
5.4 Allard’s Regularity Theorem
The following regularity theorem, due to William Allard (1972), is one of the most important results in geometric measure theory. It asserts that an integral varifold that is “close to flat” in a suitable sense must actually be a smooth graph.
Allard's Regularity Theorem (1972). There exist constants \(\epsilon = \epsilon(n, m) > 0\) and \(C = C(n, m) > 0\) with the following property. Let \(V = \mathbf{v}(E, \theta)\) be an integral \(m\)-varifold in \(B(0, 1) \subseteq \mathbb{R}^n\) with generalized mean curvature \(H \in L^p(\|V\|)\) for some \(p > m\). Suppose:
- (Density close to 1) \(|\Theta^m(\|V\|, 0) - 1| < \epsilon\),
- (Mass close to flat) \(\omega_m^{-1} \|V\|(B(0, 1)) < 1 + \epsilon\),
- (Small mean curvature) \(\left(\int_{B(0,1)} |H|^p\, d\|V\|\right)^{1/p} < \epsilon\).
Then \(\operatorname{spt}\|V\| \cap B(0, 1/2)\) is a \(C^{1,\alpha}\) graph over the approximate tangent plane at the origin, for \(\alpha = 1 - m/p\). Moreover, if \(H \in C^{k,\beta}\), then the graph is \(C^{k+2,\beta}\).
5.5 Compactness for Varifolds
Allard's Compactness Theorem. Let \(\{V_j\} \subseteq \mathbf{V}_m(U)\) be a sequence of integral varifolds with locally bounded first variation and
\[
\sup_j \left(\|V_j\|(K) + |\delta V_j|(K)\right) < \infty
\]
for every compact \(K \subseteq U\). Then there exists a subsequence converging (as Radon measures on \(G_m(U)\)) to an integral varifold \(V\) with locally bounded first variation.
Chapter 6: The Plateau Problem and Minimal Surfaces
Joseph Plateau (1801–1883), a Belgian physicist who was blinded by an experiment involving staring at the sun, conducted extensive experiments with soap films in the mid-19th century. He observed that a soap film spanning a given wire frame minimizes area and meets along certain characteristic singularities. The mathematical question of whether every reasonable boundary curve bounds a surface of least area became known as Plateau’s problem and stood as one of the central challenges of 19th and 20th century mathematics. Its resolution required the full machinery of geometric measure theory.
Plateau's Problem (in the language of currents). Let \(\Gamma \in \mathbf{I}_{k-1}(\mathbb{R}^n)\) be an integral \((k-1)\)-current with \(\partial \Gamma = 0\) (i.e., \(\Gamma\) is a cycle). Find an integral \(k\)-current \(T \in \mathbf{I}_k(\mathbb{R}^n)\) with \(\partial T = \Gamma\) that minimizes the mass:
\[
\mathbf{M}(T) = \inf\{\mathbf{M}(S) : S \in \mathbf{I}_k(\mathbb{R}^n),\; \partial S = \Gamma\}.
\]
6.2 Existence via the Direct Method
Existence of Mass-Minimizing Currents (Federer-Fleming, 1960). Let \(\Gamma \in \mathbf{I}_{k-1}(\mathbb{R}^n)\) be a compactly supported integral \((k-1)\)-cycle. Then there exists \(T \in \mathbf{I}_k(\mathbb{R}^n)\) with \(\partial T = \Gamma\) achieving
\[
\mathbf{M}(T) = \inf\{\mathbf{M}(S) : S \in \mathbf{I}_k(\mathbb{R}^n),\; \partial S = \Gamma\}.
\]
The proof uses the direct method of the calculus of variations.
Step 1: Existence of a competitor. Since \(\partial \Gamma = 0\), the isoperimetric inequality for currents guarantees that there exists at least one \(S \in \mathbf{I}_k(\mathbb{R}^n)\) with \(\partial S = \Gamma\). Let \(m_0 = \inf\{\mathbf{M}(S) : \partial S = \Gamma\}\).
Step 2: Minimizing sequence. Choose integral currents \(T_j\) with \(\partial T_j = \Gamma\) and \(\mathbf{M}(T_j) \to m_0\). By replacing \(T_j\) with the cone over \(\Gamma\) truncated to a large ball if necessary, we may assume all supports lie in a fixed compact set.
Step 3: Compactness. The sequence \(\{T_j\}\) satisfies \(\sup_j \mathbf{M}(T_j) < \infty\) and \(\partial T_j = \Gamma\) for all \(j\), so \(\sup_j \mathbf{M}(\partial T_j) = \mathbf{M}(\Gamma) < \infty\). By the Federer-Fleming compactness theorem, there is a subsequence \(T_{j_\ell} \to T\) (weakly) where \(T \in \mathbf{I}_k(\mathbb{R}^n)\).
Step 4: Boundary. By the continuity of the boundary operator under weak convergence, \(\partial T = \lim \partial T_{j_\ell} = \Gamma\).
Step 5: Lower semicontinuity. The mass is lower semicontinuous with respect to weak convergence of currents: \(\mathbf{M}(T) \leq \liminf_\ell \mathbf{M}(T_{j_\ell}) = m_0\). Since \(\partial T = \Gamma\), we also have \(\mathbf{M}(T) \geq m_0\). Hence \(\mathbf{M}(T) = m_0\).
6.3 Regularity in Codimension One
The existence theorem produces a mass-minimizing integral current, but says nothing about its regularity. Could the minimizer be everywhere singular? The regularity theory, developed over several decades, shows that in codimension one the answer is a resounding no: the minimizer is smooth except possibly on a small singular set.
Interior Regularity (De Giorgi, 1961; Almgren, 1968; Simons, 1968). Let \(T \in \mathbf{I}_k(\mathbb{R}^{k+1})\) be a mass-minimizing integral \(k\)-current in \(\mathbb{R}^{k+1}\) (codimension one). Let \(\operatorname{reg}(T)\) denote the
regular set (points near which \(\operatorname{spt} T\) is a smooth embedded submanifold) and \(\operatorname{sing}(T) = \operatorname{spt} T \setminus (\operatorname{reg}(T) \cup \operatorname{spt} \partial T)\) the
singular set. Then:
- \(\operatorname{reg}(T)\) is an open dense subset of \(\operatorname{spt} T \setminus \operatorname{spt} \partial T\).
- \(\operatorname{sing}(T)\) has Hausdorff dimension at most \(k - 7\).
- In particular, if \(k \leq 6\) (i.e., the ambient dimension is at most 7), then \(\operatorname{sing}(T) = \varnothing\) and the minimizer is completely smooth.
The regularity program proceeds through a series of deep results:
De Giorgi's Regularity Theorem (1961). Let \(T\) be a mass-minimizing integral \(k\)-current in \(\mathbb{R}^{k+1}\). If \(x \in \operatorname{spt} T \setminus \operatorname{spt} \partial T\) and the density \(\Theta^k(\|T\|, x) = 1\), then \(x \in \operatorname{reg}(T)\). In fact, \(\operatorname{spt} T\) is an analytic submanifold near \(x\).
Proof sketch. The idea is a blow-up argument. If \(T\) is mass-minimizing and has density 1 at \(x\), then by the monotonicity formula, the rescaled currents \(T_{x,r} = (\iota_{x,r})_\# T\) (where \(\iota_{x,r}(y) = (y - x)/r\)) have mass ratios close to 1 for small \(r\). One shows:
- Excess decay: The excess \(E(T, B(x,r)) = r^{-k}\left(\|T\|(B(x,r)) - \omega_k r^k\right)\) measures the deviation from flatness. One proves that the excess decays at a definite rate: \(E(T, B(x, r/2)) \leq C E(T, B(x, r))^\alpha\) for some \(\alpha > 1\), provided the initial excess is small enough.
- Lipschitz approximation: When the excess is small, \(\operatorname{spt} T \cap B(x, r/2)\) is a Lipschitz graph over the approximate tangent plane.
- Elliptic regularity: The mass-minimizing condition implies the Lipschitz graph satisfies a quasi-linear elliptic PDE (the minimal surface equation). By Schauder theory, the solution is smooth (in fact, analytic by a result of Morrey).
Simons' Inequality and Dimension Bound (1968). Let \(\Sigma^k \subseteq \mathbb{R}^{k+1}\) be a smooth, complete, stable minimal hypersurface (without boundary). James Simons proved:
- The second fundamental form \(A\) satisfies the differential inequality
\[
\Delta |A|^2 \geq -2|A|^4 + 2\left(1 + \frac{2}{k}\right)|\nabla A|^2,
\]
which, combined with a maximum principle argument, yields:
- If \(k \leq 5\), the only complete stable minimal hypercone in \(\mathbb{R}^{k+1}\) is a hyperplane.
Together with the dimension-reduction argument of Federer, this implies \(\dim_H(\operatorname{sing}(T)) \leq k - 7\) for mass-minimizing currents.
6.4 Bernstein’s Theorem and Its Generalizations
Bernstein's Theorem (generalized). Let \(u : \mathbb{R}^k \to \mathbb{R}\) be an entire solution to the minimal surface equation
\[
\operatorname{div}\left(\frac{\nabla u}{\sqrt{1 + |\nabla u|^2}}\right) = 0.
\]
Then:
- If \(k \leq 7\), \(u\) must be affine (i.e., the graph is a hyperplane).
- If \(k \geq 8\), there exist non-affine entire minimal graphs.
6.5 The Simons Cone
The sharpness of the dimension bound \(k - 7\) for the singular set is demonstrated by a celebrated example.
The Simons cone is the \(7\)-dimensional cone in \(\mathbb{R}^8\) defined by
\[
\mathbf{C} = \left\{(x, y) \in \mathbb{R}^4 \times \mathbb{R}^4 : |x|^2 = |y|^2\right\}.
\]
More generally, for any \(p, q \geq 1\), the Clifford cone is
\[
\mathbf{C}_{p,q} = \left\{(x, y) \in \mathbb{R}^{p+1} \times \mathbb{R}^{q+1} : \frac{|x|^2}{p} = \frac{|y|^2}{q}\right\}.
\]
The Simons cone is \(\mathbf{C}_{3,3}\).
Bombieri-De Giorgi-Giusti (1969). The Simons cone \(\mathbf{C}_{3,3}\) is a mass-minimizing hypersurface in \(\mathbb{R}^8\). It is area-minimizing among all integral currents with the same boundary at infinity. Its only singularity is at the origin.
6.6 Higher Codimension: Almgren’s Big Regularity Theorem
Almgren's Regularity Theorem (completed 2000, published posthumously). Let \(T \in \mathbf{I}_m(\mathbb{R}^n)\) be a mass-minimizing integral \(m\)-current in \(\mathbb{R}^n\) for arbitrary codimension \(n - m \geq 2\). Then the singular set satisfies
\[
\dim_H(\operatorname{sing}(T)) \leq m - 2.
\]
6.7 Stable Minimal Surfaces
A minimal surface (or more precisely, a stationary integral varifold \(V\)) is stable if the second variation of area is non-negative:
\[
\delta^2 V(\phi) = \int_M \left(|\nabla^\perp \phi|^2 - |A|^2 |\phi|^2 - \operatorname{Ric}^\perp(\phi, \phi)\right) d\mathcal{H}^m \geq 0
\]
for all smooth compactly supported normal variations \(\phi\). In Euclidean space, the Ricci term vanishes and stability becomes
\[
\int_M |\nabla^\perp \phi|^2\, d\mathcal{H}^m \geq \int_M |A|^2 |\phi|^2\, d\mathcal{H}^m.
\]
Curvature Estimates for Stable Minimal Surfaces (Schoen-Simon-Yau, 1975). Let \(\Sigma^m \subseteq \mathbb{R}^{m+1}\) be a complete, stable, immersed minimal hypersurface. If \(m \leq 5\), then \(\Sigma\) is a hyperplane.
Chapter 7: Sets of Finite Perimeter and Isoperimetric Problems
The theory of sets of finite perimeter (Caccioppoli sets) provides a measure-theoretic framework for studying boundaries of regions, generalizing the classical notion of a surface. This framework, developed primarily by Renato Caccioppoli and Ennio De Giorgi in the 1950s, connects geometric measure theory to the calculus of variations, partial differential equations, and geometric analysis. It yields elegant proofs of fundamental geometric inequalities, including the isoperimetric inequality.
7.1 Functions of Bounded Variation
Let \(U \subseteq \mathbb{R}^n\) be open. A function \(u \in L^1(U)\) has bounded variation in \(U\) if the distributional gradient \(Du\) is a vector-valued Radon measure on \(U\), i.e.,
\[
|Du|(U) = \sup\left\{\int_U u\, \operatorname{div}\phi\, dx : \phi \in C^1_c(U; \mathbb{R}^n),\; |\phi| \leq 1\right\} < \infty.
\]
The space of functions of bounded variation is denoted \(BV(U)\), and \(|Du|(U)\) is called the total variation of \(u\) in \(U\).
BV Compactness Theorem. Let \(\{u_j\} \subseteq BV(U)\) with \(\sup_j \|u_j\|_{BV} < \infty\), where \(U\) is bounded and has Lipschitz boundary. Then there exists a subsequence \(u_{j_k} \to u\) in \(L^1(U)\) for some \(u \in BV(U)\), and \(|Du|(U) \leq \liminf_k |Du_{j_k}|(U)\).
Structure Theorem for BV Functions. Let \(u \in BV(U)\). The distributional derivative \(Du\) admits the decomposition
\[
Du = D^a u + D^s u = \nabla u\, \mathcal{L}^n + D^c u + D^j u,
\]
where:
- \(D^a u = \nabla u\, \mathcal{L}^n\) is the absolutely continuous part, with \(\nabla u \in L^1(U; \mathbb{R}^n)\) the approximate gradient,
- \(D^j u = (u^+ - u^-) \nu_u\, \mathcal{H}^{n-1} \mathbin{\vrule height 6pt depth 0pt\relax\vrule height 0.5pt depth 0pt width 4pt} J_u\) is the jump part, concentrated on the jump set \(J_u\) (an \((n-1)\)-rectifiable set), where \(u^+, u^-\) are the traces from each side and \(\nu_u\) is the normal,
- \(D^c u\) is the Cantor part, singular with respect to \(\mathcal{L}^n\) and vanishing on all \((n-1)\)-rectifiable sets.
7.2 Sets of Finite Perimeter
A \(\mathcal{L}^n\)-measurable set \(E \subseteq \mathbb{R}^n\) has finite perimeter in an open set \(U\) if its characteristic function \(\chi_E\) belongs to \(BV(U)\). The perimeter of \(E\) in \(U\) is
\[
P(E; U) = |D\chi_E|(U) = \sup\left\{\int_E \operatorname{div}\phi\, dx : \phi \in C^1_c(U; \mathbb{R}^n),\; |\phi| \leq 1\right\}.
\]
Sets of finite perimeter are also called Caccioppoli sets, honoring Renato Caccioppoli who introduced them in the 1920s.
Examples of sets of finite perimeter.- Any bounded open set with Lipschitz boundary has finite perimeter.
- The set \(\{x \in \mathbb{R}^n : |x| < 1\}\) (the unit ball) has perimeter \(n\omega_n\) (the surface area of the unit sphere).
- Any finite union of cubes in \(\mathbb{R}^n\) has finite perimeter.
- A set whose boundary is a fractal (e.g., the interior of the Koch snowflake in \(\mathbb{R}^2\)) has finite perimeter if and only if \(\mathcal{H}^{n-1}(\partial E) < \infty\). The Koch snowflake has infinite \(\mathcal{H}^1\)-measure boundary, so it does not have finite perimeter. However, its area is finite.
7.3 The Reduced Boundary
The central idea of De Giorgi’s theory is to identify a canonical “measure-theoretic boundary” for a set of finite perimeter that has rectifiable structure.
Let \(E\) be a set of finite perimeter in \(U\). The measure-theoretic exterior normal is the Radon-Nikodým derivative
\[
\nu_E(x) = -\frac{dD\chi_E}{d|D\chi_E|}(x),
\]
which exists \(|D\chi_E|\)-a.e. and satisfies \(|\nu_E(x)| = 1\). The reduced boundary (or essential boundary) of \(E\) is the set
\[
\partial^* E = \left\{x \in \operatorname{spt}|D\chi_E| : \nu_E(x) \text{ exists and } |\nu_E(x)| = 1\right\}.
\]
7.4 De Giorgi’s Structure Theorem
The following theorem, due to Ennio De Giorgi (1954-1955), is one of the foundational results of the theory. It shows that the reduced boundary of a set of finite perimeter is a rectifiable set, and that the Gauss-Green formula holds in this generality.
De Giorgi's Structure Theorem (1954-1955). Let \(E\) be a set of finite perimeter in \(\mathbb{R}^n\). Then:
- (Rectifiability) The reduced boundary \(\partial^* E\) is countably \((n-1)\)-rectifiable.
- (Density) \(\Theta^{n-1}(|D\chi_E|, x) = 1\) for \(\mathcal{H}^{n-1}\)-a.e. \(x \in \partial^* E\).
- (Measure representation) \(|D\chi_E| = \mathcal{H}^{n-1} \mathbin{\vrule height 6pt depth 0pt\relax\vrule height 0.5pt depth 0pt width 4pt} \partial^* E\), i.e.,
\[
P(E; U) = \mathcal{H}^{n-1}(\partial^* E \cap U)
\]
for every open set \(U\).
- (Blow-up) At every point \(x \in \partial^* E\), the rescaled sets \(r^{-1}(E - x)\) converge in \(L^1_{\mathrm{loc}}\) as \(r \to 0^+\) to the half-space \(\{y : \langle y, \nu_E(x) \rangle \leq 0\}\). In particular, the approximate tangent plane to \(\partial^* E\) at \(x\) is the hyperplane \(\nu_E(x)^\perp\).
Proof sketch.Step 1: Blow-up at reduced boundary points. Fix \(x \in \partial^* E\) and consider the rescaled sets \(E_r = r^{-1}(E - x)\). The perimeter is uniformly bounded: \(P(E_r; B(0, R)) \leq C(R)\). By BV compactness, a subsequence \(E_{r_k}\) converges in \(L^1_{\mathrm{loc}}\) to a set \(F\) of locally finite perimeter. One shows that \(F\) is a half-space by proving that \(\chi_F\) is invariant under translations parallel to \(\nu_E(x)^\perp\) (using the existence and constancy of the normal at \(x\)).
Step 2: Density and rectifiability. The blow-up to a half-space implies that \(\Theta^{n-1}(|D\chi_E|, x) = 1\) for every \(x \in \partial^* E\). Combining this with the characterization of rectifiable sets by density (Preiss’s theorem, or more elementarily, the fact that the tangent plane exists at each point of \(\partial^* E\)), one obtains that \(\partial^* E\) is countably \((n-1)\)-rectifiable.
Step 3: Measure representation. The equality \(|D\chi_E| = \mathcal{H}^{n-1} \mathbin{\vrule height 6pt depth 0pt\relax\vrule height 0.5pt depth 0pt width 4pt} \partial^* E\) follows from the density result and a comparison argument using the Besicovitch covering theorem: since the density equals 1 at \(\mathcal{H}^{n-1}\)-a.e. point, the measures must agree.
De Giorgi’s structure theorem yields a far-reaching generalization of the classical divergence theorem.
Generalized Gauss-Green Formula. Let \(E\) be a set of finite perimeter in \(\mathbb{R}^n\). Then for every \(\phi \in C^1_c(\mathbb{R}^n; \mathbb{R}^n)\),
\[
\int_E \operatorname{div}\phi\, dx = -\int_{\partial^* E} \langle \phi, \nu_E \rangle\, d\mathcal{H}^{n-1},
\]
where \(\nu_E\) is the measure-theoretic outer normal to \(E\).
7.6 The Isoperimetric Inequality
The isoperimetric inequality is one of the oldest and most beautiful results in mathematics: among all domains with a given volume, the ball minimizes the surface area (perimeter). Geometric measure theory provides a clean and powerful proof.
Isoperimetric Inequality. For every set \(E \subseteq \mathbb{R}^n\) of finite perimeter,
\[
\min\{\mathcal{L}^n(E), \mathcal{L}^n(\mathbb{R}^n \setminus E)\}^{(n-1)/n} \leq C_n\, P(E),
\]
where \(C_n = n^{-1} \omega_n^{-1/n}\). Equality holds if and only if \(E\) is (up to a null set) a ball.
We present the elegant proof via the Sobolev inequality and a symmetrization argument.
Step 1: Reduction to smooth sets. By approximation (smooth sets are dense among sets of finite perimeter in the appropriate topology), it suffices to prove the inequality for bounded open sets with smooth boundary.
\[
\|u\|_{L^{n/(n-1)}} \leq C_n |Du|(\mathbb{R}^n).
\]\[
\mathcal{L}^n(E)^{(n-1)/n} = \|\chi_E\|_{L^{n/(n-1)}}^{n/(n-1)} \cdots
\]\[
\mathcal{L}^n(E)^{(n-1)/n} \leq C_n P(E).
\]
Step 3: Equality characterization. The equality case requires showing that the extremals of the BV-Sobolev inequality are characteristic functions of balls. This follows from the Pólya-Szegő inequality and the characterization of extremals of the Sobolev inequality by Aubin and Talenti, or alternatively by a direct symmetrization argument: Steiner symmetrization does not increase the perimeter while preserving the volume, and the only sets invariant under all Steiner symmetrizations are balls.
7.7 The Isoperimetric Problem via Direct Methods
The isoperimetric inequality can also be established by solving the associated variational problem directly using the compactness of sets of finite perimeter.
Existence of Isoperimetric Sets. For every \(v > 0\), there exists a set \(E_v \subseteq \mathbb{R}^n\) of finite perimeter with \(\mathcal{L}^n(E_v) = v\) that minimizes the perimeter:
\[
P(E_v) = \inf\{P(F) : F \text{ has finite perimeter},\; \mathcal{L}^n(F) = v\}.
\]
Any such minimizer is (up to translation and a null set) a ball of volume \(v\).
Proof sketch.Existence. The key challenge is preventing minimizing sequences from “escaping to infinity.” One uses the concentration-compactness principle of P.-L. Lions (1984): a minimizing sequence either concentrates (and converges to a minimizer), vanishes (impossible since the volume is fixed), or splits into pieces at large distances (ruled out by the subadditivity of perimeter and the strict inequality for the isoperimetric ratio of split domains).
Regularity and uniqueness. Any isoperimetric set has smooth boundary except on a set of Hausdorff codimension at least 7 (by the regularity theory for almost-minimizers of the perimeter). The boundary has constant mean curvature (the Lagrange multiplier for the volume constraint). By a result of Alexandrov (1962), the only compact embedded hypersurface of constant mean curvature in \(\mathbb{R}^n\) is the round sphere.
7.8 Connections to the Calculus of Variations and Optimal Transport
The theory of sets of finite perimeter provides a natural framework for studying a wide range of variational problems beyond the classical isoperimetric inequality.
A perimeter-penalized variational problem seeks to minimize functionals of the form
\[
\mathcal{F}(E) = P(E; U) + \int_E g(x)\, dx,
\]
where \(g : U \to \mathbb{R}\) is a given function. These arise naturally in phase transition problems: \(E\) represents a phase, the perimeter term penalizes the interfacial energy, and the integral term represents a bulk energy.
Existence of Minimizers for Perimeter-Penalized Problems. Let \(U \subseteq \mathbb{R}^n\) be a bounded open set with Lipschitz boundary, and let \(g \in L^1(U)\). Then the functional \(\mathcal{F}(E) = P(E; U) + \int_E g\, dx\) admits a minimizer among all sets of finite perimeter \(E \subseteq U\).
This follows immediately from the direct method: take a minimizing sequence \(\{E_j\}\), use the compactness of sets of finite perimeter (BV compactness) to extract a convergent subsequence \(\chi_{E_j} \to \chi_E\) in \(L^1(U)\), and use the lower semicontinuity of the perimeter together with the continuity of the integral term to conclude \(\mathcal{F}(E) \leq \liminf_j \mathcal{F}(E_j)\).
7.9 Concentration Compactness
We conclude with a brief discussion of the concentration-compactness principle, which addresses the fundamental difficulty that sequences of sets (or functions) on unbounded domains may fail to converge simply because mass escapes to infinity.
Lions' Concentration-Compactness Principle (1984). Let \(\{\mu_j\}\) be a sequence of non-negative measures on \(\mathbb{R}^n\) with \(\mu_j(\mathbb{R}^n) = 1\). Then, after passing to a subsequence, exactly one of the following occurs:
- Compactness: There exist translations \(y_j \in \mathbb{R}^n\) such that for every \(\epsilon > 0\), there exists \(R > 0\) with \(\mu_j(B(y_j, R)) \geq 1 - \epsilon\) for all \(j\).
- Vanishing: \(\sup_{y \in \mathbb{R}^n} \mu_j(B(y, R)) \to 0\) for every \(R > 0\).
- Dichotomy: There exists \(\alpha \in (0, 1)\) such that for every \(\epsilon > 0\), there exist \(j_0\) and non-negative measures \(\mu_j^1, \mu_j^2\) with \(\mu_j^1 + \mu_j^2 \leq \mu_j\), \(|\mu_j^1(\mathbb{R}^n) - \alpha| < \epsilon\), \(|\mu_j^2(\mathbb{R}^n) - (1 - \alpha)| < \epsilon\), and \(\operatorname{dist}(\operatorname{spt} \mu_j^1, \operatorname{spt} \mu_j^2) \to \infty\).
Summary and Outlook
Geometric measure theory provides the foundational language for studying geometric variational problems in their fullest generality. The key themes running through this course are:
Measure and dimension. Hausdorff measure and dimension extend classical notions of length, area, and volume to arbitrary subsets of Euclidean space, providing the correct framework for studying both smooth and fractal objects.
Rectifiability. The theory of rectifiable sets and measures identifies the class of sets that possess tangent structure almost everywhere — the measure-theoretic analogues of smooth submanifolds. The characterization theorems (Besicovitch-Federer, Preiss) reveal deep connections between geometric regularity and measure-theoretic density.
Generalized surfaces. Currents and varifolds provide two complementary frameworks for studying surfaces with singularities. Currents carry orientation and admit a boundary operator, enabling the solution of the Plateau problem. Varifolds forget orientation but retain tangent information, enabling powerful regularity theorems.
Regularity. The regularity theory for area-minimizing currents, developed by De Giorgi, Almgren, Simons, and Allard, reveals that minimizers are smooth except on small singular sets. The dimension bounds for singular sets are sharp, as demonstrated by the Simons cone.
Perimeter and isoperimetry. Sets of finite perimeter provide a robust framework for studying boundaries, and the theory yields elegant proofs of the isoperimetric inequality and its generalizations.
The subject continues to be an active area of research, with current directions including:
- Quantitative isoperimetric inequalities (Fusco-Maggi-Pratelli, 2008).
- Regularity of area-minimizing currents mod \(p\) (De Lellis-Hirsch-Marchese-Stuvard).
- Applications to general relativity (Penrose inequality, positive mass theorem via GMT).
- Mean curvature flow as a tool for classification of singularities.
- Connections to sub-Riemannian geometry and analysis on metric spaces.