CO 743: Discrete and Computational Geometry
Estimated study time: 1 hr 42 min
Table of contents
These notes synthesize material from J. Matousek’s Lectures on Discrete Geometry, J. Pach and P.K. Agarwal’s Combinatorial Geometry, M. de Berg et al.’s Computational Geometry: Algorithms and Applications, and J. Matousek’s Using the Borsuk-Ulam Theorem, enriched with material from MIT 18.319 and L. Guth’s Polynomial Methods in Combinatorics.
1. Convex Sets and Polytopes
Convexity is one of the most fundamental notions in geometry, and its interplay with combinatorics gives rise to a rich and beautiful theory. In this chapter we develop the foundational results on convex sets, beginning with the classical theorems of Carathéodory, Radon, and Helly, and proceeding to the combinatorial study of polytopes.
1.1 Convex Sets and Convex Hulls
The intersection of any family of convex sets is convex, which motivates the following definition.
A point of the form \( \sum_{i=1}^k \lambda_i x_i \) with \( \lambda_i \geq 0 \) and \( \sum \lambda_i = 1 \) is called a convex combination of the points \( x_1, \ldots, x_k \). The equivalence of the two definitions is a standard exercise: the set of all convex combinations is itself convex and is contained in every convex set containing \( S \).
1.2 Carathéodory’s Theorem
A natural question about convex hulls is: how many points from \( S \) are needed to express any point of \( \operatorname{conv}(S) \) as a convex combination? Carathéodory’s theorem, proved by Constantin Carathéodory in 1907, gives a sharp answer.
with not all \( \mu_i \) zero. This is called an affine dependence.
\[ p = \sum_{i=1}^k (\lambda_i - t\mu_i) x_i, \quad \text{and} \quad \sum_{i=1}^k (\lambda_i - t\mu_i) = 1. \]\[ t^* = \min \left\{ \frac{\lambda_i}{\mu_i} : \mu_i > 0 \right\}. \]At \( t = t^* \), all coefficients \( \lambda_i - t^* \mu_i \) remain non-negative (by the choice of minimum), they still sum to 1, and at least one coefficient is zero. Thus we have expressed \( p \) as a convex combination of at most \( k-1 \) points of \( S \).
Repeating this process, we reduce the number of points until \( k \leq d+1 \).
1.3 Radon’s Theorem
Johann Radon proved the following partition result in 1921. It is a direct consequence of the same linear algebra argument that underlies Carathéodory’s theorem.
is a point that lies in both \( \operatorname{conv}(\{x_i : i \in I\}) \) and \( \operatorname{conv}(\{x_j : j \in J\}) \). Since \( I \) and \( J \) partition \( \{1, \ldots, d+2\} \), we are done.
1.4 Helly’s Theorem
Eduard Helly proved his celebrated intersection theorem in 1913, though it was first published by Radon in 1921. It is one of the cornerstones of combinatorial convexity.
For each \( i \in \{1, \ldots, n\} \), the family \( \{C_j : j \neq i\} \) has \( n-1 \) members, and every \( d+1 \) of them have a common point (since every \( d+1 \) of the original \( n \) sets do). By the induction hypothesis, there exists a point \( x_i \in \bigcap_{j \neq i} C_j \).
We now have \( n \geq d+2 \) points \( x_1, \ldots, x_n \). By Radon’s theorem, these can be partitioned into two sets \( I \) and \( J \) with \( \operatorname{conv}(\{x_i : i \in I\}) \cap \operatorname{conv}(\{x_j : j \in J\}) \neq \emptyset \). Let \( p \) be a point in this intersection.
We claim \( p \in \bigcap_{k=1}^n C_k \). Fix any \( k \in \{1, \ldots, n\} \). Since \( I \) and \( J \) partition \( \{1, \ldots, n\} \), either \( k \notin I \) or \( k \notin J \).
If \( k \notin I \), then for every \( i \in I \), we have \( x_i \in C_k \) (since \( x_i \in \bigcap_{j \neq i} C_j \) and \( k \neq i \)). So \( \{x_i : i \in I\} \subseteq C_k \), and since \( C_k \) is convex, \( \operatorname{conv}(\{x_i : i \in I\}) \subseteq C_k \), hence \( p \in C_k \).
Similarly, if \( k \notin J \), then \( p \in \operatorname{conv}(\{x_j : j \in J\}) \subseteq C_k \).
In either case \( p \in C_k \), so \( p \in \bigcap_{k=1}^n C_k \).
1.5 The Fractional Helly Theorem
Helly’s theorem requires that every \( (d+1) \)-tuple of sets has a common point. What if only a positive fraction of the \( (d+1) \)-tuples intersect? The fractional Helly theorem, due to Katchalski and Liu (1979), gives a surprisingly strong conclusion.
This result tells us that even a “fractional” Helly condition forces a large common intersection. It has found numerous applications in combinatorial geometry and optimization.
1.6 Colorful Carathéodory Theorem
Imre Bárány proved the following remarkable generalization of Carathéodory’s theorem in 1982. It introduces the idea of “colorful” configurations that has become a powerful paradigm.
1.7 Polytopes and Face Lattices
We now turn to the combinatorial theory of polytopes, which are the higher-dimensional analogues of polygons and polyhedra.
The equivalence of these two descriptions (the “V-representation” and “H-representation”) is a fundamental result in polyhedral theory, sometimes called the Weyl-Minkowski theorem.
The set of all faces of a polytope \( P \), ordered by inclusion, forms a lattice called the face lattice of \( P \). Two polytopes are combinatorially equivalent if their face lattices are isomorphic. The study of face lattices is central to combinatorial convexity.
1.8 Euler’s Formula and the Dehn-Sommerville Relations
Leonhard Euler discovered the following remarkable relation in 1752, one of the founding results of topology.
This is the Euler-Poincaré relation for polytopes. It provides one linear constraint on the \( f \)-vector. For simplicial polytopes (those where every proper face is a simplex), much more can be said.
These relations, first discovered by Max Dehn in 1905 for \( d = 4,5 \) and generalized by Duncan Sommerville in 1927, show that the \( f \)-vector of a simplicial polytope has a hidden symmetry captured by the so-called \( h \)-vector.
1.9 The Upper Bound Theorem
How many faces can a polytope with \( n \) vertices have? This fundamental question was answered by Peter McMullen in 1970.
1.10 Neighborly Polytopes
Neighborly polytopes are extremal objects in the theory of polytopes. Their existence demonstrates that high-dimensional polytopes can have far richer combinatorial structure than their low-dimensional counterparts suggest. In dimension 4, for example, a neighborly polytope with \( n \) vertices has every pair of vertices forming an edge — it is a “complete graph” polytope, with \( \binom{n}{2} \) edges.
2. Incidence Geometry
Incidence geometry studies the combinatorial relationships between geometric objects — how many times can points and lines (or other geometric primitives) meet? The central results here have deep connections to additive combinatorics, harmonic analysis, and theoretical computer science.
2.1 Point-Line Incidences
A trivial upper bound is \( I(P, L) \leq |P| \cdot |L| \). Can we do better? If \( |P| = m \) and \( |L| = n \), a simple double counting shows \( I(P,L) \leq m + \binom{n}{2} \) (each pair of lines meets in at most one point) and \( I(P,L) \leq n + \binom{m}{2} \) (each pair of points determines at most one line). But the true answer is far stronger.
2.2 The Crossing Lemma
Before proving the Szemerédi-Trotter theorem, we establish a key tool from topological graph theory.
Now consider a random subgraph \( G' \) obtained by independently keeping each vertex with probability \( p \in (0,1] \) (and keeping an edge if both its endpoints are kept). The expected number of vertices of \( G' \) is \( pn \), the expected number of edges is \( p^2 m \), and the expected number of crossings is \( p^4 \operatorname{cr}(G) \) (a crossing requires all four endpoints of the two crossing edges to be kept).
\[ p^4 \operatorname{cr}(G) \geq p^2 m - 3pn. \]\[ \operatorname{cr}(G) \geq \frac{m}{p^2} - \frac{3n}{p^3}. \]\[ \operatorname{cr}(G) \geq \frac{m^3}{16n^2} - \frac{3m^3}{64n^2} = \frac{m^3}{64n^2}. \]2.3 The Szemerédi-Trotter Theorem
The following theorem, proved by Endre Szemerédi and William Trotter in 1983, is one of the central results in combinatorial geometry. The original proof was technically involved; we present the elegant proof via the crossing lemma due to Székely (1997).
Draw \( G \) in the plane by placing each vertex at the corresponding point of \( P \) and drawing each edge as the line segment between consecutive points on its line. Two edges from different lines can cross in at most one point, so the number of crossings is at most \( \binom{n}{2} < n^2/2 \).
If \( I(P,L) - n \leq 4m \), then \( I(P,L) \leq 4m + n = O(m+n) \) and we are done.
\[ \frac{n^2}{2} \geq \operatorname{cr}(G) \geq \frac{(I(P,L)-n)^3}{64m^2}. \]\[ (I(P,L)-n)^3 \leq 32 m^2 n^2, \]giving \( I(P,L) - n = O(m^{2/3} n^{2/3}) \), and hence \( I(P,L) = O(m^{2/3} n^{2/3} + m + n) \).
2.4 Applications: Distinct Distances and Unit Distances
One of the oldest problems in combinatorial geometry is due to Paul Erdős (1946): what is the minimum number of distinct distances determined by \( n \) points in the plane?
Erdős conjectured that the answer is \( \Omega(n/\sqrt{\log n}) \), achieved by the \( \sqrt{n} \times \sqrt{n} \) integer grid. Using the Szemerédi-Trotter theorem, one can prove a lower bound of \( \Omega(n^{4/5}) \). The Elekes-Sharir framework (2010) transformed the problem into one about incidences, ultimately leading to the resolution by Guth and Katz.
This nearly resolves Erdős’s conjecture and represents one of the greatest achievements of the polynomial method in combinatorial geometry. The proof combines the Elekes-Sharir reduction to a point-line incidence problem in three dimensions with the polynomial partitioning technique.
The unit distances problem, also posed by Erdős, asks for the maximum number \( u(n) \) of pairs at distance exactly 1 among \( n \) points. The Szemerédi-Trotter theorem implies \( u(n) = O(n^{4/3}) \) (by applying it to the \( n \) circles of radius 1 centered at the points). The \( \sqrt{n} \times \sqrt{n} \) grid shows \( u(n) = \Omega(n^{1+c/\log\log n}) \). Closing this gap remains a major open problem.
2.5 Beck’s Theorem
(i) There is a line containing at least \( n/100 \) of the points, or
(ii) The points determine at least \( \Omega(n^2) \) distinct lines.
Beck’s theorem can be proved using the Szemerédi-Trotter theorem. It captures the fundamental dichotomy that a point set is either “nearly collinear” or determines many lines.
2.6 Incidences in Higher Dimensions
The study of point-hyperplane and point-variety incidences in \( \mathbb{R}^d \) for \( d \geq 3 \) requires substantially different techniques. The polynomial partitioning method of Guth and Katz (discussed in Chapter 7) has led to major progress. A key result is the following.
3. Arrangements and Geometric Transversals
Arrangements of hyperplanes and other geometric objects are fundamental structures in computational and combinatorial geometry. This chapter develops the theory of arrangements, proves the ham-sandwich theorem using Borsuk-Ulam, and introduces the powerful framework of VC dimension and epsilon-nets.
3.1 Hyperplane Arrangements
For \( d = 2 \), this gives at most \( \binom{n}{0} + \binom{n}{1} + \binom{n}{2} = 1 + n + \binom{n}{2} \) regions, with equality when the lines are in general position (no two parallel, no three concurrent). This is the classical result on the number of regions determined by \( n \) lines in the plane.
3.2 The Zone Theorem
The zone theorem is a crucial tool in the analysis of algorithms for constructing and manipulating arrangements. It implies that inserting a new hyperplane into an existing arrangement takes time proportional to the zone complexity.
3.3 The Ham-Sandwich Theorem
The ham-sandwich theorem is a beautiful result at the intersection of topology and geometry. Its name comes from the following vivid interpretation: given a ham sandwich consisting of two slices of bread and a slice of ham, it is possible to cut all three pieces exactly in half with a single straight cut.
Then \( f \) is continuous (under suitable regularity assumptions on the sets) and satisfies \( f(-v) = -f(v) \) since reversing the orientation of the hyperplane swaps the two sides. By the Borsuk-Ulam theorem, there exists \( v^* \in S^d \) with \( f(v^*) = 0 \), meaning \( H_{v^*} \) bisects each \( A_i \).
3.4 Geometric Transversals
Helly’s theorem characterizes when a point transversal exists. For line transversals, the situation is more complex and leads to Hadwiger-type transversal theorems.
3.5 VC Dimension and Epsilon-Nets
The theory of VC dimension, originating in statistical learning theory (Vapnik-Chervonenkis, 1971), has become a cornerstone of computational and combinatorial geometry.
More generally, the range space of half-spaces in \( \mathbb{R}^d \) has VC dimension \( d+1 \).
The Sauer-Shelah lemma shows that bounded VC dimension implies polynomial growth of the number of distinct “traces” of ranges on finite sets. This is crucial for the epsilon-net theorem.
3.6 Epsilon-Nets and the Epsilon-Net Theorem
In other words, an \( \varepsilon \)-net “hits” every range of measure at least \( \varepsilon \). The \( \varepsilon \)-net theorem guarantees that small \( \varepsilon \)-nets exist for range spaces of bounded VC dimension.
To apply a union bound, we need to bound the number of distinct ranges restricted to a finite sample. Draw an additional “test sample” \( T \) of size \( t \) (to be chosen). The key observation is: if \( R \) has \( \mu(R) \geq \varepsilon \) and \( N \cap R = \emptyset \), then with reasonable probability, \( R \) contains at least \( \varepsilon t / 2 \) points of \( T \). This can be formalized via a double-sampling argument.
More precisely, consider a random sample \( S \) of size \( r + t \). We bound the probability that there exists a range \( R \) that is “heavy” (captures at least \( \varepsilon t / 2 \) points of the last \( t \) points of \( S \)) yet “missed” by the first \( r \) points of \( S \). The number of distinct subsets induced by ranges on \( S \) is at most \( \sum_{i=0}^d \binom{r+t}{i} \leq (r+t)^d \) by the Sauer-Shelah lemma.
\[ \frac{\binom{r+t - \varepsilon t/2}{r}}{\binom{r+t}{r}} \leq \left(1 - \frac{\varepsilon t/2}{r+t}\right)^r. \]Taking \( t = 2r \) and choosing \( c \) large enough, the union bound over all \( O((3r)^d) \) ranges gives a total failure probability less than 1. Thus an \( \varepsilon \)-net of size \( r = O\left(\frac{d}{\varepsilon}\log\frac{d}{\varepsilon}\right) \) exists.
3.7 Epsilon-Approximations
An \( \varepsilon \)-approximation provides a uniform approximation of the measure of all ranges simultaneously. Every \( \varepsilon \)-approximation is a \( 2\varepsilon \)-net, but the converse is far from true. For range spaces of VC dimension \( d \), \( \varepsilon \)-approximations of size \( O(d/\varepsilon^2 \cdot \log(d/\varepsilon)) \) exist, by a more refined probabilistic argument.
4. Geometric Algorithms
Computational geometry concerns the design and analysis of algorithms for geometric problems. This chapter covers the foundational algorithms: convex hull computation, Voronoi diagrams, Delaunay triangulations, range searching, point location, and line segment intersection.
4.1 Convex Hull Algorithms
Computing the convex hull of a set of points is one of the most fundamental problems in computational geometry. We present several classical algorithms.
4.1.1 Graham Scan
The algorithm works as follows:
- Find the point \( p_0 \) with the lowest \( y \)-coordinate (breaking ties by \( x \)-coordinate).
- Sort the remaining points by polar angle with respect to \( p_0 \).
- Process points in sorted order, maintaining a stack representing the vertices of the current convex hull. For each new point, pop points from the stack while the new point makes a right turn (or goes straight) with the top two stack elements, then push the new point.
The correctness follows from the invariant that the stack always forms a convex chain. The running time is dominated by the sorting step: \( O(n \log n) \). The stack operations take \( O(n) \) amortized time (each point is pushed and popped at most once).
4.1.2 Gift Wrapping (Jarvis March)
Starting from the lowest point, the algorithm repeatedly selects the point that makes the smallest counter-clockwise angle with the current edge direction. Each step takes \( O(n) \) time, and there are \( h \) steps. This is output-sensitive: when \( h \) is small, it outperforms Graham scan.
4.1.3 Divide and Conquer
Sort the points by \( x \)-coordinate. Recursively compute the convex hulls of the left and right halves. Merge the two hulls by finding the upper and lower tangent lines. The merge step takes \( O(n) \) time, giving the recurrence \( T(n) = 2T(n/2) + O(n) \), which solves to \( O(n \log n) \).
4.2 Voronoi Diagrams
The Voronoi diagram encodes proximity information: the cell of \( p_i \) consists of all points closer to \( p_i \) than to any other site. It is a fundamental data structure with applications in nearest-neighbor queries, facility location, and many other domains.
The complexity bounds follow from Euler’s formula (the Voronoi diagram is a planar graph). The \( O(n \log n) \) algorithm can be obtained by Fortune’s sweep (Fortune, 1987) or by divide and conquer (Shamos and Hoey, 1975).
4.3 Delaunay Triangulations
This characterization gives an equivalent definition: a triangulation of \( S \) is Delaunay if and only if every triangle has an empty circumscribed circle. The Delaunay triangulation maximizes the minimum angle among all triangulations of the same point set, making it ideal for numerical methods.
There is a beautiful connection between Delaunay triangulations and convex hulls.
This duality provides a conceptually clean way to compute Delaunay triangulations via three-dimensional convex hull algorithms.
4.4 Range Searching
Range searching is a fundamental problem in computational geometry: preprocess a set of \( n \) points so that for a given query range \( R \) (e.g., a rectangle, half-plane, or simplex), one can efficiently count or report the points in \( R \).
For non-orthogonal ranges (e.g., half-planes or simplices), different techniques are needed, often involving partition trees and cuttings.
4.5 Point Location
Kirkpatrick’s algorithm builds a hierarchy of progressively coarser triangulations. At each level, an independent set of vertices is removed and the resulting hole is re-triangulated. The hierarchy has \( O(\log n) \) levels, and a query descends from the coarsest level to the finest.
4.6 Line Segment Intersection: The Bentley-Ottmann Algorithm
The algorithm uses the plane sweep paradigm: a vertical line sweeps from left to right, maintaining a balanced binary search tree of segments ordered by their \( y \)-coordinate at the sweep line. Event points (segment endpoints and intersection points) are processed in left-to-right order. When two segments become adjacent in the sweep structure, their intersection (if any) is predicted and added to the event queue.
The key insight is that two segments can intersect only if they are adjacent in the sweep line order at some point during the sweep. This keeps the event queue manageable: at any time, it contains at most \( O(n+k) \) events.
4.7 Randomized Incremental Construction
Many geometric algorithms benefit from randomization. The paradigm of randomized incremental construction builds a geometric structure by inserting objects in random order and updating the structure after each insertion.
The algorithm inserts points one at a time in random order. After inserting a point \( p \), it finds the triangle containing \( p \), splits it, and restores the Delaunay property by edge flips. Using the history graph (or conflict graph) for point location, each insertion takes \( O(\log n) \) expected amortized time.
5. Packing, Covering, and Tiling
The study of packing, covering, and tiling problems has ancient roots — Kepler speculated about sphere packing in 1611 — and connects geometry to number theory, coding theory, and optimization. This chapter surveys the main results and open problems.
5.1 Sphere Packing
5.2 The Kepler Conjecture
This conjecture, posed by Johannes Kepler in 1611, was one of the oldest open problems in geometry. Thomas Hales announced a proof in 1998, which was published in 2005 after extensive peer review. The proof is a monumental achievement, combining theoretical reductions with extensive computer calculations. It was later verified formally using the Flyspeck project (completed 2014) and the Lean proof assistant.
5.3 The Minkowski-Hlawka Theorem
While finding the densest packing in a given dimension is extremely hard, we can prove the existence of reasonably dense packings.
The proof uses Minkowski’s theorem on lattice points in convex bodies (or, in its modern formulation, a probabilistic averaging argument over random lattices due to Siegel). Despite its simplicity, the \( 2^{-d} \) order has not been substantially improved for general \( d \) — this is one of the major open problems in the geometry of numbers.
5.4 Rogers’ Bound and High-Dimensional Packing
The gap between the lower bound (Minkowski-Hlawka, \( \Delta \geq 2^{-d+1} \)) and the upper bound (Kabatiansky-Levenshtein, \( \Delta \leq 2^{-0.599d} \)) remains wide. Understanding the true exponential rate of the optimal packing density is a deep open problem.
5.5 Connections to Coding Theory
There is a profound analogy between sphere packings and error-correcting codes. A binary code of length \( n \), minimum distance \( d \), and size \( M \) corresponds to a packing of \( M \) Hamming balls of radius \( \lfloor (d-1)/2 \rfloor \) in \( \{0,1\}^n \). The rate of the code, \( \frac{1}{n}\log_2 M \), is analogous to the packing density.
The Gilbert-Varshamov bound (existence of codes by greedy/probabilistic arguments) is the coding-theoretic analogue of the Minkowski-Hlawka theorem, while the Hamming bound (sphere-packing bound for codes) is analogous to Rogers’ bound.
5.6 Sphere Covering
Covering is the dual problem to packing: we want to cover space with as few balls as possible. The covering density satisfies \( \theta \geq 1 \) with equality only in trivial cases. Rogers (1959) showed that the covering density of the thinnest lattice covering in \( \mathbb{R}^d \) is at most \( d \ln d + d \ln \ln d + 5d \) — roughly exponential in dimension, in contrast to the packing problem.
5.7 Tiling Problems
A central question is: which convex bodies tile \( \mathbb{R}^d \)? In \( \mathbb{R}^2 \), the complete list of convex tiles is known: all triangles and quadrilaterals tile (by suitable arrangements), along with exactly 15 families of convex pentagons (the list was completed in 2017) and exactly 3 families of convex hexagons. No convex polygon with 7 or more sides tiles the plane.
5.8 The Honeycomb Conjecture
This result, conjectured since antiquity (Pappus attributed it to the wisdom of bees), was proved by Thomas Hales in 1999. The proof uses the isoperimetric inequality and careful analysis of the possible local structures.
6. Topological Methods in Combinatorial Geometry
Topology provides surprisingly powerful tools for combinatorial and geometric problems. This chapter centers on the Borsuk-Ulam theorem and its many applications, including the ham-sandwich theorem, Lovász’s proof of Kneser’s conjecture, and the topological Tverberg theorem.
6.1 The Borsuk-Ulam Theorem
The Borsuk-Ulam theorem, first proved by Karol Borsuk in 1933, is one of the most beautiful and applicable results in topology. It has an extraordinary range of consequences in combinatorics, geometry, and beyond.
Equivalently: every continuous odd map \( g: S^n \to \mathbb{R}^n \) (satisfying \( g(-x) = -g(x) \)) has a zero.
Base case \( n = 1 \). Let \( f: S^1 \to \mathbb{R} \) be continuous. Define \( g: S^1 \to \mathbb{R} \) by \( g(x) = f(x) - f(-x) \). Then \( g(-x) = f(-x) - f(x) = -g(x) \). Since \( g \) is continuous and odd, and \( S^1 \) is connected, the intermediate value theorem applies: if \( g(x_0) > 0 \) for some \( x_0 \), then \( g(-x_0) < 0 \), so \( g \) has a zero on any path from \( x_0 \) to \( -x_0 \). At this zero, \( f(x) = f(-x) \).
Inductive step. Assume the theorem holds for \( n-1 \). Let \( f: S^n \to \mathbb{R}^n \) be continuous. Define \( g: S^n \to \mathbb{R}^n \) by \( g(x) = f(x) - f(-x) \), which is continuous and odd. Suppose for contradiction that \( g(x) \neq 0 \) for all \( x \). Then \( h(x) = g(x)/\|g(x)\| \) is a continuous odd map \( h: S^n \to S^{n-1} \).
Restrict \( h \) to the equator \( S^{n-1} = \{ x \in S^n : x_{n+1} = 0 \} \). This gives a continuous odd map \( h|_{S^{n-1}}: S^{n-1} \to S^{n-1} \). By the Borsuk-Ulam theorem applied to the identity, any continuous odd map from \( S^{n-1} \) to \( S^{n-1} \) has odd degree (this is the key topological fact, provable via homology or by the theory of the degree of equivariant maps). In particular, \( h|_{S^{n-1}} \) is not null-homotopic.
But \( h \) extends to the upper hemisphere \( S^n_+ = \{x \in S^n : x_{n+1} \geq 0\} \), which is contractible (homeomorphic to the closed \( n \)-disk). So \( h|_{S^{n-1}} \) extends to a map from the disk \( D^n \to S^{n-1} \), meaning \( h|_{S^{n-1}} \) is null-homotopic — a contradiction.
Thus \( g \) has a zero, meaning \( f(x) = f(-x) \) for some \( x \in S^n \).
An immediate and beautiful consequence:
Another classical consequence:
6.2 The Ham-Sandwich Theorem Revisited
We already proved the ham-sandwich theorem in Chapter 3 using Borsuk-Ulam. Let us note some generalizations.
For \( k = 1 \), this recovers the classical ham-sandwich theorem (a hyperplane bisects \( d \) measures).
6.3 Lovász’s Proof of Kneser’s Conjecture
One of the most celebrated applications of the Borsuk-Ulam theorem in combinatorics is Lovász’s 1978 proof of Kneser’s conjecture, which determined the chromatic number of the Kneser graph.
Martin Kneser conjectured in 1955 that \( \chi(KG(n,k)) = n - 2k + 2 \). The upper bound is easy: color a \( k \)-set \( A \) by \( \min(A) \) if \( \min(A) \leq n - 2k + 1 \), and by \( n - 2k + 2 \) otherwise. This uses \( n - 2k + 2 \) colors and is a proper coloring.
Identify the ground set with \( [n] = \{1, 2, \ldots, n\} \), and place the elements of \( [n] \) on the sphere \( S^{n-2} \) in general position: choose points \( v_1, \ldots, v_n \in S^{n-2} \) such that no \( n-1 \) of them lie in a closed hemisphere (this is possible for \( n \geq 2 \) points in \( S^{n-2} \) by a dimension argument).
For each point \( x \in S^{n-2} \), consider the open hemisphere \( H_x = \{ y \in S^{n-2} : x \cdot y > 0 \} \). By our general position assumption, \( H_x \) contains at least one of the \( v_i \). Define \( A(x) \subseteq [n] \) to be the indices of points in \( H_x \), i.e., \( A(x) = \{ i \in [n] : x \cdot v_i > 0 \} \). By general position, \( |A(x)| \geq 1 \) for all \( x \).
\[ f(x) = e_{c(B(x))}, \]the \( c(B(x)) \)-th standard basis vector in \( \mathbb{R}^{n-2k+1} \). If \( |A(x)| < k \), set \( f(x) = 0 \).
\[ f_j(x) = \max\left(0, \; \min_{i \in B(x)} (x \cdot v_i)\right) \quad \text{times} \quad [\![c(B(x)) = j]\!], \]summed over appropriate subsets, and made continuous and antipodal by construction. The precise technical details (using a partition of unity based on the inner products \( x \cdot v_i \)) show that one can construct a continuous map \( f: S^{n-2} \to \mathbb{R}^{n-2k+1} \) with the property that if \( f(x) = f(-x) \), then the \( k \)-sets associated to \( x \) and \( -x \) receive the same color yet are disjoint.
By the Borsuk-Ulam theorem, there exists \( x^* \) with \( f(x^*) = f(-x^*) \). The sets \( A(x^*) \) and \( A(-x^*) \) are disjoint (since \( H_{x^*} \) and \( H_{-x^*} \) are complementary open hemispheres), so the \( k \)-subsets \( B(x^*) \subseteq A(x^*) \) and \( B(-x^*) \subseteq A(-x^*) \) are disjoint. But \( f(x^*) = f(-x^*) \) forces \( c(B(x^*)) = c(B(-x^*)) \), contradicting the properness of the coloring.
Hence \( \chi(KG(n,k)) \geq n - 2k + 2 \).
6.4 The Topological Tverberg Theorem
Tverberg’s theorem generalizes Radon’s theorem: while Radon partitions \( d+2 \) points into 2 parts with intersecting hulls, Tverberg considers \( r \) parts.
The topological version, conjectured by Bárány, Shlosman, and Szücs, strengthens this by replacing “points” with “images of vertices of a simplex.”
The proof uses equivariant topology — specifically, the theory of \( \mathbb{Z}/p \)-equivariant maps and the Borsuk-Ulam theorem for cyclic groups. For non-prime-power \( r \), counterexamples were found by Mabillard and Wagner (2015) for \( d \geq 3r + 1 \), and subsequently by Frick, Avvakumov-Karasev-Skopenkov, and Blagojević-Frick-Ziegler for all non-prime-power \( r \geq 6 \).
6.5 The Necklace Splitting Problem
The case \( k = 2 \) follows directly from the Borsuk-Ulam theorem: we need at most \( t \) cuts to divide a necklace with \( t \) colors fairly between two thieves. The proof maps cutting positions to points on \( S^t \) and applies Borsuk-Ulam to show a fair partition exists.
6.6 Borsuk’s Conjecture and Its Disproof
In 1933, Borsuk asked whether every bounded set in \( \mathbb{R}^d \) can be partitioned into \( d+1 \) sets of smaller diameter. This became known as Borsuk’s conjecture.
The construction uses the vertex set of a cleverly chosen subset of \( \{0,1\}^n \), exploiting the “concentration of measure” phenomenon on the Boolean cube. Specifically, they use a certain union of binary codes and show, using Frankl-Wilson-type intersection theorems, that any partition into pieces of smaller diameter must use exponentially many pieces.
6.7 Equivariant Topology
The common thread running through the applications in this chapter is equivariant topology: the study of topological spaces with group actions and maps that respect those actions.
The Borsuk-Ulam theorem can be restated as: there is no \( \mathbb{Z}/2 \)-equivariant map \( S^n \to S^{n-1} \), where \( \mathbb{Z}/2 \) acts by the antipodal map. This viewpoint generalizes to actions of \( \mathbb{Z}/p \), the symmetric group \( S_r \), and other groups, leading to deeper results like the topological Tverberg theorem.
The index (or genus) of a \( \mathbb{Z}/2 \)-space \( X \) is the smallest \( n \) such that there exists a \( \mathbb{Z}/2 \)-equivariant map \( X \to S^n \). The Borsuk-Ulam theorem says the index of \( S^n \) is \( n \). Computing or bounding the index of specific spaces is the key technique for deriving combinatorial applications.
7. Algebraic Methods and Modern Developments
The last two decades have seen a revolution in combinatorial geometry driven by algebraic methods — particularly the polynomial method. This chapter presents the key results: the joints problem, the Guth-Katz polynomial partitioning, Dvir’s proof of the finite field Kakeya conjecture, and connections to additive combinatorics.
7.1 The Polynomial Method
The polynomial method, in its modern incarnation, exploits the fact that low-degree polynomials cannot vanish on too many “independent” points. The key algebraic ingredients are:
7.2 The Joints Problem
A joint formed by a set of lines in \( \mathbb{R}^3 \) is a point where three or more non-coplanar lines meet. The joints problem, raised by Chazelle et al. in 1992, asks: given \( n \) lines in \( \mathbb{R}^3 \), how many joints can they form?
Suppose \( |J| > C n^{3/2} \) for a large constant \( C \) to be determined. Choose \( D \) such that \( \binom{D+3}{3} = \frac{(D+1)(D+2)(D+3)}{6} > |J| \) but \( \binom{D+2}{3} \leq |J| \). This gives \( D = \Theta(|J|^{1/3}) = \Theta(C^{1/3} n^{1/2}) \).
By parameter counting, there exists a non-zero polynomial \( f \) of degree at most \( D \) vanishing on all points of \( J \). We now derive a contradiction.
Claim: Every line \( \ell \in \mathcal{L} \) is contained in the zero set \( Z(f) = \{x : f(x) = 0\} \).
Proof of claim: The polynomial \( f \) restricted to \( \ell \) is a univariate polynomial of degree at most \( D \). The line \( \ell \) contains joints, and at each joint at least two other non-coplanar lines of \( \mathcal{L} \) pass through. The number of joints on \( \ell \) might be more than \( D \) if \( C \) is large enough (we will verify this), in which case \( f|_\ell = 0 \) identically, meaning \( \ell \subseteq Z(f) \).
To verify: the average number of joints per line is \( |J| \cdot (\text{average number of lines through a joint}) / n \geq 3|J|/n \), since each joint involves at least 3 lines. Actually, we argue differently: the number of incidences between lines and joints is at least \( 3|J| \). By averaging, some line contains at least \( 3|J|/n > 3C\sqrt{n} \) joints. But we need all lines to be in \( Z(f) \).
Consider a line \( \ell \) with \( k \) joints on it. If \( k > D \), then \( \ell \subseteq Z(f) \). For the remaining lines with \( k \leq D \) joints, we consider what happens if they are not in \( Z(f) \). Let \( J' \) be the set of joints on lines not contained in \( Z(f) \). Each such line contributes at most \( D \) joints to \( J' \), so \( |J'| \leq nD \).
Now, at each joint \( p \in J \setminus J' \), all lines through \( p \) are contained in \( Z(f) \). Since at least 3 non-coplanar lines through \( p \) lie in \( Z(f) \), the gradient \( \nabla f(p) \) is perpendicular to all tangent directions of these lines at \( p \). But 3 non-coplanar directions span \( \mathbb{R}^3 \), so \( \nabla f(p) = 0 \). Thus \( p \) is a singular point of \( f \).
Consider \( \nabla f = (\partial_1 f, \partial_2 f, \partial_3 f) \). Each \( \partial_i f \) has degree at most \( D - 1 \). The line \( \ell \subseteq Z(f) \) lies in \( Z(\partial_i f) \) as well only if \( \partial_i f \) vanishes on \( \ell \). If some \( \partial_i f \) is non-zero and does not vanish on \( \ell \), then \( \partial_i f|_\ell \) is a non-zero polynomial of degree at most \( D-1 \), so it has at most \( D-1 \) zeros on \( \ell \). Thus at most \( D-1 \) points of \( \ell \) have \( \nabla f = 0 \).
Iterate: replace \( f \) by a component of \( \nabla f \), and repeat the argument. At each stage, the degree decreases by 1. After at most \( D \) iterations, we reach a constant, and the conclusion is that the number of joints is at most \( O(nD) = O(n \cdot C^{1/3} n^{1/2}) = O(C^{1/3} n^{3/2}) \). Choosing \( C \) large enough yields a contradiction with \( |J| > Cn^{3/2} \).
More carefully: we prove by induction on degree that if \( f \) is a non-zero polynomial of degree \( D \) vanishing on all joints, and every line of \( \mathcal{L} \) lies in \( Z(f) \), then the number of joints where \( \nabla f = 0 \) is the number of “real” joints, and these satisfy \( |J| \leq n(D-1) + |J'| \) where \( |J'| \) counts joints of the partial derivatives. Unraveling gives \( |J| \leq nD + n(D-1) + \cdots \leq nD^2/2 \). With \( D = \Theta(n^{1/2}) \), this gives \( |J| = O(n^{3/2}) \).
7.3 Polynomial Partitioning
The polynomial partitioning theorem is the main technical engine behind the Guth-Katz proof of the distinct distances theorem.
The proof uses the Stone-Tukey “polynomial ham-sandwich theorem” (itself a consequence of Borsuk-Ulam) iteratively: bisect the point set with a hyperplane, then bisect each half with another hyperplane, etc. After \( k \) rounds, we have \( 2^k \) cells. Translating to polynomial partitions via the product of the hyperplane equations gives a polynomial of degree \( k \) partitioning the points into \( 2^k \) roughly equal parts. Generalizing from hyperplanes to polynomials of degree \( D \) gives cells of \( \mathbb{R}^d \setminus Z(f) \), and the number of cells is \( O(D^d) \) by a result of Warren (or the Milnor-Thom bound on real algebraic varieties).
7.4 Dvir’s Proof of the Finite Field Kakeya Conjecture
The Kakeya problem asks about the minimum size of a set containing a unit line segment in every direction. In finite fields, the analogous question was resolved by Zeev Dvir in 2009 with a stunningly short proof.
Let \( D = \deg(f) \leq q - 1 \), and let \( f_D \) denote the homogeneous component of \( f \) of degree \( D \) (the leading form). We show \( f_D \) vanishes on all of \( \mathbb{F}_q^d \setminus \{0\} \).
Fix any direction \( v \in \mathbb{F}_q^d \setminus \{0\} \). Since \( K \) is a Kakeya set, there exists \( a \in \mathbb{F}_q^d \) such that \( f(a + tv) = 0 \) for all \( t \in \mathbb{F}_q \). But \( g(t) = f(a + tv) \) is a univariate polynomial of degree at most \( D \leq q-1 \) that vanishes at all \( q \) elements of \( \mathbb{F}_q \). Since \( \deg(g) \leq q - 1 < q \) yet \( g \) has \( q \) roots, we must have \( g \equiv 0 \) (or more precisely, the leading coefficient of \( g \) is zero if \( \deg(g) < q \)).
The leading coefficient of \( g(t) = f(a + tv) \) is \( f_D(v) \) (the top-degree part evaluated at \( v \)). Since \( g \) has \( q > D \) roots in \( \mathbb{F}_q \), and its degree is at most \( D \), we must have \( g \equiv 0 \). In particular, the coefficient of \( t^D \) is zero: \( f_D(v) = 0 \).
Since this holds for every \( v \in \mathbb{F}_q^d \setminus \{0\} \), the polynomial \( f_D \) vanishes on all of \( \mathbb{F}_q^d \setminus \{0\} \). But \( f_D \) is homogeneous and vanishes at \( 0 \) as well (since \( f_D(0) = 0 \) for any homogeneous polynomial of positive degree). So \( f_D \) vanishes on all of \( \mathbb{F}_q^d \).
Now, the Schwartz-Zippel lemma states that a non-zero polynomial of degree \( D \leq q-1 \) can vanish on at most \( D \cdot q^{d-1} < q^d \) points of \( \mathbb{F}_q^d \). Since \( f_D \) vanishes on all \( q^d \) points, \( f_D \equiv 0 \), contradicting the assumption that \( f_D \) is the leading homogeneous part of \( f \) (which is non-zero by definition of degree).
Therefore \( |K| \geq \binom{q-1+d}{d} \geq q^d / d! \).
7.5 The Polynomial Ham-Sandwich Theorem
This result follows from the Borsuk-Ulam theorem applied to the space of polynomials of degree at most \( D \), viewed as a linear space with the \( \mathbb{Z}/2 \)-action \( f \mapsto -f \).
7.6 The Elekes-Rónyai Problem
The original conjecture of Elekes and Rónyai (2000) was that the bound should be \( \Omega(n^2) \) (matching the trivial upper bound from \( |A \times B| = n^2 \)) minus lower-order terms, but the \( n^{3/2} \) bound proved by Raz, Sharir, and Solymosi uses the Szemerédi-Trotter theorem and is the current state of the art.
7.7 Sum-Product Estimates via Geometry
The sum-product problem, first studied by Erdős and Szemerédi (1983), lies at the heart of additive combinatorics and has deep connections to incidence geometry.
which implies \( \max(|A+A|, |A \cdot A|) = \Omega(|A|^{5/4}) \). This was the first super-linear bound and demonstrated the power of geometric methods in additive combinatorics.
The best current bound is \( \max(|A+A|, |A \cdot A|) = \Omega(|A|^{4/3 + c}) \) for a small constant \( c > 0 \), due to Rudnev, Shkredov, and Stevens (2020). The full conjecture remains open and is one of the central problems in combinatorics.
7.8 Geometric Ramsey Theory
Ramsey theory in a geometric setting asks: what geometric configurations must appear in sufficiently large point sets?
The original proof by Erdős and Szekeres gave \( \operatorname{ES}(k) \leq \binom{2k-4}{k-2} + 1 = O(4^k / \sqrt{k}) \). This was improved by Suk (2017) to \( \operatorname{ES}(k) \leq 2^{k + o(k)} \), nearly matching the conjectured lower bound of \( 2^{k-2} + 1 \).
Another important result in geometric Ramsey theory concerns monochromatic distances:
The chromatic number of the plane — the minimum number of colors needed to color \( \mathbb{R}^2 \) so that no two points at distance 1 share a color — is known to satisfy \( 5 \leq \chi(\mathbb{R}^2) \leq 7 \). The lower bound of 5 was proved by de Grey (2018), improving the previous bound of 4, and the upper bound of 7 comes from a hexagonal tiling coloring. Determining the exact value remains a famous open problem.
7.9 Further Directions
The polynomial method and its interactions with topology, combinatorics, and analysis continue to drive rapid progress. Notable recent developments include:
The interplay between algebra, topology, probability, and geometry exemplified throughout these notes is one of the most vibrant areas of modern mathematics, and the tools developed here — from Helly’s theorem to polynomial partitioning — form an essential foundation for further study.