PMATH 867: Geometric Group Theory
Estimated study time: 1 hr 58 min
Table of contents
These notes synthesize material from M. Bridson and A. Haefliger’s Metric Spaces of Non-Positive Curvature, C. Drutu and M. Kapovich’s Geometric Group Theory, C. Löh’s Geometric Group Theory: An Introduction, and P. de la Harpe’s Topics in Geometric Group Theory, enriched with material from B. Bowditch’s course notes and M. Bestvina’s lecture notes.
Chapter 1: Groups as Geometric Objects
The central insight of geometric group theory is that finitely generated groups are themselves geometric objects. Rather than studying groups purely through their algebraic structure — subgroups, quotients, homomorphisms — we equip them with metrics arising from their generators and study them through the lens of large-scale geometry. This perspective, crystallized by Gromov in his landmark 1987 paper on hyperbolic groups, transformed the landscape of combinatorial and geometric group theory by revealing that coarse geometric properties of groups carry deep algebraic information.
1.1 Cayley Graphs and Word Metrics
To view a group as a geometric object, we need to assign it a metric space structure. The classical construction achieving this is the Cayley graph.
The Cayley graph is a connected graph (since \(S\) generates \(G\)), and it admits a natural left action of \(G\) by graph automorphisms: for each \(h \in G\), the map \(g \mapsto hg\) sends edges to edges. This action is free and transitive on vertices, so the Cayley graph is vertex-transitive.
We metrize the Cayley graph by declaring each edge to have length one and taking the induced path metric. Restricting this metric to the vertex set gives us the word metric on the group.
The word metric is left-invariant: \(d_S(hg, hg') = d_S(g, g')\) for all \(h, g, g' \in G\). This follows immediately from the fact that \(h\) acts by isometries on the Cayley graph.
A natural concern is that the Cayley graph and word metric depend on the choice of generating set. Different generating sets produce genuinely different metric spaces. However, as we shall see, the large-scale geometry remains the same — and this is made precise by the notion of quasi-isometry.
1.2 Quasi-Isometries
The appropriate notion of equivalence for the large-scale geometry of metric spaces is not isometry but something much coarser: quasi-isometry. This is the key notion that allows us to speak of geometric properties of a group independent of the choice of generators.
Quasi-isometry is an equivalence relation on metric spaces (given appropriate conditions on the spaces). A quasi-inverse of a quasi-isometry \(f: X \to Y\) is a quasi-isometry \(\bar{f}: Y \to X\) such that \(d_X(\bar{f} \circ f(x), x)\) and \(d_Y(f \circ \bar{f}(y), y)\) are uniformly bounded.
This proposition justifies speaking of “the large-scale geometry of \(G\)” without specifying a generating set. Two groups are quasi-isometric if and only if their Cayley graphs (for any choice of finite generating sets) are quasi-isometric as metric spaces.
1.3 The Milnor-Schwarz Lemma
The Milnor-Schwarz lemma (also called the Schwarz-Milnor lemma or the fundamental observation of geometric group theory) is the bridge between the geometry of spaces and the geometry of groups acting on them. It was proved independently by Milnor (1968) and Schwarz (1955).
Indeed, let \(g \in G\). Since \(X\) is geodesic, let \(\gamma\) be a geodesic from \(x_0\) to \(g \cdot x_0\). Choose points \(x_0 = y_0, y_1, \ldots, y_n = g \cdot x_0\) along \(\gamma\) with \(d(y_i, y_{i+1}) \le R\). By cocompactness, for each \(y_i\) there exists \(g_i \in G\) with \(d(g_i \cdot x_0, y_i) \le R\). Setting \(g_0 = e\) and \(g_n = g\), we see that \(d(g_i \cdot x_0, g_{i+1} \cdot x_0) \le 3R\), so \(g_i^{-1} g_{i+1} \in S \cup \{e\}\) (after possibly enlarging \(R\)). Thus \(g = (g_0^{-1} g_1)(g_1^{-1} g_2) \cdots (g_{n-1}^{-1} g_n)\) is a product of at most \(n\) elements of \(S \cup S^{-1}\), showing \(S\) generates and \(|g|_S \le n \le d(x_0, g \cdot x_0)/R + 1\).
\[ d(x_0, g \cdot x_0) \le \sum_{i=1}^n d(s_1 \cdots s_{i-1} \cdot x_0, s_1 \cdots s_i \cdot x_0) = \sum_{i=1}^n d(x_0, s_i \cdot x_0) \le 3Rn. \]Combining both bounds shows the orbit map \(g \mapsto g \cdot x_0\) is a quasi-isometric embedding. Coarse surjectivity follows from cocompactness: every point of \(X\) lies within distance \(R\) of the orbit \(G \cdot x_0\). \(\blacksquare\)
The Milnor-Schwarz lemma has far-reaching consequences. It tells us that the quasi-isometry class of a group captures not just algebraic information, but also the large-scale geometry of any space on which the group acts nicely.
1.4 Quasi-Isometry Invariants
A property or quantity associated to metric spaces (or groups) is a quasi-isometry invariant if it is preserved under quasi-isometries. Identifying quasi-isometry invariants is one of the central goals of geometric group theory, as they allow us to distinguish groups up to quasi-isometry and to transfer geometric information between a group and the spaces on which it acts.
- Being finitely presented.
- Being virtually nilpotent, virtually abelian, virtually free, or virtually cyclic.
- The number of ends.
- Hyperbolicity (in the sense of Gromov).
- The growth type (polynomial, exponential, or intermediate).
- Amenability.
- The isoperimetric function (Dehn function), up to the natural equivalence relation.
We will encounter most of these invariants in subsequent chapters. For now, let us focus on the growth of groups.
1.5 Growth of Groups
The growth function of a finitely generated group measures how the sizes of metric balls in the Cayley graph increase with radius. This notion was introduced by Schwarz (1955) and studied systematically by Milnor (1968) and Wolf (1968).
Although \(\beta_S\) depends on the generating set, its asymptotic behavior does not. Two functions \(f, g: \mathbb{N} \to \mathbb{R}_+\) have the same growth type if there exists \(C > 0\) such that \(f(n) \le g(Cn)\) and \(g(n) \le f(Cn)\) for all \(n\). This is an equivalence relation, and the growth type of \(\beta_S\) is independent of \(S\) — indeed, it is a quasi-isometry invariant.
- Polynomial growth (of degree \(d\)) if \(\beta_S(n) \preceq n^d\) for some \(d\) (i.e., there exists \(C > 0\) with \(\beta_S(n) \le C n^d\) for all \(n\)).
- Exponential growth if \(\beta_S(n) \succeq e^n\) (i.e., there exist \(c, a > 0\) with \(\beta_S(n) \ge c \cdot a^n\) for all \(n\), where \(a > 1\)).
- Intermediate growth if it has neither polynomial nor exponential growth.
The classification of growth types is intimately related to the algebraic structure of the group. Milnor and Wolf proved in the late 1960s:
In particular, solvable groups have no intermediate growth. This result was dramatically generalized by Gromov:
Gromov’s proof is a tour de force, using the Milnor-Schwarz lemma to pass from the group to an asymptotic cone, showing the asymptotic cone is locally compact and finite-dimensional, appealing to the solution of Hilbert’s fifth problem (Montgomery-Zippin) to conclude the asymptotic cone is a Lie group, and then using structural results for Lie groups. A simpler proof was later given by Kleiner (2010) using harmonic functions on groups.
An outstanding question in the 1960s and 1970s was whether intermediate growth actually occurs. This was answered affirmatively by Grigorchuk:
The Grigorchuk group is defined as a group of automorphisms of the infinite rooted binary tree. Its existence settled several open problems simultaneously: it provided the first example of intermediate growth, a finitely generated infinite torsion group (answering the Burnside problem in the finitely generated setting), and an amenable group that is not elementary amenable.
Chapter 2: Free Groups and Presentations
Free groups are the most fundamental building blocks in combinatorial group theory. They play a role analogous to that of vector spaces in linear algebra or polynomial rings in commutative algebra: every group is a quotient of a free group, and the study of how groups are presented as quotients of free groups leads directly to the word problem and its geometric manifestations through van Kampen diagrams and Dehn functions.
2.1 Free Groups
The existence of free groups can be established via reduced words. Let \(X^{-1} = \{x^{-1} : x \in X\}\) be a formal set of inverses, and set \(X^{\pm 1} = X \cup X^{-1}\). A word \(w = a_1 a_2 \cdots a_n\) in the alphabet \(X^{\pm 1}\) is reduced if it contains no subword of the form \(x x^{-1}\) or \(x^{-1} x\). The set of reduced words forms a group under concatenation followed by free reduction, and this group satisfies the universal property above.
The algebraic structure of free groups reflects their geometric nature: \(F_n\) acts freely and cocompactly on a tree (its Cayley graph), and this tree structure is responsible for many of the remarkable properties of free groups.
This formula has the initially surprising consequence that subgroups of free groups can have higher rank than the ambient group: for instance, the commutator subgroup \([F_2, F_2]\) has infinite rank.
2.2 Group Presentations
- \(\mathbb{Z} = \langle a \mid \rangle\), the infinite cyclic group.
- \(\mathbb{Z}/n\mathbb{Z} = \langle a \mid a^n \rangle\).
- \(\mathbb{Z}^2 = \langle a, b \mid aba^{-1}b^{-1} \rangle\).
- The fundamental group of a closed orientable surface of genus \(g\): \(\pi_1(\Sigma_g) = \langle a_1, b_1, \ldots, a_g, b_g \mid [a_1, b_1] \cdots [a_g, b_g] \rangle\).
- Adding a relator that is a consequence of the existing relators.
- Removing a relator that is a consequence of the remaining relators.
- Adding a new generator \(y\) and a relator \(y = w\) where \(w\) is a word in the existing generators.
- Removing a generator \(y\) and a relator \(y = w\), substituting \(w\) for \(y\) in all other relators.
2.3 Dehn’s Problems
In 1911, Max Dehn posed three fundamental decision problems for finitely presented groups:
- The word problem: Given a word \(w\) in \(X^{\pm 1}\), determine whether \(w =_G e\).
- The conjugacy problem: Given words \(u, v\) in \(X^{\pm 1}\), determine whether \(u\) and \(v\) are conjugate in \(G\).
- The isomorphism problem: Given two finite presentations, determine whether they define isomorphic groups.
These problems are of central importance in both algebra and topology. The word problem for \(\pi_1(M)\) is equivalent to determining whether a closed loop in \(M\) is null-homotopic; the isomorphism problem for fundamental groups of manifolds is closely related to the homeomorphism problem.
A landmark result of Novikov (1955) and Boone (1959) shows that the word problem is in general undecidable:
Similarly, Adyan and Rabin showed that the isomorphism problem is unsolvable in general. However, for many geometrically defined classes of groups (hyperbolic groups, CAT(0) groups, automatic groups), the word problem is solvable, and geometric group theory provides the tools to prove this.
2.4 Van Kampen Diagrams
Van Kampen diagrams are the bridge between the combinatorial word problem and geometry. They provide a geometric way to certify that a word represents the identity in a group.
- For each 2-cell (face) \(f\) of \(D\), the boundary label of \(f\) (read starting from some vertex and going around the boundary) is a cyclic permutation of a word in \(R \cup R^{-1}\).
- Reading the boundary label of the outer boundary of \(D\) gives a word \(w\).
The number of 2-cells in a van Kampen diagram for \(w\) is called its area. The minimal area over all diagrams for \(w\) is denoted \(\mathrm{Area}(w)\), and it equals the minimal number of conjugates of relators needed to express \(w\) as a product in \(F(X)\).
2.5 Dehn Functions and Isoperimetric Inequalities
The Dehn function measures the worst-case difficulty of the word problem: to certify that a word of length \(n\) represents the identity, one needs at most \(\delta(n)\) applications of relations. Two Dehn functions \(\delta_1, \delta_2\) are equivalent (written \(\delta_1 \simeq \delta_2\)) if each is bounded above by a linear function of the other (with an appropriate rescaling of the argument). The equivalence class of the Dehn function is independent of the presentation and is a quasi-isometry invariant.
The terminology “isoperimetric inequality” comes from the analogy with Riemannian geometry: the Dehn function bounds the area of a filling disk in terms of the length of the boundary curve, just as an isoperimetric inequality in a Riemannian manifold bounds the area of a minimal surface in terms of the length of its boundary.
2.6 Small Cancellation Theory
Small cancellation theory, developed by Tartakovskii, Greendlinger, Lyndon, and Schupp in the 1960s, provides geometric conditions on presentations that guarantee good algorithmic and structural properties. The basic idea is that if the relators in a presentation have little overlap (small cancellation), then van Kampen diagrams have special combinatorial properties that can be exploited.
- The \(C'(\lambda)\) condition if every piece \(p\) that is a subword of a relator \(r \in R\) satisfies \(|p| < \lambda |r|\).
- The \(T(q)\) condition if for any \(3 \le k < q\) and any relators \(r_1, \ldots, r_k \in R\) with \(r_i \neq r_{i+1}^{-1}\), at least one of the consecutive products \(r_i r_{i+1}\) is freely reduced as written.
The classical \(C'(1/6)\) condition (or more generally \(C'(1/6)\)-\(T(3)\)) is particularly powerful:
- The group \(G\) is torsion-free (if all relators have length \(\ge 2\)).
- The Dehn function of \(G\) is linear.
- The word and conjugacy problems for \(G\) are solvable.
Greendlinger’s lemma implies that \(C'(1/6)\) groups satisfy a linear isoperimetric inequality. Moreover, it provides Dehn’s algorithm for the word problem: given a word \(w\), search for a subword \(s\) of \(w\) that is more than half of some relator; if found, replace \(s\) by the complementary shorter half of the relator, reducing the length of \(w\). Repeat. If no such subword exists and \(w\) is non-empty, then \(w \neq_G e\). This algorithm terminates in at most \(|w|\) steps.
As we shall see in Chapter 4, \(C'(1/6)\) groups are hyperbolic, and Dehn’s algorithm generalizes to all hyperbolic groups.
Chapter 3: Bass-Serre Theory
Bass-Serre theory, developed by Jean-Pierre Serre in his 1977 book Arbres, amalgames, \(\mathrm{SL}_2\) (with algebraic formalization by Hyman Bass), provides a systematic framework for studying groups acting on trees. The theory gives a complete dictionary between group actions on trees and algebraic decompositions of groups as fundamental groups of graphs of groups. It unifies and generalizes the classical constructions of free products with amalgamation and HNN extensions.
3.1 Group Actions on Trees
A tree is a connected graph with no cycles. Equivalently, a connected graph \(T\) is a tree if and only if any two vertices are connected by a unique reduced edge path. Trees are the simplest examples of CAT(0) spaces (as we will see in Chapter 5), and they play a distinguished role in geometric group theory.
- The action is trivial if \(G\) fixes a vertex.
- An element \(g \in G\) is elliptic if it fixes a vertex.
- An element \(g \in G\) is hyperbolic if it fixes no vertex; in this case, \(g\) translates along a unique bi-infinite geodesic line \(\ell_g \subseteq T\) (its axis).
3.2 Free Products with Amalgamation
The structure of amalgamated products is revealed by normal forms:
3.3 HNN Extensions
The second fundamental construction in Bass-Serre theory is the HNN extension, named after Higman, Neumann, and Neumann who introduced it in 1949.
3.4 Graphs of Groups
Both amalgamated free products and HNN extensions are special cases of a more general construction: the fundamental group of a graph of groups.
- An underlying graph \(\Gamma\) (possibly with loops and multiple edges).
- For each vertex \(v\) of \(\Gamma\), a vertex group \(G_v\).
- For each oriented edge \(e\) of \(\Gamma\) (from \(\alpha(e)\) to \(\omega(e)\)), an edge group \(G_e\) together with injective homomorphisms \(\varphi_e^-: G_e \hookrightarrow G_{\alpha(e)}\) and \(\varphi_e^+: G_e \hookrightarrow G_{\omega(e)}\).
- All relations within each vertex group \(G_v\).
- \(\varphi_e^-(c) = \varphi_e^+(c)\) for each edge \(e\) in \(T\) and \(c \in G_e\).
- \(t_e \varphi_e^-(c) t_e^{-1} = \varphi_e^+(c)\) for each edge \(e\) not in \(T\) and \(c \in G_e\).
The case where \(\Gamma\) is a single edge gives an amalgamated free product, and the case where \(\Gamma\) has one vertex and one loop edge gives an HNN extension.
3.5 The Bass-Serre Tree
The fundamental theorem of Bass-Serre theory establishes a tight correspondence between graphs of groups and group actions on trees.
Conversely, if a group \(G\) acts on a tree \(T\) without inversions, then \(G\) is the fundamental group of the graph of groups \(\mathcal{G}\) whose underlying graph is \(G \backslash T\), with vertex and edge groups being the stabilizers of lifts of vertices and edges.
This correspondence is enormously powerful: it translates questions about the algebraic structure of group splittings into questions about group actions on trees.
3.6 Stallings’ Theorem on Ends of Groups
The number of ends of a group is a fundamental quasi-isometry invariant. Informally, it measures how many “directions to infinity” the Cayley graph has.
- \(e(G) = 0\) if and only if \(G\) is finite.
- \(e(G) = 2\) if and only if \(G\) is virtually \(\mathbb{Z}\).
- \(e(G) = 1\) or \(e(G) = \infty\) in all other cases.
Stallings’ theorem characterizes groups with more than one end, establishing a profound connection between this topological invariant and the algebraic structure:
- \(G\) splits as an amalgamated free product \(A *_C B\) where \(C\) is finite and \(A \neq C \neq B\).
- \(G\) splits as an HNN extension \(A *_\varphi\) where the associated subgroups are finite.
Stallings’ original proof used topological methods. A more modern proof proceeds by constructing an action of \(G\) on a tree from the combinatorics of the Cayley graph: the ends structure gives rise to a “structure tree” on which \(G\) acts with finite edge stabilizers, and Bass-Serre theory converts this into a splitting.
Chapter 4: Hyperbolic Groups
Hyperbolic groups, introduced by Gromov in his seminal 1987 essay Hyperbolic groups, are perhaps the most important class of groups in geometric group theory. They capture the coarse geometry of negatively curved spaces — including classical hyperbolic space, fundamental groups of negatively curved manifolds, and many combinatorially defined groups — in a purely metric framework that is robust under quasi-isometry.
4.1 Gromov Hyperbolicity
There are several equivalent formulations of hyperbolicity. One useful characterization uses the Gromov product:
The Gromov product measures how long geodesics from \(w\) to \(x\) and from \(w\) to \(y\) “fellow travel” before diverging. In a tree, \((x \cdot y)_w\) is exactly the distance from \(w\) to the branch point of the geodesics \([w,x]\) and \([w,y]\).
Since hyperbolicity is a quasi-isometry invariant of geodesic metric spaces, this definition is independent of the choice of generating set.
4.2 Examples of Hyperbolic Groups
- Finite groups: trivially \(0\)-hyperbolic.
- Free groups: The Cayley graph of \(F_n\) is a tree, which is \(0\)-hyperbolic (geodesic triangles are tripods).
- Surface groups: The fundamental group \(\pi_1(\Sigma_g)\) for \(g \ge 2\) is hyperbolic, since it acts properly cocompactly on the hyperbolic plane \(\mathbb{H}^2\), which is \(\delta\)-hyperbolic.
- Fundamental groups of closed negatively curved manifolds: by the Milnor-Schwarz lemma and the fact that simply connected manifolds of pinched negative curvature are Gromov hyperbolic.
- Small cancellation groups satisfying \(C'(1/6)\): as proved by Greendlinger's lemma and its consequences.
- Random groups: In the Gromov density model, a random group at density \(d < 1/2\) is hyperbolic with overwhelming probability (Gromov, Ollivier).
- \(\mathbb{Z}^2\) (and any group containing \(\mathbb{Z}^2\) as a subgroup), since its Cayley graph contains quasi-isometric copies of \(\mathbb{R}^2\), which has geodesic triangles that are not uniformly thin.
- The Baumslag-Solitar groups \(\mathrm{BS}(m,n)\) for \(|m|, |n| \ge 1\) and \((m,n) \neq (\pm 1, \pm 1)\), since they contain either \(\mathbb{Z}^2\) (when \(|m| = |n|\)) or have exponential Dehn function (when \(|m| \neq |n|\)).
- Any group with an unsolvable word problem.
4.3 The Boundary at Infinity
One of the most important features of hyperbolic spaces is the existence of a well-defined boundary at infinity, which carries a rich topological and metric structure.
The boundary \(\partial_\infty X\) can be topologized so that the compactification \(\overline{X} = X \cup \partial_\infty X\) is compact and metrizable. The topology is characterized by the property that a sequence of points \(x_n \in X\) converges to \(\xi \in \partial_\infty X\) if and only if \((x_n \cdot \xi)_w \to \infty\) for some (equivalently, any) basepoint \(w\).
- For a tree \(T\), the boundary \(\partial_\infty T\) is a Cantor set (if \(T\) is regular of valence \(\ge 3\)) or consists of two points (if \(T\) is a line).
- For the hyperbolic plane \(\mathbb{H}^2\), the boundary \(\partial_\infty \mathbb{H}^2 \cong S^1\) is the circle at infinity.
- For hyperbolic \(n\)-space \(\mathbb{H}^n\), the boundary is \(\partial_\infty \mathbb{H}^n \cong S^{n-1}\).
For a hyperbolic group \(G\), we define \(\partial_\infty G = \partial_\infty \mathrm{Cay}(G, S)\), which is well-defined up to homeomorphism. The group \(G\) acts on \(\partial_\infty G\) by homeomorphisms, and this action encodes significant algebraic information.
- \(\partial_\infty G = \emptyset\) if and only if \(G\) is finite.
- \(\partial_\infty G\) consists of exactly two points if and only if \(G\) is virtually \(\mathbb{Z}\).
- If \(G\) is one-ended (and infinite, non-virtually-cyclic), then \(\partial_\infty G\) is connected and locally connected.
4.4 Dehn’s Algorithm and the Linear Isoperimetric Inequality
The algebraic and algorithmic consequences of hyperbolicity are striking. The following theorem characterizes hyperbolic groups via their Dehn functions.
Conversely, if \(G\) satisfies a linear isoperimetric inequality, one shows that geodesic triangles in the Cayley graph are uniformly thin by analyzing the geometry of van Kampen diagrams with linear area. \(\blacksquare\)
The linear isoperimetric inequality has an immediate algorithmic consequence:
The proof of the conjugacy problem is considerably more involved and uses the classification of isometries of hyperbolic spaces into elliptic and hyperbolic types, along with the structure of centralizers.
4.5 The Rips Complex and Subgroups
This has strong consequences for the cohomological properties of hyperbolic groups. Regarding subgroups:
- Every abelian subgroup of \(G\) is virtually cyclic.
- Every solvable subgroup of \(G\) is virtually cyclic.
- \(G\) contains no subgroup isomorphic to \(\mathbb{Z}^2\).
- If \(G\) is torsion-free, then every non-trivial abelian subgroup is infinite cyclic.
4.6 Quasiconvex Subgroups
Quasiconvex subgroups are the “geometrically well-behaved” subgroups of hyperbolic groups. They enjoy many of the same properties as the ambient group.
- \(H\) is finitely generated and hyperbolic.
- The inclusion \(H \hookrightarrow G\) is a quasi-isometric embedding.
- The word problem for \(H\) is solvable (in the generating set inherited from \(G\)).
- The intersection of two quasiconvex subgroups is quasiconvex.
However, not all subgroups of hyperbolic groups are quasiconvex. The subgroup structure of hyperbolic groups can be extremely complex:
This remarkable result, proved by a sophisticated small cancellation construction, shows that while hyperbolic groups themselves have very good algorithmic properties, their subgroups can be arbitrarily complicated.
Chapter 5: CAT(0) Spaces and Groups
While hyperbolic groups capture the coarse geometry of negative curvature, CAT(0) spaces and groups provide a framework for non-positive curvature — a broader and in many ways richer theory. The notion of CAT(0) geometry, named by Gromov in honor of Cartan, Alexandrov, and Toponogov, axiomatizes the global geometry of simply connected Riemannian manifolds of non-positive sectional curvature through comparison geometry.
5.1 CAT(\(\kappa\)) Comparison Geometry
The key idea is to compare geodesic triangles in a given metric space with triangles in model spaces of constant curvature.
- \(M^2_0 = \mathbb{R}^2\) (Euclidean plane), or more generally \(\mathbb{R}^n\).
- \(M^2_\kappa = S^2(1/\sqrt{\kappa})\) for \(\kappa > 0\) (sphere of radius \(1/\sqrt{\kappa}\)).
- \(M^2_\kappa = \mathbb{H}^2(-1/\sqrt{-\kappa})\) for \(\kappa < 0\) (hyperbolic plane of curvature \(\kappa\)).
For a point \(p\) on a side \([x,y]\) of \(\Delta\), the comparison point \(\bar{p}\) is the point on \([\bar{x}, \bar{y}]\) with \(d(\bar{x}, \bar{p}) = d(x, p)\).
\[ d(p, q) \le d_{M^2_\kappa}(\bar{p}, \bar{q}). \]In words: triangles in a CAT(\(\kappa\)) space are “thinner” than triangles of the same side lengths in the model space \(M^2_\kappa\). For \(\kappa = 0\), this means triangles are thinner than Euclidean triangles — a global non-positive curvature condition.
5.2 CAT(0) Spaces (Hadamard Spaces)
CAT(0) spaces are the most important case for geometric group theory. They are also known as Hadamard spaces when they are complete.
- Unique geodesics: For any two points \(x, y \in X\), there is a unique geodesic segment \([x,y]\).
- Convexity of the distance function: If \(\gamma, \sigma: [0,1] \to X\) are geodesics, then the function \(t \mapsto d(\gamma(t), \sigma(t))\) is convex.
- Projection to convex subsets: For any non-empty closed convex subset \(C \subseteq X\) and any point \(x \in X\), there exists a unique point \(\pi_C(x) \in C\) nearest to \(x\), and the projection \(\pi_C: X \to C\) is a distance-non-increasing retraction.
- Fixed point theorem (Bruhat-Tits)): If a group \(G\) acts on \(X\) by isometries and has a bounded orbit, then \(G\) has a fixed point.
- Euclidean space \(\mathbb{R}^n\) is CAT(0) (and CAT(\(\kappa\)) for all \(\kappa \ge 0\)).
- Hyperbolic space \(\mathbb{H}^n\) is CAT(\(\kappa\)) for all \(\kappa \le 0\), hence also CAT(0).
- Trees are CAT(0) (in fact, they are CAT(\(\kappa\)) for all \(\kappa\)).
- Simply connected Riemannian manifolds of sectional curvature \(\le 0\) are CAT(0) (Cartan-Hadamard theorem).
- Products of CAT(0) spaces are CAT(0).
A fundamental tool for constructing CAT(0) spaces from smaller pieces is:
This gives a local-to-global principle: to show a space is CAT(0), it suffices to verify the CAT(0) condition locally (provided the space is simply connected).
5.3 The Flat Torus Theorem
The flat torus theorem has important consequences for understanding the algebraic structure of CAT(0) groups. For instance, it implies that every abelian subgroup of a CAT(0) group that acts properly and cocompactly is finitely generated (of rank at most the dimension of the CAT(0) space).
5.4 CAT(0) Cube Complexes
CAT(0) cube complexes have emerged as one of the most important and versatile tools in modern geometric group theory, playing a central role in Agol’s proof of the virtual Haken conjecture and Wise’s work on residual finiteness.
This combinatorial criterion for the CAT(0) condition is remarkably useful in practice, as it reduces a metric condition to a purely combinatorial one.
5.5 Hyperplanes and Sageev’s Construction
The geometry of CAT(0) cube complexes is governed by their hyperplanes, a structure with no direct analogue in general CAT(0) spaces.
- \(H\) separates \(X\) into exactly two connected components (halfspaces).
- \(H\) is convex in \(X\).
- Two hyperplanes either are disjoint, cross (intersect transversally), or osculate (share a cube but do not cross).
- The combinatorial distance between two vertices equals the number of hyperplanes separating them.
Sageev’s construction (1995) provides a powerful machine for building CAT(0) cube complexes from group actions:
This construction has been spectacularly successful. Wise and Agol used it to prove:
5.6 CAT(0) Groups and Their Properties
- \(G\) is finitely presented.
- \(G\) has solvable word problem.
- \(G\) has a finite-dimensional classifying space \(BG\) (in particular, \(G\) is of type \(FP_\infty\)).
- Every abelian subgroup of \(G\) is finitely generated.
- \(G\) satisfies a quadratic isoperimetric inequality (Dehn function \(\delta(n) \preceq n^2\)).
- \(G\) has finitely many conjugacy classes of finite subgroups.
5.7 The Visual Boundary of CAT(0) Spaces
Unlike in the hyperbolic case, the visual boundary of a CAT(0) space is not a quasi-isometry invariant — there exist quasi-isometric CAT(0) spaces with non-homeomorphic boundaries. This is one of the key differences between CAT(0) and hyperbolic geometry.
5.8 Right-Angled Artin Groups and Coxeter Groups
Two important families of CAT(0) groups arise from combinatorial data encoded in graphs.
RAAGs interpolate between free groups (when \(\Gamma\) has no edges) and free abelian groups (when \(\Gamma\) is complete). They act properly and cocompactly on CAT(0) cube complexes (the Salvetti complex), so they are CAT(0) groups.
Coxeter groups act on the Davis complex, a CAT(0) cube complex (or more generally a CAT(0) piecewise Euclidean cell complex), making them CAT(0) groups. Finite Coxeter groups are precisely the finite reflection groups, and the infinite ones include fundamental groups of important geometric objects like right-angled polyhedra in hyperbolic space.
Chapter 6: Amenability and Property (T)
The notions of amenability and Kazhdan’s property (T) stand at opposite ends of a spectrum. Amenable groups — those admitting an invariant mean or satisfying the Følner condition — are characterized by having “little resistance” to averaging; they include all abelian, solvable, and finitely generated groups of subexponential growth. At the other extreme, groups with property (T) have the strongest possible rigidity: every affine isometric action on a Hilbert space has a fixed point. The interplay between these two notions has produced some of the deepest results in group theory and its applications.
6.1 Amenable Groups
The theory of amenable groups was initiated by John von Neumann in 1929, motivated by the Banach-Tarski paradox. Von Neumann identified amenability as the key property separating the groups for which paradoxical decompositions are possible from those for which they are not.
The existence of an invariant mean is a very strong condition, but it is most useful when combined with more geometric characterizations:
The Følner condition says that amenable groups admit “almost-invariant” finite subsets — sets whose boundary is small relative to their volume. This is an isoperimetric condition, dual to the notion of expansion.
- Finite groups (take \(A = G\)).
- \(\mathbb{Z}^n\) (take \(A = [-N, N]^n\), then \(|\partial A|/|A| \to 0\)).
- All abelian groups.
- All solvable groups.
- All finitely generated groups of subexponential growth.
- Extensions and direct limits of amenable groups.
The class of amenable groups is closed under taking subgroups, quotients, extensions, and directed unions. The smallest class of groups containing all finite and abelian groups and closed under these operations is the class of elementary amenable groups.
6.2 The Banach-Tarski Paradox and Non-Amenable Groups
The Banach-Tarski paradox is possible because the rotation group \(\mathrm{SO}(3)\) contains a free subgroup of rank 2, and free groups are not amenable. More precisely:
6.3 Von Neumann’s Conjecture and Its Resolution
Von Neumann conjectured (or at least his work suggested) that a group is non-amenable if and only if it contains a non-abelian free subgroup. This became known as the von Neumann conjecture (or the von Neumann-Day problem, since Day explicitly formulated it).
Olshanskii constructed such groups using his theory of graded small cancellation (Tarski monsters — infinite groups in which every proper non-trivial subgroup is cyclic of fixed prime order). A more geometric counterexample was later given by Olshanskii and Sapir.
However, a variant of the von Neumann conjecture holds in a measure-theoretic sense:
6.4 Kazhdan’s Property (T)
Property (T) was introduced by Kazhdan in 1967 as a tool for showing that certain lattices in Lie groups are finitely generated. It has since found deep applications in ergodic theory, operator algebras, combinatorics (construction of expander graphs), and theoretical computer science.
The group \(G\) has Kazhdan’s property (T) if every unitary representation of \(G\) that almost has invariant vectors actually has a non-zero invariant vector.
For finitely generated groups, property (T) can be characterized in terms of a generating set:
This characterization, due to Delorme and Guichardet, makes clear the rigidity aspect of property (T): the group cannot act on a Hilbert space without a fixed point. It stands in stark contrast to amenability, which (by the Følner condition) means the group has “almost fixed points” in a certain sense.
6.5 Examples and Properties
- Compact groups have property (T) (by the Peter-Weyl theorem, every unitary representation decomposes into finite-dimensional irreducible representations, and almost-invariant vectors project to invariant vectors).
- \(\mathrm{SL}(n, \mathbb{Z})\) for \(n \ge 3\) has property (T). This was Kazhdan's motivating example. The proof uses the embedding of \(\mathrm{SL}(n, \mathbb{Z})\) as a lattice in \(\mathrm{SL}(n, \mathbb{R})\) and the fact that the latter has property (T) for \(n \ge 3\).
- \(\mathrm{Sp}(2n, \mathbb{Z})\) for \(n \ge 2\) has property (T).
- Lattices in simple Lie groups of rank \(\ge 2\) have property (T).
- \(\mathbb{Z}\) does not have property (T): the representation \(\pi(n) = e^{in\theta}\) on \(\mathbb{C}\) almost has invariant vectors (for small \(\theta\)) but has no non-zero invariant vector (for \(\theta \neq 0\)).
- Free groups \(F_n\) (\(n \ge 1\)) do not have property (T).
- Amenable infinite groups do not have property (T) (unless they are compact).
- Groups that split as amalgamated free products or HNN extensions over amenable subgroups do not have property (T) (by a result of Watatani).
- If \(G\) has property (T) and \(N \trianglelefteq G\) is a closed normal subgroup, then \(G/N\) has property (T).
- If \(G\) has property (T) and \(H \le G\) is a lattice (discrete subgroup of finite covolume), then \(H\) has property (T).
- Property (T) is inherited by quotients but not by subgroups in general.
- If \(G\) is finitely generated and has property (T), then \(G\) is finitely generated (in fact, compactly generated in the locally compact case) and the abelianization \(G/[G,G]\) is finite.
6.6 Expander Graphs from Property (T)
One of the most striking applications of property (T) is the construction of expander graphs — highly connected sparse graphs with applications throughout computer science and mathematics.
6.7 The Tension Between Amenability and Property (T)
This theorem beautifully illustrates the diametrically opposed natures of amenability and property (T): amenability is a “softness” condition (almost-invariant vectors exist), while property (T) is a “rigidity” condition (almost-invariant vectors must be close to truly invariant ones). The only groups satisfying both are compact, which are simultaneously “soft” (finite measure) and “rigid” (complete reducibility of representations).
Chapter 7: Boundaries, Rigidity, and Modern Directions
The final chapter surveys some of the most powerful results and active research directions in geometric group theory. We begin with the Gromov boundary of hyperbolic groups and the Poisson boundary, proceed to the celebrated Mostow rigidity theorem and its generalizations, and conclude with an overview of modern developments including relatively hyperbolic groups, acylindrical hyperbolicity, mapping class groups, and \(\mathrm{Out}(F_n)\).
7.1 The Gromov Boundary of Hyperbolic Groups
We have already defined the boundary at infinity of a hyperbolic group in Chapter 4. Here we explore its richer structure. The Gromov boundary \(\partial_\infty G\) admits a family of visual metrics:
Visual metrics exist for all sufficiently small parameters \(a > 1\) (specifically, for \(a\) close to 1 depending on \(\delta\)). Different visual metrics (and different parameters) give quasisymmetrically equivalent metrics on \(\partial_\infty G\).
The Bestvina-Mess formula has important consequences for the topology of hyperbolic groups. Combined with work of Bestvina and others, it leads to:
The Cannon conjecture asserts the converse: if \(G\) is a hyperbolic group with \(\partial_\infty G \cong S^2\), then \(G\) acts properly cocompactly on \(\mathbb{H}^3\) (equivalently, \(G\) is virtually a closed hyperbolic 3-manifold group). This remains one of the most important open problems in geometric group theory.
7.2 The Poisson Boundary
The Poisson boundary provides a measure-theoretic analogue of the topological boundary, with deep connections to harmonic analysis, random walks, and rigidity.
- The space of bounded \(\mu\)-harmonic functions on \(G\) is isometrically isomorphic to \(L^\infty(B, \nu)\).
- For \(\mu\)-almost every sample path \((w_n)_{n \ge 0}\) of the random walk on \(G\) (where \(w_n = g_1 g_2 \cdots g_n\) with \(g_i\) independent samples from \(\mu\)), the sequence \(w_n\) converges to a point in \(B\).
7.3 Mostow Rigidity
The Mostow rigidity theorem is one of the most profound results in geometry and group theory. It states that the geometry of a closed hyperbolic manifold of dimension \(\ge 3\) is completely determined by its fundamental group — a striking rigidity phenomenon that has no analogue in dimension 2 (where the Teichmüller space of a surface parameterizes a continuous family of hyperbolic structures).
The proof proceeds through several remarkable steps:
The key analytical step is showing that \(\partial_\infty \tilde{f}\) is quasiconformal. In dimension \(n \ge 3\), one then applies the following rigidity result: a quasiconformal homeomorphism of \(S^{n-1}\) that conjugates a cocompact lattice action to itself must be conformal (this is where the dimension hypothesis \(n \ge 3\) is essential — it fails in dimension 2). A conformal map of \(S^{n-1}\) extends to an isometry of \(\mathbb{H}^n\), giving the desired rigidity. \(\blacksquare\)
7.4 Quasi-Isometric Rigidity
Mostow rigidity can be viewed as a special case of a more general phenomenon: quasi-isometric rigidity, which asks when the quasi-isometry class of a group determines the group up to virtual isomorphism.
For \(n = 2\), this is the Casson-Jungreis and Gabai theorem (proved independently around 1992): a group quasi-isometric to \(\mathbb{H}^2\) is virtually a surface group. For \(n \ge 3\), it follows from the work of Sullivan and Tukia on quasiconformal groups.
These results illustrate a general principle: for many “geometrically rigid” groups, the quasi-isometry class essentially determines the group up to virtual isomorphism.
7.5 Relatively Hyperbolic Groups
Relatively hyperbolic groups generalize hyperbolic groups by allowing for a controlled set of “non-hyperbolic” subgroups (called peripheral subgroups), analogous to how a finite-volume hyperbolic manifold is hyperbolic away from its cusps.
There are several equivalent formulations due to Gromov, Farb, Bowditch, Osin, and Drutu-Sapir. The most geometric is Bowditch’s:
- The action is cofinite on the set of points of \(\partial_\infty X\) that are not parabolic fixed points.
- The parabolic fixed points of the action are precisely the limit points of the conjugates of the \(H_i\), and each such point has stabilizer conjugate to some \(H_i\).
- Fundamental groups of finite-volume hyperbolic manifolds with cusps are hyperbolic relative to their cusp subgroups (which are virtually nilpotent).
- Free products \(G = A * B\) are hyperbolic relative to \(\{A, B\}\).
- Limit groups (finitely generated fully residually free groups) are hyperbolic relative to their maximal non-cyclic abelian subgroups.
7.6 Acylindrically Hyperbolic Groups
Acylindrical hyperbolicity is a much broader generalization of hyperbolicity that has become central to modern geometric group theory. It was systematically developed by Osin (2016), building on earlier work of Bowditch, Bestvina-Fujiwara, and Dahmani-Guirardel-Osin.
- \(G\) is acylindrically hyperbolic.
- \(G\) admits a non-elementary acylindrical action on a hyperbolic space.
- \(G\) contains a proper infinite hyperbolically embedded subgroup.
- \(G\) is not virtually cyclic and admits an action on a hyperbolic space with a WPD (weak proper discontinuity) element.
- All non-elementary hyperbolic groups.
- All non-elementary relatively hyperbolic groups.
- Mapping class groups \(\mathrm{Mod}(\Sigma_g)\) for \(g \ge 1\) (except for a few small-complexity cases), acting on the curve complex.
- \(\mathrm{Out}(F_n)\) for \(n \ge 2\), acting on the free factor complex.
- Non-virtually-cyclic groups acting properly on proper CAT(0) spaces and containing a rank-one isometry.
- Most 3-manifold groups (via the Bestvina-Bromberg-Fujiwara machinery).
- Right-angled Artin groups whose defining graph is connected and not a complete graph.
Acylindrically hyperbolic groups share many of the strong properties of hyperbolic groups:
- \(G\) contains a non-degenerate hyperbolically embedded subgroup, and hence a non-abelian free subgroup (so \(G\) is non-amenable).
- \(G\) is SQ-universal: every countable group embeds in a quotient of \(G\).
- The bounded cohomology \(H^2_b(G; \mathbb{R})\) is infinite-dimensional.
- \(G\) has no non-trivial finite normal subgroups if it has trivial finite radical.
7.7 Mapping Class Groups
Mapping class groups are a central topic at the intersection of geometric group theory, low-dimensional topology, and algebraic geometry.
The study of mapping class groups draws deep analogies with both lattices in Lie groups and with the group \(\mathrm{Out}(F_n)\). The key geometric tool is the curve complex:
This landmark theorem opened the door to applying the machinery of hyperbolic geometry and boundaries to the study of mapping class groups. The mapping class group acts on \(\mathcal{C}(\Sigma)\) by simplicial automorphisms, and this action is acylindrical (Bowditch, 2008).
- Periodic: \(\phi^n = \mathrm{id}\) for some \(n\).
- Reducible: \(\phi\) preserves (up to isotopy) a non-empty collection of pairwise disjoint essential simple closed curves.
- Pseudo-Anosov: \(\phi\) preserves a pair of transverse measured foliations, stretching one and compressing the other by a factor \(\lambda > 1\) (the stretch factor or dilatation).
Pseudo-Anosov elements are the generic and most interesting type. They act as hyperbolic (loxodromic) isometries of the curve complex and of the Teichmüller space (with the Teichmüller metric).
7.8 \(\mathrm{Out}(F_n)\) and Culler-Vogtmann Outer Space
The group of outer automorphisms of the free group, \(\mathrm{Out}(F_n) = \mathrm{Aut}(F_n) / \mathrm{Inn}(F_n)\), is the algebraic analogue of the mapping class group, with the role of the surface played by a graph.
Outer space is a contractible space on which \(\mathrm{Out}(F_n)\) acts properly (but not cocompactly). It plays a role for \(\mathrm{Out}(F_n)\) analogous to the role of Teichmüller space for mapping class groups, and of symmetric spaces for arithmetic groups.
The analogy between \(\mathrm{Out}(F_n)\) and mapping class groups extends to the classification of elements (analogous to the Nielsen-Thurston classification) and to the construction of hyperbolic complexes:
7.9 Connections to Low-Dimensional Topology
Geometric group theory has deep connections to 3-manifold topology, catalyzed by Thurston’s geometrization program and its completion by Perelman (2003).
From the viewpoint of geometric group theory, the most interesting case is the hyperbolic one: closed hyperbolic 3-manifolds. Their fundamental groups are hyperbolic groups, and the interplay between group-theoretic and 3-manifold properties has driven much of the field.
Agol’s theorem resolved several long-standing conjectures (the virtual Haken conjecture, the virtual fibering conjecture, and the LERF conjecture for hyperbolic 3-manifold groups). The proof relies crucially on the theory of CAT(0) cube complexes and special groups developed by Wise, together with Kahn and Markovic’s construction of essential surfaces via almost-geodesic immersed surfaces.
7.10 Open Problems
We conclude with a selection of major open problems that continue to drive research in geometric group theory.
These problems reflect the vitality and depth of geometric group theory as a field. From its origins in the work of Dehn on surface groups and the combinatorial methods of the early 20th century, through the revolutionary insights of Gromov, Thurston, and their contemporaries, to the modern synthesis involving cube complexes, boundaries, and acylindrical actions, geometric group theory continues to reveal profound connections between algebra, geometry, and topology.