CO 444/644: Algebraic Graph Theory

Jane Gao

Estimated study time: 2 hr 18 min

Table of contents

These notes are based on Felix Zhou’s course notes. Additional explanations, visualizations, and examples have been added for clarity.

Algebraic graph theory is the study of graphs through the lens of algebra — primarily group theory and linear algebra. The central questions are: what symmetries does a graph possess, how can we classify graphs up to these symmetries, and what do the eigenvalues of natural matrix representations reveal about a graph’s structure? This course develops the machinery to answer all three.

Chapter 1: Graphs and Morphisms

1.1 Graphs and Isomorphisms

We work primarily with finite simple graphs. A graph $X = (V, E)$ consists of a vertex set $V$ and an edge set $E \subseteq \binom{V}{2}$. Important special cases include complete graphs $K_n$, cycles $C_n$, bipartite graphs, and the null graph ($V = \emptyset$). We will occasionally need multigraphs when the combinatorics demands it, but simplicity is the default.

Before we can study graph symmetry, we need a precise notion of when two graphs are “the same” — the same combinatorial object drawn with different labels. This is the role of isomorphism.

Definition (Isomorphism). An isomorphism between graphs $X$ and $Y$ is a bijection $f : V(X) \to V(Y)$ such that $uv \in E(X) \iff f(u)f(v) \in E(Y)$. If an isomorphism exists, we write $X \cong Y$.

Isomorphism is the equivalence relation for graphs: it captures when two graphs have identical combinatorial structure, regardless of labelling. Thinking of an isomorphism as a “relabelling” of the vertices makes clear why the edge relation must be preserved in both directions — we are not just mapping edges forward, but asserting that the entire adjacency structure is identical up to renaming. A subgraph $Y$ of $X$ satisfies $V(Y) \subseteq V(X)$ and $E(Y) \subseteq E(X)$. A spanning subgraph keeps all vertices; an induced subgraph $X[S]$ on $S \subseteq V$ keeps all edges with both endpoints in $S$.

1.2 Automorphisms

An automorphism is a self-isomorphism — a permutation of the vertices that preserves adjacency. Understanding the automorphism group of a graph is the central object of study for most of this course.

The key step from isomorphism to automorphism is to restrict attention to a graph’s symmetries with itself. Where isomorphisms compare two graphs, automorphisms reveal the internal symmetry of a single graph — they are the graph’s “motions.”

Definition (Automorphism). An automorphism of a graph $X$ is an isomorphism $f : X \to X$. The set of all automorphisms of $X$ is denoted $\mathrm{Aut}(X)$.

The collection $\mathrm{Aut}(X)$ is not just a set of permutations; it carries the structure of a group under composition, and this algebraic structure encodes the graph’s symmetry in a precise sense.

Proposition. $\mathrm{Aut}(X)$ is a subgroup of $\mathrm{Sym}(V(X))$, the symmetric group on the vertex set.

This proposition, while easy to verify, is the conceptual foundation of algebraic graph theory: every graph has a symmetry group, and studying that group reveals deep structural information about the graph.

We adopt the notation $v^g := g(v)$ for the image of vertex $v$ under $g \in \mathrm{Aut}(X)$, and extend this to subgraphs: if $Y$ is a subgraph of $X$, then $Y^g$ has $V(Y^g) = V(Y)^g$ and $E(Y^g) = \{u^g v^g : uv \in E(Y)\}$. Crucially, $Y^g \cong Y$.

Petersen graph with rotation automorphism arrows

The Petersen graph, with arrows indicating one automorphism: a cyclic rotation of the outer ring.

The next two lemmas make precise the intuition that automorphisms are “invisible” symmetries — they cannot be detected by local measurements like degree or distance.

Lemma. For $g \in \mathrm{Aut}(X)$ and $v \in V(X)$, $\deg_X(v) = \deg_{X^g}(v^g)$. Thus automorphisms permute vertices of the same degree.

Proof. Let $Y$ be the subgraph induced by $N(v) \cup \{v\}$. The result follows because $Y^g \cong Y$. $\square$

The proof works by transferring the local neighbourhood structure through the isomorphism; the same argument applies to any graph invariant that is determined locally around a vertex.

Lemma. For $g \in \mathrm{Aut}(X)$ and $u, v \in V(X)$, $d_X(u,v) = d_{X^g}(u^g, v^g)$. Automorphisms preserve distances.

Proof. Using $g$ and $g^{-1}$, there is a $uv$-path of length $\ell$ in $X$ if and only if there is a $u^g v^g$-path of length $\ell$ in $X^g$. $\square$

The proof proceeds by a clean bijection between paths: $g$ maps every $uv$-walk of length $\ell$ to a $u^g v^g$-walk of the same length, and $g^{-1}$ reverses this bijection.

These two lemmas encode a powerful constraint: automorphisms act as symmetries that are invisible to local graph invariants like degree and distance. Any graph property invariant under relabelling is automatically preserved by automorphisms.

1.3 Homomorphisms

A homomorphism relaxes the isomorphism condition from a two-way implication to a one-way implication: edges must map to edges, but non-edges may also map to edges. This makes homomorphisms a vastly richer class of maps.

Homomorphisms arise naturally in graph colouring: assigning colours to vertices so that no two adjacent vertices share a colour is precisely a homomorphism into a complete graph. By relaxing the bijectivity and edge-reflection requirements of isomorphism, we obtain a framework that connects graph structure to chromatic properties.

Definition (Homomorphism). A homomorphism from $X$ to $Y$ is a function $f : V(X) \to V(Y)$ such that $xy \in E(X) \implies f(x)f(y) \in E(Y)$. We write $X \to Y$ if a homomorphism exists.

The notation $X \to Y$ defines a preorder on graphs: $X \to X$ (via the identity), and if $X \to Y$ and $Y \to Z$ then $X \to Z$ (by composition). This preorder organizes the entire universe of graphs according to their “homomorphic complexity.”

Example 1.4.2. Any subgraph is homomorphic to its supergraph through the identity map.

Graph homomorphism: triangle maps to single edge

Left: triangle $K_3$. Right: single edge $K_2$. The map $a \mapsto x, b \mapsto y, c \mapsto x$ is a valid homomorphism, sending a triangle to an edge — a non-injective map.

Homomorphisms are not required to be injective or surjective, and they need not reflect the edge relation. This flexibility makes them the right notion for graph colouring theory.

Example. Given any bipartite graph $X$ with bipartition $A \cup B$, the map sending all of $A$ to one endpoint of $K_2$ and all of $B$ to the other is a homomorphism $X \to K_2$.

The following lemma makes the connection to colouring precise and is one of the most useful restatements of a classical concept in all of combinatorics.

Lemma. The chromatic number $\chi(X)$ equals the minimum integer $r$ such that $X \to K_r$.

Proof. A proper $\chi(X)$-colouring $c : V(X) \to [\chi(X)]$ is exactly a homomorphism $X \to K_{\chi(X)}$, so $\chi(X) \geq r$. Conversely, any homomorphism $f : X \to K_r$ gives an $r$-colouring by $v \mapsto f(v)$, so $\chi(X) \leq r$. $\square$

This lemma is fundamental: graph colourings are exactly graph homomorphisms into complete graphs. The minimum such target is the chromatic number. Among homomorphisms, one particularly structured type plays a starring role in the theory of retracts and in algorithmic graph theory.

Definition (Retraction). A retraction is a homomorphism $f : X \to Y$ where $Y$ is a subgraph of $X$ and $f$ restricts to the identity on $Y$. If such $f$ exists, $Y$ is a retract of $X$.

Intuitively, a retract is a subgraph that is “topologically inert” — the whole graph can be continuously collapsed onto it without tearing any edges. Retracts arise naturally in the study of core graphs and in the categorical theory of graph homomorphisms.

Example 1.4.4. Any clique $Y$ of $X$ whose size equals $\chi(X)$ is a retract of $X$.

Chapter 2: Families of Graphs

This chapter surveys the three principal families of graphs that appear throughout algebraic graph theory: circulant graphs, Johnson graphs, and line graphs. Each arises naturally from algebraic or combinatorial constructions and possesses rich symmetry structure.

2.1 Circulant Graphs

The cycle $C_n$ is the simplest example of a vertex-transitive graph, with automorphism group equal to the dihedral group $D_{2n}$. Circulant graphs generalise cycles by allowing additional edge connections dictated by an arithmetic structure on $\mathbb{Z}_n$.

To build a graph with a built-in cyclic symmetry, we need only decide which “steps” along $\mathbb{Z}_n$ constitute edges — the arithmetic structure of the group does the rest.

Definition (Circulant Graph). Let $C \subseteq \mathbb{Z}_n \setminus \{0\}$ be closed under inverses (i.e., $c \in C \implies -c \in C$). The circulant graph $X(\mathbb{Z}_n, C)$ has vertex set $\mathbb{Z}_n$ and edge set $E = \{ij : i - j \in C\}$.

The inverse-closure condition is precisely what makes the edge relation symmetric: if $i$ is connected to $j$ because $i - j \in C$, then $j - i = -(i-j) \in C$, so $j$ is connected back to $i$. The set $C$ is called the connection set of the circulant.

Circulant graph C_8(1,3)

The circulant graph $C_8(1,3)$: an 8-cycle (solid edges) augmented with “chord” connections of jump 3 (dashed red edges). The symmetry group acts by cyclic shifts and the reflection $i \mapsto -i$.

The cyclic shift $g : i \mapsto i+1$ and the reflection $h : i \mapsto -i$ both belong to $\mathrm{Aut}(X(\mathbb{Z}_n, C))$. The rotation subgroup $R = \langle g \rangle$ and the coset $hR$ are distinct, giving $|\mathrm{Aut}(X(\mathbb{Z}_n, C))| \geq 2n$.

2.2 Johnson Graphs

Johnson graphs arise from set intersections and constitute a fundamental family in algebraic combinatorics.

If circulant graphs encode group arithmetic, Johnson graphs encode combinatorial arithmetic: here, the “vertices” are subsets of a ground set, and adjacency measures how much two subsets overlap. This shift from groups to sets yields a rich family that includes the Kneser graphs and the Petersen graph as key special cases.

Definition (Johnson Graph). For integers $v \geq k \geq i \geq 0$, the Johnson graph $J(v,k,i)$ has vertex set $\{S \subseteq [v] : |S| = k\}$ and edge set $\{ST : |S \cap T| = i\}$.

The three parameters $(v, k, i)$ control the density and structure of the graph: increasing $i$ requires more overlap for adjacency, making the graph sparser; decreasing $i$ allows less overlap, making it denser. The entire Johnson scheme — the collection of all $J(v,k,i)$ for fixed $v,k$ — forms the basis of association scheme theory.

Johnson graph J(4,2)

The Johnson graph $J(4,2)$: vertices are 2-element subsets of $\{1,2,3,4\}$, and two vertices are adjacent when their intersection has exactly 1 element. This is the octahedron graph.

Since every $k$-subset of $[v]$ plays the same role, we expect the graph to be regular — and it is.

Proposition. $J(v,k,i)$ is $d$-regular for $d = \binom{k}{i}\binom{v-k}{k-i}$.

Proof. Fix a vertex $S$. Choose a subset $S' \subseteq S$ of size $i$: there are $\binom{k}{i}$ such choices. For each, there are $\binom{v-k}{k-i}$ sets $T$ with $S \cap T = S'$. The product counts all neighbours of $S$. $\square$

The proof uses a clean double-counting argument: first fix the intersection, then count the ways to complete the neighbour. This regularity confirms that the graph looks the same from every vertex — an early hint of the vertex-transitivity we will prove shortly.

Lemma. $J(v,k,i) \cong J(v, v-k, v-2k+i)$.

The isomorphism is given by complementation: $S \mapsto \bar{S} = [v] \setminus S$. An important special case: the Kneser graphs $J(v,k,0)$. In particular, $J(5,2,0)$ is the Petersen graph, one of the most important graphs in combinatorics.

The automorphism group of the Johnson graph is at least as large as the symmetric group on $[v]$, since any permutation of the ground set induces a graph automorphism.

Lemma. $\mathrm{Aut}(J(v,k,i))$ contains a subgroup isomorphic to $\mathrm{Sym}(v)$.

Proof. For $g \in \mathrm{Sym}(v)$, the map $\sigma_g : S \mapsto S^g$ is an automorphism of $J(v,k,i)$, since $|S \cap T| = |S^g \cap T^g|$. The family $\{\sigma_g : g \in \mathrm{Sym}(v)\}$ is a subgroup isomorphic to $\mathrm{Sym}(v)$. $\square$

The key observation is that permutations of the ground set preserve intersection sizes — a clean example of a symmetry acting at one level (the ground set) inducing a symmetry at another level (the subsets). Typically $\mathrm{Aut}(J(v,k,i)) \cong \mathrm{Sym}(v)$, but there are exceptions.

2.3 Line Graphs

The line graph construction replaces edges by vertices and adjacency by shared endpoints.

The line graph construction shifts perspective from vertices to edges: instead of asking how vertices relate to each other, we ask how edges interact. This “dual” viewpoint often simplifies spectral analysis and appears naturally in network flow and matching theory.

Definition (Line Graph). The line graph $L(X)$ of $X$ has vertex set $V(L(X)) = E(X)$ and edge set $E(L(X)) = \{ef : e, f \text{ share an endpoint in } X\}$.

The definition encodes a simple geometric idea: two edges in a graph “interact” if they share a vertex — just as two roads intersect at a junction. The line graph makes this interaction structure into a new graph, amenable to the same algebraic techniques as any other graph.

Graph G and its line graph L(G)

Left: a triangle with a pendant edge. Right: the corresponding line graph. Each edge of $G$ becomes a vertex in $L(G)$, with adjacencies reflecting shared endpoints.

If $X \cong Y$ then $L(X) \cong L(Y)$, but the converse fails: $L(C_3) \cong L(K_{1,3})$ yet $C_3 \not\cong K_{1,3}$. The converse does hold when both graphs have minimum degree at least 4. Not every graph is a line graph: for instance, $K_{1,3}$ is not, since any preimage would need an edge incident to three others at a common endpoint, leading to a contradiction.

Line graphs appear prominently in spectral theory: if $B = B(X)$ is the incidence matrix, then $B^T B = 2I + A(L(X))$, and the positive semidefiniteness of $B^T B$ immediately implies $\lambda_{\min}(L(X)) \geq -2$.

Chapter 3: Group Actions on Graphs

Group actions provide the algebraic framework to study graph symmetry systematically. Instead of studying automorphisms of a specific graph in isolation, we study the action of abstract groups on sets and graphs.

3.1 Group Actions

To study graph symmetry systematically, we need the language of group actions. An abstract group acting on a set captures the idea of “symmetry” in a way that is both flexible and algebraically tractable.

Definition 2.1.1 (Group Homomorphism). Let $G, H$ be groups. A map $f : G \to H$ is a homomorphism if $f(xy) = f(x)f(y)$ for all $x, y \in G$. The kernel of $f$ is $\ker f := f^{-1}(1)$.

With this in place, a group action is simply a homomorphism into a symmetric group.

Definition (Group Action). Let $G$ be a group and $V$ a set. A (left) action of $G$ on $V$ is a homomorphism $f : G \to \mathrm{Sym}(V)$. If $f$ is injective (i.e., $\ker f = \{1\}$), the action is faithful.

A group action is faithful when no non-identity element acts trivially — every element of the group “does something” to the set. Faithfulness is the natural condition when we want to identify the group with a genuine set of symmetries, rather than a group that partially acts on the set.

We write $v^g$ for $f(g)(v)$, the image of $v \in V$ under $g \in G$. For $S \subseteq V$, $S^g = \{v^g : v \in S\}$. A subset $S$ is $G$-invariant if $S^g = S$ for all $g \in G$.

Two fundamental concepts attach to every element of $V$ under a group action: the orbit (all elements reachable from it) and the stabilizer (all group elements that fix it).

Definition (Orbit and Stabilizer). The orbit of $x \in V$ under $G$ is $x^G = \{x^g : g \in G\}$. The stabilizer of $x$ is $G_x = \{g \in G : x^g = x\}$.

Orbits and stabilizers are complementary: the orbit measures how far the group can move $x$, while the stabilizer measures how much of the group leaves $x$ fixed. Together they partition the group action into structured pieces.

Group action: orbits on a 6-element set

Three orbits of a group action on $V = \{1,2,3,4,5,6\}$. The group permutes elements within each coloured oval but never moves elements between ovals.

The orbits partition $V$ into disjoint $G$-invariant subsets. The stabilizer $G_x$ is always a subgroup of $G$.

3.2 The Orbit-Stabilizer Theorem

The Orbit-Stabilizer theorem is the cornerstone of enumeration in group theory. It relates the size of a group to the size of its orbits and stabilizers. The key preliminary observation is that the set of group elements sending $x$ to any fixed $y$ is not arbitrary — it forms an entire coset of the stabilizer.

Lemma. Let $G$ act on $V$ and $x \in V$. For any $y \in x^G$, the set $\{h \in G : x^h = y\}$ is a left coset of $G_x$.

This lemma reveals a clean partition of $G$: each element of the orbit $x^G$ corresponds to a unique coset of $G_x$, giving the orbit-stabilizer theorem.

Lemma (Orbit-Stabilizer). Let $G$ act on $V$ and $x \in V$. Then \[|G_x| \cdot |x^G| = |G|.\]

Proof. Define $f : x^G \to \{\text{left cosets of } G_x\}$ by $f(y) = \{g \in G : x^g = y\}$. By the previous lemma, $f$ is a bijection, so $|x^G| = [G : G_x] = |G|/|G_x|$. $\square$

The proof constructs a bijection between orbit elements and cosets, then invokes Lagrange’s theorem. This is both elegant and powerful: it implies that the size of every orbit divides $|G|$, a constraint that severely limits which subsets of $V$ can be orbits.

Orbit-Stabilizer theorem illustrated

The Orbit-Stabilizer equation $|G| = |G_x| \cdot |x^G|$: the group splits into cosets of the stabilizer, each coset corresponding to exactly one element of the orbit.

A key consequence involves the notion of conjugation.

Definition 2.2.1 (Conjugate). Let $g, h \in G$. We say $g$ is conjugate to $h$ if there exists $\sigma \in G$ with $g = \sigma^{-1} h \sigma$. Conjugacy is an equivalence relation, partitioning $G$ into conjugacy classes. For a subgroup $H \leq G$, the subgroup $gHg^{-1}$ is conjugate to $H$.

The stabilizers along an orbit are all conjugate to each other, as the following lemma makes precise.

Lemma 2.2.1. Let $G$ act on $V$ and $x \in V$. Then for all $g \in G$, $gG_x g^{-1} = G_{x^g}$. In particular, for any $y \in x^G$, the stabilizers $G_x$ and $G_y$ are conjugate.

Proof. Let $y = x^g$. For any $ghg^{-1} \in gG_x g^{-1}$, we compute $y^{ghg^{-1}} = x^{ghg^{-1}} \cdot\ldots = x^g = y$, so $gG_x g^{-1} \subseteq G_{x^g}$. The reverse inclusion follows by applying the same argument to $g^{-1}$. $\square$

3.3 Burnside’s Lemma

Having established that orbits partition $V$, the natural enumeration problem is: how many orbits are there? Burnside’s lemma gives a beautiful answer by converting the orbit-count into an average fixed-point count.

Lemma (Burnside). Let $G$ act on $V$. Write $\mathrm{fix}(g) = \{v \in V : v^g = v\}$. Then the number of orbits of $G$ is \[\frac{1}{|G|} \sum_{g \in G} |\mathrm{fix}(g)|.\]

Proof. Let $\Lambda = \{(g,x) : x \in \mathrm{fix}(g)\}$. Double-counting $\Lambda$: summing over $g$ gives $\sum_g |\mathrm{fix}(g)|$. Summing over $x$ gives $\sum_x |G_x|$. By the Orbit-Stabilizer theorem, $|G_x| = |G|/|x^G|$, so $\sum_x |G_x| = |G| \cdot \#\text{orbits}$. $\square$

The proof is an elegant double-counting of the incidence set $\Lambda$: the same set counted two ways yields, after dividing by $|G|$, the number of orbits. This technique — picking a convenient auxiliary set and counting it in two ways — is one of the most versatile tools in combinatorics.

Burnside’s lemma: 4 distinct 2-colorings of C₄ up to rotation

Four rotationally distinct 2-colourings of $C_4$. Burnside’s lemma computes: $(16+0+4+0+4+0+16+0+\ldots)/8 = 6$ for necklaces, or $4$ if only rotations are allowed.

Application to counting graph isomorphism classes. Let $G = \mathrm{Sym}(n)$ act on $\mathcal{G}_n$ (all graphs on $[n]$) by $X \mapsto X^g$. The orbits are exactly the isomorphism classes. Write $\mathrm{Iso}(X) = \{Y \in \mathcal{G}_n : Y \cong X\}$ for the isomorphism class of $X$.

Lemma 2.3.1. For all $X \in \mathcal{G}_n$, $\displaystyle|\mathrm{Iso}(X)| = \frac{n!}{|\mathrm{Aut}(X)|}.$

Proof. Let $H = \{\tau_g : g \in \mathrm{Sym}(n)\}$ act on $\mathcal{G}_n$ by $\tau_g(X) = X^g$. By the Orbit-Stabilizer lemma, $n! = |H| = |\mathrm{Iso}(X)| \cdot |H_X|$. But $H_X \cong \mathrm{Aut}(X)$, so the result follows. $\square$

Burnside then gives:

Lemma. The number of isomorphism classes of graphs on $[n]$ satisfies \[|H| = (1+o(1)) \frac{2^{\binom{n}{2}}}{n!}.\]

The key insight is that the identity permutation dominates the Burnside sum, contributing $2^{\binom{n}{2}}$, while all other permutations contribute a negligible $o(1)$ fraction.

3.4 Asymmetric Graphs

Burnside’s lemma gives us the tools to ask a fundamental probabilistic question: how symmetric is a “typical” graph? The answer is perhaps surprising.

Definition (Asymmetric). A graph $X$ is asymmetric if $\mathrm{Aut}(X) = \{1\}$ is trivial.

Asymmetric graphs are those with no symmetry whatsoever — the only automorphism is the identity. The following theorem, while counterintuitive (highly structured objects seem interesting, yet typical objects have no structure), is one of the foundational results in random graph theory.

Theorem. Asymptotically, almost all graphs are asymmetric: if $X$ is chosen uniformly at random from $\mathcal{G}_n$, then $\Pr[X \text{ is asymmetric}] \to 1$ as $n \to \infty$.

Proof. Among isomorphism classes in $H$, let $\rho$ be the proportion corresponding to asymmetric graphs. For an asymmetric class $C$, $|C| = n!$. For a non-asymmetric class, $|C| \leq n!/2$. Using the formula for $|H|$ and summing over classes: \[2^{\binom{n}{2}} \leq \rho|H| \cdot n! + (1-\rho)|H| \cdot \frac{n!}{2} = (1+o(1))2^{\binom{n}{2}} \cdot \frac{1+\rho}{2},\] which forces $\rho \to 1$. The proportion of asymmetric graphs in $\mathcal{G}_n$ is then $n! \cdot \rho|H| / 2^{\binom{n}{2}} \to 1$. $\square$

The proof exploits the asymmetry of counting: an asymmetric isomorphism class is exactly $n!$ times larger than its representative, while a symmetric class is smaller by at least a factor of 2. This size advantage forces asymmetric classes to dominate.

3.5 Primitivity

When a group acts transitively on a set, it may or may not respect additional structure. This leads to the notion of primitive versus imprimitive actions, which is the analogue for group actions of the distinction between a prime and a composite number.

Definition (Block of Imprimitivity). Let $G$ act transitively on $V$ and $S \subseteq V$. Then $S$ is a block of imprimitivity if for all $g \in G$, either $S^g = S$ or $S^g \cap S = \emptyset$.

A block of imprimitivity is a subset that the group can only either entirely preserve or entirely move away from — a kind of indivisible “chunk” with respect to the action. The trivial blocks are singletons $\{v\}$ and all of $V$.

Example 2.4.1 (Trivial Blocks). For any transitive action, $S = \{u\}$ for a single vertex $u \in V$, and $S = V$, are both trivial blocks of imprimitivity.

Definition 2.4.2 (Primitive). A transitive group $G$ acting on $V$ is primitive if it has no non-trivial blocks of imprimitivity. Otherwise it is imprimitive.

Primitivity is to group actions what primality is to integers: it identifies actions with no intermediate structure.

Example 2.4.2. $\mathrm{Aut}(K_n)$ has no non-trivial blocks of imprimitivity, so it acts primitively on the vertices of $K_n$.

Example. The cube $Q_3$ has antipodal pairs as blocks of imprimitivity (automorphisms preserve distances, so opposite vertices always map to opposite vertices). The cycle $C_n$ for composite $n = xy$ has blocks $\{0, x, 2x, \ldots, (y-1)x\}$.

Primitive vs imprimitive graph

Left: $C_5$ is primitive (prime $n$, no non-trivial block structure). Right: $C_6$ is imprimitive — the three coloured pairs of antipodal vertices form the blocks of imprimitivity.

The characterization of primitivity in terms of maximal subgroups is one of the deepest results of this chapter, connecting the geometric notion of a block structure to the algebraic notion of a maximal subgroup.

Definition 2.4.3 (Maximal Subgroup). A proper subgroup $H < G$ is a maximal subgroup if there is no subgroup $K$ with $H < K < G$.

Lemma 2.4.4. Let $G$ act transitively on $V$ and fix $x \in V$. Then $G$ is primitive if and only if $G_x$ is a maximal subgroup of $G$.

Proof. We prove both contrapositives.

($\Rightarrow$) Suppose $B$ is a non-trivial block with $x \in B$. Let $G_B = \{g \in G : B^g = B\}$. Then $G_x < G_B < G$: since $|B| \geq 2$ there is some $h$ with $x^h \neq x$ but $x^h \in B$, so $h \in G_B \setminus G_x$. Since $B \subsetneq V$, transitivity gives $G_B \neq G$.

($\Leftarrow$) If $G_x < H < G$, let $B$ be the orbit of $x$ under $H$. Then $|B| \geq 2$ (since $H \supsetneq G_x$) and $|B| < |V|$ by Orbit-Stabilizer. One checks that $B$ is a non-trivial block. $\square$

Both directions of the proof construct a bijection between the two types of intermediate structure — blocks on one side, intermediate subgroups on the other. This is a recurring pattern in group theory: geometric structure and algebraic structure shadow each other precisely.

Chapter 4: Transitive and Primitive Graphs

Transitivity conditions on graphs — vertex-transitivity, edge-transitivity, arc-transitivity — measure how “symmetric” a graph is. These properties have deep implications for graph structure, connectivity, and matchings.

4.1 Vertex-Transitive Graphs

With the group-theoretic machinery from Chapter 3 in place, we can now define and study transitivity conditions on graphs. These conditions measure how “symmetric” a graph is in a precise algebraic sense.

Definition (Vertex-Transitive). A graph $X$ is vertex-transitive if $\mathrm{Aut}(X)$ acts transitively on $V(X)$: for any $u, v \in V(X)$, there exists $g \in \mathrm{Aut}(X)$ with $u^g = v$.

Intuitively, a vertex-transitive graph “looks the same from every vertex.” All such graphs are regular, since automorphisms preserve degree and can move any vertex to any other.

Definition 3.1.2 (k-Cube). Let $H(\alpha, \beta)$ denote the Hamming distance between strings $\alpha, \beta \in \mathbb{Z}_2^k$. The k-cube $Q_k$ is the graph with $V(Q_k) = \mathbb{Z}_2^k$ and $E(Q_k) = \{\alpha\beta : H(\alpha,\beta) = 1\}$.

Lemma 3.1.1. $Q_k$ is vertex-transitive for all $k \geq 1$.

Proof. Fix $v \in V(Q_k)$. For each $x$, let $P_v : x \mapsto x + v$. Since $H(x,y) = 1 \iff H(x+v, y+v) = 1$, each $P_v \in \mathrm{Aut}(Q_k)$. The subgroup $P = \{P_v : v \in \mathbb{Z}_2^k\}$ acts transitively since $P_{w-v}$ maps $v \mapsto w$ for any $v, w$. $\square$

Proposition 3.1.2. $|\mathrm{Aut}(Q_k)| \geq 2^k k!$.

Proof. Let $P' = \{\tau_g : g \in \mathrm{Sym}(k)\}$ where $\tau_g$ permutes the coordinates of $x \in \mathbb{Z}_2^k$ according to $g$. Then $P' \leq \mathrm{Aut}(Q_k)$ and $P \cap P' = \{\mathrm{id}\}$. By elementary group theory, $|PP'| = |P| \cdot |P'|/|P \cap P'| = 2^k k!$. $\square$

The $k$-cube is also a Cayley graph: $Q_k = X(\mathbb{Z}_2^k, \{e_i : i \in [k]\})$, where $e_i$ is the $i$-th standard basis vector. By the same left-multiplication argument, circulant graphs $X(\mathbb{Z}_n, C)$ are vertex-transitive.

Proposition 3.1.4. For $v \geq k \geq i$, the Johnson graph $J(v,k,i)$ is vertex-transitive.

Proof. For each $g \in \mathrm{Sym}(v)$, the map $\sigma_g : S \mapsto S^g$ is an automorphism of $J(v,k,i)$. The subgroup $\{\sigma_g : g \in \mathrm{Sym}(v)\} \leq \mathrm{Aut}(J(v,k,i))$ acts transitively on the $k$-subsets of $[v]$. $\square$

In general, $J(v,k,i)$ are not Cayley graphs.

4.2 Cayley Graphs

Cayley graphs are the canonical examples of vertex-transitive graphs. They arise directly from the multiplication structure of a group — making explicit the connection between group theory and graph symmetry.

The idea is simple: take a group $G$ and a generating set $C$, and declare two group elements adjacent whenever one can be obtained from the other by multiplying by an element of $C$. The resulting graph automatically inherits the group’s symmetry.

Definition (Cayley Graph). Let $G$ be a group and $C \subseteq G \setminus \{1\}$ be inverse-closed. The Cayley graph $X(G,C)$ has $V = G$ and $E = \{gh : g^{-1}h \in C\}$.

The condition $1 \notin C$ ensures no loops; inverse-closure ensures undirectedness. The Cayley graph gives a visual and combinatorial representation of the group: its structure encodes how the generators interact and generate the whole group.

$Cayley graph of Z_6 with generators {1,5}$

The Cayley graph $X(\mathbb{Z}_6, \{1,5\}) \cong C_6$: the group $\mathbb{Z}_6$ with generators $\pm 1$ yields the 6-cycle. The left-multiplication maps $\tau_g : x \mapsto gx$ act transitively on vertices.

The left-multiplication structure of the group immediately provides the required automorphisms.

Theorem. Every Cayley graph $X(G,C)$ is vertex-transitive.

Proof. For each $g \in G$, define $\tau_g : V(X) \to V(X)$ by $x \mapsto gx$. Then $\tau_g \in \mathrm{Aut}(X)$ since $xy \in E \iff y^{-1}x \in C \iff (gy)^{-1}(gx) \in C \iff \tau_g(x)\tau_g(y) \in E$. The set $\{\tau_g : g \in G\}$ acts transitively since $\tau_{wv^{-1}}$ maps $v \mapsto w$ for any $v, w$. $\square$

This tells us that the group itself acts as a transitive subgroup of the automorphism group, via left multiplication. The transitive action of $G$ on itself is called the regular representation, and it is the algebraic backbone of all Cayley graph constructions.

Lemma. $X(G,C)$ is connected if and only if $C$ generates $G$.

Proof. ($\Rightarrow$) Any path from $e$ to $g$ gives $g$ as a product of elements from $C$. ($\Leftarrow$) If $g = c_1 \cdots c_k$ with $c_i \in C$, then the sequence $e, c_1, c_1c_2, \ldots, g$ is a path in $X(G,C)$. $\square$

This clean equivalence means that connectivity of the Cayley graph reflects the generating property of $C$: the graph is connected precisely when $C$ is rich enough to “reach” the whole group from the identity by iterated multiplication.

Not every vertex-transitive graph is a Cayley graph. The Petersen graph $J(5,2,0)$ is vertex-transitive but is not a Cayley graph, since neither $\mathbb{Z}_{10}$ nor the dihedral group $D_{10}$ produces a Cayley graph isomorphic to it.

To characterise exactly which vertex-transitive graphs are Cayley graphs, we use the notion of regularity for group actions.

Definition 3.5.1 (Semiregular). A permutation group $G$ acting on $V$ is semiregular if $G_x = \{1\}$ for all $x \in V$ (no non-identity element fixes any point).

Proposition 3.5.1. If $G$ is semiregular, then $|G| = |x^G|$ for all $x \in V$.

Proof. By the Orbit-Stabilizer lemma, $|G| = |G_x| \cdot |x^G| = 1 \cdot |x^G|$. $\square$

Definition 3.5.2 (Regular Action). If $G$ is semiregular and transitive, then $G$ is regular.

Proposition 3.5.2. If $G$ acting on $V$ is regular, then $|G| = |V|$.

Lemma 3.5.3. Let $G$ be a group and $C \subseteq G \setminus \{1\}$ be inverse-closed. Then $\mathrm{Aut}(X(G,C))$ contains a regular subgroup isomorphic to $G$.

Proof. The left-multiplication maps $T = \{\tau_g : g \in G\}$ form a subgroup of $\mathrm{Aut}(X(G,C))$ isomorphic to $G$. $T$ acts transitively, and any non-identity $\tau_g$ satisfies $\tau_g(x) = gx \neq x$ for all $x$, so $T$ is semiregular. Hence $T$ is regular. $\square$

Lemma 3.5.4. If $G \leq \mathrm{Aut}(X)$ acts regularly on $V(X)$, then $X \cong X(G,C)$ for some inverse-closed $C \subseteq G \setminus \{1\}$.

This gives a clean characterisation: vertex-transitive graphs are precisely those admitting a regular subgroup in their automorphism group — which is exactly a Cayley graph structure.

Two Cayley graphs on the same group are isomorphic when their connection sets are related by a group automorphism.

Definition 3.5.3 (Group Automorphism). A bijection $\theta : G \to G$ is a group automorphism if $\theta(gh) = \theta(g)\theta(h)$ for all $g, h \in G$.

Lemma 3.5.5. Let $\theta$ be an automorphism of $G$. Then $X(G,C) \cong X(G, \theta(C))$.

Proof. The map $\theta : V(X(G,C)) \to V(X(G,\theta(C)))$ is an isomorphism since $g^{-1}h \in C \iff \theta(g)^{-1}\theta(h) = \theta(g^{-1}h) \in \theta(C)$. $\square$

Note that $X(G,C) \cong X(G,\theta(C))$ does not require $\theta$ to be an automorphism — the converse is false in general.

Furthermore:

Theorem. Every connected vertex-transitive graph is isomorphic to a retract of a Cayley graph.

4.3 Edge-Transitive and Arc-Transitive Graphs

Beyond vertex-transitivity, we can ask whether the automorphism group acts transitively on edges or on directed edges (arcs). These finer conditions capture increasingly strong symmetry requirements.

Definition (Edge-Transitive, Arc-Transitive). A graph $X$ is edge-transitive if $\mathrm{Aut}(X)$ acts transitively on $E(X)$. It is arc-transitive (or symmetric) if $\mathrm{Aut}(X)$ acts transitively on the arcs $\{(x,y) : xy \in E(X)\}$.

Arc-transitivity is strictly stronger than edge-transitivity: an arc-transitive automorphism can not only carry any edge to any other, but can also reverse the direction of any edge. Equivalently, for any two arcs $(u,v)$ and $(x,y)$, there is an automorphism sending $u \mapsto x$ and $v \mapsto y$. Arc-transitivity implies both vertex and edge transitivity, but the converses fail. The triangular prism is vertex-transitive but not edge-transitive; $K_{n,m}$ for $n \neq m$ is edge-transitive but not vertex-transitive.

Lemma. If $X$ is edge-transitive and has no isolated vertices but is not vertex-transitive, then $\mathrm{Aut}(X)$ has exactly two orbits, forming a bipartition of $X$.

Lemma. If $X$ is both vertex and edge transitive but not arc-transitive, then every vertex has even degree.

Proof. The arc set splits into two orbits $\Omega_1 = (x,y)^G$ and $\Omega_2 = (y,x)^G$. Since $X$ is not arc-transitive, these are disjoint, and the graph of arcs in $\Omega_1$ is a regular oriented graph. Each vertex $u$ satisfies $\deg_X(u) = d^+_{\Omega_1}(u) + d^-_{\Omega_1}(u) = 2d^+_{\Omega_1}(u) \equiv 0 \pmod 2$. $\square$

The proof uses the concept of an oriented graph.

Definition 3.2.3 (Oriented Graph). An oriented graph is a directed graph $X$ where $xy \in A(X) \implies yx \notin A(X)$. It is obtained by orienting each edge of an undirected graph.

The proof exploits the parity of the two arc-orbits: if they are distinct, the indegree equals the outdegree in the oriented graph $\Omega_1$, forcing the total degree to be even.

Corollary 3.2.4.1. If $X$ is vertex and edge transitive with odd vertex degrees, then $X$ is arc-transitive.

4.4 Connectivity and Matchings of Vertex-Transitive Graphs

The symmetry of vertex-transitive graphs constrains their connectivity and matching properties in remarkable ways. These results show that high symmetry forces high connectivity — a striking instance of algebraic structure controlling combinatorial properties.

Definition (Edge Connectivity). The edge connectivity $\kappa_1(X)$ is the minimum number of edges whose deletion increases the component count. An edge atom is a minimum-size $S \subseteq V(X)$ with $|\partial S| = \kappa_1(X)$.

For general graphs, the edge connectivity can be as small as the minimum degree. For vertex-transitive graphs, vertex symmetry forces a much tighter result. Two preparatory lemmas are needed.

Lemma 3.3.1. Let $X$ be connected and vertex-transitive. Any two distinct edge atoms are vertex-disjoint.

Proof. Let $\kappa = \kappa_1(X)$. Suppose $A \neq B$ are edge atoms. By minimality $|A|, |B| \leq |V|/2$. If $A \cup B = V$ then $A \cap B = \emptyset$ and we are done. Otherwise $A \cup B \subsetneq V$, so $|\partial(A \cup B)| \geq \kappa$. But $A \cap B \subsetneq A$ gives $|\partial(A \cap B)| > \kappa$. By submodularity, $2\kappa < |\partial(A \cup B)| + |\partial(A \cap B)| \leq |\partial A| + |\partial B| = 2\kappa$, a contradiction. $\square$

Lemma 3.3.2. Let $X$ be connected and vertex-transitive, and let $S$ be a block of imprimitivity for $\mathrm{Aut}(X)$. Then $X[S]$ is regular.

Proof. Let $u \neq v \in S$ and $Y = X[S]$. Since $X$ is vertex-transitive, pick $g \in \mathrm{Aut}(X)$ with $u^g = v$. Since $S$ is a block, $S^g = S$. Then $N_Y(v) = N_Y(u)^g$, so $\deg_Y(u) = \deg_Y(v)$. $\square$

Theorem. If $X$ is connected and vertex-transitive, then $\kappa_1(X)$ equals the degree of each vertex.

Proof. Clearly $\kappa_1(X) \leq k$. Let $A$ be an edge atom. For any $g \in \mathrm{Aut}(X)$, $A^g$ is also an edge atom, so either $A = A^g$ or $A \cap A^g = \emptyset$. Thus $A$ is a block of imprimitivity for $\mathrm{Aut}(X)$, and $X[A]$ is $\ell$-regular for some $\ell < k$. Computing: $|\partial A| = |A|(k-\ell) \geq |A|(k+1-|A|)$. Since $|A| \leq k$, we get $|\partial A| \geq k$. $\square$

The proof is a beautiful interplay between graph theory and group theory: edge atoms are used to construct blocks of imprimitivity, and the block structure then gives the lower bound on connectivity by a direct counting argument.

Theorem 3.3.4. A $k$-regular vertex-transitive graph has vertex connectivity at least $\frac{2}{3}(k+1)$.

This bound is tight for certain families of graphs.

The matching theory of vertex-transitive graphs rests on two structural lemmas. A vertex is critical if it is saturated in every maximum matching.

Lemma 3.4.1. Let $z_1, z_2 \in V(X)$ be vertices such that no maximum matching misses both. Suppose $M_{z_1}$ and $M_{z_2}$ are maximum matchings missing $z_1$ and $z_2$ respectively. Then $M_{z_1} \triangle M_{z_2}$ contains an even alternating $z_1 z_2$-path.

Proof. Both $z_1$ and $z_2$ lie on some even alternating path in $M_{z_1} \triangle M_{z_2}$. Let $P$ be the path containing $z_1$. If it does not contain $z_2$, then $M_{z_2} \triangle P$ is a maximum matching missing both $z_1$ and $z_2$, a contradiction. $\square$

Lemma 3.4.2. Let $u, v \in V(X)$ be vertices in a connected graph $X$ with no critical vertices. Then $X$ contains a matching missing at most 1 vertex.

Proof. We show that for every $u, v \in V(X)$, no maximum matching misses both. By induction on the length of a $uv$-path: the base case $|P| = 1$ is trivial. For $|P| \geq 2$, pick $x \notin \{u,v\}$ on $P$. Since $x$ is not critical, there is a maximum matching $M_x$ missing $x$, which saturates both $u$ and $v$. If some $M$ misses both $u$ and $v$, Lemma 3.4.1 applied to $M_x \triangle M$ yields a contradiction via the path structure. $\square$

Proposition 3.4.3. Let $X$ be connected and vertex-transitive. Then $X$ has a matching missing at most one vertex, and every edge lies in some maximum matching.

Theorem 3.4.5. Let $X$ be connected and vertex-transitive. Then $X$ has a matching missing at most 1 vertex and each edge is contained in some maximum matching.

This is proved by a careful induction on vertices and edges, using the vertex-transitivity to propagate matching properties from subgraphs back to $X$.

Vertex-transitive circulant graph on 12 vertices

The circulant $C_{12}(1,5)$: a vertex-transitive graph on 12 vertices. Every vertex has degree 4, and the automorphism group acts transitively — every vertex looks identical.

Chapter 5: Eigenvalues and the Adjacency Matrix

The second major algebraic tool in graph theory is linear algebra: the adjacency matrix and the spectrum of its eigenvalues encode surprising information about graph structure.

5.1 The Adjacency Matrix and Spectrum

The adjacency matrix is the bridge between combinatorics and linear algebra. Once we encode a graph as a matrix, the full machinery of linear algebra — eigenvalues, eigenvectors, spectral decompositions — becomes available.

Definition (Adjacency Matrix). The adjacency matrix $A = A(X) \in \{0,1\}^{V \times V}$ has $A_{u,v} = 1$ if $u \sim v$ and 0 otherwise. Since $X$ is simple and undirected, $A$ is symmetric with zero diagonal.

The symmetry of $A$ is not incidental — it is the matrix-theoretic reflection of the graph’s undirectedness, and it guarantees that all eigenvalues are real and that the matrix is orthogonally diagonalizable. This makes spectral graph theory possible.

Small graph with its adjacency matrix

A 5-vertex graph alongside its adjacency matrix. The symmetry $A = A^T$ reflects the undirectedness of the graph; the zero diagonal reflects the absence of loops.

Definition (Spectrum). The spectrum of $X$ is the multiset of eigenvalues of $A(X)$. The characteristic polynomial is $\phi(A,x) = \det(xI - A(X))$.

Since $A(X)$ is real and symmetric, all its eigenvalues are real and it has an orthonormal eigenvector basis. Isomorphic graphs have the same spectrum — but the converse fails (non-isomorphic graphs can be cospectral). The existence of cospectral non-isomorphic graphs is a fundamental limitation of the spectral approach: eigenvalues do not fully classify graphs, and the study of graph invariants that go beyond the spectrum is an active research area.

Lemma. $(A^r)_{u,v}$ equals the number of walks from $u$ to $v$ of length $r$.

Corollary. If $X$ has $m$ edges and $t$ triangles and $A = A(X)$, then $\mathrm{tr}\, A = 0$, $\mathrm{tr}\, A^2 = 2m$, and $\mathrm{tr}\, A^3 = 6t$. In general, $\mathrm{tr}\, A^r = \sum_{i=1}^n \lambda_i^r$.

Computing eigenvalues. If $f : V(X) \to \mathbb{R}$ satisfies $\sum_{v \sim u} f(v) = \lambda f(u)$ for all $u$, then $\lambda$ is an eigenvalue of $A(X)$ with eigenvector $f$.

Example (Regular Graphs). For a $k$-regular graph, the constant function $f \equiv 1$ satisfies $\sum_{v \sim u} 1 = k = k \cdot f(u)$, so $k$ is always an eigenvalue. Conversely, $\mathbf{1}$ is an eigenvector if and only if $X$ is regular.

Example (Cycles). For $C_n$, let $\tau = e^{2\pi i j/n}$ be an $n$-th root of unity and $f(u) = \tau^u$. Then $\sum_{v \sim u} f(v) = \tau^{u-1} + \tau^{u+1} = (\tau^{-1}+\tau)\tau^u$, so $\lambda_j = 2\cos(2\pi j/n)$ for $j = 0, 1, \ldots, n-1$.

Lemma. If $X$ is $k$-regular, then $X$ and its complement $\bar{X}$ share the same eigenvectors. The eigenvalues of $\bar{X}$ are $(n-k-1)$ (with eigenvector $\mathbf{1}$) and $-\theta_j - 1$ for each non-principal eigenvalue $\theta_j$ of $X$.

5.2 Incidence Matrix and the Laplacian

The adjacency matrix is not the only natural matrix associated to a graph. The incidence matrix encodes how vertices relate to edges, and it leads to the Laplacian — perhaps the most analytically rich matrix in all of spectral graph theory.

Definition (Incidence Matrix). The incidence matrix $B = B(X) \in \{0,1\}^{V \times E}$ has $B_{u,e} = 1$ if $u \in e$.

The incidence matrix records vertex-edge membership rather than vertex-vertex adjacency. Its rank captures how many “independent” ways to traverse the graph — a fact made precise by the following theorem.

Theorem. If $X$ has $n$ vertices and $c$ bipartite components, then $\mathrm{rank}\, B = n - c$.

Proof. We show $\mathrm{null}(B^T) = c$. A vector $z$ with $B^T z = 0$ satisfies $z_u + z_v = 0$ for all $uv \in E$. Hence $z_u = 0$ on non-bipartite components and takes opposite values on bipartition classes in bipartite components. So the null space has dimension $c$. $\square$

The proof characterizes the null space of $B^T$ directly: the only “harmonic” signed vertex-labellings are those that alternate $\pm 1$ on bipartition classes, one degree of freedom per bipartite component.

The key matrix identity $BB^T = D(X) + A(X)$ (where $D$ is the diagonal degree matrix) and $B^T B = 2I + A(L(X))$ connect the incidence matrix to both the adjacency matrix and the line graph.

Definition (Laplacian). The Laplacian matrix is $L(X) = D(X) - A(X)$.

The Laplacian might initially look like a slight variation on the adjacency matrix, but it is far more — it encodes the graph’s connectivity, flow, and expansion properties in a single matrix, and its eigenvalues govern everything from random walks to network robustness.

Proposition. $L(X)$ is positive semidefinite, with \[x^T L(X) x = \sum_{uv \in E} (x_u - x_v)^2 \geq 0.\]

The Laplacian measures the “roughness” of a function on the graph: $x^T L x$ is large when adjacent vertices have very different $x$-values. The smallest eigenvalue of $L$ is always 0 (with eigenvector $\mathbf{1}$), and:

Proposition. The second smallest Laplacian eigenvalue $\mu_2 > 0$ if and only if $X$ is connected. For $k$-regular $X$, $\mu_i = k - \lambda_{n+1-i}$ where $\lambda_1 \geq \cdots \geq \lambda_n$ are the adjacency eigenvalues.

5.3 Symmetric Matrices and the Spectral Theorem

Since $A(X)$ is real symmetric, the spectral theorem applies in full. The key properties are:

Eigenvectors for distinct eigenvalues are orthogonal.
All eigenvalues are real.
$\mathbb{R}^n$ has an orthonormal eigenvector basis.
$A = P D P^T$ for orthogonal $P$, giving $A = \sum_{i=1}^n \lambda_i v_i v_i^T$.

5.4 The Courant-Fisher Theorem

The Courant-Fisher theorem characterises eigenvalues as solutions to min-max optimisation problems over the Rayleigh quotient — a perspective that connects spectral theory to variational calculus and turns eigenvalue bounds into optimization problems.

Definition (Rayleigh Quotient). For $x \in \mathbb{R}^n \setminus \{0\}$ and symmetric $A$, the Rayleigh quotient is $R(x,A) = x^T A x / x^T x$.

The Rayleigh quotient measures, for a given direction $x$, how much the matrix $A$ “stretches” in that direction. Maximizing it gives the largest eigenvalue; more generally, a min-max characterization gives every eigenvalue.

Theorem (Courant-Fisher). Let $A$ be a real symmetric $n \times n$ matrix with eigenvalues $\lambda_1 \geq \lambda_2 \geq \cdots \geq \lambda_n$. For all $k \in [n]$: \[\lambda_k = \max_{S \subseteq \mathbb{R}^n,\, \dim S = k} \min_{x \in S,\, x \neq 0} \frac{x^T A x}{x^T x}.\]

Proof. Let $f_1, \ldots, f_n$ be an orthonormal eigenvector basis. For $S = \mathrm{span}\{f_1,\ldots,f_k\}$, the minimum of the Rayleigh quotient over $S$ is $\lambda_k$ (attained at $f_k$), giving $\geq \lambda_k$. For any other $k$-dimensional $S$, intersecting with $\mathrm{span}\{f_k,\ldots,f_n\}$ (dimension $n-k+1$) gives a nonzero vector with Rayleigh quotient $\leq \lambda_k$, so the minimum is $\leq \lambda_k$. $\square$

The proof uses a dimension-counting argument: any $k$-dimensional subspace must intersect the bottom $(n-k+1)$-dimensional eigenspace nontrivially, providing a vector with Rayleigh quotient at most $\lambda_k$. This intersection argument is the core of many spectral bounds in combinatorics.

5.5 The Maximum Eigenvalue

Among all eigenvalues, the largest — called the spectral radius — has the most direct combinatorial interpretation: it measures how well-connected the graph is on average, and bounds the chromatic number from above.

Theorem. Let $\bar{d}$ be the average degree and $\Delta$ the maximum degree of $X$. Then $\bar{d} \leq \lambda_1 \leq \Delta$.

Proof. The lower bound follows by evaluating the Rayleigh quotient at $x = \mathbf{1}$: $\lambda_1 \geq \mathbf{1}^T A \mathbf{1}/n = 2|E|/n = \bar{d}$. For the upper bound, if $f$ is the principal eigenvector and $u$ attains the maximum of $f$, then $\lambda_1 f(u) = (Af)(u) = \sum_{v \sim u} f(v) \leq \deg(u) f(u) \leq \Delta f(u)$. $\square$

The lower bound uses the constant vector as a test vector in the Rayleigh quotient; the upper bound localizes at the vertex of maximum eigenvector weight. Together, they sandwich $\lambda_1$ between two natural degree-based quantities.

Theorem (Wilf, 1967). The chromatic number satisfies $\chi(X) \leq \lfloor \lambda_1 \rfloor + 1$.

Proof. By induction on $n$. Since the average degree is $\leq \lambda_1$, some vertex $u$ has degree $\leq \lfloor \lambda_1 \rfloor$. Removing $u$, the induced subgraph has maximum eigenvalue $\leq \lambda_1$ (by interlacing), so by induction is $(\lfloor\lambda_1\rfloor+1)$-colourable. Reintroduce $u$: it has at most $\lfloor\lambda_1\rfloor$ neighbours, so one colour remains available. $\square$

This is a spectacular result: a purely spectral quantity (the largest eigenvalue) controls the chromatic number — a property that in general requires exponential computation to determine. The proof is an inductive vertex-deletion argument, with interlacing providing the key hereditary property.

5.6 Eigenvalue Interlacing

Interlacing theorems are the spectral analogue of the inclusion-exclusion principle: when a graph is reduced by deleting a vertex or restricting to an induced subgraph, the eigenvalues of the smaller graph are interleaved with those of the larger graph. This is one of the most powerful techniques in spectral graph theory.

Theorem (Cauchy Interlacing). Let $A$ be symmetric $n \times n$ and $B$ an $(n-1)\times(n-1)$ principal submatrix. If $\alpha_1 \geq \cdots \geq \alpha_n$ and $\beta_1 \geq \cdots \geq \beta_{n-1}$ are their eigenvalues, then \[\alpha_1 \geq \beta_1 \geq \alpha_2 \geq \beta_2 \geq \cdots \geq \alpha_{n-1} \geq \beta_{n-1} \geq \alpha_n.\]

Interlacing shows that removing a vertex from a graph cannot decrease the minimum eigenvalue or increase the maximum eigenvalue. In other words, the extreme eigenvalues are “monotone” under induced subgraph taking — a fact used constantly in spectral arguments. Combined with the positive semidefinite matrix argument, this gives:

Lemma. If $Y$ is an induced subgraph of $X$, then $\lambda_{\min}(X) \leq \lambda_{\min}(Y) \leq \lambda_{\max}(Y) \leq \lambda_{\max}(X)$.

5.7 Perron-Frobenius Theorem

The Perron-Frobenius theorem is the spectral theorem of nonnegative matrix theory, and for connected graphs it reveals a striking rigidity: the largest eigenvalue is simple, achieved by a strictly positive eigenvector, and its magnitude controls the symmetry of the spectrum.

Theorem (Perron-Frobenius). Let $X$ be connected and $\lambda_1 \geq \lambda_2 \geq \cdots \geq \lambda_n$ be the eigenvalues of $A(X)$. Then:

$\lambda_1$ has a strictly positive eigenvector.
$\lambda_1 > \lambda_2$ (the largest eigenvalue is simple).
$\lambda_1 \geq -\lambda_n$, with equality if and only if $X$ is bipartite.

Proof. (a) Let $f$ be a unit eigenvector of $\lambda_1$ and $f^+ = |f|$. By the Courant-Fisher theorem, $\lambda_1 = f^T A f \leq (f^+)^T A f^+ \leq \lambda_1$, forcing $f^+$ to also be a maximiser — hence an eigenvector of $\lambda_1$. If $f^+(u) = 0$ for some $u$, connectivity yields an edge $vw$ with $f^+(v) = 0 < f^+(w)$, contradicting $\sum_{w \sim v} f^+(w) = \lambda_1 f^+(v) = 0$.

(b) If $\lambda_2 = \lambda_1$, then any eigenvector $g$ of $\lambda_2$ would also have $|g|$ strictly positive. But $g \perp f^+$ forces $g$ to change sign, and connectivity gives an edge where $g(u)g(v) < 0 \neq |g(u)||g(v)|$, contradicting the equality case.

(c) The inequality $|\lambda_n| \leq \lambda_1$ follows from the triangle inequality on the Rayleigh quotient. For bipartite $X$ with bipartition $S, T$: if $f$ is an eigenvector of $\lambda$, define $\bar{f}(u) = f(u)$ for $u \in S$ and $\bar{f}(u) = -f(u)$ for $u \in T$. Then $A\bar{f} = -\lambda \bar{f}$, so $-\lambda$ is an eigenvalue, giving symmetry of the spectrum around 0. The converse is similar. $\square$

Eigenvalue spectrum of the Petersen graph

Eigenvalue spectrum of the Petersen graph: eigenvalue 3 (multiplicity 1), eigenvalue 1 (multiplicity 5), eigenvalue $-2$ (multiplicity 4). The Petersen graph is 3-regular, and $\lambda_1 = 3$ confirms regularity.

Eigenvalue spectra of K₄, C₆, and Petersen graph compared

Comparing spectra of $K_4$, $C_6$, and the Petersen graph. $K_4$ has one large eigenvalue; $C_6$’s spectrum is symmetric (bipartite complement); the Petersen graph has a characteristic gap between $\lambda_1=3$ and $\lambda_2=1$.

5.8 Strongly Regular Graphs

The spectrum of a graph encodes a great deal of information, but for a very special class of graphs — the strongly regular graphs — the spectrum is essentially determined by the combinatorial parameters and vice versa. This makes strongly regular graphs both highly structured and highly constrained.

Why are strongly regular graphs (SRGs) so interesting? They occupy a remarkable middle ground in the landscape of graphs. A random graph looks different from every vertex and every pair of vertices; a complete graph looks the same but is trivially uniform. Strongly regular graphs are neither: they are highly non-trivial, yet they impose an iron uniformity on every pair of vertices simultaneously. This tension between structure and non-triviality makes them the central objects of algebraic combinatorics.

They arise naturally in many areas of mathematics. In finite geometry, the points of a projective or polar space form an SRG. In coding theory, the minimum-weight codewords of a linear code often form the vertices of an SRG, with the graph encoding the combinatorial structure of the code. In design theory, every symmetric 2-design gives rise to an SRG on its points. In group theory, the orbital graphs of rank-3 permutation groups are always strongly regular — this is one of the central reasons that the classification of rank-3 groups (a chapter in the classification of finite simple groups) is intimately tied to the theory of SRGs. And in spectral graph theory, SRGs provide the cleanest examples of the three-eigenvalue phenomenon we are about to prove.

The parameters $(n, k, a, c)$ are simple to state but severely constraining. Most tuples $(n, k, a, c)$ satisfying even obvious necessary conditions — the parameter equation, the integrality of eigenvalue multiplicities — turn out not to be realizable by any actual graph. Determining whether a given parameter set is realizable, and if so, how many non-isomorphic SRGs have those parameters, is one of the central open problems in algebraic combinatorics. Some parameter sets admit a unique graph (up to isomorphism); others admit thousands; some admit none at all. This unpredictability is part of what makes the subject so rich and difficult.

Definition (Strongly Regular). A graph $X$ is strongly regular with parameters $(n,k,a,c)$ if: $|V(X)| = n$, $X$ is $k$-regular, every adjacent pair shares exactly $a$ common neighbours, and every non-adjacent pair shares exactly $c$ common neighbours.

The definition imposes a remarkable uniformity: not only is the graph regular (every vertex has the same degree), but every pair of vertices has the same number of common neighbours — a property depending only on whether they are adjacent. This is a dramatically stronger condition than regularity alone.

Example 7.1.1. $C_5$ is $(5,2,0,1)$-strongly regular. The Petersen graph is $(10,3,0,1)$-strongly regular.

Example 7.1.4. The line graph of $K_n$ is $\bigl(\binom{n}{2}, 2n-4, n-2, 4\bigr)$-strongly regular. The line graph of $K_{n,n}$ is $(n^2, 2n-2, n-2, 2)$-strongly regular.

The complement of a strongly regular graph is also strongly regular.

Proposition 7.1.2. Suppose $X$ is $(n,k,a,c)$ strongly regular. Then $\bar{X}$ is $(n, \bar{k}, \bar{a}, \bar{c})$ strongly regular, where $\bar{k} = n-1-k$, $\bar{a} = n-2-2k+c$, and $\bar{c} = n-2k+a$.

A strongly regular graph is primitive if both it and its complement are connected.

Definition 7.1.2 (Primitive SRG). A strongly regular graph $X$ is primitive if both $X$ and $\bar{X}$ are connected. Otherwise it is imprimitive.

Lemma 7.1.3. Let $X$ be $(n,k,a,c)$ strongly regular. The following are equivalent: (a) $X$ is disconnected; (b) $c = 0$; (c) $a = k-1$; (d) $X \cong mK_{k+1}$ for some $m$.

Proof. (a)⟹(b): Vertices in different components share no common neighbours. (b)⟹(c): If $c = 0$ and $v, w \in N(u)$, then $u \in N(v) \cap N(w)$, so by $c = 0$ we need $v \sim w$, giving $a = k-1$. (c)⟹(d) and (d)⟹(a) are immediate. $\square$

The complement of a strongly regular graph is also strongly regular, which immediately doubles our supply of examples and tells us that SRGs come in complementary pairs.

The four parameters $(n,k,a,c)$ are not independent — they satisfy a necessary integrality and consistency condition. Think of this as a bookkeeping identity: fixing a vertex $u$ and counting the edges between its neighbourhood $N(u)$ and its non-neighbourhood $\bar{N}(u) = V \setminus (N(u) \cup \{u\})$ in two ways must give the same answer. This kind of double-counting argument is one of the foundational techniques of combinatorics, and here it immediately yields a strong constraint.

Proposition (Parameter Relation). A $(n,k,a,c)$ strongly regular graph satisfies $c(n-k-1) = k(k-a-1)$.

Proof. Fix $u \in V$. Count edges between $N(u)$ and $V \setminus (N(u) \cup \{u\})$ in two ways: each non-neighbour contributes $c$ edges, giving $c(n-k-1)$; each neighbour has $k-1-a$ neighbours in $V \setminus (N(u) \cup \{u\})$, giving $k(k-1-a)$. $\square$

The proof is a clean double-counting of a bipartite graph between the neighbourhood and non-neighbourhood of a fixed vertex. This relation is a useful first check: any putative parameter set $(n,k,a,c)$ failing this identity cannot be realized by a strongly regular graph.

Now we arrive at the spectral core of the theory. The key algebraic identity for SRGs is that $A^2 = kI + aA + c(J-I-A)$, which comes directly from the definition: the $(u,v)$-entry of $A^2$ counts walks of length 2 from $u$ to $v$, and for an SRG this count depends only on whether $u \sim v$ (giving $a$) or $u \not\sim v$ (giving $c$). This matrix equation constrains the minimal polynomial of $A$, and that constraint forces there to be at most three eigenvalues.

Lemma. For a non-complete strongly regular graph with parameters $(n,k,a,c)$, the adjacency matrix has exactly three distinct eigenvalues: $k$, and the two roots $\theta, \tau$ of $\lambda^2 - (a-c)\lambda - (k-c) = 0$. Their multiplicities are: \[m_\theta = \tfrac{1}{2}\!\left[(n-1) - \frac{2k+(n-1)(a-c)}{\sqrt{\Delta}}\right], \quad m_\tau = \tfrac{1}{2}\!\left[(n-1) + \frac{2k+(n-1)(a-c)}{\sqrt{\Delta}}\right],\] where $\Delta = (a-c)^2 + 4(k-c)$.

Proof. The key identity is $A^2 = kI + aA + c(J - I - A)$, giving $A^2 - (a-c)A - (k-c)I = cJ$. Any eigenvector $v$ orthogonal to $\mathbf{1}$ satisfies $\lambda^2 - (a-c)\lambda - (k-c) = 0$, so $\lambda \in \{\theta, \tau\}$. The multiplicities follow from $m_\theta + m_\tau = n-1$ and $k + m_\theta \theta + m_\tau \tau = \mathrm{tr}\, A = 0$. $\square$

Conversely, a connected regular graph with exactly three distinct eigenvalues is strongly regular — and this converse is also true. This is a beautiful rigidity result: the spectral data (number of distinct eigenvalues) forces the combinatorial data (exact common-neighbour counts). The three eigenvalues and the SRG parameters completely determine each other, making SRGs one of the most tightly controlled families in all of combinatorics.

Lemma 7.2.2. A connected regular graph with exactly 3 distinct eigenvalues is strongly regular.

Proof. Suppose $X$ is $k$-regular and connected with eigenvalues $k, \theta, \tau$. Set $M = (k-\theta)^{-1}(k-\tau)^{-1}(A-\theta I)(A-\tau I)$. By Perron-Frobenius, $M$ has eigenvalue 1 with eigenvector $\mathbf{1}$ and eigenvalue 0 with multiplicity $n-1$, so $M = \frac{1}{n}J$. Expanding: $A^2 = (\theta+\tau)A - \theta\tau I + (k-\theta)(k-\tau)\frac{1}{n}J$. The entries of $A^2$ then depend only on adjacency, so $X$ is strongly regular. $\square$

Theorem 7.2.3. A connected regular graph that is not complete has exactly 3 distinct eigenvalues if and only if it is strongly regular.

The requirement that $m_\theta, m_\tau$ be positive integers is a strong integrality constraint on the parameters $(n,k,a,c)$. It rules out vast families of parameter tuples that otherwise satisfy the parameter relation. When $\Delta = (a-c)^2 + 4(k-c)$ is a perfect square, the multiplicities are automatically integers. When it is not, the multiplicities can still be integers but the constraint becomes more subtle — this is the case of “irrational” SRGs, which are particularly hard to construct or rule out, and where the interplay between number theory and combinatorics is most vivid.

5.9 Paley Graphs

The Paley graphs are a remarkable family that sits at the intersection of number theory, algebra, and combinatorics. They are constructed from finite fields and provide concrete examples of strongly regular, arc-transitive, self-complementary graphs — a rare combination of desirable properties.

Definition (Paley Graph). For a prime power $q \equiv 1 \pmod 4$, the Paley graph $P(q)$ has vertex set $\mathrm{GF}(q)$ and edge set $\{uv : u - v \text{ is a nonzero square in } \mathrm{GF}(q)\}$.

The condition $q \equiv 1 \pmod 4$ ensures $-1$ is a square in $\mathrm{GF}(q)$, which is precisely what makes the edge relation symmetric: $u - v$ is a square if and only if $v - u = -(u-v)$ is a square. Without this condition, the graph would be a tournament rather than an undirected graph.

The condition $q \equiv 1 \pmod 4$ ensures that $-1$ is a square in $\mathrm{GF}(q)$, making the edge relation symmetric. Paley graphs are $\frac{q-1}{2}$-regular and arc-transitive.

Theorem 7.3.3. $P(q)$ is self-complementary.

Proof. Let $a \in \mathrm{GF}(q)$ be a non-square. The map $\sigma : x \mapsto ax$ is an isomorphism from $P(q)$ to $\overline{P(q)}$: since $a$ is a non-square, $a(u-v)$ is a square iff $u - v$ is a non-square, so $\sigma$ maps edges to non-edges and vice versa. $\square$

Theorem 7.3.4. $P(q)$ is strongly regular with parameters $\bigl(q,\, \frac{q-1}{2},\, \frac{q-5}{4},\, \frac{q-1}{4}\bigr)$.

Beyond Paley graphs, there are several other important families of strongly regular graphs.

Definition 7.4.1 (Lattice Graph). The lattice graph $L_n$ has $V(L_n) = [n] \times [n]$ and edges $(a,b)(a',b')$ whenever $a = a'$ or $b = b'$. It is $(n^2, 2(n-1), n-2, 2)$-strongly regular.

Definition 7.4.2 (Latin Square). An $n \times n$ Latin square is an $n \times n$ matrix whose rows and columns are each permutations of $[n]$.

Definition 7.4.3 (Latin Square Graph). Let $A$ be an $n \times n$ Latin square. The Latin square graph $X(A)$ has $V = [n] \times [n]$ and $E = \{(a,b)(a',b') : a = a' \text{ or } b = b' \text{ or } A_{ab} = A_{a'b'}\}$. It is $(n^2, 3(n-1), n, 6)$-strongly regular.

The lattice graph $L_n$ is the special case where the Latin square is the addition table of $\mathbb{Z}_n$.

5.10 Expanders and the Second Eigenvalue

The second eigenvalue of the Laplacian, $\mu_2$, measures how well-connected a graph is beyond mere connectivity. A graph with large $\mu_2$ is a good expander: every small set of vertices has many edges leaving it.

Expanders are among the most important objects in theoretical computer science and combinatorics. The motivation is practical: in communication networks, you want a graph that is simultaneously sparse (cheap to build, few connections per node) and highly connected (messages can propagate quickly, with no “bottlenecks” where traffic is forced through a tiny cut). These two desiderata are in apparent tension — sparse graphs should have poor connectivity — but expanders achieve both simultaneously. The spectral gap $\mu_2$ is the mathematical measure of expansion: a large gap means no small cut exists relative to the set’s size. The Cheeger inequality below makes this precise, relating a purely algebraic quantity (an eigenvalue) to a purely combinatorial one (the minimum boundary-to-volume ratio).

Definition (Isoperimetric Ratio). For $S \subseteq V(X)$, the isoperimetric ratio is $\theta(S) = e(S, \bar S)/|S|$. The isoperimetric number of $X$ is $\theta(X) = \min_{|S| \leq |V|/2} \theta(S)$. $X$ is an $\alpha$-expander if $\theta(X) \geq \alpha$.

Theorem. $\theta(X) \geq \mu_2/2$.

Proof. For $S \subseteq V$ with $|S| \leq n/2$, set $\alpha = |S|/n$ and $x = \chi_S - \alpha \mathbf{1}_n$. Then $x \perp \mathbf{1}$ and $\|x\|^2 = \alpha n(1-\alpha)$. By the Courant-Fisher theorem: \[\mu_2 \leq \frac{x^T L x}{x^T x} = \frac{1}{\alpha n(1-\alpha)} \sum_{uv \in E} (x_u - x_v)^2 = \frac{e(S,\bar{S})}{\alpha n(1-\alpha)} = \frac{\theta(S)}{1-\alpha} \leq 2\theta(S).\] Taking the minimum over $S$ gives $\theta(X) \geq \mu_2/2$. $\square$

This inequality — sometimes called the Cheeger inequality for graphs — is one of the most important results in spectral graph theory. It says that a graph with a large spectral gap $\mu_2$ is necessarily a good expander. The proof is a clean application of Courant-Fisher: choose a test vector that is the characteristic function of $S$, shifted to be orthogonal to $\mathbf{1}$, and evaluate the Rayleigh quotient for the Laplacian.

The converse direction — that expanders have large $\mu_2$ — is also true (and slightly harder): $\mu_2 \geq \theta(X)^2 / 2k$ for $k$-regular graphs. Together, the two directions say that $\mu_2$ and $\theta(X)$ are within a polynomial factor of each other, making the spectral gap and the combinatorial expansion equivalent up to polynomial losses.

The study of graph families with simultaneously large degree and large $\mu_2$ is the theory of Ramanujan graphs. A $k$-regular Ramanujan graph achieves the optimal spectral gap: $\lambda_2 \leq 2\sqrt{k-1}$. This bound, proved by the Alon-Boppana theorem, is optimal — no sequence of $k$-regular graphs can do better. Explicit constructions of Ramanujan graphs (by Lubotzky-Phillips-Sarnak and Margulis) use deep number theory (the Ramanujan conjecture, now a theorem of Deligne) and show that the worlds of spectral graph theory, finite group theory, and analytic number theory are intimately connected.

Chapter 6: Generalized Polygons

Incidence geometry provides a rich source of highly symmetric graphs. In this chapter we study incidence structures, projective planes, and their associated bipartite graphs (incidence graphs). The key insight is that projective planes correspond exactly to bipartite graphs of diameter 3 and girth 6.

Up to now, our graphs have come from groups (Cayley graphs, circulants), from set combinatorics (Johnson graphs), or from matrix spectra (strongly regular graphs). This chapter opens a completely different door: classical incidence geometry. The objects here — projective planes, generalized polygons — were studied by geometers long before graph theory existed, and yet they produce some of the most symmetric, most structurally rigid graphs known.

The central idea is to convert a geometric system (points and lines with an incidence relation) into a graph, called the incidence graph or Levi graph. This translation is lossless: every property of the geometry can be read off from the graph, and vice versa. The payoff is that powerful graph-theoretic and algebraic tools — automorphism groups, eigenvalue bounds, girth-diameter relations — all become available to study geometric structures.

Projective planes in particular sit at a remarkable intersection: they arise from finite fields (the construction $PG(2,q)$ below), they are characterized by a beautifully clean diameter-girth condition on their incidence graphs, and their automorphism groups turn out to be linear groups over the field. Understanding this chapter means understanding why the geometry of finite fields and the combinatorics of graphs are two faces of the same mathematical object.

6.1 Incidence Structures and Incidence Graphs

Definition 4.1.1 (Incidence Structure). Let $P$ be a set of points, $\mathcal{L}$ a set of lines, and $I \subseteq P \times \mathcal{L}$ an incidence relation. The triple $\mathcal{I} = (P, \mathcal{L}, I)$ defines an incidence structure. If $(p, L) \in I$, we say point $p$ and line $L$ are incident.

The dual $\mathcal{I}^* = (\mathcal{L}, P, I^*)$ where $I^* = \{(L, p) : (p,L) \in I\}$ swaps the roles of points and lines. Many important incidence structures are self-dual. Duality is a deep organizing principle: a theorem about points and lines automatically yields a theorem about lines and points, at no extra cost.

To bring graph-theoretic tools to bear on incidence structures, we encode the geometry as a bipartite graph.

Definition 4.1.2 (Incidence Graph). Given an incidence structure $\mathcal{I} = (P, \mathcal{L}, I)$, the incidence graph $X(\mathcal{I})$ is the bipartite graph with vertex set $P \cup \mathcal{L}$ and edge set $\{pL : (p,L) \in I\}$. We have $X(\mathcal{I}) \cong X(\mathcal{I}^*)$.

The incidence graph is the Rosetta Stone of this chapter: geometric properties of $\mathcal{I}$ (how many points lie on a line, how many lines pass through a point, when two lines meet) translate directly into graph-theoretic properties (regularity, degree, girth, diameter). The bipartiteness is automatic: edges always run between the point-side and the line-side, never within either side. An automorphism of $\mathcal{I}$ that preserves point-and line-types becomes exactly a graph automorphism that preserves the bipartition. Translating incidence geometry into graph theory allows us to apply spectral and automorphism-group techniques.

Definition 4.1.3 (Automorphism of Incidence Structure). An automorphism of $\mathcal{I} = (P, \mathcal{L}, I)$ is a permutation $\sigma$ of $P \cup \mathcal{L}$ with $P^\sigma = P$ and $\mathcal{L}^\sigma = \mathcal{L}$ such that $(p^\sigma, L^\sigma) \in I \iff (p,L) \in I$.

Notice that the automorphism group of $\mathcal{I}$ embeds naturally into the automorphism group of its incidence graph $X(\mathcal{I})$. This is one of the key tools we will exploit: by studying the incidence graph’s symmetry, we learn about the symmetry of the underlying geometry.

There is a subtle point here that is easy to miss: the automorphism group of $X(\mathcal{I})$ may be strictly larger than the automorphism group of $\mathcal{I}$. The incidence graph is a bipartite graph, and an automorphism of the graph might swap the two sides — swapping all points with all lines. This corresponds to a duality of the incidence structure, a concept that is meaningful only for self-dual geometries. For projective planes, which are self-dual, these “improper” automorphisms (dualities) are geometrically meaningful and important. For non-self-dual geometries, they are graph automorphisms that have no geometric interpretation. This distinction will matter when we analyze the automorphism group of $PG(2,q)$.

6.2 Partial Linear Spaces

Before reaching the full definition of a projective plane, we introduce the key regularity axiom that geometries must satisfy. In a “nice” incidence structure, any two points should determine at most one line — just as in the Euclidean plane, two points determine a unique line. This is the defining property of a partial linear space.

Definition 4.1.4 (Partial Linear Space). An incidence structure $\mathcal{I} = (P, \mathcal{L}, I)$ is a partial linear space if for any two distinct points $x, y \in P$, there exists at most one line $L \in \mathcal{L}$ incident with both.

Lemma 4.1.1. In a partial linear space, any two distinct lines are incident with at most one common point.

This lemma says that the “at most one” condition is symmetric in points and lines — it holds for lines as well as points. This is the first hint that duality is a deep feature of the theory.

Two lines incident with a common point are called concurrent. The girth constraint on the incidence graph captures the partial linearity condition exactly. This is the graph-theoretic translation of the geometric axiom, and it is the key lemma that allows us to pass freely between the geometric and graph-theoretic perspectives.

Lemma 4.1.2. If $\mathcal{I}$ is a partial linear space, then the incidence graph $X(\mathcal{I})$ has girth at least 6.

Proof. $X(\mathcal{I})$ is bipartite, so it has no odd cycles. A 4-cycle would correspond to two points both incident with two lines, contradicting the partial linearity condition. Hence the girth is at least 6. $\square$

The proof is a one-sentence geometric-to-combinatorial translation: a 4-cycle in the incidence graph says precisely that two points lie on two common lines, which is forbidden. This establishes a clean lower bound on the girth of incidence graphs of “reasonable” geometries — and projective planes will turn out to achieve the girth exactly as small as possible while also being as “connected” as possible.

This girth-6 lower bound is part of a general phenomenon in extremal graph theory: the girth of an incidence graph is tightly controlled by the “complexity” of the geometry. Affine planes (projective planes with one line removed) have incidence graphs of girth 6 and diameter 4; projective planes achieve diameter 3, which is optimal. This optimality — girth 6 and diameter 3 simultaneously — is precisely the Moore bound for bipartite graphs of girth 6, and it is one of the most striking manifestations of the connection between geometry and extremal combinatorics.

6.3 Projective Planes

We now arrive at the central geometric object of this chapter. A projective plane is what you get when you take the ordinary Euclidean plane and add “points at infinity” to ensure that every pair of lines meets — eliminating the special case of parallel lines. The result is a geometry so symmetric that it is, in a precise sense, the simplest non-trivial geometry.

Why should algebraic graph theorists care about projective planes? There are several reasons. First, their incidence graphs are among the most highly symmetric bipartite graphs known — they are arc-transitive, and their automorphism groups are classical linear groups. Second, these incidence graphs achieve an extremal property: for a given vertex degree (corresponding to the order of the plane), they simultaneously maximize girth and minimize diameter among all bipartite graphs — a combinatorial miracle called the Moore bound. Third, the eigenvalues of the incidence graph can be computed explicitly from the order of the plane, and these eigenvalues control expansion and mixing properties that have applications to coding theory and computer science.

Definition 4.2.1 (Projective Plane). A projective plane is a partial linear space $\mathcal{I} = (P, \mathcal{L}, I)$ satisfying:

(C1) Every two distinct lines meet in a unique point.
(C2) Every two distinct points lie on a unique line.
(C3) There exist 3 noncollinear points (a triangle).

Conditions (C1) and (C2) are exact duals of each other — swapping “points” and “lines” converts one into the other. This is the principle of projective duality: every theorem about a projective plane yields a dual theorem for free, by swapping points and lines throughout. Condition (C3) is a nondegeneracy condition that rules out trivial “planes” consisting of a single line with all points on it.

Let us pause to appreciate why axiom (C1) — any two distinct lines meet in a unique point — represents such a clean conceptual move from the Euclidean setting. In the ordinary Euclidean plane, two lines either meet in a unique point or are parallel. The existence of parallel lines is an anomaly — a special case that forces geometric arguments to split into cases. Projective geometry eliminates this anomaly by postulating that “points at infinity” exist, one for each direction, and that parallel lines meet at the corresponding point at infinity. The projective plane is the completion of the Euclidean plane under this extension, and it is dramatically more symmetric: there are no special lines, no special points, and every pair of lines meets exactly once.

The finite case captures the same spirit: in $PG(2,q)$, every two lines meet in exactly one point. The classical Euclidean messiness of parallelism simply disappears. What remains is a perfectly symmetric geometry on $q^2+q+1$ points, where every line contains exactly $q+1$ points and every point lies on exactly $q+1$ lines. This symmetry is so complete that the geometry looks identical from the perspective of any incident point-line pair — the defining property of arc-transitivity, which we will prove in Section 6.4.

A useful consequence: if a projective plane has any line with $n+1$ points on it, then every line has exactly $n+1$ points and every point lies on exactly $n+1$ lines. The integer $n$ is called the order of the projective plane, and the total number of points (and lines) is $n^2 + n + 1$. For $n=2$, this gives the Fano plane — 7 points and 7 lines, every line containing 3 points, every point on 3 lines. It is the smallest projective plane and one of the most important combinatorial objects in mathematics.

The following theorem gives a clean graph-theoretic characterisation of projective planes, connecting combinatorial geometry to spectral graph theory. It is the payoff of the entire incidence-graph translation program: geometric axioms on points and lines become conditions on diameter and girth.

Theorem 4.2.1. Let $\mathcal{I}$ be a partial linear space containing a triangle. Then $\mathcal{I}$ is a projective plane if and only if $X(\mathcal{I})$ has diameter 3 and girth 6.

Proof. $(\Rightarrow)$ In a projective plane: any two points are at distance 2 in $X(\mathcal{I})$, and any two lines are at distance 2. For a point $p$ and line $L$ not incident, there is a unique line through $p$ meeting $L$, so $d(p,L) = 3$. The triangle gives a 6-cycle (girth exactly 6) and the diameter is exactly 3. $(\Leftarrow)$ Diameter 3 and girth 6 in a bipartite graph force any two vertices of the same part to be at distance exactly 2, giving (C1) and (C2). The 6-cycle gives a triangle, yielding (C3). $\square$

This theorem is remarkable: it says that the axioms of projective geometry — expressed in terms of points, lines, and incidence — are equivalent to two simple numerical conditions on a bipartite graph. The girth condition (at least 6) encodes the partial linearity; the diameter condition (exactly 3) encodes that any two elements are “connected” in the strongest possible sense. Graphs achieving both simultaneously are called cages in this context, and projective planes give the unique family of regular bipartite cages of diameter 3 — the so-called (q+1,6)-cages. This connects incidence geometry to the famous cage problem in extremal graph theory.

The theorem also reveals why the incidence graph is the right object to study, rather than some other graph derived from the geometry. The two key axioms of a projective plane — (C1) uniqueness of line through two points, (C2) uniqueness of intersection point of two lines — become the two key numerical conditions on the graph. The translation is lossless, and working with the graph gives us access to the entire toolkit of algebraic graph theory: degree sequences, girth bounds, spectral methods, automorphism groups. All of these will be deployed in what follows.

A word on what is not yet known: projective planes need not arise from finite fields. There exist projective planes of order $n$ for all prime powers $n$, but it is an open problem whether any projective plane of non-prime-power order exists. The order-10 case was settled by an exhaustive computer search (Lam, 1991) — no projective plane of order 10 exists. For other non-prime-power orders, the question remains wide open. This is one of the most tantalizing open problems in finite combinatorics, and it illustrates how constrained these geometric structures really are.

6.4 A Family of Projective Planes: $PG(2,q)$

We have seen that projective planes exist (at least in the abstract sense of the axioms), but do concrete, explicit families exist? The answer is yes, and the construction uses finite fields in a beautiful way.

Fix a prime power $q$ and the finite field $\mathbb{F}_q$. Let $V = \mathbb{F}_q^3$. The projective plane $PG(2,q)$ is the incidence structure where $P$ consists of all 1-dimensional subspaces of $V$, $\mathcal{L}$ consists of all 2-dimensional subspaces, and incidence is subspace containment.

The notation “$PG$” stands for “projective geometry.” The construction is the finite analogue of the real projective plane $\mathbb{RP}^2$: instead of lines through the origin in $\mathbb{R}^3$, we take lines through the origin in $\mathbb{F}_q^3$. The finite field replaces the real numbers, and the result is a finite combinatorial object with all the essential properties of a projective plane.

Since $|\mathbb{F}_q^3 \setminus \{0\}| = q^3 - 1$ and each 1-dimensional subspace contains $q-1$ nonzero vectors, we get $|P| = |{\mathcal{L}}| = 1 + q + q^2$. Each line contains $1 + q$ points and each point lies on $1 + q$ lines, so $X(PG(2,q))$ has $2(q^2+q+1)$ vertices and is $(q+1)$-regular. The case $q = 2$ gives the Fano plane.

The counting here is a clean exercise in linear algebra over finite fields: the number of nonzero vectors in $\mathbb{F}_q^3$ is $q^3-1$, and dividing by $q-1$ (the size of each nonzero scalar multiple class) gives the number of 1-dimensional subspaces. The same argument for 2-dimensional subspaces — via their annihilators in the dual space — shows that $|P| = |\mathcal{L}|$, confirming the self-duality of $PG(2,q)$.

This self-duality is not a coincidence: in $PG(2,q)$, there is a canonical bijection between points and lines given by orthogonal complementation (using the standard inner product on $\mathbb{F}_q^3$). The point $\mathrm{span}\{u\}$ is dual to the line $\{v : u \cdot v = 0\} = u^\perp$. This duality is a fundamental symmetry of the construction and explains why so many results about $PG(2,q)$ come in symmetric pairs.

Now let us verify that this construction actually satisfies the axioms of a projective plane. The proof is a direct exercise in linear algebra over $\mathbb{F}_q$ — the three axioms translate into three standard linear-algebraic facts about subspaces of a 3-dimensional vector space over a finite field.

Lemma 4.3.1. $PG(2,q)$ is a projective plane.

Proof. (C1): For two distinct 2-dimensional subspaces $L_1, L_2$, we have $\dim(L_1 \cap L_2) = 1$ (since $\dim V = 3$), giving a unique intersection point. (C2): Two distinct 1-dimensional subspaces $\mathrm{span}\{u\}$ and $\mathrm{span}\{v\}$ (with $v \notin \mathrm{span}\{u\}$) lie in a unique 2-dimensional subspace $\mathrm{span}\{u,v\}$. (C3): Three linearly independent vectors span all of $V$, so their spans are noncollinear. $\square$

The automorphism group of $PG(2,q)$ is extraordinarily rich, and understanding it is one of the central goals of the classical theory of projective geometry. The general linear group $GL(3,q)$ acts on $V$ and induces automorphisms of the projective plane. But since scalar multiples of the identity act trivially on subspaces, the “effective” group of linear automorphisms is the projective linear group $PGL(3,q) = GL(3,q)/Z$, where $Z$ is the scalar subgroup. In addition, field automorphisms of $\mathbb{F}_q$ also induce automorphisms of $PG(2,q)$, giving the full automorphism group $P\Gamma L(3,q)$. This group is a finite simple group (up to small modifications), and its structure is one of the crown jewels of the classification of finite simple groups.

The key observation is that $GL(3,q)$ preserves subspace containment: if $U \subseteq W$ then $A(U) \subseteq A(W)$ for any invertible linear map $A$. Since incidence in $PG(2,q)$ is defined by subspace containment, invertible linear maps automatically preserve incidence. The next two lemmas make this precise.

Lemma 4.3.2. Every $A \in GL(3,q)$ induces a permutation on $V(X(PG(2,q)))$, sending 1-dimensional subspaces to 1-dimensional subspaces and 2-dimensional subspaces to 2-dimensional subspaces.

Lemma 4.3.3. $GL(3,q) \leq \mathrm{Aut}(PG(2,q))$.

Proof. Let $A \in GL(3,q)$. If $p = \mathrm{span}\{u\} \subseteq L = \mathrm{span}\{u,v\}$, then $p^A = \mathrm{span}\{Au\} \subseteq \mathrm{span}\{Au, Av\} = L^A$. So incidence is preserved. $\square$

The fact that $GL(3,q)$ acts as automorphisms is powerful, but the group is large and redundant: scalar matrices $\lambda I$ act trivially on every subspace (since $\mathrm{span}\{\lambda u\} = \mathrm{span}\{u\}$). The “effective” group is thus $PGL(3,q)$, which has order $q^3(q^3-1)(q^2-1)/(q-1)$. For $q = 2$ (the Fano plane), this gives $|\mathrm{Aut}(PG(2,2))| = 168$, which is the well-known simple group $PSL(2,7)$, the smallest non-abelian simple group after $A_5$.

The incidence graph of $PG(2,q)$ has the strongest possible symmetry. This is the culmination of the chapter: we started with a geometric object defined by simple linear-algebra axioms, translated it into a bipartite graph, and now discover that this graph is arc-transitive — the automorphism group acts transitively on directed edges, meaning the graph “looks the same” from every incident point-line pair.

To establish arc-transitivity, we need to show two things: (1) for any two incident pairs $(p_1, L_1)$ and $(p_2, L_2)$, there is an automorphism mapping one to the other; and (2) for any incident pair $(p, L)$, there is an automorphism mapping the arc $(p, L)$ to $(L, p)$ — i.e., reversing the direction. The first follows from the transitivity of $GL(3,q)$ on ordered bases; the second follows from the self-duality of $PG(2,q)$.

Lemma 4.3.4. $X(PG(2,q))$ is arc-transitive.

Proof. For any two incident pairs $p_1 \subset L_1$ and $p_2 \subset L_2$, write $p_i = \mathrm{span}\{u_i\}$, $L_i = \mathrm{span}\{u_i, v_i\}$. Elementary linear algebra gives $A \in GL(3,q)$ with $Au_1 = u_2$ and $Av_1 = v_2$, so $A$ maps $(p_1, L_1)$ to $(p_2, L_2)$. To see that the automorphism group also acts transitively on arcs $(L, p)$ (line to point), use the map $\pi : \mathrm{span}\{u\} \mapsto \mathrm{span}\{u\}^\perp$, which swaps points and lines while preserving incidence. $\square$

The proof illustrates the power of the linear-algebraic construction: arc-transitivity reduces to the ability of $GL(3,q)$ to map any ordered basis to any other — which is precisely the defining property of invertible linear maps. The geometry does the heavy lifting for us.

Arc-transitivity means the incidence graph of $PG(2,q)$ is, in a strong sense, the most uniform possible bipartite graph at its parameters. Combined with the girth-diameter characterization of Theorem 4.2.1, this makes $X(PG(2,q))$ an extremal object in multiple senses simultaneously: extremal in girth, extremal in diameter, extremal in symmetry. These extremal properties make it a central example in algebraic graph theory and a testing ground for conjectures about symmetric graphs.

To summarize what we have built: starting from the finite field $\mathbb{F}_q$, we construct the vector space $\mathbb{F}_q^3$ and form the projective plane $PG(2,q)$ by taking 1-dimensional and 2-dimensional subspaces as points and lines. The incidence graph $X(PG(2,q))$ is a $(q+1)$-regular bipartite graph on $2(q^2+q+1)$ vertices with girth 6 and diameter 3. It is arc-transitive with automorphism group containing $PGL(3,q)$. It achieves the Moore bound for bipartite graphs of degree $q+1$ and girth 6. It is the most symmetric, most extremal graph we have encountered in this course, and it connects finite geometry, linear algebra over finite fields, and the theory of symmetric graphs in a single beautiful object.

Chapter 7: Homomorphisms, Cores, and Products

The theory of graph homomorphisms develops a rich categorical structure on the class of all graphs. Two central notions emerge: the core of a graph (its minimal homomorphic image) and the direct product (which represents homomorphisms universally). Together they give a lattice structure on the set of graph isomorphism classes.

We encountered homomorphisms in Chapter 1, where the key example was graph coloring: a proper $k$-coloring of $X$ is precisely a homomorphism $X \to K_k$. At the time, homomorphisms were a convenient language for coloring. In this chapter, we develop them as a subject in their own right, revealing a rich algebraic structure on the entire class of graphs.

The central question is: what does the homomorphism preorder $X \to Y$ look like? Which graphs can be mapped to which? The answer involves two key ideas. First, the core of a graph: every graph $X$ can be “compressed” to a unique minimal subgraph $X^\bullet$ that captures all the homomorphic information of $X$. Two graphs are homomorphically equivalent precisely when they have isomorphic cores. This makes cores the “canonical representatives” of equivalence classes — think of them as the simplest graph in each equivalence class, the one with no redundant structure.

Second, the direct product $X \times Y$: this construction satisfies a universal property exactly analogous to the categorical product, making the class of graphs (modulo homomorphic equivalence) into a lattice. This connects graph theory to category theory in a precise way: graphs form a category, homomorphisms are morphisms, and products and coproducts exist with the expected universal properties.

The upshot is that graph homomorphisms are not merely a convenience for coloring theory — they organize all graphs into a structured hierarchy, with cores as the minimal elements of each class and the direct product giving the operation that “combines” two graphs in the most efficient way.

7.1 Basic Results on Homomorphisms

We begin by establishing the basic algebraic properties of the homomorphism relation. These are simple to state and verify, but they determine the overall structure of the theory.

Before diving into the lemmas, let us ground ourselves in the category-theoretic perspective. Graphs form a category $\mathbf{Graph}$: objects are graphs, morphisms are homomorphisms, the identity morphism is the identity map, and composition is function composition. In this language, the homomorphism relation $X \to Y$ says precisely that there is a morphism from $X$ to $Y$. The preorder structure of $\to$ is then the coarse structure of the category: which objects can be mapped to which.

Category theory tells us that this preorder structure is deeply connected to the existence of categorical products and coproducts — and indeed, we will see in Section 7.3 that the direct product of graphs is exactly the categorical product in $\mathbf{Graph}$. The coproduct is the disjoint union. This is not a formal observation but a theorem with content: it gives the direct product a universal property, which will be our main tool for analyzing the lattice of cores.

Lemma 5.1.1. The relation $\to$ is reflexive and transitive, but neither symmetric nor antisymmetric.

This means $\to$ is a preorder, not a partial order, on the class of all graphs. Reflexivity is the identity map; transitivity is composition of homomorphisms. The failure of antisymmetry is the interesting part: there exist non-isomorphic graphs $X$ and $Y$ with $X \to Y$ and $Y \to X$. This is precisely why we need the notion of homomorphic equivalence to get a partial order.

Definition 5.1.1 (Homomorphically Equivalent). Graphs $X, Y$ are homomorphically equivalent if $X \to Y$ and $Y \to X$. This is an equivalence relation.

Homomorphic equivalence is the right notion of “sameness” for homomorphism theory, coarser than isomorphism. Two graphs are homomorphically equivalent when they can be mapped to each other — they are “the same” in the homomorphism preorder. For example, any graph is homomorphically equivalent to its core (since we can map the graph to the core and include the core back). In contrast, isomorphic graphs are of course homomorphically equivalent, but the converse is wildly false: a triangle $K_3$ and any odd cycle $C_{2k+1}$ are all homomorphically equivalent (they can all map to each other), but they are not isomorphic. This is what makes the core — the unique minimal representative of each equivalence class — such a valuable object.

Lemma 5.1.2. If surjective homomorphisms exist from $X$ to $Y$ and from $Y$ to $X$, then $X \cong Y$.

Surjectivity is the key hypothesis here. Without it, $X \to Y \to X$ could “fold” $X$ down and then “expand” it back, arriving at a different graph. The lemma says that surjective homomorphisms in both directions are so constraining that they force isomorphism — a rigidity result that will be central in our analysis of cores.

One of the most powerful tools for showing that a homomorphism does not exist is to use invariants that are monotone with respect to $\to$. Two such invariants are the chromatic number and the odd girth. The general strategy is: to prove $X \not\to Y$, find a monotone invariant $\phi$ and show $\phi(X) > \phi(Y)$. The chromatic number and odd girth are the two most useful such invariants in practice, because they are computable and reflect genuinely different aspects of graph structure.

Definition 5.1.2 (Odd Girth). The odd girth of a graph $X$ is the length of the shortest odd cycle. If $X$ is bipartite, its odd girth is $\infty$.

Lemma 5.1.3. Suppose $X \to Y$. Then: (a) $\chi(X) \leq \chi(Y)$; (b) the odd girth of $X$ is at least the odd girth of $Y$.

Proof. (a) A proper colouring of $Y$ composed with $f : X \to Y$ gives a proper colouring of $X$. (b) Any odd cycle in $X$ maps to a closed walk in $Y$ of the same odd length, which must contain an odd cycle. $\square$

Lemma 5.1.3 is our main obstruction tool: to show $X \not\to Y$, it suffices to exhibit a chromatic or odd-girth gap. Observe that (a) says homomorphisms can only decrease the chromatic number — mapping to a “simpler” graph — and (b) says odd cycles can only shrink, never lengthen. These two monotone invariants organize the homomorphism preorder in a coarse but powerful way.

Corollary 5.1.3.1. (a) There is no homomorphism from $C_{2n+1}$ to $K_2$. (b) There is no homomorphism from the Petersen graph to $C_4$. (c) There is no homomorphism from the Petersen graph to any of its proper subgraphs.

Part (c) of the corollary is particularly striking: it says the Petersen graph cannot be “compressed” into any smaller subgraph via a homomorphism. This is exactly the property that makes a graph a core, and we are now ready to develop that theory.

7.2 Cores

What does it mean for a graph to be “minimal” in the homomorphism preorder? There are many notions of minimality one might consider, but the right one for our purposes is: a graph $X$ is minimal if no proper subgraph of $X$ is a homomorphic image of $X$. Equivalently, every endomorphism of $X$ — every self-homomorphism — must be a bijection, i.e., an automorphism. This is the definition of a core.

Think of cores as the “irreducible” graphs in the homomorphism preorder. Just as every rational number can be uniquely expressed as a fraction in lowest terms, every graph has a unique “lowest-terms representative” in its homomorphic equivalence class — its core. Two graphs are homomorphically equivalent precisely when they share the same core (up to isomorphism). The core is thus a canonical invariant of the equivalence class, and it is the best notion of “the simplest graph in the class.”

The connection to graph colouring is immediate: the chromatic number of $X$ is the size of the clique core that $X$ maps to. More precisely, $\chi(X) = |X^\bullet|$ when $X^\bullet$ is a complete graph — which happens for “chromatic” graphs whose chromatic structure is tight. For other graphs, the core may be more complex than a clique, encoding richer homomorphic structure than mere colourability.

Definition 5.2.1 (Core). A graph $X$ is a core if every endomorphism (homomorphism from $X$ to itself) is an automorphism.

Definition 5.2.2 (Core of a Graph). A subgraph $Y \subseteq X$ is a core of $X$ if $Y$ is a core and $X \to Y$. The cores of $X$ are the minimal subgraphs that are homomorphic images of $X$.

Example 5.2.1. $K_n$ is a core, since any homomorphism $K_n \to K_n$ must be injective (no two adjacent vertices can map to the same vertex), hence bijective.

The complete graph example tells us that every chromatic number is “realized” by a core: the core $K_n$ represents the equivalence class of all $n$-chromatic graphs that are homomorphically equivalent to $K_n$ (i.e., all graphs whose chromatic number is exactly $n$ and which admit a surjective homomorphism to $K_n$). The existence of cores that are not complete graphs is precisely what makes the homomorphism preorder richer than the chromatic number alone.

Example 5.2.2. A graph is critical if every proper subgraph has strictly smaller chromatic number. A critical graph $X$ satisfies $X \not\to Y$ for every proper subgraph $Y$, so every critical graph is a core.

Lemma 5.2.3. If $Y$ is a core of $X$, then $Y$ is a retract of $X$.

Proof. Let $f : X \to Y$ be a homomorphism. Since $Y$ is a core, $g := f\big|_Y : Y \to Y$ is an automorphism. Then $g^{-1} \circ f : X \to Y$ is a retraction. $\square$

This lemma is the key structural result: cores are not just abstract minimal objects, they are concretely embedded in the graph as retracts. This means there is a “projection” from $X$ to its core, a map that collapses the redundant parts of $X$ and fixes the core pointwise. Topologically, this is analogous to a deformation retraction — the graph can be “continuously collapsed” onto its core. This geometric intuition is helpful, though the graph-theoretic definition is purely combinatorial.

Lemma 5.2.4. Let $Y_1, Y_2$ be cores. Then $Y_1$ and $Y_2$ are homomorphically equivalent if and only if they are isomorphic.

Proof. Let $f : Y_1 \to Y_2$ and $g : Y_2 \to Y_1$. Then $g \circ f$ and $f \circ g$ are endomorphisms of cores, hence automorphisms, so $f$ and $g$ are both bijective, giving isomorphisms. $\square$

Lemma 5.2.4 is crucial: it says that cores are the “canonical representatives” of homomorphic equivalence classes. If we restrict attention to cores, the preorder $\to$ becomes a genuine partial order — there is a unique representative for each equivalence class, and two distinct cores can never be equivalent. This is the analog of reducing a fraction to lowest terms: each equivalence class has a unique minimal representative (the core), and that representative is determined up to isomorphism.

The following lemma is one of the most fundamental results in homomorphism theory: every graph has a unique (up to isomorphism) minimal homomorphic image that is itself a core.

Lemma 5.2.5. Every graph $X$ has a core, which is an induced subgraph of $X$ and is unique up to isomorphism. We denote it $X^\bullet$.

Proof. Existence: Let $Y$ be a minimum-size subgraph of $X$ that is a homomorphic image of $X$. By minimality, every endomorphism of $Y$ is an automorphism, so $Y$ is a core. Since $Y$ is a retract of $X$, it is induced. Uniqueness: If $Y_1, Y_2$ are both cores of $X$, then $Y_1 \to Y_2$ and $Y_2 \to Y_1$ (via the homomorphisms $X \to Y_i$), so they are homomorphically equivalent cores, hence isomorphic. $\square$

Lemma 5.2.6. Two graphs $X$ and $Y$ are homomorphically equivalent if and only if $X^\bullet \cong Y^\bullet$.

Proof. If $X \to Y$ and $Y \to X$, then $X^\bullet \to X \to Y \to Y^\bullet$ and $Y^\bullet \to Y \to X \to X^\bullet$, so $X^\bullet$ and $Y^\bullet$ are homomorphically equivalent cores, hence isomorphic. The converse is immediate. $\square$

Corollary 5.2.6.1. The relation $\to$ is a partial order on the set of isomorphism classes of cores (it is antisymmetric since two homomorphically equivalent cores are isomorphic).

We now have a complete picture of the homomorphism preorder’s structure at the level of equivalence classes: it is a genuine partial order on isomorphism classes of cores. This partial order is rich and complex — it has the complete graphs at its “top” (in terms of chromatic number), with odd cycles below, and the bipartite graphs (with core $K_2$ or $K_1$) at the bottom. The study of this poset is one of the central themes of graph homomorphism theory, connecting to topological methods (the Lovász theta function, hom complexes) and algorithmic questions (the complexity of deciding $X \to H$ for fixed $H$).

The next lemma gives a sufficient condition for a connected graph to be a core, based on the geometry of its shortest odd cycles. It is a beautiful application of the odd-girth monotonicity of Lemma 5.1.3.

Lemma 5.2.7. Let $X$ be connected. If every path of length 2 lies in a shortest odd cycle of $X$, then $X$ is a core.

Proof. Suppose towards a contradiction that $X^\bullet \neq X$ and $f : X \to X^\bullet$ is a retraction. By connectedness, there exist $u \sim v$ with $v \in V(X^\bullet)$ and $u \notin V(X^\bullet)$. Let $w = f(u)$. Then $w \sim v$ in $X^\bullet$, and $uvw$ lies in a shortest odd cycle $C$. The image $f(C)$ is a closed odd walk with $f(u) = f(w) = w$, giving a strictly shorter odd cycle in $X^\bullet$, contradicting Lemma 5.1.3(b). $\square$

Corollary 5.2.7.1. The Petersen graph is a core.

The Petersen graph corollary is one of the most cited results about this graph. It explains why the Petersen graph is a “hard” graph to color: it cannot be reduced to any simpler graph via homomorphisms, and its chromatic structure is irreducible. Any graph that maps to the Petersen graph must have odd girth at least 5, and any graph to which the Petersen graph maps must have chromatic number at least 3. The Petersen graph sits in its own “stratum” of the homomorphism order, between $K_3$ and $K_4$, and this isolation is part of what makes it so ubiquitous as a counterexample in graph theory.

7.3 Direct Products and the Lattice of Cores

Having established that homomorphic equivalence classes are indexed by cores, we now ask: what algebraic structure do these classes form? The answer is a lattice — and the key operation is the direct product, which turns out to be the categorical product in $\mathbf{Graph}$.

The direct product construction has a beautiful combinatorial interpretation: vertices of $Y \times Z$ are pairs, and two pairs are adjacent when both coordinates are simultaneously adjacent. This is the most “restrictive” product: an edge in the product requires edges in both factors simultaneously. The result is that the product is in some sense the “smallest common object” below both $Y$ and $Z$ in the homomorphism order — and this is exactly the definition of the categorical product.

Definition 5.3.1 (Direct Product). For graphs $Y$ and $Z$, their direct product $Y \times Z$ has vertex set $V(Y) \times V(Z)$ and edge set $\{(y,z)(y',z') : y \sim y' \text{ in } Y \text{ and } z \sim z' \text{ in } Z\}$.

Lemma 5.3.1 (Product Properties). (a) If $Y$ and $Z$ are both connected, then $Y \times Z$ is disconnected if and only if both $Y$ and $Z$ are bipartite. (b) $(Y_1 + Y_2) \times Z = Y_1 \times Z + Y_2 \times Z$ (where $+$ denotes disjoint union). (c) $Y \times Z \cong Z \times Y$ and $(Y_1 \times Y_2) \times Y_3 \cong Y_1 \times (Y_2 \times Y_3)$. (d) The projection maps $p_X : (x,y) \mapsto x$ and $p_Y : (x,y) \mapsto y$ are homomorphisms.

Property (a) deserves a moment’s pause. In the category of sets, the product of two connected objects is connected. In the category of graphs with the direct product, bipartiteness creates a parity obstruction: a walk in $Y \times Z$ must simultaneously traverse edges in both factors, so parity in both factors must match. When both $Y$ and $Z$ are bipartite, this parity clash splits the product into two components. This is a fundamentally graph-theoretic phenomenon — the direct product is more subtle than the set-theoretic product precisely because of the parity constraint in bipartite graphs. Property (d) is the homomorphism version of the universal projection: the direct product “forgets” one coordinate while mapping correctly.

The direct product satisfies a universal property: it is the categorical product in the category of graphs.

Theorem 5.3.2 (Universal Property of Product). Let $X, Y, Z$ be graphs. If $f : Z \to X$ and $g : Z \to Y$ are homomorphisms, there is a unique homomorphism $\varphi : Z \to X \times Y$ such that $f = p_X \circ \varphi$ and $g = p_Y \circ \varphi$.

Proof. Define $\varphi(z) = (f(z), g(z))$. If $u \sim v$ in $Z$, then $f(u) \sim f(v)$ and $g(u) \sim g(v)$, so $(f(u),g(u)) \sim (f(v),g(v))$ in $X \times Y$. Uniqueness: any $\varphi'$ satisfying the equations must satisfy $\varphi'(z) = (f(z), g(z)) = \varphi(z)$. $\square$

Corollary 5.3.2.1. $|\mathrm{Hom}(Z, X \times Y)| = |\mathrm{Hom}(Z,X)| \cdot |\mathrm{Hom}(Z,Y)|$.

Proof. The universal property gives a bijection $\mathrm{Hom}(Z, X \times Y) \leftrightarrow \mathrm{Hom}(Z,X) \times \mathrm{Hom}(Z,Y)$ via $\varphi \mapsto (p_X \circ \varphi, p_Y \circ \varphi)$. $\square$

This product formula for counting homomorphisms is a powerful computational tool. It says that homomorphisms into a product decompose uniquely into a pair of independent homomorphisms into each factor. In coding theory, this corresponds to a product of codes; in probability, it corresponds to independence. The formula makes the direct product a “generating function” object for homomorphism counting, and it is one of the reasons the product plays such a central role in the theory.

We are now in a position to establish the main structural theorem of this chapter: the lattice structure on cores.

The partial order $\to$ on isomorphism classes of cores forms a complete lattice.

Definition 5.3.2 (Lattice). A partially ordered set $\Lambda$ is a lattice if for every $x, y \in \Lambda$ there is a least upper bound $x \vee y$ and a greatest lower bound $x \wedge y$.

Lemma 5.3.3. The set of isomorphism classes of cores with partial order $\to$ is a lattice.

Proof. Least upper bound: For cores $X, Y$, set $X \vee Y = (X + Y)^\bullet$. Both $X \to (X+Y)^\bullet$ and $Y \to (X+Y)^\bullet$. If $Z$ is a core with $X \to Z$ and $Y \to Z$, then $X + Y \to Z$ and $(X+Y)^\bullet \to Z$. Greatest lower bound: Set $X \wedge Y = (X \times Y)^\bullet$. Then $(X \times Y)^\bullet \to X \times Y \to X$ (via $p_X$) and similarly to $Y$. If $Z \to X$ and $Z \to Y$, the universal property gives $Z \to X \times Y \to (X \times Y)^\bullet$. $\square$

The lattice structure is the crowning result of this chapter and, in many ways, of the entire course. Let us appreciate what it says. The class of all graphs, modulo homomorphic equivalence, is not just a poset — it is a lattice, meaning every pair of equivalence classes has both a least common “upper bound” (the join, given by the core of the disjoint union) and a greatest common “lower bound” (the meet, given by the core of the direct product). This is the algebraic structure of a complete lattice, analogous to the lattice of subgroups of a group or the lattice of subspaces of a vector space.

The meet operation $X \wedge Y = (X \times Y)^\bullet$ has a beautiful interpretation: it is the “smallest” core that can be mapped to by both $X$ and $Y$ simultaneously. When $X$ and $Y$ are complete graphs $K_m$ and $K_n$, the direct product $K_m \times K_n$ is the complete graph $K_{\min(m,n)}$ (since adjacency in the product requires adjacency in both factors, which forces a proper coloring with at most $\min(m,n)$ colors). So the lattice meet of $K_m$ and $K_n$ is $K_{\min(m,n)}$ — the meet operation recovers the minimum of chromatic numbers, extending it to a lattice operation on all graphs.

The join $X \vee Y = (X + Y)^\bullet$ is the “largest” core that maps to both $X$ and $Y$. For complete graphs, $(K_m + K_n)^\bullet = K_{\max(m,n)}$, so the join recovers the maximum. The lattice of cores thus generalizes the lattice of natural numbers ordered by divisibility or the lattice of chromatic numbers — but it is vastly richer, encoding all the homomorphic structure of the graph universe in a single algebraic object.

This lattice perspective also connects to deep questions in computational complexity. For a fixed graph $H$, the problem $\mathrm{CSP}(H)$ — determining whether a given graph $X$ admits a homomorphism to $H$ — has complexity determined entirely by $H^\bullet$: $\mathrm{CSP}(H)$ is polynomial if and only if $\mathrm{CSP}(H^\bullet)$ is. This is the dichotomy theorem of Feder and Vardi (conjectured) and Hell and Nešetřil (proved for graphs): for every fixed graph $H$, either $H$ is bipartite (and $X \to H$ is polynomial to decide) or $H$ contains an odd cycle (and $X \to H$ is NP-complete). The core is the canonical object that determines which regime we are in.