PMATH 445: Representations of Finite Groups
Faisal Al-Faisal
Estimated study time: 3 hr 55 min
Table of contents
Representation theory asks a deceptively simple question: given a group \(G\), how can it act linearly on a vector space? The power of the question lies in its answer — by studying these linear actions, called representations, we can bring the full machinery of linear algebra to bear on problems in group theory, number theory, combinatorics, and mathematical physics. This is the subject of PMATH 445.
The course follows a double arc. The first half develops the theory from scratch using direct linear-algebraic methods, culminating in the orthogonality of characters and the classification of irreducible representations by their character tables. The second half recasts everything in the language of modules over group algebras, with the Artin-Wedderburn theorem as its centrepiece. Both arcs illuminate the same landscape from different altitudes.
Chapter 1: What Is Representation Theory?
The Idea Behind Representations
A group \(G\) is an abstract object — a set with a multiplication satisfying certain axioms. To study \(G\), one of the most powerful strategies is to make \(G\) act on something more concrete. Groups act on sets (this is the notion of a group action), on geometric objects (symmetries), on other groups (conjugation), and — most relevantly for us — on vector spaces. An action of \(G\) on a vector space \(V\) by linear maps is called a representation of \(G\).
Why focus on vector spaces in particular? Because linear algebra gives us an enormous toolkit: eigenvalues, diagonalization, inner products, traces, determinants. A representation “represents” abstract group elements as concrete, computable matrices, allowing us to bring these tools to bear.
The reach of representation theory is remarkable. A few illustrations:
- If \(E\) is an elliptic curve over \(\mathbb{Q}\), there is a representation of the Galois group \(\mathrm{Gal}(\overline{\mathbb{Q}}/\mathbb{Q})\) on the Tate module of \(E\); this representation encodes the arithmetic of \(E\), and its study lies at the heart of Wiles’s proof of Fermat’s Last Theorem.
- The representation theory of the symmetric group \(S_n\) is interwoven with Young tableaux and the combinatorial structure of symmetric functions — a nexus of connections known as combinatorial representation theory.
- In quantum physics, the symmetry group of a physical system acts on the Hilbert space of quantum states. The irreducible representations of \(SU(3)\) led Gell-Mann and Ne’eman to postulate the existence of quarks before any experimental evidence: mathematics preceded physics.
- Dirichlet’s proof of his theorem on primes in arithmetic progressions used, in modern language, Fourier analysis on the abelian group \((\mathbb{Z}/n\mathbb{Z})^\times\) — which is precisely the representation theory of abelian groups.
The condition that \(\rho\) is a homomorphism means \(\rho(gh) = \rho(g)\rho(h)\) for all \(g, h \in G\), and \(\rho(e) = \mathrm{id}_V\). Notice that we do not require \(\rho\) to be injective — the added flexibility is essential. If \(\rho\) is injective, we call the representation faithful.
When \(V\) is finite-dimensional of dimension \(n\), a choice of basis identifies \(\mathrm{GL}(V)\) with \(\mathrm{GL}_n(F)\), giving a matrix representation \(\rho : G \to \mathrm{GL}_n(F)\). A different basis produces a conjugate matrix representation \(r'(g) = Ar(g)A^{-1}\) for some fixed invertible \(A\). The choice of basis is inessential, and we study representations up to this equivalence.
It is worth pausing to appreciate how many structures are “representations in disguise.” A group acting on a graph (edge-colourings, adjacency eigenvalues) is representation theory. The Laplacian of a symmetric graph decomposes into irreducible blocks indexed by representations. Character sums over finite fields — Gauss sums, Kloosterman sums — are trace functions of Galois representations. Even the Fast Fourier Transform is an efficient algorithm for decomposing a function on a cyclic group into irreducible components.
Group Actions and the Path to Representations
The conceptually cleanest route to the definition of a representation passes through group actions.
If \(G\) acts on \(X\), each element \(g \in G\) defines a bijection \(a_g : X \to X\) by \(a_g(x) = g \cdot x\). The map \(g \mapsto a_g\) is a group homomorphism \(G \to \mathrm{Sym}(X)\). Conversely, any such homomorphism defines an action. This equivalence is the content of:
A representation is simply a group action on a vector space that respects the linear structure. If \(G\) acts on an \(F\)-vector space \(V\) and each \(g\) acts by a linear map, then the map \(G \to \mathrm{Sym}(V)\) has image in the subgroup \(\mathrm{GL}(V)\) of linear bijections. This is precisely a representation.
The key examples of group actions are the foundations on which all representation theory is built:
- Trivial action: \(g \cdot x = x\) for all \(g, x\). Boring but indispensable.
- Left multiplication: \(G\) acts on itself by \(g \cdot x = gx\). This gives, after linearization, the regular representation.
- Conjugation: \(G\) acts on a normal subgroup \(H\) by \(g \cdot h = ghg^{-1}\).
- Left cosets: \(G\) acts on \(G/H\) by \(g \cdot (xH) = (gx)H\). After linearization, this is a permutation representation.
The Orbit-Stabilizer Formula \(|Gx| = [G : G_x]\) and Burnside’s Lemma (the number of orbits equals the average number of fixed points) are fundamental tools that will reappear throughout the course.
Chapter 2: The Language of Representations
Definitions and Equivalence
In what follows, unless otherwise stated, \(G\) is a group, \(F\) is a field, and all vector spaces are \(F\)-vector spaces. A representation \((V, \rho)\) has degree \(\deg \rho = \dim_F V\).
Rather than always writing pairs \((V, \rho)\), we often speak of “the \(G\)-module \(V\),” treating \(V\) as a vector space on which \(G\) acts by \(g \cdot v := \rho(g)v\). The linearity condition \(g \cdot (cv + w) = c(g \cdot v) + g \cdot w\) makes \(V\) into what is called an \(FG\)-module. The two terminologies — representation and \(G\)-module — are synonyms.
Two representations are essentially the same if one can be obtained from the other by a change of basis. Formally:
The set of all \(G\)-linear maps from \(V\) to \(W\) is denoted \(\mathrm{Hom}_G(V, W)\). The fundamental problem of representation theory is to classify all representations of \(G\) up to isomorphism.
This is a meaningful problem: for a finite group \(G\) over \(\mathbb{C}\), there are only finitely many isomorphism classes of irreducible representations (we will prove this), and the character table is a finite matrix encoding them all. The classification is therefore completable in principle — and in practice for many important groups.
Fundamental Examples
The sign representation is not faithful for \(n \geq 2\): the alternating group \(A_n = \ker(\mathrm{sgn})\) contains more than just the identity. For example, \(\mathrm{sgn}((123)) = +1\) even though \((123) \neq e\). The kernel of the representation is \(A_n\), so the representation “cannot see” any element of \(A_n\) — it treats all even permutations identically.
By contrast, the standard representation of \(S_n\) on \(V_\mathrm{std} = \{(z_1,\ldots,z_n) : \sum z_i = 0\} \subset \mathbb{C}^n\) is faithful for \(n \geq 3\): if \(\sigma \neq e\) then there exists some vector in \(V_\mathrm{std}\) that \(\sigma\) moves (one can check this directly). Thus faithfulness depends delicately on the representation.
The key point: whether or not \(\rho\) is faithful, the representation is still a perfectly good linear-algebraic object. Unfaithful representations arise naturally — for instance, every representation of \(G\) that factors through a quotient \(G/N\) is unfaithful (its kernel contains \(N\)). When we classify all representations of \(G\), we must include both faithful and unfaithful ones.
This example also illustrates the key conceptual point: to define a representation of a group given by generators and relations, it suffices to specify matrices for the generators and verify that all relations hold. We did not need to specify all 8 matrices in advance; once \(\rho(r)\) and \(\rho(s)\) are chosen satisfying the three defining relations, the entire representation is determined.
The alternating (sign) representation of \(S_n\) is the one-dimensional representation \(\mathrm{sgn} : S_n \to F^\times\) defined by \(\mathrm{sgn}(\pi) = \pm 1\) according to the parity of \(\pi\). Together with the trivial representation, these are the only one-dimensional representations of \(S_n\) for \(n \geq 2\).
One-Dimensional Representations and the Abelianization
One-dimensional representations \(\rho : G \to F^\times\) are the simplest and most transparent. Since \(F^\times\) is abelian, any such homomorphism must kill the commutator subgroup \([G, G]\) — the subgroup generated by all commutators \([g, h] = ghg^{-1}h^{-1}\). Therefore every one-dimensional representation of \(G\) factors through the abelianization \(G^{\mathrm{ab}} = G / [G, G]\).
Chapter 3: Building Representations
Permutation Representations
\[g(a_1 x_1 + \cdots + a_n x_n) = a_1(gx_1) + \cdots + a_n(gx_n).\]This is the permutation representation induced by \(X\). The associated matrix representation consists of permutation matrices (matrices obtained by permuting columns of the identity).
The central example is the regular representation: let \(G\) act on itself by left multiplication. The resulting representation \(V_\mathrm{reg} = F\langle G \rangle\) has degree \(|G|\), and its decomposition into irreducible pieces encodes the entire representation theory of \(G\). We will return to it repeatedly.
A useful complementary model replaces \(F\langle X \rangle\) with the function space \(F(X, F)\) of \(F\)-valued functions on \(X\), with action \((g \cdot f)(x) = f(g^{-1}x)\). The two models are isomorphic via \(x \mapsto e_x\) (the indicator function of \(x\)), but the function-space model generalizes more cleanly to infinite groups.
The Character of the Regular Representation
\[\chi_\mathrm{reg}(e) = |G|, \qquad \chi_\mathrm{reg}(g) = 0 \text{ for all } g \neq e.\]This single formula will later give us enormous leverage: by decomposing the regular representation into irreducibles and using the character formula, we will prove that every irreducible representation of \(G\) appears in the regular representation with multiplicity equal to its dimension. This yields the fundamental dimension formula \(|G| = \sum_i (\dim V_i)^2\).
Constructing New Representations from Old
Given \(G\)-modules \(V\) and \(W\), we can build new ones:
Subrepresentations. A \(G\)-submodule of \(V\) is a subspace \(U \subseteq V\) that is \(G\)-invariant: \(gu \in U\) for all \(g \in G\), \(u \in U\). A \(G\)-invariant subspace \(U\) is itself a representation; in a matrix basis adapted to \(U \subseteq V\), the matrix of \(\rho(g)\) is block upper triangular with \(\rho|_U(g)\) in the upper-left block.
Direct sum. The direct sum \(V \oplus W\) becomes a \(G\)-module under \(g(v,w) = (gv, gw)\), with block-diagonal matrix representation \(\rho(g) \oplus \sigma(g)\).
Dual representation. The dual space \(V^* = \mathrm{Hom}(V, F)\) becomes a \(G\)-module under \((g \cdot f)(v) = f(g^{-1}v)\), called the dual or contragredient of \(V\). In matrix terms, the dual representation is the inverse transpose: \(r^*(g) = r(g^{-1})^T\).
Quotient representation. If \(U \subseteq V\) is a \(G\)-submodule, then \(V/U\) becomes a \(G\)-module via \(g(v + U) = gv + U\). The First Isomorphism Theorem holds in this setting: if \(T : V \to W\) is \(G\)-linear then \(V/\ker T \cong \mathrm{im}\, T\) as \(G\)-modules.
Fixed points. For any \(G\)-module \(V\), the subspace \(V^G = \{v \in V : gv = v \text{ for all } g \in G\}\) of \(G\)-fixed points is a \(G\)-submodule isomorphic to a direct sum of copies of the trivial representation.
Tensor Products of Vector Spaces
The tensor product construction is central to representation theory. The idea is to define a “product” \(V \otimes W\) that mimics bilinear operations.
\[(v, w_1+w_2) - (v,w_1) - (v,w_2), \quad (v_1+v_2,w) - (v_1,w) - (v_2,w), \quad (cv,w) - c(v,w), \quad (v,cw) - c(v,w).\]The coset of \((v,w)\) is denoted \(v \otimes w\) and called a pure tensor. By construction, \(\otimes\) is bilinear, and every element of \(V \otimes W\) is a finite sum of pure tensors — though not every element is itself a pure tensor.
If \(\{e_i\}\) is a basis for \(V\) and \(\{f_j\}\) is a basis for \(W\), then \(\{e_i \otimes f_j\}\) is a basis for \(V \otimes W\), so \(\dim(V \otimes W) = (\dim V)(\dim W)\).
Tensor product of \(G\)-modules. If \(V\) and \(W\) are \(G\)-modules, their inner tensor product is \(V \otimes W\) with action \(g(v \otimes w) = gv \otimes gw\). Even if \(V\) and \(W\) are irreducible, their tensor product can be reducible — this creates an interesting decomposition problem called the Clebsch-Gordan problem.
One can also form the outer tensor product of a \(G_1\)-module \(V\) and a \(G_2\)-module \(W\): this is a representation of \(G_1 \times G_2\) on \(V \otimes W\) via \((g_1, g_2)(v \otimes w) = g_1v \otimes g_2w\). A fundamental result:
This theorem is extremely useful: it tells us that classifying representations of \(G_1 \times G_2\) reduces entirely to classifying representations of the two factors separately. For example, the irreducible representations of \(C_2 \times C_2\) are in bijection with pairs of irreducibles of \(C_2\), of which there are \(2 \times 2 = 4\) — and indeed \(C_2 \times C_2\) has 4 conjugacy classes, consistent with the count.
The Hom Space and the Trace
There is a beautiful interplay between \(\mathrm{Hom}\) spaces and tensor products. For \(G\)-modules \(V\) and \(W\):
This perspective makes the trace of a linear operator more conceptual. The evaluation map \(\tau : V^* \otimes V \to F\) defined by \(\tau(f \otimes v) = f(v)\) corresponds, under the isomorphism \(V^* \otimes V \cong \mathrm{Hom}(V,V)\), precisely to the trace functional. This is the “natural” linear functional on \(\mathrm{Hom}(V,V)\).
The identification \(\mathrm{Hom}_G(V, W) = (V^* \otimes W)^G\) lets us compute dimensions using the averaging formula: \(\dim \mathrm{Hom}_G(V, W) = \dim (V^* \otimes W)^G = \frac{1}{|G|} \sum_{g \in G} \chi_{V^*}(g) \chi_W(g) = \frac{1}{|G|} \sum_{g \in G} \overline{\chi_V(g)} \chi_W(g) = \langle \chi_V, \chi_W \rangle.\) This is a preview of the fundamental orthogonality of characters.
Chapter 4: Irreducibility and Maschke’s Theorem
Irreducible Representations
Given that we want to classify all representations of \(G\), it makes sense to first identify the “atoms” — representations that cannot be broken into smaller pieces.
Irreducible implies indecomposable, but the converse fails in general (it holds for finite groups in characteristic zero, as we will see). A one-dimensional representation is always irreducible.
To get a feel for what irreducibility means in practice: the two-dimensional standard representation \(V_\mathrm{std}\) of \(S_3\) is irreducible because there is no line in \(\mathbb{C}^2\) fixed by all rotations and reflections of the equilateral triangle. On the other hand, the three-dimensional defining representation of \(S_3\) (permuting the standard basis of \(\mathbb{C}^3\)) is not irreducible: the line spanned by \(e_1 + e_2 + e_3\) is \(S_3\)-invariant.
- The trivial representation \(V_\mathrm{triv}\) (dimension 1): \(\rho(\pi) = 1\) for all \(\pi\).
- The sign representation \(V_\mathrm{sgn}\) (dimension 1): \(\rho(\pi) = \mathrm{sgn}(\pi)\).
- The standard representation \(V_\mathrm{std}\) (dimension 2): the action on the plane described earlier.
Maschke’s Theorem
The fundamental theorem of this chapter tells us when complete reducibility holds.
The hypothesis \(\mathrm{char}\, F \nmid |G|\) is sharp: if \(p = \mathrm{char}\, F\) divides \(|G|\), there exist finite-dimensional representations that are not completely reducible. This is the starting point of modular representation theory.
Before proving Maschke’s theorem, let us understand precisely why the characteristic hypothesis is necessary. This requires a concrete counterexample.
Now, the subspace \(U = \{(x, 0) : x \in \mathbb{F}_p\} = \mathrm{span}\{e_1\}\) is \(G\)-invariant: \(\rho(a)(x, 0)^T = (x, 0)^T\). So \(U \cong V_\mathrm{triv}\) is a \(G\)-submodule of \(V = \mathbb{F}_p^2\).
\[\rho(a)\begin{pmatrix}b\\c\end{pmatrix} = \begin{pmatrix}1&1\\0&1\end{pmatrix}\begin{pmatrix}b\\c\end{pmatrix} = \begin{pmatrix}b+c\\c\end{pmatrix}.\]For this to equal \(\lambda(b, c)^T\), we need \(b + c = \lambda b\) and \(c = \lambda c\). The second equation gives \(\lambda = 1\) (since \(c \neq 0\)), and then the first gives \(b + c = b\), i.e., \(c = 0\) — contradiction. So no such complement exists, and \(V\) is indecomposable but not irreducible.
\[\widetilde{P} = \frac{1}{p}\sum_{k=0}^{p-1} \rho(a^k) P_0 \rho(a^k)^{-1},\]but \(1/p\) does not exist in \(\mathbb{F}_p\). The averaging trick literally cannot be performed.
The key ingredient in the proof of Maschke’s theorem is the averaging trick (or Weyl’s unitary trick in its continuous analogue):
The factor \(1/|G|\) requires \(|G| \neq 0\) in \(F\), which is our hypothesis.
Step 2 (\(\widetilde{P}\) is a projection onto \(U\)). For any \(v \in V\), \(\widetilde{P}(v)\) is in \(U\): each term \(\rho(g)P_0\rho(g)^{-1}v\) lies in \(\rho(g)(U)\), and since \(U\) is \(G\)-invariant, \(\rho(g)(U) \subseteq U\). So \(\widetilde{P}(v) \in U\) for all \(v\).
For \(u \in U\): \(\rho(g)^{-1}u \in U\) (by \(G\)-invariance of \(U\)), then \(P_0(\rho(g)^{-1}u) = \rho(g)^{-1}u\) (since \(P_0\) is the identity on \(U\)), then \(\rho(g)(\rho(g)^{-1}u) = u\). So each term in the sum equals \(u\), giving \(\widetilde{P}(u) = \frac{1}{|G|} \cdot |G| \cdot u = u\). Thus \(\widetilde{P}\) fixes \(U\) pointwise and maps \(V\) into \(U\), confirming it is a projection.
\[\rho(h) \circ \widetilde{P} = \frac{1}{|G|}\sum_{g \in G} \rho(h)\rho(g) P_0 \rho(g)^{-1} = \frac{1}{|G|}\sum_{g \in G} \rho(hg) P_0 \rho(hg)^{-1} \cdot \rho(h).\]Substituting \(g' = hg\) (a bijection on \(G\)) gives \(\rho(h)\circ\widetilde{P} = \widetilde{P} \circ \rho(h)\).
\[\widetilde{P}(\rho(g)w) = \rho(g)(\widetilde{P}(w)) = \rho(g)(0) = 0.\]So \(\rho(g)w \in \ker\widetilde{P} = W\). Thus \(W\) is \(G\)-invariant. \(\square\)
where we substituted \(g' = hg\) and used that the sum over \(G\) is invariant under left translation. So \(\widetilde{P}\) commutes with all \(\rho(h)\), meaning \(\ker \widetilde{P}\) is \(G\)-invariant.
The factor \(1/|G|\) requires that \(|G|\) is invertible in \(F\), hence the characteristic hypothesis. As a corollary:
This averaged inner product is \(G\)-invariant, making \(\rho\) unitary with respect to it.
Chapter 5: Schur’s Lemma and Isotypic Decompositions
Schur’s Lemma
Schur’s Lemma is perhaps the single most-used tool in representation theory. It says that irreducible representations are rigid: the only \(G\)-linear maps between two irreducibles are either zero or isomorphisms.
- Any nonzero map in \(\mathrm{Hom}_G(V, W)\) is an isomorphism.
- If \(F\) is algebraically closed and \(V\) is finite-dimensional, then \(\mathrm{Hom}_G(V,V) = \{\lambda\, \mathrm{id} : \lambda \in F\} \cong F\).
For (2): any \(T \in \mathrm{Hom}_G(V,V)\) has an eigenvalue \(\lambda \in F\) (since \(F\) is algebraically closed and \(V\) is finite-dimensional, the characteristic polynomial of \(T\) has a root in \(F\)). Then \(T - \lambda\, \mathrm{id} \in \mathrm{Hom}_G(V,V)\) is not invertible (since \(\lambda\) is an eigenvalue), hence by (1) it must be zero. So \(T = \lambda\, \mathrm{id}\). \(\square\)
Let us unpack why Part (1) works so cleanly. The kernel and image of a \(G\)-linear map are automatically \(G\)-invariant: if \(T(v) = 0\) and \(T\) is \(G\)-linear, then \(T(gv) = gT(v) = g \cdot 0 = 0\), so \(gv \in \ker T\). Similarly, \(T(gv) = g T(v) \in g \cdot \mathrm{im}(T)\), so the image is \(G\)-stable. Irreducibility then forces these subspaces to be trivial, leaving no room for \(T\) to be anything but an isomorphism.
Part (2) fails if \(F\) is not algebraically closed. For instance, over \(\mathbb{R}\), the two-dimensional rotation representation of \(C_4\) is irreducible, but multiplication by \(i\) (rotation by 90°) is a \(\mathbb{R}C_4\)-linear endomorphism that is not a scalar multiple of the identity.
The consequence of Part (2) is often called “Schur’s magical fact”: if \(V\) is an irreducible \(\mathbb{C}G\)-module and \(T : V \to V\) commutes with every \(\rho(g)\), then \(T\) is multiplication by a scalar. This is magical because it is a purely algebraic statement with no continuity or topological assumptions — and it is the key reason why irreducible representations over algebraically closed fields are “rigid.”
Schur’s Lemma for Abelian Groups
The following corollary is one of the most important consequences of Schur’s Lemma, and it is the bridge from representation theory to Fourier analysis.
This corollary is the starting point of Fourier analysis on abelian groups. For \(G = \mathbb{Z}/n\mathbb{Z}\), it says every irreducible complex representation is a character \(\rho_k : j \mapsto e^{2\pi ijk/n}\) — exactly the “frequencies” of discrete Fourier analysis.
Isotypic Decompositions
Maschke’s theorem guarantees that every finite-dimensional \(FG\)-module (over a field of suitable characteristic) is a direct sum of irreducibles. But such a decomposition is not unique — different choices of basis can split things differently. What is canonical is how many copies of each irreducible appear.
The multiplicity formula \(m_i = \dim \mathrm{Hom}_G(V_i, V)\) is powerful. It says: to find how many times \(V_i\) appears in \(V\), count the \(G\)-linear maps from \(V_i\) into \(V\).
Chapter 6: Fourier Analysis and the Regular Representation
Fourier Analysis on Finite Abelian Groups
The representation theory of finite abelian groups is Fourier analysis in disguise. This is one of the oldest and most beautiful connections in mathematics.
Let \(G\) be a finite abelian group. By the structure theorem, \(G \cong C_{n_1} \times \cdots \times C_{n_k}\). Since \(G\) is abelian, Schur’s Lemma forces all irreducible complex representations to be one-dimensional. A one-dimensional complex representation of \(G\) with image in \(U_1(\mathbb{C}) = \{z \in \mathbb{C} : |z| = 1\}\) is called a unitary character.
- The group \(\widehat{G}\) of unitary characters equals \(\mathrm{Irr}_\mathbb{C}(G)\) and satisfies \(|\widehat{G}| = |G|\).
- (Orthogonality) The characters form an orthonormal set in \(\ell^2(G)\) with inner product \(\langle f, g \rangle = \frac{1}{|G|}\sum_{x \in G} f(x)\overline{g(x)}\).
- (Completeness) The characters form a basis for \(\ell^2(G)\).
The classical Fourier analysis on \(\mathbb{R}/2\pi\mathbb{Z}\) is the limiting case of this theorem as the group becomes continuous. The functions \(e_n(x) = e^{inx}\) are precisely the unitary characters of \(\mathbb{R}/2\pi\mathbb{Z}\), and the Fourier series expansion is the analogue of the finite group case with \(\ell^2(G)\) replaced by \(L^2(\mathbb{R}/2\pi\mathbb{Z})\).
The Isotypic Decomposition of the Regular Representation
The following theorem is the centrepiece of the whole theory.
- (Isotypic decomposition) \(F\langle G \rangle \cong V_1^{\oplus \dim V_1} \oplus \cdots \oplus V_r^{\oplus \dim V_r}\).
- (Dimension formula) \(|G| = (\dim V_1)^2 + \cdots + (\dim V_r)^2\).
- There are only finitely many irreducible representations, and \(r \leq |G|\).
The key step is Lemma: the map \(\mathrm{Hom}_G(F\langle G \rangle, U) \to U\) given by \(f \mapsto f(1)\) is an isomorphism of \(F\)-vector spaces. This shows that the multiplicity of \(V_i\) in \(F\langle G \rangle\) is \(\dim \mathrm{Hom}_G(F\langle G \rangle, V_i) = \dim V_i\), giving the isotypic decomposition. The dimension formula follows by computing \(\dim F\langle G \rangle = |G|\) on both sides.
Similarly, the multiplicity of each one-dimensional representation is 1. ✓
The dimension formula \(|G| = \sum_i (\dim V_i)^2\) is one of the most useful equations in finite group theory. It severely constrains the possible dimensions of irreducible representations. For instance, if \(|G| = p\) is prime, then the only solution with all \(d_i^2 \leq p\) and \(\sum d_i^2 = p\) forces all \(d_i = 1\) and \(r = p\), implying \(G\) is abelian (actually \(G \cong C_p\)). More generally, from \(\sum d_i^2 = |G|\) and \(d_1 = 1\) (the trivial representation), we get \(d_i \leq \sqrt{|G|}\) for all \(i\), and in fact \(d_i \mid |G|\) (a result we will prove).
Counting Irreducibles
A critical structural fact:
This will be proved using character theory in the next chapter. For now, the dimension formula and the count of one-dimensional representations (which equal the number of irreducibles of \(G^{\mathrm{ab}} = G/[G,G]\)) give useful constraints on the degrees.
| \(\{e\}\) | \(\{a^2\}\) | \(\{a,a^3\}\) | \(\{b,a^2b\}\) | \(\{ab,a^3b\}\) | |
|---|---|---|---|---|---|
| \(\chi_1\) | 1 | 1 | 1 | 1 | 1 |
| \(\chi_2\) | 1 | 1 | 1 | \(-1\) | \(-1\) |
| \(\chi_3\) | 1 | 1 | \(-1\) | 1 | \(-1\) |
| \(\chi_4\) | 1 | 1 | \(-1\) | \(-1\) | 1 |
| \(\chi_5\) | 2 | \(-2\) | 0 | 0 | 0 |
Verification: \(1+1+1+1+4 = 8 = |D_8|\). ✓ Row orthogonality: \(\langle \chi_5, \chi_5 \rangle = \frac{1}{8}(4 + 4 + 0 + 0 + 0) = 1\). ✓
Chapter 7: Characters: Definition and Properties
The Character of a Representation
Characters are the central computational and theoretical tool of representation theory. The miracle is that a function as coarse as the trace completely determines a representation.
Why should the trace be the right invariant? Among the standard similarity invariants — the characteristic polynomial, determinant, trace — the determinant is too coarse (non-isomorphic representations can have identical determinants). The trace manages, remarkably, to distinguish all representations over \(\mathbb{C}\).
The key property making this work is that the trace is conjugation-invariant: \(\mathrm{tr}(ABA^{-1}) = \mathrm{tr}(B)\). Changing a basis amounts to conjugating each matrix, so the trace is independent of basis choice. Moreover, conjugate group elements \(g\) and \(hgh^{-1}\) give rise to conjugate matrices \(\rho(g)\) and \(\rho(h)\rho(g)\rho(h)^{-1}\), so their traces are equal. This means characters are class functions.
- \(\chi_V(hgh^{-1}) = \chi_V(g)\) for all \(g, h \in G\). (Characters are class functions.)
- If \(V \cong W\) then \(\chi_V = \chi_W\). (The converse is also true and will be proved below.)
- \(\chi_V(e) = \dim V\).
- \(\chi_{V \oplus W}(g) = \chi_V(g) + \chi_W(g)\).
- \(\chi_{V \otimes W}(g) = \chi_V(g)\chi_W(g)\).
- \(\chi_{V^*}(g) = \chi_V(g^{-1}) = \overline{\chi_V(g)}\). (The last equality uses the fact that eigenvalues of \(\rho(g)\) are roots of unity.)
- \(|\chi_V(g)| \leq \dim V\) for all \(g \in G\).
Let us prove the key properties (5) and (6) in detail.
Property (4) says the character “linearizes” direct sums into ordinary addition. Property (5) says it “multiplicates” tensor products into ordinary multiplication. Together, characters form a ring under pointwise addition and multiplication — the character ring of \(G\).
Class Functions and the Inner Product
A class function on \(G\) is a function \(f : G \to \mathbb{C}\) that is constant on conjugacy classes. Every character is a class function. The space \(\mathcal{C}(G)\) of class functions has dimension equal to \(h(G)\), the number of conjugacy classes.
\[\langle f, g \rangle = \frac{1}{|G|} \sum_{x \in G} f(x)\overline{g(x)}.\]This inner product encodes deep information about representations via:
The proof reveals why the inner product has its particular form: it is literally counting \(G\)-equivariant maps.
Chapter 8: Orthogonality of Characters and Character Tables
The Orthogonality Relations
This is the nonabelian generalization of the orthogonality of Fourier characters of abelian groups. It follows immediately from the proposition in the previous chapter combined with Schur’s Lemma (which says \(\dim \mathrm{Hom}_G(V,W) = 1\) if \(V \cong W\) and \(= 0\) otherwise).
Let us be explicit: Schur’s Lemma tells us that any \(G\)-linear map from an irreducible \(V\) to an irreducible \(W\) is either 0 (if \(V \not\cong W\)) or a scalar multiple of a fixed isomorphism (if \(V \cong W\)). So \(\dim \mathrm{Hom}_G(V, W) = 0\) or \(1\), and the theorem follows from \(\langle \chi_V, \chi_W \rangle = \dim \mathrm{Hom}_G(V, W)\).
The orthogonality relations have immediate and powerful consequences:
These are the tools that make character theory computationally powerful. To determine the isotypic decomposition of any representation \(V\), one simply computes the inner products \(\langle \chi_{V_i}, \chi_V \rangle\) with each irreducible character \(\chi_{V_i}\).
Completeness and the Character Table
This theorem completes the “Fourier analysis” picture for nonabelian groups: just as the unitary characters of a finite abelian group are an orthonormal basis for \(\ell^2(G)\), the irreducible characters of a finite group are an orthonormal basis for the subspace \(\mathcal{C}(G)\) of \(\ell^2(G)\).
Suppose \(f \in \mathcal{C}(G)\) satisfies \(\langle f, \chi_V \rangle = 0\) for all \(V \in \mathrm{Irr}(G)\). Define the operator \(T_f = \sum_{g \in G} f(g)\rho(g)\) on a \(G\)-module \(V\). One checks that \(T_f\) is \(G\)-linear. If \(V\) is irreducible, Schur’s lemma implies \(T_f = \lambda \cdot \mathrm{id}_V\). Taking traces: \(\lambda \dim V = \mathrm{tr}(T_f) = \sum_{g} f(g) \chi_V(g) = |G| \langle f, \chi_V \rangle = 0\). So \(T_f = 0\) on every irreducible \(V\).
\[T_f(e) = \sum_{g \in G} f(g) \rho_\mathrm{reg}(g)(e) = \sum_{g \in G} f(g) \cdot g = 0.\]Since \(\{g : g \in G\}\) is a basis for \(\mathbb{C}\langle G \rangle\), this implies \(f(g) = 0\) for all \(g\). So \(f = 0\). \(\square\)
The geometric picture is striking: the vector space \(\mathcal{C}(G)\) has dimension \(h(G)\) (one dimension per conjugacy class, since a class function is determined by its values on the \(h(G)\) conjugacy classes). The irreducible characters are an orthonormal basis for this space. The number of irreducibles equals the number of conjugacy classes: this is not a coincidence, it is a theorem.
The character table of \(G\) is the \(r \times r\) matrix (where \(r = h(G)\)) whose rows are indexed by irreducible representations and whose columns are indexed by conjugacy classes, with entry \(\chi_{V_i}(C_j)\) for conjugacy class representative \(C_j\):
| \(C_1\) | \(C_2\) | \(\cdots\) | \(C_r\) | |
|---|---|---|---|---|
| \(\chi_1\) | \(\chi_1(C_1)\) | \(\chi_1(C_2)\) | \(\cdots\) | \(\chi_1(C_r)\) |
| \(\chi_2\) | \(\chi_2(C_1)\) | \(\chi_2(C_2)\) | \(\cdots\) | \(\chi_2(C_r)\) |
| \(\vdots\) | \(\vdots\) | \(\vdots\) | \(\ddots\) | \(\vdots\) |
| \(\chi_r\) | \(\chi_r(C_1)\) | \(\chi_r(C_2)\) | \(\cdots\) | \(\chi_r(C_r)\) |
The character table of \(G\) encodes an enormous amount of information about \(G\). For instance:
- The first column gives the degrees: \(\chi_i(e) = \dim V_i\).
- The row sums \(\sum_j |C_j| \chi_i(C_j)\) equal zero for non-trivial irreducibles (by the dimension formula applied to \(\chi_\mathrm{reg} = 0\) off the identity).
- Rows are orthogonal with weight \(|C_j|/|G|\); columns are orthogonal with weight 1 (column orthogonality).
The column orthogonality deserves its own statement:
This column orthogonality is often as useful as row orthogonality: if the character table is partially filled in, column orthogonality gives additional equations that constrain the unknown entries.
Full Character Table for \(C_4\)
Let us work out the character table of \(C_4 = \{0, 1, 2, 3\}\) (addition mod 4) in complete detail, as a prototype for all cyclic groups.
Step 1: Conjugacy classes. Since \(C_4\) is abelian, every element is its own conjugacy class. There are 4 conjugacy classes: \(\{0\}, \{1\}, \{2\}, \{3\}\). Sizes: each has size 1.
Step 2: Number and dimensions of irreducibles. We need 4 irreducible representations (one per conjugacy class). Since \(\sum d_i^2 = |C_4| = 4\) and all \(d_i \geq 1\), the unique solution is \(d_1 = d_2 = d_3 = d_4 = 1\). All irreducibles are 1-dimensional, as predicted by the corollary to Schur’s Lemma.
Step 3: Write down the representations. With \(\zeta = i = e^{2\pi i/4}\), the four representations are \(\rho_k(j) = i^{jk}\) for \(k = 0, 1, 2, 3\):
| \(\{0\}\) | \(\{1\}\) | \(\{2\}\) | \(\{3\}\) | |
|---|---|---|---|---|
| \(\chi_0\) | \(1\) | \(1\) | \(1\) | \(1\) |
| \(\chi_1\) | \(1\) | \(i\) | \(-1\) | \(-i\) |
| \(\chi_2\) | \(1\) | \(-1\) | \(1\) | \(-1\) |
| \(\chi_3\) | \(1\) | \(-i\) | \(-1\) | \(i\) |
Dimension formula: \(1^2 + 1^2 + 1^2 + 1^2 = 4 = |C_4|\). \(\checkmark\)
Connection to DFT. The character table of \(C_n\) is precisely the discrete Fourier transform (DFT) matrix \(F_n\) with entries \((F_n)_{jk} = \zeta^{jk}\) where \(\zeta = e^{2\pi i/n}\). The orthogonality of rows is the statement that \(\frac{1}{n}F_n^* F_n = I_n\), which is the unitarity of the DFT. For \(C_4\) this gives us the 4-point DFT, the simplest nontrivial instance.
Full Character Table for \(S_3\)
| \(\{e\}\) | \(\{(1\,2)\}\) | \(\{(1\,2\,3)\}\) | |
|---|---|---|---|
| \(\chi_\mathrm{triv}\) | 1 | 1 | 1 |
| \(\chi_\mathrm{sgn}\) | 1 | \(-1\) | 1 |
| \(\chi_\mathrm{std}\) | 2 | 0 | \(-1\) |
Let us verify this table systematically from scratch.
Step 1. \(S_3\) has conjugacy classes of sizes 1, 3, 2 as above. So 3 irreducibles.
Step 2. Dimension formula: \(d_1^2 + d_2^2 + d_3^2 = 6\). The only solution (up to ordering) with all \(d_i \geq 1\) is \((1, 1, 2)\).
Step 3. The trivial rep gives row \((1, 1, 1)\). The sign rep gives row \((1, -1, 1)\) since 2-cycles are odd permutations and 3-cycles are even.
Step 4. The 2-dimensional irrep has \(\chi(e) = 2\). Using row orthogonality with \(\chi_\mathrm{triv}\): \(\frac{1}{6}(1 \cdot 2 + 3 \cdot b + 2 \cdot c) = 0 \Rightarrow 2 + 3b + 2c = 0\). Using row orthogonality with \(\chi_\mathrm{sgn}\): \(\frac{1}{6}(1 \cdot 2 + 3 \cdot (-1) \cdot b + 2 \cdot 1 \cdot c) = 0 \Rightarrow 2 - 3b + 2c = 0\). Subtracting: \(6b = 0 \Rightarrow b = 0\). Then \(2 + 2c = 0 \Rightarrow c = -1\).
Step 5. Verify \(\langle \chi_\mathrm{std}, \chi_\mathrm{std} \rangle = 1\): \(\frac{1}{6}(4 + 0 + 2) = 1\). ✓
One verifies: \(1^2 + 1^2 + 2^2 = 6 = |S_3|\). Row orthogonality: \(\langle \chi_\mathrm{std}, \chi_\mathrm{std} \rangle = \frac{1}{6}(4 + 0 + 2) = 1\). ✓
Column orthogonality for columns 1 and 2: \(1 \cdot 1 + 1 \cdot (-1) + 2 \cdot 0 = 0\). ✓ For column 2 with itself: \(1^2 + (-1)^2 + 0^2 = 2 = |G|/|C_2| = 6/3\). ✓
Full Character Table for \(Z/4Z\)
The complete character table for \(\mathbb{Z}/4\mathbb{Z}\) was given above. Let us now use it to see orthogonality of columns in action.
\[\sum_k \chi_k(1)\overline{\chi_k(3)} = 1 \cdot 1 + i \cdot \overline{(-i)} + (-1)\cdot \overline{(-1)} + (-i)\cdot \overline{i} = 1 \cdot 1 + i \cdot i + (-1)(-1) + (-i)(-i) = 1 - 1 + 1 - 1 = 0. \checkmark\](Note: since \(C_4\) is abelian, column \(\{j\}\) corresponds to the element \(j\), and \(\overline{\chi_k(j)} = \overline{i^{jk}} = i^{-jk} = \chi_k(-j)\). So the column orthogonality amounts to \(\sum_k i^{(j_1 - j_2)k} = 0\) for \(j_1 \neq j_2\) — the standard fact about geometric sums of roots of unity.)
Constructing the Character Table Systematically
It is instructive to see how one constructs a character table from scratch, without knowing the representations in advance. We illustrate with \(S_3\), the simplest nonabelian case.
Step 1. The group \(S_3\) has conjugacy classes \(\{e\}\), \(\{(12),(13),(23)\}\), \(\{(123),(132)\}\) of sizes 1, 3, 2. So there are 3 irreducible representations.
Step 2. The dimension formula \(\sum d_i^2 = 6\) with \(d_i \geq 1\) has (up to ordering) unique solution \((1, 1, 2)\).
Step 3. The trivial and sign representations give the first two rows: \((1,1,1)\) and \((1,-1,1)\).
Step 4. By column orthogonality applied to column 1 versus itself: \(1^2 + 1^2 + d_3^2 = |G|/|C_1| = 6\), confirming \(d_3 = 2\). ✓
Step 5. Use row orthogonality of \(\chi_3\) with \(\chi_\mathrm{triv}\): \(\frac{1}{6}(2a + 3b + 2c) = 0\), where \((a,b,c)\) are the unknown values.
Step 6. Use \(\langle \chi_3, \chi_3 \rangle = 1\): \(\frac{1}{6}(4 + 3b^2 + 2c^2) = 1\).
Step 7. Use column orthogonality of columns 2 and 3 with each other: \(1 \cdot 1 + 1 \cdot 1 + bc = 0\), so \(bc = -2\).
Step 8. Solve: from the constraint equations we get \(b = 0\) (since \(2a + 3b + 2c = 0\) and \(a = 2\)), and then \(c = -1\) and the equations all check out. The character table is complete.
| \(e\) | \((12)\) | \((12)(34)\) | \((123)\) | \((1234)\) | |
|---|---|---|---|---|---|
| \(\chi_\mathrm{triv}\) | 1 | 1 | 1 | 1 | 1 |
| \(\chi_\mathrm{sgn}\) | 1 | \(-1\) | 1 | 1 | \(-1\) |
| \(\chi_2\) | 2 | 0 | 2 | \(-1\) | 0 |
| \(\chi_3\) | 3 | 1 | \(-1\) | 0 | \(-1\) |
| \(\chi_3'\) | 3 | \(-1\) | \(-1\) | 0 | 1 |
Here \(\chi_2\) is the unique 2-dimensional irreducible (coming from \(S_4 \to S_3 \to GL_2\) via the sign map on the Klein four-group), \(\chi_3\) is the standard representation restricted from \(S_4\), and \(\chi_3' = \chi_3 \otimes \chi_\mathrm{sgn}\). Verification: \(1+1+4+9+9 = 24 = |S_4|\). ✓
Chapter 9: Inflation, Kernels, and Symmetric Powers
Inflation and the Kernel of a Character
If \(\phi : G \to H\) is a group homomorphism and \((W, \sigma)\) is an \(H\)-module, we can define a \(G\)-module \(\phi^* W\) by pulling back: \(\rho(g) = \sigma(\phi(g))\). This is called inflation when \(\phi : G \twoheadrightarrow G/N\) is a quotient map.
The kernel of a character is indeed a normal subgroup: it equals \(\ker \rho\), the kernel of the representation homomorphism. One can characterize it using character values: since \(|\chi_V(g)| \leq \dim V = \chi_V(e)\) with equality if and only if all eigenvalues of \(\rho(g)\) are equal (i.e., \(\rho(g)\) is scalar), the kernel consists precisely of the elements \(g\) where \(\rho(g)\) is a scalar matrix. For irreducible \(V\), this means \(\rho(g) = \lambda I\), and since \(g^{|G|} = e\), we get \(\lambda^{|G|} = 1\).
The one-dimensional representations of \(G\) are exactly the inflations of the irreducible representations of the abelianization \(G^\mathrm{ab} = G/[G,G]\). More generally, if \(N\) is a normal subgroup contained in the kernel of some representation, then that representation factors through \(G/N\).
Symmetric and Alternating Powers
Given a \(G\)-module \(V\), the tensor power \(V^{\otimes n}\) carries a natural action of both \(G\) (acting on each tensor factor) and \(S_n\) (permuting the factors). These two actions commute, and their interaction is controlled by Schur-Weyl duality — one of the deep organizing principles of representation theory.
The symmetric power \(\mathrm{Sym}^n(V)\) is the quotient of \(V^{\otimes n}\) by the subspace generated by \(v_1 \otimes \cdots \otimes v_n - v_{\sigma(1)} \otimes \cdots \otimes v_{\sigma(n)}\) for all \(\sigma \in S_n\), and the alternating power \(\mathrm{Alt}^n(V)\) is the quotient by \(v_1 \otimes \cdots \otimes v_n - \mathrm{sgn}(\sigma) v_{\sigma(1)} \otimes \cdots \otimes v_{\sigma(n)}\). Both are \(G\)-modules.
For a two-dimensional representation \(V\), one has \(\dim \mathrm{Sym}^n(V) = n+1\) and \(\dim \mathrm{Alt}^n(V) = \binom{2}{n}\) (so \(\mathrm{Alt}^n(V) = 0\) for \(n > 2\) and \(\mathrm{Alt}^2(V)\) is one-dimensional). The standard representation of \(S_n\) satisfies \(\mathrm{Alt}^{n-1}(V_\mathrm{std}) \cong V_\mathrm{sgn}\).
\[\chi_{\mathrm{Sym}^2 V}(g) = \frac{\chi_V(g)^2 + \chi_V(g^2)}{2}, \qquad \chi_{\mathrm{Alt}^2 V}(g) = \frac{\chi_V(g)^2 - \chi_V(g^2)}{2}.\]These formulas are useful for computing character tables and for determining the Frobenius-Schur indicator.
Similarly, \(\chi_{\mathrm{Alt}^2 V_\mathrm{std}}(e) = \frac{4-2}{2} = 1\), \(\chi_{\mathrm{Alt}^2 V_\mathrm{std}}((12)) = \frac{0-2}{2} = -1\), \(\chi_{\mathrm{Alt}^2 V_\mathrm{std}}((123)) = \frac{1+1}{2} = 1\). This is the character of \(V_\mathrm{sgn}\)! So \(\mathrm{Alt}^2(V_\mathrm{std}) \cong V_\mathrm{sgn}\), consistent with \(V_\mathrm{std} \cong V_\mathrm{std}^*\) (since \(V_\mathrm{std} \otimes V_\mathrm{sgn} \cong V_\mathrm{std}\)) and \(\mathrm{Alt}^2 V \cong \det V\).
Chapter 10: The Representation Theory of \(S_n\)
Young Diagrams
The representation theory of the symmetric group is one of the most beautiful chapters in all of mathematics. It establishes a remarkable bijection between the set of irreducible complex representations of \(S_n\) and the combinatorial objects called Young diagrams.
\[\mathrm{Irr}_\mathbb{C}(S_n) \longleftrightarrow \{\text{conjugacy classes in } S_n\} \longleftrightarrow \{\text{partitions of } n\}.\]The conjugacy class of a permutation is determined by its cycle type, and cycle types correspond to ways of writing \(n\) as an ordered sum of positive integers. A partition of \(n\) is a tuple \(\lambda = (\lambda_1 \geq \lambda_2 \geq \cdots \geq \lambda_k > 0)\) with \(\sum \lambda_i = n\), written \(\lambda \vdash n\).
A partition \(\lambda\) is displayed as a Young diagram: a left-justified array of boxes with \(\lambda_i\) boxes in row \(i\). For example, the Young diagram of \(\lambda = (3,2,2,1)\) consists of 8 boxes arranged in rows of lengths 3, 2, 2, 1 from top to bottom.
Young Tableaux and Specht Modules
A Young tableau of shape \(\lambda\) is a filling of the Young diagram of \(\lambda\) with the numbers \(1, \ldots, n\), each appearing exactly once. The row group \(R(T)\) is the subgroup of \(S_n\) that permutes entries within each row; the column group \(C(T)\) permutes entries within each column.
\[a_T = \sum_{\sigma \in R(T)} \sigma, \quad b_T = \sum_{\tau \in C(T)} \mathrm{sgn}(\tau) \tau, \quad c_T = b_T a_T \quad \text{(the \textbf{Young symmetrizer})}.\]The hook length formula is a beautiful combinatorial result: despite the intricate algebraic construction, the dimension turns out to be a simple product formula. For instance, \(S^{(n)} \cong V_\mathrm{triv}\), \(S^{(1,1,\ldots,1)} \cong V_\mathrm{sgn}\), and \(S^{(n-1,1)} \cong V_\mathrm{std}\) (the standard representation).
Branching rules describe how Specht modules restrict and induce between \(S_n\) and \(S_{n+1}\): the restriction of \(S^\mu\) (\(\mu \vdash n+1\)) to \(S_n\) decomposes as \(\bigoplus S^{\mu^-}\) over all \(\mu^-\) obtained by removing one box from \(\mu\), and the induction of \(S^\lambda\) (\(\lambda \vdash n\)) to \(S_{n+1}\) decomposes as \(\bigoplus S^{\lambda^+}\) over all \(\lambda^+\) obtained by adding one box.
The Robinson-Schensted Correspondence
One of the most striking combinatorial results connecting the representation theory of \(S_n\) to combinatorics is the Robinson-Schensted correspondence:
where \(f^\lambda = \dim S^\lambda\) is the number of standard Young tableaux of shape \(\lambda\) (equal to the dimension of the Specht module). This is yet another proof of the dimension formula \(|S_n| = \sum_\lambda (\dim S^\lambda)^2\), but now with a combinatorial flavour.
Chapter 11: Module Theory and the Artin-Wedderburn Theorem
Algebras and the Group Algebra
The second arc of the course recasts representation theory in terms of modules over rings. This perspective is more powerful in some respects and provides natural connections to noncommutative algebra.
The central example is the group algebra \(FG\): as a vector space, \(FG = F\langle G \rangle\) with basis \(G\); multiplication is extended from \(G\) by linearity. An \(FG\)-module is exactly an \(F\)-representation of \(G\).
Other key examples:
- \(F[x]\)-modules are vector spaces equipped with a chosen linear operator (the action of \(x\)). By the structure theorem for modules over a PID, the theory of \(F[x]\)-modules recaptures the Jordan normal form theorem.
- \(\mathbb{Z}\)-modules are abelian groups.
- \(M_n(F)\)-modules: the only simple \(M_n(F)\)-module is \(F^n\) (column vectors with matrix multiplication).
The quaternion algebra \(\mathbb{H} = \mathbb{R} \cdot 1 \oplus \mathbb{R} \cdot i \oplus \mathbb{R} \cdot j \oplus \mathbb{R} \cdot k\) (with \(i^2 = j^2 = k^2 = -1\) and \(ij = k\)) is a noncommutative division ring — a “skew field.” It arises naturally in the representation theory of groups with quaternionic (real) irreducibles.
The Center of the Group Algebra
The center \(Z(FG)\) of the group algebra has a beautiful explicit description that connects algebra to combinatorics.
This fact has an important consequence: under the Artin-Wedderburn decomposition \(FG \cong \prod_i M_{n_i}(F)\), the center decomposes as \(Z(FG) \cong \prod_i Z(M_{n_i}(F)) \cong \prod_i F\). So \(\dim Z(FG) = r\) (the number of irreducible representations). Combining: \(r = h(G)\).
Simple and Semisimple Modules
Simple \(FG\)-modules are precisely irreducible \(G\)-representations. Maschke’s theorem, in module-theoretic language, says that \(FG\) is a semisimple ring when \(\mathrm{char}\, F \nmid |G|\).
A remarkably elegant characterization of semisimplicity:
- \(M\) is semisimple.
- \(M\) is a sum (not necessarily direct) of simple submodules.
- Every submodule \(N \subseteq M\) has a complement: there exists \(N'\) with \(M = N \oplus N'\).
- Every submodule of \(M\) is semisimple.
Key consequences:
- Submodules and quotients of semisimple modules are semisimple.
- Every simple submodule of a semisimple module \(M = \bigoplus_i S_i\) is isomorphic to some \(S_i\).
The theorem that \(F^n\) is the unique simple \(M_n(F)\)-module shows how matrix algebras over fields are perfectly “tame” from the module-theory perspective. The Artin-Wedderburn theorem generalizes this.
The Artin-Wedderburn Theorem
This is one of the great theorems of algebra. Wedderburn proved the theorem for finite-dimensional algebras over a field in 1907; Artin extended it to abstract semisimple rings in 1927. The footnote in E. Artin’s 1950 survey captures the community’s reaction: “This extraordinary result has excited the fantasy of every algebraist and still does so in our day.”
Let us explain why this theorem is so powerful. It says that a semisimple ring is — up to isomorphism — nothing but a product of matrix algebras over division rings. There are no other examples. The structure is completely rigid.
The proof strategy is:
- Decompose \(R R = \bigoplus S_i^{\oplus m_i}\) into simple submodules using semisimplicity.
- Each simple submodule is a minimal left ideal; the sum of all copies of a fixed simple \(S_j\) is a two-sided ideal (the “block”).
- Each block is isomorphic to \(M_{m_j}(D_j)\) where \(D_j = \mathrm{End}_R(S_j)^{op}\) is a division ring (by Schur’s Lemma, since \(S_j\) is simple).
- The product decomposition follows from the orthogonality of distinct blocks.
Application to group algebras. When \(F\) is algebraically closed and \(\mathrm{char}\, F \nmid |G|\), Maschke’s theorem ensures that \(FG\) is semisimple. By Schur’s Lemma (part 2), every \(D_i\) in the Artin-Wedderburn decomposition is just \(F\). Therefore:
The dimension formula \(|G| = \sum n_i^2\) follows immediately: \(\dim FG = |G|\) and \(\dim M_{n_i}(F) = n_i^2\). The character table theorem (number of irreducibles = number of conjugacy classes) follows from the theory of the center: \(\dim Z(FG) = h(G)\) (center of the group algebra has basis the conjugacy class sums), and \(Z(M_{n_i}(F)) \cong F\), so \(\dim Z(FG) = r\).
Primitive Idempotents and the Artin-Wedderburn Decomposition Explicitly
The Artin-Wedderburn decomposition is not just an abstract isomorphism — it can be made explicit using primitive central idempotents.
The formula \(e_i = \frac{n_i}{|G|}\sum_g \overline{\chi_i(g)} g\) is the representation-theoretic analogue of the projection formula in Fourier analysis: for a finite abelian group, the projection onto the \(\chi_k\)-eigenspace of a function is \(\hat{f}(k) = \frac{1}{|G|}\sum_g \overline{\chi_k(g)} f(g)\). The extra factor of \(n_i = \dim V_i\) accounts for the non-commutativity.
Over \(\mathbb{R}\), the division rings \(D_i\) can be \(\mathbb{R}\), \(\mathbb{C}\), or \(\mathbb{H}\). This trichotomy is classified by the Frobenius-Schur indicator, discussed in the next chapter.
Chapter 12: Real Representations
The Frobenius-Schur Indicator
Over \(\mathbb{C}\), every representation decomposes into complex irreducibles. Over \(\mathbb{R}\), the story is richer: a complex irreducible \(V\) can behave in three fundamentally different ways when we look at its underlying real structure.
In terms of bilinear forms: \(\iota(\chi_V) = 1\) if and only if \(V\) has a nonzero \(G\)-invariant symmetric bilinear form; \(\iota(\chi_V) = -1\) if and only if \(V\) has a nonzero \(G\)-invariant skew-symmetric bilinear form.
The indicator also arises in the decomposition of symmetric and alternating squares: if \(\chi_{\mathrm{Sym}^2}(g) = \frac{1}{2}(\chi(g)^2 + \chi(g^2))\) and \(\chi_{\mathrm{Alt}^2}(g) = \frac{1}{2}(\chi(g)^2 - \chi(g^2))\), then \(\iota(\chi_V) = \langle \chi_{\mathrm{Sym}^2 V}, \chi_\mathrm{triv} \rangle - \langle \chi_{\mathrm{Alt}^2 V}, \chi_\mathrm{triv} \rangle\).
The Artin-Wedderburn decomposition of \(\mathbb{R}G\) corresponds precisely to this trichotomy: real-type irreducibles contribute \(M_n(\mathbb{R})\) blocks, quaternionic-type contribute \(M_m(\mathbb{H})\) blocks, and complex-type contribute \(M_k(\mathbb{C})\) blocks (with the corresponding \(\mathbb{C}\)-irreducible and its conjugate merging into one \(\mathbb{R}\)-irreducible).
The Quaternion Group \(Q_8\) as an Example
The quaternion group \(Q_8 = \{\pm 1, \pm i, \pm j, \pm k\}\) with the multiplication rules \(i^2 = j^2 = k^2 = -1\) and \(ij = k\), \(jk = i\), \(ki = j\) is one of the most important examples in the theory of real representations.
\(Q_8\) has order 8 and 5 conjugacy classes: \(\{1\}\), \(\{-1\}\), \(\{\pm i\}\), \(\{\pm j\}\), \(\{\pm k\}\). The dimension formula gives \(8 = \sum d_i^2\) with 5 terms. The abelianization is \(Q_8^{\mathrm{ab}} \cong C_2 \times C_2\) (since \([Q_8, Q_8] = \{1, -1\}\)), so there are 4 one-dimensional representations. The unique solution is \(d_i = (1, 1, 1, 1, 2)\).
The character table of \(Q_8\):
| \(\{1\}\) | \(\{-1\}\) | \(\{\pm i\}\) | \(\{\pm j\}\) | \(\{\pm k\}\) | |
|---|---|---|---|---|---|
| \(\chi_0\) | 1 | 1 | 1 | 1 | 1 |
| \(\chi_1\) | 1 | 1 | 1 | \(-1\) | \(-1\) |
| \(\chi_2\) | 1 | 1 | \(-1\) | 1 | \(-1\) |
| \(\chi_3\) | 1 | 1 | \(-1\) | \(-1\) | 1 |
| \(\chi_4\) | 2 | \(-2\) | 0 | 0 | 0 |
Wait, more carefully: \(g^2\) for each of the 8 elements: \(1^2 = 1, (-1)^2 = 1, i^2 = -1, (-i)^2 = -1, j^2 = -1, (-j)^2 = -1, k^2 = -1, (-k)^2 = -1\). So \(\iota(\chi_4) = \frac{1}{8}(\chi_4(1) + \chi_4(1) + \chi_4(-1) + \chi_4(-1) + \chi_4(-1) + \chi_4(-1) + \chi_4(-1) + \chi_4(-1)) = \frac{1}{8}(2 + 2 + (-2) \cdot 6) = \frac{1}{8}(4 - 12) = \frac{-8}{8} = -1\). So \(\chi_4\) is of quaternionic type. This means that while \(\chi_4\) is a 2-dimensional complex irreducible, it does not come from a real representation; instead, it comes from the natural 1-dimensional representation of \(Q_8\) over the quaternions \(\mathbb{H}\).
Chapter 13: Applications to Group Theory
Burnside’s \(p^a q^b\) Theorem
One of the most striking applications of representation theory to pure group theory is Burnside’s theorem, proved in 1904 using character theory — and for which no elementary proof was found until the 1970s.
The historical significance of this theorem is hard to overstate. In 1904, Burnside could prove it only using characters; the result sat with no group-theoretic proof for 70 years, until Goldschmidt (1970) and Matsuyama (1973) gave elementary proofs. That character theory could prove something apparently purely group-theoretic, and that nothing else could prove it for so long, is a testament to the power of the representation-theoretic approach.
The key character-theoretic ingredient is:
The proof proceeds via algebraic number theory: character values are algebraic integers, and so are \(\chi(g)/\chi(e)\). If \(|C|\) and \(\dim V\) are coprime, a Bezout argument shows \(\frac{|C|}{\chi(e)}\chi(g)\) is an algebraic integer, and combining with the absolute value bound \(|\chi(g)| \leq \chi(e)\) forces either \(\chi(g) = 0\) or \(|\chi(g)| = \chi(e)\) (all eigenvalues equal, making \(\rho(g)\) scalar).
The theorem follows: if \(G\) has a conjugacy class of size \(p^a\) (with \(a > 0\)), one analyzes the irreducible characters to show that \(G\) has a proper nontrivial normal subgroup, then proceeds by induction on \(|G|\). The existence of such a conjugacy class is guaranteed by Sylow theory.
Dimension Divisibility
Another application of character theory to group structure:
The proof uses the fact that \(\frac{|G|}{\dim V}\chi_V(g)\) is an algebraic integer for all \(g\) (proved using character inner products and the fact that character values are algebraic integers), combined with the formula \(\frac{|G|}{(\dim V)^2} = \sum_g \frac{\chi_V(g)\overline{\chi_V(g)}}{\dim V}\).
More precisely: one proves \(\frac{|G|}{\dim V} \in \mathbb{Z}\) by showing that \(\frac{|G|}{\dim V} = \sum_{g \in G} \frac{\chi_V(g) \overline{\chi_V(g)}}{\dim V}\) is a sum of algebraic integers (hence an algebraic integer) that also lies in \(\mathbb{Q}\) (hence in \(\mathbb{Z}\)).
Additional Character-Theoretic Applications
Normal Subgroups via Characters
Representation theory gives a clean detection of normal subgroups:
In other words: a subgroup is normal if and only if it is a union of conjugacy classes, and a normal subgroup is “seen” by the character table as a set of conjugacy classes where certain characters take special values (equal to their degree at the identity).
Chapter 14: Induction of Representations
Induced Representations
If \(H \leq G\) and \(W\) is an \(H\)-module, we want to build a \(G\)-module “from” \(W\). This is the process of induction, dual to restriction.
(with \(k = [G:H]\) summands), and the action of \(g \in G\) sends \(t_i \otimes w\) to \(t_j \otimes h \cdot w\), where \(gt_i = t_j h\) for the unique \(t_j\) and \(h \in H\). In particular, \(\dim \mathrm{Ind}_H^G(W) = [G:H] \cdot \dim W\).
We need \(t_i^{-1}(12)t_i \in H = \{e, (12)\}\). Check: \(e^{-1}(12)e = (12) \in H\), contributing \(\chi_-((12)) = -1\); \((13)^{-1}(12)(13) = (23) \notin H\); \((23)^{-1}(12)(23) = (13) \notin H\). So \(\chi_{\mathrm{Ind}\chi_-}((12)) = -1\).
Similarly: \(\chi_{\mathrm{Ind}\chi_-}(e) = 3 \cdot \chi_-(e) = 3 \cdot 1 = 3\); for \((123)\): \(e^{-1}(123)e = (123) \notin H\), and the other coset representatives also give conjugates not in \(H\), so \(\chi_{\mathrm{Ind}\chi_-}((123)) = 0\). The character values are \((3, -1, 0)\) on the three conjugacy classes — this is the character of \(V_\mathrm{triv} \oplus V_\mathrm{std}\)! (Indeed \(1 + 2 = 3\), \(1 + 0 = 1\), \(1 + (-1) = 0\) — wait, \(1 \cdot 1 + (-1) \cdot 1 = 0 \neq -1\). Let me recalculate: \(\chi_\mathrm{triv}((12)) + \chi_\mathrm{std}((12)) = 1 + 0 = 1 \neq -1\). So \(\mathrm{Ind}_H^{S_3}(\chi_-) \ncong V_\mathrm{triv} \oplus V_\mathrm{std}\). By character inner products: \(\langle (3,-1,0), \chi_\mathrm{triv} \rangle = \frac{1}{6}(3 \cdot 1 + 3 \cdot (-1) \cdot 1 + 2 \cdot 0) = 0\); \(\langle (3,-1,0), \chi_\mathrm{sgn} \rangle = \frac{1}{6}(3 - 3(-1)(-1) + 0) = \frac{1}{6}(3-3) = 0\). That gives multiplicity 0 for both one-dimensional irreducibles, but \(\langle (3,-1,0), \chi_\mathrm{std} \rangle = \frac{1}{6}(3\cdot 2 + 3\cdot(-1)(0) + 2\cdot 0\cdot(-1)) = \frac{6}{6} = 1\)… but \(\dim = 3 \neq 2\). Something is wrong. Rechecking: for \(g = e\), \(\chi_{\uparrow}(e) = \sum_i \chi_-(t_i^{-1} e t_i) = 3 \cdot \chi_-(e) = 3\). For \(g = (12)\), only \(i = 1\) contributes (as computed), giving \(-1\). For \(g = (123)\), \(t_1^{-1}(123)t_1 = (123) \notin H\); \((13)(123)(13) = (132) \cdot\)… actually \((13)^{-1}(123)(13)\): let us compute directly. \((13)(123) = (23)\) (as permutations), then \((23)(13) = (123)\). So \((13)^{-1}(123)(13) = (123) \notin H\). Similarly for \(t_3\). So \(\chi_\uparrow((123)) = 0\). Character values: \((3, -1, 0)\) on classes of sizes \(1, 3, 2\). Inner products: \(\langle \chi_\uparrow, \chi_\mathrm{triv} \rangle = \frac{1}{6}(3 - 3 + 0) = 0\), \(\langle \chi_\uparrow, \chi_\mathrm{sgn} \rangle = \frac{1}{6}(3 + 3 + 0) = 1\), \(\langle \chi_\uparrow, \chi_\mathrm{std} \rangle = \frac{1}{6}(6 + 0 + 0) = 1\). So \(\mathrm{Ind}_H^{S_3}(\chi_-) \cong V_\mathrm{sgn} \oplus V_\mathrm{std}\). ✓ Dimension: \(1 + 2 = 3 = [S_3:H] \cdot \dim \chi_- = 3 \cdot 1\). ✓
Frobenius Reciprocity
The most fundamental property of induction is its adjointness to restriction:
This says: the multiplicity of \(W\) in the restriction of \(V\) equals the multiplicity of \(V\) in the induction of \(W\). Frobenius reciprocity is an extremely efficient computational tool: to determine the decomposition of induced representations, one can instead analyze restrictions, which are often easier.
This corollary provides a strategy for determining \(\mathrm{Irr}(G)\): start from irreducibles of subgroups and induce up. This strategy is particularly powerful for groups with a rich subgroup structure, like \(GL_n(\mathbb{F}_q)\).
The Character Formula for Induced Representations
Let us verify \(\phi_1 \uparrow \cong \chi_\mathrm{std}\) explicitly. Take coset representatives \(\{e, (12)\}\). The character formula for \(g = (12)\): we need \(t_i^{-1}(12)t_i \in A_3\). For \(t_1 = e\): \((12) \notin A_3\). For \(t_2 = (12)\): \((12)^{-1}(12)(12) = (12) \notin A_3\). So \(\phi_1\uparrow((12)) = 0\). For \(g = (123)\): \(e^{-1}(123)e = (123) \in A_3\), contributing \(\phi_1((123)) = \omega\); \((12)^{-1}(123)(12) = (132) \in A_3\), contributing \(\phi_1((132)) = \omega^{-1} = \bar\omega\). Sum: \(\omega + \bar\omega = 2\cos(2\pi/3) = -1\). So \(\phi_1\uparrow((123)) = -1\). Character values: \((2, 0, -1)\) = character of \(\chi_\mathrm{std}\). ✓
Mackey’s Irreducibility Criterion
A natural question is when an induced representation is irreducible. The answer involves the notion of Mackey’s double coset formula. If \(H, K \leq G\), then \(G\) decomposes into double cosets \(HsK\), and restricting the induced module \(\mathrm{Ind}_H^G(W)\) to \(K\) decomposes it into pieces indexed by these double cosets:
- \(W \in \mathrm{Irr}(H)\), and
- For every \(s \in G \setminus H\), the representations \(\mathrm{Res}_{H \cap s^{-1}Hs}^H(W)\) and \(\mathrm{Res}_{H \cap sHs^{-1}}^H({}^s W)\) have no common irreducible constituent.
In the special case \(H \trianglelefteq G\), the double cosets are just cosets, and the criterion simplifies: \(\mathrm{Ind}_H^G(W)\) is irreducible if and only if \(W\) is irreducible and no two conjugates \({}^s W\) and \(W\) (\(s \notin H\)) are isomorphic.
| \(1 \cdot e\) | \(20 \cdot (1\,2\,3)\) | \(15 \cdot (1\,2)(3\,4)\) | \(12 \cdot (1\,2\,3\,4\,5)\) | \(12 \cdot (1\,3\,4\,5\,2)\) | |
|---|---|---|---|---|---|
| \(\chi_0\) | 1 | 1 | 1 | 1 | 1 |
| \(\chi_1\) | 3 | 0 | \(-1\) | \(\frac{1+\sqrt{5}}{2}\) | \(\frac{1-\sqrt{5}}{2}\) |
| \(\chi_2\) | 3 | 0 | \(-1\) | \(\frac{1-\sqrt{5}}{2}\) | \(\frac{1+\sqrt{5}}{2}\) |
| \(\chi_3\) | 4 | 1 | 0 | \(-1\) | \(-1\) |
| \(\chi_4\) | 5 | \(-1\) | 1 | 0 | 0 |
The golden ratio \(\varphi = \frac{1+\sqrt{5}}{2}\) appearing in the character values of \(A_5\) reflects the group’s connection to the icosahedron and its fivefold symmetry. Note that \(1^2 + 3^2 + 3^2 + 4^2 + 5^2 = 60 = |A_5|\). ✓
The two 3-dimensional irreducibles \(\chi_1, \chi_2\) arise from the two 3-dimensional geometric representations of \(A_5\): the group \(A_5 \cong \mathrm{Isom}^+(S^2_\mathrm{icosa})\) acts on \(\mathbb{R}^3\) in two ways related by an outer automorphism (since \(A_5\) is simple, any two faithful permutation representations of degree 5 are related by such an automorphism). The representations \(\chi_1\) and \(\chi_2\) are complex conjugates of each other — their character values involve \(\varphi\) and \(\bar\varphi = \frac{1-\sqrt{5}}{2} = 1 - \varphi\). The 4-dimensional irreducible \(\chi_3\) arises as the standard representation of \(A_5\) inside \(S_5\), and \(\chi_4\) (dimension 5) comes from inducing from the two-element subgroup \(C_2 \leq A_5\).
Chapter 15: The Structure of the Group Algebra and Further Topics
Conjugacy Class Sums and the Multiplication of Blocks
\[C_i^+ \cdot C_j^+ = \sum_k a_{ij}^k C_k^+\]\[a_{ij}^k = \frac{|C_i||C_j|}{|G|}\sum_{\ell=1}^r \frac{\chi_\ell(C_i)\chi_\ell(C_j)\overline{\chi_\ell(C_k)}}{\chi_\ell(e)}.\]These structure constants are always non-negative integers (since \(a_{ij}^k\) counts the number of ways to write a fixed element of \(C_k\) as a product of an element from \(C_i\) and an element from \(C_j\)). The formula above is a consequence of the character theory developed earlier and provides another “check” on character tables.
Representation Theory of \(GL_2(\mathbb{F}_q)\)
As a glimpse of more advanced representation theory, we briefly describe the representation theory of \(GL_2(\mathbb{F}_q)\), the group of invertible \(2 \times 2\) matrices over the field with \(q\) elements (where \(q = p^n\) for a prime \(p\)).
The group \(GL_2(\mathbb{F}_q)\) has order \((q^2-1)(q^2-q) = q(q-1)^2(q+1)\). Its representation theory over \(\mathbb{C}\) is rich and serves as a prototype for the representation theory of \(GL_n(\mathbb{F}_q)\) in general. The irreducible representations fall into several families:
- Principal series representations (\((q-1)^2/2\) of them, roughly): induced from characters of the Borel subgroup (upper triangular matrices).
- Special representations (\(q-1\) of them): twisted versions of the Steinberg representation.
- Cuspidal representations (\(q(q-1)/2\) of them): “new” representations that do not appear in the principal series; they are harder to construct and require working with characters of non-split tori.
using the dimensions of the irreducibles above, providing a non-trivial check.
This example illustrates how the representation theory of “groups of Lie type” — finite groups arising as points of algebraic groups over finite fields — is an active and deep area of current mathematics, with connections to the Langlands program, modular forms, and automorphic representations.
Summary Table: Key Facts
The following table summarizes the main structural facts proved in this course.
| Theorem | Statement | Key Tool |
|---|---|---|
| Maschke (1898) | Every rep in char 0 (or (\mathrm{char}\nmid | G |
| Schur’s Lemma | \(\mathrm{Hom}_G(V,W) = 0\) or isomorphisms only; End\(_G(V) \cong F\) | Kernel/image are \(G\)-invariant |
| 1st Orthogonality | \(\langle \chi_i, \chi_j \rangle = \delta_{ij}\) | Schur + averaging |
| 2nd Orthogonality | (\sum_i \chi_i(g)\overline{\chi_i(h)} = | C_G(g) |
| Dimension formula | \(\vert G\vert = \sum_i (\dim V_i)^2\) | Regular rep decomposition |
| \(\#\mathrm{Irr} = \#\mathrm{conj. classes}\) | \(r = h(G)\) | \(\dim Z(\mathbb{C}G) = r = h(G)\) |
| Artin-Wedderburn | \(\mathbb{C}G \cong \prod_i M_{n_i}(\mathbb{C})\) | Semisimplicity + Schur |
| Frobenius reciprocity | \(\langle \chi_W\uparrow, \chi_V \rangle_G = \langle \chi_W, \chi_V\downarrow\rangle_H\) | Adjunction of Ind/Res |
| Burnside \(p^aq^b\) | Groups of order \(p^aq^b\) are solvable | Algebraic integers + characters |
| Dim divisibility | (\dim V \mid | G |
| Hook length formula | \(\dim S^\lambda = n!/\prod h(i,j)\) | Young symmetrizers |
A Final Word: Why Character Tables Are So Powerful
A finite group \(G\) can be an utterly intractable combinatorial object. The symmetric group \(S_{20}\) has more than \(2.4 \times 10^{18}\) elements; a direct attack on its structure is hopeless. But its character table — a \(p(20) \times p(20) = 627 \times 627\) matrix, where \(p(20)\) is the number of partitions of 20 — encodes all the essential information about its representation theory in a finite, computable form.
The character table knows:
- All normal subgroups (as intersections of kernels of characters).
- Whether the group is abelian, solvable, simple, etc.
- How representations tensor with each other (via the Clebsch-Gordan coefficients).
- The structure of the group algebra via Artin-Wedderburn.
And yet it does not know everything: there exist non-isomorphic groups with the same character table (they are called Brauer pairs or simply groups with the same character degrees). The character table is a very powerful invariant, but not a complete invariant.
The fact that such a finite object captures so much information about an infinite-looking problem is, in the end, the deepest miracle of representation theory. It arises because group theory (non-commutative, combinatorial) and linear algebra (commutative, continuous) happen to resonate in exactly the right way — and the character theory we have developed in this course is the mathematical expression of that resonance.
Chapter 16: Deeper Dives — Worked Computations and Extensions
The \(D_4\) Character Table: A Detailed Construction
We have stated the character table of \(D_4 = D_8\) (the dihedral group of order 8) earlier. Let us now redo the construction completely from scratch, using only the structure of \(D_4\) and the general theory, without any foreknowledge of the representations.
Step 1: Group Structure and Conjugacy Classes
The dihedral group of order 8 is \(D_4 = \langle r, s \mid r^4 = s^2 = e,\ srs^{-1} = r^{-1} \rangle\). Its 8 elements are \(\{e, r, r^2, r^3, s, rs, r^2s, r^3s\}\). To find the conjugacy classes, we compute conjugates:
- \(e\) is always alone: \(C_1 = \{e\}\).
- Since \(srs^{-1} = r^{-1} = r^3\), we have \(r \sim r^3\). Is \(r \sim r^2\)? Computing \(r^k r r^{-k} = r\) for all \(k\), and \(sr^k r (sr^k)^{-1} = sr^{k+1}r^{-k}s^{-1} = s r s^{-1} = r^{-1} = r^3\). So \(r\) and \(r^3\) are conjugate but \(r^2\) is not conjugate to \(r\). Thus \(C_3 = \{r, r^3\}\).
- \(r^2\) generates \(\{e, r^2\}\) in the center: \(r \cdot r^2 \cdot r^{-1} = r^2\) and \(s r^2 s^{-1} = r^{-2} = r^2\). So \(C_2 = \{r^2\}\).
- For the reflections: \(r s r^{-1} = r s r^3 = r (r^3 s) \cdot \text{(using } rs = sr^{-1}) \). Alternatively: \(r \cdot s \cdot r^{-1} = r s r^3\). Using \(rs = sr^{-1} = sr^3\): \(r(sr^3) = (rs)r^3 = sr^{-1}r^3 = sr^2 = r^2s \cdot \text{?}\) Let me be more careful. In \(D_4\), the rule is \(sr = r^{-1}s = r^3 s\). So \(r s r^{-1} = r s r^3\). First, \(sr^3 = r^{-3}s = rs\). So \(r(sr^3) = r \cdot rs \cdot\)… Actually let me use a direct computation: \(r s r^{-1}\). We have \(rs = sr^{-1}\), so \(r = sr^{-1}s^{-1}\)… this is getting complicated. Let me just tabulate:
So \(r s r^{-1} = r^2 s\). Similarly, \(r^2 s r^{-2} = s\) (can be verified). And \(r^3 s r^{-3} = r^2 s\) by similar computation. Wait: \(r(r^2 s)r^{-1} = r^3 s r^{-1} = r^3 s r^3 = \ldots\). Actually one can check that the four reflections \(\{s, rs, r^2s, r^3s\}\) split into two conjugacy classes:
- \(C_4 = \{s, r^2 s\}\): since \(r s r^{-1} = r^2 s\) (computed above), and \(r^2 s (r^2)^{-1} = r^2 s r^2 = s\) (since \(r^2\) is central), so \(s\) and \(r^2 s\) are conjugate.
- \(C_5 = \{rs, r^3 s\}\): since \(r(rs)r^{-1} = r^2 s r^3 = r^{-1}s = r^3 s\).
So the 5 conjugacy classes are: \(C_1 = \{e\}\), \(C_2 = \{r^2\}\), \(C_3 = \{r, r^3\}\), \(C_4 = \{s, r^2s\}\), \(C_5 = \{rs, r^3s\}\), with sizes 1, 1, 2, 2, 2.
Step 2: Determine Dimension Types
\(|D_4| = 8 = \sum d_i^2\) with 5 terms: unique solution \((1,1,1,1,2)\). The abelianization is \(D_4 / [D_4, D_4]\). The commutator subgroup \([D_4, D_4]\) is generated by \([r, s] = rsr^{-1}s^{-1} = r \cdot r^{-2}s \cdot s = r^{-1}\)… wait, let me compute \([r,s] = rsr^{-1}s^{-1}\). We showed \(rsr^{-1} = r^2s\), so \([r,s] = r^2s \cdot s^{-1} = r^2\). Thus \([D_4, D_4] = \langle r^2 \rangle = \{e, r^2\}\) and \(D_4^{\mathrm{ab}} = D_4/\{e, r^2\} \cong C_2 \times C_2\). This has 4 elements, confirming 4 one-dimensional representations.
Step 3: Write Down the One-Dimensional Representations
The one-dimensional representations factor through \(D_4^{\mathrm{ab}} \cong C_2 \times C_2\). In this quotient, \(r\) maps to an element of order dividing 2 and \(s\) maps to an element of order dividing 2. So \(\rho(r) \in \{+1, -1\}\) and \(\rho(s) \in \{+1, -1\}\) with the only constraint \(\rho(r)^4 = \rho(s)^2 = 1\) (automatic) and \(\rho(s)\rho(r)\rho(s)^{-1} = \rho(r)^{-1}\), which forces \(\rho(r)^{-1} = \rho(r)\) (since \(\rho(s)^2 = 1\)), i.e., \(\rho(r)^2 = 1\). So \(\rho(r) = \pm 1\) and \(\rho(s) = \pm 1\), giving 4 choices:
| \(e\) | \(r^2\) | \(r, r^3\) | \(s, r^2s\) | \(rs, r^3s\) | |
|---|---|---|---|---|---|
| \(\chi_1\): \((\rho(r), \rho(s)) = (1,1)\) | 1 | 1 | 1 | 1 | 1 |
| \(\chi_2\): \((\rho(r), \rho(s)) = (1,-1)\) | 1 | 1 | 1 | \(-1\) | \(-1\) |
| \(\chi_3\): \((\rho(r), \rho(s)) = (-1,1)\) | 1 | 1 | \(-1\) | 1 | \(-1\) |
| \(\chi_4\): \((\rho(r), \rho(s)) = (-1,-1)\) | 1 | 1 | \(-1\) | \(-1\) | 1 |
Step 4: Find the 2-Dimensional Irreducible by Orthogonality
Let \(\chi_5 = (a, b, c, d, e_5)\) be the unknown row. We know \(a = \chi_5(e) = 2\).
\[\frac{1}{8}(1 \cdot 2 + 1 \cdot b + 2 \cdot c + 2 \cdot d + 2 \cdot e_5) = 0 \implies 2 + b + 2c + 2d + 2e_5 = 0.\]\[\frac{1}{8}(2 + b + 2c - 2d - 2e_5) = 0 \implies 2 + b + 2c - 2d - 2e_5 = 0.\]Subtracting: \(4d + 4e_5 = 0 \implies d = -e_5\).
\[\frac{1}{8}(2 + b - 2c + 2d - 2e_5) = 0 \implies 2 + b - 2c + 2d - 2e_5 = 0.\]\[\frac{1}{8}(2 + b - 2c - 2d + 2e_5) = 0 \implies 2 + b - 2c - 2d + 2e_5 = 0.\]From the \(\chi_3\) and \(\chi_4\) equations, adding: \(4 + 2b - 4c = 0 \implies b = 2c - 2\). Subtracting: \(4d - 4e_5 = 0 \implies d = e_5\). Combined with \(d = -e_5\): \(d = e_5 = 0\).
Now from \(\chi_1\) equation: \(2 + b + 2c = 0\), and \(b = 2c - 2\): \(2 + (2c-2) + 2c = 0 \implies 4c = 0 \implies c = 0\). Then \(b = -2\).
Finally, check normality: \(\langle \chi_5, \chi_5 \rangle = \frac{1}{8}(4 + 4 + 0 + 0 + 0) = 1\). ✓
The full character table of \(D_4\):
| \(\{e\}\) | \(\{r^2\}\) | \(\{r,r^3\}\) | \(\{s,r^2s\}\) | \(\{rs,r^3s\}\) | |
|---|---|---|---|---|---|
| \(\chi_1\) | 1 | 1 | 1 | 1 | 1 |
| \(\chi_2\) | 1 | 1 | 1 | \(-1\) | \(-1\) |
| \(\chi_3\) | 1 | 1 | \(-1\) | 1 | \(-1\) |
| \(\chi_4\) | 1 | 1 | \(-1\) | \(-1\) | 1 |
| \(\chi_5\) | 2 | \(-2\) | 0 | 0 | 0 |
Verification: \(1 + 1 + 1 + 1 + 4 = 8 = |D_4|\). ✓
\[1 \cdot 1 + 1 \cdot 1 + 1 \cdot (-1) + 1 \cdot (-1) + (-2) \cdot 0 = 1 + 1 - 1 - 1 + 0 = 0. \checkmark\]The 2-Dimensional Representation Explicitly
We already wrote down the 2-dimensional representation via matrices. Let us verify that the character values match. The conjugacy class \(C_3 = \{r, r^3\}\) has \(\chi_5(r) = \mathrm{tr}\bigl(\begin{smallmatrix}0&-1\\1&0\end{smallmatrix}\bigr) = 0 = \chi_5(r^3)\). The class \(C_2 = \{r^2\}\) has \(\chi_5(r^2) = \mathrm{tr}(-I) = -2\). All consistent with the table. ✓
Tensor Products of \(D_4\) Representations
\[(1 \cdot 1, 1 \cdot 1, 1 \cdot (-1), (-1) \cdot 1, (-1)(-1)) = (1, 1, -1, -1, 1) = \chi_4.\]\[\chi_5 \otimes \chi_2: (2 \cdot 1, (-2) \cdot 1, 0 \cdot 1, 0 \cdot (-1), 0 \cdot (-1)) = (2, -2, 0, 0, 0) = \chi_5.\]So \(\chi_5 \otimes \chi_2 \cong \chi_5\): tensoring the 2-dimensional irreducible with \(\chi_2\) leaves it unchanged! This is the \(D_4\) analogue of the \(S_3\) result \(V_\mathrm{std} \otimes V_\mathrm{sgn} \cong V_\mathrm{std}\).
\[(4, 4, 0, 0, 0).\]Decomposing: \(\langle (4,4,0,0,0), \chi_i \rangle\) for each \(i\). The sizes are 1, 1, 2, 2, 2, so the inner product is \(\frac{1}{8}\) times the weighted sum. For \(\chi_1\): \(\frac{1}{8}(4 + 4) = 1\). For \(\chi_2\): \(\frac{1}{8}(4 + 4) = 1\). For \(\chi_3\): \(\frac{1}{8}(4 + 4) = 1\). For \(\chi_4\): \(\frac{1}{8}(4 + 4) = 1\). For \(\chi_5\): \(\frac{1}{8}(8 - 8) = 0\). So \(\chi_5 \otimes \chi_5 \cong \chi_1 \oplus \chi_2 \oplus \chi_3 \oplus \chi_4\). Dimension check: \(4 = 1 + 1 + 1 + 1\). ✓
Restriction and Induction: Detailed Examples
Restricting from \(D_4\) to \(C_4\)
The rotation subgroup \(C_4 = \langle r \rangle \cong C_4\) is a normal subgroup of \(D_4\) of index 2. The irreducible representations of \(C_4\) over \(\mathbb{C}\) are \(\psi_k : r^j \mapsto i^{jk}\) for \(k = 0, 1, 2, 3\) (where \(i = e^{2\pi i/4}\)).
What are the restrictions of \(D_4\)-irreducibles to \(C_4\)?
For the one-dimensional reps: \(\mathrm{Res}_{C_4}^{D_4}(\chi_j)\) is a 1-dimensional rep of \(C_4\) determined by \(\chi_j(r)\). From the table: \(\chi_1(r) = 1\), \(\chi_2(r) = 1\), \(\chi_3(r) = -1\), \(\chi_4(r) = -1\). So \(\chi_1\!\downarrow = \chi_2\!\downarrow = \psi_0\) (trivial) and \(\chi_3\!\downarrow = \chi_4\!\downarrow = \psi_2\) (the order-2 character, since \(\psi_2(r) = -1\)).
\[\langle \chi_5\!\downarrow, \psi_1 \rangle_{C_4} = \tfrac{1}{4}(2 \cdot 1 + 0 \cdot \bar i + (-2)(-1) + 0 \cdot \bar{i}^3) = \tfrac{1}{4}(2 + 2) = 1.\]Similarly \(\langle \chi_5\!\downarrow, \psi_3 \rangle_{C_4} = 1\). And \(\langle \chi_5\!\downarrow, \psi_0\rangle = \frac{1}{4}(2+0-2+0) = 0\), \(\langle \chi_5\!\downarrow, \psi_2\rangle = \frac{1}{4}(2+0-2+0) = 0\). So \(\chi_5\!\downarrow \cong \psi_1 \oplus \psi_3\).
By Frobenius reciprocity, \(\mathrm{mult}(\chi_5, \psi_1\!\uparrow) = \mathrm{mult}(\psi_1, \chi_5\!\downarrow) = 1\). So \(\chi_5\) appears in \(\psi_1\!\uparrow\). Since \(\dim(\psi_1\!\uparrow) = [D_4:C_4] \cdot 1 = 2 = \dim\chi_5\), we have \(\psi_1\!\uparrow \cong \chi_5\)! The 2-dimensional irreducible of \(D_4\) is the induced representation of the primitive 4th-root-of-unity character of \(C_4\).
The Regular Representation of \(D_4\) Decomposed
\[\mathbb{C}D_4 \cong \chi_1^{\oplus 1} \oplus \chi_2^{\oplus 1} \oplus \chi_3^{\oplus 1} \oplus \chi_4^{\oplus 1} \oplus \chi_5^{\oplus 2}.\]Dimension check: \(1 + 1 + 1 + 1 + 4 = 8 = |D_4|\). ✓
This can be verified using the character of the regular representation: \(\chi_\mathrm{reg}(e) = 8\), \(\chi_\mathrm{reg}(g) = 0\) for \(g \neq e\). The multiplicity of \(\chi_5\) is \(\langle \chi_5, \chi_\mathrm{reg}\rangle = \frac{1}{8}(2 \cdot 8) = 2\). ✓
\[\mathbb{C}D_4 \cong \mathbb{C} \times \mathbb{C} \times \mathbb{C} \times \mathbb{C} \times M_2(\mathbb{C}).\]Dimension check: \(1 + 1 + 1 + 1 + 4 = 8\). ✓
Character Theory Applied to Group Structure: More Examples
Detecting Simplicity
A finite group \(G\) is simple if its only normal subgroups are \(\{e\}\) and \(G\). Since normal subgroups correspond to intersections of kernels of characters, \(G\) is simple if and only if no proper subset of conjugacy classes (other than just \(\{e\}\)) forms a union that is the kernel of some character.
Equivalently, \(G\) is simple if and only if every irreducible character other than the trivial character has trivial kernel.
This argument does not use any structural properties of \(A_5\) — just the numerical data in its character table. It is a remarkable fact that simplicity (a structural property) is detectable from the purely numerical character table.
The Class Equation via Characters
\[|C_G(g)| = \sum_{i=1}^r |\chi_i(g)|^2.\]This is because the second orthogonality relation gives \(\sum_i |\chi_i(g)|^2 = |G|/|C_G(g)| \cdot |C_G(g)| = |C_G(g)|\)… wait, let me be careful. The second orthogonality says \(\sum_i \chi_i(g)\overline{\chi_i(h)} = |C_G(g)| \delta_{[g],[h]}\). Setting \(g = h\): \(\sum_i |\chi_i(g)|^2 = |C_G(g)|\). So the centralizer size is the sum of squares of character values in a column of the character table!
The Induced Character Formula in Detail
We state and carefully prove the formula for the character of an induced representation, as it is used constantly in computations.
The Tensor Product and Frobenius Reciprocity Together
Here is a worked example that uses both the tensor product of characters and Frobenius reciprocity simultaneously.
The projection formula is a very efficient computational tool when combined with Frobenius reciprocity.
Chapter 17: Representation Theory and Number Theory
Characters and Dirichlet \(L\)-functions
One of the most beautiful applications of representation theory is the proof of Dirichlet’s theorem on primes in arithmetic progressions. Let us explain the connection.
Dirichlet’s Theorem. For any \(a, n\) with \(\gcd(a, n) = 1\), there are infinitely many primes \(p \equiv a \pmod{n}\).
The proof uses Dirichlet characters: group homomorphisms \(\chi : (\mathbb{Z}/n\mathbb{Z})^\times \to \mathbb{C}^\times\). These are exactly the one-dimensional complex representations of the group \((\mathbb{Z}/n\mathbb{Z})^\times\).
\[L(s, \chi) = \sum_{n=1}^\infty \frac{\chi(n)}{n^s} = \prod_p \frac{1}{1 - \chi(p)p^{-s}},\]where the product is over primes and \(\chi(n)\) is extended to all positive integers by setting \(\chi(n) = 0\) if \(\gcd(n, N) > 1\).
\[\frac{1}{\phi(n)} \sum_{\chi} \chi(a^{-1})\chi(p) = \begin{cases} 1 & \text{if } p \equiv a \pmod{n} \\ 0 & \text{otherwise} \end{cases}\]is precisely the character orthogonality for abelian groups, applied to detect the arithmetic progression \(p \equiv a \pmod{n}\). The proof of Dirichlet’s theorem then reduces to showing that each \(L(1, \chi) \neq 0\) for non-trivial \(\chi\) — a non-vanishing result at the boundary of the region of convergence.
Thus Dirichlet’s theorem is, at its heart, a result about the representation theory of the abelian group \((\mathbb{Z}/n\mathbb{Z})^\times\). The “Fourier analysis” on this group (decomposing functions into irreducible components indexed by characters) is the mechanism by which the arithmetic information is extracted.
Gauss Sums and Character Theory
\[\tau(\chi) = \sum_{a=0}^{n-1} \chi(a) e^{2\pi i a/n}.\]These sums are fundamental in analytic number theory and appear in the functional equation for \(L(s, \chi)\).
\[|\tau(\chi)|^2 = \begin{cases} n & \text{if } \chi \text{ is primitive} \\ 0 & \text{if } \chi \text{ is imprimitive} \end{cases}\]is a consequence of the character orthogonality relations for the group \(\mathbb{Z}/n\mathbb{Z}\).
\[\tau(\chi)^2 = (-1)^{(p-1)/2} p = \begin{cases} p & \text{if } p \equiv 1 \pmod{4} \\ -p & \text{if } p \equiv 3 \pmod{4}. \end{cases}\]This is Gauss’s famous theorem on quadratic Gauss sums, and the proof is essentially an exercise in character orthogonality.
Artin \(L\)-functions and the Nonabelian Case
\[L(s, \rho) = \prod_p \det\left(I - \rho(\mathrm{Frob}_p) p^{-s}\right)^{-1},\]where \(\mathrm{Frob}_p\) is the Frobenius element at a prime \(p\). The key properties:
- If \(\rho = \mathbf{1}\) is trivial, \(L(s, \mathbf{1}) = \zeta(s)\) is the Riemann zeta function.
- If \(G\) is abelian and \(\rho\) is one-dimensional, \(L(s, \rho)\) is a Dirichlet \(L\)-function.
- For any two representations \(\rho_1, \rho_2\): \(L(s, \rho_1 \oplus \rho_2) = L(s, \rho_1) L(s, \rho_2)\).
- Artin’s conjecture (largely open): \(L(s, \rho)\) extends to an entire function for any nontrivial irreducible \(\rho\).
which is Artin’s induction theorem. This connects the representation theory of \(G\) to the arithmetic of the field extension.
Chapter 18: Schur-Weyl Duality
Setup and Statement
Schur-Weyl duality is one of the most striking applications of representation theory and connects the representation theories of \(GL_n\) and \(S_d\) in a tight, mutually determining way.
Let \(V = \mathbb{C}^n\) be the standard representation of \(GL_n(\mathbb{C})\). The tensor power \(V^{\otimes d} = V \otimes \cdots \otimes V\) (\(d\) times) carries two commuting actions:
- \(GL_n(\mathbb{C})\) acts diagonally: \(g \cdot (v_1 \otimes \cdots \otimes v_d) = gv_1 \otimes \cdots \otimes gv_d\).
- \(S_d\) acts by permuting factors: \(\sigma \cdot (v_1 \otimes \cdots \otimes v_d) = v_{\sigma^{-1}(1)} \otimes \cdots \otimes v_{\sigma^{-1}(d)}\).
These two actions commute: \(g \sigma v = \sigma g v\) for \(g \in GL_n\), \(\sigma \in S_d\). Define:
- \(A = \mathbb{C}[GL_n]\)-endomorphisms of \(V^{\otimes d}\) = image of the \(GL_n\) action in \(\mathrm{End}(V^{\otimes d})\).
- \(B = \mathbb{C}[S_d]\)-endomorphisms = image of the \(S_d\) action.
The key word is “multiplicity-free”: each irreducible of \(S_d\) (occurring at all) appears with multiplicity exactly 1, and the corresponding \(GL_n\)-module is the “partner” \(W^\lambda\). The two representation theories are linked in a perfect duality: knowing which \(S^\lambda\) appear determines which \(W^\lambda\) appear, and vice versa.
Consequences
For \(GL_n\)-representation theory. The irreducible polynomial representations of \(GL_n(\mathbb{C})\) in degree \(d\) are precisely the \(W^\lambda\) for \(\lambda \vdash d\) with at most \(n\) rows. Their characters are the Schur polynomials \(s_\lambda(x_1, \ldots, x_n)\) — the most important basis for the ring of symmetric functions.
For symmetric function theory. The ring of symmetric functions \(\Lambda = \mathbb{Z}[x_1, x_2, \ldots]^{S_\infty}\) has several natural bases (monomial, elementary, complete, power sum, Schur). Schur-Weyl duality gives a representation-theoretic interpretation of all these bases and the relations between them.
For combinatorics. The dimension of \(W^\lambda\) (for \(GL_n\) acting on degree-\(d\) part of \(\mathbb{C}^n\)) is the number of semistandard Young tableaux of shape \(\lambda\) with entries in \(\{1, \ldots, n\}\) — a formula known as the hook-content formula. Combined with the hook length formula for \(\dim S^\lambda\), Schur-Weyl duality gives deep combinatorial identities.
Chapter 19: Further Directions
Modular Representation Theory
When the characteristic of the field \(F\) divides \(|G|\), Maschke’s theorem fails and the theory becomes much richer and more complicated. This is modular representation theory, developed by Richard Brauer.
The key new features:
- Brauer characters: Analogues of ordinary characters, defined using a lift from characteristic \(p\) to characteristic 0 via the ring of Witt vectors.
- Decomposition matrix: The matrix \(D\) with \(D_{\chi, \phi}\) = multiplicity of the modular irreducible \(\phi\) as a composition factor of the reduction mod \(p\) of the ordinary irreducible \(\chi\).
- Blocks: The indecomposable two-sided ideals of \(FG\). Each block is associated to a conjugacy class of \(p\)-subgroups (its defect groups).
- Projective indecomposable modules (PIMs): The indecomposable direct summands of \(FG\) as a left \(FG\)-module. These replace the simple summands of Artin-Wedderburn.
The discrepancy: the standard 2-dimensional complex rep \(V_\mathrm{std}\) reduces mod 3 to an irreducible \(\mathbb{F}_3\)-module, and so does the sign rep. The trivial rep also reduces to an irreducible. But there are 3 ordinary irreducibles and only 2 modular ones — the decomposition matrix \(D\) encodes how ordinary irreducibles break up into modular composition factors.
The McKay Correspondence
A deep and mysterious connection between representation theory and geometry is the McKay correspondence, discovered by John McKay in 1980.
Let \(\Gamma \leq SU(2)\) be a finite subgroup. By the classification, these are cyclic groups \(C_n\), dihedral groups \(D_n\), and the exceptional groups corresponding to the tetrahedron (\(A_4\)), octahedron/cube (\(S_4\)), and icosahedron/dodecahedron (\(A_5\)).
The McKay graph of \(\Gamma\) is defined as follows: vertices are the irreducible representations \(\{V_0, V_1, \ldots, V_r\}\) of \(\Gamma\) (with \(V_0\) the trivial rep), and the number of edges from \(V_i\) to \(V_j\) equals the multiplicity of \(V_j\) in \(V \otimes V_i\), where \(V\) is the 2-dimensional representation of \(\Gamma\) coming from its inclusion in \(SU(2)\).
McKay’s Theorem: The McKay graph of \(\Gamma\) is an affine Dynkin diagram of type \(\widetilde{A}_n\), \(\widetilde{D}_n\), \(\widetilde{E}_6\), \(\widetilde{E}_7\), or \(\widetilde{E}_8\) — exactly the same ADE classification as that of simple Lie algebras!
This is a striking “coincidence” with no elementary explanation: why should finite subgroups of \(SU(2)\) (classified by the ADE types via their McKay graphs) correspond to simple Lie algebras (also classified by ADE Dynkin diagrams)? The explanation ultimately lies in algebraic geometry (the minimal resolutions of \(\mathbb{C}^2/\Gamma\) are ADE surface singularities) and string theory (which provides a physical interpretation of the duality). But the McKay correspondence, discovered simply by computing McKay graphs of finite groups, is one of the most beautiful unexpected discoveries in representation theory.
Representation Stability
A modern direction in representation theory is the notion of representation stability, introduced by Church, Ellenberg, and Farb around 2010.
The idea is that many natural sequences of representations \(V_n\) of \(S_n\) (or of other groups in a compatible family) stabilize in a precise sense as \(n \to \infty\). For instance, the cohomology groups \(H^k(\mathrm{Conf}_n(\mathbb{C}), \mathbb{Q})\) (configuration spaces of \(n\) distinct points in \(\mathbb{C}\)) are representations of \(S_n\), and they are representation-stable for large enough \(n\).
This notion of stability is formalized using the language of FI-modules (modules over the category FI of finite sets and injections), where an FI-module encodes a compatible system of representations of all symmetric groups simultaneously. The theory has applications to statistics, algebraic topology, and algebraic geometry.
Geometric Representation Theory: A Glimpse
The most active frontier of modern representation theory is its interaction with geometry. A few highlights:
Geometric Satake. The irreducible representations of a reductive algebraic group \(G\) (like \(GL_n\), \(SO_n\), etc.) over \(\mathbb{C}\) are in natural bijection with the \(G^\vee\)-equivariant perverse sheaves on the affine Grassmannian \(\mathrm{Gr}_G\) (where \(G^\vee\) is the Langlands dual group). This is the geometric Satake correspondence.
Categorification. The irreducible representations of \(S_n\) (Specht modules \(S^\lambda\)) can be “categorified”: one constructs a 2-category (a category of categories) whose Grothendieck group (the free abelian group on isomorphism classes of objects) recovers the usual representation ring. The morphisms in the 2-category then give extra structure invisible at the level of characters.
Kazhdan-Lusztig theory. For Hecke algebras (deformations of group algebras of Weyl groups), the representation theory involves mysterious polynomials \(P_{y,w}(q)\) (Kazhdan-Lusztig polynomials) whose coefficients count intersection cohomology of Schubert varieties. These polynomials are always non-negative — a fact that was conjectured on representation-theoretic grounds and proved geometrically.
The thread connecting all these developments is this: the linear-algebraic structures we have studied — characters, orthogonality, induction, modules — are the algebraic shadows of much richer geometric and categorical structures. Understanding those richer structures is one of the driving forces of contemporary mathematics.
These notes cover the content of PMATH 445 at the University of Waterloo as taught by Faisal Al-Faisal, enriched with detailed worked examples, proofs, historical context, and connections to broader mathematics. The core theory (Maschke, Schur, orthogonality, Artin-Wedderburn, induction, Frobenius reciprocity) is developed with full proofs; the later chapters indicate the depth and breadth of the subject beyond the course.