PMATH 940: Modular Forms

C. L. Stewart

Estimated study time: 1 hr 18 min

Table of contents

Sources and References

  • Diamond, F. and Shurman, J. A First Course in Modular Forms. Springer GTM 228, 2005.
  • Serre, J.-P. A Course in Arithmetic. Springer GTM 7, 1973. (Part II: Modular Forms.)
  • Koblitz, N. Introduction to Elliptic Curves and Modular Forms. Springer GTM 97, 2nd ed. 1993.
  • Zagier, D. “Modular Forms and Their Applications.” In The 1-2-3 of Modular Forms. Springer Universitext, 2008.
  • Apostol, T. M. Modular Functions and Dirichlet Series in Number Theory. Springer GTM 41, 2nd ed. 1990.
  • Silverman, J. H. The Arithmetic of Elliptic Curves. Springer GTM 106, 2009.

Chapter 1: The Upper Half-Plane and the Modular Group

1.1 The Complex Upper Half-Plane

Modular forms live on the upper half-plane, the open subset of the complex plane defined by

\[ \mathfrak{H} = \{ \tau \in \mathbb{C} : \operatorname{Im}(\tau) > 0 \}. \]

This is a rich geometric object. Equipped with the hyperbolic (Poincaré) metric

\[ ds^2 = \frac{dx^2 + dy^2}{y^2}, \quad \tau = x + iy, \]

\(\mathfrak{H}\) becomes a model of the hyperbolic plane with constant Gaussian curvature \(-1\). The group of orientation-preserving isometries of this metric turns out to be precisely the group of Möbius transformations with real coefficients and positive determinant, which brings us immediately to the modular group.

1.2 The Modular Group and Its Action

The modular group is the group of two-by-two integer matrices with determinant one:

\[ \operatorname{SL}(2,\mathbb{Z}) = \left\{ \begin{pmatrix} a & b \\ c & d \end{pmatrix} : a,b,c,d \in \mathbb{Z},\; ad - bc = 1 \right\}. \]

This group acts on \(\mathfrak{H}\) by Möbius transformations (also called fractional linear transformations): for \(\gamma = \begin{pmatrix} a & b \\ c & d \end{pmatrix}\) and \(\tau \in \mathfrak{H}\),

\[ \gamma \cdot \tau = \frac{a\tau + b}{c\tau + d}. \]

One must check that this actually maps \(\mathfrak{H}\) to itself. A direct computation shows

\[ \operatorname{Im}(\gamma \cdot \tau) = \frac{\operatorname{Im}(\tau)}{|c\tau + d|^2}, \]

which is positive whenever \(\operatorname{Im}(\tau) > 0\). The kernel of the action consists of the matrices \(\pm I\), so the effective group of symmetries is the projective special linear group

\[ \operatorname{PSL}(2,\mathbb{Z}) = \operatorname{SL}(2,\mathbb{Z}) / \{ \pm I \}. \]

The modular group is generated by the two elements

\[ S = \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}, \quad T = \begin{pmatrix} 1 & 1 \\ 0 & 1 \end{pmatrix}, \]

whose actions are \(S \cdot \tau = -1/\tau\) and \(T \cdot \tau = \tau + 1\). These satisfy the relations \(S^2 = (ST)^3 = -I\) in \(\operatorname{SL}(2,\mathbb{Z})\), equivalently \(S^2 = (ST)^3 = I\) in \(\operatorname{PSL}(2,\mathbb{Z})\). The modular group is therefore a free product \(\mathbb{Z}/2\mathbb{Z} * \mathbb{Z}/3\mathbb{Z}\) in its projective form — a fact that plays an important role in computing the structure of spaces of modular forms.

1.3 Congruence Subgroups

Many important families of modular forms are naturally associated not with the full modular group but with finite-index subgroups. The most important are the congruence subgroups defined by congruence conditions on the matrix entries. Fix a positive integer \(N\), called the level.

The *principal congruence subgroup of level \(N\)* is \[ \Gamma(N) = \left\{ \begin{pmatrix} a & b \\ c & d \end{pmatrix} \in \operatorname{SL}(2,\mathbb{Z}) : \begin{pmatrix} a & b \\ c & d \end{pmatrix} \equiv \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} \pmod{N} \right\}. \]

A subgroup \(\Gamma \leq \operatorname{SL}(2,\mathbb{Z})\) is called a congruence subgroup if it contains \(\Gamma(N)\) for some \(N\). The smallest such \(N\) is the level of \(\Gamma\).

The two most commonly encountered congruence subgroups between \(\Gamma(N)\) and \(\operatorname{SL}(2,\mathbb{Z})\) are

\[ \Gamma_0(N) = \left\{ \begin{pmatrix} a & b \\ c & d \end{pmatrix} \in \operatorname{SL}(2,\mathbb{Z}) : c \equiv 0 \pmod{N} \right\}, \]\[ \Gamma_1(N) = \left\{ \begin{pmatrix} a & b \\ c & d \end{pmatrix} \in \operatorname{SL}(2,\mathbb{Z}) : c \equiv 0,\; a \equiv d \equiv 1 \pmod{N} \right\}. \]

The inclusions \(\Gamma(N) \subseteq \Gamma_1(N) \subseteq \Gamma_0(N) \subseteq \operatorname{SL}(2,\mathbb{Z})\) are all of finite index. The index \([\operatorname{SL}(2,\mathbb{Z}) : \Gamma_0(N)]\) equals \(N \prod_{p \mid N}(1 + 1/p)\), where the product runs over primes \(p\) dividing \(N\).

1.4 The Fundamental Domain

A fundamental domain for the action of \(\operatorname{SL}(2,\mathbb{Z})\) on \(\mathfrak{H}\) is a closed region \(\mathcal{F} \subset \mathfrak{H}\) whose translates under \(\operatorname{SL}(2,\mathbb{Z})\) tile \(\mathfrak{H}\) with at most boundary overlaps.

The standard fundamental domain for \(\operatorname{SL}(2,\mathbb{Z})\) is \[ \mathcal{F} = \left\{ \tau \in \mathfrak{H} : |\operatorname{Re}(\tau)| \leq \tfrac{1}{2},\; |\tau| \geq 1 \right\}. \]

Every \(\tau \in \mathfrak{H}\) is \(\operatorname{SL}(2,\mathbb{Z})\)-equivalent to some point in \(\mathcal{F}\), and two interior points of \(\mathcal{F}\) are equivalent only if they coincide.

The proof proceeds in two steps. For existence, given \(\tau \in \mathfrak{H}\), the set \(\{|c\tau + d|^2 : c,d \in \mathbb{Z},\,(c,d) = 1\}\) is bounded below by a positive constant (depending on \(\operatorname{Im}(\tau)\)), so there exists a matrix \(\gamma\) maximizing \(\operatorname{Im}(\gamma \cdot \tau)\). This maximum is attained in \(\mathcal{F}\) after applying a suitable power of \(T\) to control the real part. Uniqueness requires checking the boundary identifications: the left and right edges \(\operatorname{Re}(\tau) = \pm 1/2\) are identified by \(T\), and the arc \(|\tau| = 1\) maps to itself under \(S\).

The boundary of \(\mathcal{F}\) contains two special points worth singling out. The point \(\tau = i\) is fixed by \(S\) (since \(S \cdot i = -1/i = i\)), and the points \(\tau = e^{2\pi i/3} = -1/2 + i\sqrt{3}/2\) and \(\tau = e^{\pi i/3} = 1/2 + i\sqrt{3}/2\) are both fixed by \(ST\) and its conjugates.

1.5 Elliptic Points and Cusps

Points in \(\mathfrak{H}\) whose stabilizer in \(\operatorname{PSL}(2,\mathbb{Z})\) is nontrivial are called elliptic points. For the full modular group there are exactly two \(\operatorname{SL}(2,\mathbb{Z})\)-orbits of elliptic points: the orbit of \(\tau = i\) (stabilizer of order 2, generated by \(S\)) and the orbit of \(\tau = \rho = e^{2\pi i/3}\) (stabilizer of order 3, generated by \(ST\)).

The cusps are the points in \(\mathbb{P}^1(\mathbb{Q}) = \mathbb{Q} \cup \{\infty\}\), which form the boundary at infinity of \(\mathfrak{H}\). The modular group acts transitively on \(\mathbb{P}^1(\mathbb{Q})\), so there is a single cusp up to equivalence, which may be taken to be \(\infty\). For a congruence subgroup \(\Gamma\) of level \(N\), there are finitely many \(\Gamma\)-orbits of cusps, and their number can be computed from the index and level.

The compactified upper half-plane \(\mathfrak{H}^* = \mathfrak{H} \cup \mathbb{P}^1(\mathbb{Q})\) is given a topology by adjoining, for each cusp \(s = \gamma \cdot \infty\), “horoball” neighborhoods. The quotient \(Y(\Gamma) = \Gamma \backslash \mathfrak{H}\) is a Riemann surface (non-compact), and its compactification \(X(\Gamma) = \Gamma \backslash \mathfrak{H}^*\) is a compact Riemann surface, hence an algebraic curve. For \(\Gamma = \operatorname{SL}(2,\mathbb{Z})\), the compactified quotient \(X(1) = \operatorname{SL}(2,\mathbb{Z}) \backslash \mathfrak{H}^*\) has genus 0, consistent with its being parametrized by the \(j\)-function.


Chapter 2: Modular Functions

2.1 Automorphy and Modular Functions

We now formalize the notion of a function on \(\mathfrak{H}\) that “transforms nicely” under the modular group. Let \(\Gamma\) be a congruence subgroup and \(k\) an integer.

A meromorphic function \(f : \mathfrak{H} \to \mathbb{C}\) satisfies the *weight-\(k\) automorphy condition* for \(\Gamma\) if \[ f\!\left(\frac{a\tau+b}{c\tau+d}\right) = (c\tau+d)^k f(\tau) \quad \text{for all } \begin{pmatrix}a&b\\c&d\end{pmatrix} \in \Gamma. \]

A modular function of weight 0 for \(\Gamma\) is a meromorphic function on \(\mathfrak{H}^*\) that is invariant (weight \(k=0\)) under \(\Gamma\).

For weight 0, the automorphy condition says simply \(f(\gamma \cdot \tau) = f(\tau)\), i.e., \(f\) descends to a meromorphic function on the compact Riemann surface \(X(\Gamma)\). Since \(X(\Gamma)\) is compact, any holomorphic function on it is constant; only meromorphic (or rational) functions are interesting.

The field of modular functions for \(\operatorname{SL}(2,\mathbb{Z})\) (i.e., weight-0 invariants, meromorphic on \(X(1)\)) is generated over \(\mathbb{C}\) by a single function, the celebrated \(j\)-function.

2.2 The \(j\)-Function

Since \(T = \begin{pmatrix}1&1\\0&1\end{pmatrix} \in \operatorname{SL}(2,\mathbb{Z})\), any automorphic function must satisfy \(f(\tau+1) = f(\tau)\), i.e., it is periodic in \(\tau\) with period 1. Such a function admits a Fourier expansion (also called a \(q\)-expansion) in the local parameter

\[ q = e^{2\pi i \tau}. \]

As \(\operatorname{Im}(\tau) \to \infty\), we have \(q \to 0\), so the cusp \(\infty\) corresponds to \(q = 0\) under this local coordinate. A periodic meromorphic function on \(\mathfrak{H}\) that is also meromorphic at the cusp \(\infty\) has an expansion

\[ f(\tau) = \sum_{n = n_0}^{\infty} a_n q^n, \quad q = e^{2\pi i\tau}, \]

for some \(n_0 \in \mathbb{Z}\).

The *\(j\)-function* (or *Klein \(j\)-invariant*) is the unique modular function for \(\operatorname{SL}(2,\mathbb{Z})\) whose \(q\)-expansion has the form \[ j(\tau) = q^{-1} + 744 + 196884\,q + 21493760\,q^2 + \cdots \]

It is holomorphic on \(\mathfrak{H}\), has a simple pole at the cusp \(\infty\), and every modular function for \(\operatorname{SL}(2,\mathbb{Z})\) is a rational function of \(j\).

The coefficients of \(j(\tau)\) are positive integers with remarkable arithmetic significance, as we will see in the discussion of Moonshine.

2.3 Eisenstein Series

The cleanest way to produce modular forms is via averaging over the group. Let \(k \geq 2\) be an integer. Consider the Eisenstein series

\[ G_k(\tau) = \sum_{(c,d) \in \mathbb{Z}^2 \setminus \{(0,0)\}} \frac{1}{(c\tau + d)^k}. \]

For \(k \geq 3\), this sum converges absolutely and uniformly on compact subsets of \(\mathfrak{H}\), defining a holomorphic function. One verifies directly that \(G_k(\gamma \cdot \tau) = (c\tau+d)^k G_k(\tau)\) for all \(\gamma = \begin{pmatrix}a&b\\c&d\end{pmatrix} \in \operatorname{SL}(2,\mathbb{Z})\) — the weight-\(k\) automorphy condition. By Fourier analysis, the \(q\)-expansion of \(G_k\) can be made explicit.

For odd \(k\), the sum vanishes identically because the term for \((c,d)\) and \((-c,-d)\) cancel. So the interesting Eisenstein series are for even \(k \geq 4\).

For even \(k \geq 4\), the \(q\)-expansion of \(G_k\) is \[ G_k(\tau) = 2\zeta(k) + 2 \frac{(2\pi i)^k}{(k-1)!} \sum_{n=1}^{\infty} \sigma_{k-1}(n) q^n, \]

where \(\zeta(k) = \sum_{n=1}^\infty n^{-k}\) is the Riemann zeta function and \(\sigma_{k-1}(n) = \sum_{d \mid n} d^{k-1}\) is the sum-of-divisors function.

It is customary to normalize by dividing out the leading constant. Define the normalized Eisenstein series

\[ E_k(\tau) = \frac{G_k(\tau)}{2\zeta(k)} = 1 - \frac{2k}{B_k} \sum_{n=1}^{\infty} \sigma_{k-1}(n) q^n, \]

where \(B_k\) denotes the \(k\)-th Bernoulli number. Explicitly:

\[ E_4(\tau) = 1 + 240\sum_{n=1}^{\infty} \sigma_3(n) q^n = 1 + 240q + 2160q^2 + 6720q^3 + \cdots, \]\[ E_6(\tau) = 1 - 504\sum_{n=1}^{\infty} \sigma_5(n) q^n = 1 - 504q - 16632q^2 - \cdots. \]

These two functions generate the ring of modular forms for \(\operatorname{SL}(2,\mathbb{Z})\), a fact we will prove once we have the dimension formula in hand.

2.4 The Discriminant and Ramanujan’s \(\tau\)-Function

One of the most important functions in the theory is the modular discriminant

\[ \Delta(\tau) = \frac{E_4(\tau)^3 - E_6(\tau)^2}{1728}. \]

The normalization is chosen so that \(\Delta\) has a simple zero at the cusp and its \(q\)-expansion begins with \(q\):

\[ \Delta(\tau) = q \prod_{n=1}^{\infty} (1 - q^n)^{24} = \sum_{n=1}^{\infty} \tau(n) q^n, \]

where the arithmetic function \(n \mapsto \tau(n)\) is Ramanujan’s tau-function. The first few values are

\[ \tau(1) = 1,\quad \tau(2) = -24,\quad \tau(3) = 252,\quad \tau(4) = -1472,\quad \tau(5) = 4830. \]

Ramanujan observed (and Mordell proved) that \(\tau\) is multiplicative: if \(\gcd(m,n) = 1\) then \(\tau(mn) = \tau(m)\tau(n)\). Ramanujan also conjectured — and this is far deeper — that \(|\tau(p)| \leq 2p^{11/2}\) for all primes \(p\). This became the Ramanujan conjecture, proved by Deligne in 1974 as a consequence of his proof of the Weil conjectures.

The function \(\Delta\) has weight 12. It is remarkable in being a cusp form — holomorphic on \(\mathfrak{H}\) and vanishing at the cusp — and the unique cusp form of weight 12 up to scalar multiples.


Chapter 3: Modular Forms

3.1 Definition and Basic Examples

We now define the principal objects of study.

Let \(\Gamma\) be a congruence subgroup and \(k\) a non-negative integer. A *modular form of weight \(k\) for \(\Gamma\)* is a holomorphic function \(f : \mathfrak{H} \to \mathbb{C}\) such that:
  1. (Automorphy) \(f(\gamma \cdot \tau) = (c\tau + d)^k f(\tau)\) for all \(\gamma = \begin{pmatrix}a&b\\c&d\end{pmatrix} \in \Gamma\).
  2. (Holomorphic at cusps) For each cusp \(s\) of \(\Gamma\), the function \(f|_k[\sigma_s]\) (where \(\sigma_s\) is any element of \(\operatorname{SL}(2,\mathbb{Z})\) sending \(\infty\) to \(s\)) has a Fourier expansion in the local coordinate at the cusp with only non-negative powers of \(q\).

A modular form is a cusp form if its Fourier expansion at every cusp has no constant term (i.e., it vanishes at every cusp).

The space of all modular forms of weight \(k\) for \(\Gamma\) is denoted \(M_k(\Gamma)\), and the subspace of cusp forms is \(S_k(\Gamma)\). Both are finite-dimensional complex vector spaces.

For the full modular group \(\Gamma = \operatorname{SL}(2,\mathbb{Z})\), there is only one cusp (at \(\infty\)), and the holomorphic-at-cusp condition simply says that the \(q\)-expansion \(f(\tau) = \sum_{n=0}^{\infty} a_n q^n\) starts from \(n = 0\). Being a cusp form means \(a_0 = 0\), i.e., \(f\) vanishes at \(\infty\).

3.2 The Valence Formula

The key tool for determining the dimensions \(\dim M_k(\Gamma)\) is the valence formula, which constrains the total zero-pole count of a meromorphic automorphic form via the Gauss-Bonnet theorem for the Riemann surface \(X(\Gamma)\).

For the full modular group, the formula reads:

*(Valence formula for \(\operatorname{SL}(2,\mathbb{Z})\))* If \(f\) is a nonzero meromorphic modular form of weight \(k\) for \(\operatorname{SL}(2,\mathbb{Z})\), then \[ v_\infty(f) + \frac{1}{2} v_i(f) + \frac{1}{3} v_\rho(f) + \sum_{\substack{\tau \in \mathcal{F} \\ \tau \neq i, \rho}} v_\tau(f) = \frac{k}{12}, \]

where \(v_\tau(f)\) denotes the order of vanishing of \(f\) at \(\tau\), and \(\rho = e^{2\pi i/3}\).

The fractional contributions at \(i\) and \(\rho\) arise because these are elliptic fixed points with stabilizers of order 2 and 3 respectively; the formula must account for ramification.

One integrates \(\frac{d\log f}{2\pi i}\) around a suitably truncated version of the fundamental domain, using the Cauchy residue theorem. The boundary terms on the identified edges cancel by the automorphy condition, leaving a contribution only from the arc at the top (giving \(v_\infty(f)\)) and from the elliptic points (giving the fractional parts).

3.3 Dimension Formulas

The valence formula immediately gives the dimension of \(M_k(\Gamma)\) for small weights and for the full modular group.

For the full modular group \(\operatorname{SL}(2,\mathbb{Z})\):
  • \(M_k(\operatorname{SL}(2,\mathbb{Z})) = 0\) if \(k < 0\) or \(k\) is odd.
  • \(\dim M_k(\operatorname{SL}(2,\mathbb{Z})) = \lfloor k/12 \rfloor\) if \(k \equiv 2 \pmod{12}\), and \(\lfloor k/12 \rfloor + 1\) otherwise (for even \(k \geq 0\)).
  • \(\dim S_k(\operatorname{SL}(2,\mathbb{Z})) = \dim M_k - 1\) for \(k \geq 2\) (one cusp), and \(S_k = 0\) for \(k \leq 10\), \(k \neq 12\).

From these formulas one deduces:

\[ \dim M_0 = 1, \quad \dim M_2 = 0, \quad \dim M_4 = 1, \quad \dim M_6 = 1, \quad \dim M_8 = 1, \quad \dim M_{10} = 1, \quad \dim M_{12} = 2. \]

Since \(M_4\) is 1-dimensional and contains \(E_4\), every weight-4 modular form is a multiple of \(E_4\). Similarly \(M_6 = \mathbb{C} \cdot E_6\). The first nontrivial cusp form occurs at weight 12: \(S_{12} = \mathbb{C} \cdot \Delta\). The space \(M_{12}\) is spanned by \(E_4^3 = E_6^2 \cdot (\text{something})\) and \(\Delta\), or equivalently by \(E_{12}\) and \(\Delta\).

The ring of modular forms for \(\operatorname{SL}(2,\mathbb{Z})\) is the polynomial ring \[ M_* = \bigoplus_{k \geq 0} M_k(\operatorname{SL}(2,\mathbb{Z})) = \mathbb{C}\left[ E_4, E_6 \right], \]

a free commutative graded ring with generators in degrees 4 and 6.

For a general congruence subgroup \(\Gamma\), the dimension formula comes from the Riemann-Roch theorem on the compact Riemann surface \(X(\Gamma)\). The genus \(g\) of \(X(\Gamma)\) and the number of cusps and elliptic points of various orders enter the formula, which takes the form

\[ \dim M_k(\Gamma) = (k-1)(g-1) + \left\lfloor \frac{k}{4} \right\rfloor \nu_2 + \left\lfloor \frac{k}{3} \right\rfloor \nu_3 + \frac{k}{2} \nu_\infty \]

for even \(k \geq 2\), where \(\nu_2, \nu_3\) count elliptic points of orders 2 and 3 respectively and \(\nu_\infty\) counts cusps. The precise formula involves further correction terms depending on the parity of \(k\) and the specific subgroup.

3.4 Low-Weight Examples and the Dedekind Eta Function

The Dedekind eta function is

\[ \eta(\tau) = q^{1/24} \prod_{n=1}^{\infty}(1-q^n), \quad q = e^{2\pi i\tau}. \]

This is not quite a modular form because of the fractional power of \(q\); rather, it is a modular form of weight \(1/2\) with a character, hence a half-integral weight form. Nevertheless, its 24th power is related to \(\Delta\) via

\[ \Delta(\tau) = \eta(\tau)^{24}. \]

The eta function satisfies the transformation formulas

\[ \eta(\tau+1) = e^{\pi i/12} \eta(\tau), \quad \eta(-1/\tau) = \sqrt{-i\tau}\,\eta(\tau). \]

These encode the modular transformation behavior of \(\Delta\) since \(\Delta(\tau+1) = \Delta(\tau)\) and \(\Delta(-1/\tau) = \tau^{12} \Delta(\tau)\), consistent with weight 12.

3.5 The \(q\)-Expansion Principle

The \(q\)-expansion is a fundamental tool, but it is more than just a convenient Fourier expansion — it carries the arithmetic structure of modular forms.

*(q-Expansion Principle)* A modular form \(f \in M_k(\Gamma)\) is uniquely determined by its \(q\)-expansion at the cusp \(\infty\). Moreover, if \(f\) has \(q\)-expansion \(\sum_{n=0}^\infty a_n q^n\), then \(f\) is a cusp form if and only if \(a_0 = 0\).

A much deeper result (requiring algebraic geometry) shows that if \(f \in M_k(\Gamma_0(N))\) has all Fourier coefficients \(a_n\) lying in a subring \(R \subset \mathbb{C}\) and if \(a_0 \in R\), then the \(q\)-expansion at every cusp also has coefficients in a ring closely related to \(R\). This “arithmetic” aspect of \(q\)-expansions is essential for the theory of Hecke operators and the modularity theorem.


Chapter 4: Doubly Periodic Functions and Elliptic Curves

4.1 Lattices and the Weierstrass \(\wp\)-Function

The connection between modular forms and elliptic curves runs deep, and understanding it requires a brief excursion into the theory of doubly periodic meromorphic functions on \(\mathbb{C}\).

A *lattice* in \(\mathbb{C}\) is a subgroup \(\Lambda = \mathbb{Z}\omega_1 + \mathbb{Z}\omega_2\) where \(\omega_1, \omega_2 \in \mathbb{C}\) are \(\mathbb{R}\)-linearly independent. We may (and do) assume that \(\tau = \omega_1/\omega_2\) satisfies \(\operatorname{Im}(\tau) > 0\), so \(\tau \in \mathfrak{H}\).

A meromorphic function \(f : \mathbb{C} \to \mathbb{C} \cup \{\infty\}\) is doubly periodic (or elliptic) with respect to \(\Lambda\) if \(f(z + \omega) = f(z)\) for all \(\omega \in \Lambda\). The study of such functions is essentially equivalent to the study of meromorphic functions on the torus \(\mathbb{C}/\Lambda\).

The fundamental example is the Weierstrass \(\wp\)-function:

\[ \wp(z;\Lambda) = \frac{1}{z^2} + \sum_{\omega \in \Lambda \setminus \{0\}} \left( \frac{1}{(z-\omega)^2} - \frac{1}{\omega^2} \right). \]

The subtraction of \(1/\omega^2\) ensures absolute convergence. The function \(\wp\) is clearly doubly periodic (as shifting \(z\) by a period permutes the terms of the sum) and has a double pole at each lattice point.

The Weierstrass \(\wp\)-function satisfies the differential equation \[ (\wp'(z))^2 = 4\wp(z)^3 - g_2(\Lambda)\wp(z) - g_3(\Lambda), \]

where the lattice invariants are

\[ g_2(\Lambda) = 60 G_4(\Lambda) = 60 \sum_{\omega \in \Lambda \setminus \{0\}} \omega^{-4}, \quad g_3(\Lambda) = 140 G_6(\Lambda) = 140 \sum_{\omega \in \Lambda \setminus \{0\}} \omega^{-6}. \]
One computes the Laurent expansion of \(\wp(z)\) near \(z = 0\): \[ \wp(z) = z^{-2} + \sum_{k=1}^{\infty}(2k+1)G_{2k+2}(\Lambda) z^{2k}, \]

where \(G_{2k}(\Lambda) = \sum_{\omega \neq 0} \omega^{-2k}\) are the Eisenstein sums of the lattice. Differentiating and substituting, one verifies that \((\wp')^2 - 4\wp^3 + g_2\wp + g_3\) is a doubly periodic function with no poles, hence a constant. Computing the Laurent expansion shows the constant is zero.

4.2 Eisenstein Series as Lattice Invariants

The relationship between the Eisenstein series \(G_k(\tau)\) from Chapter 2 and the lattice sums \(G_k(\Lambda)\) is completely transparent: for the lattice \(\Lambda = \mathbb{Z} + \mathbb{Z}\tau\), one has

\[ G_k(\Lambda) = G_k(\tau), \]

since every nonzero element of \(\Lambda\) has the form \(c\tau + d\) for \((c,d) \in \mathbb{Z}^2 \setminus \{(0,0)\}\).

This identification is the bridge between the analytic theory of lattices and the theory of modular forms. The function \(\tau \mapsto G_k(\tau)\) is not just a modular form — it is the natural function encoding the geometry of the lattice \(\mathbb{Z} + \mathbb{Z}\tau\).

4.3 Elliptic Curves as Complex Tori

The differential equation for \(\wp\) says that the map \(z \mapsto (\wp(z), \wp'(z))\) sends \(\mathbb{C}/\Lambda\) to the affine curve

\[ E : y^2 = 4x^3 - g_2 x - g_3, \]

which is an elliptic curve (in Weierstrass form) over \(\mathbb{C}\). This map is actually a group isomorphism between the torus \(\mathbb{C}/\Lambda\) (with its natural abelian group structure) and the points of \(E\) (with the group law defined geometrically by the chord-tangent construction).

The discriminant of the cubic on the right is \(\Delta(\Lambda) = g_2(\Lambda)^3 - 27 g_3(\Lambda)^2\), and the curve is nonsingular precisely when \(\Delta(\Lambda) \neq 0\). One computes that \(\Delta(\Lambda) = (2\pi)^{12} \Delta(\tau)\) (up to a power of 12), confirming that \(\Delta(\tau) \neq 0\) for all \(\tau \in \mathfrak{H}\) — a fact we already knew from the product formula \(\Delta(\tau) = q \prod(1-q^n)^{24}\) with \(|q| < 1\).

The *\(j\)-invariant* of a lattice \(\Lambda\) (or the corresponding elliptic curve) is \[ j(\Lambda) = 1728 \frac{g_2(\Lambda)^3}{g_2(\Lambda)^3 - 27 g_3(\Lambda)^2} = 1728 \frac{E_4(\tau)^3}{\Delta(\tau) \cdot \text{const}}, \]

which equals the \(j\)-function \(j(\tau)\) evaluated at \(\tau = \omega_1/\omega_2\).

Two elliptic curves over \(\mathbb{C}\) are isomorphic (as complex Lie groups) if and only if their \(j\)-invariants agree, and every element of \(\mathbb{C}\) is the \(j\)-invariant of some elliptic curve. Thus the \(j\)-function provides a bijection

\[ j : \operatorname{SL}(2,\mathbb{Z}) \backslash \mathfrak{H}^* \xrightarrow{\;\sim\;} \mathbb{P}^1(\mathbb{C}), \]

identifying the modular curve \(X(1)\) with the Riemann sphere and confirming the genus-0 nature of \(X(1)\).


Chapter 5: Hecke Operators and Newforms

5.1 Hecke Operators

The vector spaces \(M_k(\Gamma)\) carry a commuting family of linear operators — the Hecke operators — that endow them with additional arithmetic structure. For the full modular group and a positive integer \(n\), the Hecke operator \(T_n\) acts on \(M_k(\operatorname{SL}(2,\mathbb{Z}))\) by

\[ (T_n f)(\tau) = n^{k-1} \sum_{ad = n,\, a,d > 0} \sum_{b=0}^{d-1} d^{-k} f\!\left(\frac{a\tau + b}{d}\right). \]

Equivalently, if \(f(\tau) = \sum_{m=0}^\infty a_m q^m\), then

\[ (T_n f)(\tau) = \sum_{m=0}^{\infty} \left( \sum_{d \mid \gcd(m,n)} d^{k-1} a_{mn/d^2} \right) q^m. \]

The key algebraic properties of the Hecke operators are:

1. \(T_m T_n = T_{mn}\) if \(\gcd(m,n) = 1\). 2. For a prime \(p\) and \(r \geq 1\): \(T_{p^r} = T_p T_{p^{r-1}} - p^{k-1} T_{p^{r-2}}\). 3. The operators \(T_n\) are *self-adjoint* with respect to the Petersson inner product (defined in Section 5.3). 4. All Hecke operators mutually commute: \(T_m T_n = T_n T_m\) for all \(m, n\).

Relations (1) and (2) show that the Hecke operators are “almost multiplicative”: they satisfy the same multiplicativity relations as Dirichlet series coefficients of an Euler product. This foreshadows the connection with \(L\)-functions.

5.2 Eigenforms and Their \(L\)-Functions

Since the Hecke operators mutually commute and are self-adjoint, they can be simultaneously diagonalized on \(S_k(\Gamma)\).

A nonzero \(f \in M_k(\Gamma)\) is a *Hecke eigenform* (or simply an *eigenform*) if \(T_n f = \lambda_n f\) for all \(n \geq 1\). If additionally \(f\) is normalized so that \(a_1(f) = 1\), it is called a *normalized eigenform* or *Hecke newform*.

For a normalized eigenform \(f = \sum_{n=1}^\infty a_n q^n\), the eigenvalue relation gives \(\lambda_n = a_n\): the eigenvalues are just the Fourier coefficients. The multiplicativity of the \(T_n\) then translates into multiplicativity of the \(a_n\): \(a_{mn} = a_m a_n\) when \(\gcd(m,n) = 1\), and \(a_{p^r} = a_p a_{p^{r-1}} - p^{k-1} a_{p^{r-2}}\) for primes \(p\).

These relations mean that the \(L\)-function

\[ L(s, f) = \sum_{n=1}^{\infty} \frac{a_n}{n^s} \]

has an Euler product decomposition

\[ L(s, f) = \prod_p \frac{1}{1 - a_p p^{-s} + p^{k-1-2s}}, \]

a striking structural property analogous to the Riemann zeta function and Dirichlet \(L\)-functions.

For \(\Delta\), the unique normalized cusp form of weight 12, this gives

\[ L(s, \Delta) = \prod_p \frac{1}{1 - \tau(p) p^{-s} + p^{11-2s}}. \]

5.3 The Petersson Inner Product

For \(f, g \in S_k(\Gamma)\), the *Petersson inner product* is \[ \langle f, g \rangle = \int_{\Gamma \backslash \mathfrak{H}} f(\tau) \overline{g(\tau)} \, y^{k-2} \, dx \, dy, \quad \tau = x + iy, \]

where integration is over any fundamental domain for \(\Gamma\).

The measure \(y^{k-2} \, dx \, dy = y^k \, d\mu\) where \(d\mu = y^{-2} \, dx \, dy\) is the hyperbolic area measure, making the integrand automorphic of weight 0 and the integral well-defined. The restriction to cusp forms (rather than all modular forms) ensures convergence.

The Petersson inner product makes \(S_k(\Gamma)\) into a Hilbert space (of finite dimension). The Hecke operators \(T_n\) (for \(\gcd(n, N) = 1\) when \(\Gamma = \Gamma_0(N)\)) are normal with respect to this inner product, which is why they are simultaneously diagonalizable.

5.4 Old Forms, New Forms, and the Multiplicity-One Theorem

For congruence subgroups \(\Gamma_0(N)\), the space \(S_k(\Gamma_0(N))\) contains old forms: these are cusp forms coming from lower levels, i.e., of the form \(f(\tau) \in S_k(\Gamma_0(M))\) for some \(M \mid N\), \(M < N\), possibly composed with \(\tau \mapsto d\tau\) for \(d \mid N/M\). The orthogonal complement of the space of old forms in \(S_k(\Gamma_0(N))\) is the space of new forms \(S_k^{\text{new}}(\Gamma_0(N))\).

*(Atkin-Lehner, Multiplicity-One)* The space \(S_k^{\text{new}}(\Gamma_0(N))\) has a basis of normalized newforms. Each normalized newform is a simultaneous eigenform for all Hecke operators \(T_n\), and two distinct normalized newforms in \(S_k^{\text{new}}(\Gamma_0(N))\) cannot have the same eigenvalue for all \(T_p\) with \(p \nmid N\).

The multiplicity-one theorem is a cornerstone of the theory: it says that a newform is uniquely determined (up to scalar) by its Hecke eigenvalues. This rigidity is what makes it possible to match modular forms with Galois representations and elliptic curves — the content of the modularity theorem.


Chapter 6: \(L\)-Functions of Modular Forms

6.1 The Mellin Transform and Dirichlet Series

Let \(f(\tau) = \sum_{n=1}^\infty a_n q^n\) be a cusp form of weight \(k\) for \(\operatorname{SL}(2,\mathbb{Z})\). The connection between \(f\) and its \(L\)-function is mediated by the Mellin transform: substituting \(\tau = it\) (so \(q = e^{-2\pi t}\)), one obtains

\[ f(it) = \sum_{n=1}^\infty a_n e^{-2\pi n t}. \]

The Mellin transform of this is

\[ \int_0^\infty f(it) t^s \frac{dt}{t} = \sum_{n=1}^\infty a_n \int_0^\infty e^{-2\pi n t} t^s \frac{dt}{t} = \frac{\Gamma(s)}{(2\pi)^s} \sum_{n=1}^\infty \frac{a_n}{n^s} = \frac{\Gamma(s)}{(2\pi)^s} L(s, f). \]

This motivates defining the completed \(L\)-function

\[ \Lambda(s, f) = (2\pi)^{-s} \Gamma(s) L(s, f) = \int_0^\infty f(it) t^s \frac{dt}{t}. \]

6.2 Functional Equation and Analytic Continuation

The transformation formula \(f(-1/\tau) = \tau^k f(\tau)\) (which holds when \(f\) has weight \(k\) under \(S\)) translates, after the substitution \(t \mapsto 1/t\), into a functional equation for \(\Lambda(s,f)\).

*(Hecke)* Let \(f \in S_k(\operatorname{SL}(2,\mathbb{Z}))\) be a cusp form of weight \(k\). The completed \(L\)-function \(\Lambda(s, f)\) extends to an entire function on \(\mathbb{C}\) and satisfies the functional equation \[ \Lambda(k - s, f) = (-1)^{k/2} \Lambda(s, f). \]

In particular, \(L(s, f)\) has an analytic continuation to all of \(\mathbb{C}\) and satisfies a functional equation relating \(s\) to \(k - s\).

Split the integral \(\int_0^\infty = \int_0^1 + \int_1^\infty\) and substitute \(t \mapsto 1/t\) in the first piece. Using the modular transformation \(f(i/t) = (it)^k f(it)\), one obtains \[ \int_0^1 f(it) t^s \frac{dt}{t} = \int_1^\infty f(i/t) t^{-s} \frac{dt}{t} = i^k \int_1^\infty f(it) t^{k-s} \frac{dt}{t}. \]

Adding the two pieces gives \(\Lambda(s) = i^k \Lambda(k-s)\), which is the functional equation with \((-1)^{k/2} = i^k\) for even \(k\).

For congruence subgroups \(\Gamma_0(N)\), the situation is analogous but requires additional care. A newform \(f\) of level \(N\) satisfies \(f|_k w_N = \epsilon \bar{f}\) where \(w_N\) is the Atkin-Lehner involution and \(\epsilon\) is a root of unity (the sign of the functional equation), giving

\[ \Lambda(s, f) = \epsilon \cdot N^{k/2 - s} \Lambda(k-s, f). \]

The sign \(\epsilon \in \{+1, -1\}\) determines whether \(L(k/2, f)\) can be nonzero — the central value \(L(k/2, f)\) vanishes when \(\epsilon = -1\) (the functional equation forces the zero).

6.3 Analytic Properties and the Ramanujan Conjecture

The Fourier coefficients \(a_n\) of a cusp form satisfy the Ramanujan-Petersson bound. For \(f = \sum a_n q^n \in S_k(\Gamma_0(N))\) a normalized newform:

*(Deligne, 1974)* For all primes \(p \nmid N\): \[ |a_p| \leq 2 p^{(k-1)/2}. \]

Equivalently, writing \(a_p = p^{(k-1)/2} (\alpha_p + \bar\alpha_p)\) with \(|\alpha_p| = 1\), the local \(L\)-factor at \(p\) is

\[ (1 - \alpha_p p^{-s})(1 - \bar\alpha_p p^{-s})^{-1}, \]

and Deligne’s bound says both roots \(\alpha_p, \bar\alpha_p\) lie on the unit circle. Deligne proved this by interpreting the Fourier coefficients as traces of Frobenius on certain \(\ell\)-adic cohomology groups and applying his proof of the Weil conjectures for varieties over finite fields.

For weight 2, the bound \(|a_p| \leq 2\sqrt{p}\) is the Hasse bound on the number of points of an elliptic curve over \(\mathbb{F}_p\), reflecting the deep connection between weight-2 cusp forms and elliptic curves.


Chapter 7: Applications

7.1 Sums of Squares

One of the earliest triumphs of the theory of modular forms is the derivation of exact formulas for representation numbers. The sum-of-four-squares problem asks: in how many ways can a positive integer \(n\) be written as a sum of four perfect squares?

Let \(r_4(n) = \#\{(a,b,c,d) \in \mathbb{Z}^4 : a^2+b^2+c^2+d^2 = n\}\). Jacobi proved:

*(Jacobi, 1829)* \[ r_4(n) = 8 \sum_{\substack{d \mid n \\ 4 \nmid d}} d. \]

In particular, \(r_4(n) > 0\) for all \(n \geq 1\), proving Lagrange’s four-square theorem.

The proof using modular forms proceeds by recognizing the generating function

\[ \theta(q)^4 = \left(\sum_{n=-\infty}^{\infty} q^{n^2}\right)^4 = \sum_{n=0}^\infty r_4(n) q^n \]

as a modular form. The function \(\theta(\tau) = \sum_n q^{n^2}\) (with \(q = e^{2\pi i \tau}\)) satisfies transformation formulas under the Jacobi theta transformation \(\tau \mapsto -1/\tau\) (with \(\sqrt{-i\tau}\) factors), making \(\theta(\tau)^4\) a modular form of weight 2 for \(\Gamma_0(4)\). One then uses the fact that the space of such modular forms is spanned by Eisenstein series and cusp forms, and the Eisenstein part gives Jacobi’s formula exactly.

7.2 Kloosterman Sums and Weil Bounds

The Kloosterman sum is defined for integers \(m, n\) and a modulus \(c > 0\) by

\[ S(m, n; c) = \sum_{\substack{d \pmod{c} \\ \gcd(d,c)=1}} e^{2\pi i(md + n\bar{d})/c}, \]

where \(\bar{d}\) denotes the inverse of \(d\pmod{c}\). These sums appear naturally in the Fourier coefficients of Poincaré series and in the Kuznetsov trace formula.

*(Weil, 1948)* For \(\gcd(mn, c) = 1\): \[ |S(m,n;c)| \leq \tau(c) \sqrt{c} \cdot \gcd(m,n,c)^{1/2}, \]

where \(\tau(c)\) is the number of divisors of \(c\). In particular, \(|S(m,n;c)| \leq \tau(c)\sqrt{c}\) when \(\gcd(mn,c)=1\).

Weil’s proof uses the Riemann hypothesis for curves over finite fields. The connection to modular forms arises via the theory of Poincaré series: a Poincaré series is defined by

\[ P_m(\tau) = \sum_{\gamma \in \Gamma_\infty \backslash \Gamma} (c\tau+d)^{-k} e^{2\pi i m \gamma\tau}, \]

and its Fourier coefficients involve Kloosterman sums via the Petersson formula, making the Weil bounds applicable in questions about modular forms.

7.3 The Congruent Number Problem and Tunnell’s Theorem

A positive integer \(n\) is a congruent number if there exists a right triangle with all three sides rational and area equal to \(n\). For example, 5 is congruent (sides 3/2, 20/3, 41/6 with area \((1/2)(3/2)(20/3) = 5\)), while 1 is conjectured not to be congruent.

The connection to elliptic curves is classical: \(n\) is congruent if and only if the elliptic curve

\[ E_n : y^2 = x^3 - n^2 x \]

has a rational point of infinite order, i.e., its Mordell-Weil group \(E_n(\mathbb{Q})\) is infinite.

The Birch and Swinnerton-Dyer conjecture (BSD) for \(E_n\) asserts that \(E_n(\mathbb{Q})\) is infinite if and only if \(L(1, E_n) = 0\). The curve \(E_n\) is an elliptic curve of conductor \(32n^2\) (or related), and its \(L\)-function is the \(L\)-function of an associated weight-2 modular form by the modularity theorem.

*(Tunnell, 1983)* Suppose \(n\) is squarefree. Define \[ f(n) = \#\{(x,y,z)\in\mathbb{Z}^3 : 2x^2+y^2+8z^2 = n\} - 2\cdot\#\{(x,y,z) : 2x^2+y^2+32z^2=n\}, \]

and a similar expression \(g(n)\) for even \(n\). Then: if \(n\) is an odd congruent number, then \(f(n) = 0\), and if \(n\) is an even congruent number, then \(g(n) = 0\). Conversely, if BSD holds for \(E_n\), then \(f(n) = 0\) (resp.\ \(g(n) = 0\)) implies \(n\) is congruent.

The proof uses the theory of half-integral weight modular forms and the Shimura correspondence, which relates forms of weight \(k + 1/2\) to forms of weight \(2k\). Tunnell’s criterion reduces the congruent number problem to counting integer representations of quadratic forms — a computationally verifiable condition.

7.4 Monstrous Moonshine

In 1979, John McKay observed a remarkable numerological coincidence: the coefficient of \(q\) in the \(j\)-function is

\[ j(\tau) = q^{-1} + 744 + 196884\,q + \cdots \]

and \(196884 = 196883 + 1\), where \(196883\) is the dimension of the smallest nontrivial complex representation of the Monster group \(\mathbb{M}\), the largest sporadic simple group of order approximately \(8 \times 10^{53}\). Thompson extended this observation: all coefficients of \(j\) are non-negative integer linear combinations of the dimensions of irreducible representations of \(\mathbb{M}\).

Conway and Norton formalized this in their 1979 paper “Monstrous Moonshine”: they conjectured that for each element \(g \in \mathbb{M}\), the McKay-Thompson series

\[ T_g(\tau) = q^{-1} + \sum_{n=0}^\infty \text{tr}(g \mid V_n) q^n, \]

where \(V = \bigoplus V_n\) is a graded Monster-module with \(\dim V_n = c_n\) (the \(q^n\) coefficient of \(j\)), is a Hauptmodul (generator of the function field) for some genus-zero congruence subgroup \(\Gamma_g \leq \operatorname{SL}(2,\mathbb{R})\).

*(Borcherds, 1992)* The Conway-Norton Moonshine conjecture is true. The Monster group acts on the *Monster vertex operator algebra* (Moonshine module) \(V^\natural = \bigoplus_{n=-1}^\infty V^\natural_n\), and for each \(g \in \mathbb{M}\), the corresponding McKay-Thompson series is a Hauptmodul for a genus-zero group.

Borcherds’ proof introduced the theory of vertex operator algebras and generalized Kac-Moody algebras, winning him the 1998 Fields Medal. The proof constructs the Monster Lie algebra (a generalized Kac-Moody algebra) from the Monster vertex algebra and uses the denominator formula for this Lie algebra to establish the Hauptmodul property of the McKay-Thompson series via the theory of modular forms.

The deeper structural reason for Moonshine remains an active area of research, with connections to string theory (the Moonshine module \(V^\natural\) arises naturally as the Hilbert space of a particular conformal field theory), algebraic geometry, and number theory.

7.5 The Taniyama-Shimura-Weil Theorem and Fermat’s Last Theorem

The modularity theorem — conjectured independently by Taniyama (1955), Shimura (1958), and Weil (1967) — is one of the deepest results in twentieth-century mathematics.

*(Wiles 1995; Taylor-Wiles 1995; Breuil-Conrad-Diamond-Taylor 2001)* Every elliptic curve \(E\) over \(\mathbb{Q}\) is *modular*: there exists a normalized newform \(f \in S_2(\Gamma_0(N))\) (where \(N\) is the conductor of \(E\)) such that \[ L(s, E) = L(s, f), \]

i.e., the Hasse-Weil \(L\)-function of \(E\) equals the \(L\)-function of the modular form \(f\).

The theorem means that every elliptic curve over \(\mathbb{Q}\) is a quotient of the Jacobian of some modular curve \(X_0(N)\) — the curve “comes from” a modular form. In terms of Galois representations: the 2-dimensional \(\ell\)-adic representation of \(\operatorname{Gal}(\bar{\mathbb{Q}}/\mathbb{Q})\) on the Tate module of \(E\) is isomorphic to the representation attached to \(f\) by Shimura.

The connection to Fermat’s Last Theorem goes through the Frey curve. Suppose for contradiction that there exist positive integers \(a, b, c, p\) with \(p \geq 5\) prime and \(a^p + b^p = c^p\). Frey (1985) proposed and Serre (1987) formalized: the elliptic curve

\[ E_{a,b,c} : y^2 = x(x - a^p)(x + b^p) \]

would have such exotic properties (semistable of conductor \(N_{abc}\) with minimal discriminant \(\Delta = (abc)^{2p}/16\)) that it could not be modular. Ribet (1990) proved the epsilon conjecture (Serre’s conjecture in a special case): if \(E_{a,b,c}\) were modular of level \(N_{abc}\), one could “level-lower” to obtain a modular form of level 2. But there are no nonzero cusp forms in \(S_2(\Gamma_0(2))\), a contradiction.

Therefore, if the modularity theorem holds for semistable elliptic curves, Fermat’s Last Theorem follows. Wiles proved modularity for all semistable elliptic curves in his 1995 Annals paper (with the crucial “3-5 trick” completed in the Taylor-Wiles companion paper), thereby proving Fermat’s Last Theorem after 350 years.

The proof strategy of Wiles introduces the framework of Galois deformation theory (following Mazur), reduces the modularity theorem to a statement about the equality of certain Hecke rings and deformation rings (\(R = \mathbb{T}\)), and verifies this equality using a numerical criterion of Wiles and Taylor. The full modularity theorem for all elliptic curves over \(\mathbb{Q}\) was completed by Breuil, Conrad, Diamond, and Taylor in 2001, extending Wiles’ methods to handle the non-semistable cases.


Chapter 8: Supplementary Topics

8.1 Half-Integral Weight Forms

The theory extends naturally to forms of half-integral weight \(k/2\) where \(k\) is an odd integer. The automorphy condition requires care because \((c\tau+d)^{k/2}\) is not well-defined without a choice of branch. One uses the metaplectic group \(\widetilde{\operatorname{SL}}(2,\mathbb{Z})\), a double cover of \(\operatorname{SL}(2,\mathbb{Z})\), to define half-integral weight forms properly.

The Shimura correspondence links weight-\((k+1/2)\) forms to weight-\(2k\) forms via a lift that essentially squares the theta series. This was used crucially by Tunnell in his theorem on congruent numbers.

The Dedekind eta function \(\eta(\tau)\), which transforms as a weight-\(1/2\) form with a character, provides the prototypical example. The theta series

\[ \theta(\tau) = \sum_{n=-\infty}^\infty q^{n^2} \]

is a modular form of weight \(1/2\) for \(\Gamma_0(4)\). More generally, theta series associated to positive-definite quadratic forms \(Q\) in \(m\) variables are modular forms of weight \(m/2\).

8.2 Modular Forms and Galois Representations

A central theme of modern number theory is the Langlands program, which predicts deep correspondences between automorphic forms and Galois representations. In the case of modular forms, the relevant construction is:

*(Deligne, 1971)* For every normalized newform \(f = \sum a_n q^n \in S_k(\Gamma_0(N))\) and every prime \(\ell\), there exists a continuous irreducible representation \[ \rho_{f,\ell} : \operatorname{Gal}(\bar{\mathbb{Q}}/\mathbb{Q}) \to \operatorname{GL}_2(\mathbb{Z}_\ell) \]

such that for all primes \(p \nmid N\ell\), \(\rho_{f,\ell}\) is unramified at \(p\), and the characteristic polynomial of \(\operatorname{Frob}_p\) is \(X^2 - a_p X + p^{k-1}\).

The Hecke eigenvalues \(a_p\) thus appear as traces of Frobenius in the associated Galois representation — a profound arithmetic interpretation of the Fourier coefficients. The Ramanujan-Petersson bound \(|a_p| \leq 2p^{(k-1)/2}\) then becomes the assertion that the eigenvalues of Frobenius have absolute value \(p^{(k-1)/2}\), which is Deligne’s theorem on the Weil conjectures.

8.3 The Eichler-Shimura Isomorphism

For weight \(k = 2\), the connection between modular forms and geometry is particularly direct. The space \(S_2(\Gamma)\) of weight-2 cusp forms is isomorphic (as a Hecke module) to the space of holomorphic differentials \(H^0(X(\Gamma), \Omega^1)\) on the compact Riemann surface \(X(\Gamma)\). This is because a cusp form \(f(\tau) \in S_2(\Gamma)\) defines a holomorphic differential \(f(\tau) d\tau\) on \(\mathfrak{H}\), which descends to \(X(\Gamma)\) by the weight-2 automorphy.

The dimension of \(S_2(\Gamma)\) equals the genus of \(X(\Gamma)\), consistent with the Riemann surface theory. This geometric perspective provides a powerful tool: the Jacobian of \(X(\Gamma)\) is an abelian variety whose endomorphism algebra contains the Hecke algebra, and the Eichler-Shimura theorem identifies the \(L\)-function of this abelian variety with the product of \(L\)-functions of the constituent newforms.

8.4 Modular Forms of Level 1: Structural Summary

To synthesize the material, we summarize the picture for the full modular group. The graded ring

\[ M_* = \bigoplus_{k \geq 0} M_k(\operatorname{SL}(2,\mathbb{Z})) = \mathbb{C}[E_4, E_6] \]

has the following key features:

  • Generators: \(E_4\) (weight 4, Eisenstein) and \(E_6\) (weight 6, Eisenstein).
  • First cusp form: \(\Delta = (E_4^3 - E_6^2)/1728\) in weight 12.
  • Decomposition: \(M_k = \mathbb{C} \cdot E_k \oplus S_k\) for \(k \geq 4\) (one-dimensional Eisenstein part plus cusp forms).
  • Basis for \(S_k\): Spanned by products \(E_4^a E_6^b \Delta^c\) with \(4a + 6b + 12c = k\), \(c \geq 1\).
  • Hecke theory: \(M_k\) is semisimple as a module over the Hecke algebra; the Eisenstein series are eigenforms (for the non-cusp part) and \(S_k\) splits into Hecke eigenspaces.

This clean structure at level 1 serves as a model for understanding the more complicated structure at higher levels, where the multiplicity-one theorem ensures a canonical basis of newforms and old forms.


Chapter 9: Further Directions

9.1 Automorphic Forms

Modular forms are the \(\operatorname{GL}_2\) case of the general theory of automorphic forms, which considers functions on quotients \(\Gamma \backslash G\) for a reductive algebraic group \(G\) over \(\mathbb{Q}\) and arithmetic subgroup \(\Gamma\). The Langlands program predicts vast generalizations of the modularity theorem: every motivic \(L\)-function should equal an automorphic \(L\)-function.

Modular forms of higher rank groups — Siegel modular forms (for \(\operatorname{Sp}_{2g}\)), Hilbert modular forms (for \(\operatorname{GL}_2\) over a totally real field), Maass forms (non-holomorphic eigenfunctions of the Laplacian) — all play important roles in current research.

9.2 \(p\)-adic Modular Forms

Serre observed that the \(q\)-expansions of Eisenstein series satisfy congruences modulo powers of primes. For instance, the weight-\(k\) Eisenstein series \(E_k\) satisfies \(E_k \equiv E_{k'} \pmod{p}\) whenever \(k \equiv k' \pmod{p-1}\). This suggests defining \(p\)-adic modular forms as \(p\)-adic limits of sequences of classical modular forms whose weights converge in the \(p\)-adic topology.

Hida theory systematizes this into the theory of \(p\)-adic families of modular forms: for ordinary primes \(p\) (where \(|a_p| = p^{(k-1)/2}\), i.e., \(a_p \not\equiv 0 \pmod{p}\)), there exist analytic families of eigenforms parametrized by the weight space — a \(p\)-adic analytic space. These families are central to modern work on the Iwasawa main conjecture and the Bloch-Kato conjecture.

9.3 Mock Modular Forms

The theory of modular forms has been further extended to encompass mock theta functions, introduced by Ramanujan in his 1920 deathbed letter. These functions have \(q\)-expansions similar to modular forms but transform “almost” like modular forms — they are holomorphic but their transformation formula involves a non-holomorphic correction term called the shadow.

Zwegers (2002) put Ramanujan’s mock theta functions on a rigorous footing as the holomorphic parts of certain harmonic Maass forms — generalizations of modular forms that are annihilated by the weight-\(k\) hyperbolic Laplacian rather than being holomorphic. This development has had profound applications in combinatorics (the circle method, ranks of partitions), string theory (counting BPS states), and the arithmetic of elliptic curves.

Back to top