PMATH 434: Set Theory
Andrew Zucker
Estimated study time: 4 hr 25 min
Table of contents
Sources and References
Primary textbook — Thomas Jech, Set Theory, 3rd ed., Springer, 2003. Supplementary texts — Kenneth Kunen, Set Theory, 2nd ed., College Publications, 2011; Azriel Lévy, Basic Set Theory, Springer, 1979. Online resources — Timothy Chow, A Beginner’s Guide to Forcing, AMS, 2008; Itay Neeman, UCLA Set Theory notes; Asger Törnquist, Copenhagen set theory notes; Yiannis Moschovakis, Notes on Set Theory, Springer.
Prologue: The Story of Set Theory
I. Cantor’s Paradise
In the 1870s, Georg Cantor set out to understand the structure of the real line — specifically, whether different infinite subsets of \(\mathbb{R}\) could have genuinely different sizes. What he discovered was not just a theorem about the reals but a wholly new mathematical universe. By 1874 he had shown that the real numbers are uncountable, meaning no list can exhaust them; by 1891 he had proved his diagonal argument, establishing that the power set of any set strictly exceeds it in size. Cantor defined “sets” naively: a set is any collection of objects we can think of as a whole. He called this the “paradise” of the infinite, and David Hilbert would later declare that no one should be able to expel mathematicians from this paradise. The theory developed rapidly — transfinite ordinals, cardinal arithmetic, the continuum hypothesis — building an intricate edifice on this naive foundation.
II. Russell’s Paradox and the Crisis of Foundations
In 1901, Bertrand Russell discovered a fatal crack in Cantor’s paradise. Consider the collection \(R = \{x : x \notin x\}\) — the set of all sets that do not contain themselves. Does \(R\) contain itself? If \(R \in R\), then by definition \(R \notin R\). If \(R \notin R\), then by definition \(R \in R\). Either way, we have a contradiction. The paradox is not a marginal puzzle: it shows that unrestricted set formation — the principle that any property defines a set — is simply inconsistent. Similar paradoxes followed: Burali-Forti’s paradox showed that the collection of all ordinals cannot be a set, and Cantor himself found the paradox of the set of all sets. Frege, who had just finished a grand logicist program to build arithmetic from pure logic, received Russell’s letter and wrote in response that the foundations of his edifice had been shaken. The paradise needed walls.
III. Zermelo and Fraenkel: Building the Walls
Ernst Zermelo responded in 1908 with the first axiomatic system for set theory. Rather than allowing any collection to be a set, Zermelo restricted set-formation: new sets can only be carved out of existing sets (Separation), or assembled from existing sets by explicit construction rules (Pairing, Union, Power Set, Infinity). The key insight was that Russell’s paradox requires forming a set from nothing — and Zermelo’s axioms prevent this. Abraham Fraenkel and Thoralf Skolem later strengthened Zermelo’s system by adding the Replacement schema, which allows the image of any definable function on a set to itself be a set. The resulting system — Zermelo–Fraenkel set theory with the Axiom of Choice, abbreviated ZFC — became the universally adopted foundation. It was strong enough to formalize all of ordinary mathematics, yet specific enough to be consistent (as far as anyone can tell).
IV. Independence: Gödel, Cohen, and the Undecidable
The axiomatic cure raised a new question: which mathematical statements are actually decided by ZFC? Kurt Gödel showed in 1938 that the Axiom of Choice and the Generalized Continuum Hypothesis are both consistent with ZFC — he constructed the constructible universe \(L\), a minimal inner model in which both hold. This showed ZFC cannot disprove CH. Then in 1963, Paul Cohen invented an entirely new technique — forcing — to show that CH is also unprovable from ZFC. By adding “generic” reals to a model of ZFC, Cohen built extensions where \(2^{\aleph_0}\) takes any prescribed value. Together, Gödel and Cohen showed that CH is independent of ZFC: it can neither be proved nor disproved. This was not an anomaly. Hundreds of natural mathematical statements — Suslin’s hypothesis, the existence of Whitehead groups, the behavior of cardinal arithmetic at singular cardinals — turned out to be similarly independent. Set theory was revealed not as a fixed truth but as a landscape of possible mathematical universes, with ZFC marking the shared terrain and the independence results charting the regions where different choices lead to different worlds. This course explores that landscape.
Chapter 1: Axioms of Set Theory (ZFC)
Set theory serves as a universal foundation for mathematics. Every mathematical object — numbers, functions, topological spaces, groups — can be encoded as a set, and all of mathematics can in principle be derived from a small list of axioms about sets. The standard axiom system is Zermelo–Fraenkel set theory with the Axiom of Choice, abbreviated ZFC. It was assembled in its modern form during the first three decades of the twentieth century, with contributions from Ernst Zermelo (1908), Abraham Fraenkel, Thoralf Skolem, and John von Neumann.
The language of ZFC is first-order logic with a single binary predicate \(\in\) (membership). All variables range over sets. There are no “urelements” (non-set atoms) in the standard formulation. We write \(x \in y\) to mean “x is an element of y.”
The Axioms of ZFC
Extensionality
Two sets are purely extensional objects — they are determined entirely by what they contain, not by how they are described or named. A list and the same list in a different order are the same set; a set defined by one formula and the same set defined by another formula are identical. This is the most fundamental structural principle of set theory, making sets radically different from intensional objects like properties or concepts.
This axiom makes sets purely extensional objects: a set is determined entirely by its members, not by how it is described or constructed.
Empty Set and Pairing
We need a starting point — a set that exists without requiring any prior set to carve it from. The empty set \(\emptyset\) is this ground floor. From it, via Pairing and Union, we can construct any finite set. Without an empty set axiom, we could not even begin: the existential quantifier in all other axioms would be vacuously true in the empty model.
By Extensionality, this set is unique; we denote it \(\emptyset\).
Taking \(a = b\) yields the singleton \(\{a\}\).
Union and Power Set
Union lets us “flatten” a collection of sets: from the family \(\{\{1,2\}, \{2,3\}\}\) we form \(\{1,2,3\}\). This is essential for constructing infinite sets by taking limits along a sequence of approximations. Power Set is the engine of cardinality growth: starting from \(\omega\), a single application gives \(\mathcal{P}(\omega)\), which by Cantor’s theorem is strictly larger. Iterating produces a hierarchy of ever-larger infinities that far outpaces any constructive procedure.
Separation (Comprehension Schema)
Separation is Zermelo’s direct response to Russell’s paradox. Rather than allowing any property to define a set outright, Separation only permits us to filter an existing set. We cannot form \(\{x : \varphi(x)\}\) from nothing — we can only form \(\{x \in X : \varphi(x)\}\) from a set \(X\) we already possess. This single restriction defuses all the classical paradoxes.
This is a schema — one axiom for each formula \(\varphi\). It avoids Russell’s paradox: we cannot form \(\{x : x \notin x\}\) without already having a set \(X\) to draw elements from.
This is a perfectly legitimate set in ZFC. Notice that we never tried to form a set of “all even numbers” out of nothing — we always worked inside the pre-existing set \(\omega\). The Russell paradox would require us to form \(R = \{x : x \notin x\}\) where \(x\) ranges over all sets — but no such universal set exists in ZFC. The moment we restrict to \(\{x \in X : x \notin x\}\) for any fixed set \(X\), we merely get a subset of \(X\), and the argument \(R \in R \leftrightarrow R \notin R\) no longer fires: the resulting set is simply \(X \setminus \{x \in X : x \in x\}\), a perfectly harmless set.
Replacement (Fraenkel’s Axiom Schema)
Fraenkel observed that Zermelo’s original axioms could not form the set \(\{\omega, \mathcal{P}(\omega), \mathcal{P}(\mathcal{P}(\omega)), \ldots\}\) — an infinite “tower” built by iterating the power set operation starting from \(\omega\). The problem is that such a set requires applying a function (\(x \mapsto \mathcal{P}(x)\)) to the set \(\omega\) and collecting all the resulting values. Separation alone cannot do this: it can only filter what is already there. Replacement adds the power to define new sets by applying functions to existing sets, as long as the function is definable by a formula.
Replacement is strictly stronger than Separation and is needed to construct, for example, the set \(\{\omega, \mathcal{P}(\omega), \mathcal{P}(\mathcal{P}(\omega)), \ldots\}\).
Infinity
The smallest inductive set is \(\omega = \{0, 1, 2, \ldots\}\) where \(0 = \emptyset\), \(1 = \{\emptyset\}\), \(2 = \{\emptyset, \{\emptyset\}\}\), etc. — the von Neumann natural numbers.
Regularity (Foundation)
Regularity forbids infinite descending \(\in\)-chains and, in particular, \(x \in x\). It implies the universe of sets is well-founded and stratified into the von Neumann hierarchy \(V = \bigcup_\alpha V_\alpha\).
Axiom of Choice
AC is independent of ZF (Zermelo–Fraenkel without choice), as shown by Cohen. It is equivalent to Zorn’s Lemma, the Well-Ordering Theorem, and Tychonoff’s Theorem for Hausdorff spaces.
Classes and the Cumulative Hierarchy
In ZFC, everything is a set, but it is useful to speak of proper classes — collections too large to be sets (like the class of all sets, \(V\), or the class of all ordinals, \(\mathbf{Ord}\)). These are formal abbreviations for formulas.
- \(V_0 = \emptyset\)
- \(V_{\alpha+1} = \mathcal{P}(V_\alpha)\)
- \(V_\lambda = \bigcup_{\alpha < \lambda} V_\alpha\) for limit ordinals \(\lambda\)
Chapter 2: Ordinal Numbers
Ordinals are the backbone of transfinite mathematics. They generalize the natural numbers to the infinite, providing canonical representatives for well-ordered sets and a framework for transfinite induction and recursion.
Think of ordinals as measuring positions rather than sizes: first, second, third, and then, after all finite positions are exhausted, a new position \(\omega\) that comes after all of them — followed by \(\omega+1\), \(\omega+2\), and eventually \(\omega \cdot 2\), \(\omega^2\), \(\omega^\omega\), and so on. The ordinals form an absolute backbone for all of set theory: every well-ordered set is isomorphic to a unique ordinal, so ordinals are the canonical “names” for the order-types of well-ordered sets.
Well-Orderings and Order Types
Von Neumann Ordinals
The key insight of von Neumann’s definition is that each ordinal is the set of all smaller ordinals: \(\alpha = \{\beta : \beta < \alpha\}\).
- \(0 = \emptyset\)
- \(1 = \{0\} = \{\emptyset\}\)
- \(2 = \{0,1\} = \{\emptyset, \{\emptyset\}\}\)
- \(\omega = \{0, 1, 2, 3, \ldots\}\) — the first infinite ordinal
- \(\omega + 1 = \omega \cup \{\omega\} = \{0, 1, 2, \ldots, \omega\}\)
- \(\omega \cdot 2 = \omega + \omega\), \(\omega^2\), \(\omega^\omega\), \(\epsilon_0 = \omega^{\omega^{\omega^{\cdots}}}\) — the first fixed point of \(\alpha \mapsto \omega^\alpha\)
Successor and Limit Ordinals
The ordinals \(0, \omega, \omega \cdot 2, \omega^2, \ldots\) are limit ordinals (with \(0\) sometimes treated separately). Every natural number \(n \geq 1\) is a successor ordinal.
Transfinite Induction and Recursion
This theorem justifies defining functions by recursion over the ordinals, such as the von Neumann hierarchy, ordinal arithmetic, and the constructible universe.
Ordinal Arithmetic
Ordinal arithmetic is defined by transfinite recursion and differs from cardinal arithmetic in being non-commutative.
- Addition: \(\alpha + 0 = \alpha\); \(\alpha + (\beta+1) = (\alpha + \beta) + 1\); \(\alpha + \lambda = \sup_{\beta < \lambda}(\alpha + \beta)\) for limit \(\lambda\).
- Multiplication: \(\alpha \cdot 0 = 0\); \(\alpha \cdot (\beta+1) = \alpha \cdot \beta + \alpha\); \(\alpha \cdot \lambda = \sup_{\beta < \lambda}(\alpha \cdot \beta)\).
- Exponentiation: \(\alpha^0 = 1\); \(\alpha^{\beta+1} = \alpha^\beta \cdot \alpha\); \(\alpha^\lambda = \sup_{\beta < \lambda} \alpha^\beta\).
By definition, \(1 + \omega = \sup_{n < \omega}(1 + n) = \sup_{n < \omega}(n+1) = \omega\). The sequence \(1+0, 1+1, 1+2, \ldots = 1, 2, 3, \ldots\) has supremum \(\omega\). So \(1 + \omega = \omega\).
On the other hand, \(\omega + 1 = \omega \cup \{\omega\} = \{0, 1, 2, \ldots, \omega\}\), the first ordinal beyond \(\omega\). Its order type is \(\omega + 1\): it has a last element (namely \(\omega\)), while \(\omega\) does not. So \(\omega + 1 \neq \omega = 1 + \omega\).
Intuitively: \(1 + \omega\) means “place a single point before an \(\omega\)-sequence” — but that single point gets absorbed into the sequence, giving an \(\omega\)-sequence again. But \(\omega + 1\) means “place a single point after an \(\omega\)-sequence” — this adds a genuine last element, changing the order type.
By definition, \(\omega \cdot \omega = \sup_{n < \omega} \omega \cdot n\). The sequence is:
\[ \omega \cdot 0 = 0, \quad \omega \cdot 1 = \omega, \quad \omega \cdot 2 = \omega + \omega, \quad \omega \cdot 3 = \omega + \omega + \omega, \quad \ldots \]The supremum is \(\omega^2 = \omega \cdot \omega\), whose elements are all ordinals of the form \(\omega \cdot m + k\) for \(m, k < \omega\). This is the order type of \(\omega \times \omega\) under the lexicographic order. Each “row” \(\{m\} \times \omega\) is an \(\omega\)-sequence, and there are \(\omega\) many rows — giving \(\omega^2\) in total.
The ordinal \(\omega^\omega = \sup_{n < \omega} \omega^n\) is the order type of all finite sequences of natural numbers ordered lexicographically. Above \(\omega^\omega\) comes \(\omega^{\omega^\omega}\), and iterating gives a sequence \(\omega^\omega, \omega^{\omega^\omega}, \ldots\) whose supremum is \(\varepsilon_0 = \omega^{\omega^{\omega^{\cdots}}}\), the first ordinal \(\alpha\) satisfying \(\omega^\alpha = \alpha\). Every countable ordinal can be uniquely written in Cantor Normal Form (described below), and \(\varepsilon_0\) is the “boundary” of that form — all ordinals below it have normal form with base \(\omega\), while \(\varepsilon_0\) itself is a fixed point.
Cantor Normal Form
where \(\beta_1 > \beta_2 > \cdots > \beta_n \geq 0\) are ordinals and \(k_1, \ldots, k_n \geq 1\) are natural numbers.
Bridge remark: Cantor Normal Form will reappear in the proof of termination for Goodstein sequences and in the study of ordinal notations used in proof theory. In PCF theory (Chapter 21), a similar decomposition governs the generators of \(\mathrm{pcf}(A)\).
Chapter 3: Cardinal Numbers
While ordinals measure the type of a well-ordering, cardinals measure size. Two sets have the same cardinality if there is a bijection between them, regardless of any ordering.
Cardinals are the answer to the question “how many elements does this set have?” For finite sets, the answer is a natural number. For infinite sets, the answer is one of the alephs \(\aleph_0, \aleph_1, \aleph_2, \ldots\) — a new hierarchy of infinite quantities that Cantor first discovered and that set theory now systematically studies.
Cardinality and the Schröder–Bernstein Theorem
We construct explicit injections in both directions and apply Schröder–Bernstein.
Injection \(f: (0,1) \hookrightarrow [0,1]\): Simply take \(f(x) = x\). This is an injection from the open interval into the closed interval.
Injection \(g: [0,1] \hookrightarrow (0,1)\): We need to send \([0,1]\) injectively into \((0,1)\). Define \(g(x) = \frac{1}{4} + \frac{x}{2}\). Then \(g\) maps \([0,1]\) bijectively onto \([\frac{1}{4}, \frac{3}{4}] \subset (0,1)\). In particular \(g\) is an injection from \([0,1]\) into \((0,1)\).
By Schröder–Bernstein, \(|(0,1)| = |[0,1]|\). Concretely, the bijection \(h\) constructed in the proof sends the “problematic” points \(\{0, 1, g(0), g(1), g(g(0)), g(g(1)), \ldots\} = \{0, 1, 1/4, 3/4, 3/8, 5/8, \ldots\}\) via \(f\) (i.e., \(h(x) = x\) on this countable set) and all other points via \(g^{-1}\) (i.e., \(h(x) = 2x - 1/2\) off this set). The result is a genuine bijection \([0,1] \to (0,1)\).
Infinite Cardinals and Alephs
The infinite cardinals are the alephs: \(\aleph_0 = \omega\), \(\aleph_1 = \omega_1\), \(\aleph_2 = \omega_2\), and so on. We also write \(\omega_\alpha\) for the \(\alpha\)-th infinite cardinal.
- \(\aleph_{\alpha+1} = \omega_{\alpha+1}\) is the least cardinal greater than \(\aleph_\alpha\).
- \(\aleph_\lambda = \bigcup_{\alpha < \lambda} \aleph_\alpha\) for limit ordinals \(\lambda\).
Cantor’s Theorem
Cardinal Arithmetic
- \(\kappa + \lambda = |\kappa \sqcup \lambda|\) (disjoint union)
- \(\kappa \cdot \lambda = |\kappa \times \lambda|\) (Cartesian product)
- \(\kappa^\lambda = |\kappa^\lambda|\) (set of all functions \(\lambda \to \kappa\))
We claim \(|\mathbb{R}| = |\mathcal{P}(\omega)| = 2^{\aleph_0}\).
Step 1: \(|\mathbb{R}| = |(0,1)|\). The map \(x \mapsto \frac{1}{\pi}\arctan(x) + \frac{1}{2}\) is a bijection \(\mathbb{R} \to (0,1)\).
Step 2: \(|(0,1)| = |\mathcal{P}(\omega)|\). Every real in \((0,1)\) has a binary expansion \(0.b_0 b_1 b_2 \ldots\) with \(b_i \in \{0,1\}\). Ignoring the ambiguity at dyadic rationals (which form a countable set), this gives a bijection between \((0,1)\) and \(2^\omega = \{f : \omega \to 2\}\). And \(|2^\omega| = |\mathcal{P}(\omega)|\) via \(f \leftrightarrow \{n : f(n) = 1\}\).
Step 3: \(|\mathbb{R}^2| = |\mathbb{R}|\). We need an injection \(\mathbb{R}^2 \hookrightarrow \mathbb{R}\). Given two reals \(x = 0.a_0 a_1 a_2 \ldots\) and \(y = 0.b_0 b_1 b_2 \ldots\) in \((0,1)\), interleave their binary expansions to get \(0.a_0 b_0 a_1 b_1 a_2 b_2 \ldots \in (0,1)\). This is an injection \((0,1)^2 \hookrightarrow (0,1)\), and the reverse injection is trivial (embed via first coordinate). By Schröder–Bernstein, \(|(0,1)^2| = |(0,1)|\), so \(|\mathbb{R}^2| = |\mathbb{R}|\). By induction, \(|\mathbb{R}^n| = |\mathbb{R}|\) for all finite \(n\), and even \(|\mathbb{R}^\omega| = |\mathbb{R}|\) since \((2^{\aleph_0})^{\aleph_0} = 2^{\aleph_0 \cdot \aleph_0} = 2^{\aleph_0}\).
Cofinality
- \(\mathrm{cf}(\omega) = \omega\) — \(\omega\) is regular.
- \(\mathrm{cf}(\omega_1) = \omega_1\) — \(\omega_1\) is regular.
- \(\mathrm{cf}(\aleph_\omega) = \omega\) — \(\aleph_\omega\) is singular (it is the supremum of the sequence \(\aleph_0, \aleph_1, \aleph_2, \ldots\)).
In particular, \(\kappa < \kappa^{\mathrm{cf}(\kappa)}\) for any infinite cardinal \(\kappa\).
König’s Theorem immediately implies \(2^{\aleph_0} \neq \aleph_\omega\), since \(\mathrm{cf}(\aleph_\omega) = \omega\) and \(2^{\aleph_0} = (2^{\aleph_0})^{\aleph_0} \geq \aleph_\omega^{\aleph_0} > \aleph_\omega\).
Chapter 4: Real Numbers
Set theory provides a rigorous construction of the real number system from the natural numbers. The key cardinality result is that \(|\mathbb{R}| = |\mathcal{P}(\omega)| = 2^{\aleph_0}\), the cardinality of the continuum.
The Continuum
The Continuum Hypothesis (CH) asserts that \(2^{\aleph_0} = \aleph_1\), i.e., there is no set of reals with cardinality strictly between \(\aleph_0\) and \(\mathfrak{c}\). The Generalized Continuum Hypothesis (GCH) asserts \(2^{\aleph_\alpha} = \aleph_{\alpha+1}\) for all \(\alpha\).
Cardinality of the Cantor Set and Descriptive Set Theory Preview
The Cantor set \(C \subseteq [0,1]\) has \(|C| = 2^{\aleph_0} = \mathfrak{c}\). Every closed subset of \(\mathbb{R}\) is either countable or of size \(\mathfrak{c}\) — this is the Cantor–Bendixson theorem. Chapter 11 will develop the descriptive set-theoretic hierarchy that extends this analysis.
Chapter 5: The Axiom of Choice and Cardinal Arithmetic
Equivalents of the Axiom of Choice
- The Axiom of Choice.
- Zorn's Lemma: Every partially ordered set in which every chain has an upper bound has a maximal element.
- Well-Ordering Theorem: Every set can be well-ordered.
- Tychonoff's Theorem: Any product of compact Hausdorff spaces is compact.
- Every vector space has a basis (Hamel basis).
- Every surjection has a right inverse.
Cardinal Exponentiation and the GCH
Under GCH, all infinite cardinal arithmetic reduces to simple rules:
Without GCH, cardinal exponentiation is much harder to determine and is the subject of Shelah’s pcf theory.
Singular Cardinal Hypothesis
SCH follows from GCH and holds in many canonical inner models. Its failure (which requires large cardinal hypotheses) was demonstrated by Silver, Magidor, and others.
Chapter 6: The Axiom of Regularity
The Axiom of Regularity (Foundation) asserts that every non-empty set has an \(\in\)-minimal element. This has deep structural consequences.
Well-Foundedness
- The Axiom of Regularity.
- There is no infinite descending \(\in\)-chain \(x_0 \ni x_1 \ni x_2 \ni \cdots\)
- The cumulative hierarchy \(V = \bigcup_\alpha V_\alpha\) covers all sets.
Consequences
In ordinary mathematics, Regularity is often used implicitly: for instance, to show that \(\in\)-induction is valid on the universe.
Chapter 7: Filters, Ultrafilters, and Boolean Algebras
Filters and ultrafilters are central to model theory, topology, and large cardinal theory. They provide the mechanism for the ultrapower construction of measurable cardinals and Cohen’s forcing via Boolean-valued models.
Filters
- \(I \in \mathcal{F}\) and \(\emptyset \notin \mathcal{F}\),
- If \(A \in \mathcal{F}\) and \(A \subseteq B\), then \(B \in \mathcal{F}\) (upward closure),
- If \(A, B \in \mathcal{F}\), then \(A \cap B \in \mathcal{F}\) (finite intersection property).
Ultrafilters and the Ultrapower
This theorem is fundamental: the ultrapower satisfies the same first-order sentences as \(M\), and is used to construct elementary extensions and measurable cardinals.
Boolean Algebras
Complete Boolean algebras provide the algebraic foundation for forcing: a generic filter over a complete Boolean algebra \(\mathbb{B}\) gives rise to a generic extension \(V^\mathbb{B}\).
Chapter 8: Stationary Sets
Stationary sets form one of the most powerful tools in combinatorial set theory. They arise naturally in the analysis of uncountable cardinals and feature prominently in results about cardinal arithmetic, partition relations, and inner models.
Club Sets
- Unbounded: \(\sup C = \kappa\) (i.e., for every \(\alpha < \kappa\) there exists \(\gamma \in C\) with \(\gamma > \alpha\)),
- Closed: For every limit ordinal \(\alpha < \kappa\), if \(C \cap \alpha\) is unbounded in \(\alpha\), then \(\alpha \in C\).
- The set of all limit ordinals below \(\omega_1\) is a club in \(\omega_1\).
- For any increasing function \(f: \omega_1 \to \omega_1\), the set of fixed points \(\{\alpha < \omega_1 : f(\alpha) = \alpha\}\) is a club.
- The intersection of two clubs is again a club.
The Club Filter
Stationary Sets
- Every club is stationary.
- The set \(\{\alpha < \omega_1 : \alpha \text{ is a successor ordinal}\}\) is stationary in \(\omega_1\).
- The set \(\{\alpha < \omega_1 : \mathrm{cf}(\alpha) = \omega\}\) is a club (it is all limit ordinals \(< \omega_1\), since all limit ordinals \(< \omega_1\) have cofinality \(\omega\)).
- For a regular cardinal \(\kappa > \omega_1\), the set \(E^\kappa_\omega = \{\alpha < \kappa : \mathrm{cf}(\alpha) = \omega\}\) is stationary in \(\kappa\) but contains no club.
The Pressing-Down (Fodor) Lemma
More carefully, we handle the \(\kappa = \omega_1\) case by induction: assume all fibers are non-stationary. Then for each \(\gamma < \kappa\), pick a club \(C_\gamma\) disjoint from \(f^{-1}(\gamma)\). By transfinite induction, build a “diagonal intersection”: the set \(\Delta_{\gamma < \kappa} C_\gamma = \{\alpha < \kappa : \alpha \in C_\gamma \text{ for all } \gamma < \alpha\}\) is a club. Any \(\alpha\) in \(S \cap \Delta C_\gamma\) satisfies \(\alpha \in C_{f(\alpha)}\) (since \(f(\alpha) < \alpha\)), but \(C_{f(\alpha)} \cap f^{-1}(f(\alpha)) = \emptyset\), meaning \(\alpha \notin f^{-1}(f(\alpha))\), i.e., \(f(\alpha) \neq f(\alpha)\), contradiction.
- It implies the Silver Dichotomy: any stationary subset of \(\omega_1\) either contains a club or is partitioned into \(\omega_1\) disjoint stationary sets.
- It is used to prove that the club filter on \(\omega_1\) is normal.
- It underlies the proof of the Solovay Splitting Theorem: any stationary subset of \(\kappa\) can be split into \(\kappa\) disjoint stationary subsets.
Normal Filters and Normality
Chapter 9: Combinatorial Set Theory
Combinatorial set theory studies partition properties, trees, and combinatorial principles that hold (or consistently fail) at various uncountable cardinals.
Ramsey Theory: Partition Calculus
Trees
The existence of a Suslin tree is independent of ZFC: it follows from the Diamond Principle \(\Diamond\) (which holds in Gödel’s \(L\)) but is refuted by Martin’s Axiom plus \(\neg\)CH.
The Diamond Principle
Martin’s Axiom
\(\mathsf{MA} + \neg\mathsf{CH}\) is consistent with ZFC (proved by Solovay and Tennenbaum). Under \(\mathsf{MA} + \neg\mathsf{CH}\), all Aronszajn trees are special (contain no Suslin subtree), and \(2^{\aleph_0}\) can be any regular cardinal.
Chapter 10: Measurable Cardinals
Large cardinal axioms assert the existence of cardinals with strong combinatorial or model-theoretic properties. They form a natural hierarchy extending beyond ZFC and serve as consistency strength benchmarks.
Measurable Cardinals via Ultrafilters
The Ultrapower Construction and Scott’s Theorem
Given a measurable cardinal \(\kappa\) with a \(\kappa\)-complete ultrafilter \(\mathcal{U}\), we form the ultrapower \(\text{Ult}(V, \mathcal{U})\):
The canonical embedding \(j: V \to \text{Ult}(V,\mathcal{U})\) defined by \(j(x) = [c_x]_\mathcal{U}\) (constant function) is an elementary embedding. This embedding is non-trivial: \(j(\kappa) > \kappa\). The least ordinal moved by \(j\) is the critical point of \(j\), which equals \(\kappa\).
0-Sharp (\(0^\#\))
Scott’s theorem initiates a fundamental dichotomy: either \(V = L\) (no large cardinals of measurable strength) or \(V \neq L\) (large cardinals exist). The canonical witness for \(V \neq L\) at the measurable cardinal level is \(0^\#\) (zero-sharp):
Chapter 11: Borel and Analytic Sets
Descriptive set theory studies the definability and structural properties of subsets of Polish spaces (complete separable metric spaces), chief among them \(\mathbb{R}\) and \(\omega^\omega\) (Baire space).
The Borel Hierarchy
- \(\mathbf{\Sigma}^0_1\) = open sets
- \(\mathbf{\Pi}^0_1\) = closed sets
- \(\mathbf{\Sigma}^0_{\alpha+1}\) = countable unions of \(\mathbf{\Pi}^0_\alpha\) sets
- \(\mathbf{\Pi}^0_{\alpha+1}\) = complements of \(\mathbf{\Sigma}^0_{\alpha+1}\) sets
- \(\mathbf{\Delta}^0_\alpha = \mathbf{\Sigma}^0_\alpha \cap \mathbf{\Pi}^0_\alpha\)
Analytic and Coanalytic Sets
The Perfect Set Property and the Cantor–Bendixson Theorem
- \(\mathbf{\Sigma}^1_2\) (projections of coanalytic) = PCA sets
- \(\mathbf{\Pi}^1_2\) = CPCA sets
Chapter 12: Models of Set Theory
To understand the independence of CH and other statements, we must understand what it means for a set or class to be a model of ZFC, and how to build new models from existing ones.
Structures and Satisfaction
Relativization and Absoluteness
Elementary Substructures and the Löwenheim–Skolem Theorem
Reflection Principle
The Reflection Principle is a fundamental metatheorem of ZFC, showing that any finite list of axioms is “reflected” down to a set-sized structure within \(V\).
- ZFC cannot prove its own consistency (Gödel's incompleteness), but ZFC does prove each finite fragment of ZFC is consistent (by reflecting to a \(V_\beta\)).
- It implies that \(\mathbf{Ord}\) is "indescribable" from below: any property true of \(V\) is true of some \(V_\alpha\).
- It provides an internal justification for inaccessible cardinals: the reflection of "all of ZFC" to a \(V_\kappa\) gives a \(\kappa\) that looks like a strong inaccessible.
Chapter 13: Constructible Sets (L)
Gödel’s constructible universe \(L\) is the smallest inner model of ZFC. Working in \(L\) allows proofs of the consistency of GCH and AC relative to ZF.
Definition of the Constructible Universe
The constructible hierarchy \(L\) is defined by:
- \(L_0 = \emptyset\)
- \(L_{\alpha+1} = \mathrm{Def}(L_\alpha)\)
- \(L_\lambda = \bigcup_{\alpha < \lambda} L_\alpha\) for limit ordinals \(\lambda\)
Properties of L
The Condensation Lemma is the key to proving GCH in \(L\):
Ordinal Definable Sets (OD) and HOD
Beyond \(L\), there are other important inner models defined by definability conditions.
Jensen’s Fine Structure Theory
Jensen developed a systematic “fine structure” analysis of \(L\), introducing the Jense hierarchy \(J_\alpha\) (a variant of \(L_\alpha\)) and the \(\Sigma_n\) projecta and master codes. This machinery yields the combinatorial principles:
- \(\Diamond(\kappa)\): There exists a \(\Diamond\)-sequence at \(\kappa\) (holds for all regular uncountable \(\kappa\) in \(L\)).
- \(\square_\kappa\): There exists a coherent sequence of clubs (a global square). Also holds in \(L\).
These principles have major consequences: \(\Diamond \Rightarrow\) Suslin tree exists; \(\square_\kappa\) has implications for cardinal arithmetic and the structure of singular cardinals.
Chapter 14: Forcing and the Independence of CH
Forcing, invented by Paul Cohen in 1963, is the technique for proving that mathematical statements are unprovable from ZFC. It works by constructing new models of set theory — “generic extensions” — by adjoining new sets to an existing model.
Intuitive Introduction to Forcing
Cohen’s method of forcing is the most powerful technique in modern set theory, and it deserves an extended intuitive preparation before the technical machinery. The basic question is: how do we build a model of ZFC in which \(2^{\aleph_0} = \aleph_2\), given that we are living inside some existing model \(M\) where perhaps \(2^{\aleph_0} = \aleph_1\)?
We cannot simply “add” new real numbers to \(M\) from inside \(M\) — if the new reals existed in \(M\), they would already be there. Instead, Cohen’s idea is to reason about a hypothetical extension \(M[G]\) from within \(M\), by specifying what properties the new object \(G\) should have, without actually constructing \(G\).
Think of forcing conditions as finite approximations to the new object we want to add. A forcing poset \(\mathbb{P}\) consists of these approximations, ordered so that stronger conditions (lower in the order) provide more information. A generic filter \(G\) is a coherent collection of conditions — one from each “dense” set of conditions in \(\mathbb{P}\) — whose union describes the new object completely.
The key insight is: \(G\) need not exist in \(M\). If \(M\) is countable, then \(\mathbb{P}\) has countably many dense sets (as elements of \(M\)), and by a combinatorial argument (Rasiowa–Sikorski), a filter meeting all of them exists — but lives outside \(M\). We adjoin \(G\) to \(M\) to form \(M[G]\), and the forcing relation \(p \Vdash \varphi\) (defined inside \(M\)) tells us exactly which sentences will be true in \(M[G]\): \(\varphi\) holds in \(M[G]\) if and only if some condition \(p\) in \(G\) forces \(\varphi\).
consisting of all finite binary strings \(s: n \to 2\) (for \(n \in \omega\)), ordered by reverse extension: \(s \leq t\) (i.e., \(s\) is stronger) if and only if \(s \supseteq t\) as functions. Intuitively, longer strings provide more information.
A generic filter \(G\) picks one finite string from each dense set. The dense sets ensure that:
- For each \(n \in \omega\), some string in \(G\) has length \(> n\) (so \(\bigcup G\) is a total function \(\omega \to 2\)).
- For each condition \(s \in \mathbb{P}\) and each \(k \in \{0,1\}\), there is a condition in \(G\) extending \(s^\frown k\) (so \(\bigcup G\) genuinely branches).
The new real is \(g = \bigcup G \in 2^\omega\). This \(g\) is not in the ground model \(M\): for each real \(r \in M \cap 2^\omega\), the set \(D_r = \{s \in \mathbb{P} : s \not\subseteq r\}\) is dense (since any finite approximation can be extended to disagree with \(r\) somewhere), so \(G\) meets \(D_r\), meaning \(g \neq r\). The generic real \(g\) is thus genuinely new — not just a re-description of something already in \(M\).
This forcing does not collapse cardinals (it is ccc: any antichain in \(2^{<\omega}\) is countable, since two compatible strings can always be extended to a common string). The extension \(M[G]\) satisfies all the same cardinal arithmetic as \(M\), but now contains a new real \(g \notin M\).
The Basic Setup
Let \(M\) be a countable transitive model of ZFC (a ground model). A partial order (or forcing notion) is a set \(\mathbb{P} = (P, \leq)\) with a maximum element \(\mathbf{1}\). Elements of \(\mathbb{P}\) are conditions; we think of stronger conditions as providing more information.
Since \(M\) is countable, \(M\) contains only countably many dense sets, so a generic filter \(G \notin M\) always exists (the generic filter is chosen “outside” \(M\)).
Names and the Generic Extension
- \(M \subseteq M[G]\) and \(G \in M[G]\).
- \(M[G]\) has the same ordinals as \(M\).
- \(M[G]\) is the smallest transitive model of ZFC containing \(M\) and \(G\).
The Forcing Relation
- \(p \Vdash \tau \in \sigma\) iff for some \((\rho, q) \in \sigma\) with \(p \leq q\), \(p \Vdash \tau = \rho\).
- \(p \Vdash \varphi \land \psi\) iff \(p \Vdash \varphi\) and \(p \Vdash \psi\).
- \(p \Vdash \lnot\varphi\) iff no extension \(q \leq p\) forces \(\varphi\).
- \(p \Vdash \exists x \, \varphi(x)\) iff for some name \(\tau\) and some \(q \leq p\), \(q \Vdash \varphi(\tau)\).
Moreover, the forcing relation \(\Vdash\) is definable in \(M\).
Cohen Forcing and the Independence of CH
Cohen’s original forcing adds \(\aleph_2\) (in \(M\)) many new subsets of \(\omega\), making \(2^{\aleph_0} \geq \aleph_2\) in the extension.
\(\mathbb{P}\) is ccc: Any antichain in \(\mathbb{P}\) is countable. (Two conditions sharing the same finite domain part are compatible.)
Cardinals are preserved: Since \(\mathbb{P}\) is ccc, all cardinals and cofinalities of \(M\) are preserved in \(M[G]\). In particular, \(\aleph_1^{M[G]} = \aleph_1^M\) and \(\aleph_2^{M[G]} = \aleph_2^M\).
\(2^{\aleph_0} \geq \aleph_2\) in \(M[G]\): For each \(\alpha < \aleph_2^M\), the generic adds a new real \(r_\alpha = \{n \in \omega : G(\alpha, n) = 1\}\). These are distinct reals (they differ on the dense set of conditions forcing distinctness). So \(|{\{r_\alpha : \alpha < \aleph_2^M\}}| = \aleph_2\) in \(M[G]\).
Therefore \(M[G] \models \lnot\mathsf{CH}\).
Preserving Cardinals: ccc Forcing
Boolean-Valued Models
An alternative approach to forcing uses Boolean-valued models, which avoids the need for a countable ground model.
Other Independence Results
Forcing is used to establish the independence of many statements:
- Suslin’s Hypothesis (SH): Consistent with ZFC (MA + ¬CH implies SH) and its negation is consistent (follows from \(\Diamond\), which holds in \(L\)).
- Souslin’s Problem: The question whether every dense, complete linear order without endpoints satisfying ccc is isomorphic to \(\mathbb{R}\). Independent of ZFC.
- Whitehead Problem: Is every Whitehead group free? Shelah showed it is independent of ZFC.
- The proper forcing axiom (PFA): A strong strengthening of MA, consistent relative to a supercompact cardinal.
Chapter 17: Large Cardinals
Large cardinal axioms assert the existence of cardinals with properties so strong that their existence cannot be proved (or even shown consistent) in ZFC alone. They form a natural well-ordered hierarchy of consistency strength, calibrating the “strength” of mathematical theories.
Large cardinal axioms extend the ZFC axioms upward — they assert the existence of very large sets whose existence cannot be proved in ZFC but whose consistency we nonetheless believe (based on the coherence of the resulting theory and its fruitful consequences). Each large cardinal axiom is a bet on the richness of the set-theoretic universe, and the remarkable empirical fact is that all such axioms studied so far are linearly ordered by consistency strength: for any two large cardinal axioms \(\Phi\) and \(\Psi\), either ZFC + \(\Phi\) proves the consistency of ZFC + \(\Psi\), or vice versa.
Inaccessible Cardinals
- \(\kappa\) is regular: \(\mathrm{cf}(\kappa) = \kappa\),
- \(\kappa\) is a strong limit: for all \(\lambda < \kappa\), \(2^\lambda < \kappa\).
Inaccessible cardinals are significant because they are precisely the cardinals \(\kappa\) such that \(V_\kappa\) is a model of ZFC. In this sense, they are “ZFC-absolute” — they look like the entire set-theoretic universe from within their own level.
Mahlo Cardinals
Weakly Compact Cardinals
- \(\kappa\) is weakly compact.
- \(\kappa\) has the tree property: every \(\kappa\)-tree has a branch of length \(\kappa\).
- \(\kappa\) is \(\Pi^1_1\)-indescribable: for any \(R \subseteq V_\kappa\) and any \(\Pi^1_1\)-sentence \(\varphi\), if \((V_\kappa, \in, R) \models \varphi\), then there exists \(\alpha < \kappa\) with \((V_\alpha, \in, R \cap V_\alpha) \models \varphi\).
- There exists a \(\kappa\)-complete ultrafilter on each set of cardinality \(\kappa\) (the "extension property").
Measurable Cardinals (Revisited)
Ramsey and Erdős Cardinals
Supercompact Cardinals
Woodin Cardinals
Each “\(\prec\)” means “strictly weaker in consistency strength.” Woodin cardinals sit between measurable and supercompact in a deep sense: they are strong enough to imply projective determinacy (which requires infinitely many Woodin cardinals) but not strong enough to imply the existence of a supercompact. Above supercompact in the hierarchy come extendible, huge, and rank-into-rank cardinals — with Kunen’s Inconsistency placing an absolute upper bound by showing there is no elementary embedding \(j: V \to V\) in ZFC.
The Large Cardinal Hierarchy
The following schematic shows the hierarchy of large cardinals in increasing order of consistency strength:
| Level | Large Cardinal |
|---|---|
| 1 | Inaccessible (\(V_\kappa \models \mathsf{ZFC}\)) |
| 2 | Mahlo (stationary many inaccessibles below) |
| 3 | Weakly Compact (\(\kappa \to (\kappa)^2_2\)) |
| 4 | Measurable (non-trivial \(\kappa\)-complete ultrafilter) |
| 5 | Strong (elementary embedding with critical point \(\kappa\)) |
| 6 | Woodin (for every function, many witnesses below \(\delta\)) |
| 7 | Supercompact (\(M^\lambda \subseteq M\)) |
| 8 | Extendible (\(V_{\kappa+\eta} \prec V_{j(\kappa)+\eta}\)) |
| 9 | Huge (\(M^{j(\kappa)} \subseteq M\)) |
| 10 | Rank-into-rank (\(j: V_\lambda \to V_\lambda\)) |
Each level is strictly stronger in consistency strength than all previous levels. At the top, Kunen showed (in ZFC) that there is no elementary embedding \(j: V \to V\) (Kunen’s Inconsistency), placing an absolute upper bound on the large cardinal hierarchy.
Determinacy and Inner Model Theory
The inner model program seeks to construct, for each large cardinal notion, a canonical model \(L[\vec{E}]\) (built from an extender sequence \(\vec{E}\)) that contains and in some sense “explains” that large cardinal. The program has been carried out through Woodin cardinals and is one of the central programs in contemporary set theory.
Summary and Interconnections
The major themes of graduate set theory are deeply intertwined:
Foundations and Axiomatics (Chapters 1, 6): ZFC provides a rigorous but rich foundation. Regularity stratifies the universe into the cumulative hierarchy.
Ordinals and Cardinals (Chapters 2, 3): The backbone of transfinite mathematics. Ordinals measure order types; cardinals measure sizes. Cofinality and König’s theorem govern cardinal arithmetic.
Combinatorics (Chapters 8, 9): Stationary sets, the club filter, Fodor’s Lemma, and the Diamond and Square principles are the central tools of infinitary combinatorics. They interact with forcing (which can add or destroy stationary sets) and large cardinals (which impose reflection principles that limit combinatorial pathology).
Descriptive Set Theory (Chapters 4, 11): The structure of subsets of the reals is governed by the Borel hierarchy and the projective hierarchy. Definability interacts with large cardinals: under projective determinacy (a consequence of Woodin cardinals), all projective sets enjoy regularity properties.
Inner Models (Chapter 13): \(L\) is the canonical inner model. HOD is the definability-based inner model. The inner model program extends these to accommodate large cardinals.
Forcing (Chapter 14): Forcing constructs outer models, proving independence results. Cohen’s forcing adds new reals, making \(\neg\)CH consistent. The interaction of forcing with stationary sets (preserving or collapsing them) and large cardinals is a central technical concern.
Large Cardinals (Chapters 10, 17): They calibrate consistency strength and impose structure on \(V\). They imply \(V \neq L\) and yield regularity properties for definable sets of reals. The large cardinal hierarchy is linearly ordered by consistency strength, a deep empirical fact about mathematics.
The independence results of Gödel (GCH consistent) and Cohen (CH independent), together with the large cardinal program, show that ZFC is both a powerful and genuinely incomplete foundation for mathematics — a theme that continues to drive research in set theory.
Appendix: Key Formulas and Results Reference
| Result | Statement |
|---|---|
| Cantor’s Theorem | \(\vert X \vert < \vert \mathcal{P}(X) \vert\) for any set \(X\) |
| Schröder–Bernstein | \(\vert A \vert \leq \vert B \vert\) and \(\vert B \vert \leq \vert A \vert\) implies \(\vert A \vert = \vert B \vert\) |
| König’s Theorem | \(\sum_i \kappa_i < \prod_i \lambda_i\) when \(\kappa_i < \lambda_i\) |
| Fodor’s Lemma | Regressive function on stationary set has stationary fiber |
| Łoś’s Theorem | Ultrapower satisfies same first-order sentences |
| Condensation Lemma | Elementary substructures of \(L_\alpha\) collapse to some \(L_\beta\) |
| Reflection Principle | Every formula true in \(V\) reflects to some \(V_\alpha\) |
| Forcing Theorem | \(M[G] \models \varphi\) iff some \(p \in G\) forces \(\varphi\) |
| Scott’s Theorem | Measurable cardinal implies \(V \neq L\) |
| GCH in L | \(L \models 2^{\aleph_\alpha} = \aleph_{\alpha+1}\) |
Chapter 18: Forcing Axioms
Forcing axioms arise from a simple but powerful idea: rather than adding a single generic object to the universe, we want a principle that guarantees the existence of generic filters for many forcings simultaneously. They are thus “axioms of richness,” asserting that the universe is in some sense saturated. The study of forcing axioms reveals deep connections between combinatorics, topology, and large cardinal strength.
Section 18.1: Martin’s Axiom
Motivation. Recall the Baire Category Theorem: if \(X\) is a compact Hausdorff space and \(\{D_n : n \in \omega\}\) is a countable family of dense open sets, then \(\bigcap_n D_n \neq \emptyset\). Martin’s Axiom (MA) generalizes this from countably many dense sets to fewer than \(2^{\aleph_0}\) many, while restricting to a class of forcing notions called ccc (the countable chain condition).
Note that MA at \(\aleph_0\) (i.e., when \(|\mathcal{D}| \leq \aleph_0\)) is provable in ZFC — this is the content of the Rasiowa–Sikorski lemma. The axiom becomes interesting and independent when we demand this for families of size \(\aleph_1, \aleph_2, \ldots\) up to \(2^{\aleph_0}\). We write \(\text{MA}_\kappa\) for the statement restricted to \(|\mathcal{D}| \leq \kappa\); then MA is \(\text{MA}_{<2^{\aleph_0}}\).
- Under MA + ¬CH, every set of reals of cardinality less than \(2^{\aleph_0}\) has Lebesgue measure zero and is meager.
- MA implies Suslin's Hypothesis: there is no Suslin line.
- MA implies that any product of ccc topological spaces is ccc (a generalization of Knaster's lemma).
- MA + ¬CH is consistent relative to ZFC.
Section 18.2: The Proper Forcing Axiom
MA can be seen as a forcing axiom for ccc forcings. The natural question is: can we extend it to larger classes? The answer requires restricting to forcings that preserve stationary sets in a controlled way.
Every ccc forcing is proper, and so is every countably closed forcing. Proper forcings do not collapse \(\aleph_1\), making them the natural setting for extending MA.
PFA is consistent relative to the existence of a supercompact cardinal, via an iteration due to Baumgartner.
Section 18.3: Martin’s Maximum
Martin’s Maximum is the strongest possible forcing axiom in a precise sense.
Every proper forcing is SSP, so MM implies PFA. Foreman, Magidor, and Shelah proved that MM is consistent relative to a supercompact cardinal (1988), and Woodin showed that a supercompact cardinal directly implies MM after a suitable forcing.
MM is “maximal” in the sense that no forcing axiom for a strictly larger class of forcings is consistent: if \(\mathbb{P}\) does not preserve all stationary subsets of \(\omega_1\), then the forcing axiom for \(\{\mathbb{P}\}\) and \(\aleph_1\) many dense sets is already inconsistent.
Section 18.4: Comparison Table
| Axiom | Class of forcings | Implies \(2^{\aleph_0} = \aleph_2\)? | Consistency strength |
|---|---|---|---|
| MA | ccc | No (any regular \(> \aleph_1\) possible) | ZFC |
| PFA | Proper | Yes | Supercompact cardinal |
| MM | Stationary-set-preserving | Yes | Supercompact cardinal |
Chapter 19: Determinacy and Descriptive Set Theory
Descriptive set theory studies definable sets of reals — those arising through explicit operations starting from open sets. The projective hierarchy organizes these sets by complexity, and the theory of games provides an unexpected tool for their analysis.
Section 19.1: The Projective Hierarchy
- A set \(A \subseteq \mathbb{R}\) is analytic (\(\Sigma^1_1\)) if it is a continuous image of a Borel set (equivalently, a Borel image of \(\mathbb{N}^\mathbb{N}\)).
- A set is coanalytic (\(\Pi^1_1\)) if its complement is analytic.
- For \(n \geq 1\): \(\Sigma^1_{n+1}\) is the class of continuous images of \(\Pi^1_n\) sets; \(\Pi^1_{n+1}\) is the class of complements of \(\Sigma^1_{n+1}\) sets; \(\Delta^1_n = \Sigma^1_n \cap \Pi^1_n\).
This elegant characterization shows that Borel sets are precisely those that are both analytic and coanalytic. Every analytic set is Lebesgue measurable, has the Baire property, and has the perfect set property (Suslin, Luzin). The situation for \(\Sigma^1_2\) and beyond is undecidable in ZFC alone.
Section 19.2: Games and Determinacy
Borel Determinacy is provable in ZFC. The axiom AD, however, contradicts the Axiom of Choice.
Section 19.3: Projective Determinacy
While AD is inconsistent with AC (since AC implies the existence of a non-determined game), PD is consistent with ZFC.
- Every projective set is Lebesgue measurable.
- Every projective set has the Baire property.
- Every projective set has the perfect set property (hence is either countable or has cardinality \(2^{\aleph_0}\)).
- The projective hierarchy does not collapse: \(\Sigma^1_n \neq \Pi^1_n\) for all \(n \geq 1\).
Thus, under PD, the projective sets behave as well as the Borel sets — they are measurable and structured. Without large cardinals, the behavior of \(\Sigma^1_2\) sets is already undecidable.
Section 19.4: The Full Axiom of Determinacy
- Every set of reals is Lebesgue measurable and has the Baire property.
- \(\aleph_1\) is a measurable cardinal.
- AD implies DC (Dependent Choice) but not full AC.
- The club filter on \(\omega_1\) is an ultrafilter.
Chapter 20: Cardinal Characteristics of the Continuum
Section 20.1: Motivation
We know that \(2^{\aleph_0}\) can be many things — \(\aleph_1\), \(\aleph_2\), \(\aleph_{\omega_1}\), and so on. But beyond asking what \(2^{\aleph_0}\) equals, we can ask finer questions: how complex is the combinatorial structure of \(\mathbb{R}\)? Cardinal characteristics of the continuum are cardinal numbers, definable from the reals, that measure specific aspects of this complexity. Each lies between \(\aleph_1\) and \(2^{\aleph_0}\) inclusive, and they can be separated from each other by forcing.
Section 20.2: Characteristics of Baire Space
We work in \(\omega^\omega\), ordered by eventual domination: \(f \leq^* g\) iff \(f(n) \leq g(n)\) for all but finitely many \(n\).
- The bounding number \(\mathfrak{b}\) is the smallest cardinality of a family \(\mathcal{F} \subseteq \omega^\omega\) that is unbounded: no single \(g \in \omega^\omega\) satisfies \(f \leq^* g\) for all \(f \in \mathcal{F}\).
- The dominating number \(\mathfrak{d}\) is the smallest cardinality of a dominating family \(\mathcal{F}\): for every \(g \in \omega^\omega\), there exists \(f \in \mathcal{F}\) with \(g \leq^* f\).
- The splitting number \(\mathfrak{s}\) is the smallest cardinality of a family \(\mathcal{S} \subseteq [\omega]^\omega\) such that for every \(A \in [\omega]^\omega\), some \(S \in \mathcal{S}\) splits \(A\): both \(A \cap S\) and \(A \setminus S\) are infinite.
- The reaping number \(\mathfrak{r}\) is the smallest cardinality of a family \(\mathcal{R} \subseteq [\omega]^\omega\) such that no single \(A \in [\omega]^\omega\) splits every member of \(\mathcal{R}\).
One can show \(\aleph_1 \leq \mathfrak{b} \leq \mathfrak{d} \leq 2^{\aleph_0}\) and \(\mathfrak{b} \leq \mathfrak{s}\) in ZFC.
- A family \(\mathcal{F} \subseteq [\omega]^\omega\) has the strong finite intersection property (sfip) if every finite subfamily has infinite intersection.
- A pseudo-intersection of \(\mathcal{F}\) is an \(A \in [\omega]^\omega\) with \(A \subseteq^* F\) for all \(F \in \mathcal{F}\).
- \(\mathfrak{p}\) is the smallest cardinality of a family with sfip but no pseudo-intersection.
- A tower is a \(\subseteq^*\)-decreasing sequence in \([\omega]^\omega\) with no pseudo-intersection. \(\mathfrak{t}\) is the smallest length of a tower.
Section 20.3: Characteristics of the Null and Meager Ideals
Let \(\mathcal{N}\) be the ideal of Lebesgue null sets and \(\mathcal{M}\) the ideal of meager sets on \(\mathbb{R}\).
- \(\text{add}(\mathcal{I})\): smallest number of sets in \(\mathcal{I}\) whose union is not in \(\mathcal{I}\).
- \(\text{cov}(\mathcal{I})\): smallest number of sets in \(\mathcal{I}\) whose union is all of \(X\).
- \(\text{non}(\mathcal{I})\): smallest cardinality of a set not in \(\mathcal{I}\).
- \(\text{cof}(\mathcal{I})\): cofinality of \(\mathcal{I}\) under \(\subseteq\) (smallest cardinality of a cofinal subfamily).
Section 20.4: The Cichoń Diagram
The ten characteristics are related by the following inequalities, all provable in ZFC:
\[ \begin{array}{ccccc} \text{add}(\mathcal{N}) & \leq & \text{cov}(\mathcal{N}) & \leq & \text{non}(\mathcal{M}) \\ \downarrow & & & & \downarrow \\ \mathfrak{b} & & & & \mathfrak{d} \\ \downarrow & & & & \downarrow \\ \text{add}(\mathcal{M}) & \leq & \text{cov}(\mathcal{M}) & \leq & \text{non}(\mathcal{N}) \\ \end{array} \qquad \leq \quad \text{cof}(\mathcal{N}) \leq 2^{\aleph_0} \]More precisely, with \(\aleph_1\) on the left and \(2^{\aleph_0}\) on the right, all arrows go left-to-right or top-to-bottom, and in every model of ZFC all displayed inequalities hold. A key result is that \(\text{add}(\mathcal{M}) = \min(\mathfrak{b}, \text{cov}(\mathcal{M}))\) and \(\text{cof}(\mathcal{M}) = \max(\mathfrak{d}, \text{non}(\mathcal{M}))\).
- \(\mathfrak{b} < \mathfrak{d}\) (e.g., using a finite support iteration of Hechler forcing).
- \(\text{cov}(\mathcal{N}) < \text{non}(\mathcal{N})\) (random real forcing).
- All ten characteristics are pairwise distinct (Goldstern–Kellner–Shelah).
Chapter 21: PCF Theory and Singular Cardinal Arithmetic
Section 21.1: Motivation — Silver’s Theorem and Beyond
Cardinal arithmetic is wildly undecidable for regular cardinals: by Easton’s theorem, \(2^{\aleph_\alpha}\) can be almost anything for regular \(\aleph_\alpha\). But singular cardinals are different. A remarkable theorem of Silver shows that GCH cannot first fail at a singular cardinal of uncountable cofinality.
The analogous result for cofinality \(\omega\) is false: Magidor showed that starting from a supercompact cardinal, GCH can fail at \(\aleph_\omega\). But even then, how large can \(2^{\aleph_\omega}\) be?
Section 21.2: Shelah’s Bound
This is a ZFC result — no large cardinals. It shows that even though \(2^{\aleph_\omega}\) can exceed \(\aleph_{\omega+1}\), it cannot be arbitrarily large: it is bounded below \(\aleph_{\omega_4}\). The proof uses Shelah’s PCF theory.
Let \(A = \{\aleph_1, \aleph_2, \aleph_3\}\). We want to determine the possible cofinalities of \(\prod A / D\) as \(D\) ranges over ultrafilters on \(A\).
Since \(A\) is a finite set of three elements, the ultrafilters on \(A\) are exactly the three principal ultrafilters \(D_1, D_2, D_3\) concentrated on \(\aleph_1, \aleph_2, \aleph_3\) respectively. (There are no non-principal ultrafilters on a finite set.)
For a principal ultrafilter \(D_i\) concentrated on \(\aleph_i\):
\[ \prod A / D_i \cong \aleph_i \]since every function \(f \in \prod A\) is identified with its value \(f(\aleph_i)\), giving a structure of order type \(\aleph_i\).
Therefore \(\mathrm{cf}(\prod A / D_i) = \mathrm{cf}(\aleph_i)\). Under GCH, all \(\aleph_i\) for \(i \geq 1\) are regular (since \(\aleph_i = \omega_i\) and successor cardinals are regular), so:
\[ \mathrm{pcf}(\{\aleph_1, \aleph_2, \aleph_3\}) = \{\aleph_1, \aleph_2, \aleph_3\} \subseteq [\aleph_1, \aleph_3] \subseteq [\aleph_1, \aleph_4]. \]In the general infinite case, for \(A = \{\aleph_1, \aleph_2, \ldots, \aleph_n, \ldots\}_{n < \omega}\) (the first \(\omega\) uncountable cardinals), the ultrafilters on this countable set include non-principal ones, and the pcf can extend up to \(\aleph_{\omega_4}\) — this is the content of Shelah’s theorem. The pcf generators for this larger set are the subsets \(B_\lambda \subseteq A\) that “generate” each possible cofinality \(\lambda\), and their existence is guaranteed by the pcf structure theorem.
Assume \(2^{\aleph_n} < \aleph_\omega\) for all \(n < \omega\) (i.e., GCH holds cofinally below \(\aleph_\omega\)). We want to bound \(\aleph_\omega^{\aleph_0}\).
Step 1: Represent the product. We have \(\aleph_\omega^{\aleph_0} = |\prod_{n < \omega} \aleph_\omega| \leq |\prod_{n < \omega} \aleph_{n+1}|\) (since \(\aleph_\omega = \sup_{n} \aleph_n\) and replacing each factor by a larger cardinal). The key object is \(\prod_{n<\omega} \aleph_{n+1}\) with the eventual domination order.
Step 2: PCF bound. By Shelah’s pcf theorem, the exact upper bound of this product (the maximum possible cofinality of \(\prod_{n<\omega} \aleph_{n+1} / D\) over all ultrafilters \(D\)) equals the true cofinality of the product. Shelah shows this true cofinality is in \(\mathrm{pcf}(\{\aleph_1, \aleph_2, \ldots\})\).
Step 3: The \(\aleph_{\omega_4}\) bound. Using the pcf generators \(B_\lambda\) for the set \(\{\aleph_n : n \geq 1\}\), Shelah proves that \(\max(\mathrm{pcf}(\{\aleph_n : n \geq 1\})) < \aleph_{\omega_4}\). The argument uses a covering argument: if some \(\lambda \geq \aleph_{\omega_4}\) were in \(\mathrm{pcf}\), one could derive a contradiction with the assumption \(2^{\aleph_n} < \aleph_\omega\) for all \(n\) via the pcf structure theorem and König’s theorem applied to cofinalities.
The conclusion is \(\aleph_\omega^{\aleph_0} \leq \max(\mathrm{pcf}(\{\aleph_n\})) < \aleph_{\omega_4}\), which is Shelah’s celebrated bound — a ZFC theorem with no large cardinal hypotheses.
Section 21.3: PCF Theory
Let \(A\) be a set of regular cardinals with \(|A| < \min(A)\). The key object is the product \(\prod A = \prod_{\lambda \in A} \lambda\) ordered by eventual domination: \(f <^* g\) iff \(f(\lambda) < g(\lambda)\) for all but finitely many (or more generally, all but boundedly many) \(\lambda \in A\).
Thus \(\text{pcf}(A)\) is the set of all regular cardinals that appear as the cofinality of some ultrapower of \(\prod A\).
- \(\min(A) \leq \text{pcf}(A) \subseteq \left[\min(A), \max\text{pcf}(A)\right]\) and \(\max\text{pcf}(A) = \text{cf}(\prod A, <^*)\).
- \(|\text{pcf}(A)| \leq 2^{|A|}\).
- For each \(\lambda \in \text{pcf}(A)\), there is a generator \(B_\lambda \subseteq A\) such that \(\lambda \in \text{pcf}(B_\lambda)\) and \(\lambda \notin \text{pcf}(B_\lambda \setminus \{b\})\) for all \(b \in B_\lambda\). The generators \(\{B_\lambda : \lambda \in \text{pcf}(A)\}\) form a basis for \(\text{pcf}(A)\).
The key open question is whether \(|\text{pcf}(\{\aleph_1, \ldots, \aleph_\omega\})| \leq \aleph_4\). Shelah conjectures this is provable in ZFC; it follows from his bound on \(2^{\aleph_\omega}\).
Section 21.4: The Galvin–Hajnal Theorem and SCH
This bounds cardinal exponentiation at singular cardinals of uncountable cofinality in terms of smaller cardinals, generalizing Silver’s theorem.
SCH follows from GCH, and also from the existence of large cardinals in certain senses. However:
Appendix A: Timeline of Set Theory
The following chronological table traces the development of set theory from its origins to modern research frontiers.
| Year | Event |
|---|---|
| 1874 | Cantor proves the reals are uncountable (first uncountability proof, using a nested interval argument) |
| 1878 | Cantor conjectures the Continuum Hypothesis in a letter to Dedekind |
| 1879–1884 | Cantor develops transfinite ordinals and cardinals in a series of papers |
| 1891 | Cantor’s diagonal argument; proves \(\vert X\vert < \vert \mathcal{P}(X)\vert \) for any set \(X\) |
| 1895–1897 | Cantor publishes the Beiträge, his systematic treatment of cardinals and ordinals |
| 1897 | Burali-Forti paradox: the collection of all ordinals is not a set |
| 1899 | Cantor discovers the paradox of the set of all sets (in correspondence with Dedekind) |
| 1900 | Hilbert lists the Continuum Hypothesis as the first of his 23 problems |
| 1901 | Russell’s paradox: \(R = \{x : x \notin x\}\) leads to contradiction |
| 1903 | Russell publishes The Principles of Mathematics; paradox becomes widely known |
| 1905 | König attempts to prove \(2^{\aleph_0} \neq \aleph_\omega\); error found by Zermelo |
| 1906 | Schröder–Bernstein theorem proved independently by Bernstein and Schröder |
| 1908 | Zermelo publishes the first axiomatization of set theory (Z); proves Well-Ordering Theorem using AC |
| 1910–1913 | Whitehead and Russell publish Principia Mathematica; type theory as alternative foundation |
| 1914 | Hausdorff introduces partially ordered sets, study of order types |
| 1922 | Fraenkel adds Replacement schema to Zermelo’s axioms (ZF) |
| 1922 | Skolem argues for first-order formulation; notes Löwenheim–Skolem theorem |
| 1923 | von Neumann introduces ordinals as sets; the “von Neumann ordinals” |
| 1925 | von Neumann proves the Well-Ordering Theorem using only ZF + AC |
| 1930 | Zermelo publishes revised axioms including Regularity |
| 1930 | Ulam asks the measure problem: which cardinals admit \(\sigma\)-additive measures on all subsets? |
| 1935 | Gödel introduces the constructible universe \(L\) |
| 1938 | Gödel proves \(\mathsf{Con}(\mathsf{ZF}) \Rightarrow \mathsf{Con}(\mathsf{ZFC} + \mathsf{GCH})\) using \(L\) |
| 1939 | Gödel proves AC holds in \(L\) |
| 1943 | Gödel conjectures independence of CH from ZFC |
| 1950 | Erdős and Rado develop infinite Ramsey theory; partition calculus |
| 1953 | Kleene introduces the arithmetical and analytical hierarchies (descriptive set theory) |
| 1957 | Scott shows measurable cardinals imply \(V \neq L\) |
| 1961 | Hanf and Scott develop large cardinal theory; Hanf numbers |
| 1963 | Cohen invents forcing; proves \(\mathsf{Con}(\mathsf{ZFC}) \Rightarrow \mathsf{Con}(\mathsf{ZFC} + \lnot\mathsf{CH})\) |
| 1963 | Cohen proves independence of AC from ZF (using forcing and symmetric models) |
| 1964 | Lévy and Solovay: forcing preserves measurability if the forcing is small |
| 1965 | Solovay proves every \(\Sigma^1_2\) set of reals is Lebesgue measurable if a measurable cardinal exists |
| 1967 | Jensen’s fine structure theory of \(L\); introduction of \(\Diamond\) and \(\square\) |
| 1967 | Solovay and Tennenbaum prove consistency of Suslin’s Hypothesis via iterated forcing |
| 1970 | Easton’s theorem: \(2^\kappa\) can be almost anything for regular \(\kappa\) |
| 1971 | Jensen proves \(\Diamond\) holds in \(L\); Suslin trees exist in \(L\) |
| 1971 | Silver proves GCH cannot first fail at a singular of uncountable cofinality |
| 1974 | Martin’s Axiom + ¬CH is proved consistent (Solovay–Tennenbaum) |
| 1975 | Martin proves Borel Determinacy in ZFC |
| 1976 | Silver proves \(0^\#\) exists iff there are uncountably many Silver indiscernibles |
| 1978 | Magidor proves SCH can fail at \(\aleph_\omega\) from supercompact |
| 1980 | Foreman, Magidor, Shelah prove consistency of Martin’s Maximum (MM) |
| 1984 | Shelah introduces PCF theory; bounds \(\aleph_\omega^{\aleph_0} < \aleph_{\omega_4}\) |
| 1985 | Shelah’s Cardinal Arithmetic develops the full pcf machinery |
| 1987 | Woodin introduces Woodin cardinals |
| 1988 | Martin–Steel theorem: infinitely many Woodin cardinals imply Projective Determinacy |
| 1988 | Foreman–Magidor–Shelah: Martin’s Maximum consistent from supercompact |
| 1992 | Shelah–Woodin: failure of SCH implies large cardinal strength |
| 1994 | Woodin: \(\text{AD}^{L(\mathbb{R})}\) equiconsistent with infinitely many Woodin cardinals |
| 1999 | Woodin announces \(\Omega\)-logic and the case for \(V = \text{Ultimate-}L\) |
| 2000 | Shelah: many ZFC non-equivalent combinatorial principles at \(\aleph_1\) |
| 2003 | Viale proves the singular compactness theorem for SCH |
| 2005 | Goldstern–Shelah: many separation results in the Cichoń diagram |
| 2010 | Magidor–Shelah: structure of pcf under failure of SCH |
| 2013 | Malliaris–Shelah prove \(\mathfrak{p} = \mathfrak{t}\) using Keisler’s order |
| 2014 | Goldstern–Kellner–Mejía–Shelah: all Cichoń characteristics consistently distinct |
| 2016 | Woodin’s Ultimate-\(L\) program: conjecture that a canonical inner model satisfying GCH exists at the level of supercompact cardinals |
| 2017 | Asperó–Schindler: Martin’s Maximum\(^{++}\) implies Woodin’s axiom \((*)\) |
| 2020 | Recent advances in descriptive inner model theory: connection between HOD and canonical models under determinacy |
| 2024 | Active research: inner model theory at the supercompact level, fine structure beyond Woodin cardinals, Ultimate-\(L\) program |
Appendix B: Problem Sets
The following problems are organized by topic and difficulty. Problems in Part A and B are accessible after Chapters 1–5. Problems in Part C require the forcing machinery of Chapter 14. Problems in Part D are research-adjacent and require the full course.
Part A: ZFC and Ordinal Arithmetic (5 problems)
- \(\omega \cdot 3 + \omega^2 \cdot 2 + 1\) (write in decreasing exponent order)
- \((\omega + 1) \cdot \omega\) — show this equals \(\omega^2\)
- \(\omega^\omega + \omega^3 \cdot 5 + \omega \cdot 7 + 3\)
- \(|V_\alpha| = \beth_\alpha\), where \(\beth_0 = \aleph_0\), \(\beth_{\alpha+1} = 2^{\beth_\alpha}\), and \(\beth_\lambda = \sup_{\alpha < \lambda} \beth_\alpha\) for limit \(\lambda\).
- \(V_\omega\) is a model of all ZFC axioms except Infinity.
- If \(\kappa\) is strongly inaccessible, then \(|V_\kappa| = \kappa\).
Part B: Cardinals and the Continuum Hypothesis (5 problems)
- \(|\mathbb{Q}| = \aleph_0\)
- \(|\mathbb{R} \setminus \mathbb{Q}| = 2^{\aleph_0}\)
- \(|\mathcal{P}(\mathbb{N})| = |\mathbb{R}|\)
- The set of all continuous functions \(f: \mathbb{R} \to \mathbb{R}\) has cardinality \(2^{\aleph_0}\)
- \(2^{\aleph_0} \neq \aleph_\omega\)
- \(\aleph_\omega^{\aleph_0} > \aleph_\omega\)
- For any infinite cardinal \(\kappa\), \(\kappa^{\mathrm{cf}(\kappa)} > \kappa\). (This shows the continuum function is "discontinuous" at singular cardinals.)
- \(\aleph_{\omega}^{\aleph_1} = \aleph_{\omega+1}\) (Hint: \(\mathrm{cf}(\aleph_\omega) = \omega < \aleph_1\), so use the GCH cardinal exponentiation formula.)
- For any infinite cardinal \(\kappa\) and \(\lambda \leq \kappa\): \(\kappa^\lambda = \kappa^+\) if \(\lambda\) is cofinal in \(\kappa\), and \(\kappa^\lambda = \kappa\) otherwise.
- Under GCH, the cardinal characteristics of the continuum all equal \(\aleph_1\): \(\mathfrak{b} = \mathfrak{d} = \mathfrak{p} = \mathfrak{t} = \aleph_1\).
- Prove that \(\mathrm{cf}(\aleph_{\omega_1}) = \omega_1\). Is \(\aleph_{\omega_1}\) regular or singular?
- Show that for any limit cardinal \(\kappa\), \(\mathrm{cf}(\kappa) \leq \kappa\).
- Prove that if \(\kappa\) is a singular cardinal, then \(\kappa\) is not measurable. (Hint: a measurable cardinal must be regular.)
- Show that CH is equivalent to: every uncountable subset of \(\mathbb{R}\) has cardinality \(\mathfrak{c}\). (Hint: use the Cantor–Bendixson theorem and properties of closed sets.)
- Show that MA implies \(\neg\)CH. (Hint: under MA, apply MA to the poset of finite partial functions \(2^{<\aleph_1} \to 2\); derive that \(2^{\aleph_0} \geq \aleph_2\).)
- State (without proof) why the existence of an inaccessible cardinal implies \(\mathrm{Con}(\mathsf{ZFC})\), and why this means ZFC alone cannot prove the existence of an inaccessible.
Part C: Forcing and Generic Extensions (5 problems)
- Show that \(\mathbb{P}\) is ccc: any antichain in \(\mathbb{P}\) is countable.
- For each \(n \in \omega\), show that the set \(D_n = \{s \in 2^{<\omega} : |s| > n\}\) is dense in \(\mathbb{P}\).
- For each \(t \in 2^{<\omega}\), show that \(E_t = \{s \in 2^{<\omega} : s \supseteq t\}\) is dense in \(\mathbb{P}\).
- If \(G\) is an \(M\)-generic filter over \(\mathbb{P}\), show that \(g = \bigcup G\) is a total function \(\omega \to 2\).
- Show that \(g \notin M\).
- If \(p \Vdash \varphi\) and \(q \leq p\), then \(q \Vdash \varphi\). (Monotonicity)
- No condition forces both \(\varphi\) and \(\lnot \varphi\). (Consistency)
- For every formula \(\varphi\) with names as parameters, the set \(\{p \in \mathbb{P} : p \Vdash \varphi \text{ or } p \Vdash \lnot\varphi\}\) is dense. (Decidability)
- Show that if \(\kappa\) is a regular cardinal in \(M\) and \(f: \omega \to \kappa\) is in \(M[G]\), then \(f \in M\). (Hint: use ccc to "predict" \(f\) from a name with countably many possibilities.)
- Conclude that ccc forcings do not collapse \(\omega_1\).
- Show that \(\omega_1^{M[G]} = \omega_1^M\) for any ccc forcing \(\mathbb{P}\).
- Show that for each \(\alpha < \aleph_2^M\), the set \(r_\alpha = \{n \in \omega : G(\alpha, n) = 1\}\) is a real in \(M[G]\).
- Show that the map \(\alpha \mapsto r_\alpha\) is injective in \(M[G]\). (Hint: for \(\alpha \neq \beta\), show the set of conditions where \(r_\alpha\) and \(r_\beta\) differ is dense.)
- Conclude that \(M[G] \models 2^{\aleph_0} \geq \aleph_2\).
- Since \(\mathbb{P}\) is ccc, we have \(\aleph_2^{M[G]} = \aleph_2^M\), so \(M[G] \models 2^{\aleph_0} \geq \aleph_2 = \aleph_2^{M[G]}\), giving \(\neg\mathsf{CH}\) in \(M[G]\).
- Show that \(\mathbb{P}\) is not ccc. (Hint: find an uncountable antichain.)
- Show that if \(G\) is \(M\)-generic over \(\mathbb{P}\), then \(\bigcup G: \omega \to \omega_1^M\) is a surjection, hence \(\omega_1^M\) is countable in \(M[G]\).
- Conclude that \(\omega_1^{M[G]} < \omega_1^M\): the forcing collapses \(\omega_1\).
- Explain why this does not contradict the ZFC theorem "\(\omega_1\) is uncountable": the statement "\(\omega_1^M\) is uncountable" is a statement about \(M\), while \(M[G]\) sees additional countable sequences not in \(M\).
Part D: Large Cardinals — Challenge Problems (5 problems)
- Verify that \(V_\kappa\) satisfies each of the ZFC axioms when quantifiers are restricted to \(V_\kappa\). Be explicit about which properties of \(\kappa\) are used for each axiom.
- Show that the statement "\(\kappa\) is inaccessible" is a \(\Pi_1\) statement (it can be expressed as a universal statement about sets of rank less than \(\kappa\)), and use this to argue that any elementary embedding \(j: V \to M\) with critical point \(\lambda\) satisfies \(j(\kappa)\) is inaccessible in \(M\).
- Using Łoś's Theorem, show that the ultrapower \(j: V \to \mathrm{Ult}(V, \mathcal{U})\) is a well-defined elementary embedding.
- Show that the critical point of \(j\) is exactly \(\kappa\): that is, \(j(\alpha) = \alpha\) for all \(\alpha < \kappa\), and \(j(\kappa) > \kappa\).
- Show that \(j(\kappa)\) is inaccessible in \(\mathrm{Ult}(V, \mathcal{U})\), hence measurable cardinals are limits of inaccessible cardinals.
- Show that every Woodin cardinal is a limit of measurable cardinals. (Hint: for each function \(f: \delta \to \delta\), find a measurable \(\kappa < \delta\) closed under \(f\).)
- Explain the key step of Martin–Steel: why the existence of infinitely many Woodin cardinals implies all projective sets are determined. (Sketch the "generic absoluteness" argument.)
- Show that \(\mathrm{cf}(\aleph_\omega^{\aleph_0}) > \aleph_\omega\) using König's theorem. (This shows \(\aleph_\omega^{\aleph_0} \neq \aleph_\omega\).)
- Using the definition of \(\mathrm{pcf}(A)\) for \(A = \{\aleph_1, \aleph_2, \ldots\}\), explain why \(\aleph_\omega^{\aleph_0} \leq \max(\mathrm{pcf}(A))\).
- State Shelah's theorem that \(\max(\mathrm{pcf}(\{\aleph_n : 1 \leq n < \omega\})) < \aleph_{\omega_4}\), and explain the role of the pcf generators in this bound.
- State Gödel's program: he hoped that large cardinal axioms would eventually decide CH. Explain why the independence of CH from ZFC + measurable cardinal (shown by Cohen's method) complicated this program.
- Woodin's \(\Omega\)-logic is a semantic notion of logical consequence that is invariant under set-forcing. State (informally) the conjecture that \(\mathsf{CH}\) is \(\Omega\)-provable from large cardinal axioms, and what this would mean philosophically.
- State the conjecture (Asperó–Schindler 2017, partially resolved) that Martin's Maximum\(^{++}\) is equivalent to Woodin's axiom \((*)\), and explain what this suggests about the "right" value of \(2^{\aleph_0}\) from the perspective of forcing axioms.
Chapter 22: Ordinal and Cardinal Arithmetic — Deep Dives
This chapter consolidates and extends the arithmetic introduced in Chapters 2 and 3, providing additional worked examples, proofs of key structural theorems, and connections to the later forcing and large-cardinal material.
Section 22.1: Non-Commutativity of Ordinal Multiplication
Ordinal multiplication, like ordinal addition, fails to commute in general. The failure is perhaps even more striking than for addition: whereas \(1 + \omega = \omega\) says “prepending a single point is absorbed into an \(\omega\)-sequence,” the failure \(2 \cdot \omega \neq \omega \cdot 2\) says something deeper about the structure of infinite repeated concatenations.
Recall the definition: \(\alpha \cdot \beta\) is defined by transfinite recursion on \(\beta\):
\[ \alpha \cdot 0 = 0, \quad \alpha \cdot (\beta + 1) = \alpha \cdot \beta + \alpha, \quad \alpha \cdot \lambda = \sup_{\xi < \lambda} \alpha \cdot \xi \text{ for limit } \lambda. \]Computing \(\omega \cdot 2\):
\[ \omega \cdot 2 = \omega \cdot (1 + 1) = \omega \cdot 1 + \omega = \omega + \omega. \]Concretely, \(\omega \cdot 2 = \omega + \omega = \{0, 1, 2, \ldots, \omega, \omega+1, \omega+2, \ldots\}\). This is the order type of two disjoint copies of \(\omega\) placed one after the other: the first copy \(\{0, 1, 2, \ldots\}\) followed by a second copy \(\{\omega, \omega+1, \omega+2, \ldots\}\). It has no largest element and is a limit ordinal.
Computing \(2 \cdot \omega\):
\[ 2 \cdot \omega = \sup_{n < \omega} 2 \cdot n. \]Now \(2 \cdot n = n + n = 2n\) (a finite even number), so the sequence is \(0, 2, 4, 6, \ldots\). The supremum is \(\omega\). Thus \(2 \cdot \omega = \omega\).
Summary:
\[ \omega \cdot 2 = \omega + \omega \neq \omega = 2 \cdot \omega. \]Intuition: \(\omega \cdot 2\) means “two copies of \(\omega\)” — a genuine \(\omega + \omega\). But \(2 \cdot \omega\) means “\(\omega\) copies of 2” — and \(\omega\) pairs \(\{0,1\}, \{2,3\}, \{4,5\}, \ldots\) placed end-to-end produce an order of type \(\omega\), not \(\omega \cdot 2\). In general, \(\alpha \cdot \beta\) is the order type of \(\beta\) disjoint copies of \(\alpha\) placed in sequence; since we take the \(\beta\)-many copies in the \(\beta\)-order, it is the \(\beta\)-fold concatenation of \(\alpha\). The key asymmetry is that multiplication is “left-repeated addition”: \(\omega \cdot n\) means \(n\) copies of \(\omega\), while \(n \cdot \omega\) means \(\omega\) copies of the \(n\)-element ordinal — which is just \(\omega\) itself.
For any \(n \geq 1\):
\[ n \cdot \omega = \sup_{k < \omega} n \cdot k = \sup_{k < \omega} nk = \omega, \]since the finite products \(n \cdot 0, n \cdot 1, n \cdot 2, \ldots = 0, n, 2n, \ldots\) are cofinal in \(\omega\). Thus “any finite positive ordinal times \(\omega\) equals \(\omega\).” By contrast, \(\omega \cdot n = \omega + \omega + \cdots + \omega\) (\(n\) times), which grows without bound as \(n \to \omega\). This illustrates why the left factor governs the “block size” and the right factor governs the “number of blocks.”
Section 22.2: The Diagonal Argument and \(2^{\aleph_0} > \aleph_0\)
Cantor’s diagonal argument is the most elegant proof in all of mathematics. It deserves a fully explicit presentation.
That is, \(d\) disagrees with \(f_n\) at position \(n\) for every \(n\). Then \(d \in 2^\omega\), but \(d \neq f_n\) for any \(n\) (since \(d(n) \neq f_n(n)\)). This contradicts the assumption that \(f_0, f_1, f_2, \ldots\) is an enumeration of all of \(2^\omega\). Therefore \(2^\omega\) is uncountable.
Imagine the following (hypothetical) list of binary sequences:
\[ \begin{array}{c|cccccc} n & 0 & 1 & 2 & 3 & 4 & \cdots \\ \hline f_0 & \mathbf{0} & 1 & 1 & 0 & 1 & \cdots \\ f_1 & 1 & \mathbf{1} & 0 & 1 & 0 & \cdots \\ f_2 & 0 & 0 & \mathbf{1} & 1 & 1 & \cdots \\ f_3 & 1 & 1 & 0 & \mathbf{0} & 0 & \cdots \\ f_4 & 0 & 1 & 1 & 1 & \mathbf{1} & \cdots \\ \vdots & & & & & & \ddots \end{array} \]The diagonal entries (bolded) are \(f_0(0) = 0, f_1(1) = 1, f_2(2) = 1, f_3(3) = 0, f_4(4) = 1, \ldots\) The diagonal sequence \(d\) flips each: \(d = 1, 0, 0, 1, 0, \ldots\) This sequence \(d\) differs from every \(f_n\) in the list, showing the list was incomplete.
Section 22.3: Ordinal Exponentiation and \(\varepsilon_0\)
By definition, \(\omega^\omega = \sup_{n < \omega} \omega^n\). We compute the first few values:
\[ \omega^0 = 1, \quad \omega^1 = \omega, \quad \omega^2 = \omega \cdot \omega, \quad \omega^3 = \omega^2 \cdot \omega, \quad \ldots \]Each \(\omega^n\) is a limit ordinal strictly greater than all \(\omega^k\) for \(k < n\). The ordinal \(\omega^\omega\) is then the supremum of the sequence \(1, \omega, \omega^2, \omega^3, \ldots\) — the smallest ordinal greater than each \(\omega^n\).
To visualize: \(\omega^n\) is the order type of \(n\)-tuples of natural numbers under the lexicographic order. So \(\omega^2\) consists of all pairs \((a, b)\) ordered lexicographically, \(\omega^3\) consists of all triples, etc. The ordinal \(\omega^\omega\) consists of all finite sequences of natural numbers, ordered lexicographically: first by length, then lexicographically within each length class. Every countable ordinal below \(\omega^\omega\) has a Cantor Normal Form using only finitely many \(\omega^n\) terms — \(\omega^\omega\) is the first ordinal whose normal form requires arbitrarily large exponents.
Define a sequence \(\alpha_0 = \omega\) and \(\alpha_{n+1} = \omega^{\alpha_n}\):
\[ \alpha_0 = \omega, \quad \alpha_1 = \omega^\omega, \quad \alpha_2 = \omega^{\omega^\omega}, \quad \alpha_3 = \omega^{\omega^{\omega^\omega}}, \quad \ldots \]Set \(\varepsilon_0 = \sup_{n < \omega} \alpha_n\). Then:
\[ \omega^{\varepsilon_0} = \omega^{\sup_n \alpha_n} = \sup_n \omega^{\alpha_n} = \sup_n \alpha_{n+1} = \varepsilon_0. \]So \(\varepsilon_0\) is a fixed point of \(\alpha \mapsto \omega^\alpha\): it satisfies \(\omega^{\varepsilon_0} = \varepsilon_0\). It is the least such fixed point, since any fixed point \(\beta\) of \(\alpha \mapsto \omega^\alpha\) must satisfy \(\beta \geq \omega^0 = 1\), \(\beta = \omega^\beta \geq \omega\), \(\beta = \omega^\beta \geq \omega^\omega\), etc., giving \(\beta \geq \varepsilon_0\) by induction.
In proof theory, \(\varepsilon_0\) is the “proof-theoretic ordinal” of Peano Arithmetic: transfinite induction up to \(\varepsilon_0\) is exactly what PA cannot prove, but ZFC can. This gives a precise measure of the strength of first-order arithmetic relative to set theory.
Section 22.4: The Hartogs Number and Well-Orderability
Chapter 23: Set-Theoretic Topology and the Real Line
The real line is the primary testing ground for set-theoretic questions. Many combinatorial principles — CH, MA, PFA — have beautiful and surprising consequences for the topology of \(\mathbb{R}\) and related spaces. This chapter explores those connections.
Section 23.1: Suslin’s Problem
Suslin’s Problem asks whether every complete dense linear order without endpoints that satisfies the countable chain condition (every family of disjoint open intervals is countable) must be isomorphic to \(\mathbb{R}\). The real line itself satisfies all these hypotheses — but are there “Suslin lines” that look locally like \(\mathbb{R}\) but are not \(\mathbb{R}\)?
- A Suslin line exists if and only if a Suslin tree exists.
- Suslin trees exist in Gödel's \(L\) (indeed, \(\Diamond \Rightarrow\) Suslin tree).
- Martin's Axiom + \(\neg\)CH implies there are no Suslin lines (Solovay–Tennenbaum, 1971).
Section 23.2: The Baire Category Theorem and Cardinal Characteristics
The Baire Category Theorem states that in a complete metric space, any countable intersection of dense open sets is dense. Cardinal characteristics of the continuum measure how far this result can be extended.
The cardinal \(\text{cov}(\mathcal{M})\) (covering number of the meager ideal) measures the minimum number of meager sets needed to cover \(\mathbb{R}\). Under ZFC alone, \(\aleph_1 \leq \text{cov}(\mathcal{M}) \leq 2^{\aleph_0}\), and MA implies \(\text{cov}(\mathcal{M}) = 2^{\aleph_0}\). This is one face of the Cichoń diagram.
Section 23.3: Measure Theory and the Null Ideal
- \(\text{add}(\mathcal{N})\): the least \(\kappa\) such that \(\kappa\) null sets can cover a non-null set.
- \(\text{cov}(\mathcal{N})\): the least \(\kappa\) such that \(\kappa\) null sets cover \(\mathbb{R}\).
- \(\text{non}(\mathcal{N})\): the least cardinality of a non-null set (i.e., the least size of a set of positive outer measure).
- \(\text{cof}(\mathcal{N})\): the cofinality of \(\mathcal{N}\) (the least size of a cofinal subfamily).
Section 23.4: The Continuum Hypothesis and Topology
Chapter 24: Inner Models and the Fine Structure of L
Section 24.1: The Condensation Lemma — Full Proof
The Condensation Lemma is the key tool for proving GCH in \(L\). We give a more detailed presentation than in Chapter 13.
Section 24.2: The Global Square \(\square_\kappa\)
- Each \(C_\alpha\) is a club in \(\alpha\).
- If \(\text{cf}(\alpha) < \kappa\), then \(|C_\alpha| < \kappa\).
- Coherence: if \(\beta\) is a limit point of \(C_\alpha\), then \(C_\beta = C_\alpha \cap \beta\).
Section 24.3: Core Model Theory (Overview)
The core model \(K\) (or Dodd–Jensen core model) is a canonical inner model that extends \(L\) to accommodate measurable cardinals and beyond.
The core model has several key properties:
- \(L \subseteq K \subseteq V\).
- \(K\) satisfies GCH.
- Covering theorem: if there is no inner model with a measurable cardinal, then for every uncountable set \(X\) of ordinals, there exists \(Y \in K\) with \(X \subseteq Y\) and \(|Y| = |X|\).
- The \(\square_\kappa\) holds in \(K\) for all uncountable \(\kappa\).
Chapter 25: Advanced Forcing Techniques
Section 25.1: Iterated Forcing
Single forcing extensions are powerful, but many independence results require adding many generic objects simultaneously or sequentially. Iterated forcing formalizes the process of forcing repeatedly.
Section 25.2: Proper Forcing in Detail
Every ccc forcing is proper (trivially), and every countably closed forcing (\(\omega\)-closed: any countable descending chain has a lower bound) is proper. The class of proper forcings is closed under countable support iterations (by Shelah’s theorem), making it the natural setting for long iterations without collapsing \(\omega_1\).
Section 25.3: Forcing and Stationary Sets
Section 25.4: Large Cardinals and Forcing
Chapter 26: Descriptive Set Theory — Advanced Topics
Section 26.1: Uniformization and the Axiom of Choice
Section 26.2: The Perfect Set Theorem for Analytic Sets
Section 26.3: Wadge Degrees
Chapter 27: The Set-Theoretic Multiverse
Section 27.1: The Multiverse View of Set Theory
Traditional set theory takes a “universist” view: there is one true universe \(V\) of all sets, and the axioms of ZFC plus large cardinals describe it. But the profusion of independence results — CH, GCH, Suslin’s hypothesis, the Whitehead problem, and hundreds more — has led some set theorists to advocate for a “multiverse” view.
The set-theoretic multiverse is the collection of all possible universes of set theory — all models of ZFC, all forcing extensions, all inner models, and all their combinations. Rather than asking “what is true?” we ask “what is true in which universes?” and study the structure of the multiverse itself.
- Forcing extensions: if \(M \in \mathcal{M}\) and \(G\) is \(M\)-generic for some \(\mathbb{P} \in M\), then \(M[G] \in \mathcal{M}\).
- Inner models: if \(M \in \mathcal{M}\), then \(L^M\) (the constructible universe as computed in \(M\)) is in \(\mathcal{M}\).
- Ground models: if \(M[G] \in \mathcal{M}\) for some generic \(G\), then \(M \in \mathcal{M}\).
Section 27.2: Hamkins’s Multiverse Axioms
Joel David Hamkins proposed a formal theory of the set-theoretic multiverse, characterized by “multiverse axioms” describing the richness and variety of the multiverse. Key axioms include:
- Well-foundedness mirage: Every universe appears ill-founded from the perspective of some other universe.
- Forcing extension axiom: For every universe \(M\) and every forcing \(\mathbb{P} \in M\), there is a forcing extension \(M[G] \in \mathcal{M}\).
- Ground model axiom: Every universe is a forcing extension of some inner model.
- Countability: Every universe appears countable from the perspective of some larger universe.
Section 27.3: Woodin’s \(\Omega\)-Logic and the Search for New Axioms
- The set of \(\Omega\)-provable sentences is recursively enumerable (has an explicit enumeration).
- Every \(\Sigma^2_1\) sentence that is \(\Omega\)-consistent is \(\Omega\)-provable from ZFC + large cardinals.
- CH is \(\Omega\)-independent of ZFC: both CH and \(\neg\)CH are \(\Omega\)-consistent.
Appendix C: Hints and Selected Solutions to Problem Sets
Part A Hints
A1. For the second part, assume \(R_X \in X\). Then ask: is \(R_X \in R_X\)? If \(R_X \in R_X\), then by definition of \(R_X\) (as \(\{x \in X : x \notin x\}\)), we get \(R_X \notin R_X\) — contradiction. If \(R_X \notin R_X\), then \(R_X \in X\) and \(R_X \notin R_X\), so by definition \(R_X \in R_X\) — contradiction. Either way contradicts the assumption, so \(R_X \notin X\).
A2. For part (2): compute \((\omega+1) \cdot n\) by induction. \((\omega+1) \cdot 1 = \omega + 1\). \((\omega+1) \cdot 2 = (\omega+1) \cdot 1 + (\omega+1) = \omega + 1 + \omega + 1 = \omega \cdot 2 + 1\) (since \(1 + \omega = \omega\)). In general, \((\omega+1) \cdot n = \omega \cdot n + 1\). Then \((\omega+1) \cdot \omega = \sup_{n < \omega} (\omega \cdot n + 1) = \sup_{n < \omega} \omega \cdot n = \omega^2\), since \(\omega \cdot n + 1 < \omega \cdot (n+1)\) and \(\sup_n \omega \cdot n = \omega^2\).
A3. The base cases \(\gamma = 0\): \((\alpha + \beta) + 0 = \alpha + \beta = \alpha + (\beta + 0)\). Successor: \((\alpha + \beta) + (\gamma + 1) = ((\alpha + \beta) + \gamma) + 1 = (\alpha + (\beta + \gamma)) + 1 = \alpha + ((\beta + \gamma) + 1) = \alpha + (\beta + (\gamma + 1))\). Limit: \((\alpha + \beta) + \lambda = \sup_{\xi < \lambda} ((\alpha + \beta) + \xi) = \sup_{\xi < \lambda} (\alpha + (\beta + \xi)) = \alpha + \sup_{\xi < \lambda} (\beta + \xi) = \alpha + (\beta + \lambda)\).
A4. For (1): use induction. \(|V_0| = |\emptyset| = 0 = \beth_0\)… wait, \(\beth_0 = \aleph_0\) and \(|V_\omega| = \aleph_0\). Adjust: \(|V_n| = n\) for finite \(n\), \(|V_\omega| = \aleph_0 = \beth_0\), \(|V_{\omega+1}| = |\mathcal{P}(V_\omega)| = 2^{\aleph_0} = \beth_1\), etc. So \(|V_{\omega+\alpha}| = \beth_\alpha\) for all \(\alpha \geq 0\).
A5. Replacement gives \(\text{Im}(f) = \{n^2 : n \in \omega\}\) as a set. Separation with \(\varphi(m) = \exists n \in \omega\,(m = n \cdot n)\) applied to \(\omega\) gives \(\{m \in \omega : \exists n \in \omega\,(m = n \cdot n)\}\). These are the same set: a natural number \(m\) is in the image of \(n \mapsto n^2\) iff there exists \(n\) with \(m = n^2\), i.e., iff \(m\) is a perfect square.
Part B Hints
B1. For (3): use the binary expansion bijection \(\mathcal{P}(\mathbb{N}) \to [0,1]\) via \(A \mapsto \sum_{n \in A} 2^{-(n+1)}\). This is not quite a bijection (dyadic rationals have two representations), but the set of exceptions is countable, so Schröder–Bernstein applies. For (4): a continuous function is determined by its values on \(\mathbb{Q}\), giving an injection \(C(\mathbb{R}, \mathbb{R}) \hookrightarrow \mathbb{R}^\mathbb{Q}\). Since \(|\mathbb{R}^\mathbb{Q}| = (2^{\aleph_0})^{\aleph_0} = 2^{\aleph_0}\) and the constant functions give an injection \(\mathbb{R} \hookrightarrow C(\mathbb{R},\mathbb{R})\), Schröder–Bernstein gives \(|C(\mathbb{R},\mathbb{R})| = 2^{\aleph_0}\).
B2. For (1): König’s theorem says \(\sum_{n < \omega} \aleph_n < \prod_{n < \omega} \aleph_{n+1}\), i.e., \(\aleph_\omega < \aleph_\omega^{\aleph_0}\) (a weaker form). For the specific claim that \(2^{\aleph_0} \neq \aleph_\omega\): suppose \(2^{\aleph_0} = \aleph_\omega\). Then \(2^{\aleph_0} = \aleph_\omega\), but \(\text{cf}(\aleph_\omega) = \omega\), so by König, \(\aleph_\omega^{\text{cf}(\aleph_\omega)} = \aleph_\omega^\omega > \aleph_\omega\). But \(\aleph_\omega^\omega = (2^{\aleph_0})^\omega = 2^{\aleph_0 \cdot \omega} = 2^{\aleph_0} = \aleph_\omega\), contradiction.
B3. For (1): by the GCH exponentiation formula, since \(\text{cf}(\aleph_\omega) = \omega \leq \aleph_1\), we are in the case \(\text{cf}(\lambda) \leq \kappa \leq \lambda\) with \(\kappa = \aleph_1\) and \(\lambda = \aleph_\omega\). Wait — the formula gives \(\kappa^\lambda = \lambda^+\) if \(\kappa \leq \text{cf}(\lambda)\) and \(\kappa^\lambda = \lambda\) if \(\text{cf}(\lambda) < \kappa \leq \lambda\). Here \(\aleph_\omega^{\aleph_1}\): \(\kappa = \aleph_\omega\), \(\lambda = \aleph_1\). Since \(\lambda = \aleph_1 < \aleph_\omega = \kappa\), this falls under the case of exponentiating a larger cardinal to a smaller one. Under GCH: \(\kappa^\lambda = \kappa^+\) if \(\text{cf}(\kappa) \leq \lambda\), i.e., \(\omega \leq \aleph_1\) — true. So \(\aleph_\omega^{\aleph_1} = \aleph_{\omega+1}\). ✓
Part C Hints
C1. For (1) (ccc): any antichain in \(2^{<\omega}\) consists of pairwise incompatible strings. Two strings are incompatible iff neither extends the other. Any antichain of binary strings is countable because: map each string \(s\) of length \(n\) to the interval \([0.\underbrace{s}_n, 0.\underbrace{s}_n + 2^{-n})\) in \([0,1]\). Pairwise incompatible strings map to disjoint intervals, and \([0,1]\) contains only countably many disjoint sub-intervals (since \(\mathbb{R}\) is second-countable). For (5) (\(g \notin M\)): for each \(r \in M \cap 2^\omega\), the set \(D_r = \{s \in 2^{<\omega} : \exists n\,(s(n) \neq r(n))\}\) is dense (any string can be extended to disagree with \(r\) somewhere). Since \(G\) is \(M\)-generic, \(G \cap D_r \neq \emptyset\), so \(g = \bigcup G\) disagrees with \(r\). Since this holds for all \(r \in M\), \(g \notin M\).
C2. For (1): if \(p \Vdash \varphi\) and \(q \leq p\), then any generic \(G\) containing \(q\) also contains \(p\) (since \(G\) is upward closed under \(\geq\)). Since \(p \in G\) and \(p \Vdash \varphi\), the Truth Lemma gives \(M[G] \models \varphi\). Since \(G\) was arbitrary, \(q \Vdash \varphi\). For (3): given \(\varphi\), the set \(\{p : p \Vdash \varphi \text{ or } p \Vdash \neg\varphi\}\) is dense because: if \(p\) does not force \(\neg\varphi\), then there exists \(q \leq p\) forcing \(\varphi\) (by definition of forcing for negation).
Part D Hints
D1. For (1): for each ZFC axiom, identify which property of \(\kappa\) is needed. Infinity: \(\omega \in V_\kappa\) since \(\kappa > \omega\). Replacement: if \(\varphi\) defines a function on a set \(x \in V_\kappa\), the image has rank \(\leq \sup_{y \in x} (\text{rank}(\varphi(y))+1)\); regularity of \(\kappa\) ensures this sup is \(< \kappa\) since it is the union of \(|x| < \kappa\) many ordinals each \(< \kappa\). Power set: for \(x \in V_\kappa\), \(\mathcal{P}(x) \in V_\kappa\) because \(|\mathcal{P}(x)| = 2^{|x|} < \kappa\) (strong limit property).
D2. For (2): \(j(\alpha) = \alpha\) for \(\alpha < \kappa\) because the equivalence class of the constant function \(c_\alpha: \kappa \to \{\alpha\}\) satisfies \([c_\alpha] = j(\alpha)\), and \([c_\alpha]\) computes to \(\alpha\) by Łoś’s theorem and the fact that \(\alpha < \kappa\). For \(j(\kappa) > \kappa\): consider the equivalence class \([\text{id}]\) where \(\text{id}: \kappa \to \kappa\) is the identity. By Łoś, \([\text{id}] < j(\kappa) = [c_\kappa]\)… actually \(j(\kappa)\) is the equivalence class of the constant function \(c_\kappa\), and \([\text{id}] < [c_\kappa]\) because \(\{\alpha < \kappa : \text{id}(\alpha) < \kappa\} = \kappa \in \mathcal{U}\). Moreover \(\kappa = [\hat{\kappa}]\) where \(\hat{\kappa}\) is the constant-\(\kappa\) function. The actual critical point calculation uses the fact that \(j(\kappa) = [\text{id}_\kappa]\) in many standard ultrapower constructions, giving \(j(\kappa) > \kappa\).
Appendix D: Notation and Conventions
The following table collects the principal notation used throughout these notes.
| Symbol | Meaning |
|---|---|
| \(\in\) | Set membership |
| \(\subseteq\) | Subset (not necessarily proper) |
| \(\subsetneq\) | Proper subset |
| \(\emptyset\) | Empty set |
| \(\mathcal{P}(X)\) | Power set of \(X\) |
| \(\bigcup X\) | Union of all elements of \(X\) |
| \(\bigcap X\) | Intersection of all elements of \(X\) (for \(X \neq \emptyset\)) |
| \(\omega\) | The first infinite ordinal (= set of natural numbers) |
| \(\omega_\alpha, \aleph_\alpha\) | The \(\alpha\)-th infinite cardinal |
| \(\mathbf{Ord}\) | The proper class of all ordinals |
| \(\mathbf{Card}\) | The proper class of all cardinals |
| \(V_\alpha\) | The \(\alpha\)-th level of the von Neumann hierarchy |
| \(L_\alpha\) | The \(\alpha\)-th level of the constructible hierarchy |
| \(\text{cf}(\alpha)\) | Cofinality of the ordinal/cardinal \(\alpha\) |
| \(\vert X\vert \) | Cardinality of \(X\) |
| \(\kappa^+\) | Successor cardinal of \(\kappa\) |
| \(\beth_\alpha\) | The \(\alpha\)-th beth number (\(\beth_0 = \aleph_0\), \(\beth_{\alpha+1} = 2^{\beth_\alpha}\)) |
| \(\mathfrak{c}\) | Cardinality of the continuum \(= 2^{\aleph_0}\) |
| \(\mathfrak{b}, \mathfrak{d}, \mathfrak{p}, \mathfrak{t}, \mathfrak{s}, \mathfrak{r}\) | Cardinal characteristics of the continuum |
| \(\Diamond\) | Diamond principle (Jensen) |
| \(\square_\kappa\) | Square principle at \(\kappa\) (Jensen) |
| \(p \Vdash \varphi\) | Condition \(p\) forces formula \(\varphi\) |
| \(M[G]\) | Generic extension of \(M\) by \(G\) |
| \(\text{Ult}(V, \mathcal{U})\) | Ultrapower of \(V\) by ultrafilter \(\mathcal{U}\) |
| \(j: V \to M\) | Elementary embedding (large cardinal embedding) |
| \(\text{crit}(j)\) | Critical point of elementary embedding \(j\) |
| \(\text{pcf}(A)\) | Set of possible cofinalities of products of sets in \(A\) |
| \(\Sigma^1_n, \Pi^1_n, \Delta^1_n\) | Levels of the projective hierarchy |
| \(\mathbf{\Sigma}^0_\alpha, \mathbf{\Pi}^0_\alpha\) | Levels of the Borel hierarchy |
| AD | Axiom of Determinacy |
| PD | Projective Determinacy |
| CH | Continuum Hypothesis (\(2^{\aleph_0} = \aleph_1\)) |
| GCH | Generalized Continuum Hypothesis |
| SCH | Singular Cardinal Hypothesis |
| MA | Martin’s Axiom |
| PFA | Proper Forcing Axiom |
| MM | Martin’s Maximum |
Appendix E: Index of Major Theorems
The following index lists the major theorems, lemmas, and definitions of these notes by name and section.
| Theorem / Lemma | Section | Statement (brief) |
|---|---|---|
| Extensionality Axiom | Ch.1 | Sets equal iff same members |
| Separation Schema | Ch.1 | \(\{x \in X : \varphi(x)\}\) is a set |
| Replacement Schema | Ch.1 | Image of set under class-function is a set |
| Russell’s Paradox | Ch.1 | \(R = \{x:x\notin x\}\) is contradictory |
| Von Neumann Hierarchy | Ch.1 | \(V = \bigcup_\alpha V_\alpha\) covers all sets |
| Transfinite Induction | Ch.2 | Induction over all ordinals |
| Transfinite Recursion | Ch.2 | Definitions by recursion over ordinals |
| Cantor Normal Form | Ch.2 | Unique base-\(\omega\) expansion for ordinals |
| Schröder–Bernstein | Ch.3 | Two injections imply bijection |
| Cantor’s Theorem | Ch.3 | \(\vert X\vert < \vert \mathcal{P}(X)\vert \) for all \(X\) |
| Cantor Diagonal Argument | Ch.3 | \(2^\omega\) is uncountable |
| König’s Theorem | Ch.3 | \(\sum \kappa_i < \prod \lambda_i\) when \(\kappa_i < \lambda_i\) |
| Zorn’s Lemma | Ch.5 | Equivalent to AC: maximal elements exist |
| Well-Ordering Theorem | Ch.5 | Equivalent to AC: every set well-orderable |
| Cantor–Bendixson | Ch.11 | Every closed set = perfect + countable |
| Perfect Set Theorem | Ch.11, Ch.26 | Uncountable analytic sets contain perfect subsets |
| Suslin’s Theorem | Ch.11, Ch.19 | \(\Delta^1_1\) = Borel sets |
| Mostowski Collapse | Ch.12 | Well-founded extensional structure collapses to transitive set |
| Reflection Principle | Ch.12 | Every formula reflects to some \(V_\alpha\) |
| Löwenheim–Skolem | Ch.12 | Every ZFC model has countable elementary substructure |
| Condensation Lemma | Ch.13, Ch.24 | Elementary substructures of \(L_\alpha\) collapse to \(L_\beta\) |
| GCH in L | Ch.13 | \(L \models 2^{\aleph_\kappa} = \aleph_{\kappa+1}\) |
| Forcing Theorem | Ch.14 | \(M[G]\models\varphi\) iff some \(p\in G\) forces \(\varphi\) |
| Rasiowa–Sikorski | Ch.14 | Generic filters exist for countably many dense sets |
| Cohen’s Theorem | Ch.14 | CH is independent of ZFC |
| ccc Preservation | Ch.14, Ch.25 | ccc forcings preserve all cardinals |
| Lévy–Solovay | Ch.25 | Small forcing preserves large cardinals |
| Łoś’s Theorem | Ch.7 | Ultrapower elementarily extends base structure |
| Scott’s Theorem | Ch.10 | Measurable cardinal implies \(V \neq L\) |
| Jensen’s Covering | Ch.10 | If no \(0^\#\), then \(L\) covers \(V\) |
| Fodor’s Lemma | Ch.8 | Regressive function on stationary set has stationary fiber |
| Solovay Splitting | Ch.8 | Every stationary set splits into \(\kappa\) stationary parts |
| Martin–Steel | Ch.17, Ch.19 | \(\infty\) Woodin cardinals \(\Rightarrow\) PD |
| Silver’s Theorem | Ch.21 | GCH cannot first fail at a singular of uncountable cofinality |
| Shelah’s Bound | Ch.21 | \(\aleph_\omega^{\aleph_0} < \aleph_{\omega_4}\) |
| Malliaris–Shelah | Ch.20 | \(\mathfrak{p} = \mathfrak{t}\) |
| Borel Determinacy | Ch.19 | Every Borel game is determined (Martin, ZFC) |
| Kondo Uniformization | Ch.26 | Every \(\Pi^1_1\) set uniformizable by a \(\Pi^1_1\) function |
| Wadge’s Lemma | Ch.26 | Under AD, Wadge degrees are linearly ordered |
Chapter 28: Model-Theoretic Tools in Set Theory
Set theory and model theory are deeply intertwined. The tools of model theory — elementary embeddings, compactness, Löwenheim–Skolem — both motivate and are used in set-theoretic arguments. This chapter collects the principal model-theoretic ideas used throughout these notes and develops them further.
Section 28.1: Elementarity and the Tarski–Vaught Test
We write \(j: M \prec N\) to indicate that \(j\) is an elementary embedding.
Section 28.2: The Compactness Theorem and Its Set-Theoretic Uses
The same trick is used in set theory: add sentences asserting the existence of a transitive model of ZFC containing a specific sequence of large cardinals, and use compactness to deduce the existence of such a model. This is the basis of the “reflection argument” for large cardinals.
Section 28.3: Löwenheim–Skolem and Skolem’s Paradox
The resolution: “uncountable” is a relative notion. \(M \models \text{"}\mathbb{R}\text{ is uncountable"}\) means there is no bijection \(\omega \to \mathbb{R}^M\) in \(M\). But from outside \(M\), we can see that \(M\) is countable — we just cannot “see” any bijection \(\omega \to \mathbb{R}^M\) that is an element of \(M\). The bijection exists in \(V\), but not in \(M\), so \(M\) correctly (from its own perspective) sees \(\mathbb{R}^M\) as uncountable.
This is the foundational insight behind forcing: a Cohen generic real \(g\) is “new” relative to \(M\) because \(g \notin M\), even though from outside \(M\) (i.e., from \(V\)), \(g\) is just another element of \(2^\omega\). The model \(M[G]\) “knows about” \(g\) and is therefore a larger universe.
Section 28.4: Ehrenfeucht–Mostowski Models and Indiscernibles
Chapter 29: Applications of Set Theory in Mathematics
Section 29.1: Algebra — Whitehead Groups and Shelah’s Theorem
- Assuming V = L (or just \(\Diamond\)), every Whitehead group of cardinality \(\aleph_1\) is free.
- Assuming MA + \(\neg\)CH, there exists a non-free Whitehead group of cardinality \(\aleph_1\).
Section 29.2: Analysis — The Banach–Tarski Paradox
Section 29.3: Topology — The Normal Moore Space Conjecture
The Normal Moore Space Conjecture (NMSC) asserts that every normal Moore space is metrizable. It was a major open problem in topology from the 1930s to the 1980s.
- Under MA + \(\neg\)CH, every separable normal Moore space is metrizable (a partial result).
- Under certain forcing extensions (adding \(\aleph_2\) many Cohen reals to a model with a measurable cardinal), NMSC holds.
- Under \(V = L\), there exist non-metrizable normal Moore spaces (constructed using a \(\Diamond^+\)-sequence).
Section 29.4: Analysis — Every Set of Reals is Measurable (Solovay’s Model)
Chapter 30: Open Problems and Current Research Frontiers
Section 30.1: The Inner Model Problem
The central open problem in inner model theory is:
The Inner Model Problem for a Supercompact Cardinal: Construct a canonical inner model \(L[\vec{E}]\) that contains a supercompact cardinal, satisfies GCH, and has a fine structure theory analogous to Jensen’s fine structure for \(L\).
This is Woodin’s “Ultimate-\(L\)” program. The conjecture is:
- \(L_{\text{ult}} \models\) GCH.
- \(L_{\text{ult}}\) contains a supercompact cardinal.
- \(V = L_{\text{ult}}\) is \(\Omega\)-consistent with all large cardinal axioms (i.e., forcing over \(L_{\text{ult}}\) cannot produce large cardinals inconsistent with those already present).
- If \(V = L_{\text{ult}}\), then CH holds.
If the Ultimate-\(L\) conjecture is correct, it would resolve CH in favor of GCH — the universe of sets, at its canonical core, satisfies the Continuum Hypothesis. Woodin believes this program is on the verge of completion (as of 2020s), though the full construction remains incomplete.
Section 30.2: The \(\mathfrak{p} = \mathfrak{t}\) Theorem and Keisler’s Order
The theorem \(\mathfrak{p} = \mathfrak{t}\) (Malliaris–Shelah, 2016) was proved using an unexpected tool: Keisler’s order, a pre-order on theories based on the saturation of their ultrapowers.
The proof of \(\mathfrak{p} = \mathfrak{t}\) goes: the equality is equivalent to saying that certain ultrafilters (called “OK” ultrafilters) are exactly the regular ultrafilters. This equivalence is proved by analyzing the Keisler order position of the theory of dense linear orders, which is the minimum theory in Keisler’s order. The deep model theory of ultrafilters — specifically, the distinction between “good” and “OK” ultrafilters — turns out to capture exactly the combinatorial distinction between \(\mathfrak{p}\) and \(\mathfrak{t}\).
Section 30.3: The Cichoń Maximum
Section 30.4: The HOD Conjecture
If the HOD Conjecture holds, then HOD and \(V\) are “close” near the supercompact cardinal, and many set-theoretic questions become decidable. The conjecture would imply that assuming large cardinals, the set-theoretic universe converges toward a canonical form — contrary to the multiverse view.
Woodin has announced a proof of the HOD Conjecture from appropriate large cardinal hypotheses (as of early 2020s), but the details remain under verification by the community.
Section 30.5: Cardinal Arithmetic at Singular Cardinals
- Can \(\text{pp}(\aleph_\omega) = \aleph_{\omega_1}\)? (pp = pseudopower; consistent upper bounds of \(\aleph_{\omega_4}\) are known, but the true maximum is unclear.)
- Is it consistent that \(\aleph_\omega^{\aleph_0} = \aleph_{\omega+1}\) (i.e., GCH holds at \(\aleph_\omega\)) together with the failure of SCH at some other singular cardinal?
- What is the exact consistency strength of the failure of SCH at \(\aleph_{\omega_1}\)?
Final Remarks: The Unity of Set Theory
We close with a reflection on the remarkable unity of the material covered in this course. What appears at first glance to be a collection of unrelated topics — ordinals, cardinals, forcing, large cardinals, descriptive set theory — is in fact a tightly woven tapestry.
The ordinal hierarchy (Chapter 2) provides the backbone along which everything else is built. The cumulative hierarchy \(V = \bigcup_\alpha V_\alpha\) (Chapter 1) gives a stratification of all mathematical objects. The cardinal hierarchy (Chapter 3) measures the sizes of these levels and leads immediately to the central problem of the course: what is the relationship between \(\aleph_1\) and \(2^{\aleph_0}\)?
Forcing (Chapter 14) answers: both \(\aleph_1 = 2^{\aleph_0}\) (under GCH) and \(\aleph_2 = 2^{\aleph_0}\) (Cohen’s model) are consistent. But forcing is not just a tool for independence results — it is a universe-building machine, producing new models with rich properties.
Large cardinals (Chapters 10, 17) provide the consistency strength backbone: they are what makes forcing extensions “rich” in a controlled way, and they imply structural regularity properties (like determinacy for projective sets) that ZFC alone cannot provide.
Descriptive set theory (Chapters 11, 19, 26) makes this regularity concrete: under large cardinal axioms, all projective sets of reals are measurable, have the Baire property, and are determined. These are the precise mathematical consequences of placing the universe “high” in the large cardinal hierarchy.
Inner models (Chapter 13, 24) provide the “skeleton”: canonical minimal models that satisfy GCH and contain exactly as many large cardinals as prescribed. The core model program seeks to build, for each large cardinal, a canonical inner model that “explains” it — just as \(L\) explains the universe without large cardinals.
The forcing axioms (Chapter 18) represent another extreme: rather than restricting to minimal models, they assert that the universe is as “generic” as possible — as rich as if many generic objects had been added. MA, PFA, and MM push the universe toward a canonical “maximal” form, in opposition to the canonical “minimal” form of \(L\).
The deep conjecture of Woodin’s Ultimate-\(L\) program is that these two extremes converge: the “correct” universe satisfies both “enough large cardinals” and “GCH” — and therefore resolves all outstanding questions including CH. Whether this vision is realized will be the defining question of set theory in the coming decades.
This course has given you the tools to understand — and perhaps contribute to — this profound inquiry into the nature of mathematical infinity.
\[ \star \quad \star \quad \star \]Appendix F: Glossary of Key Terms
The following definitions provide quick reference for the major concepts introduced throughout the course.
Absolute formula. A formula \(\varphi\) is absolute for a transitive model \(M\) if \(\varphi\) holds in \(M\) exactly when it holds in \(V\). All \(\Delta_0\) formulas are absolute; \(\Sigma_1\) formulas are upward absolute; \(\Pi_1\) formulas are downward absolute.
Aleph numbers. The alephs \(\aleph_0, \aleph_1, \aleph_2, \ldots\) are the infinite cardinals in increasing order. \(\aleph_0 = \omega\) is the smallest infinite cardinal; \(\aleph_{\alpha+1} = \omega_{\alpha+1}\) is the successor of \(\aleph_\alpha\); for limit \(\lambda\), \(\aleph_\lambda = \sup_{\alpha < \lambda} \aleph_\alpha\).
Antichain. In a partial order, a set of pairwise incompatible elements. The countable chain condition (ccc) requires every antichain to be countable.
Beth numbers. The beths \(\beth_0 = \aleph_0\), \(\beth_{\alpha+1} = 2^{\beth_\alpha}\), and \(\beth_\lambda = \sup_{\alpha < \lambda} \beth_\alpha\) for limit \(\lambda\). Under GCH, \(\beth_\alpha = \aleph_\alpha\) for all \(\alpha\).
Cardinal. An ordinal \(\kappa\) not in bijection with any smaller ordinal. Every infinite cardinal is a limit ordinal. The finite cardinals are the natural numbers.
Closed unbounded (club). A subset \(C\) of a regular uncountable cardinal \(\kappa\) that is unbounded in \(\kappa\) and closed under limits below \(\kappa\).
Cofinality. The cofinality \(\text{cf}(\alpha)\) of a limit ordinal \(\alpha\) is the least ordinal \(\delta\) such that there is an unbounded function \(f: \delta \to \alpha\). A cardinal \(\kappa\) is regular if \(\text{cf}(\kappa) = \kappa\).
Condition. An element of a forcing poset. Conditions represent “finite approximations” to the generic object being added. Stronger conditions (lower in the order) provide more information.
Constructible universe (\(L\)). The smallest inner model of ZFC, defined by \(L = \bigcup_\alpha L_\alpha\) where \(L_0 = \emptyset\), \(L_{\alpha+1} = \text{Def}(L_\alpha)\) (definable subsets of \(L_\alpha\)), and \(L_\lambda = \bigcup_{\alpha < \lambda} L_\alpha\) for limits. \(L \models \text{GCH}\).
Dense set. In a forcing poset, a set \(D\) such that every condition has an extension in \(D\).
Elementary embedding. A map \(j: M \to N\) preserving all first-order formulas. An elementary embedding is a “truth-preserving” map between models. Large cardinals are characterized by the existence of non-trivial elementary embeddings.
Filter (on a poset). A non-empty upward-closed set closed under finite meets (greatest lower bounds). A generic filter meets every dense set in the ground model.
Forcing. Cohen’s technique for building new models of ZFC by adjoining a generic object \(G\) to a ground model \(M\), producing \(M[G]\). The forcing relation \(p \Vdash \varphi\) encodes what is “forced” to be true in \(M[G]\).
Generic filter. An \(M\)-generic filter for a poset \(\mathbb{P}\) is a filter \(G \subseteq \mathbb{P}\) meeting every dense set that is an element of \(M\).
HOD. Hereditarily ordinal definable sets: the inner model of all sets whose entire transitive closure consists of ordinal-definable sets. \(L \subseteq \text{HOD} \subseteq V\), and \(\text{HOD} \models \text{AC}\).
Inaccessible cardinal. An uncountable regular strong limit cardinal \(\kappa\): \(\text{cf}(\kappa) = \kappa\) and \(2^\lambda < \kappa\) for all \(\lambda < \kappa\). The level \(V_\kappa\) of the cumulative hierarchy satisfies all of ZFC.
Inner model. A transitive class \(M\) containing all ordinals and satisfying ZFC (with quantifiers restricted to \(M\)). Inner models include \(L\), \(\text{HOD}\), and models \(L[\vec{E}]\) built from extender sequences.
Large cardinal. A cardinal whose existence cannot be proved in ZFC but whose consistency is generally assumed. Large cardinals form a well-ordered hierarchy by consistency strength: inaccessible, Mahlo, weakly compact, measurable, strong, Woodin, supercompact, extendible, huge, rank-into-rank.
Measurable cardinal. A cardinal \(\kappa\) admitting a non-principal \(\kappa\)-complete ultrafilter. Equivalently, there is an elementary embedding \(j: V \to M\) with critical point \(\kappa\). Measurable cardinals imply \(V \neq L\) (Scott’s theorem).
Ordinal. A transitive set well-ordered by membership. The ordinals are \(0 = \emptyset\), \(1 = \{0\}\), \(2 = \{0,1\}\), \(\ldots, \omega, \omega+1, \ldots, \omega \cdot 2, \ldots, \omega^\omega, \ldots\). Each ordinal is the set of all smaller ordinals.
PCF (possible cofinalities). For a set \(A\) of regular cardinals with \(|A| < \min(A)\), \(\text{pcf}(A) = \{\text{cf}(\prod A / D) : D\text{ is an ultrafilter on }A\}\). PCF theory governs singular cardinal arithmetic.
Projective determinacy (PD). The assertion that every two-player game on \(\omega\) with a projective winning condition is determined (one player has a winning strategy). Follows from infinitely many Woodin cardinals; implies all projective sets are measurable and have the Baire property.
Rank. The rank of a set \(x\) is the least ordinal \(\alpha\) with \(x \in V_{\alpha+1}\). Every set has a rank by the Axiom of Regularity.
Regular cardinal. A cardinal \(\kappa\) with \(\text{cf}(\kappa) = \kappa\). Successor cardinals are always regular (under AC); limit cardinals may or may not be.
Stationary set. A subset \(S\) of a regular uncountable cardinal \(\kappa\) that intersects every club in \(\kappa\). The club filter on \(\kappa\) is dual to the non-stationary ideal.
Transitive set. A set \(M\) such that every element of an element of \(M\) is itself an element of \(M\): \(x \in y \in M \Rightarrow x \in M\). Transitive sets are the natural “building blocks” for models of set theory.
Ultrafilter. A maximal filter; equivalently, a filter \(\mathcal{U}\) on a set \(I\) such that for every \(A \subseteq I\), either \(A \in \mathcal{U}\) or \(I \setminus A \in \mathcal{U}\). Non-principal ultrafilters (whose existence requires AC) are the foundation of ultrapower constructions.
Well-ordering. A linear order in which every non-empty subset has a least element. The ordinals are canonical representatives for well-order types. By the Well-Ordering Theorem (equivalent to AC), every set can be well-ordered.
Woodin cardinal. An uncountable cardinal \(\delta\) such that for every function \(f: \delta \to \delta\), there is a cardinal \(\kappa < \delta\) closed under \(f\) and an elementary embedding \(j: V \to M\) with critical point \(\kappa\) and \(V_{j(f)(\kappa)} \subseteq M\). Infinitely many Woodin cardinals imply projective determinacy.
ZFC. Zermelo–Fraenkel set theory with the Axiom of Choice. The standard foundation for mathematics, consisting of the axioms of Extensionality, Empty Set, Pairing, Union, Power Set, Separation, Replacement, Infinity, Regularity, and Choice. All of ordinary mathematics can be formalized within ZFC.
These notes cover the material of PMATH 434 (Set Theory) at the University of Waterloo. The primary reference is Jech’s Set Theory (3rd ed., Springer, 2003). For forcing, see Kunen’s Set Theory or Chow’s A Beginner’s Guide to Forcing. For large cardinals, see Kanamori’s The Higher Infinite (2nd ed., Springer, 2003).