These notes synthesize material from L. Hörmander’s The Analysis of Linear Partial Differential Operators, A. Grigis and J. Sjöstrand’s Microlocal Analysis for Differential Operators, M. Zworski’s Semiclassical Analysis, and M.E. Taylor’s Pseudodifferential Operators, enriched with material from MIT OCW 18.157 (R. Melrose) and S. Dyatlov’s lecture notes.
Chapter 1: Distributions and Sobolev Spaces
The modern theory of partial differential equations rests on a broadened notion of “function” that permits differentiation of objects far more irregular than anything classical analysis can handle. Laurent Schwartz’s theory of distributions, developed in the late 1940s, provided the rigorous framework for this program. In this opening chapter, we set up the functional-analytic foundations: the spaces of test functions, distributions, tempered distributions, and the Sobolev spaces that measure regularity in a quantitative way. These tools are indispensable for everything that follows.
1.1 Test Functions and Distributions
We begin with the space of test functions and the distributions that act on them.
Definition 1.1.1 (Test Function Space). Let \(\Omega \subseteq \mathbb{R}^n\) be open. The space \(C_c^\infty(\Omega)\), also denoted \(\mathcal{D}(\Omega)\), consists of all infinitely differentiable functions \(\varphi : \Omega \to \mathbb{C}\) with compact support in \(\Omega\). We equip \(\mathcal{D}(\Omega)\) with the following topology: a sequence \(\varphi_j \to \varphi\) in \(\mathcal{D}(\Omega)\) if and only if there exists a compact set \(K \subset \Omega\) with \(\operatorname{supp} \varphi_j \subseteq K\) for all \(j\), and \(\partial^\alpha \varphi_j \to \partial^\alpha \varphi\) uniformly on \(K\) for every multi-index \(\alpha\).
This topology makes \(\mathcal{D}(\Omega)\) into a locally convex topological vector space, though not a Fréchet space (it is an LF-space, i.e., a strict inductive limit of Fréchet spaces). The precise topological structure, while important for the general theory, will mostly stay in the background; what matters for us is the notion of convergence described above.
Definition 1.1.2 (Distributions). A distribution on \(\Omega\) is a continuous linear functional \(u : \mathcal{D}(\Omega) \to \mathbb{C}\). The space of all distributions on \(\Omega\) is denoted \(\mathcal{D}'(\Omega)\). Continuity means that if \(\varphi_j \to \varphi\) in \(\mathcal{D}(\Omega)\), then \(\langle u, \varphi_j \rangle \to \langle u, \varphi \rangle\) in \(\mathbb{C}\).
Example 1.1.3. Every locally integrable function \(f \in L^1_{\mathrm{loc}}(\Omega)\) defines a distribution via
\[
\langle u_f, \varphi \rangle = \int_\Omega f(x) \varphi(x) \, dx, \quad \varphi \in \mathcal{D}(\Omega).
\]
The map \(f \mapsto u_f\) is injective (by the du Bois-Reymond lemma), so we identify \(L^1_{\mathrm{loc}}(\Omega)\) with a subspace of \(\mathcal{D}'(\Omega)\). Distributions that arise this way are called regular.
Example 1.1.4 (Dirac Delta). For \(x_0 \in \Omega\), the Dirac delta at \(x_0\) is the distribution defined by
\[
\langle \delta_{x_0}, \varphi \rangle = \varphi(x_0).
\]
This is not a regular distribution: there is no locally integrable function \(f\) such that \(\int f \varphi \, dx = \varphi(x_0)\) for all \(\varphi \in \mathcal{D}(\Omega)\).
Example 1.1.5 (Principal Value). The principal value distribution on \(\mathbb{R}\) is defined by
\[
\langle \mathrm{p.v.}\tfrac{1}{x}, \varphi \rangle = \lim_{\varepsilon \to 0^+} \int_{|x| > \varepsilon} \frac{\varphi(x)}{x} \, dx.
\]
One verifies that this limit exists for every \(\varphi \in \mathcal{D}(\mathbb{R})\) and defines a continuous linear functional.
1.2 Operations on Distributions
The power of distribution theory lies in the ability to extend classical operations — differentiation, multiplication by smooth functions, convolution — to this enlarged setting.
Definition 1.2.1 (Derivative of a Distribution). If \(u \in \mathcal{D}'(\Omega)\) and \(\alpha\) is a multi-index, the distributional derivative \(\partial^\alpha u\) is defined by
\[
\langle \partial^\alpha u, \varphi \rangle = (-1)^{|\alpha|} \langle u, \partial^\alpha \varphi \rangle, \quad \varphi \in \mathcal{D}(\Omega).
\]
This definition is motivated by integration by parts: if \(u\) is smooth with compact support, the formula above holds with equality. The key observation is that every distribution is infinitely differentiable in the distributional sense.
Example 1.2.2. On \(\mathbb{R}\), the Heaviside function \(H(x) = \mathbf{1}_{[0,\infty)}(x)\) has distributional derivative \(H' = \delta_0\). Indeed, for \(\varphi \in \mathcal{D}(\mathbb{R})\),
\[
\langle H', \varphi \rangle = -\langle H, \varphi' \rangle = -\int_0^\infty \varphi'(x) \, dx = \varphi(0) = \langle \delta_0, \varphi \rangle.
\]
Definition 1.2.3 (Multiplication by Smooth Functions). If \(u \in \mathcal{D}'(\Omega)\) and \(a \in C^\infty(\Omega)\), then \(au \in \mathcal{D}'(\Omega)\) is defined by
\[
\langle au, \varphi \rangle = \langle u, a\varphi \rangle, \quad \varphi \in \mathcal{D}(\Omega).
\]
Definition 1.2.5 (Convolution). If \(u \in \mathcal{D}'(\mathbb{R}^n)\) and \(\varphi \in \mathcal{D}(\mathbb{R}^n)\), the convolution \(u * \varphi\) is the smooth function
\[
(u * \varphi)(x) = \langle u, \varphi(x - \cdot) \rangle.
\]
More generally, if \(u \in \mathcal{E}'(\mathbb{R}^n)\) (compactly supported distribution) and \(v \in \mathcal{D}'(\mathbb{R}^n)\), the convolution \(u * v \in \mathcal{D}'(\mathbb{R}^n)\) is defined by
\[
\langle u * v, \varphi \rangle = \langle u_x, \langle v_y, \varphi(x+y) \rangle \rangle.
\]
Proposition 1.2.6 (Regularization). Let \(\rho \in \mathcal{D}(\mathbb{R}^n)\) with \(\int \rho = 1\), and set \(\rho_\varepsilon(x) = \varepsilon^{-n}\rho(x/\varepsilon)\). If \(u \in \mathcal{D}'(\mathbb{R}^n)\), then \(u * \rho_\varepsilon \to u\) in \(\mathcal{D}'(\mathbb{R}^n)\) as \(\varepsilon \to 0\).
To develop Fourier analysis in the distributional setting, we need a space of distributions adapted to the Fourier transform. This is the space of tempered distributions, introduced by Schwartz precisely for this purpose.
Definition 1.3.1 (Schwartz Space). The Schwartz space \(\mathcal{S}(\mathbb{R}^n)\) consists of all \(\varphi \in C^\infty(\mathbb{R}^n)\) such that
\[
\|\varphi\|_{\alpha,\beta} := \sup_{x \in \mathbb{R}^n} |x^\alpha \partial^\beta \varphi(x)| < \infty
\]
for all multi-indices \(\alpha, \beta\). The topology is generated by the family of seminorms \(\|\cdot\|_{\alpha,\beta}\), making \(\mathcal{S}(\mathbb{R}^n)\) a Fréchet space.
Definition 1.3.2 (Tempered Distributions). A tempered distribution is a continuous linear functional on \(\mathcal{S}(\mathbb{R}^n)\). The space of tempered distributions is denoted \(\mathcal{S}'(\mathbb{R}^n)\). We have the continuous inclusions
\[
\mathcal{D}(\mathbb{R}^n) \hookrightarrow \mathcal{S}(\mathbb{R}^n) \hookrightarrow \mathcal{S}'(\mathbb{R}^n) \hookrightarrow \mathcal{D}'(\mathbb{R}^n).
\]
Definition 1.3.3 (Fourier Transform). For \(\varphi \in \mathcal{S}(\mathbb{R}^n)\), the Fourier transform is
\[
\hat{\varphi}(\xi) = \mathcal{F}\varphi(\xi) = \int_{\mathbb{R}^n} e^{-i x \cdot \xi} \varphi(x) \, dx.
\]
The inverse Fourier transform is
\[
\mathcal{F}^{-1}\psi(x) = (2\pi)^{-n} \int_{\mathbb{R}^n} e^{i x \cdot \xi} \psi(\xi) \, d\xi.
\]
Theorem 1.3.4 (Fourier Transform on \(\mathcal{S}\)). The Fourier transform \(\mathcal{F} : \mathcal{S}(\mathbb{R}^n) \to \mathcal{S}(\mathbb{R}^n)\) is a topological isomorphism with inverse \(\mathcal{F}^{-1}\). Moreover:
(i) \(\mathcal{F}(\partial^\alpha \varphi)(\xi) = (i\xi)^\alpha \hat{\varphi}(\xi)\),
(ii) \(\mathcal{F}(x^\alpha \varphi)(\xi) = (i\partial_\xi)^\alpha \hat{\varphi}(\xi)\),
(iii) \(\mathcal{F}(\varphi * \psi) = \hat{\varphi} \cdot \hat{\psi}\),
(iv) (Parseval) \(\int \hat{\varphi} \overline{\hat{\psi}} \, d\xi = (2\pi)^n \int \varphi \overline{\psi} \, dx\).
Definition 1.3.5 (Fourier Transform of Tempered Distributions). For \(u \in \mathcal{S}'(\mathbb{R}^n)\), the Fourier transform \(\hat{u} \in \mathcal{S}'(\mathbb{R}^n)\) is defined by
\[
\langle \hat{u}, \varphi \rangle = \langle u, \hat{\varphi} \rangle, \quad \varphi \in \mathcal{S}(\mathbb{R}^n).
\]
Example 1.3.6. The Fourier transform of \(\delta_0\) is the constant function \(1\):
\[
\langle \hat{\delta}_0, \varphi \rangle = \langle \delta_0, \hat{\varphi} \rangle = \hat{\varphi}(0) = \int \varphi(x) \, dx = \langle 1, \varphi \rangle.
\]
Conversely, \(\hat{1} = (2\pi)^n \delta_0\).
1.4 Sobolev Spaces
Sobolev spaces provide the quantitative framework for measuring regularity of distributions. They are the natural \(L^2\)-based function spaces in which to study elliptic and more general PDE.
Definition 1.4.1 (Sobolev Spaces via Fourier Transform). For \(s \in \mathbb{R}\), the Sobolev space \(H^s(\mathbb{R}^n)\) consists of all \(u \in \mathcal{S}'(\mathbb{R}^n)\) such that
\[
\|u\|_{H^s}^2 := \int_{\mathbb{R}^n} (1 + |\xi|^2)^s |\hat{u}(\xi)|^2 \, d\xi < \infty.
\]
We write \(\langle \xi \rangle = (1 + |\xi|^2)^{1/2}\), so the norm becomes \(\|u\|_{H^s} = \|\langle \xi \rangle^s \hat{u}\|_{L^2}\).
Proposition 1.4.3 (Basic Properties).
(i) \(H^s(\mathbb{R}^n)\) is a Hilbert space with inner product \(\langle u, v \rangle_{H^s} = \int \langle \xi \rangle^{2s} \hat{u}(\xi) \overline{\hat{v}(\xi)} \, d\xi\).
(ii) If \(s > t\), then \(H^s(\mathbb{R}^n) \hookrightarrow H^t(\mathbb{R}^n)\) with continuous inclusion.
(iii) \(\mathcal{S}(\mathbb{R}^n)\) is dense in \(H^s(\mathbb{R}^n)\) for every \(s \in \mathbb{R}\).
(iv) The dual of \(H^s(\mathbb{R}^n)\) is isometrically isomorphic to \(H^{-s}(\mathbb{R}^n)\).
Definition 1.4.4 (Sobolev Spaces on Domains). For an open set \(\Omega \subseteq \mathbb{R}^n\), we define:
(i) \(H^s(\Omega) = \{ u|_\Omega : u \in H^s(\mathbb{R}^n) \}\) with the quotient norm,
(ii) \(H^s_0(\Omega)\) as the closure of \(C_c^\infty(\Omega)\) in \(H^s(\Omega)\).
1.5 Sobolev Embedding Theorems
The embedding theorems relate Sobolev regularity to classical regularity. They are among the most frequently used results in PDE theory.
Theorem 1.5.1 (Sobolev Embedding). If \(s > n/2 + k\) for a non-negative integer \(k\), then \(H^s(\mathbb{R}^n) \hookrightarrow C^k_b(\mathbb{R}^n)\) (bounded continuous functions with bounded derivatives up to order \(k\)), and the embedding is continuous:
\[
\|\partial^\alpha u\|_{L^\infty} \leq C_{s,n} \|u\|_{H^s}, \quad |\alpha| \leq k.
\]
In particular, \(\bigcap_{s \in \mathbb{R}} H^s(\mathbb{R}^n) = \mathcal{S}(\mathbb{R}^n)\) (as sets, though the topologies differ).
Proof. For \(|\alpha| \leq k\) and \(u \in \mathcal{S}(\mathbb{R}^n)\),
\[
\partial^\alpha u(x) = (2\pi)^{-n} \int e^{ix \cdot \xi} (i\xi)^\alpha \hat{u}(\xi) \, d\xi.
\]
By the Cauchy-Schwarz inequality,
\[
|\partial^\alpha u(x)| \leq (2\pi)^{-n} \int |\xi^\alpha| |\hat{u}(\xi)| \, d\xi = (2\pi)^{-n} \int \frac{|\xi^\alpha|}{\langle \xi \rangle^s} \cdot \langle \xi \rangle^s |\hat{u}(\xi)| \, d\xi
\]
\[
\leq (2\pi)^{-n} \left(\int \frac{|\xi|^{2k}}{\langle \xi \rangle^{2s}} \, d\xi\right)^{1/2} \|u\|_{H^s}.
\]
The first integral is finite precisely when \(2s - 2k > n\), i.e., \(s > n/2 + k\). The result extends from \(\mathcal{S}\) to \(H^s\) by density. \(\blacksquare\)
Theorem 1.5.2 (Rellich-Kondrachov Compactness). Let \(\Omega \subset \mathbb{R}^n\) be a bounded open set with Lipschitz boundary. If \(s > t\), then the inclusion \(H^s(\Omega) \hookrightarrow H^t(\Omega)\) is compact.
This compactness result is fundamental for spectral theory of elliptic operators and will appear repeatedly in later chapters.
1.6 Trace Theorems
When studying boundary value problems, one needs to restrict Sobolev functions to lower-dimensional submanifolds. The trace theorems make this precise.
Theorem 1.6.1 (Trace Theorem). Let \(\Omega \subset \mathbb{R}^n\) be a bounded open set with smooth boundary \(\partial\Omega\). For \(s > 1/2\), the restriction map \(\gamma_0 : C^\infty(\overline{\Omega}) \to C^\infty(\partial\Omega)\) defined by \(\gamma_0 u = u|_{\partial\Omega}\) extends to a continuous surjection
\[
\gamma_0 : H^s(\Omega) \to H^{s-1/2}(\partial\Omega),
\]
with a continuous right inverse (extension operator).
1.7 Duality of Sobolev Spaces
Theorem 1.7.1. The dual space of \(H^s_0(\Omega)\) is naturally identified with \(H^{-s}(\Omega)\), and the dual of \(H^s(\Omega)\) is identified with a space of distributions supported in \(\overline{\Omega}\). In the case \(\Omega = \mathbb{R}^n\), we have \((H^s(\mathbb{R}^n))' \cong H^{-s}(\mathbb{R}^n)\) with the duality pairing extending the \(L^2\) inner product.
These duality relations are essential for the weak formulation of PDE and for the theory of pseudodifferential operators on Sobolev spaces that we develop in Chapter 3.
Having set up the distributional framework, we now turn to the algebraic and analytic machinery that underlies the theory of pseudodifferential operators. The key idea, pioneered by Kohn-Nirenberg (1965) and Hörmander (1965), is to associate to a linear operator a function of both position and frequency — its symbol — and to study the operator through the properties of this symbol. This chapter develops the symbol calculus that makes this program precise.
2.1 Oscillatory Integrals
Many of the integrals we encounter will not converge absolutely; they must be interpreted as oscillatory integrals, regularized by the rapid oscillation of the integrand.
Definition 2.1.1 (Oscillatory Integral). Let \(a(x,\xi) \in S^m_{1,0}(\mathbb{R}^n \times \mathbb{R}^n)\) (to be defined below) and \(\Phi(x,\xi)\) be a phase function. The oscillatory integral
\[
I(\varphi) = \int\!\!\!\int e^{i\Phi(x,\xi)} a(x,\xi) \varphi(x) \, dx \, d\xi
\]
is defined as the limit
\[
I(\varphi) = \lim_{\varepsilon \to 0} \int\!\!\!\int e^{i\Phi(x,\xi)} a(x,\xi) \chi(\varepsilon\xi) \varphi(x) \, dx \, d\xi,
\]
where \(\chi \in C_c^\infty(\mathbb{R}^n)\) with \(\chi(0) = 1\). The limit is independent of the choice of \(\chi\).
\[
L = \frac{1 - i\xi \cdot \nabla_x \Phi}{|\nabla_x \Phi|^2 + |\xi|^2}(1 - \Delta_\xi)
\]
satisfies \(L^t(e^{i\Phi}) = e^{i\Phi}\) when the phase is non-degenerate, and repeated application of \(L^t\) to the amplitude \(a \cdot \chi(\varepsilon\xi) \cdot \varphi\) produces convergent integrals.
2.2 Symbol Classes
The symbol classes, introduced by Hörmander, provide the natural setting for the symbols of pseudodifferential operators.
Definition 2.2.1 (Symbol Classes \(S^m_{\rho,\delta}\)). Let \(m \in \mathbb{R}\) and \(0 \leq \delta \leq \rho \leq 1\). A smooth function \(a \in C^\infty(\mathbb{R}^n \times \mathbb{R}^n)\) belongs to the symbol class \(S^m_{\rho,\delta}(\mathbb{R}^n \times \mathbb{R}^n)\) if for all multi-indices \(\alpha, \beta\) there exists a constant \(C_{\alpha,\beta}\) such that
\[
|\partial_\xi^\alpha \partial_x^\beta a(x,\xi)| \leq C_{\alpha,\beta} \langle \xi \rangle^{m - \rho|\alpha| + \delta|\beta|}
\]
for all \(x, \xi \in \mathbb{R}^n\). We write \(S^m = S^m_{1,0}\) for the standard class and \(S^{-\infty} = \bigcap_m S^m\).
Example 2.2.3. The function \(a(x,\xi) = \langle \xi \rangle^m = (1 + |\xi|^2)^{m/2}\) belongs to \(S^m_{1,0}\). More generally, if \(p(x,\xi)\) is a polynomial of degree \(m\) in \(\xi\) with smooth coefficients bounded along with all their derivatives, then \(p \in S^m_{1,0}\).
Example 2.2.4. The symbol \(a(x,\xi) = |\xi|^2 + V(x)\), where \(V \in C^\infty_b(\mathbb{R}^n)\), belongs to \(S^2_{1,0}\). This is the symbol of the Schrödinger operator \(-\Delta + V(x)\).
2.3 Asymptotic Expansions
A central technique in the symbol calculus is the construction of symbols from asymptotic series.
Definition 2.3.1 (Asymptotic Expansion). Let \(a_j \in S^{m_j}_{\rho,\delta}\) with \(m_j \to -\infty\). We say that \(a \in S^{m_0}_{\rho,\delta}\) has the asymptotic expansion \(a \sim \sum_{j=0}^\infty a_j\) if for every \(N\),
\[
a - \sum_{j=0}^{N-1} a_j \in S^{m_N}_{\rho,\delta}.
\]
Theorem 2.3.2 (Borel's Lemma for Symbols). Given \(a_j \in S^{m_j}_{\rho,\delta}\) with \(m_j \to -\infty\), there exists \(a \in S^{m_0}_{\rho,\delta}\) with \(a \sim \sum_j a_j\). The symbol \(a\) is unique modulo \(S^{-\infty}\).
Proof. Choose a cutoff \(\chi \in C^\infty(\mathbb{R}^n)\) with \(\chi(\xi) = 0\) for \(|\xi| \leq 1\) and \(\chi(\xi) = 1\) for \(|\xi| \geq 2\). Set
\[
a(x,\xi) = \sum_{j=0}^\infty \chi(\xi/R_j) a_j(x,\xi),
\]
where \(R_j\) is a sequence increasing sufficiently rapidly to \(\infty\). For \(|\xi| \leq R_j\), the \(j\)-th term vanishes, so the sum is locally finite and defines a smooth function. One verifies that for sufficiently rapidly growing \(R_j\), the tail \(\sum_{j \geq N} \chi(\xi/R_j) a_j\) belongs to \(S^{m_N}_{\rho,\delta}\). The uniqueness modulo \(S^{-\infty}\) is immediate from the definition. \(\blacksquare\)
2.4 Classical Symbols
Many operators arising in geometric and physical applications have symbols with a particularly nice structure.
Definition 2.4.1 (Classical Symbols). A symbol \(a \in S^m_{1,0}\) is classical (or polyhomogeneous) if it admits an asymptotic expansion \(a \sim \sum_{j=0}^\infty a_{m-j}\), where each \(a_{m-j}(x,\xi)\) is positively homogeneous of degree \(m-j\) in \(\xi\) for \(|\xi| \geq 1\):
\[
a_{m-j}(x, t\xi) = t^{m-j} a_{m-j}(x,\xi), \quad t \geq 1, \; |\xi| \geq 1.
\]
The leading term \(a_m\) is called the principal symbol.
Example 2.4.2. The Laplacian \(\Delta = \sum_{j=1}^n \partial_{x_j}^2\) has symbol \(-|\xi|^2\), which is a classical symbol of order 2 with principal symbol \(\sigma_2(\Delta)(x,\xi) = -|\xi|^2\). More generally, any differential operator \(P = \sum_{|\alpha| \leq m} a_\alpha(x) D^\alpha\) (where \(D = -i\partial\)) has symbol \(p(x,\xi) = \sum_{|\alpha| \leq m} a_\alpha(x) \xi^\alpha\), which is a polynomial in \(\xi\) and hence classical.
2.5 The Schwartz Kernel Theorem
The Schwartz kernel theorem provides the bridge between operators and distributions, and is fundamental to the theory of integral operators.
Theorem 2.5.1 (Schwartz Kernel Theorem). Every continuous linear operator \(A : \mathcal{D}(\Omega_2) \to \mathcal{D}'(\Omega_1)\) has a unique distribution kernel \(K_A \in \mathcal{D}'(\Omega_1 \times \Omega_2)\) such that
\[
\langle Au, v \rangle = \langle K_A, v \otimes u \rangle
\]
for all \(u \in \mathcal{D}(\Omega_2)\), \(v \in \mathcal{D}(\Omega_1)\). Conversely, every \(K \in \mathcal{D}'(\Omega_1 \times \Omega_2)\) defines such an operator.
2.6 Fourier Integral Representation of Operators
We can now write down the general form of a pseudodifferential operator as a Fourier integral.
Definition 2.6.1. Given a symbol \(a \in S^m_{\rho,\delta}(\mathbb{R}^n \times \mathbb{R}^n)\), the associated operator \(\mathrm{Op}(a)\) (or \(a(x,D)\)) is defined by
\[
\mathrm{Op}(a)u(x) = (2\pi)^{-n} \int_{\mathbb{R}^n} e^{ix \cdot \xi} a(x,\xi) \hat{u}(\xi) \, d\xi
\]
for \(u \in \mathcal{S}(\mathbb{R}^n)\). Equivalently, using the Fourier inversion formula,
\[
\mathrm{Op}(a)u(x) = (2\pi)^{-n} \int\!\!\!\int e^{i(x-y) \cdot \xi} a(x,\xi) u(y) \, dy \, d\xi.
\]
The latter is an oscillatory integral.
Chapter 3: Pseudodifferential Operators
Pseudodifferential operators (abbreviated \(\Psi\)DOs) generalize differential operators by allowing symbols that are not polynomial in \(\xi\). They were introduced in the 1960s by Kohn-Nirenberg, Hörmander, and others to provide a flexible algebraic framework for studying elliptic PDE. The fundamental insight is that the class of pseudodifferential operators is closed under composition, taking adjoints, and — crucially — taking parametrices (approximate inverses) of elliptic operators.
3.1 Definition and Basic Properties
Definition 3.1.1 (Pseudodifferential Operator). An operator \(A : \mathcal{S}(\mathbb{R}^n) \to \mathcal{S}'(\mathbb{R}^n)\) is a pseudodifferential operator of order \(m\) and type \((\rho,\delta)\) if it has the form
\[
Au(x) = \mathrm{Op}(a)u(x) = (2\pi)^{-n} \int\!\!\!\int e^{i(x-y)\cdot\xi} a(x,\xi) u(y) \, dy \, d\xi
\]
for some symbol \(a \in S^m_{\rho,\delta}\). We write \(A \in \Psi^m_{\rho,\delta}(\mathbb{R}^n)\), or simply \(A \in \Psi^m\) when \((\rho,\delta) = (1,0)\). The class \(\Psi^{-\infty} = \bigcap_m \Psi^m\) consists of smoothing operators.
Proposition 3.1.2. Every \(A \in \Psi^{-\infty}(\mathbb{R}^n)\) has a Schwartz kernel \(K_A \in \mathcal{S}(\mathbb{R}^n \times \mathbb{R}^n)\); in particular, \(A\) maps \(\mathcal{S}'(\mathbb{R}^n) \to \mathcal{S}(\mathbb{R}^n)\). Conversely, any operator with Schwartz kernel in \(\mathcal{S}\) belongs to \(\Psi^{-\infty}\).
Theorem 3.1.3 (Mapping Properties). If \(A \in \Psi^m_{\rho,\delta}(\mathbb{R}^n)\) with \(\delta < 1\), then:
(i) \(A : \mathcal{S}(\mathbb{R}^n) \to \mathcal{S}(\mathbb{R}^n)\) is continuous.
(ii) \(A\) extends to a continuous map \(A : \mathcal{S}'(\mathbb{R}^n) \to \mathcal{S}'(\mathbb{R}^n)\).
(iii) \(A : H^s(\mathbb{R}^n) \to H^{s-m}(\mathbb{R}^n)\) is continuous for every \(s \in \mathbb{R}\).
Part (iii) says that a \(\Psi\)DO of order \(m\) “costs” exactly \(m\) derivatives in the Sobolev scale, which is the quantitative embodiment of the notion of order.
3.2 The Symbol Map
The assignment \(a \mapsto \mathrm{Op}(a)\) is not quite injective: two symbols that differ by an element of \(S^{-\infty}\) give rise to operators that differ by a smoothing operator. The quotient gives a well-defined map.
Theorem 3.2.1 (Symbol Map). The map \(\sigma : \Psi^m_{\rho,\delta} / \Psi^{-\infty} \to S^m_{\rho,\delta} / S^{-\infty}\) that assigns to an operator its symbol class (modulo \(S^{-\infty}\)) is a well-defined isomorphism. The principal symbol \(\sigma_m(A) \in S^m / S^{m-(\rho-\delta)}\) is an invariantly defined function on \(T^*\mathbb{R}^n\) (or more generally on the cotangent bundle of a manifold).
3.3 Composition of Pseudodifferential Operators
The closure of the pseudodifferential calculus under composition is its most powerful algebraic property.
Theorem 3.3.1 (Composition). Let \(A \in \Psi^{m_1}_{\rho,\delta}(\mathbb{R}^n)\) with symbol \(a\) and \(B \in \Psi^{m_2}_{\rho,\delta}(\mathbb{R}^n)\) with symbol \(b\), where \(\delta < \rho\). Then \(AB \in \Psi^{m_1+m_2}_{\rho,\delta}(\mathbb{R}^n)\) with symbol \(c \in S^{m_1+m_2}_{\rho,\delta}\) given by
\[
c(x,\xi) \sim \sum_\alpha \frac{1}{\alpha!} \partial_\xi^\alpha a(x,\xi) \cdot D_x^\alpha b(x,\xi).
\]
In particular, the principal symbol of the composition is the product of the principal symbols:
\[
\sigma_{m_1+m_2}(AB) = \sigma_{m_1}(A) \cdot \sigma_{m_2}(B).
\]
Proof (Sketch). The Schwartz kernel of \(AB\) is
\[
K_{AB}(x,y) = \int K_A(x,z) K_B(z,y) \, dz.
\]
Substituting the oscillatory integral representations and performing a stationary phase expansion in the intermediate variable \(z\) yields the asymptotic formula. The key identity is
\[
e^{-iz \cdot \eta} a(x,\xi+\eta) = \sum_{|\alpha| < N} \frac{(-iz)^\alpha}{\alpha!} \partial_\xi^\alpha a(x,\xi) \cdot e^{-iz \cdot \eta} + r_N(x,z,\xi,\eta),
\]
applied via Taylor expansion. The remainder \(r_N\) contributes a term in \(S^{m_1+m_2-N(\rho-\delta)}\), which can be made arbitrarily negative. \(\blacksquare\)
3.4 Adjoint
Theorem 3.4.1 (Formal Adjoint). If \(A \in \Psi^m_{\rho,\delta}\) with symbol \(a\), then the formal \(L^2\)-adjoint \(A^*\) defined by \(\langle A^* u, v \rangle = \langle u, Av \rangle\) is a pseudodifferential operator \(A^* \in \Psi^m_{\rho,\delta}\) with symbol
\[
a^*(x,\xi) \sim \sum_\alpha \frac{1}{\alpha!} \partial_\xi^\alpha D_x^\alpha \overline{a(x,\xi)}.
\]
In particular, \(\sigma_m(A^*) = \overline{\sigma_m(A)}\).
Corollary 3.4.2. The principal symbol of a formally self-adjoint pseudodifferential operator is real-valued.
3.5 Elliptic Operators and Parametrices
Ellipticity is the key condition that allows one to “invert” a pseudodifferential operator modulo smoothing errors.
Definition 3.5.1 (Ellipticity). An operator \(A \in \Psi^m_{\rho,\delta}\) with symbol \(a\) is elliptic if there exist constants \(C, R > 0\) such that
\[
|a(x,\xi)| \geq C \langle \xi \rangle^m, \quad |\xi| \geq R.
\]
Equivalently, the principal symbol \(\sigma_m(A)(x,\xi) \neq 0\) for all \(x \in \mathbb{R}^n\) and \(\xi \neq 0\).
Example 3.5.2. The Laplacian \(\Delta\) has principal symbol \(-|\xi|^2\), which is elliptic. The operator \(I - \Delta\), with symbol \(1 + |\xi|^2\), is elliptic of order 2. The heat operator \(\partial_t - \Delta\), with symbol \(i\tau - |\xi|^2\) on \(\mathbb{R}^{n+1}\), is not elliptic (it vanishes when \(\tau = 0\) and \(\xi = 0\) in a degenerate way as a function on \(\mathbb{R}^{n+1}\)).
Theorem 3.5.3 (Parametrix Construction). Let \(A \in \Psi^m_{\rho,\delta}\) be elliptic with \(\delta < \rho\). Then there exists \(B \in \Psi^{-m}_{\rho,\delta}\) such that
\[
AB = I + R_1, \quad BA = I + R_2,
\]
where \(R_1, R_2 \in \Psi^{-\infty}\) are smoothing operators. The operator \(B\) is called a parametrix for \(A\).
Proof. We construct the symbol \(b\) of \(B\) by solving the asymptotic equation \(a \# b \sim 1\), where \(\#\) denotes the composition of symbols. Set \(b_0(x,\xi) = \chi(\xi) / a(x,\xi)\), where \(\chi \in C^\infty\) is a cutoff vanishing near the origin and equal to 1 for \(|\xi| \geq R\). Then \(b_0 \in S^{-m}_{\rho,\delta}\), and \(a \# b_0 = 1 + r_1\) with \(r_1 \in S^{-(\rho-\delta)}_{\rho,\delta}\).
We now set \(b_1 = -b_0 \cdot r_1 \in S^{-m-(\rho-\delta)}\) and iterate: having constructed \(b_0, \ldots, b_{N-1}\) such that \(a \# (b_0 + \cdots + b_{N-1}) = 1 + r_N\) with \(r_N \in S^{-N(\rho-\delta)}\), we set \(b_N = -b_0 \cdot r_N\). By Borel’s lemma (Theorem 2.3.2), there exists \(b \sim \sum_j b_j\) in \(S^{-m}\), and the corresponding operator \(B = \mathrm{Op}(b)\) satisfies \(AB = I + R_1\) with \(R_1 \in \Psi^{-\infty}\). A similar construction gives the left parametrix. \(\blacksquare\)
3.6 Elliptic Regularity
The parametrix immediately yields the fundamental regularity theorem for elliptic equations.
Theorem 3.6.1 (Elliptic Regularity). Let \(A \in \Psi^m\) be elliptic. If \(Au = f\) with \(f \in H^s_{\mathrm{loc}}\), then \(u \in H^{s+m}_{\mathrm{loc}}\). In particular, if \(f \in C^\infty\), then \(u \in C^\infty\).
Proof. Let \(B\) be a parametrix for \(A\), so \(BA = I + R\) with \(R \in \Psi^{-\infty}\). Then \(u = Bf - Ru\). Since \(f \in H^s_{\mathrm{loc}}\) and \(B \in \Psi^{-m}\), we have \(Bf \in H^{s+m}_{\mathrm{loc}}\). Since \(R\) is smoothing, \(Ru \in C^\infty\). Hence \(u = Bf - Ru \in H^{s+m}_{\mathrm{loc}}\). \(\blacksquare\)
3.7 The Calderón-Vaillancourt Theorem
Not every \(\Psi\)DO of order 0 is bounded on \(L^2\): this requires some care with the symbol class.
Theorem 3.7.1 (Calderón-Vaillancourt). If \(a \in S^0_{\rho,\rho}(\mathbb{R}^n \times \mathbb{R}^n)\) with \(0 \leq \rho < 1\), then \(\mathrm{Op}(a) : L^2(\mathbb{R}^n) \to L^2(\mathbb{R}^n)\) is bounded, with
\[
\|\mathrm{Op}(a)\|_{L^2 \to L^2} \leq C \sum_{|\alpha+\beta| \leq N} \sup_{x,\xi} |\partial_\xi^\alpha \partial_x^\beta a(x,\xi)|,
\]
where \(N\) depends only on \(n\) and \(\rho\).
Proof (Sketch for \(\rho = 0\)). The case \(\rho = 0\) is the most classical. The symbol \(a \in S^0_{0,0}\) satisfies \(|\partial_\xi^\alpha \partial_x^\beta a| \leq C_{\alpha\beta}\) for all \(\alpha, \beta\). One decomposes the operator using a partition of unity in phase space (a Gabor-type decomposition) and estimates each piece. The Cotlar-Stein almost-orthogonality lemma then yields the \(L^2\)-boundedness. Specifically, if \(A = \sum_j A_j\) where the pieces satisfy
\[
\|A_i^* A_j\| \leq c(i-j)^2, \quad \|A_i A_j^*\| \leq c(i-j)^2,
\]
with \(\sum_j c(j) < \infty\), then \(\|A\| \leq \sum_j \sqrt{c(j)}\). \(\blacksquare\)
3.8 Gårding’s Inequality
Gårding’s inequality is the pseudodifferential version of coercivity, fundamental for energy estimates.
Theorem 3.8.1 (Sharp Gårding Inequality). Let \(A \in \Psi^m_{1,0}(\mathbb{R}^n)\) with symbol \(a\) satisfying \(\operatorname{Re} a(x,\xi) \geq 0\) for all \(x,\xi\). Then there exists \(C > 0\) such that
\[
\operatorname{Re} \langle Au, u \rangle \geq -C\|u\|_{H^{(m-1)/2}}^2
\]
for all \(u \in \mathcal{S}(\mathbb{R}^n)\).
Theorem 3.8.2 (Fefferman-Phong Inequality). Under the stronger hypothesis that \(a(x,\xi) \geq 0\) (real non-negative symbol of order \(m\)), one has the improved lower bound
\[
\operatorname{Re} \langle Au, u \rangle \geq -C\|u\|_{H^{(m-2)/2}}^2.
\]
This gains a full derivative over the sharp Gårding inequality and is optimal.
Chapter 4: Wavefront Sets and Microlocal Analysis
The wavefront set is the central concept of microlocal analysis. Introduced by Hörmander in 1970, it refines the classical notion of singular support by tracking not only where a distribution is singular but also in which codirections the singularity occurs. This refinement, living in the cotangent bundle rather than the base manifold, is the passage from local to microlocal analysis — and it transforms the study of PDE.
4.1 Singular Support
We begin with the classical notion, which the wavefront set refines.
Definition 4.1.1 (Singular Support). The singular support of a distribution \(u \in \mathcal{D}'(\Omega)\), denoted \(\operatorname{sing\,supp}(u)\), is the smallest closed set \(F \subseteq \Omega\) such that \(u|_{\Omega \setminus F} \in C^\infty(\Omega \setminus F)\). Equivalently,
\[
x_0 \notin \operatorname{sing\,supp}(u) \iff \exists \, \varphi \in C_c^\infty(\Omega) \text{ with } \varphi(x_0) \neq 0 \text{ and } \varphi u \in C^\infty(\Omega).
\]
Example 4.1.2. For the Dirac delta, \(\operatorname{sing\,supp}(\delta_0) = \{0\}\). For the Heaviside function \(H\), \(\operatorname{sing\,supp}(H) = \{0\}\) as well, since \(H\) is smooth away from the origin.
The singular support tells us where the singularity is, but not how it is oriented. Two distributions can have the same singular support but very different analytic behavior — for instance, \(\delta(x_1)\) (singular on the hyperplane \(\{x_1 = 0\}\)) and \(\delta(x_1) + \delta(x_2)\) have qualitatively different singularity structures that the singular support alone cannot distinguish.
4.2 The Wavefront Set
Definition 4.2.1 (Wavefront Set). Let \(u \in \mathcal{D}'(\Omega)\). A point \((x_0, \xi_0) \in \Omega \times (\mathbb{R}^n \setminus \{0\})\) is not in the wavefront set \(\mathrm{WF}(u)\) if there exist \(\varphi \in C_c^\infty(\Omega)\) with \(\varphi(x_0) \neq 0\) and an open cone \(\Gamma \subset \mathbb{R}^n \setminus \{0\}\) containing \(\xi_0\) such that for every \(N \in \mathbb{N}\),
\[
|\widehat{\varphi u}(\xi)| \leq C_N \langle \xi \rangle^{-N}, \quad \xi \in \Gamma.
\]
The wavefront set \(\mathrm{WF}(u) \subset T^*\Omega \setminus 0\) is the complement of the set of all such "microlocally regular" points.
The wavefront set is a closed conic subset of \(T^*\Omega \setminus 0\) (conic means invariant under positive scaling of the fiber variable \(\xi\)). It refines the singular support via the projection:
Proposition 4.2.2. The projection of \(\mathrm{WF}(u)\) onto the base \(\Omega\) equals \(\operatorname{sing\,supp}(u)\):
\[
\pi(\mathrm{WF}(u)) = \operatorname{sing\,supp}(u),
\]
where \(\pi : T^*\Omega \setminus 0 \to \Omega\) is the base projection.
Proof. If \(x_0 \notin \operatorname{sing\,supp}(u)\), then there exists \(\varphi\) with \(\varphi(x_0) \neq 0\) and \(\varphi u \in C^\infty_c\), so \(\widehat{\varphi u}\) is rapidly decreasing in all directions. Thus \((x_0, \xi_0) \notin \mathrm{WF}(u)\) for every \(\xi_0\), so \(x_0 \notin \pi(\mathrm{WF}(u))\). Conversely, if \(x_0 \notin \pi(\mathrm{WF}(u))\), one can cover \(S^{n-1}\) by finitely many open cones in which \(\widehat{\varphi u}\) is rapidly decreasing (after possibly shrinking the support of \(\varphi\)), yielding rapid decrease in all directions and hence \(\varphi u \in C^\infty\). \(\blacksquare\)
Example 4.2.3 (Wavefront Set of the Dirac Delta). For \(\delta_0 \in \mathcal{D}'(\mathbb{R}^n)\), we have \(\hat{\delta}_0 = 1\), which does not decay in any direction. Thus
\[
\mathrm{WF}(\delta_0) = \{(0,\xi) : \xi \in \mathbb{R}^n \setminus \{0\}\} = \{0\} \times (\mathbb{R}^n \setminus \{0\}).
\]
The singularity at the origin is "omnidirectional."
Example 4.2.4 (Wavefront Set of a Characteristic Function). Let \(\Omega \subset \mathbb{R}^n\) be a bounded open set with smooth boundary. Then
\[
\mathrm{WF}(\mathbf{1}_\Omega) = \{(x,\xi) : x \in \partial\Omega, \; \xi \perp T_x(\partial\Omega), \; \xi \neq 0\} = N^*(\partial\Omega) \setminus 0,
\]
the conormal bundle of the boundary (minus the zero section). The singularity is concentrated on the boundary and points in the normal direction.
Example 4.2.5 (Wavefront Set of \(\delta(x_1)\)). Consider \(\delta(x_1) \in \mathcal{D}'(\mathbb{R}^n)\), the distribution defined by \(\langle \delta(x_1), \varphi \rangle = \int \varphi(0, x') \, dx'\). Its Fourier transform in \(x_1\) is constant, so the singularity is in the \(\xi_1\)-direction:
\[
\mathrm{WF}(\delta(x_1)) = \{(0,x'; \xi_1, 0) : x' \in \mathbb{R}^{n-1}, \; \xi_1 \neq 0\}.
\]
This is precisely the conormal bundle of the hyperplane \(\{x_1 = 0\}\).
4.3 Microlocal Regularity and Pseudolocality
Theorem 4.3.1 (Microlocal Regularity of \(\Psi\)DOs). If \(A \in \Psi^m_{\rho,\delta}\) with \(\delta < \rho\), then
\[
\mathrm{WF}(Au) \subseteq \mathrm{WF}(u)
\]
for every \(u \in \mathcal{S}'(\mathbb{R}^n)\) (or \(u \in \mathcal{E}'(\mathbb{R}^n)\)). That is, a pseudodifferential operator does not create new singularities.
Proof (Sketch). If \((x_0, \xi_0) \notin \mathrm{WF}(u)\), we must show \((x_0, \xi_0) \notin \mathrm{WF}(Au)\). Choose \(\varphi\) localizing near \(x_0\) and a symbol \(b \in S^0\) that is 1 near \((x_0, \xi_0)\) (in a conic sense) and supported where \(u\) is microlocally regular. The key is to decompose \(A = A_1 + A_2\) where \(A_1\) has symbol supported in the region where \(u\) is microlocally smooth, so \(A_1 u\) is smooth near \((x_0,\xi_0)\), and \(A_2\) has symbol vanishing near \((x_0,\xi_0)\). A careful estimation using the symbol calculus completes the argument. \(\blacksquare\)
Corollary 4.3.2 (Pseudolocality). For \(A \in \Psi^m_{\rho,\delta}\) with \(\delta < \rho\),
\[
\operatorname{sing\,supp}(Au) \subseteq \operatorname{sing\,supp}(u).
\]
4.4 Microlocal Elliptic Regularity
The wavefront set allows a refinement of elliptic regularity to the microlocal level.
Definition 4.4.1 (Characteristic Set). For \(A \in \Psi^m\) with principal symbol \(\sigma_m(A)\), the characteristic set is
\[
\operatorname{Char}(A) = \{(x,\xi) \in T^*\mathbb{R}^n \setminus 0 : \sigma_m(A)(x,\xi) = 0\}.
\]
The operator \(A\) is elliptic precisely when \(\operatorname{Char}(A) = \emptyset\).
Theorem 4.4.2 (Microlocal Elliptic Regularity). Let \(A \in \Psi^m\) and \(Au = f\). Then
\[
\mathrm{WF}(u) \subseteq \mathrm{WF}(f) \cup \operatorname{Char}(A).
\]
In words: the wavefront set of a solution \(u\) can only contain points that are either in the wavefront set of the right-hand side or in the characteristic set of the operator.
Proof. Suppose \((x_0,\xi_0) \notin \mathrm{WF}(f) \cup \operatorname{Char}(A)\). Since \((x_0,\xi_0) \notin \operatorname{Char}(A)\), the principal symbol \(\sigma_m(A)\) is nonzero at \((x_0,\xi_0)\). We can construct a microlocal parametrix \(B \in \Psi^{-m}\) that inverts \(A\) microlocally near \((x_0,\xi_0)\): \(BA = I + R\) where \(R\) is microsupported away from \((x_0,\xi_0)\). Then
\[
u = Bf - Ru + Ru - Ru = Bf - Ru
\]
microlocally near \((x_0,\xi_0)\). Since \((x_0,\xi_0) \notin \mathrm{WF}(f)\) and \(B\) preserves wavefront sets, \((x_0,\xi_0) \notin \mathrm{WF}(Bf)\). And \(R\) is smoothing near \((x_0,\xi_0)\), so \((x_0,\xi_0) \notin \mathrm{WF}(Ru)\). Hence \((x_0,\xi_0) \notin \mathrm{WF}(u)\). \(\blacksquare\)
4.5 The Wavefront Set of a Product
One of the most important applications of the wavefront set is to the problem of multiplying distributions. As noted in Chapter 1, not all distributions can be multiplied; the wavefront set gives a precise criterion.
Theorem 4.5.1 (Hörmander's Criterion for Multiplication). Let \(u, v \in \mathcal{D}'(\Omega)\). If the following condition holds:
\[
(x, \xi) \in \mathrm{WF}(u) \text{ and } (x, \eta) \in \mathrm{WF}(v) \implies \xi + \eta \neq 0,
\]
then the product \(uv \in \mathcal{D}'(\Omega)\) is well-defined, and
\[
\mathrm{WF}(uv) \subseteq \{(x, \xi+\eta) : (x,\xi) \in \mathrm{WF}(u) \cup \{0\}, \; (x,\eta) \in \mathrm{WF}(v) \cup \{0\}, \; \xi+\eta \neq 0\}.
\]
Here \(\{0\}\) means we allow \(\xi = 0\) or \(\eta = 0\) (but not \(\xi + \eta = 0\)) in the union.
Example 4.5.3. Let \(u = \delta(x_1) \in \mathcal{D}'(\mathbb{R}^2)\) and \(v = \delta(x_2) \in \mathcal{D}'(\mathbb{R}^2)\). Then \(\mathrm{WF}(u) \subset \{(\xi_1, 0)\}\)-directions and \(\mathrm{WF}(v) \subset \{(0, \xi_2)\}\)-directions. Hörmander's condition is satisfied (these are never antipodal), and the product \(uv = \delta(x_1)\delta(x_2) = \delta_0\) is well-defined.
4.6 Propagation of Singularities
For operators that are not elliptic, singularities can propagate. The fundamental theorem describes this propagation.
Definition 4.6.1 (Bicharacteristic Flow). Let \(P \in \Psi^m\) have real principal symbol \(p = \sigma_m(P)\). The Hamilton vector field of \(p\) is
\[
H_p = \sum_{j=1}^n \left(\frac{\partial p}{\partial \xi_j} \frac{\partial}{\partial x_j} - \frac{\partial p}{\partial x_j} \frac{\partial}{\partial \xi_j}\right).
\]
A bicharacteristic is an integral curve of \(H_p\) lying in \(\operatorname{Char}(P) = \{p = 0\}\). Null bicharacteristics are bicharacteristic curves on which \(p = 0\).
Theorem 4.6.2 (Hörmander's Propagation of Singularities). Let \(P \in \Psi^m\) have real principal symbol \(p\). If \(Pu = f\), then \(\mathrm{WF}(u) \setminus \mathrm{WF}(f)\) is a union of maximally extended null bicharacteristics of \(P\) contained in \(\operatorname{Char}(P)\).
Example 4.6.4 (Wave Equation). For the wave operator \(P = \partial_t^2 - \Delta_x\) on \(\mathbb{R}^{1+n}\), the principal symbol is \(p(t,x;\tau,\xi) = -\tau^2 + |\xi|^2\). The Hamilton equations are
\[
\dot{t} = -2\tau, \quad \dot{x} = 2\xi, \quad \dot{\tau} = 0, \quad \dot{\xi} = 0.
\]
On \(\operatorname{Char}(P) = \{\tau^2 = |\xi|^2\}\), with \(\tau = \pm|\xi|\), these are straight lines in space-time with velocity \(\dot{x}/\dot{t} = -\xi/\tau = \mp \xi/|\xi|\), i.e., unit speed propagation. The propagation of singularities theorem says that the wavefront set of a solution to the wave equation propagates along these rays.
Chapter 5: Fourier Integral Operators
Fourier integral operators (FIOs) generalize pseudodifferential operators by allowing the phase function to be more general than the linear phase \((x-y) \cdot \xi\). They were introduced by Hörmander in 1971 to provide a coordinate-invariant framework for the propagation of singularities and to construct parametrices for hyperbolic equations. FIOs encode canonical transformations (symplectomorphisms) of the cotangent bundle, linking the symplectic geometry of classical mechanics to the analysis of PDEs. The theory draws heavily on the work of Maslov, Hörmander, and Duistermaat-Guillemin.
5.1 Symplectic Geometry and Lagrangian Submanifolds
The natural geometric setting for FIOs is symplectic geometry.
Definition 5.1.1 (Symplectic Structure). The cotangent bundle \(T^*\mathbb{R}^n\) carries the canonical symplectic form \(\omega = \sum_{j=1}^n d\xi_j \wedge dx_j\). A submanifold \(\Lambda \subset T^*\mathbb{R}^n\) of dimension \(n\) is Lagrangian if \(\omega|_\Lambda = 0\).
Example 5.1.2. The zero section \(\{(x,0) : x \in \mathbb{R}^n\}\) is Lagrangian. The conormal bundle \(N^*S = \{(x,\xi) : x \in S, \; \xi \perp T_xS\}\) of any smooth submanifold \(S \subset \mathbb{R}^n\) is Lagrangian. The graph of any exact 1-form \(\xi = d\phi(x)\) is Lagrangian.
Definition 5.1.3 (Canonical Relation). A canonical relation from \(T^*\mathbb{R}^{n_2}\) to \(T^*\mathbb{R}^{n_1}\) is a Lagrangian submanifold \(C \subset T^*\mathbb{R}^{n_1} \times T^*\mathbb{R}^{n_2}\) with respect to the twisted symplectic form \(\omega_1 \oplus (-\omega_2)\). In particular, the graph of any canonical transformation (symplectomorphism) \(\chi : T^*\mathbb{R}^n \to T^*\mathbb{R}^n\) is a canonical relation.
5.2 Generating Functions and Phase Functions
Definition 5.2.1 (Non-degenerate Phase Function). A phase function is a smooth real-valued function \(\Phi(x,y,\theta)\) defined on \(\mathbb{R}^{n_1} \times \mathbb{R}^{n_2} \times (\mathbb{R}^N \setminus \{0\})\) that is positively homogeneous of degree 1 in \(\theta\). It is non-degenerate if the differentials \(d(\partial\Phi/\partial\theta_1), \ldots, d(\partial\Phi/\partial\theta_N)\) are linearly independent on the critical set
\[
C_\Phi = \{(x,y,\theta) : \nabla_\theta \Phi = 0\}.
\]
Proposition 5.2.2. If \(\Phi\) is a non-degenerate phase function, then \(C_\Phi\) is a smooth manifold of dimension \(n_1 + n_2\), and the map
\[
C_\Phi \ni (x,y,\theta) \mapsto (x, \nabla_x\Phi; y, -\nabla_y\Phi) \in T^*\mathbb{R}^{n_1} \times T^*\mathbb{R}^{n_2}
\]
parametrizes a Lagrangian submanifold (with respect to the twisted symplectic form), which is the canonical relation associated to \(\Phi\).
Example 5.2.3. The phase function \(\Phi(x,y,\xi) = (x-y) \cdot \xi\) is non-degenerate, with critical set \(C_\Phi = \{x = y\}\) and associated canonical relation equal to the diagonal \(\{(x,\xi;x,\xi)\}\) — corresponding to the identity transformation. This is the phase of pseudodifferential operators.
5.3 Definition of Fourier Integral Operators
Definition 5.3.1 (Fourier Integral Operator). A Fourier integral operator of order \(m\) associated to a non-degenerate phase function \(\Phi(x,y,\theta)\) is an operator of the form
\[
Au(x) = \int\!\!\!\int e^{i\Phi(x,y,\theta)} a(x,y,\theta) u(y) \, dy \, d\theta,
\]
where \(a \in S^{m + n_1/2 - N/2}_{1,0}(\mathbb{R}^{n_1} \times \mathbb{R}^{n_2} \times \mathbb{R}^N)\) is an amplitude, interpreted as an oscillatory integral. We write \(A \in I^m(C)\), where \(C\) is the canonical relation generated by \(\Phi\).
Theorem 5.3.3 (Wavefront Set of FIOs). If \(A \in I^m(C)\) and \(u \in \mathcal{E}'(\mathbb{R}^{n_2})\), then
\[
\mathrm{WF}(Au) \subseteq C \circ \mathrm{WF}(u) := \{(x,\xi) : \exists (y,\eta) \in \mathrm{WF}(u), \; (x,\xi;y,\eta) \in C\}.
\]
That is, the FIO transforms the wavefront set according to its canonical relation.
5.4 Composition with Pseudodifferential Operators
Theorem 5.4.1. If \(A \in I^m(C)\) is an FIO and \(P \in \Psi^k\) is a pseudodifferential operator, then \(PA \in I^{m+k}(C)\) — the composition is again an FIO associated to the same canonical relation, with order increased by \(k\). Similarly, \(AP' \in I^{m+k'}(C)\) for \(P' \in \Psi^{k'}\).
This means that the class of FIOs associated to a given canonical relation forms a module over the ring of pseudodifferential operators — a crucial structural property.
5.5 Egorov’s Theorem
Egorov’s theorem is the microlocal version of the statement that conjugation by a unitary operator associated to a canonical transformation implements that transformation at the symbol level.
Theorem 5.5.1 (Egorov's Theorem). Let \(\chi : T^*\mathbb{R}^n \to T^*\mathbb{R}^n\) be a canonical transformation, and let \(U \in I^0(\operatorname{Graph}(\chi))\) be an elliptic FIO of order 0 with parametrix \(U^{-1} \in I^0(\operatorname{Graph}(\chi^{-1}))\). Then for any \(A \in \Psi^m\), the conjugated operator
\[
B = U^{-1} A U
\]
is a pseudodifferential operator \(B \in \Psi^m\), and its principal symbol satisfies
\[
\sigma_m(B) = \sigma_m(A) \circ \chi.
\]
Proof (Sketch). By Theorem 5.4.1, \(AU \in I^m(\operatorname{Graph}(\chi))\), and then \(U^{-1}(AU) \in I^m(\operatorname{Graph}(\chi^{-1}) \circ \operatorname{Graph}(\chi)) = I^m(\Delta)\), where \(\Delta\) is the diagonal canonical relation. But FIOs associated to the diagonal are precisely pseudodifferential operators, so \(B \in \Psi^m\). The formula for the principal symbol follows from tracking the symbol through the composition. \(\blacksquare\)
5.6 Application: Parametrix for the Wave Equation
We illustrate the power of FIO theory by constructing a parametrix for the wave equation.
Theorem 5.6.1 (Parametrix for the Wave Operator). Consider the Cauchy problem
\[
\begin{cases} (\partial_t^2 - \Delta) u = 0, \\ u(0,x) = f(x), \\ \partial_t u(0,x) = g(x). \end{cases}
\]
There exist Fourier integral operators \(E_0(t), E_1(t) \in I^0(\mathbb{R}^n)\) such that the solution is
\[
u(t,\cdot) = E_0(t)f + E_1(t)g + R(t)(f,g),
\]
where \(R(t)\) is a smoothing operator. The canonical relation of \(E_j(t)\) is the graph of the geodesic flow at time \(t\) (for the flat metric, this is the linear flow \((x,\xi) \mapsto (x + 2t\xi/|\xi|, \xi)\) on the characteristic set \(\{|\xi| = 1\}\), extended homogeneously).
Proof (Sketch). One constructs \(E_0(t)\) as an FIO with phase function
\[
\Phi_\pm(t,x,y,\xi) = (x - y) \cdot \xi \pm t|\xi|
\]
and amplitude \(a(t,x,y,\xi)\) determined by solving the transport equations obtained from applying the wave operator to the ansatz
\[
E_0(t)f(x) = \frac{1}{(2\pi)^n} \int\!\!\!\int \left(e^{i((x-y)\cdot\xi + t|\xi|)} a^+(t,x,\xi) + e^{i((x-y)\cdot\xi - t|\xi|)} a^-(t,x,\xi)\right) f(y) \, dy \, d\xi.
\]
The phase functions \(\Phi_\pm\) generate the forward and backward light cones. The transport equations, obtained by collecting terms of decreasing homogeneity in \(\xi\), can be solved iteratively, yielding an amplitude with an asymptotic expansion. The initial conditions \(u(0) = f\), \(\partial_t u(0) = g\) determine the split between the \(+\) and \(-\) contributions. \(\blacksquare\)
Chapter 6: Applications to PDE
With the machinery of pseudodifferential operators, wavefront sets, and Fourier integral operators in hand, we now demonstrate the power of microlocal analysis through applications to partial differential equations. These applications range from the foundational (elliptic regularity) to the sophisticated (propagation of singularities for the wave equation, analytic wavefront set). The microlocal perspective not only simplifies many classical arguments but also yields results that are inaccessible by other methods.
6.1 Elliptic Regularity Revisited
We have already proved elliptic regularity (Theorem 3.6.1) via parametrices. The microlocal viewpoint gives a sharper result.
Theorem 6.1.1 (Microlocal Elliptic Regularity, Refined). Let \(P \in \Psi^m(\mathbb{R}^n)\) be a classical pseudodifferential operator with principal symbol \(p_m\). If \(Pu = f\) and \((x_0, \xi_0) \notin \operatorname{Char}(P) \cup \mathrm{WF}(f)\), then \((x_0,\xi_0) \notin \mathrm{WF}(u)\). Furthermore, microlocally near \((x_0,\xi_0)\),
\[
u \equiv B f \pmod{C^\infty},
\]
where \(B\) is a microlocal parametrix with principal symbol \(1/p_m\).
This refined statement is immediately useful for equations that are elliptic only in some directions.
6.2 Hypoellipticity
An operator is hypoelliptic if it gains regularity, though perhaps not as much as an elliptic operator.
Definition 6.2.1. A linear differential operator \(P\) is hypoelliptic if \(\operatorname{sing\,supp}(Pu) = \operatorname{sing\,supp}(u)\) for all \(u \in \mathcal{D}'(\Omega)\). That is, \(Pu \in C^\infty(\Omega) \implies u \in C^\infty(\Omega)\).
Example 6.2.2. The heat operator \(P = \partial_t - \Delta_x\) on \(\mathbb{R}^{1+n}\) is hypoelliptic but not elliptic. Its characteristic set \(\operatorname{Char}(P) = \{\tau = 0, \xi = 0\}\) is nonempty (as a set in the projectivized cotangent bundle, it is the "direction" \(d\tau\)), yet solutions to \(Pu = 0\) are smooth. Hörmander proved that a constant-coefficient operator \(P(D)\) is hypoelliptic if and only if for every \(\alpha\), \(|P^{(\alpha)}(\xi)/P(\xi)| \to 0\) as \(|\xi| \to \infty\) (with \(\xi\) staying away from the zeros of \(P\)).
Theorem 6.2.3 (Hörmander's Sum of Squares). Let \(X_0, X_1, \ldots, X_r\) be smooth real vector fields on an open set \(\Omega \subseteq \mathbb{R}^n\), and consider the operator
\[
P = \sum_{j=1}^r X_j^2 + X_0 + c
\]
where \(c \in C^\infty(\Omega)\). If the Lie algebra generated by \(X_0, X_1, \ldots, X_r\) spans the tangent space at every point (the Hörmander bracket condition), then \(P\) is hypoelliptic.
6.3 The Heat Kernel via Pseudodifferential Methods
The pseudodifferential calculus provides an elegant construction of the heat kernel for elliptic operators.
Theorem 6.3.1. Let \(P \in \Psi^2(\mathbb{R}^n)\) be a classical, self-adjoint, elliptic operator that is positive (i.e., \(\langle Pu, u \rangle \geq c\|u\|_{H^1}^2 - C\|u\|_{L^2}^2\)). Then the heat semigroup \(e^{-tP}\) exists for \(t > 0\) as a smoothing operator, and its Schwartz kernel \(K_t(x,y)\) satisfies:
(i) \(K_t \in C^\infty(\mathbb{R}^+ \times \mathbb{R}^n \times \mathbb{R}^n)\),
(ii) \(K_t(x,y) \to \delta(x-y)\) as \(t \to 0^+\) in the sense of distributions,
(iii) For small \(t > 0\), \(K_t\) admits an asymptotic expansion on the diagonal:
\[
K_t(x,x) \sim (4\pi t)^{-n/2} \sum_{j=0}^\infty a_j(x) t^j \quad \text{as } t \to 0^+,
\]
where the coefficients \(a_j(x)\) are determined by the symbol of \(P\).
6.4 Propagation of Singularities for the Wave Equation
We now give a detailed treatment of the propagation theorem for the wave equation, using the machinery developed in Chapters 4 and 5.
Theorem 6.4.1 (Propagation of Singularities for the Wave Equation). Let \(P = D_t^2 - \sum_{j,k} \partial_{x_j}(g^{jk}(x)\partial_{x_k})\) be a wave operator with smooth metric \(g\), and suppose \(Pu = f\) on \(\mathbb{R} \times \mathbb{R}^n\). Then:
(i) \(\mathrm{WF}(u) \setminus \mathrm{WF}(f) \subset \operatorname{Char}(P) = \{\tau^2 = |\xi|_g^2\}\),
(ii) \(\mathrm{WF}(u) \setminus \mathrm{WF}(f)\) is invariant under the null bicharacteristic flow, i.e., it is a union of maximally extended null bicharacteristics.
Proof (Sketch). Part (i) is microlocal elliptic regularity (Theorem 4.4.2). For part (ii), the idea is to construct a microlocal energy estimate near the bicharacteristic. Let \(\gamma : [0,T] \to T^*(\mathbb{R}^{1+n}) \setminus 0\) be a null bicharacteristic of \(P\), and suppose \(\gamma(0) \notin \mathrm{WF}(u)\). We want to show \(\gamma(t) \notin \mathrm{WF}(u)\) for \(t \in [0,T]\).
\[
E(t) = \|B(t) D_t u\|_{L^2}^2 + \|B(t) \nabla_x u\|_{L^2}^2.
\]
One computes \(\frac{d}{dt}E(t)\) using the equation \(Pu = f\), the composition formula for \(\Psi\)DOs, and the positive commutator method (following Hörmander). The crucial point is that \([P, B(t)^*B(t)]\) has principal symbol \(H_p(|b(t)|^2)\), and by choosing \(B(t)\) to propagate along the flow, this commutator can be made non-positive (modulo lower-order terms). A Gronwall argument then yields \(E(t) = 0\) for all \(t\), hence \(\gamma(t) \notin \mathrm{WF}(u)\). \(\blacksquare\)
6.5 Duhamel’s Principle
Theorem 6.5.1 (Duhamel's Principle). Consider the inhomogeneous wave equation
\[
\begin{cases} (\partial_t^2 - \Delta) u = F(t,x), \quad t > 0, \\ u(0,x) = f(x), \quad \partial_t u(0,x) = g(x). \end{cases}
\]
If \(E_1(t)\) is the propagator for the homogeneous equation with data \(u(0) = 0\), \(\partial_t u(0) = g\) (so that \(u(t) = E_1(t)g\)), then the solution to the inhomogeneous problem is
\[
u(t) = E_0(t)f + E_1(t)g + \int_0^t E_1(t-s) F(s) \, ds.
\]
The FBI (Fourier-Bros-Iagolnitzer) transform is a tool for studying analytic regularity, bridging the gap between \(C^\infty\) microlocal analysis and analytic microlocal analysis.
Definition 6.6.1 (FBI Transform). The FBI transform of \(u \in \mathcal{E}'(\mathbb{R}^n)\) is
\[
Tu(x,\xi) = c_n \int_{\mathbb{R}^n} e^{i(x-y) \cdot \xi - |\xi|(x-y)^2/2} u(y) \, dy,
\]
where \(c_n\) is a normalization constant. This is a smooth function of \((x,\xi) \in \mathbb{R}^n \times \mathbb{R}^n\).
Theorem 6.6.2. A point \((x_0,\xi_0) \notin \mathrm{WF}(u)\) if and only if \(|Tu(x,\xi)|\) decays rapidly as \(|\xi| \to \infty\) in a neighborhood of \((x_0, \xi_0/|\xi_0|)\) with \(|\xi| \to \infty\) in the direction \(\xi_0\).
6.7 Analytic Wavefront Set
Definition 6.7.1 (Analytic Wavefront Set). The analytic wavefront set \(\mathrm{WF}_A(u)\) is defined by: \((x_0, \xi_0) \notin \mathrm{WF}_A(u)\) if there exist \(\varepsilon > 0\) and a neighborhood \(U\) of \((x_0, \xi_0/|\xi_0|)\) such that
\[
|Tu(x,\xi)| \leq C e^{-\varepsilon |\xi|}
\]
for \((x, \xi/|\xi|) \in U\) and \(|\xi|\) sufficiently large.
Chapter 7: Semiclassical Analysis
Semiclassical analysis studies the behavior of quantum-mechanical systems in the limit where Planck’s constant \(\hbar \to 0\) — or, mathematically, operators depending on a small parameter \(h\) that plays the role of \(\hbar\). This chapter introduces the semiclassical pseudodifferential calculus, where the symbol depends on \(h\) and the relevant notion of order tracks powers of \(h\) rather than growth in \(\xi\). Semiclassical analysis provides quantitative refinements of the qualitative results of microlocal analysis and has deep applications to spectral theory, scattering theory, and quantum ergodicity. The foundational contributions of Weyl, Egorov, Maslov, and more recently Zworski and Sjöstrand shape this field.
7.1 Semiclassical Pseudodifferential Operators
Definition 7.1.1 (Semiclassical Symbol Classes). For \(m \in \mathbb{R}\) and \(0 < h \leq h_0\), the semiclassical symbol class \(S^m_h(\mathbb{R}^{2n})\) consists of families \(\{a_h(x,\xi)\}_{h \in (0,h_0]}\) in \(C^\infty(\mathbb{R}^{2n})\) satisfying
\[
|\partial_x^\alpha \partial_\xi^\beta a_h(x,\xi)| \leq C_{\alpha,\beta} \langle \xi \rangle^{m - |\beta|}
\]
uniformly in \(h\). The class \(S^m_{\delta,h}\) allows \(h\)-dependent bounds:
\[
|\partial_x^\alpha \partial_\xi^\beta a_h(x,\xi)| \leq C_{\alpha,\beta} h^{-\delta(|\alpha|+|\beta|)} \langle \xi \rangle^{m - |\beta|}.
\]
Definition 7.1.2 (Semiclassical Quantization). The standard (Kohn-Nirenberg) semiclassical quantization of \(a \in S^m_h\) is
\[
\mathrm{Op}_h(a) u(x) = \frac{1}{(2\pi h)^n} \int\!\!\!\int e^{i(x-y)\cdot\xi/h} a(x,\xi) u(y) \, dy \, d\xi.
\]
This is related to the non-semiclassical quantization by the rescaling \(\xi \mapsto h\xi\): \(\mathrm{Op}_h(a) = \mathrm{Op}(a(x, h\cdot))\) after appropriate identification.
The factor of \(1/h\) in the phase reflects the fundamental semiclassical scaling: oscillations at frequency \(\sim 1/h\) in position space correspond to momenta of order 1 in the classical limit.
7.2 Weyl Quantization
The Weyl quantization, which evaluates the symbol at the midpoint, has special properties that make it the preferred quantization in semiclassical analysis.
Definition 7.2.1 (Weyl Quantization). The Weyl quantization of \(a \in S^m_h\) is
\[
\mathrm{Op}_h^w(a) u(x) = \frac{1}{(2\pi h)^n} \int\!\!\!\int e^{i(x-y)\cdot\xi/h} a\!\left(\frac{x+y}{2}, \xi\right) u(y) \, dy \, d\xi.
\]
Proposition 7.2.2 (Properties of Weyl Quantization).
(i) If \(a\) is real-valued, then \(\mathrm{Op}_h^w(a)\) is formally self-adjoint on \(L^2(\mathbb{R}^n)\).
(ii) Weyl and standard quantizations differ by lower-order terms: \(\mathrm{Op}_h^w(a) = \mathrm{Op}_h(a) + \mathcal{O}(h)\) as operators on \(L^2\).
(iii) The Weyl symbol of the composition \(\mathrm{Op}_h^w(a) \mathrm{Op}_h^w(b)\) is given by the Moyal product:
\[
(a \#_h b)(x,\xi) = e^{\frac{ih}{2}(\partial_{\xi_a} \cdot \partial_{x_b} - \partial_{x_a} \cdot \partial_{\xi_b})} a(x_a,\xi_a) b(x_b, \xi_b) \Big|_{\substack{x_a = x_b = x \\ \xi_a = \xi_b = \xi}}.
\]
In particular, \(a \#_h b = ab + \frac{h}{2i}\{a,b\} + \mathcal{O}(h^2)\), where \(\{a,b\} = \sum_j (\partial_{\xi_j}a \, \partial_{x_j}b - \partial_{x_j}a \, \partial_{\xi_j}b)\) is the Poisson bracket.
Example 7.2.4. The semiclassical Schrödinger operator \(P_h = -h^2 \Delta + V(x)\) is the Weyl quantization of the classical Hamiltonian \(p(x,\xi) = |\xi|^2 + V(x)\). The semiclassical principal symbol is thus the total energy of the classical system.
7.3 Semiclassical Wavefront Set
Definition 7.3.1 (Semiclassical Wavefront Set). Let \(\{u_h\}_{h \in (0,h_0]}\) be a bounded family in \(L^2(\mathbb{R}^n)\). The semiclassical wavefront set \(\mathrm{WF}_h(u_h) \subset T^*\mathbb{R}^n\) is the complement of the set of points \((x_0,\xi_0)\) such that there exists \(a \in C_c^\infty(T^*\mathbb{R}^n)\) with \(a(x_0,\xi_0) \neq 0\) and \(\|\mathrm{Op}_h(a) u_h\|_{L^2} = \mathcal{O}(h^\infty)\).
7.4 Egorov’s Theorem in the Semiclassical Regime
Theorem 7.4.1 (Semiclassical Egorov's Theorem). Let \(P_h = \mathrm{Op}_h^w(p)\) with real principal symbol \(p \in S^0_h\), and let \(U_h(t) = e^{-itP_h/h}\) be the semiclassical propagator. Let \(\Phi^t : T^*\mathbb{R}^n \to T^*\mathbb{R}^n\) be the Hamiltonian flow of \(p\). Then for any \(a \in S^0_h\),
\[
U_h(t)^* \mathrm{Op}_h^w(a) U_h(t) = \mathrm{Op}_h^w(a \circ \Phi^t) + \mathcal{O}(h) \quad \text{in } \mathcal{L}(L^2),
\]
for \(|t| \leq T\) (any fixed \(T\)). More precisely, the full symbol has an asymptotic expansion in powers of \(h\).
Proof (Sketch). Set \(A(t) = U_h(t)^* \mathrm{Op}_h^w(a) U_h(t)\). Then
\[
\frac{d}{dt} A(t) = \frac{i}{h} [P_h, A(t)].
\]
Writing \(A(t) = \mathrm{Op}_h^w(a_t)\), the Moyal product formula gives
\[
\frac{d}{dt} a_t = \{p, a_t\} + \mathcal{O}(h) = H_p(a_t) + \mathcal{O}(h),
\]
with initial condition \(a_0 = a\). To leading order, this is the transport equation along the Hamiltonian flow, solved by \(a_t^{(0)} = a \circ \Phi^t\). Higher-order corrections are obtained iteratively. \(\blacksquare\)
7.5 Quantum-Classical Correspondence
The semiclassical results above are manifestations of a broader principle: quantum mechanics converges to classical mechanics as \(h \to 0\).
Theorem 7.5.1 (Semiclassical Propagation of Wavefront Sets). Let \(u_h\) solve the semiclassical Schrödinger equation \(ih\partial_t u_h = P_h u_h\), where \(P_h = \mathrm{Op}_h^w(p)\) with real \(p\). Then the semiclassical wavefront set propagates according to the classical flow:
\[
\mathrm{WF}_h(u_h(t)) = \Phi^t(\mathrm{WF}_h(u_h(0))),
\]
where \(\Phi^t\) is the Hamiltonian flow of \(p\).
7.6 The Weyl Law
The Weyl law describes the asymptotic distribution of eigenvalues of elliptic operators and is one of the crown jewels of semiclassical analysis.
Theorem 7.6.1 (Weyl Law). Let \(P_h = -h^2\Delta + V(x)\) on a compact Riemannian manifold \((M,g)\) of dimension \(n\), where \(V \in C^\infty(M)\). Let \(N(E, h) = \#\{\lambda_j(h) \leq E\}\) count the number of eigenvalues of \(P_h\) below \(E\). If \(E\) is a regular value of the classical Hamiltonian \(p(x,\xi) = |\xi|_g^2 + V(x)\), then
\[
N(E,h) = \frac{1}{(2\pi h)^n} \operatorname{Vol}\{(x,\xi) \in T^*M : p(x,\xi) \leq E\} + \mathcal{O}(h^{1-n}).
\]
Equivalently, on a compact manifold without potential, the eigenvalue counting function \(N(\lambda) = \#\{\lambda_j \leq \lambda\}\) of \(-\Delta\) satisfies
\[
N(\lambda) = \frac{\omega_n \operatorname{Vol}(M)}{(2\pi)^n} \lambda^{n/2} + \mathcal{O}(\lambda^{(n-1)/2})
\]
as \(\lambda \to \infty\), where \(\omega_n\) is the volume of the unit ball in \(\mathbb{R}^n\).
Proof (Sketch). The proof proceeds in several steps.
Step 1: Tauberian argument. The eigenvalue counting function is related to the trace of the heat kernel or the spectral projector. One introduces a smoothed counting function \(\tilde{N}(E,h) = \operatorname{Tr}(f((P_h - E)/\delta))\) for a smooth approximation \(f\) to the characteristic function of \((-\infty, 0]\).
\[
f(P_h) = \frac{1}{\pi} \int_{\mathbb{C}} \bar{\partial}\tilde{f}(z) (z - P_h)^{-1} \, dL(z),
\]
where \(\tilde{f}\) is an almost-analytic extension of \(f\). The resolvent \((z - P_h)^{-1}\) is a semiclassical pseudodifferential operator for \(z\) away from the spectrum.
\[
\operatorname{Tr}(f(P_h)) = \frac{1}{(2\pi h)^n} \int_{T^*M} f(p(x,\xi)) \, dx \, d\xi + \mathcal{O}(h^{1-n}).
\]
The leading term is exactly the phase-space volume below \(E\), and the error is controlled by the remainder in the stationary phase expansion.
Step 4: De-smoothing. A Tauberian theorem (or direct comparison argument) allows one to pass from the smoothed counting function back to the sharp one, at the cost of a controlled error. \(\blacksquare\)
7.7 Applications to Spectral Theory
We conclude with some further applications of semiclassical analysis to spectral theory.
Theorem 7.7.1 (Quantum Ergodicity, Shnirelman-Zelditch-Colin de Verdière). Let \((M,g)\) be a compact Riemannian manifold such that the geodesic flow on the unit cosphere bundle \(S^*M\) is ergodic. Let \(\{u_j\}_{j=1}^\infty\) be an orthonormal basis of eigenfunctions of \(-\Delta_g\) with eigenvalues \(\lambda_j \to \infty\). Then there exists a subsequence \(\{u_{j_k}\}\) of density 1 (meaning \(\#\{k : j_k \leq N\}/N \to 1\) as \(N \to \infty\)) such that for every \(a \in C^\infty(S^*M)\),
\[
\langle \mathrm{Op}(a) u_{j_k}, u_{j_k} \rangle \to \frac{1}{\operatorname{Vol}(S^*M)} \int_{S^*M} a \, d\mu_L
\]
as \(k \to \infty\), where \(\mu_L\) is the Liouville measure.
Theorem 7.7.3 (Semiclassical Trace Formula). Let \(P_h = -h^2\Delta + V\) on a compact manifold, and let \(\chi \in C_c^\infty(\mathbb{R})\). Then
\[
\operatorname{Tr}(\chi(P_h)) = \frac{1}{(2\pi h)^n} \int_{T^*M} \chi(p(x,\xi)) \, dx \, d\xi + \sum_{k=1}^\infty h^k \int_{T^*M} c_k(x,\xi) \chi(p(x,\xi)) \, dx \, d\xi + \mathcal{O}(h^\infty),
\]
where the coefficients \(c_k\) are determined by the symbol of \(P_h\). The leading term is the Weyl term, and the subleading corrections encode geometric and dynamical information about the classical system.
7.8 Conclusion
The semiclassical perspective brings the entire subject full circle. The distributions and Sobolev spaces of Chapter 1 provided the analytic foundation; the symbol calculus of Chapters 2–3 gave us the algebraic tools; the wavefront set of Chapter 4 revealed the geometric structure of singularities; the FIOs of Chapter 5 encoded symplectic transformations; the PDE applications of Chapter 6 demonstrated the power of the framework; and now semiclassical analysis shows how all of these ideas fit together in the regime where analysis meets geometry and physics. The passage from local to microlocal — from base manifold to cotangent bundle — is not merely a technical refinement but a genuine change in perspective, one that has transformed the study of linear PDE and continues to drive research in spectral theory, mathematical physics, and beyond.