AMATH 350: Differential Equations for Business and Economics
Shahla Aliakbari
Estimated study time: 3 hr 7 min
Table of contents
Sources and References
Supplementary texts — W.E. Boyce & R.C. DiPrima, Elementary Differential Equations and Boundary Value Problems (11th ed., Wiley, 2017); M. Tenenbaum & H. Pollard, Ordinary Differential Equations (Dover, 1985); P. Wilmott, S. Howison & J. Dewynne, The Mathematics of Financial Derivatives (Cambridge, 1995); S.E. Shreve, Stochastic Calculus for Finance II: Continuous-Time Models (Springer, 2004)
Online resources — MIT OCW 18.03 Differential Equations; Paul’s Online Math Notes (tutorial.math.lamar.edu); Khan Academy Finance series
Chapter 1: First-Order Ordinary Differential Equations
The central object of this course is the differential equation — an equation that relates an unknown function to its own rates of change. Rather than asking “what is the value of some quantity?”, a differential equation asks “how does that quantity evolve?” This shift in perspective is enormously powerful: most natural and economic phenomena are described far more easily in terms of rates than in terms of static values. A bank account earns interest proportional to its balance. A market price moves toward equilibrium in response to excess demand. A population grows at a rate depending on how large it already is. In each case, the governing law is a differential equation, and the solution is the quantity itself as a function of time.
The simplest differential equations can be solved by direct integration. If \(\frac{dy}{dx} = f(x)\), then \(y = \int f(x)\,dx + C\). The constant \(C\) represents the freedom we have in specifying where the solution starts — it is pinned down only when we impose an initial condition \(y(x_0) = y_0\). The combination of a differential equation plus an initial condition is called an initial-value problem (IVP).
Before computing solutions, we should ask whether a solution even exists, and whether it is unique. The following foundational theorem answers both questions for first-order equations.
Proof sketch. Rewrite the IVP as the integral equation \(y(x) = y_0 + \int_{x_0}^x f(t,y(t))\,dt\). Define a sequence of approximations \(y_0(x) = y_0\), \(y_{n+1}(x) = y_0 + \int_{x_0}^x f(t,y_n(t))\,dt\). The Lipschitz condition on \(f\) (guaranteed by continuity of \(\partial f/\partial y\)) ensures this sequence converges uniformly to a unique fixed point.
The theorem tells us that pathological behaviour — solutions that blow up, split, or fail to exist — can only arise when the hypotheses fail, for instance when \(f\) is discontinuous or \(\partial f / \partial y\) is unbounded. In practice, the theorem covers nearly every equation we encounter in business and economics applications, and we can proceed to find solutions with confidence.
is differentiable everywhere and solves the IVP. This yields infinitely many distinct solutions from the same initial data, demonstrating that the Lipschitz hypothesis of Picard–Lindelöf is not a formality — it is genuinely necessary for uniqueness.
1.1 Separable Equations
Many important first-order ODEs have the form \(y' = f(x)\,g(y)\), where the right-hand side factors into a function of \(x\) alone times a function of \(y\) alone. Such equations are called separable because we can separate the variables onto opposite sides of the equation.
The separation step is formally justified by writing \(dy = g(y)\,f(x)\,dx\) and dividing through by \(g(y)\) (wherever \(g(y) \neq 0\)). Note that any value \(y = y^*\) with \(g(y^*) = 0\) is a constant (equilibrium) solution, and it should be checked separately.
This is separable. Separating variables:
\[ \frac{dP}{P(1 - P/K)} = r\,dt. \]Using partial fractions on the left side: \(\frac{1}{P(1-P/K)} = \frac{1}{P} + \frac{1/K}{1 - P/K}\). Integrating:
\[ \ln|P| - \ln\left|1 - \frac{P}{K}\right| = rt + C \implies \ln\frac{P}{K - P} = rt + C. \]Exponentiating and applying the initial condition \(P(0) = P_0\):
\[ P(t) = \frac{K}{1 + \left(\dfrac{K - P_0}{P_0}\right)e^{-rt}}. \]Numerical example: Let \(r = 0.1\,\text{yr}^{-1}\), \(K = 1000\), \(P_0 = 10\). Then the solution is
\[ P(t) = \frac{1000}{1 + 99\,e^{-0.1t}}. \]As \(t \to \infty\), \(e^{-0.1t} \to 0\), so \(P(t) \to 1000 = K\). The population converges to the carrying capacity regardless of initial size.
Time to reach 50% of carrying capacity: We want \(P(t_{1/2}) = 500\):
\[ 500 = \frac{1000}{1 + 99\,e^{-0.1t_{1/2}}} \implies 1 + 99\,e^{-0.1t_{1/2}} = 2 \implies e^{-0.1t_{1/2}} = \frac{1}{99}. \]Therefore \(t_{1/2} = \ln(99)/0.1 \approx 4.595/0.1 \approx 46\) years. The logistic model is widely used in market saturation models (a new technology reaches half its maximum market share), epidemic modelling (cumulative infections follow a logistic curve before herd immunity), and resource management.
1.2 First-Order Linear ODEs
Not all first-order equations are separable. The next most tractable class is first-order linear equations, which arise naturally in finance whenever the rate of change of a quantity depends linearly on both the quantity itself and an external forcing term.
The key technique for solving first-order linear ODEs is the integrating factor. The idea is to multiply both sides by a cleverly chosen function \(\mu(x)\) so that the left-hand side becomes a perfect derivative.
Proof sketch. Note that \(\mu'(x) = P(x)\mu(x)\). Then \(\frac{d}{dx}[\mu y] = \mu y' + \mu' y = \mu(y' + Py) = \mu Q\). The result follows by integration.
The integrating factor is \(\mu(t) = e^{-0.05t}\). Multiplying through and integrating from 0 to \(t\):
\[ e^{-0.05t}D(t) - D(0) = -100\int_0^t e^{-0.03s}\,ds = -\frac{100}{0.03}\!\left(1 - e^{-0.03t}\right). \]Solving for \(D(t)\):
\[ D(t) = D(0)\,e^{0.05t} + \frac{100}{0.03}\!\left(e^{0.05t} - e^{0.02t}\right). \]As \(t \to \infty\), since \(0.05 > 0.02\), the \(e^{0.05t}\) terms dominate and \(D(t) \to +\infty\), regardless of \(D(0)\). Despite growing revenues, debt grows without bound — the interest rate exceeds the revenue growth rate, so the debt-to-revenue ratio diverges. This is the mathematical expression of the Domar fiscal sustainability criterion: long-run debt stability requires the interest rate not to exceed the growth rate of the economy. The condition \(r > g\), made famous by Piketty, signals precisely this explosive dynamic.
1.3 Logistic Growth and Newton’s Law of Cooling
Not every important first-order ODE is linear. Two nonlinear equations that appear throughout economics, ecology, and applied science are the logistic equation and Newton’s law of cooling.
Using partial fractions: \(\frac{1}{P(1-P/K)} = \frac{1}{P} + \frac{1/K}{1 - P/K}\). Integrating both sides and applying the initial condition \(P(0) = P_0\):
\[ P(t) = \frac{K}{1 + \left(\dfrac{K - P_0}{P_0}\right)e^{-rt}}. \]The market is nearly fully saturated by year 50. Notice that the inflection point — the time of fastest growth — occurs when \(P = K/2 = 500\), i.e., when \(1 + 9e^{-0.2t^*} = 2\), giving \(t^* = \ln(9)/0.2 = 2.197/0.2 \approx 11\) years. After the inflection, growth decelerates as saturation approaches.
Economic interpretation: In innovation diffusion (Bass model), the logistic equation captures the S-shaped adoption curve observed in markets for consumer goods, telecommunications technology, and pharmaceuticals. The carrying capacity \(K\) represents potential market size; \(r\) reflects the effectiveness of word-of-mouth or marketing. Policymakers use this model to forecast when a market will approach saturation and plan capacity accordingly.
Numerical instance: A cup of coffee starts at \(T_0 = 90°\text{C}\), the room is at \(A = 20°\text{C}\), and the cooling constant is \(k = 0.1\,\text{min}^{-1}\). Then \(T(t) = 20 + 70\,e^{-0.1t}\).
At \(t = 10\) min: \(T(10) = 20 + 70e^{-1} = 20 + 70 \times 0.3679 \approx 20 + 25.8 = 45.8°\text{C}\).
Time to cool to \(50°\text{C}\): solve \(20 + 70e^{-0.1t} = 50\), so \(e^{-0.1t} = 30/70 = 3/7\), giving \(t = -10\ln(3/7) = 10\ln(7/3) \approx 10 \times 0.847 = 8.47\) minutes.
Economic analogy: The same equation governs the rate at which a price \(P(t)\) adjusts toward a new equilibrium level \(A\) following a shock — the “price adjustment speed” parameter \(k\) plays the role of the cooling constant. The exponential approach to equilibrium is the simplest model of market convergence.
1.4 Substitution Methods
Some equations are neither separable nor linear in their original form, but a change of variables transforms them into one of these tractable classes.
A Bernoulli equation has the form \(y' + P(x)y = Q(x)y^n\) for some \(n \neq 0, 1\). The substitution \(v = y^{1-n}\) linearises it: differentiating, \(v' = (1-n)y^{-n}y'\), and substituting into the Bernoulli equation yields a linear ODE for \(v\).
\[ v + xv' = f(v) \implies \int \frac{dv}{f(v) - v} = \int \frac{dx}{x}. \]Dimensional homogeneity is a useful modelling principle: every term in a physical or economic equation must have the same units. This constrains the form of equations and provides a check on algebra. For example, in an equation relating dollars, time, and rates, each additive term must have units of dollars (or dollars per unit time, consistently). Violations of dimensional homogeneity indicate errors in model formulation.
This has \(n = 2\). The substitution \(v = P^{1-2} = P^{-1} = 1/P\) linearises the equation. Differentiating: \(v' = -P^{-2}P'\). Dividing the Bernoulli equation by \(P^2\):
\[ P^{-2}P' + P^{-1} = 1 \implies -v' + v = 1 \implies v' - v = -1. \]This is a first-order linear ODE. The integrating factor is \(e^{-t}\):
\[ \frac{d}{dt}\!\left[e^{-t}v\right] = -e^{-t} \implies e^{-t}v = e^{-t} + C \implies v = 1 + Ce^t. \]Therefore \(P(t) = 1/(1 + Ce^t)\). Applying the initial condition \(P(0) = 2\): \(2 = 1/(1 + C)\), so \(1 + C = 1/2\), giving \(C = -1/2\). The solution is:
\[ P(t) = \frac{1}{1 - \tfrac{1}{2}e^t} = \frac{2}{2 - e^t}. \]Finite-time blowup: The denominator vanishes when \(e^t = 2\), i.e., at \(t^* = \ln 2 \approx 0.693\). The solution \(P(t) \to +\infty\) as \(t \to \ln 2^-\). The quadratic death term \(P^2\) (modelling e.g. competition for limited resources causing mass die-off, or runaway positive feedback) overwhelms the linear birth term, driving the population to infinity in finite time. This phenomenon — finite-time blowup — is impossible for linear ODEs but is generic in nonlinear equations with superlinear growth terms.
Chapter 2: Applications to Finance and Economics
The differential equations developed in Chapter 1 are not merely mathematical abstractions — they are the natural language for modelling how economic quantities evolve over time. This chapter develops several core applications, building intuition for how the choice of equation structure encodes economic assumptions.
2.1 Supply-Demand Dynamics
In introductory economics, supply and demand determine a static equilibrium price. But in reality, prices adjust over time in response to excess demand. A simple dynamic model captures this:
In a normal market, demand slopes downward (\(D' < 0\)) and supply slopes upward (\(S' > 0\)), so \(D'(P^*) - S'(P^*) < 0\). Setting \(\beta = -\alpha[D'(P^*) - S'(P^*)] > 0\), the linearised equation is \(dp/dt = -\beta p\), with solution \(p(t) = p(0)e^{-\beta t} \to 0\). The equilibrium is stable: prices converge exponentially to \(P^*\).
2.2 The Domar Model of Public Debt Sustainability
A classic application of first-order linear ODEs in macroeconomics is the Domar (1944) model of public debt dynamics. If the government runs a primary deficit that grows at rate \(g\) and the existing debt accumulates interest at rate \(r\), the evolution of the debt stock \(D(t)\) is:
\[ \frac{dD}{dt} = rD + G_0 e^{gt}, \]where \(G_0 e^{gt}\) is the (growing) primary deficit. This is a first-order linear ODE with \(P(t) = -r\) and \(Q(t) = G_0 e^{gt}\). The integrating factor is \(\mu(t) = e^{-rt}\).
Multiplying through: \(\frac{d}{dt}[e^{-rt}D] = G_0 e^{(g-r)t}\).
\[ e^{-rt}D = \frac{G_0}{g - r}e^{(g-r)t} + C. \]\[ D(t) = e^{rt}\left[D_0 - \frac{G_0}{g-r}\right] + \frac{G_0}{g-r}e^{gt}. \]Case 2: \(g = r\). Then \(\frac{d}{dt}[e^{-rt}D] = G_0\), giving \(D(t) = e^{rt}(D_0 + G_0 t)\) — debt grows faster than exponential.
The key quantity is the debt-to-GDP ratio \(d = D/Y\). If GDP grows at rate \(g\) (so \(Y = Y_0 e^{gt}\)), then \(d\) stabilises when the growth rate of \(D\) equals \(g\). Analysis of the solution shows:
- If \(r < g\): the term \(e^{rt}\) grows slower than \(e^{gt}\), and the deficit term dominates. The debt ratio stabilises at a finite level.
- If \(r > g\): the compounding interest on debt outpaces GDP growth, and the debt-to-GDP ratio grows without bound — debt is unsustainable.
This result — that \(r < g\) is the sustainability condition — has been central to debates about fiscal policy since Olivier Blanchard’s 2019 American Economic Association presidential lecture, in which he argued that for most advanced economies in recent decades, \(r < g\) has held, making some level of deficit spending sustainable.
2.3 The Evans Price Adjustment Model
\[ P'' + \alpha P' + \beta P = \gamma, \]this is a second-order constant-coefficient ODE, analysed in Chapter 3. The roots of the characteristic equation determine whether prices oscillate to equilibrium (complex roots with negative real part) or converge monotonically (real negative roots). The Evans model thus predicts price cycles in commodity markets as a natural consequence of inventory dynamics, not market irrationality.
2.4 The Solow Growth Model
One of the most influential models in macroeconomics is the Solow growth model, which describes how capital accumulates in an economy over time.
\[ \frac{dk}{dt} = sf(k) - (n + \delta)k. \]The steady state \(k^*\) satisfies \(sf(k^*) = (n+\delta)k^*\) — investment exactly offsets the dilution of capital from population growth and depreciation. Linearising near \(k^*\) shows the steady state is stable when \(sf'(k^*) < n + \delta\), which is guaranteed by the concavity of \(f\). This ODE is a Bernoulli equation when \(f(k) = k^\alpha\), and can be solved explicitly by the substitution \(v = k^{1-\alpha}\).
At \(t = 20\): \(k(20) = [3 - 2e^{-1}]^2 = [3 - 0.736]^2 = [2.264]^2 \approx 5.13\). At \(t = 100\): \(k(100) = [3 - 2e^{-5}]^2 \approx [3 - 0.013]^2 \approx 8.92\), close to the steady state \(k^* = 9\). The half-life of convergence is \(\ln(2)/0.05 \approx 13.9\) years — consistent with empirical estimates of convergence rates in cross-country growth regressions.
2.5 The IS-LM Model as a System of ODEs
The IS-LM model of macroeconomic equilibrium — the workhorse of undergraduate macro — can be cast as a dynamic system. Let \(Y(t)\) be national income and \(r(t)\) the interest rate. In a simple dynamic version:
\[ \frac{dY}{dt} = \alpha\left[I(r) + G - S(Y)\right], \qquad \frac{dr}{dt} = \beta\left[L(Y, r) - M\right], \]where \(I(r)\) is investment (decreasing in \(r\)), \(G\) government spending, \(S(Y)\) saving (increasing in \(Y\)), \(L(Y,r)\) money demand (increasing in \(Y\), decreasing in \(r\)), and \(M\) the money supply. Near the IS-LM equilibrium, linearising gives a 2D linear system whose eigenvalues determine whether adjustment is monotone or oscillatory. In typical parameterisations both eigenvalues are negative (stable node or spiral), so the economy converges to its IS-LM equilibrium.
Chapter 3: Higher-Order Linear ODEs
First-order equations describe systems with a single degree of freedom — one piece of initial data. Many physical and economic systems have richer dynamics: oscillations, resonance, and complex transients that require higher-order equations to capture.
A central result is that the solution set of the homogeneous equation forms a vector space of dimension \(n\): there exist \(n\) linearly independent solutions \(y_1, \ldots, y_n\), and every solution is a linear combination \(y = c_1 y_1 + \cdots + c_n y_n\). The general solution of the inhomogeneous equation is then \(y = y_h + y_p\), where \(y_h\) is the general homogeneous solution and \(y_p\) is any particular solution.
3.1 The Wronskian
The question of whether \(n\) solutions are linearly independent is answered by the Wronskian determinant.
Check that both are solutions: For \(y_1 = e^x\): \(y_1'' - 3y_1' + 2y_1 = e^x - 3e^x + 2e^x = 0\). For \(y_2 = e^{2x}\): \(y_2'' - 3y_2' + 2y_2 = 4e^{2x} - 6e^{2x} + 2e^{2x} = 0\). Both are indeed solutions.
Wronskian:
\[ W(e^x, e^{2x}) = \det \begin{pmatrix} e^x & e^{2x} \\ e^x & 2e^{2x} \end{pmatrix} = e^x \cdot 2e^{2x} - e^{2x} \cdot e^x = 2e^{3x} - e^{3x} = e^{3x}. \]Since \(W(x) = e^{3x} \neq 0\) for all \(x\), the two solutions are linearly independent. The general solution is therefore \(y = c_1 e^x + c_2 e^{2x}\).
Remark on Abel’s theorem: For a second-order ODE \(y'' + p(x)y' + q(x)y = 0\), Abel’s theorem guarantees \(W(x) = W(x_0)\exp(-\int_{x_0}^x p(t)\,dt)\). For our equation \(y'' - 3y' + 2y = 0\), \(p = -3\), so \(W(x) = W(0) e^{3x} = 1 \cdot e^{3x}\), confirming our direct calculation. Abel’s theorem shows that if \(W\) is nonzero at one point, it is nonzero everywhere on the interval — independence is all-or-nothing.
3.2 The Damped Oscillator: Three Cases
\[ y'' + 2\gamma y' + \omega_0^2 y = 0, \]\[ r = -\gamma \pm \sqrt{\gamma^2 - \omega_0^2}. \]The behaviour splits into three qualitatively different cases:
The solution decays to zero without oscillating — it is exponential decay with two time scales. The system returns to equilibrium sluggishly.
\[ y(t) = (c_1 + c_2 t)e^{-2t}. \]This decays to zero faster than the overdamped case (for the same \(\omega_0\)) and without oscillation. Critical damping is the threshold: it gives the fastest non-oscillatory return to equilibrium, and is the design target for shock absorbers and galvanometers.
\[ y(t) = e^{-t}\!\left(A\cos\!\sqrt{3}\,t + B\sin\!\sqrt{3}\,t\right). \]The solution oscillates at frequency \(\omega_d = \sqrt{\omega_0^2 - \gamma^2} = \sqrt{3} \approx 1.73\) (the damped natural frequency), with exponentially decaying amplitude \(e^{-t}\).
Economic interpretation (Cobweb model): In the cobweb model of agricultural markets, suppliers make production decisions based on last period’s price. The second-order ODE analogy captures the oscillatory price dynamics: overdamped markets have sluggish convergence (e.g., housing markets with slow supply response); underdamped markets exhibit price cycles (e.g., hog cycles in agricultural economics); critically damped markets would represent the ideal — fastest convergence without overshoot. The damping parameter \(\gamma\) is related to the relative slopes of supply and demand.
3.3 Resonance and Investment Cycles
3.5 Constant-Coefficient Homogeneous Equations
When the coefficients \(a_i\) are constants, we can solve the homogeneous equation completely by substituting the trial solution \(y = e^{rx}\).
- Distinct real roots \(r_1, \ldots, r_n\): general solution \(y = c_1 e^{r_1 x} + \cdots + c_n e^{r_n x}\).
- Repeated root \(r\) of multiplicity \(k\): contributes \((c_1 + c_2 x + \cdots + c_k x^{k-1})e^{rx}\).
- Complex conjugate pair \(\alpha \pm \beta i\): contributes \(e^{\alpha x}(A\cos\beta x + B\sin\beta x)\).
3.6 Method of Undetermined Coefficients
When the inhomogeneous term \(g(x)\) is a polynomial, exponential, or sinusoidal function (or a product of these), we can find a particular solution by guessing its form and determining the coefficients.
The guessing rules are:
- If \(g(x) = P_m(x)e^{ax}\) (polynomial of degree \(m\) times exponential), guess \(y_p = x^s Q_m(x) e^{ax}\) where \(s\) is the multiplicity of \(a\) as a root of the characteristic equation (0 if not a root).
- If \(g(x) = e^{ax}(P_m(x)\cos bx + R_k(x)\sin bx)\), guess \(y_p = x^s e^{ax}(Q_n(x)\cos bx + S_n(x)\sin bx)\) where \(n = \max(m,k)\) and \(s\) is the multiplicity of \(a+bi\) as a characteristic root.
The factor of \(x^s\) corrects for resonance — when the driving frequency matches a natural frequency of the homogeneous equation.
Computing derivatives:
\[ y_p' = C\cos(2x) + D\sin(2x) + x(-2C\sin(2x) + 2D\cos(2x)), \]\[ y_p'' = -2C\sin(2x) + 2D\cos(2x) - 2C\sin(2x) + 2D\cos(2x) + x(-4C\cos(2x) - 4D\sin(2x)). \]\[ y_p'' = -4C\sin(2x) + 4D\cos(2x) - 4x\!\left(C\cos(2x) + D\sin(2x)\right). \]Substituting into \(y_p'' + 4y_p\):
\[ y_p'' + 4y_p = -4C\sin(2x) + 4D\cos(2x) - 4x(\cdots) + 4x(\cdots) = -4C\sin(2x) + 4D\cos(2x). \]Setting this equal to \(\cos(2x)\): \(-4C = 0\) and \(4D = 1\), giving \(C = 0\), \(D = 1/4\). Therefore:
\[ y_p = \frac{x}{4}\sin(2x). \]The general solution is \(y(x) = A\cos(2x) + B\sin(2x) + \frac{x}{4}\sin(2x)\). The particular solution \(\frac{x}{4}\sin(2x)\) has amplitude growing linearly with \(x\) — there is no saturation. This is the signature of resonance: energy is pumped into the system at exactly the rate it can absorb, leading to unbounded oscillations.
3.7 Variation of Parameters
The method of undetermined coefficients works only for special forms of \(g(x)\). A more general technique, variation of parameters, works whenever the homogeneous solutions are known and \(g(x)\) is continuous.
Proof sketch. Assume \(y_p = u_1(x)y_1 + u_2(x)y_2\). Impose the constraint \(u_1' y_1 + u_2' y_2 = 0\) (to simplify \(y_p''\)). Substituting into the ODE yields \(u_1' y_1' + u_2' y_2' = g\). Solving this \(2\times 2\) system by Cramer's rule gives \(u_1' = -y_2 g/W\) and \(u_2' = y_1 g/W\). Integrating gives \(y_p\).
3.8 Reduction of Order
If one solution \(y_1\) of a homogeneous second-order ODE is known, a second linearly independent solution can always be found. The substitution \(y = y_1(x) v(x)\) reduces the ODE to a first-order equation for \(v'\). Setting \(w = v'\), we obtain a first-order linear ODE for \(w\), which can be solved by the integrating factor method. The second solution is \(y_2 = y_1 \int v'(x)\,dx\).
Chapter 4: Systems of Linear ODEs
Many economic models involve not one but several interacting quantities — prices in multiple markets, stocks of multiple goods, or components of a macroeconomic model — that evolve simultaneously. The natural framework is a system of first-order linear ODEs.
Any \(n\)-th order linear ODE can be converted to a first-order system of dimension \(n\) by introducing new variables for the successive derivatives. This means the theory of systems is fully general.
4.1 Converting a Higher-Order ODE to a System
Any \(n\)-th order linear ODE can be written as a first-order system by introducing state variables for the successive derivatives. This reduction is important both theoretically (it unifies the theory) and computationally (numerical solvers work with first-order systems).
This reduction clarifies why the three cases of the damped oscillator (Section 3.2) correspond to node, spiral, and centre/node in the phase portrait of the companion system: overdamped gives a stable node; critically damped gives a degenerate node; underdamped gives a stable spiral.
4.2 The Lotka-Volterra Trade Model
The eigenvalue method applies beyond ecological systems to economic models of international trade and comparative advantage.
Eigenvalues: \(\lambda = (-1.1 \pm \sqrt{1.21 - 0.72})/2 = (-1.1 \pm \sqrt{0.49})/2 = (-1.1 \pm 0.7)/2\). So \(\lambda_1 = -0.2\) and \(\lambda_2 = -0.9\).
Both eigenvalues are real and negative — a stable node. Both countries’ exports decay to zero in this model, reflecting a contractionary trade scenario. The eigenvector for \(\lambda_1 = -0.2\) (the slow mode) has the interpretation that a particular combination of \(X_1\) and \(X_2\) decays slowly, while the fast mode (eigenvector of \(\lambda_2 = -0.9\)) decays quickly. In a trade context, the slow mode represents a long-run structural imbalance that persists after the short-run adjustment has completed.
4.3 Homogeneous Systems: The Eigenvalue Method
\[ \lambda e^{\lambda t}\mathbf{v} = A e^{\lambda t}\mathbf{v} \implies A\mathbf{v} = \lambda\mathbf{v}. \]So \(\lambda\) must be an eigenvalue of \(A\) and \(\mathbf{v}\) a corresponding eigenvector.
For \(2 \times 2\) systems, the phase portrait gives qualitative insight: if both eigenvalues are real negative, trajectories converge to the origin (stable node); if real positive, they diverge (unstable node); if of opposite sign, the origin is a saddle (unstable); if complex with negative real part, a stable spiral; if purely imaginary, a centre (neutrally stable).
Characteristic polynomial: \(\det(A - \lambda I) = \det\begin{pmatrix} -\lambda & 1 \\ -2 & -3-\lambda \end{pmatrix} = \lambda(\lambda+3) + 2 = \lambda^2 + 3\lambda + 2 = (\lambda+1)(\lambda+2) = 0\).
Eigenvalues: \(\lambda_1 = -1\), \(\lambda_2 = -2\). Both negative — the origin is a stable node.
Eigenvectors: For \(\lambda_1 = -1\): \((A + I)\mathbf{v} = \begin{pmatrix}1&1\\-2&-2\end{pmatrix}\mathbf{v} = \mathbf{0}\), giving \(\mathbf{v}_1 = (1, -1)^T\).
For \(\lambda_2 = -2\): \((A + 2I)\mathbf{v} = \begin{pmatrix}2&1\\-2&-1\end{pmatrix}\mathbf{v} = \mathbf{0}\), giving \(\mathbf{v}_2 = (1, -2)^T\).
General solution:
\[ \mathbf{X}(t) = c_1 e^{-t}\begin{pmatrix}1\\-1\end{pmatrix} + c_2 e^{-2t}\begin{pmatrix}1\\-2\end{pmatrix}. \]Initial condition: At \(t = 0\): \(c_1 + c_2 = 1\) and \(-c_1 - 2c_2 = 0\). From the second equation \(c_1 = -2c_2\). Substituting: \(-2c_2 + c_2 = -c_2 = 1\), so \(c_2 = -1\) and \(c_1 = 2\). The solution is:
\[ x(t) = 2e^{-t} - e^{-2t}, \qquad y(t) = -2e^{-t} + 2e^{-2t}. \]Both components decay to zero. The fast mode (\(e^{-2t}\)) relaxes first, leaving the slow mode (\(e^{-t}\)) to dominate at large times. In an economic context where \(x\) and \(y\) are deviations of output and investment from equilibrium, the slow mode represents the long-run adjustment path toward equilibrium, while the fast mode represents rapid initial transients.
The characteristic polynomial is \((\lambda - 2)^2 = 0\), giving the repeated eigenvalue \(\lambda = 2\) with multiplicity 2.
Eigenvector: \((A - 2I)\mathbf{v} = \begin{pmatrix}0&1\\0&0\end{pmatrix}\mathbf{v} = \mathbf{0}\) gives \(\mathbf{v}_1 = (1, 0)^T\).
Generalised eigenvector: Solve \((A - 2I)\mathbf{w} = \mathbf{v}_1\):
\[ \begin{pmatrix}0&1\\0&0\end{pmatrix}\mathbf{w} = \begin{pmatrix}1\\0\end{pmatrix} \implies w_2 = 1. \]Taking \(w_1 = 0\): \(\mathbf{w} = (0, 1)^T\).
General solution: The two linearly independent solutions are \(e^{2t}\mathbf{v}_1\) and \(e^{2t}(t\mathbf{v}_1 + \mathbf{w})\):
\[ \mathbf{X}(t) = c_1 e^{2t}\begin{pmatrix}1\\0\end{pmatrix} + c_2 e^{2t}\!\left(t\begin{pmatrix}1\\0\end{pmatrix} + \begin{pmatrix}0\\1\end{pmatrix}\right) = e^{2t}\begin{pmatrix}c_1 + c_2 t \\ c_2\end{pmatrix}. \]The \(c_2 t\,e^{2t}\) term grows super-exponentially relative to pure exponential growth — the repeated eigenvalue introduces polynomial growth multiplying the exponential. In economics, such dynamics can model a linearly escalating instability (e.g., a hyperinflation spiral where the rate of inflation itself accelerates linearly before the eventual stabilisation).
4.4 Inhomogeneous Systems: Variation of Parameters
\[ \mathbf{X}_p(t) = \Phi(t)\int \Phi^{-1}(t)\mathbf{F}(t)\,dt. \]The general solution is \(\mathbf{X} = \Phi(t)\mathbf{c} + \mathbf{X}_p(t)\), where \(\mathbf{c}\) is a constant vector determined by initial conditions.
4.5 The Samuelson Multiplier-Accelerator Model
The Samuelson (1939) multiplier-accelerator model of business cycles generates oscillatory output through the interaction of consumption multiplier effects and investment accelerator effects. In continuous time, if \(Y(t)\) is aggregate income, \(c \in (0,1)\) the marginal propensity to consume, and \(a > 0\) the accelerator coefficient:
\[ Y'' - (c - 1 + a)Y' + (1 - c)Y = G_0, \]where \(G_0\) is autonomous government expenditure. This is a second-order constant-coefficient ODE with forcing. The characteristic roots are:
\[ r = \frac{(c - 1 + a) \pm \sqrt{(c-1+a)^2 - 4(1-c)}}{2}. \]Since the real part \(0.15 > 0\), the homogeneous solution grows: \(Y_h(t) = e^{0.15t}(A\cos(0.421t) + B\sin(0.421t))\). The output oscillates with exponentially growing amplitude — an unstable business cycle in this parameterisation. The period of oscillation is \(2\pi/0.421 \approx 14.9\) years. Samuelson showed that by tuning \(c\) and \(a\), one can obtain damped cycles (stable spiral), explosive cycles (unstable spiral), or exact periodicity (\(\text{Re}(r) = 0\)). The model thus provides a theoretical basis for the observation that capitalist economies exhibit persistent but irregular business cycles.
Chapter 5: Introduction to Partial Differential Equations
So far, all functions have depended on a single independent variable. Many phenomena in finance and physics — heat diffusion, wave propagation, option pricing — involve functions of two or more variables, and the equations governing them involve partial derivatives. These are partial differential equations (PDEs).
The new feature of PDEs, compared to ODEs, is that initial/boundary data must be specified along curves or surfaces, and the structure of the solution depends crucially on the geometry of these data sets relative to the equation’s characteristics.
5.1 First-Order Linear PDEs and the Method of Characteristics
The simplest first-order linear PDE in two variables is \(a u_x + b u_t = c\), where \(a, b, c\) may be functions of \(x\), \(t\), and \(u\). The method of characteristics converts this PDE into a system of ODEs.
5.2 Classification of Second-Order Linear PDEs
\[ A u_{xx} + B u_{xt} + C u_{tt} + D u_x + E u_t + Fu = G. \]- Elliptic (\(\Delta < 0\)): e.g., Laplace's equation \(u_{xx} + u_{yy} = 0\). Models steady-state phenomena. Solutions are smooth.
- Parabolic (\(\Delta = 0\)): e.g., the heat equation \(u_t = \kappa u_{xx}\). Models diffusion. Has a preferred time direction.
- Hyperbolic (\(\Delta > 0\)): e.g., the wave equation \(u_{tt} = c^2 u_{xx}\). Models wave propagation. Has two families of real characteristics.
This classification is not merely taxonomic — it determines which boundary conditions are appropriate, what solution methods apply, and what physical behaviours are possible. The Black-Scholes equation, as we will see, is parabolic, which is why its solution methods mirror those of the heat equation.
5.3 The Heat Equation
The heat equation \(u_t = \kappa u_{xx}\) (where \(\kappa > 0\) is the diffusivity) models the spread of heat in a rod or, more abstractly, the diffusion of any quantity — concentration of a chemical, probability distributions in stochastic processes.
\[ \frac{T'}{T\kappa} = \frac{X''}{X} = -\lambda \quad (\text{separation constant}). \]\[ u(x,t) = \sum_{n=1}^\infty b_n \sin\!\left(\frac{n\pi x}{L}\right) e^{-\kappa (n\pi/L)^2 t}, \]where the coefficients \(b_n = \frac{2}{L}\int_0^L f(x)\sin(n\pi x/L)\,dx\) are determined by the initial condition \(u(x,0) = f(x)\). The exponential decay factor ensures the solution smooths out and decays to zero as \(t \to \infty\).
Integrating by parts with \(u = x\), \(dv = \sin(nx)\,dx\):
\[ \int_0^\pi x\sin(nx)\,dx = \left[-\frac{x\cos(nx)}{n}\right]_0^\pi + \frac{1}{n}\int_0^\pi \cos(nx)\,dx = -\frac{\pi\cos(n\pi)}{n} + \frac{1}{n}\left[\frac{\sin(nx)}{n}\right]_0^\pi. \]Since \(\sin(n\pi) = 0\) for all integers \(n\):
\[ \int_0^\pi x\sin(nx)\,dx = -\frac{\pi\cos(n\pi)}{n} = \frac{\pi(-1)^{n+1}}{n}. \]Therefore:
\[ b_n = \frac{2}{\pi}\cdot\frac{\pi(-1)^{n+1}}{n} = \frac{2(-1)^{n+1}}{n}. \]First coefficient: \(b_1 = 2(-1)^2/1 = 2\). Second coefficient: \(b_2 = 2(-1)^3/2 = -1\). Third: \(b_3 = 2/3\). The full solution to the heat equation with \(f(x) = x\) is:
\[ u(x,t) = \sum_{n=1}^\infty \frac{2(-1)^{n+1}}{n}\sin(nx)\,e^{-n^2 t}. \]At \(t = 0\), this recovers the Fourier sine series of \(x\) on \([0, \pi]\). As \(t\) increases, the higher modes (large \(n\)) decay much faster than the fundamental (\(n=1\)) because the decay rate is \(e^{-n^2 t}\), so the solution rapidly becomes dominated by the first term \(2e^{-t}\sin(x)\). The “sawtooth” initial profile is smoothed into a single half-arch.
This is an exact closed-form solution — no approximation is made.
Interpretation: The first mode \(e^{-t}\sin(x)\) decays with time constant \(\tau_1 = 1\). The second mode \(0.5\,e^{-4t}\sin(2x)\) decays four times faster, with time constant \(\tau_2 = 1/4\). By \(t = 1\), the first mode has lost a factor \(e^{-1} \approx 0.37\) of its amplitude, while the second has lost a factor \(e^{-4} \approx 0.018\) — it is nearly gone. At large times, the solution is well approximated by just the first mode: \(u(x,t) \approx e^{-t}\sin(x)\).
5.4 Steady-State Solutions of the Heat Equation
\[ u_{xx} = 0. \]This result is the one-dimensional version of Laplace’s equation. The steady state represents the long-time behaviour of the heat equation: regardless of the initial condition, the solution converges to \(u_{ss}(x)\) as \(t \to \infty\) (assuming fixed, nonzero boundary conditions).
Chapter 6: The Fourier Transform
Separation of variables works beautifully on a finite interval, but many problems — including the Black-Scholes equation — are posed on the entire real line. The right tool for these problems is the Fourier Transform, which converts the spatial PDE into an ODE in time, exactly as the Laplace transform converts ODEs into algebraic equations.
The Fourier Transform has several key properties that make it ideal for solving constant-coefficient linear PDEs:
- Linearity: \(\widehat{af + bg} = a\hat{f} + b\hat{g}\).
- Derivative rule: \(\widehat{f^{(n)}}(\omega) = (i\omega)^n \hat{f}(\omega)\). In particular, differentiation in \(x\) corresponds to multiplication by \(i\omega\) in frequency space.
- Convolution theorem: if \(h(x) = \int f(x-y)g(y)\,dy\), then \(\hat{h}(\omega) = \hat{f}(\omega)\hat{g}(\omega)\). Convolution in \(x\)-space is multiplication in \(\omega\)-space.
By definition:
\[ \hat{f}(\omega) = \int_{-\infty}^{\infty} e^{-x^2} e^{-i\omega x}\,dx = \int_{-\infty}^{\infty} e^{-(x^2 + i\omega x)}\,dx. \]Complete the square in the exponent: \(x^2 + i\omega x = \left(x + \frac{i\omega}{2}\right)^2 + \frac{\omega^2}{4} - \frac{(i\omega)^2}{4}\). Wait — more carefully: \(x^2 + i\omega x = \left(x + \frac{i\omega}{2}\right)^2 - \left(\frac{i\omega}{2}\right)^2 = \left(x + \frac{i\omega}{2}\right)^2 + \frac{\omega^2}{4}\). Therefore:
\[ \hat{f}(\omega) = e^{-\omega^2/4}\int_{-\infty}^{\infty} \exp\!\left(-\left(x + \frac{i\omega}{2}\right)^2\right)dx. \]The integral \(\int_{-\infty}^{\infty} e^{-u^2}\,du = \sqrt{\pi}\) (the Gaussian integral) holds for complex shifts by analyticity (contour deformation). Therefore:
\[ \mathcal{F}\{e^{-x^2}\}(\omega) = \sqrt{\pi}\,e^{-\omega^2/4}. \]This result is fundamental: the Fourier transform of a Gaussian is a Gaussian. The Gaussian \(e^{-x^2}\) has “width” of order 1 in \(x\)-space and “width” of order 2 in \(\omega\)-space — wider in frequency space, reflecting the uncertainty principle. This is the mathematical origin of the heat kernel: in the heat equation solution \(\hat{u}(\omega, t) = \hat{f}(\omega)e^{-\kappa\omega^2 t}\), the factor \(e^{-\kappa\omega^2 t}\) is a Gaussian in \(\omega\) whose inverse transform gives the Gaussian heat kernel \(G(x,t) \propto e^{-x^2/(4\kappa t)}\).
6.1 Solving the Heat Equation on the Real Line
\[ u_t = \kappa u_{xx}, \quad x \in \mathbb{R},\; t > 0, \quad u(x,0) = f(x). \]\[ \frac{\partial \hat{u}}{\partial t} = \kappa(i\omega)^2 \hat{u} = -\kappa\omega^2 \hat{u}. \]\[ \hat{u}(\omega, t) = \hat{f}(\omega)\,e^{-\kappa\omega^2 t}. \]\[ G(x,t) = \frac{1}{\sqrt{4\pi\kappa t}}\,e^{-x^2/(4\kappa t)}. \]\[ u(x,t) = \int_{-\infty}^{\infty} f(y)\,G(x-y,t)\,dy = \frac{1}{\sqrt{4\pi\kappa t}}\int_{-\infty}^{\infty} f(y)\,\exp\!\left(-\frac{(x-y)^2}{4\kappa t}\right)dy. \]This formula is not just elegant — it is precisely the structure that underpins the solution to the Black-Scholes equation. The analogy between heat diffusion and the uncertainty in future asset prices is deep and deliberate.
The initial Gaussian spreads: its width grows as \(\sqrt{\sigma^2 + t}\). For \(t \gg \sigma^2\), the width is \(\approx \sqrt{t}\) — the characteristic diffusive scaling. The height of the Gaussian decreases as \(\sigma/\sqrt{\sigma^2+t}\) to conserve the total “heat” \(\int u\,dx = 2\sigma\sqrt{\pi} = \text{const}\). This example cleanly illustrates that the Fourier Transform diagonalises the diffusion operator: each Fourier mode evolves independently and exponentially.
6.2 The Black-Scholes Equation as a Heat Equation
\[ \frac{\partial V}{\partial t} + \frac{1}{2}\sigma^2 S^2 \frac{\partial^2 V}{\partial S^2} + rS\frac{\partial V}{\partial S} - rV = 0, \]with terminal condition \(V(S,T) = \text{Payoff}(S)\) at expiry \(T\). The variable coefficient \(S^2\) makes this look different from the heat equation, but a change of variables transforms it exactly.
Chapter 7: The Black-Scholes Equation
The previous chapters have been building toward one of the most celebrated results in financial mathematics: the Black-Scholes model for option pricing. This chapter derives the Black-Scholes PDE from first principles, solves it using the heat equation connection, and interprets the result economically. The mathematical tools developed throughout the course — ODEs, linear algebra, separation of variables, the Fourier transform — all converge here.
7.1 Stochastic Models for Asset Prices
A fundamental challenge in finance is modelling the future price of an asset. Prices are not deterministic — they are subject to random fluctuations. The standard model is Geometric Brownian Motion (GBM), which posits that the log-return of an asset over a short time interval is normally distributed and independent of the past.
The Wiener process \(W_t\) has the properties: \(W_0 = 0\); increments \(W_{t+s} - W_t \sim \mathcal{N}(0, s)\) for \(s > 0\); increments over non-overlapping intervals are independent. The \(\sigma S\,dW_t\) term introduces randomness proportional to the current price — consistent with the empirical observation that percentage changes in price, not absolute changes, are approximately normally distributed.
The solution to the GBM SDE is \(S(t) = S(0)\exp\!\left((\mu - \tfrac{1}{2}\sigma^2)t + \sigma W_t\right)\), so \(\ln S(t)\) is normally distributed — hence the name “geometric” Brownian motion.
7.2 Itô’s Lemma
To price derivatives — contracts whose payoff depends on \(S\) — we need to know how smooth functions of \(S\) evolve over time. In ordinary calculus, if \(V = V(S)\) and \(S\) evolves deterministically, then \(dV = V'(S)\,dS\). For a stochastic process, however, the ordinary chain rule is modified by a second-order correction term, because \((dW_t)^2 = dt\) in the Itô calculus (not zero as in ordinary calculus).
Heuristic derivation. Expand \(dV\) by the multivariable Taylor theorem: \(dV \approx V_t\,dt + V_S\,dS + \tfrac{1}{2}V_{SS}(dS)^2 + \cdots\). Substituting \(dS = \mu S\,dt + \sigma S\,dW_t\) and using the Itô rule \((dW_t)^2 = dt\), \((dt)^2 = 0\), \(dt\,dW_t = 0\): \[ (dS)^2 = \sigma^2 S^2 (dW_t)^2 = \sigma^2 S^2\,dt. \] Collecting \(dt\) and \(dW_t\) terms gives the formula above.
The crucial extra term \(\frac{1}{2}\sigma^2 S^2 V_{SS}\,dt\) is the Itô correction. It vanishes in the deterministic limit \(\sigma \to 0\), recovering ordinary calculus.
7.3 Delta-Hedging and the Black-Scholes PDE
\[ \Pi = V - \Delta\cdot S. \]\[ d\Pi = dV - \Delta\,dS = \left(V_t + \mu S V_S + \tfrac{1}{2}\sigma^2 S^2 V_{SS}\right)dt + \sigma S V_S\,dW_t - \Delta\left(\mu S\,dt + \sigma S\,dW_t\right). \]\[ d\Pi = \left(V_t + \tfrac{1}{2}\sigma^2 S^2 V_{SS}\right)dt. \]\[ d\Pi = r\Pi\,dt = r(V - \Delta S)\,dt = r(V - S V_S)\,dt. \]Setting the two expressions for \(d\Pi\) equal and rearranging:
For a European call option with strike \(K\): Payoff\((S) = \max(S-K, 0)\).
For a European put option: Payoff\((S) = \max(K-S, 0)\).
Notice the structure: this is a second-order linear PDE, backward in time (the terminal condition is given at \(t = T\), and we solve backward to \(t < T\)). The coefficients involve \(S\), making it variable-coefficient — but a change of variables will transform it into the heat equation.
7.4 Reduction to the Heat Equation
We transform the Black-Scholes PDE into the standard heat equation through a sequence of changes of variables.
Step 1: Let \(S = e^x\), so \(x = \ln S \in \mathbb{R}\). The log-price is a more natural variable since it ranges over all of \(\mathbb{R}\).
Step 2: Reverse time. Let \(\tau = T - t \geq 0\). The terminal condition at \(t = T\) becomes an initial condition at \(\tau = 0\).
Step 3: Remove the exponential growth by writing \(V = e^{\alpha x + \beta \tau} u(x, \tau)\) for constants \(\alpha\) and \(\beta\) to be determined.
\[ \frac{\partial u}{\partial \tau} = \frac{\sigma^2}{2}\frac{\partial^2 u}{\partial x^2}. \]This is exactly the heat equation with diffusivity \(\kappa = \sigma^2/2\)! The initial condition \(u(x, 0)\) is determined by the payoff function.
7.5 The Black-Scholes Formula
Applying the heat kernel solution (Chapter 6) to the transformed initial value problem and reversing the change of variables yields the celebrated Black-Scholes formula.
The formula has a beautiful economic interpretation. \(N(d_2)\) is the risk-neutral probability that the option expires in the money (\(S_T > K\)) — the probability, computed under a risk-neutral measure that replaces the drift \(\mu\) with the risk-free rate \(r\). The term \(Ke^{-r(T-t)}N(d_2)\) is the discounted expected strike payment. The term \(S\,N(d_1)\) accounts for the expected receipt of the stock, adjusted for the probability weighting. The option price is the present value of the expected net benefit from exercising.
The resolution is the risk-neutral measure (or equivalent martingale measure). The delta-hedging argument shows that any option risk can be perfectly replicated by a dynamic portfolio of stock and bond. The cost of this replicating portfolio — which is the option price — depends only on the statistical structure of the stock’s fluctuations (encoded in \(\sigma\)), not on the expected return (\(\mu\)), because the hedge eliminates all directional exposure. In the language of measure theory: we change from the physical (real-world) probability measure \(\mathbb{P}\) (under which the stock drifts at rate \(\mu\)) to the risk-neutral measure \(\mathbb{Q}\) (under which the stock drifts at the risk-free rate \(r\)). Under \(\mathbb{Q}\), the discounted stock price \(e^{-rt}S(t)\) is a martingale, and the option price is simply the discounted expected payoff: \(V = e^{-r(T-t)}\mathbb{E}^{\mathbb{Q}}\!\left[\text{Payoff}(S_T)\right]\).
This is the fundamental theorem of asset pricing. Its economic interpretation is: in a complete, arbitrage-free market, every agent — regardless of their risk preferences or beliefs about \(\mu\) — agrees on option prices, because those prices are set by the cost of replication, not by expected utility maximisation. This consensus is why options markets are so liquid: buyers and sellers can transact without needing to agree on the stock’s future return, only on its current volatility.
7.6 The Greeks
The partial derivatives of the option price with respect to its parameters are called the Greeks, and they quantify the sensitivities used by traders to manage risk.
- Delta: \(\Delta = \frac{\partial C}{\partial S} = N(d_1)\). Measures sensitivity to stock price. This is exactly the hedge ratio used in the Black-Scholes derivation.
- Gamma: \(\Gamma = \frac{\partial^2 C}{\partial S^2} = \frac{N'(d_1)}{S\sigma\sqrt{T-t}}\). Measures the rate of change of delta. High gamma means delta changes rapidly, requiring frequent rehedging.
- Theta: \(\Theta = \frac{\partial C}{\partial t}\). The time decay of the option — how much the option loses in value as one day passes, all else equal.
- Vega: \(\mathcal{V} = \frac{\partial C}{\partial \sigma} = S\sqrt{T-t}\,N'(d_1)\). Sensitivity to volatility. Options become more valuable when volatility increases, since the upside from large moves outweighs the downside for a call holder.
- Rho: \(\rho = \frac{\partial C}{\partial r} = K(T-t)e^{-r(T-t)}N(d_2)\). Sensitivity to the interest rate.
Note that the Black-Scholes PDE can be written in terms of the Greeks as \(\Theta + \frac{1}{2}\sigma^2 S^2 \Gamma + rS\Delta - rC = 0\). This form has the interpretation: the time decay \(\Theta\) is offset by the gain from Gamma (the curvature of the option’s value in \(S\)) and the return earned on the delta hedge.
7.7 Assumptions and Limitations
The Black-Scholes model rests on several idealising assumptions worth understanding critically:
- Constant volatility: In reality, implied volatility varies with strike and maturity (the “volatility smile” or “volatility surface”). Models such as stochastic volatility (Heston) or local volatility (Dupire) extend Black-Scholes to address this.
- Continuous trading: The delta-hedging argument requires instantaneous and costless rebalancing. In practice, transaction costs and discrete rebalancing introduce hedging error.
- Log-normal distribution: Empirical returns exhibit heavier tails than the normal distribution — so-called “fat tails” or excess kurtosis — meaning extreme events occur more often than the model predicts.
- No dividends, constant interest rate: These can be incorporated by straightforward modifications to the formula.
- Complete markets: The assumption that all risk can be perfectly hedged is an idealisation.
Despite these limitations, Black-Scholes remains the universal benchmark in options markets. Practitioners quote option prices in terms of “implied volatility” — the value of \(\sigma\) that makes the Black-Scholes formula reproduce the observed market price — as a common language, even when more sophisticated models are used for actual pricing.
7.8 Numerical Black-Scholes Example
Step 2: Normal CDF values: \(N(-0.098) \approx 0.461\) and \(N(-0.239) \approx 0.406\).
\[ C = 100 \times 0.461 - 105 \times e^{-0.025} \times 0.406 = 46.1 - 105 \times 0.9753 \times 0.406 \approx 46.1 - 41.6 = 4.50. \]The call is worth approximately $4.50. By put-call parity: \(P = C - S + Ke^{-r\tau} \approx 4.50 - 100 + 102.4 = 6.90\). The put commands a higher price because it is in-the-money while the call is out-of-the-money.
7.9 The Arc from ODEs to Black-Scholes
It is worth pausing to appreciate the mathematical arc of this course. We began with the simplest differential equations — separable ODEs describing exponential growth in a bank account — and progressively built the tools needed to handle more complex phenomena. First-order linear ODEs gave us the integrating factor and debt dynamics. Higher-order linear ODEs introduced the characteristic equation and superposition. Systems of ODEs introduced eigenvalues and phase portraits. PDEs introduced the method of characteristics and the classification of equations. The Fourier Transform gave us the heat kernel. And finally, Itô’s lemma and the delta-hedging argument produced the Black-Scholes PDE — which, through a clever change of variables, reduces to the very heat equation we solved in Chapter 6.
The Black-Scholes formula is thus the culmination of a long chain of mathematical ideas, each building on the last. The diffusion of heat and the diffusion of uncertainty in financial markets are not merely analogous — they are governed by the same equation, and the same mathematical techniques solve both.
Chapter 8: Phase Plane Analysis and Nonlinear Systems
The eigenvalue method of Chapter 4 gives exact solutions for linear systems. But most real systems — ecological interactions, epidemic spread, market dynamics — are genuinely nonlinear, and exact solutions are rarely available. The phase plane is the tool that compensates for this: it provides a complete qualitative picture of all trajectories without requiring closed-form expressions.
8.1 Phase Portraits and Equilibrium Classification
For a two-dimensional autonomous system \(\mathbf{x}' = \mathbf{f}(\mathbf{x})\), a phase portrait is the family of solution curves in the \((x_1, x_2)\) plane, with arrows indicating the direction of increasing \(t\). The key objects are the nullclines — curves where one component of \(\mathbf{f}\) is zero — and the equilibria (or fixed points) where \(\mathbf{f}(\mathbf{x}^*) = \mathbf{0}\).
- The \(x_1\)-nullcline is the set \(\{f_1(x_1,x_2) = 0\}\); on this curve, \(x_1\) is momentarily stationary.
- The \(x_2\)-nullcline is the set \(\{f_2(x_1,x_2) = 0\}\); on this curve, \(x_2\) is momentarily stationary.
- An equilibrium is a point where both nullclines intersect: \(f_1 = f_2 = 0\).
For linear systems \(\mathbf{x}' = A\mathbf{x}\), the origin is always an equilibrium. The character of all other trajectories is determined entirely by the eigenvalues of \(A\). Writing \(\tau = \mathrm{tr}(A)\) and \(\Delta = \det(A)\), the trace-determinant criterion gives a complete classification.
- Stable node: \(\Delta > 0\), \(\tau < 0\), \(\tau^2 - 4\Delta > 0\). Both eigenvalues real and negative.
- Unstable node: \(\Delta > 0\), \(\tau > 0\), \(\tau^2 - 4\Delta > 0\). Both eigenvalues real and positive.
- Saddle point: \(\Delta < 0\). Eigenvalues real with opposite signs; the equilibrium is always unstable.
- Stable spiral: \(\Delta > 0\), \(\tau < 0\), \(\tau^2 - 4\Delta < 0\). Complex eigenvalues with negative real part.
- Unstable spiral: \(\Delta > 0\), \(\tau > 0\), \(\tau^2 - 4\Delta < 0\). Complex eigenvalues with positive real part.
- Centre: \(\Delta > 0\), \(\tau = 0\). Purely imaginary eigenvalues; neutrally stable closed orbits.
- Degenerate (star/improper) node: \(\tau^2 - 4\Delta = 0\). Repeated eigenvalue.
(a) \(A = \begin{pmatrix}0&1\\-2&-3\end{pmatrix}\): \(\tau = -3\), \(\Delta = 2\). Since \(\tau < 0\), \(\Delta > 0\), and \(\tau^2 - 4\Delta = 9 - 8 = 1 > 0\): stable node. Eigenvalues \(\lambda = (-3 \pm 1)/2\), giving \(\lambda_1 = -1\), \(\lambda_2 = -2\). (This is the matrix from Section 4.3.)
(b) \(A = \begin{pmatrix}-1&2\\-2&-1\end{pmatrix}\): \(\tau = -2\), \(\Delta = 1 + 4 = 5\). \(\tau^2 - 4\Delta = 4 - 20 < 0\): stable spiral. Eigenvalues \(\lambda = -1 \pm 2i\). Trajectories spiral inward.
(c) \(A = \begin{pmatrix}1&1\\4&1\end{pmatrix}\): \(\tau = 2\), \(\Delta = 1 - 4 = -3 < 0\): saddle point. One positive, one negative eigenvalue. Trajectories approach along the stable manifold and diverge along the unstable manifold.
(d) \(A = \begin{pmatrix}0&1\\-4&0\end{pmatrix}\): \(\tau = 0\), \(\Delta = 4 > 0\): centre. Eigenvalues \(\lambda = \pm 2i\). The system exhibits closed orbits (ellipses). This is the undamped harmonic oscillator.
8.2 Competitive Species and Predator-Prey Models
The power of phase plane analysis lies in its application to genuinely nonlinear systems. A canonical example is the Lotka-Volterra competition model, which describes two species competing for the same resource.
Let \(x(t)\) and \(y(t)\) be the populations of two competing species. Each grows logistically in isolation, but each inhibits the other:
\[ \frac{dx}{dt} = r_1 x\!\left(1 - \frac{x}{K_1} - \frac{\alpha_{12} y}{K_1}\right), \qquad \frac{dy}{dt} = r_2 y\!\left(1 - \frac{y}{K_2} - \frac{\alpha_{21} x}{K_2}\right). \]The parameter \(\alpha_{12}\) measures how much a unit of species 2 “costs” species 1 in terms of carrying-capacity units; \(\alpha_{21}\) is the reciprocal effect. Equilibria occur where both nullclines intersect. The \(x\)-nullclines are \(x = 0\) and \(x + \alpha_{12} y = K_1\); the \(y\)-nullclines are \(y = 0\) and \(\alpha_{21} x + y = K_2\).
The \(x\)-nullcline (for \(x > 0\)): \(x + 0.5y = 100\), i.e., \(x = 100 - 0.5y\). The \(y\)-nullcline (for \(y > 0\)): \(0.5x + y = 100\), i.e., \(y = 100 - 0.5x\).
\[ x + 0.5y = 100, \quad 0.5x + y = 100. \]Adding: \(1.5(x + y) = 200\), so \(x + y = 133.3\). Subtracting the second from the first: \(0.5(x - y) = 0\), so \(x = y\). Then \(1.5x = 100\), giving \(x^* = y^* = 200/3 \approx 66.7\).
\[ J = \begin{pmatrix} -r_1 x^*/K_1 & -r_1\alpha_{12}x^*/K_1 \\ -r_2\alpha_{21}y^*/K_2 & -r_2 y^*/K_2 \end{pmatrix} = \begin{pmatrix} -2/3 & -1/3 \\ -1/3 & -2/3 \end{pmatrix}. \]\(\tau = -4/3 < 0\), \(\Delta = 4/9 - 1/9 = 3/9 = 1/3 > 0\), \(\tau^2 - 4\Delta = 16/9 - 4/3 = 4/9 > 0\): stable node. The two species coexist at the interior equilibrium. When \(\alpha_{12}\alpha_{21} < 1\) (interspecific competition weaker than intraspecific), coexistence is always stable. When \(\alpha_{12}\alpha_{21} > 1\), the interior equilibrium is a saddle and one species excludes the other — the initial conditions determine which.
Economic interpretation: This model applies directly to two firms competing in a market. \(K_i\) is the market share firm \(i\) could capture alone; \(\alpha_{ij}\) is the competitive impact ratio. Stable coexistence corresponds to product differentiation reducing direct competition (\(\alpha\) small); competitive exclusion corresponds to undifferentiated commodity markets where one firm drives the other out.
Chapter 9: Linearisation of Nonlinear Systems and the Hartman-Grobman Theorem
The trace-determinant analysis applies directly to linear systems. For nonlinear systems, the key insight is that near a hyperbolic equilibrium, the nonlinear system looks like its linearisation — and the phase portrait of the linearisation accurately represents that of the nonlinear system.
The Jacobian at an equilibrium is computed by differentiating \(\mathbf{f}\) and evaluating at the equilibrium point.
9.1 The SIR Epidemic Model
The SIR model (Kermack-McKendrick, 1927) is the canonical model of epidemic spread. A population of fixed size \(N\) is divided into: \(S\) (susceptibles), \(I\) (infectives), \(R\) (recovered/removed). The dynamics are:
\[ \frac{dS}{dt} = -\beta S I, \qquad \frac{dI}{dt} = \beta S I - \gamma I, \qquad \frac{dR}{dt} = \gamma I, \]where \(\beta > 0\) is the transmission rate and \(\gamma > 0\) is the recovery rate. Since \(S + I + R = N\) is conserved, \(R\) is determined by \(S\) and \(I\), so we work in the two-dimensional \((S, I)\) plane.
Equilibria: Setting \(dS/dt = 0\) and \(dI/dt = 0\):
- \(\beta SI = 0\) requires \(S = 0\) or \(I = 0\).
- \(\beta SI - \gamma I = I(\beta S - \gamma) = 0\) requires \(I = 0\) or \(S = \gamma/\beta\).
The only equilibria with \(I \geq 0\) and \(S \geq 0\) are the disease-free equilibria \((S^*, 0)\) for any \(0 \leq S^* \leq N\). (There is no interior equilibrium with \(I > 0\) that is a fixed point; the epidemic must end at some \(S^* < N\).)
\[ J = \begin{pmatrix} \frac{\partial}{\partial S}(-\beta SI) & \frac{\partial}{\partial I}(-\beta SI) \\ \frac{\partial}{\partial S}(\beta SI - \gamma I) & \frac{\partial}{\partial I}(\beta SI - \gamma I) \end{pmatrix}\Bigg|_{(S^*, 0)} = \begin{pmatrix} 0 & -\beta S^* \\ 0 & \beta S^* - \gamma \end{pmatrix}. \]The eigenvalues are \(\lambda_1 = 0\) and \(\lambda_2 = \beta S^* - \gamma\).
Stability threshold: The equilibrium \((S^*, 0)\) is stable (the infection dies out) when \(\lambda_2 < 0\), i.e., \(\beta S^* < \gamma\), i.e., \(S^* < \gamma/\beta = N/\mathcal{R}_0\).
When the population is fully susceptible (\(S^* = N\)), stability requires \(\beta N < \gamma\), i.e., \(\mathcal{R}_0 < 1\). If \(\mathcal{R}_0 > 1\), the disease-free state is unstable and an epidemic occurs. The threshold \(\mathcal{R}_0 = 1\) is the epidemic threshold: vaccination campaigns aim to reduce the effective susceptible fraction below \(1/\mathcal{R}_0\) (herd immunity threshold \(p^* = 1 - 1/\mathcal{R}_0\)).
Numerical example: Suppose \(N = 10{,}000\), \(\beta = 0.0003\), \(\gamma = 0.1\). Then \(\mathcal{R}_0 = 0.0003 \times 10{,}000 / 0.1 = 30\). This is a highly transmissible pathogen (comparable to measles). The herd immunity threshold is \(1 - 1/30 \approx 96.7\%\). The Jacobian at the disease-free equilibrium \((10{,}000, 0)\) gives \(\lambda_2 = 0.0003 \times 10{,}000 - 0.1 = 3 - 0.1 = 2.9 > 0\): the disease-free state is strongly unstable, confirming that without immunisation, an epidemic is inevitable.
Chapter 10: The IS-LM Model as a Dynamical System
The IS-LM framework introduced in Chapter 2 deserves a more careful dynamical treatment. In the static version, national income \(Y\) and the interest rate \(r\) are simultaneously determined by goods market equilibrium (the IS curve) and money market equilibrium (the LM curve). In the dynamic version, these adjust at finite speeds, generating a system of ODEs.
10.1 Dynamic IS-LM
Let income and the interest rate satisfy:
\[ \frac{dY}{dt} = \alpha\bigl[I(r) + G - sY\bigr], \qquad \frac{dr}{dt} = \beta\bigl[kY - \bar{M}/P - hr\bigr], \]where \(s\) is the marginal propensity to save, \(k > 0\) is the income sensitivity of money demand, \(h > 0\) is the interest sensitivity of money demand, \(\bar{M}/P\) is the real money supply, \(I(r) = I_0 - br\) is investment (decreasing in \(r\)), and \(\alpha, \beta > 0\) are adjustment speeds. Linearising near the IS-LM equilibrium \((Y^*, r^*)\):
\[ \frac{d}{dt}\begin{pmatrix}Y - Y^* \\ r - r^*\end{pmatrix} = \begin{pmatrix} -\alpha s & -\alpha b \\ \beta k & -\beta h \end{pmatrix}\begin{pmatrix}Y - Y^* \\ r - r^*\end{pmatrix}. \]Eigenvalues: \(\lambda = (-0.7 \pm \sqrt{-0.01})/2 = -0.35 \pm 0.05i\). The period of oscillation is \(2\pi/0.05 \approx 125.6\) time units. The decay rate is \(e^{-0.35t}\); after \(t = 10\), the amplitude has fallen by factor \(e^{-3.5} \approx 0.030\). This represents a business cycle of very long period — consistent with the observation that IS-LM adjustment in calibrated macro models takes several years to complete.
\[ \frac{\partial Y^*}{\partial G} = \frac{h}{sh + bk} = \frac{1}{s + bk/h}. \]With the numbers above: \(bk/h = 0.05\), so the multiplier is \(1/(0.2 + 0.05) = 4\). A unit increase in government spending raises equilibrium income by 4 units — this is the Keynesian multiplier, modified downward from \(1/s = 5\) by the crowding-out effect (higher \(G\) raises \(r\), reducing \(I\)). The eigenvalue ratio \(\lambda_1/\lambda_2\) encodes the relative speeds at which the goods market and money market adjust — a deeper connection between the dynamical and comparative-static analyses.
Chapter 11: The Solow Model with Human Capital
The standard Solow model of Chapter 2 explains convergence in physical capital \(k\) but leaves an uncomfortably large residual — “total factor productivity” — unexplained. Mankiw, Romer, and Weil (1992) extended the model by adding human capital \(h\) (skills, education) as a separate accumulable factor. This MRW extension resolves the empirical puzzle and is solvable as a two-dimensional linear system after log-linearisation.
11.1 The MRW Equations
With a Cobb-Douglas production function \(Y = K^\alpha H^\phi L^{1-\alpha-\phi}\) (where \(\alpha + \phi < 1\)), fractions \(s_k\) and \(s_h\) of output invested in physical and human capital, and common depreciation and population growth \((n + \delta)\), the dynamics of \(k = K/L\) and \(h = H/L\) are:
\[ \frac{dk}{dt} = s_k k^\alpha h^\phi - (n+\delta)k, \qquad \frac{dh}{dt} = s_h k^\alpha h^\phi - (n+\delta)h. \]The steady state \((k^*, h^*)\) satisfies both equations simultaneously. Taking logs and setting \(\hat{k} = \ln k - \ln k^*\) and \(\hat{h} = \ln h - \ln h^*\) (log-deviations from steady state), the linearised system is:
\[ \frac{d}{dt}\begin{pmatrix}\hat{k}\\\hat{h}\end{pmatrix} = (n+\delta)\begin{pmatrix}\alpha - 1 & \phi \\ \alpha & \phi - 1\end{pmatrix}\begin{pmatrix}\hat{k}\\\hat{h}\end{pmatrix}. \]For comparison, the standard Solow model (without human capital) gives \(\mu_{\text{Solow}} = (n+\delta)(1-\alpha) = 0.07 \times 2/3 \approx 0.0467\), corresponding to a half-life of about 14.8 years. Adding human capital roughly doubles the convergence half-life, because human capital must also adjust — the system has more “inertia”. This is consistent with cross-country data showing convergence rates of roughly 2% per year (implying half-lives around 35 years), much slower than the pure Solow prediction.
Golden rule capital stock: The golden rule maximises steady-state consumption per worker \(c^* = f(k^*, h^*) - (n+\delta)(k^* + h^*)\). Optimising over \((s_k, s_h)\) gives the golden rule condition: the marginal product of each capital type equals \(n+\delta\). With Cobb-Douglas production: \(\alpha Y^*/k^* = n + \delta\) and \(\phi Y^*/h^* = n + \delta\), so at the golden rule, \(s_k = \alpha\) and \(s_h = \phi\). If the economy saves too little (\(s_k < \alpha\)), capital is below the golden rule and increasing savings raises long-run welfare; if \(s_k > \alpha\), the economy is dynamically inefficient.
Chapter 12: Laplace Transforms
While the Fourier Transform is the right tool for problems on the entire real line, the Laplace Transform is designed for initial-value problems on the half-line \(t \geq 0\). It converts an ODE with initial conditions into an algebraic equation, solves it in “transform space,” and inverts. The power of the Laplace Transform lies especially in handling discontinuous or impulsive forcing — a sudden policy change, a tax reform implemented at \(t = t_0\), or a sharp shock to investment.
The transform is linear: \(\mathcal{L}\{af + bg\} = a\mathcal{L}\{f\} + b\mathcal{L}\{g\}\). Its defining property for ODEs is the derivative rule: \(\mathcal{L}\{f'\}(s) = sF(s) - f(0)\), and more generally \(\mathcal{L}\{f^{(n)}\}(s) = s^n F(s) - s^{n-1}f(0) - \cdots - f^{(n-1)}(0)\). Initial conditions are automatically incorporated.
12.1 Table of Key Transforms
The following table collects the transforms needed for most applications:
| \(f(t)\) | \(F(s) = \mathcal{L}\{f\}(s)\) | Condition |
|---|---|---|
| \(1\) | \(1/s\) | \(s > 0\) |
| \(e^{at}\) | \(1/(s-a)\) | \(s > a\) |
| \(t^n\) | \(n!/s^{n+1}\) | \(s > 0\) |
| \(\sin(\omega t)\) | \(\omega/(s^2+\omega^2)\) | \(s > 0\) |
| \(\cos(\omega t)\) | \(s/(s^2+\omega^2)\) | \(s > 0\) |
| \(e^{at}\sin(\omega t)\) | \(\omega/((s-a)^2+\omega^2)\) | \(s > a\) |
| \(e^{at}\cos(\omega t)\) | \((s-a)/((s-a)^2+\omega^2)\) | \(s > a\) |
| \(u_c(t) = \mathbf{1}_{t \geq c}\) | \(e^{-cs}/s\) | \(s > 0\) |
| \(u_c(t)f(t-c)\) | \(e^{-cs}F(s)\) | (Second shifting theorem) |
| \(\delta(t - c)\) | \(e^{-cs}\) | (Dirac delta) |
| \((f*g)(t) = \int_0^t f(\tau)g(t-\tau)d\tau\) | \(F(s)G(s)\) | (Convolution theorem) |
Here \(u_c(t)\) is the Heaviside step function that switches on at \(t = c\): \(u_c(t) = 0\) for \(t < c\) and \(u_c(t) = 1\) for \(t \geq c\).
- First shifting theorem (s-shift): \(\mathcal{L}\{e^{at}f(t)\}(s) = F(s-a)\). Multiplication by \(e^{at}\) in \(t\)-space shifts the transform by \(a\) in \(s\)-space.
- Second shifting theorem (t-shift): \(\mathcal{L}\{u_c(t)\,f(t-c)\}(s) = e^{-cs}F(s)\). A time-delay of \(c\) units in \(t\)-space multiplies the transform by \(e^{-cs}\).
- Convolution theorem: \(\mathcal{L}\{(f*g)(t)\}(s) = F(s)G(s)\). Convolution in \(t\)-space is multiplication in \(s\)-space.
12.2 Solving an IVP with Discontinuous Forcing
Let \(G(s) = \frac{1}{s(s^2+4)}\), so \(g(t) = \mathcal{L}^{-1}\{G\}(t) = \frac{1}{4}(1 - \cos(2t))\).
\[ y(t) = \mathcal{L}^{-1}\{e^{-2s}G(s)\}(t) = u_2(t)\,g(t-2) = u_2(t)\cdot\frac{1}{4}\bigl(1 - \cos(2(t-2))\bigr). \]\[ y(t) = \begin{cases} 0 & t < 2, \\ \dfrac{1}{4}\!\left(1 - \cos(2t - 4)\right) & t \geq 2. \end{cases} \]For \(t < 2\): no forcing has occurred, so the system remains at rest. At \(t = 2\), the forcing switches on, exciting oscillations at the natural frequency \(\omega_0 = 2\). The solution oscillates with amplitude \(1/4\) around the constant particular solution \(1/4\). Note: this is close to (but not exactly) resonance — the forcing is a step function, which has Fourier content at all frequencies, including \(\omega = 2\).
Numerical values: At \(t = 3\): \(y(3) = \frac{1}{4}(1 - \cos(2)) \approx \frac{1}{4}(1 - (-0.416)) \approx \frac{1.416}{4} \approx 0.354\). At \(t = 5\): \(y(5) = \frac{1}{4}(1 - \cos(6)) \approx \frac{1}{4}(1 - 0.960) \approx 0.010\). The oscillation persists indefinitely (no damping).
12.3 Economic Application: Policy Shock
Partial fractions: \(\frac{1}{s(s+\lambda)} = \frac{1}{\lambda s} - \frac{1}{\lambda(s+\lambda)}\). Inverting:
\[ Y(t) = Y_0 e^{-\lambda t} + \frac{\alpha G_0}{\lambda}\!\left(1 - e^{-\lambda t}\right) + \frac{\alpha\,\Delta G}{\lambda}\,u_{t_0}(t)\!\left(1 - e^{-\lambda(t-t_0)}\right). \]For \(t < t_0\): \(Y\) decays from \(Y_0\) toward \(Y^*_{\text{pre}} = \alpha G_0/\lambda\). After \(t_0\): \(Y\) makes a further transition toward \(Y^*_{\text{post}} = \alpha(G_0 + \Delta G)/\lambda\). The policy multiplier on the new long-run income is \(\alpha/\lambda = 1/(s-c)\) — the standard Keynesian multiplier restated in these units. The Laplace approach makes it mechanical to handle piecewise-constant policy paths, which would be cumbersome to analyse by the method of variation of parameters.
Chapter 13: Power Series and the Method of Frobenius
Not every ODE with variable coefficients can be solved by the previous methods. When the coefficient functions are analytic (expressible as convergent power series), we can seek solutions as power series too. This is particularly relevant for the Euler equation and its generalisations, which arise in the Solow model and in spatial economics when problems are posed in polar or spherical coordinates.
- An ordinary point if \(P\) and \(Q\) are both analytic at \(x_0\) (i.e., expandable in convergent Taylor series). Near an ordinary point, two linearly independent power series solutions \(y = \sum_{n=0}^\infty a_n(x-x_0)^n\) exist, convergent in the radius of analyticity.
- A regular singular point if \(P\) has at most a simple pole and \(Q\) at most a double pole at \(x_0\): \((x-x_0)P(x)\) and \((x-x_0)^2 Q(x)\) are both analytic at \(x_0\). The method of Frobenius applies.
- An irregular singular point otherwise.
13.1 The Euler Equation
\[ x^2 y'' + \alpha x y' + \beta y = 0, \quad x > 0. \]\[ r^2 + (\alpha - 1)r + \beta = 0. \]The substitution \(x = e^t\) (i.e., \(t = \ln x\)) converts the Euler equation into a constant-coefficient ODE in \(t\), which can be solved by the characteristic equation method.
Indicial equation: \(r^2 + (3-1)r + 1 = r^2 + 2r + 1 = (r+1)^2 = 0\). Repeated root \(r = -1\).
First solution: \(y_1 = x^{-1}\).
Second solution (logarithmic case — repeated root): \(y_2 = x^{-1}\ln x\).
\[ x^2 \cdot x^{-3}(2\ln x - 3) + 3x \cdot x^{-2}(1 - \ln x) + x^{-1}\ln x = x^{-1}(2\ln x - 3) + 3x^{-1}(1 - \ln x) + x^{-1}\ln x. \]\[ = x^{-1}(2\ln x - 3 + 3 - 3\ln x + \ln x) = x^{-1} \cdot 0 = 0. \checkmark \]General solution: \(y = c_1 x^{-1} + c_2 x^{-1}\ln x = x^{-1}(c_1 + c_2 \ln x)\).
Example with distinct roots: Solve \(x^2 y'' - 2y = 0\). Indicial equation: \(r(r-1) - 2 = r^2 - r - 2 = (r-2)(r+1) = 0\). Roots \(r_1 = 2\), \(r_2 = -1\) (differ by 3, not by 0 or 1, so no logarithm). General solution: \(y = c_1 x^2 + c_2 x^{-1}\).
Economic interpretation: Euler equations arise naturally in optimal control problems with scale-invariant technologies (constant returns to scale). In growth theory, the Ramsey-Cass-Koopmans optimal savings problem reduces to a second-order ODE for the capital stock; near the steady state, the linearised equation is approximately of Euler type, and the indicial roots determine the stable and unstable manifolds.
13.2 Power Series Solution Near an Ordinary Point
Near an ordinary point, the method of power series is simpler than Frobenius — no indicial equation is needed.
This recurrence connects even and odd coefficients separately. Starting from free constants \(a_0\) and \(a_1\):
\[ a_2 = \frac{a_0}{2},\quad a_4 = \frac{a_2}{4} = \frac{a_0}{8},\quad a_6 = \frac{a_4}{6} = \frac{a_0}{48}, \ldots \]\[ a_3 = \frac{a_1}{3},\quad a_5 = \frac{a_3}{5} = \frac{a_1}{15},\quad a_7 = \frac{a_5}{7} = \frac{a_1}{105}, \ldots \]\[ y_1(x) = a_0\sum_{k=0}^\infty \frac{x^{2k}}{2^k k!} = a_0\,e^{x^2/2}, \qquad y_2(x) = a_1\sum_{k=0}^\infty \frac{x^{2k+1}}{1 \cdot 3 \cdot 5 \cdots (2k+1)}. \]Indeed, \(y_1 = e^{x^2/2}\) can be verified directly: \(y_1' = xe^{x^2/2}\), \(y_1'' = (1+x^2)e^{x^2/2}\), and \(y_1'' - xy_1' - y_1 = (1+x^2)e^{x^2/2} - x^2 e^{x^2/2} - e^{x^2/2} = 0\). The power series method has efficiently produced both solutions, and the even-indexed one turns out to be an elementary function while the odd-indexed one is a non-elementary special function related to the error function \(\mathrm{erf}(x)\).
Chapter 14: Travelling Waves and the Fisher-KPP Equation
The final topic brings together nonlinear ODEs, phase plane analysis, and PDE theory in an application with direct epidemiological and economic interpretations: travelling wave solutions to reaction-diffusion equations.
14.1 The Fisher-KPP Equation
Consider a population \(u(x,t)\) — the density of infected individuals in a spatial domain, or the density of adopters of a new technology — that both diffuses and grows logistically:
\[ \frac{\partial u}{\partial t} = D\frac{\partial^2 u}{\partial x^2} + ru\!\left(1 - u\right), \]where we have scaled so that the carrying capacity is 1. This is the Fisher-KPP equation (Fisher 1937; Kolmogorov, Petrovsky, Piskunov 1937). The key feature is that it admits travelling wave solutions of the form \(u(x,t) = \varphi(x - ct)\) — a front that moves at constant speed \(c\) while maintaining its shape.
Substituting \(u = \varphi(\xi)\) into the PDE: \(-c\varphi' = D\varphi'' + r\varphi(1-\varphi)\), or
\[ D\varphi'' + c\varphi' + r\varphi(1-\varphi) = 0. \]This is a second-order nonlinear ODE for \(\varphi(\xi)\). Setting \(p = \varphi'\), we convert to a first-order system in the phase plane \((\varphi, p)\):
\[ \frac{d\varphi}{d\xi} = p, \qquad \frac{dp}{d\xi} = -\frac{c}{D}p - \frac{r}{D}\varphi(1-\varphi). \]The equilibria are \((\varphi, p) = (0, 0)\) and \((1, 0)\). A travelling wave corresponds to a trajectory in the \((\varphi, p)\) phase plane connecting \((1,0)\) to \((0,0)\) — a heteroclinic orbit.
This is the minimum speed at which an epidemic front advances spatially. Historical data on the Black Death (1347-1353) gives an observed wave speed of roughly 500 km/year, suggesting much higher effective diffusion (\(D \sim 40\,\text{km}^2/\text{day}\)) consistent with long-range travel by trade routes rather than pure local diffusion.
Stability of the wave speed: Initial conditions with compact support always generate the minimal-speed wave. Faster initial conditions (e.g., a function decaying algebraically rather than exponentially as \(x \to \infty\)) can generate faster waves, but minimal waves are the generic outcome. This explains why epidemic fronts typically advance at a characteristic, reproducible speed even when initial seeding conditions vary.
Economic application — Diffusion of innovation: The same equation models the spatial spread of a new technology or agricultural practice. \(u(x,t)\) is the adoption fraction at location \(x\), \(D\) is the rate of information diffusion (via trade networks, media), and \(r\) is the intrinsic adoption advantage. The wave speed \(c^* = 2\sqrt{Dr}\) predicts how fast the adoption frontier advances geographically. Ryan and Gross’s famous study of hybrid corn adoption in the 1930s US Midwest observed precisely this wave-like spread, with a speed consistent with the Fisher-KPP prediction.
Appendix: Key Formulas and Solution Methods
First-Order ODEs: Solution Map
| Equation type | Solution method | Result |
|---|---|---|
| \(y' = ry\) | Separation | \(y = y_0 e^{rt}\) |
| \(y' = ry(1-y/K)\) | Separation (logistic) | \(y = K/(1+Ae^{-rt})\), \(A = (K-y_0)/y_0\) |
| \(y' + P(x)y = Q(x)\) | Integrating factor \(\mu = e^{\int P}\) | \(y = \mu^{-1}(\int \mu Q\,dx + C)\) |
| \(y' + P(x)y = Q(x)y^n\) | Bernoulli: \(v = y^{1-n}\) | Linear ODE for \(v\) |
| Newton cooling \(T' = k(A-T)\) | Linear 1st order | \(T = A + (T_0-A)e^{-kt}\) |
Second-Order ODE: Characteristic Roots
For \(ay'' + by' + cy = 0\) with discriminant \(\Delta = b^2 - 4ac\):
- \(\Delta > 0\) (overdamped / distinct real): \(y = c_1 e^{r_1 x} + c_2 e^{r_2 x}\)
- \(\Delta = 0\) (critically damped / repeated): \(y = (c_1 + c_2 x)e^{rx}\)
- \(\Delta < 0\) (underdamped / complex): \(y = e^{\alpha x}(A\cos\beta x + B\sin\beta x)\)
Resonance: multiply particular solution guess by \(x^s\) where \(s\) = multiplicity of the forcing frequency as a characteristic root.
Economic Models Summary
| Model | ODE/System | Key result |
|---|---|---|
| Continuous compounding | \(A' = rA\) | \(A = A_0e^{rt}\) |
| Logistic adoption | \(P' = rP(1-P/K)\) | S-curve; inflection at \(P = K/2\) |
| Public debt (Domar) | \(D' = rD + G_0e^{gt}\) | Sustainable iff \(r < g\) |
| Solow growth | \(k' = sk^\alpha - (n+\delta)k\) | Stable \(k^*\) via Bernoulli substitution |
| Samuelson cycle | \(Y'' - (c-1+a)Y' + (1-c)Y = G_0\) | Oscillatory iff \((c-1+a)^2 < 4(1-c)\) |
| Price adjustment | \(P' = \alpha(D(P)-S(P))\) | Stable iff \(D'(P^*) < S'(P^*)\) |
| SIR epidemic | \(S' = -\beta SI,\; I' = \beta SI - \gamma I\) | Epidemic iff \(\mathcal{R}_0 = \beta N/\gamma > 1\) |
| IS-LM dynamic | \(Y' = \alpha[I(r)+G-sY],\; r' = \beta[kY-hr-\bar{M}/P]\) | Always stable; node/spiral by \(\tau^2 - 4\Delta\) |
| MRW (Solow + human capital) | \(dk/dt = s_k k^\alpha h^\phi - (n+\delta)k\) | Convergence rate \(\mu = (n+\delta)(1-\alpha-\phi)\) |
| Fisher-KPP (spatial spread) | \(u_t = Du_{xx} + ru(1-u)\) | Wave speed \(c^* = 2\sqrt{Dr}\) |
Black-Scholes Quick Reference
PDE: \(V_t + \frac{1}{2}\sigma^2 S^2 V_{SS} + rSV_S - rV = 0\)
\[ C = SN(d_1) - Ke^{-r\tau}N(d_2), \quad d_1 = \frac{\ln(S/K)+(r+\sigma^2/2)\tau}{\sigma\sqrt{\tau}}, \quad d_2 = d_1 - \sigma\sqrt{\tau}. \]Put-call parity: \(C - P = S - Ke^{-r\tau}\).
Delta: \(\Delta = N(d_1)\). Vega: \(\mathcal{V} = S\sqrt{\tau}N'(d_1)\).
Laplace Transform Quick Reference
Key pairs: \(\mathcal{L}\{e^{at}\} = 1/(s-a)\); \(\mathcal{L}\{\sin\omega t\} = \omega/(s^2+\omega^2)\); \(\mathcal{L}\{\cos\omega t\} = s/(s^2+\omega^2)\); \(\mathcal{L}\{t^n\} = n!/s^{n+1}\); \(\mathcal{L}\{u_c(t)f(t-c)\} = e^{-cs}F(s)\).
Derivative rule: \(\mathcal{L}\{y''\} = s^2Y - sy(0) - y'(0)\). Converts an IVP to an algebraic equation for \(Y(s)\).
Solving procedure: (1) Transform the ODE; (2) solve for \(Y(s)\) algebraically; (3) apply partial fractions; (4) invert using the table and shifting theorems.
Trace-Determinant Quick Reference
For \(\mathbf{x}' = A\mathbf{x}\) with \(\tau = \mathrm{tr}(A)\), \(\Delta = \det(A)\):
- \(\Delta < 0\): saddle (always unstable)
- \(\Delta > 0\), \(\tau < 0\): stable (\(\tau^2 > 4\Delta\): node; \(\tau^2 < 4\Delta\): spiral)
- \(\Delta > 0\), \(\tau > 0\): unstable (\(\tau^2 > 4\Delta\): node; \(\tau^2 < 4\Delta\): spiral)
- \(\Delta > 0\), \(\tau = 0\): centre
Euler Equation Quick Reference
\(x^2y'' + \alpha x y' + \beta y = 0\): substitute \(y = x^r\), indicial equation \(r^2 + (\alpha-1)r + \beta = 0\).
- Distinct real roots \(r_1 \neq r_2\): \(y = c_1 x^{r_1} + c_2 x^{r_2}\).
- Repeated root \(r\): \(y = x^r(c_1 + c_2\ln x)\).
- Complex roots \(r = \mu \pm \nu i\): \(y = x^\mu(c_1\cos(\nu\ln x) + c_2\sin(\nu\ln x))\).
Heat equation connection: \(x = \ln S\), \(\tau = T-t\), exponential substitution \(\Rightarrow\) \(u_\tau = \frac{\sigma^2}{2}u_{xx}\), solved by the heat kernel.