MUSIC 272: Counterpoint and Fugue

Estimated study time: 3 hr 16 min

Table of contents

These notes draw on Johann Joseph Fux’s Gradus ad Parnassum (trans. Mann, 1943), Kent Kennan’s Counterpoint (4th ed., 1999), Robert Gauldin’s A Practical Approach to Eighteenth-Century Counterpoint (2nd ed., 2013), Johann Sebastian Bach’s Well-Tempered Clavier and Two-Part Inventions (as primary analytical examples), and supplementary material from Yale School of Music counterpoint course materials and Eastman School of Music undergraduate theory resources.


Chapter 1: Foundations of Counterpoint

1.1 Intervals and Their Classification

Counterpoint is the art of combining independent melodic lines into a coherent and expressively unified whole. The word derives from the Latin punctus contra punctum — note against note — and in its broadest sense encompasses every discipline of polyphonic writing from the Renaissance motet to the Baroque fugue, from the Classical string quartet to the modern phasing compositions of Steve Reich. Before one can compose a single measure of two-voice counterpoint, one must understand with precision the raw material from which all polyphony is constructed: the interval.

An interval is the distance in pitch between two tones, measured by the number of diatonic steps separating them. We count inclusively: from C to E is a third (C, D, E — three steps). Intervals are named by their size (unison, second, third, fourth, fifth, sixth, seventh, octave) and their quality (perfect, major, minor, augmented, diminished). The distinction between perfect and imperfect intervals is not merely taxonomic; it reflects a hierarchy of acoustic stability that directly governs which intervals may appear, and where, in a contrapuntal texture. The theorist who understands why this hierarchy exists — rooted not in convention but in the physics of vibrating strings and the mathematics of frequency ratios — will find that the rules of counterpoint feel not arbitrary but inevitable.

Definition 1.1 (Perfect and Imperfect Consonances). A perfect consonance is an interval whose frequency ratio, in just intonation, is expressible as a ratio of small integers involving only the primes 2 and 3. The perfect intervals are: the unison \((1:1)\), the perfect fifth \((3:2)\), the perfect fourth \((4:3)\), and the octave \((2:1)\). An imperfect consonance is an interval whose ratio involves the prime 5: the major third \((5:4)\), the minor third \((6:5)\), the major sixth \((5:3)\), and the minor sixth \((8:5)\). All other diatonic intervals — seconds, sevenths, the tritone — are classified as dissonances.

This classification is not arbitrary. The ancient Greeks, particularly Pythagoras, observed that the most acoustically stable simultaneous tones are those related by the simplest ratios. Two strings vibrating in the ratio \(2:1\) produce tones an octave apart, so thoroughly consonant that medieval theorists occasionally treated them as the same pitch class. The ratio \(3:2\) yields the perfect fifth, the interval that forms the basis of the circle of fifths, the overtone series, and ultimately the entire tonal system. The ratio \(4:3\) gives the perfect fourth — acoustically a near-twin to the fifth, yet treated with considerably more ambivalence in Renaissance and Baroque practice, as we shall see.

The imperfect consonances — thirds and sixths — introduce the prime 5 into the ratio. A major third at \(5:4\) carries a brightness absent from the hollow resonance of a perfect fifth; a minor third at \(6:5\) has a mellower, more shadowed quality. These intervals, which medieval theory largely dismissed as dissonances (too complex, too far from the Pythagorean ideal of pure powers of 2 and 3), became the structural pillars of Renaissance polyphony. Composers like Josquin des Prez and Palestrina built entire movements on the careful interlocking of thirds and sixths, understanding — perhaps intuitively, perhaps from long practice — that these intervals create a richer, warmer sonority than the naked perfect consonances of the earlier Gothic style. Their richness arises precisely from the slight acoustic complexity introduced by the prime 5.

Definition 1.2 (Dissonance). An interval is a dissonance if it is not a consonance in the sense of Definition 1.1. The primary dissonances in tonal counterpoint are: the major and minor second \((9:8\) and \(16:15\) respectively), the major and minor seventh \((15:8\) and \(9:5)\), the augmented fourth and diminished fifth (the tritone, irrational in equal temperament), and the diminished seventh. Dissonances require preparation or special treatment, and in all standard cases resolve by stepwise motion to a consonance.

The tritone deserves special attention. Its frequency ratio is irrational in equal temperament — the ratio is \(2^{6/12} : 1 = \sqrt{2}:1\) — and even in just intonation it admits no clean small-integer form, the closest approximations (\(45:32\) or \(64:45\)) involving large numbers. Medieval theorists called it diabolus in musica — the devil in music — and its use was severely restricted in early polyphony. Yet it is precisely this restless instability that makes the tritone indispensable to tonal music: it creates the harmonic tension that the dominant seventh chord must resolve, and it defines the half-step voice-leading that gives authentic cadences their sense of inevitability. Without the tritone, there is no functional harmony in the modern sense.

Remark 1.1 (Just Intonation vs. Equal Temperament). The frequency ratios given in Definition 1.1 are those of just intonation, the system in which each interval is tuned to its exact small-integer ratio. In equal temperament — the tuning system used by virtually all modern instruments — intervals are approximations: the perfect fifth is \(2^{7/12}:1 \approx 1.4983\) rather than exactly \(3:2 = 1.5000\). This slight compromise (about two cents flat) permits unlimited transposition through all twelve keys while keeping every interval reasonably in tune. The study of counterpoint assumes equal temperament in practice, but the just-intonation ratios explain the psychological reality that underlies the rules: the acoustic fusion of perfect consonances, the warmth of thirds, the restlessness of the tritone.

1.2 The Overtone Series and Harmonic Justification

The interval classification of Definition 1.1 has a deeper physical justification than the mere simplicity of frequency ratios. Every pitched sound produced by a musical instrument is in fact a complex mixture of frequencies: the fundamental frequency \(f_0\) and its integer multiples \(2f_0, 3f_0, 4f_0, \ldots\), the so-called overtone or harmonic series. A string vibrating at 110 Hz (A2) simultaneously produces partial tones at 220 Hz (A3), 330 Hz (E4), 440 Hz (A4), 550 Hz (approximately C#5), 660 Hz (E5), and so on. The first six partials of any fundamental collectively span a major triad — an observation that is not a coincidence but a derivation: the major triad is the acoustically natural chord, the chord that nature produces whenever a single pitched tone is sounded.

Theorem 1.1 (The Harmonic Series and Consonance Hierarchy). Let \(f_0\) be the fundamental frequency of a tone. Its harmonic series is the sequence \(\{n f_0 : n \in \mathbb{Z}^+\}\). The intervals formed between successive partials — and between partials close in harmonic number — correspond to the most consonant intervals:
  • Partials 1 and 2: ratio \(2:1\), the octave (perfect consonance).
  • Partials 2 and 3: ratio \(3:2\), the perfect fifth.
  • Partials 3 and 4: ratio \(4:3\), the perfect fourth.
  • Partials 4 and 5: ratio \(5:4\), the major third.
  • Partials 5 and 6: ratio \(6:5\), the minor third.
  • Partials 6 and 7: ratio \(7:6\), approximately a minor third but slightly flat (the "harmonic seventh").
The consonance of an interval is thus directly proportional to the proximity of its constituent partials in the harmonic series: the more harmonics two tones share, the more consonant they sound. Dissonances — seconds and sevenths — correspond to pairs of partials that are close in frequency but not integer multiples of a common fundamental, producing the acoustic "beating" (amplitude fluctuation at frequency \(|f_1 - f_2|\)) that the ear perceives as roughness or tension.

This theorem is the physical foundation of the contrapuntal consonance hierarchy. The perfect consonances (unison, fifth, octave) share the most harmonics; the imperfect consonances (thirds, sixths) share somewhat fewer but still enough for clear fusion; dissonances share very few, producing the acoustic roughness that the mind associates with incompleteness and motion. Species counterpoint is, in this light, a system for managing the density of shared harmonics between simultaneously sounding voices — for ensuring that the texture maintains sufficient harmonic resonance on strong beats while permitting localized acoustic tension on weak beats.

1.3 Consonance and Dissonance in Polyphony

Understanding intervals in the abstract is only the first step. In a polyphonic texture, intervals acquire meaning through their context: where they appear in the measure, how they are approached, and how they resolve. Renaissance counterpoint, as codified by Fux in the Gradus ad Parnassum (1725), operates according to a set of rules that might initially appear arbitrary but in fact encode centuries of accumulated aesthetic wisdom about the perception of simultaneous sound. The student who approaches these rules as a lawyer approaches statutes — seeking loopholes, looking for exceptions, asking what is technically permitted — will produce exercises that are formally correct and aesthetically lifeless. The student who approaches them as a musician — asking what each rule is trying to achieve, what perceptual or aesthetic goal it serves — will produce exercises that are both correct and genuinely musical. The goal of this chapter is to provide the understanding that makes the second approach possible: not just the rules, but the reasons.

The historical development of the consonance hierarchy is itself a story of aesthetic evolution. The earliest medieval polyphony — the organum of the Notre Dame school (twelfth and thirteenth centuries) — admitted only unisons, fifths, and octaves as consonances, treating thirds and sixths as active, unstable intervals requiring resolution. The late medieval period (Machaut, Dufay) began to treat thirds as consonances in certain contexts, and the early Renaissance (Josquin, Obrecht) completed the transition: by the early sixteenth century, thirds and sixths had become the primary harmonic intervals of polyphony, displacing the bare fifths and octaves of the medieval style. This evolution was not arbitrary; it reflected a genuine change in the acoustic aesthetic of Western music — a shift from the hollow, resonant power of perfect consonances to the warmer, fuller sound of the imperfect consonances that the prime 5 introduced. The species rules, by accepting both perfect and imperfect consonances as valid strong-beat intervals, reflect the fully developed Renaissance aesthetic in which both types of consonance are structurally available.

Theorem 1.1 (The Consonance Principle). In strict counterpoint, every note that falls on a strong beat (the downbeat, or the first of two equal beats) must form a consonance with every other simultaneously sounding voice. Dissonances may appear on weak beats, but only under specific conditions that guarantee their perceptual resolution.

The reasoning behind this principle is psychoacoustic as much as aesthetic. When two tones sound simultaneously, the auditory system attempts to fuse them into a single percept. Consonant intervals — those with simple frequency ratios — fuse readily because their overtone series share many components, producing a sensation of repose and stability. Dissonant intervals resist fusion because their overtone series produce many near-coincident partials that beat against each other, creating the acoustic sensation of roughness or tension. On a strong beat — the moment of metric emphasis — the ear expects stability. A dissonance on the downbeat strikes the listener as a collision, something demanding immediate correction. On a weak beat, the same dissonance registers as motion, a passing moment of tension swept away before the next strong beat arrives. The rules of species counterpoint are, in this sense, a grammar of perceptual expectation, mapping the physics of acoustics onto the psychology of metric time.

Remark 1.2 (Historical Context of the Gradus ad Parnassum). Fux's Gradus ad Parnassum was published in Vienna in 1725, the same year as Bach's completion of the first book of the Well-Tempered Clavier. Although Fux was summarizing a style already one hundred years old — that of Palestrina's Roman polyphony — his pedagogical method proved extraordinarily durable. Haydn studied it under his own instruction, drawing on it throughout his career; Mozart worked through it at his father Leopold's insistence; Beethoven used it as the basis of his counterpoint studies with Haydn and Albrechtsberger. The species method Fux introduced — five progressively more complex species of counterpoint, each isolating a distinct voice-leading technique — remains the standard entry point into contrapuntal training to this day, more than three centuries after its invention.

The student must cultivate the habit of hearing intervals not as isolated sonic events but as points on a trajectory of tension and release. A major sixth leading to a perfect octave through contrary motion is not simply two consecutive intervals; it is a gesture of closure, a cadential arrival. A minor seventh dissolving to a major third through stepwise contrary motion is a suspension resolving — a moment of friction yielding to consonance, of harmonic pressure released. Developing this sense of intervallic trajectory, this ability to hear each interval in the context of where it came from and where it is going, is the most important cognitive skill in the study of counterpoint. No amount of rule-memorization substitutes for it.

1.4 Interval Inversion and Its Properties

A fundamental operation on intervals that is indispensable in counterpoint, and especially in the study of invertible counterpoint and fugue, is interval inversion: the transposition of the lower note of an interval up by an octave (or equivalently, the transposition of the upper note down by an octave), converting the interval \(i\) to the interval \(9 - i\) (using interval numbers from 1 to 8).

Theorem 1.2 (Interval Inversion and Consonance Preservation). Under interval inversion (interval number \(i \mapsto 9 - i\)):
  • Unison (1) inverts to octave (8): both are perfect consonances.
  • Second (2) inverts to seventh (7): both are dissonances.
  • Third (3) inverts to sixth (6): both are imperfect consonances.
  • Fourth (4) inverts to fifth (5): both are perfect consonances (with the caveat about the fourth in two-voice writing).
Interval inversion thus preserves the consonance/dissonance classification in all cases except the fourth/fifth pair, where both are perfect consonances but the fourth has a context-dependent dissonant function when above the bass. This asymmetry is the principal constraint on invertible counterpoint at the octave.

The quality of an interval also changes under inversion in a predictable pattern: major inverts to minor, minor to major, augmented to diminished, diminished to augmented, and perfect remains perfect. Thus the major third (M3) inverts to the minor sixth (m6), the minor third (m3) inverts to the major sixth (M6), the major sixth (M6) inverts to the minor third (m3), and so on. These quality inversions have practical consequences for the harmonic colour of inverted passages: a passage that featured warm major sixths in its original form will feature brighter minor thirds in its inversion, changing the expressive character even while the structural contrapuntal logic remains valid.

1.5 The Four Types of Motion

When two voices move simultaneously, their relative motion falls into one of four categories, each with distinct contrapuntal properties and associated rules. The classification of motion types is not merely descriptive; it is a framework for analyzing and managing the independence of voices, which is the fundamental goal of all polyphonic writing.

Definition 1.3 (Types of Melodic Motion). Let \(v_1\) and \(v_2\) be two voices at pitches \(p_1^{(t)}\) and \(p_2^{(t)}\) at time \(t\), moving to \(p_1^{(t+1)}\) and \(p_2^{(t+1)}\). Define \(\delta_i = p_i^{(t+1)} - p_i^{(t)}\) (positive upward, negative downward, measured in semitones). Then:
  • Contrary motion: \(\delta_1\) and \(\delta_2\) have opposite signs — one voice ascends while the other descends.
  • Similar motion: \(\delta_1\) and \(\delta_2\) have the same sign but \(|\delta_1| \neq |\delta_2|\) — voices move in the same direction by different intervals.
  • Oblique motion: exactly one of \(\delta_1, \delta_2\) is zero — one voice holds a sustained pitch while the other moves.
  • Parallel motion: \(\delta_1 = \delta_2 \neq 0\) — voices move in the same direction by the same interval, preserving the intervallic distance between them exactly.

Contrary motion is the gold standard of contrapuntal writing. When the voices move in opposite directions, they assert their independence most forcefully: neither voice seems to pull or follow the other, and the result is a texture of genuine polyphonic equality. The two voices trace arcs that are, in a sense, mirror images, and the resulting sonic impression is one of balance, dialogue, and genuine two-part composition. It is no accident that cadences — the moments of greatest formal weight in a contrapuntal piece — typically employ contrary motion in their approach: the soprano rising by step from the leading tone to the tonic octave while the bass descends by a fourth or fifth from the dominant to the tonic.

Similar motion is less independent than contrary but entirely permissible, provided it does not produce forbidden parallel intervals. The two voices travel in the same direction, implying a certain gravitational pull between them, but the difference in interval size prevents their complete assimilation. Oblique motion, in which one voice sustains while the other moves, provides textural contrast and is particularly useful in the creation of suspensions in fourth-species counterpoint — the stationary voice creates the suspended dissonance while the moving voice provides the harmonic change. Parallel motion — the most restricted of the four types — preserves the interval between the voices exactly, and when that interval is a perfect consonance, the result is the notorious parallel fifth or parallel octave, the most rigorously forbidden event in strict counterpoint.

In practice, the experienced contrapuntist does not consciously categorize every interval of motion as contrary, similar, oblique, or parallel; the categories are internalized through exercise to the point where the ear immediately flags parallel motion into a perfect consonance as a perceptual wrong note, requiring no conscious analysis. The goal of the species exercises is precisely this internalization — the formation of a musical instinct that operates below the level of deliberate thought, so that the conscious mind is freed to attend to the higher-level questions of melody, harmony, and expression. The motion-type classification is the analytical vocabulary; the developed ear is the goal.

Example 1.1 (Motion Types in a Two-Bar Passage). Consider two voices in C major over two measures. Soprano: C5–D5–E5–F5 (ascending step motion). Bass: G3–F3–G3–C3 (descending step, ascending step, descending fourth). At beat 1: soprano C5, bass G3 — perfect fifth. Beat 2: soprano D5 (up a second), bass F3 (down a second) — these two voices move by the same interval size in opposite directions: contrary motion, arriving at a minor sixth (D5–F3). Beat 3: soprano E5 (up a second), bass G3 (up a third) — similar motion (both ascending), arriving at a major sixth (E5–G3). Beat 4: soprano F5 (up a second), bass C3 (down a fifth) — contrary motion, arriving at a major thirteenth (= major sixth plus an octave, F5–C3). The motion types alternate throughout, with contrary motion dominating and providing the strongest sense of voice independence. The intervals are entirely consonant (fifth, sixth, sixth, thirteenth), and the passage is a valid first-species example.

1.6 Forbidden Parallels and Voice Independence

The prohibition against consecutive perfect fifths and consecutive perfect octaves (often called “parallel fifths” and “parallel octaves”) is the single most famous rule in counterpoint. Students encounter it early, violate it by instinct, and spend years developing the ear to detect and avoid it. Understanding why this prohibition exists — not merely that it does — is essential to genuine contrapuntal thinking, and it is one of the great satisfactions of a rigorous counterpoint education.

It is instructive to consider what parallel fifths actually sound like, and why the ear objects to them. Play, at the piano or at any instrument, the progression C–G in the bass against E–B in the soprano (moving from an interval of a sixth to an interval of a fifth as both voices ascend by a step). This is a hidden fifth, not a parallel fifth: the preceding interval was a sixth. Now play C–G in the bass against G–D in the soprano (moving from an interval of a fifth to another fifth as both voices ascend by a step). This is the parallel fifth. The difference is subtle but audible: the parallel fifth creates a momentary sense that the two voices have merged into a single “super-voice” harmonized at the fifth, while the hidden fifth merely approaches the fifth from a different angle, preserving the voices’ independence. The prohibition captures this perceptual reality: parallel fifths are not merely wrong by convention, they are wrong by the logic of polyphonic perception. The rule is the formalization of an auditory fact.

Theorem 1.2 (The Prohibition of Parallel Fifths and Octaves). In strict counterpoint, two voices must not move in parallel motion from one perfect fifth to another, nor from one perfect octave to another. This prohibition applies regardless of which voices are involved, regardless of whether the parallels are in the same or different octaves (so-called "hidden" or "covered" parallels at the octave are still prohibited), and whether the parallels occur on consecutive beats or across a bar line.

The acoustic justification is subtle but profound. A perfect fifth, with its ratio of \(3:2\), produces such strong resonance that the two voices fuse nearly into one acoustic entity. The fundamental of the lower voice and the first overtone of the upper voice are very close in frequency; the second overtone of the lower voice is identical to the fundamental of the upper voice. This overtone sharing creates the characteristic hollow yet powerful sound of the perfect fifth, a sound that is perceptually singular — we do not easily hear it as two independent tones. When two voices move in parallel fifths — say, from C–G to D–A — the listener hears not two independent melodic lines but a single melodic entity harmonized at the fifth. The contrapuntal goal of independent polyphonic voices is thus defeated: the texture collapses from two distinct melodic personalities into a single melody with a fixed harmonic shadow. The voices have ceased to be voices in the genuine sense and have become a single compound instrument.

The same principle applies even more forcefully to parallel octaves. The octave ratio of \(2:1\) causes the upper voice to coincide with the second partial of the lower voice; the two tones fuse so completely in perception that listeners frequently do not hear them as two separate voices at all. Consecutive octaves produce the effect of a single voice doubling at the octave — not counterpoint, but unison writing with added thickness. The prohibition on parallel octaves is therefore not so much a stylistic rule as a definition: music that proceeds in consecutive octaves is not polyphony and has no claim to being treated as counterpoint.

Definition 1.4 (Hidden and Direct Fifths and Octaves). A direct (or hidden) fifth occurs when two voices move in similar motion to a perfect fifth, with at least one voice moving by a leap rather than by step. A direct octave is the analogous event at the octave. These are treated with more leniency than true parallel fifths and octaves but are still prohibited when they occur between the outer voices (soprano and bass), both of which move by leap, in strict counterpoint. The rule is that similar motion into a perfect consonance is permissible only when the upper voice moves by step.
Remark 1.3 (Why Perfect Fourths Are Treated Differently). The perfect fourth, with ratio \(4:3\), shares the acoustic stability of the fifth yet is treated differently in two-voice counterpoint. When the fourth appears above a bass voice without an additional third below it (i.e., without completing a chord of which the fourth is not the lowest interval), it sounds as a dissonance — the bass seems to demand resolution. This context-sensitivity of the fourth is one of the most striking features of common-practice harmony: an interval that is acoustically consonant functions as a dissonance in specific registral contexts. Parallel fourths between upper voices, however, are treated very differently — they are permitted and even idiomatic, since neither voice functions as a harmonic bass for the other. The distinction reveals a crucial principle: dissonance is not solely a property of the interval itself but of the interval in its registral context.

1.7 Voice Independence as Aesthetic Ideal

Before turning to voice ranges and melodic writing, it is worth pausing to articulate the aesthetic ideal that the prohibition on parallel fifths and octaves is designed to protect: the ideal of voice independence. In a genuine polyphonic texture — as opposed to a single melody harmonized homophonically — every voice has its own melodic identity, its own rhythmic character, its own harmonic trajectory, and its own arc of tension and release. The voices coexist in the same acoustic space without merging into each other: they are, in the phenomenological sense, distinct musical subjects, each with its own “voice” in the speech-act sense as well as the musical sense.

This ideal of voice independence is not merely a technical requirement but an ethical one, in the sense that it reflects a musical value: the belief that multiple perspectives are more interesting than one, that dialogue is richer than monologue, that the simultaneous pursuit of independent melodic goals can produce a whole that is greater than the sum of its parts. The polyphonic ideal — multiple equal voices, each maintaining its identity while cooperating in a common harmonic goal — is a musical expression of an aesthetic and philosophical commitment that the Western musical tradition has upheld, with varying degrees of consistency, from the twelfth century to the present. The study of counterpoint is, at its deepest level, the study of how this ideal is realized in musical practice.

1.8 Voice Ranges and Melodic Writing

Counterpoint is not an abstract mathematical exercise; it is vocal music. Even when written for instruments, the contrapuntal tradition conceives of each voice as a human singer with a natural range, a characteristic tessitura, and physical limits on the intervals it can execute comfortably and expressively. The traditional four voices — soprano, alto, tenor, bass (SATB) — each occupy a range of roughly a tenth to a twelfth, and the rules of melodic writing in counterpoint are calibrated to what a trained singer can render naturally.

Definition 1.5 (Standard Voice Ranges). The conventional ranges for four-voice counterpoint are:
  • Soprano: C4 (middle C) to G5, with tessitura D4–E5.
  • Alto: G3 to C5, with tessitura A3–D4.
  • Tenor: C3 to G4, with tessitura D3–E4.
  • Bass: E2 to C4, with tessitura F2–D3.
These ranges are guidelines rather than absolute constraints; exceptional notes at the extremes are permissible for expressive effect but should not be sustained or approached carelessly.

Good melodic writing in counterpoint is governed by principles of singability and shape. A well-formed contrapuntal melody rises and falls with a clear arc; it reaches a single climactic high point (or occasionally a low point) and then descends (or ascends) to its conclusion. It balances stepwise motion with occasional leaps, and large leaps — sixths, sevenths, octaves — are compensated by stepwise motion in the opposite direction immediately after the leap. This principle of leap compensation reflects the physical reality of singing: after a large ascending leap, the voice naturally wants to fall back; the compensation gives the melodic line the sense of recovering its balance. A melody that leaps repeatedly in the same direction without compensation feels driven to an extreme, unstable.

Theorem 1.3 (Melodic Intervals in Strict Counterpoint). The following intervals are generally prohibited in the melodic line of strict counterpoint:
  • Augmented intervals of any size (the augmented second A2, augmented fourth A4) — difficult to sing in tune and stylistically alien to Renaissance polyphony.
  • Diminished intervals, particularly the diminished fourth and diminished seventh — similarly difficult and anguished in effect.
  • Any leap larger than an octave — beyond the comfortable range of the voice.
  • Two consecutive leaps in the same direction that together span an augmented or diminished interval, or that span a seventh or more — these create the effect of a hidden augmented or diminished interval and disrupt melodic coherence.
The seventh is generally avoided as a melodic leap; the major sixth is permitted but should be followed immediately by stepwise motion in the opposite direction. The minor sixth is similarly treated.

A melody in strict counterpoint should be comprehensible and singable on its own, independent of the other voices: it should feel like a melody, not merely a series of intervals chosen to avoid forbidden parallels. This demand for melodic integrity in each voice — the requirement that every voice be worth singing alone — is what distinguishes genuine counterpoint from chord-writing. In a Bach two-part invention, if one isolates either voice and plays it alone, one hears a complete, shapely, and interesting melody. This is not a coincidence or a by-product; it is a fundamental design criterion of contrapuntal composition.

1.9 Palestrina and the Codification of Renaissance Style

Giovanni Pierluigi da Palestrina (c. 1525–1594) stands at the apex of the Renaissance polyphonic tradition, and his style — smooth, carefully controlled, rigorously avoiding harsh dissonances, animated by flowing voice-leading — became the canonical model that Fux codified in the Gradus ad Parnassum and that the species system was designed to teach. Understanding Palestrina’s style is inseparable from understanding why the species rules take the form they do: every rule encodes an aesthetic judgment that Palestrina made, distilled from the accumulated practice of the Roman polyphonic school.

Example 1.2 (Palestrina's "Pope Marcellus Mass," Kyrie). The Kyrie of Palestrina's Missa Papae Marcelli (published 1567) is perhaps the most celebrated example of Renaissance polyphony and a perfect demonstration of the species rules in action. The six voices (SSAATB) move primarily in half and quarter notes (species 2 and 3) against each other, with occasional whole-note sustained pitches (species 1) creating moments of relative repose. Every dissonance — and there are many, mostly passing tones and 7–6 suspensions — is approached and quitted by step, and every strong-beat note is consonant with all simultaneously sounding voices. The texture is so smooth, so perfectly voice-led, that the harmonic changes seem to flow inevitably from each other with no sense of the rigorous constraint that underlies every note. This smoothness is the aesthetic ideal of Renaissance polyphony, and it is what the species rules, properly internalized, are designed to produce.

The contrast between Palestrina’s smooth style and the slightly more adventurous writing of his contemporary Orlando di Lasso (Roland de Lassus, c. 1532–1594) illustrates that the species rules codify one approach to Renaissance polyphony rather than the only possible one. Lasso’s music uses dissonance more freely and admits a wider range of melodic intervals, including occasional augmented seconds in expressive contexts. The species system, by targeting the Palestrina style specifically, teaches the most constrained and therefore most revealing form of Renaissance counterpoint, from which the student can relax the constraints in controlled ways as they progress toward free composition. Starting with the strictest style and relaxing its rules is always more productive, pedagogically, than starting from freedom and trying to impose constraints retroactively.

1.11 Modes and the Historical Context of Renaissance Polyphony

Before the full establishment of major-minor tonality in the seventeenth century, the harmonic framework of polyphonic music was modal rather than tonal. The eight church modes — Dorian, Phrygian, Lydian, Mixolydian, and their plagal variants — each defined a characteristic scale pattern, a preferred final (the modal equivalent of the tonic), and a preferred dominant (the reciting tone, or tenor). Renaissance counterpoint is fundamentally modal counterpoint, and the study of species counterpoint as Fux formulated it is, strictly speaking, a study of modal polyphony in the style of Palestrina.

Definition 1.6 (The Church Modes). The authentic modes of Renaissance polyphony, with their finals and characteristic scale patterns on the white keys of the piano, are:
  • Dorian: final D, scale D–E–F–G–A–B–C–D (minor with raised sixth).
  • Phrygian: final E, scale E–F–G–A–B–C–D–E (characterized by the semitone between final and second degree).
  • Lydian: final F, scale F–G–A–B–C–D–E–F (major with raised fourth — the tritone above the final).
  • Mixolydian: final G, scale G–A–B–C–D–E–F–G (major with lowered seventh).
  • Aeolian: final A, scale A–B–C–D–E–F–G–A (the natural minor scale).
  • Ionian: final C, scale C–D–E–F–G–A–B–C (the major scale).
Each plagal mode uses the same final as its authentic counterpart but spans the range a fourth below rather than a fifth above the final.

The modal system profoundly influences the character of Renaissance counterpoint in ways that extend beyond the simple question of which pitches are available. Each mode has its own characteristic melodic gestures, its own range of available cadential formulas, and its own emotional character as perceived by Renaissance theorists. The Phrygian mode, for instance, with its unique semitone at the bottom (E–F rather than the whole step at the bottom of all other modes) produces a characteristic cadential gesture — the Phrygian cadence, in which the bass descends by semitone (E to D#, or in transposition any descent by semitone) — that sounds strikingly different from all other cadential approaches and persists in tonal music as a special expressive device even after the modal system was supplanted by major-minor tonality.

Remark 1.3 (Musica Ficta and Chromatic Inflection). Renaissance composers did not always adhere strictly to the diatonic pitches of the mode. The practice of musica ficta — the performer's (or scribe's) insertion of accidentals not written in the score, to avoid the tritone in melodic writing or to sharpen the leading tone at cadences — introduced systematic chromatic inflection into modal polyphony. The F# frequently introduced into the Dorian mode (to avoid the tritone F–B) and the G# introduced at cadences in the Dorian mode (to create a leading-tone approach to A) gradually eroded the distinction between the Dorian mode and D minor; the systematic sharpening of the seventh at cadences in all modes contributed to the emergence of major-minor tonality from the modal system. Counterpoint, in this historical view, was one of the principal forces driving the evolution of Western harmonic language.

The practical consequence for species counterpoint exercises is that the student must be aware of the modal framework of the cantus firmus and adjust the counterpoint accordingly. A Dorian cantus firmus — ending on D, with a characteristic B-natural in ascent and B-flat in descent — demands that the counterpoint respect these inflections and employ the Phrygian-cadence gesture (E to D in the bass) at the final cadence rather than the major-mode leading-tone approach. Understanding the modal character of the cantus is inseparable from the craft of good counterpoint in the Renaissance style.


Chapter 2: First and Second Species Counterpoint

2.1 First Species: Note Against Note

First species counterpoint is the most rigorous and revealing of the five species, because it strips away all rhythmic differentiation and presents the harmonic skeleton of the polyphonic texture in the starkest possible form. In first species, the counterpoint voice adds one note for each note of the cantus firmus (the given melody), and every vertical interval between the two voices must be consonant. There is no rhythmic complexity to distract from the interval-by-interval logic; every choice is exposed and consequential.

Definition 2.1 (Cantus Firmus and First Species Counterpoint). A cantus firmus (Latin: "fixed song") is a given melody, traditionally presented in whole notes, that serves as the harmonic basis for a contrapuntal exercise. A first-species counterpoint is a voice added above or below the cantus such that: (1) exactly one note sounds in the added voice for each note of the cantus; (2) every vertical interval formed between the two voices is a consonance; (3) no consecutive perfect fifths or octaves occur between any two adjacent pairs of notes; (4) the counterpoint begins on a perfect consonance (unison, fifth, or octave); and (5) the counterpoint ends on a unison or octave, approached by contrary stepwise motion.

The opening and closing requirements are both formal and acoustic. A polyphonic piece must establish and confirm the mode (or key), and in strict counterpoint the strongest declarations of modal centre are the perfect consonances. The approach to the final octave or unison by contrary motion — typically the soprano moving up by step (the leading tone to tonic) and the bass moving down by step or fourth (dominant to tonic) — is the prototype of all cadential motion in Western music. Every perfect authentic cadence in common-practice harmony is an elaboration of this basic gesture, ornamented by harmonic rhythm, dominant seventh chords, and rhythmic displacement, but structurally reducible to this two-voice framework.

Example 2.1 (A First-Species Counterpoint in C Major). Let the cantus firmus in the tenor present the notes C4–D4–F4–E4–D4–C4 in whole notes. A soprano counterpoint might proceed: E5–F5–A4–G4–F4–E4, producing the interval sequence major tenth (= major third), minor tenth (= minor third), minor third, minor third, minor third, major third. Every interval is an imperfect consonance; contrary motion appears at the first, third, and fourth note-pairs; and the final interval is a major third — a weak but permissible close. A stronger setting aims for an octave at the close: soprano proceeding E5–D5–C5–B4–A4–C5, producing the interval sequence major tenth, minor seventh (error — dissonant!), and so on. The exercise of finding a good first-species counterpoint requires simultaneously satisfying all the conditions, which forces systematic thinking about interval sequences and motion types.

The rule against hidden fifths and octaves (Definition 1.4) adds a further constraint. A hidden fifth occurs when two voices move in similar motion to a perfect fifth, with at least one voice leaping. Even though the preceding interval was not a fifth, the sudden parallel approach to the fifth creates an acoustic impression of parallel-fifth motion. In strict first species, hidden fifths between the outer voices are permitted only when the upper voice moves by step — which guarantees that the stepwise motion distinguishes the two voices sufficiently to prevent the impression of fusion.

2.2 The Cantus Firmus and Its Role

The cantus firmus — the “fixed song” — occupies a central position in the pedagogical tradition not merely because it provides a harmonic framework for the counterpoint exercise but because it connects the student to the deepest roots of the polyphonic tradition. In the medieval organum, a plainchant melody was held in long notes in the lower voice (the tenor, from Latin tenere, to hold) while upper voices added florid elaborations above it. Fux’s cantus firmus exercises recreate this structure in miniature, connecting the student through a continuous practice to the origins of Western polyphony in the chant tradition of the medieval church.

Remark 2.1 (Selecting a Cantus Firmus). A well-chosen cantus firmus for pedagogical exercises has several properties: (1) it moves primarily by step, with occasional leaps that are musically justified and followed by compensating contrary motion; (2) it has a single climactic high point near its midpoint, with a gradual ascent and descent flanking it; (3) it is modal in character, avoiding chromaticism, and clearly implies a modal final at its last note; (4) it is neither too short (fewer than six notes) nor too long (more than twelve or fourteen notes), so that a complete contrapuntal exercise can be written against it within a single page. The classic cantus firmi in Fux's Gradus ad Parnassum — stepwise melodies of eight to twelve whole notes, moving through clearly defined modal spaces — remain the best pedagogical examples after three centuries.

The student should be aware that the cantus firmus is not merely a harmonic given but a melodic partner. Even in first species, where the added voice has the responsibility for creating all the melodic interest (since the cantus moves in whole notes at the same rate as the counterpoint), the added voice must be composed in dialogue with the cantus, not merely in avoidance of the cantus. A good first-species exercise feels like a conversation — question and answer, rise and fall, tension and resolution — not like a harmonic calculation.

2.3 Rules and Good Melodic Intervals

In first species, because every note is exposed on a strong beat and every interval is directly audible, the quality of the melodic intervals in the counterpoint voice matters enormously. A first-species counterpoint that consists entirely of stepwise motion has no melodic interest and fails as a melody even if it respects all the harmonic rules. One that contains too many leaps becomes angular and unsingable. The art is in the balance.

Theorem 2.1 (Preferred Intervals and Motion in First Species). In a well-formed first-species counterpoint:
  • Stepwise motion (seconds) and the consonant skip of a third should predominate, accounting for the majority of melodic motion.
  • Larger leaps (fourths, fifths, sixths) should be exceptional and should be approached and quitted by contrary stepwise motion.
  • The counterpoint should have a clear melodic arc — rising to a single climax and descending to the cadence — rather than oscillating without direction.
  • Contrary motion between the voices should predominate; similar, oblique, and (especially) parallel motion should be used sparingly and with awareness of their harmonic implications.

It is worth dwelling on the aesthetic rationale for these melodic preferences. The insistence on a single melodic climax reflects the Renaissance understanding of melody as a purposive arc — a gesture that begins, rises to a point of maximum energy, and falls to a point of repose. A melody that oscillates or wanders without direction lacks this purposiveness; it fails to communicate anything. The preference for contrary motion between the voices reflects the polyphonic ideal of independence: when both voices move in contrary directions, neither dominates, and the listener hears a genuine dialogue between equals. These are not merely technical rules but aesthetic principles that the great masters of counterpoint internalized as the deep grammar of musical discourse.

2.4 Approaching the Cadence

The cadence — from the Latin cadere, to fall — is the moment of formal arrival at which melodic and harmonic motion converges on a point of rest. In first species counterpoint, the cadence is structurally simple but musically decisive: it is the moment at which all the tension of the preceding intervals resolves to the stability of a perfect consonance. Understanding the cadence as a structural event — not merely a formula to be applied but a moment of formal conclusion that must be prepared by the preceding material — is essential to genuine contrapuntal thinking. A well-prepared cadence feels inevitable; the final perfect consonance is felt as the answer to a question that the entire exercise has been asking. A poorly prepared cadence feels arbitrary — the exercise simply stops, without the sense that it has reached its destination.

Remark 2.2 (Cadential Preparation). In first species, the cadence is prepared by the penultimate interval, which should create a strong sense of motion toward the final. The most effective preparation is the interval of a sixth below the final octave (e.g., the soprano on B4 and the bass on D3, forming a major sixth, immediately before the soprano rises to C5 and the bass falls to C3 for the final octave). The major sixth creates a strong tendency toward the octave because the soprano's leading tone (B4) urgently desires to resolve to C5 while the bass's D3 desires to resolve to C3. The two resolutions happen simultaneously, in contrary motion, producing the most decisive possible cadence. This sixth-to-octave (or sixth-to-unison) cadential motion is the species counterpart of the V–I harmonic resolution in tonal music — both encode the same fundamental impulse toward tonal closure, expressed at different levels of abstraction.
Theorem 2.2 (Cadential Approach in First Species). The final interval of a first-species exercise must be a unison or octave, approached by contrary stepwise motion. Specifically, if the cantus ends \(\hat{2}–\hat{1}\) (scale degrees 2 descending to 1), the counterpoint placed above the cantus should end \(\hat{7}–\hat{8}\) (leading tone ascending to the tonic octave). The resulting contrary motion produces the characteristic clausula vera, or authentic cadence in two voices.

This cadential formula encodes the fundamental logic of tonal harmonic motion. The leading tone’s ascending semitone to the tonic creates a sense of inevitable arrival — the slight dissonance of the diminished seventh between \(\hat{7}\) and the bass \(\hat{1}\) (in a tonal context where \(\hat{7}\) sits above the dominant) resolves outward. The simultaneous descent of the bass reinforces the finality of the cadence. Every formal cadence in common-practice harmony is an elaboration of this gesture; the dominant seventh chord formalizes the harmonic basis of the leading tone’s urgency; but the underlying two-voice motion remains constant from the fifteenth century to the nineteenth.

2.5 Second Species: Two Notes Against One

Second species introduces rhythmic differentiation into the contrapuntal texture. Against each whole note of the cantus firmus, the counterpoint voice adds two half notes, producing a 2:1 ratio of motion. This ratio creates a crucial distinction between strong beats (the downbeat, where the half note coincides with the whole note) and weak beats (the midpoint of the bar, where the half note sounds against the sustained cantus note). Dissonance becomes available for the first time — but only in strictly prescribed circumstances that ensure perceptual resolution.

Definition 2.2 (Second Species Counterpoint). A second-species counterpoint places two half notes against each whole note of the cantus firmus. The rules are: (1) the note on the strong beat (downbeat) must be consonant with the cantus; (2) the note on the weak beat may be a passing tone (dissonant, approached and quitted by stepwise motion in the same direction), a neighbor tone (dissonant, approached by step and returning by step to the same note), a consonant skip (leaping to a consonance on the weak beat), or a nota cambiata; (3) consecutive parallel fifths or octaves between any two strong beats, or between a strong and the immediately following weak beat, are prohibited; (4) the exercise begins on the downbeat with a perfect consonance.

The passing tone is the central device of second species. It fills in the space between two consonances by stepwise motion, and the fact that it is dissonant on the weak beat is acoustically tolerable because the ear, having registered the strong-beat consonance, perceives the dissonant passing tone as motion — a flash of tension swept aside before the next strong beat arrives. The weak beat is, in this sense, a zone of relative harmonic permission: not lawless, but operating under a more lenient code than the strong beat. The ear’s tolerance for dissonance on a weak beat is approximately inversely proportional to the metric weight of that beat; in compound meter, where some weak beats are stronger than others, the rules must be calibrated accordingly.

Example 2.2 (Passing Tones in Second Species). Suppose the cantus in the bass holds C4 as a whole note. The soprano counterpoint moves from E5 on the downbeat (major tenth — consonant) through D5 on the weak beat (major ninth — dissonant passing tone) to C5 at the opening of the next measure against the next cantus note. The passing tone D5 is dissonant against C4 (major ninth = major second expanded by an octave), but it is on the weak beat, approached by descending step from E5, and quitted by descending step to the next bar's consonance. Its dissonance is perceptually absorbed into the motion from E5 downward; it is a step in a trajectory, not a harmonic event in itself.

2.6 The Cambiata and Consonant Skip

Second species permits two additional melodic devices beyond the passing tone and neighbor tone: the nota cambiata (changing-note figure) and the consonant skip. The cambiata is a four-note figure — step down, skip down a third, step up — that was a Renaissance cliché, widely used by Palestrina and his contemporaries. Although the third note of the figure (the note arrived at by the downward skip) is dissonant and is quitted by ascending step rather than by the expected resolution, the figure was so thoroughly conventionalized by Renaissance practice that strict species pedagogy admits it as an exception. This is a notable moment in the rules of counterpoint: an irregularity tolerated because it has become a stylistic formula.

Definition 2.3 (Nota Cambiata and Consonant Skip). The nota cambiata is the four-note figure that, in second species, descends by step, then leaps down a third, then ascends by step (e.g., E–D–B–C), where D is the weak-beat dissonance quitted by the downward skip rather than resolved by step. The consonant skip is a leap on the weak beat to a consonance with the cantus, used when stepwise continuation would produce a forbidden parallel or an awkward melodic line. Both devices enrich the melodic vocabulary of second species beyond what the passing-tone and neighbor-tone rules alone would permit.

The consonant skip provides melodic flexibility when the strict alternation of strong consonances and weak passing tones would produce an uncomfortably stepwise, monotonous line. A well-formed melodic contour requires occasional leaps, and the consonant skip — always to a consonance — satisfies this need while respecting the fundamental requirement that weak-beat activity resolve perceptually before the next downbeat. The consonant skip is not a dissonance treatment at all; it is a melodic option that happens to occur on a weak beat, and its consonance ensures that no special resolution is needed.

2.7 Melodic Independence in Two-Voice Writing

A persistent danger in two-voice counterpoint — one that afflicts student exercises with disproportionate frequency — is the tendency for the added voice to lose its melodic character and become merely a harmonic adjunct to the cantus firmus. This failure is partly a matter of attention — the student, focused on the vertical interval relationships, neglects to hear their added voice as a melody — and partly a matter of conception: the student thinks of themselves as “harmonizing” the cantus rather than “conversing” with it. The corrective is a change of perspective: one must compose the added voice as if it were the primary melody, temporarily forgetting the cantus, and then check that the resulting combination respects the interval rules. A voice composed with melodic integrity will almost always produce better counterpoint than one composed from the interval outward.

Remark 2.3 (The Role of Sequences in First and Second Species). In free composition, sequences — the repetition of a melodic pattern at successive pitch levels — are indispensable for melodic development. In strict species counterpoint, sequences are formally permitted but must be used with care: a sequence that repeats a pattern involving a weak-beat dissonance will repeat the same dissonance type at each level of the sequence, which is acceptable (the dissonance treatment remains consistent) provided no sequence level produces a forbidden parallel or an impermissible dissonance. The most common sequential pattern in second species is the ascending chain of passing tones — the soprano moves stepwise upward through a third or fourth at each beat, each dissonance treated as a passing tone against the sustained cantus — which is both harmonically valid and melodically natural, creating the sense of purposive upward motion that a good second-species melody requires.
A voice that simply moves from consonance to consonance without regard for its own melodic shape is not a genuine second voice; it is a series of harmonically correct responses, devoid of musical character. The discipline of first species counterpoint, precisely because it demands consonance at every note, creates the temptation to think exclusively harmonically, choosing each note of the counterpoint for its vertical relationship to the cantus rather than for its horizontal relationship to the preceding and following notes in its own line.
Theorem 2.3 (The Priority of Melodic Logic). In species counterpoint, the melodic integrity of the added voice takes priority over the variety of intervals used. A first-species counterpoint that proceeds entirely in parallel thirds — always choosing the third above or below the cantus — is harmonically correct (thirds are consonant) but melodically inert (the added voice is simply a transposition of the cantus, with no independent identity). Such a counterpoint fails as genuine two-part writing even though it technically violates no rule except the unstated but fundamental one: the voices must be independent.

The test of melodic independence is the single-voice test: remove one voice from the texture and play or sing the other alone. A genuinely independent voice will sound like a complete and interesting melody in isolation; a dependent voice will sound like an incomplete fragment whose only reason for existence is its relationship to the other voice. Every voice in a species exercise — and every voice in a Bach invention or fugue — should pass the single-voice test. This is the standard to which the student must hold themselves, and it is a higher standard than mere correctness.

Example 2.3 (Melodic Independence vs. Harmonic Correctness). Consider two soprano counterpoints above the cantus C–D–E–F–G in the bass. The first reads E–F–G–A–B — intervals of major third, minor third, minor third, major third, major third. Every interval is consonant; there are no parallel fifths or octaves. But the soprano simply moves in parallel thirds with the bass throughout. Remove the bass and sing the soprano: it is a perfectly fine D major scale (without the first note), but it has no identity independent of the bass that generated it. The second soprano reads G–G–C–C–D — intervals of fifth, fourth, minor sixth, fifth, fifth. Again, all consonant, no parallels. But now the soprano has a profile of its own: it sustains G twice (creating a moment of oblique motion against the ascending bass), leaps down to C (a fourth, followed by two stationary notes and a step to D). Remove the bass: the soprano line G–G–C–C–D has a clear melodic character — a sustained opening followed by a descent and a step up — that is recognizable in isolation. It is the second soprano that exhibits genuine melodic independence.

Chapter 3: Third, Fourth, and Fifth Species

3.1 Third Species: Four Notes Against One

Third species places four quarter notes against each whole note of the cantus, introducing the fastest melodic motion yet encountered and demanding a corresponding increase in the number of available voice-leading devices. The shift from two quarter notes (second species) to four (third species) is not merely a doubling of melodic activity; it changes the character of the contrapuntal texture fundamentally. In second species, the melody has time to breathe — two notes per cantus note, with each note occupying a full half beat and each melodic gesture clear and deliberate. In third species, the melody flows continuously, and the ear must follow a stream of quarter notes whose harmonic implications change at every beat. The contrapuntal challenge shifts from choosing the right interval to composing a genuinely flowing melody that respects all interval and motion-type constraints while maintaining melodic interest and direction. At this density — four times the speed of the cantus — the counterpoint voice has the character of a flowing, ornamented melody, and the student must manage not only vertical intervals but melodic shape, direction, and rhythmic continuity across the entire exercise. The challenge is no longer simply to avoid forbidden parallels but to compose a genuinely beautiful melodic line that observes the harmonic rules as a matter of course.

Definition 3.1 (Third Species Counterpoint). In third species, four quarter notes are placed against each cantus whole note. The first beat of each group of four (the downbeat) must be consonant. The remaining three quarter notes may be dissonant, provided they function as passing tones or neighbor tones. The double-neighbor figure (a note surrounded by its upper and lower neighbors before returning) is also permitted. Consecutive parallel fifths or octaves between any two beats — strong or weak, within the bar or across the bar line — are prohibited.

The abundance of notes in third species does not relax the fundamental prohibition against parallel fifths and octaves; if anything, the density of motion makes their detection more challenging. One must scan not only consecutive beats within the bar but also the connection between the fourth quarter note of one bar and the first quarter of the next — the bar-line connection that receives special scrutiny because it is the moment of metric renewal. Parallel fifths that straddle the bar line are just as forbidden as those within it, and more insidious because the eye tends to treat each bar as a self-contained unit. The discipline of systematically checking every bar-line connection — placing a mental bracket at the join of every two bars and verifying the motion type and interval quality — is an essential habit for the third-species student and for any contrapuntist writing at a fast rhythmic density.

Example 3.1 (Double-Neighbor Figure in Third Species). Against a sustained C4 in the cantus, the soprano counterpoint might move E4–F4–D4–E4, forming the interval sequence major third, perfect fourth (dissonant), major second (dissonant), major third. The figure begins and ends on the consonant major third, E4 above C4; the two middle notes F4 and D4 are the upper and lower neighbors of E4 respectively. Their dissonance on beats 2 and 3 is acceptable because (1) the figure is a recognized conventional pattern, (2) the structural note E4 appears at both the beginning and end of the figure, making the neighbor notes perceptually subordinate, and (3) the consonant frame (major thirds on beats 1 and 4) ensures that no dissonance falls on a strong beat.

The melodic demands of third species are substantial. The four-quarter-note stream must maintain a clear sense of direction and arc; it should not meander aimlessly from note to note. The most common defect in student third-species exercises is excessive stepwise motion without compensating leaps, producing a melody that seems to crawl rather than flow. Conversely, too many leaps produce an angular, disjunct line that loses melodic coherence. The ideal third-species counterpoint alternates between scalar runs and melodic skips with the naturalness of a well-conceived melody, always moving purposefully toward the cadence.

3.2 Third Species: Forbidden Parallels and the Battuta

The density of third species creates a specific problem with forbidden parallels that requires explicit attention. The most dangerous parallel in third species is the one that crosses the bar line — the parallel between the fourth quarter note of one bar and the first quarter note of the next. This parallel is particularly easy to miss in the process of composition because the bar line creates a visual break that the eye tends to treat as a harmonic break. But there is no harmonic break at the bar line in species counterpoint; the harmonic obligation continues without interruption from bar to bar.

Definition 3.2 (The Battuta). In third species, a battuta ("beat") error is a parallel fifth or octave between the fourth quarter note of one bar and the first quarter note of the next. The term derives from the Italian for "beat" and refers to the moment of metric renewal at the bar line. The battuta error is treated as equivalent to a parallel between consecutive strong beats within the bar: it violates the prohibition on consecutive perfect consonances between any two beats, regardless of bar placement.

A related issue is the treatment of parallel fourths in third species. While parallel fifths between any pair of beats are absolutely prohibited, and parallel octaves are prohibited between any strong-beat notes, parallel fourths between upper voices (not involving the bass) are permissible in third species, as they are in all species. The distinction between the outer-voice rules and the inner-voice rules becomes particularly important in three- and four-voice textures, where the rules governing the relationship of each voice to the bass must be distinguished from those governing the relationships between pairs of upper voices.

3.3 Fourth Species: Suspensions

Fourth species is arguably the most musically consequential of the five species, because it introduces the suspension — the device that generates more harmonic tension and expressive power than any other single technique in tonal music. The suspension is a held note from a preceding consonance that becomes dissonant against a new harmonic context on the downbeat, then resolves downward by step to a new consonance. Its characteristic effect — the sudden arrival of a long-anticipated dissonance on the metric accent, followed by the gentle release of resolution — is the emotional heart of an enormous proportion of Western tonal music, from the Renaissance motet through the Romantic symphony.

Definition 3.2 (Suspension). A suspension is a three-phase voice-leading figure:
  1. Preparation: the note that will be suspended is first heard as a consonance on a weak beat (or on the preceding strong beat).
  2. Suspension: the note is held (tied) into the next strong beat, where it becomes dissonant against the other voice(s).
  3. Resolution: the suspended note moves downward by step to a consonance on the following weak beat.
The standard suspension types in two-voice counterpoint are named by the interval of suspension followed by the interval of resolution: 7–6 (seventh resolving to sixth), 4–3 (fourth resolving to third), 9–8 (ninth resolving to octave), and 2–3 (the bass suspension, where the bass voice is the suspended voice — a second resolving to a third below).

The psychological power of the suspension is immense and is worth analyzing carefully. The preparation establishes the suspended note as a consonance — the ear hears it in a stable harmonic context and registers the note as “belonging” to the current harmony. When the other voice changes at the next downbeat, the held note suddenly becomes dissonant against the new harmonic context; the ear registers this as a collision, a moment of unresolved tension. The resolution — the downward step to a consonance — is thus heard as a release of that tension, an answering of the question the suspension posed. The entire three-phase figure is a paradigm of expectation, frustration, and fulfillment that mirrors the most basic patterns of human emotional experience.

What makes the suspension particularly powerful as a musical device is the combination of its metric placement and its preparation. A dissonance that appears without preparation — a sudden accented dissonance on the downbeat with no held-over note — would be jarring, a true collision. The preparation converts the dissonance from a collision to a suspension: the note has been there, it is familiar, it simply finds itself in an unexpected harmonic context. The listener’s response is not shock but sympathy — a sense that the suspended note must now find its way to resolution, and a sense of satisfaction when it does. This is the emotional logic of the suspension, and it is also the emotional logic of much of human experience: the familiar thing that finds itself displaced, suspended in a new context, and seeks its natural resolution.

Theorem 3.1 (Resolution Rules for Suspensions). A suspension must resolve by stepwise downward motion in the suspending voice. Upward resolution is prohibited in strict counterpoint. The resolution must arrive on a consonance. In a chain of suspensions — where the resolution of one suspension immediately becomes the preparation for the next — the resolving note is tied into the next downbeat, where it again becomes dissonant. This chain of suspensions can sustain a single emotional arch across many measures, each resolution simultaneously a point of momentary rest and the beginning of a new tension.

The prohibition on upward resolution is not arbitrary. The psychological effect of a suspension depends on the sense that the suspended note is “held back” by the weight of the harmonic change beneath it — suspended, in the gravitational sense, above a new floor. The downward resolution is the note “falling” to its natural resting place below. An upward resolution reverses this metaphor: the note “escapes” upward rather than resolving downward, producing a sense of avoidance rather than resolution. This is dramatically appropriate in certain expressive contexts (Bach uses upward resolutions as expressive exceptions in free composition) but stylistically incongruent with the sustained, inexorable quality of strict species counterpoint.

Example 3.2 (Chain of Suspensions in D Minor). Against a cantus descending by step from A3 to D3 (A–G–F–E–D), the soprano counterpoint in fourth species might proceed: B4 (tied to) B4 (resolving to A4, tied to) A4 (resolving to G4, tied to) G4 (resolving to F4, tied to) F4 (resolving to E4). The suspensions are: 9–8 (B4 over A3), 9–8 (A4 over G3), 9–8 (G4 over F3), 9–8 (F4 over E3). Each suspended ninth resolves to an octave, and the chain produces a long descending arc of cascading resolutions — one of the most powerful and satisfying gestures in two-voice counterpoint. Bach deploys exactly this texture in many of the slow movements of his keyboard suites and cantatas.

3.3 Fifth Species: Florid Counterpoint

Fifth species, called florid counterpoint, combines all the devices of the preceding four species into a freely flowing line that moves in mixed note values according to the musical demands of the moment. Fifth species is the culmination of the species method — the point at which the systematic isolation of individual voice-leading techniques yields to their free combination in a texture that approaches actual composition. A well-written fifth-species exercise should sound like music, not like a pedagogical drill.

Definition 3.3 (Fifth Species Counterpoint). Fifth species is a combination of all previous species in a single voice. The counterpoint may move in whole notes, half notes, quarter notes, or brief eighth-note ornaments (particularly before cadences), and may include tied notes functioning as suspensions. The choice of note values is governed by musical judgment rather than fixed rules: suspensions should appear at cadences and at points of harmonic emphasis; quarter-note motion provides the primary melodic substance; half notes create moments of relative repose; and eighth notes appear as ornamental turns, anticipations, or cadential embellishments.

The transition from strict species to free composition is not a relaxation of standards but a change of the kind of discipline required. In the earlier species, the discipline is external: rules are applied one by one to each note as it is placed. In fifth species, the discipline must be internal: the rules have been so thoroughly absorbed that they operate automatically, below the level of conscious rule-checking, leaving the composer free to attend to the higher-level questions of melodic shape, rhythmic variety, and expressive trajectory. This internalization is the ultimate goal of the species system — not the production of correct exercises, but the formation of a musical mind that cannot produce forbidden parallels any more than a native speaker can produce ungrammatical sentences.

Remark 3.1 (Ornamental Figures in Fifth Species). The most idiomatic cadential ornament in fifth species is the combination of an anticipation with a suspension. Before the final cadence, the counterpoint voice may introduce an eighth-note anticipation of the resolution tone, producing a brief flicker of the arrival before the full rhythmic weight of the resolution. This decoration — heard everywhere in late-Renaissance and Baroque keyboard music — adds a quality of inevitability to the cadence, as though the voice is reaching ahead of time toward the resting point it knows is coming. The ornamental turn (\(\hat{1}–\hat{2}–\hat{7}–\hat{1}\)) before the final note of a phrase serves a similar function, energizing the approach to the cadence without delaying the arrival.

3.4 Fifth Species: Florid Counterpoint

Fifth species, called florid counterpoint, combines all the devices of the preceding four species into a freely flowing line that moves in mixed note values according to the musical demands of the moment. Fifth species is the culmination of the species method — the point at which the systematic isolation of individual voice-leading techniques yields to their free combination in a texture that approaches actual composition. A well-written fifth-species exercise should sound like music, not like a pedagogical drill.

Definition 3.4 (Fifth Species Counterpoint). Fifth species is a combination of all previous species in a single voice. The counterpoint may move in whole notes, half notes, quarter notes, or brief eighth-note ornaments (particularly before cadences), and may include tied notes functioning as suspensions. The choice of note values is governed by musical judgment rather than fixed rules: suspensions should appear at cadences and at points of harmonic emphasis; quarter-note motion provides the primary melodic substance; half notes create moments of relative repose; and eighth notes appear as ornamental turns, anticipations, or cadential embellishments.

The transition from strict species to free composition is not a relaxation of standards but a change of the kind of discipline required. In the earlier species, the discipline is external: rules are applied one by one to each note as it is placed. In fifth species, the discipline must be internal: the rules have been so thoroughly absorbed that they operate automatically, below the level of conscious rule-checking, leaving the composer free to attend to the higher-level questions of melodic shape, rhythmic variety, and expressive trajectory. This internalization is the ultimate goal of the species system — not the production of correct exercises, but the formation of a musical mind that cannot produce forbidden parallels any more than a native speaker can produce ungrammatical sentences.

Remark 3.3 (Ornamental Figures in Fifth Species). The most idiomatic cadential ornament in fifth species is the combination of a written-out trill and a resolution. Before the final cadence, the counterpoint voice introduces a trill figure — rapid alternation between the leading tone and the note above — that resolves to the tonic, creating a brief moment of rhythmic acceleration before the final arrival. This decoration — heard everywhere in late-Renaissance and Baroque keyboard music — adds a quality of inevitability to the cadence, as though the voice is eagerly anticipating the resting point it knows is coming. The ornamental turn (\(\hat{1}–\hat{2}–\hat{7}–\hat{1}\)) before the final note of a phrase serves a similar function, energizing the approach to the cadence without delaying the arrival.
Example 3.2 (Fifth Species Above a Dorian Cantus Firmus). Suppose the cantus firmus is a Dorian melody on D: D–F–E–D–G–F–A–G–F–E–D. A fifth-species counterpoint above this might proceed as follows: the first bar employs first-species half notes (A–A, a fifth above D and F); the second bar shifts to third-species quarters passing through consonances; the third bar introduces a 7–6 suspension over the cantus' D; the fourth bar uses a consonant skip to move between consonances; and the fifth and sixth bars build a chain of 4–3 suspensions leading into the final cadence, where an eighth-note anticipation of the tonic precedes the full tonic arrival on the last note. The result is a melodic line of considerable variety and expressiveness, each rhythmic gesture chosen for its musical effect rather than for any mechanical alternation of species.

3.5 Fourth Species Continued: The Bass Suspension and the 9–8

The most common suspensions in standard fourth species — 7–6, 4–3, and 9–8 — all involve the upper voice as the suspended voice, holding a note from the previous beat while the lower voice changes. The bass suspension reverses this: the lower voice holds a note from the previous beat while the upper voice changes, creating the interval of a second (or ninth) above the bass on the downbeat, resolving upward by step to a third. This 2–3 figure (the interval names referring to the bass suspension: second moving to third) is less common in strict species counterpoint but widespread in free tonal writing, where it typically arises from the stepwise descent of an inner voice (in four parts) while the bass is held.

Example 3.3 (The 9–8 Suspension and Its Resolution Direction). The 9–8 suspension is the "large" form of the 2–1 (second moving to unison): the suspended note is a ninth above the bass rather than a second, and it resolves downward by step to the octave. Its psychological effect differs subtly from the 7–6 and 4–3: the ninth, as a compound second, has a rich dissonance that is particularly associated with an atmosphere of yearning or longing in the Baroque and Classical styles. The 9–8 suspension resolving to the octave creates a momentary doubling of the bass note at the point of resolution — the two voices arrive at the same pitch class — which gives the resolution a quality of absolute arrival, of the upper voice descending to meet the bass on equal terms. Bach uses the 9–8 suspension extensively in his slow movements, where its quality of lingering dissonance and decisive resolution is especially expressive.

3.7 The Aesthetics of Dissonance Treatment

One of the most revelatory aspects of studying species counterpoint is the light it sheds on the aesthetic function of dissonance in music more broadly. Dissonance, in the species system, is not an evil to be tolerated but a resource to be cultivated — the source of harmonic tension without which the consonances would be inert, lacking the contrast that gives them their sense of arrival and rest. Every master of counterpoint from Palestrina to Bach understood this intuitively: a piece of music that contains no dissonance is a piece without forward motion, without yearning, without the sense of becoming that is the essential quality of music as a temporal art.

Remark 3.2 (Dissonance Density and Style). The frequency and type of dissonance tolerated in a polyphonic texture is one of the primary markers of historical style. Palestrina's style — the model for Fux's species system — uses dissonance with extreme economy: almost exclusively as passing tones and suspensions, on weak beats or prepared and resolved with care, and rarely more than one dissonance per measure in each voice. Bach's free polyphony, by contrast, employs a considerably wider range of dissonance types — unprepared sevenths, appoggiatura, chromatic passing tones, augmented sixth chords — and tolerates dissonance in metrically stronger positions. Schoenberg's twelve-tone counterpoint "emancipates" dissonance entirely, removing the requirement of resolution and treating all intervals as equally available. Each shift represents an expansion of the dissonance vocabulary, but in each case the underlying principle — that dissonance creates tension and must be managed in relation to consonance — remains operative. Even in a twelve-tone texture, the ear seeks resolution; the question is only what counts as resolution.

The treatment of dissonance in fifth species — the free combination of all previous species — serves as a kind of controlled laboratory for developing the student’s sense of how much dissonance, of what types, and in what contexts, produces the optimal aesthetic effect. Too little dissonance, and the music sounds bland; too much, and it becomes incoherent, the ear unable to find its bearings. The expert fifth-species exercise achieves a continuous oscillation between tension and repose that is, in miniature, the same arc that a complete musical movement must trace on a larger scale.

Theorem 3.2 (Hierarchy of Dissonance Intensity). The perceptual intensity of a dissonance in tonal counterpoint is governed by three factors: (1) the intrinsic acoustic roughness of the interval (seconds and sevenths are rougher than fourths and tritones in most orchestral timbres); (2) the metric position of the dissonance (a downbeat dissonance is stronger than a weak-beat dissonance by a factor proportional to its metric weight); and (3) the duration of the dissonance (the longer a dissonance is sustained without resolution, the greater the accumulated tension). The suspension is the most intense common dissonance because it combines metric strength (it falls on the downbeat), moderate duration (a half or full beat), and an acoustically rough interval (typically a seventh, second, or fourth) into a single concentrated moment of tension whose preparation and resolution make its intensity maximally perceptible.

Chapter 4: Two-Part Tonal Counterpoint

4.1 The Harmonic Minor Scale and Its Contrapuntal Implications

The transition from modal counterpoint (Chapter 1) to tonal counterpoint brings with it one of the most important practical complications in Western harmony: the dual nature of the minor mode. The natural minor scale (Aeolian mode) has a subtonic seventh degree (\(\hat{7} = \text{D}\) in E minor) that does not create the half-step leading tone needed for the authentic cadence. The harmonic minor scale — in which the seventh degree is raised by a semitone (\(\hat{7}^\sharp\)) — provides the leading tone but introduces an augmented second between \(\hat{6}\) and \(\hat{7}^\sharp\) (e.g., F to G# in A harmonic minor), an interval prohibited in strict melodic writing.

Definition 4.1 (Melodic Minor Scale). The melodic minor scale resolves the augmented-second problem of the harmonic minor by raising both \(\hat{6}\) and \(\hat{7}\) when ascending (creating a major sixth and major seventh above the tonic) and using the natural forms of \(\hat{6}\) and \(\hat{7}\) (lowered) when descending. The ascending form provides a smooth, stepwise ascent to the leading tone; the descending form avoids the leading tone altogether, using the subtonic and submediant of the natural minor. In tonal counterpoint, this bidirectional inflection must be carefully managed: a voice ascending toward the tonic should use the raised forms, while a voice descending away from the tonic should use the natural forms.

The practical consequence for two-voice tonal counterpoint is that the student must be aware of the directional melodic conventions of minor-mode writing. A soprano that ascends through \(\hat{6}^\natural\) and \(\hat{7}^\natural\) while approaching the tonic will produce a subtonic approach — a whole step to the tonic rather than a half step — which sounds modal and archaic in a tonal context. A soprano that uses the raised \(\hat{7}^\sharp\) in all contexts, even when descending, will produce the augmented second \(\hat{6}–\hat{7}^\sharp\) in downward motion, which is prohibited in melodic writing. These constraints — arising from the dual nature of the minor mode — add a layer of complexity to tonal counterpoint in minor that is absent in major, and they explain why minor-mode fugue subjects, chorale harmonizations, and free counterpoint require particular care in the treatment of the sixth and seventh scale degrees.

4.2 The Figured Bass Tradition and Its Connection to Counterpoint

Parallel to the species counterpoint tradition, the Baroque era developed a practical system of realizing harmony from an abbreviated notation: the figured bass, or basso continuo. In figured bass notation, the bass line is written out in full, and Arabic numerals placed below each bass note indicate the intervals to be realized above it by the keyboard player (harpsichordist or organist). The figure “6” indicates a first-inversion chord (sixth and third above the bass); “6/4” indicates a second-inversion chord; “7” indicates a seventh chord in root position; and so on. The keyboard player realizes the figured bass by filling in the upper voices according to the figured intervals, respecting the voice-leading rules of four-part counterpoint.

Definition 4.2 (Figured Bass Realization). A figured bass realization is a four-part harmonization of a given bass line in which the upper voices are determined by the Arabic numerals (figures) placed beneath each bass note, indicating the intervals to sound above the bass. An unfigured bass note receives a root-position triad (5/3) by default. Figures modify this default: "6" indicates a \(^6_3\) (first-inversion) chord; "\sharp" or "\flat" preceding a figure modifies the indicated interval by the corresponding accidental; and a line through a figure indicates that the indicated interval should be raised by a semitone. Realizing a figured bass requires simultaneous command of the harmonic vocabulary (the chord types implied by each figure) and the voice-leading rules (forbidden parallels, suspension resolution, leading-tone treatment) that govern the movement from chord to chord.

The connection between figured bass realization and species counterpoint is intimate. In both, the primary challenge is the management of independent voices above a given bass line; in both, the rules of consonance, dissonance treatment, and forbidden parallels govern every note-to-note connection. The difference is that figured bass realization requires the addition of three upper voices simultaneously (rather than the single added voice of species counterpoint), and the harmonic vocabulary is considerably richer (the full range of diatonic and chromatic chords, rather than the simple consonance framework of the species). The student who has mastered species counterpoint in two voices and four-part chorale writing has, in effect, mastered the underlying skills of figured bass realization; the additional challenge is merely the speed and fluency required by the practical performance context.

4.3 From Species to Free Counterpoint

The species method is a pedagogical scaffold, not a compositional style. No masterpiece of music consists entirely of strict first-species counterpoint, nor does any great composer work note by note through a rulebook while composing. The point of the species system is to isolate and master individual contrapuntal principles — consonance management, dissonance treatment, voice independence — so that they can be deployed naturally and flexibly in free composition. The transition from species exercises to free tonal counterpoint requires one additional conceptual step: the integration of harmonic progression.

In species counterpoint, the cantus firmus is simply a given melody, and the student adds voices whose notes are individually consonant with it. In free tonal counterpoint, both voices are composed simultaneously, and their interaction is governed not only by the interval-by-interval rules of the species system but also by the harmonic logic of functional tonality. Chords, harmonic rhythm, and the directed motion toward tonal goals shape the counterpoint at a higher structural level, creating a two-tier compositional problem: the interval-by-interval level of voice-leading and the phrase-by-phrase level of harmonic plan.

Definition 4.1 (Tonal Counterpoint). Tonal counterpoint is polyphonic writing in which the melodic and intervallic choices of individual voices are simultaneously governed by the species rules of voice-leading (consonance, dissonance treatment, forbidden parallels) and by the principles of functional harmonic progression (tonic, dominant, subdominant relationships within a key and their chromatic extensions). The harmonic progression provides the large-scale framework; the voice-leading rules govern the note-by-note execution within that framework.

The bass and soprano voices are the primary contrapuntal pair in virtually all tonal music. The bass defines the harmonic root progression and the chord inversions; the soprano provides the melodic identity of the phrase. Together they form a two-voice counterpoint that must be satisfying in itself, even before inner voices are added. Heinrich Schenker made this point central to his analytical theory: the Ursatz (fundamental structure) of a tonal composition is a two-voice framework — a soprano descending stepwise from \(\hat{3}\) (or \(\hat{5}\) or \(\hat{8}\)) to \(\hat{1}\) above a bass that moves from the tonic to the dominant and back — that underlies the entire melodic and harmonic surface. Every chord change, melodic ornament, and rhythmic figure is, in Schenker’s analysis, an elaboration of this primordial two-voice framework.

4.4 Sequence and the Harmonic Logic of Modulation

The harmonic sequence — the repetition of a harmonic or melodic pattern at a different pitch level — is the primary mechanism by which tonal counterpoint traverses harmonic distance and visits secondary key areas. Sequences are not merely convenient devices for filling space between more important harmonic events; they have their own internal logic, deriving their forward momentum from the voice-leading motion of their constituent intervals.

Definition 4.3 (Harmonic Sequence). A harmonic sequence is the repetition of a harmonic progression at a transposed pitch level, typically by a consistent interval (a fifth, a fourth, a third, or a step). In a descending-fifth sequence, each chord root descends by a fifth from the previous root: I–IV–VII–III–VI–II–V–I. In an ascending-fifth sequence, each root ascends by a fifth: I–V–II–VI–III–VII–IV–I. Sequences may be diatonic (remaining within the key throughout) or chromatic (introducing accidentals that tonicize each chord in the sequence), and they may be applied to any number of harmonies.

The descending-fifth sequence is the most common in tonal music because it follows the natural direction of harmonic tension: each chord in the sequence acts briefly as a local dominant to the following chord, so the sequence creates a continuous chain of unresolved dominant resolutions. This chain of temporary dominants produces a sense of harmonic inevitability and forward motion that is the sequence’s most valuable property. In the contrapuntal domain, the descending-fifth sequence typically appears as a chain of 7–6 suspensions in two voices: the soprano holds a suspended seventh that resolves to a sixth, immediately becoming the preparation for the next suspension. The sequential chain of suspensions is thus a fourth-species pattern elevated to the harmonic level, operating across multiple chord changes rather than a single measure.

Example 4.3 (Descending-Fifth Sequence in Two Voices). In G major, a descending-fifth sequence might proceed: G major (I) — C major (IV) — F# diminished (VII) — B minor (III) — E minor (VI) — A minor (II) — D major (V) — G major (I). In two-voice counterpoint, this sequence can be rendered as a chain of suspended seventh-sixth figures in the soprano against a descending bass: soprano G–F#–E–D, bass G–E–C–D–G (the bass skips to complete the chord roots while the soprano descends by step through suspensions). The result is a compact, powerful two-voice sequence that traverses the entire harmonic cycle of G major in eight beats, modulating through every diatonic harmony before arriving at the tonic with a decisive authentic cadence.

4.5 Figuration and Ornamentation

Once the two-voice harmonic skeleton has been established — a succession of consonant intervals governed by a coherent harmonic progression — the art of figuration takes over. Figuration is the process of animating the harmonic skeleton with the devices of the species system: passing tones, suspensions, neighbor tones, and the consonant skips that give each voice its melodic character. In free composition, figuration is not optional decoration but an essential part of musical expression; the same harmonic skeleton can be figurated in an infinite variety of ways, each producing a distinctive melodic and rhythmic character.

Definition 4.2 (Figuration). Figuration is the elaboration of a structural harmonic interval or note by the introduction of non-harmonic tones — passing tones, neighbor tones, anticipations, suspensions, escape tones, and neighboring chord tones — into the melodic surface of one or more voices. Figuration animates the harmonic rhythm by filling in the time between structural chords with melodic motion. The resulting surface melody may be considerably more complex than the underlying harmonic skeleton, yet it can always (in principle) be reduced to that skeleton by the removal of figuration.

The Baroque technique of diminution — the replacement of a long structural note with several shorter notes that elaborate the same harmonic point — is the basic tool of figuration. A structural half note C supported by a G in the bass might be figured as C–D–C (upper neighbor), C–B–C (lower neighbor and return), C–B–A–G (descending passing tones through a third), or C (sixteenth)–B (anticipation)–C (return on the beat). In each case the harmonic fact — C5 against G3, a tenth — remains constant while the melodic surface is elaborated into a characteristic rhythmic and melodic gesture. Bach’s keyboard suites are inexhaustible libraries of figurative technique; every prelude, every allemande, every sarabande demonstrates a different approach to the animation of a harmonic skeleton.

4.6 The Contrapuntal Cadence

The cadence in free tonal counterpoint is more elaborate than in strict species, but it remains grounded in the same two-voice logic. The clausula vera — the authentic cadence — involves a soprano leading tone ascending to the tonic octave while the bass descends by a fourth (or ascends by a fifth) from dominant to tonic. This two-voice cadential formula is the harmonic prototype from which all cadential elaboration proceeds; every perfect authentic cadence in common-practice harmony is a elaboration of this formula.

Theorem 4.1 (Types of Tonal Cadence). The principal cadential types in tonal counterpoint are:
  • Perfect authentic cadence (PAC): bass moves \(\hat{5}–\hat{1}\) (root position dominant to root position tonic), soprano arrives on \(\hat{1}\). The strongest form of formal closure.
  • Imperfect authentic cadence (IAC): dominant moves to tonic, but one or both chords is inverted, or the soprano ends on \(\hat{3}\) or \(\hat{5}\) rather than \(\hat{1}\). Weaker closure than the PAC.
  • Half cadence: ends on the dominant, typically with soprano motion to \(\hat{2}\), \(\hat{4}\), or \(\hat{5}\). Creates a moment of structural suspension demanding continuation in a new phrase.
  • Deceptive cadence: bass proceeds \(\hat{5}–\hat{6}\) (dominant to submediant) rather than to the expected tonic, while the soprano may arrive on \(\hat{1}\). The harmonic surprise creates a moment of withheld resolution, typically followed immediately by a repetition or elaboration of the cadential gesture.
  • Plagal cadence: bass moves \(\hat{4}–\hat{1}\) (subdominant to tonic). More commonly heard as a post-cadential extension (the "Amen" cadence) than as a primary structural arrival.

4.7 Creating Musical Tension and Release

The overarching goal of contrapuntal writing — and, indeed, of all tonal music — is the management of tension and release. Tension is created by dissonance, by melodic ascent, by harmonic distance from the tonic, by registral expansion, and by rhythmic density. Release is created by resolution of dissonance, by melodic descent, by harmonic return to the tonic, by registral contraction, and by rhythmic relaxation. A great contrapuntal composition creates a long-range trajectory of tension and release, with local fluctuations nested within a global arc of departure from and return to the tonic.

Remark 4.1 (The Role of the Tritone in Tonal Tension). The tritone — the interval \(\sqrt{2}:1\) in equal temperament — is the primary vehicle of harmonic tension in tonal music. The tritone formed between the leading tone (\(\hat{7}\)) and the subdominant (\(\hat{4}\)) in the dominant seventh chord is the engine of the dominant-to-tonic progression. When \(\hat{7}\) resolves upward by semitone to \(\hat{8}\) and \(\hat{4}\) resolves downward by step to \(\hat{3}\), the tritone resolves to a major third or a minor sixth — imperfect consonances that feel like genuine arrival after the acoustic restlessness of the tritone. Understanding the tritone as a directional vector always pointing toward the tonic, and understanding the dominant seventh chord as the harmonic crystallization of this vector, is one of the deepest insights in tonal counterpoint.

The two-voice invention, as exemplified by Bach’s fifteen Two-Part Inventions (BWV 772–786), demonstrates the management of tension and release with extraordinary economy. Each invention is built from a single motivic idea, typically a compact figure of two to four beats, that is stated, imitated, inverted, and sequenced through a carefully planned tonal journey: from the tonic to the dominant (or relative major or minor), through various intermediate key areas, and back to the tonic with a final cadence of decisive weight. The contrapuntal techniques of imitation, inversion, augmentation, and stretto provide the means of this development; the harmonic plan provides the large-scale architecture that gives the development direction and purpose. What makes Bach’s inventions so powerful despite their brevity is the absolute coherence of these two levels — the local voice-leading is impeccable, and the global harmonic plan is architecturally clear — working together to produce a sense of organic inevitability that belies the music’s apparent simplicity.

4.8 Bach’s Two-Part Inventions as Compositional Models

Bach’s fifteen Two-Part Inventions (BWV 772–786), composed in their final form around 1723 for the musical education of his son Wilhelm Friedemann, are the most widely studied examples of free two-voice tonal counterpoint in the repertoire, and for good reason. Each invention is a complete compositional study in a single contrapuntal technique or stylistic gesture, rendered with absolute economy and clarity. They serve the same pedagogical function in free counterpoint that the species exercises serve in strict counterpoint: they isolate a single principle and demonstrate it with such transparency that the principle becomes unmistakable.

Example 4.1 (Invention No. 1 in C Major, BWV 772: Invertible Counterpoint). The C major invention opens with a single-voice statement of the "subject" in the right hand — a flowing, sequential figure that descends from C to G — followed immediately by the answer in the left hand one octave lower. After two measures, both hands have stated the opening figure, and the piece proceeds through a series of imitative exchanges, sequences, and registral alternations that constitute the invention's development. At the midpoint of the piece, the materials of the right and left hands are exchanged: what was the right-hand figure now appears in the left hand, and vice versa. The inversion works without parallel fifths or octaves because the original two-measure combination was composed to be invertible at the octave. The invention thus demonstrates invertible counterpoint at the octave in its most compressed and transparent form: a single combination, stated in its original form, developed, and then stated in its inversion.
Example 4.2 (Invention No. 4 in D Minor, BWV 775: Sequential Episode Writing). The D minor invention is dominated by its episode material — a sequential figure in parallel tenths that traverses harmonic distance rapidly and smoothly. The outer voices move in parallel tenths (soprano and bass always a tenth apart), while the inner-voice figuration provides harmonic filling. This texture — parallel tenths in the outer voices — is one of the most idiomatic devices of Baroque two-voice writing; it produces a rich, full sound from only two voices by exploiting the acoustically warm quality of the tenth (an octave plus a third) and the contrary motion that parallel tenths often imply (if the soprano rises, the bass typically falls or stays, creating the parallel tenth in the other direction). The D minor invention is a masterclass in using parallel tenths as a structural device throughout an entire piece.

The student of free tonal counterpoint is advised to analyze Bach’s inventions at multiple levels simultaneously. At the surface level, one should identify the voice-leading: which notes are passing tones, which are suspensions, which are consonant skips? At the motivic level, one should identify the principal motive and trace its transformations — where it is stated literally, where it is inverted, where it appears in diminution or augmentation. At the harmonic level, one should map the key areas visited: where does the invention leave the tonic, what keys does it pass through, and how does it return? At the formal level, one should identify the sections — exposition, development, recapitulation or final cadence — and understand how they relate to each other. Only by analyzing at all four levels simultaneously can one begin to understand how the invention achieves its characteristic effect of spontaneous inevitability.


Chapter 5: Three- and Four-Part Counterpoint

5.1 Adding Inner Voices

Two-part counterpoint establishes the fundamental harmonic and melodic skeleton of a polyphonic texture. The outer voices — soprano and bass — carry the harmonic argument and define the two-voice counterpoint that underlies the entire texture. Inner voices — alto and tenor — fill in the harmony, complete the chords, and add registral warmth, but they are, in a structural sense, secondary to the outer-voice dialogue. This hierarchy is not merely theoretical; it is audible. In any four-part texture, the soprano and bass project more clearly than the inner voices: the soprano occupies the highest register, commanding attention by its acoustical prominence; the bass occupies the lowest, providing the harmonic foundation that the upper voices define themselves against. The alto and tenor fill the middle ground, their lines important for smooth voice-leading and complete harmony but rarely the primary bearers of melodic interest. Three- and four-part writing fills in this skeleton with additional voices that complete the harmony, add registral depth, and create a richer timbral environment. The addition of inner voices is not a mechanical harmonic task — the insertion of whatever pitch happens to complete the chord — but a contrapuntal challenge requiring that each added voice maintain its own melodic integrity, follow the rules of voice-leading, and avoid creating forbidden parallels not only with the outer voices but also with each other.

Definition 5.1 (Four-Part Chorale Texture). The four-part chorale texture consists of four voices (soprano, alto, tenor, bass) moving largely in homophonic rhythm, each assigned to a distinct staff and vocal register. The genre derives from the Lutheran chorale harmonizations of the Baroque era, of which Bach's approximately 370 surviving examples are the canonical models of the style. In four-part chorale texture, each beat typically presents a complete chord in four voices, and the voice-leading between consecutive chords is governed by the rules of tonal counterpoint applied simultaneously to all pairs of voices — a combinatorial challenge of considerable difficulty.

The challenge of four-part writing is combinatorial: where two-part counterpoint requires managing one pair of voices, four-part writing requires managing six pairs simultaneously (soprano–alto, soprano–tenor, soprano–bass, alto–tenor, alto–bass, tenor–bass). Forbidden parallels between any pair of these six combinations are forbidden; voice crossing and overlapping, while technically possible, introduce registral confusion that undermines the clean identification of voices. This apparent complexity is manageable in practice because many of the constraints reinforce each other: good outer-voice counterpoint (soprano and bass) tends to create conditions that simplify inner-voice management, since the harmonic framework established by the outer voices constrains the choices available to the inner voices.

A practical strategy for composing four-part counterpoint is to proceed in two stages: first, compose the outer-voice counterpoint (soprano and bass) as a complete two-voice composition in the style of the species exercises, ensuring that it forms a satisfying and correct two-voice piece in its own right. Then add the inner voices (alto and tenor) to fill in the harmony, treating each inner voice as a third-species counterpoint above the bass and checking each addition against the soprano and against the other inner voice. This two-stage approach — outer voices first, inner voices as filling — reflects the hierarchical structure of the texture and is how many composers (including Bach, by all accounts) actually worked. The outer-voice composition is the structural act; the inner-voice completion is the harmonic elaboration.

Example 5.3 (Outer-Voice-First Composition). To harmonize the chorale phrase F–E–D–C in the soprano (in C major), one first finds a satisfying bass line: a possible bass is C–C–G–C (root-position tonic, repeat, root-position dominant, root-position tonic), producing the outer-voice intervals: perfect fifth (F above C), major third (E above C), perfect fifth (D above G), perfect octave (C above C). The outer voices form a correct, if simple, two-voice counterpoint: all intervals consonant, final interval an octave approached by contrary motion (soprano descends, bass holds or descends). Adding the inner voices: the alto might proceed A–G–F–E (descending thirds and steps), and the tenor C–B–B–G (passing motion through the implied harmonies). Checking all six pairs: SA (fifth/third/third/sixth — all consonant), ST (third/second — second is dissonant! Alto A against tenor B is a major second on beat 2 — we must reconsider the tenor). Revise tenor: C–C–B–E. Now ST gives: fifth, third, third, sixth — all consonant. AB: eighth, fifth, fourth, third. AT: unison, second (C against B — still a problem). Further revision required. The exercise demonstrates that four-part writing demands the simultaneous management of all six voice pairs and that the first attempt is rarely correct.

5.2 Chord Spacing and Doubling Rules

The physical distribution of chord tones across the four voices — chord spacing — profoundly affects the sonority of the texture. The principles of spacing derive from the acoustics of the overtone series: lower intervals should be wider to avoid the acoustic muddiness created by closely spaced bass tones (whose overtones clash more severely at short intervals), while higher intervals may be narrower because the upper partials of high-register tones are weaker and less likely to produce perceived roughness.

Definition 5.2 (Open and Close Position). A four-part chord is in close position if the upper three voices (soprano, alto, tenor) are arranged within the span of an octave. It is in open position if the upper three voices span more than an octave — that is, if the alto is more than an octave below the soprano and the tenor more than an octave below the alto. Both positions are idiomatic in four-part writing; the choice of position affects texture density, registral warmth, and the ease of smooth voice-leading in the particular context.
Theorem 5.1 (Doubling Rules in Four-Part Harmony). When a triad is presented in four parts, one chord tone must be doubled. The hierarchy of preferred doublings is:
  1. Double the root: the most stable and acoustically grounded choice.
  2. Double the fifth: permissible and common when the root doubling produces awkward voice-leading or forbidden parallels.
  3. Double the third: generally avoided, though sometimes necessary; in particular, the third of a major chord in a bright register can be doubled without ill effect.
  4. Never double the leading tone: both voices would need to resolve upward by semitone to the tonic, producing parallel octaves on the most structurally important moment of the phrase.

The prohibition on doubling the leading tone is a direct consequence of the parallel-octave prohibition, applied to the specific harmonic context of the dominant chord resolving to the tonic. If the leading tone (\(\hat{7}\)) appears in two voices, both must resolve upward by semitone to the tonic (\(\hat{8}\)); the resulting parallel octaves on the cadence — the most emphatic moment of the phrase — would be a catastrophic failure of voice independence at the one moment when independence matters most. The rule against leading-tone doubling is therefore not a freestanding prohibition but a specific application of the most fundamental principle of counterpoint.

5.3 Voice Crossing and Overlapping

Voice crossing occurs when a lower voice temporarily rises above a higher voice, or vice versa. Voice overlapping occurs when a voice moves to a pitch beyond the previous position of an adjacent voice, even without literally crossing in the current chord. Both practices disrupt the registral hierarchy that gives each voice its distinct identity, and both are prohibited in strict four-part writing.

Definition 5.3 (Voice Crossing and Overlapping). Voice crossing occurs when, at a given moment in a four-part texture, a lower voice sounds a pitch higher than the immediately higher voice. Voice overlapping occurs when a voice moves to a pitch that exceeds the most recent pitch of an adjacent voice in the same direction: if the soprano was most recently on D4 and the alto now moves to E4 (above the soprano's last heard note), the alto has overlapped the soprano, creating momentary ambiguity about which is the "upper" voice.
Remark 5.1 (Perceptual Rationale for Crossing and Overlapping Rules). The prohibitions on voice crossing and overlapping are fundamentally matters of auditory stream segregation. Research by Albert Bregman and others has shown that the auditory system segregates simultaneous sounds into distinct perceptual streams primarily on the basis of frequency proximity and continuity: we expect the "soprano stream" to remain in the upper register from note to note, and the "alto stream" to remain below it. When the alto suddenly moves above the soprano's last pitch, the auditory system momentarily loses track of which stream is which. In a slow, clearly articulated four-part texture, such momentary confusion is merely confusing; in a fast, dense texture, it can render the polyphony perceptually incoherent.

5.4 The Six-Four Chord and Its Contrapuntal Treatment

The second-inversion triad — the chord in which the fifth of the chord appears in the bass — presents a special voice-leading challenge in four-part writing that has no exact equivalent in two-voice counterpoint. Because the bass note is the fifth of the triad rather than the root, the chord root appears as a fourth above the bass; in the two-voice context of Definition 1.5, we noted that the fourth above the bass functions as a dissonance. In four-part writing, the second-inversion triad — the \(^6_4\) chord — is treated as an embellishing dissonance that requires both preparation and resolution.

Definition 5.5 (The Second-Inversion Triad and Its Uses). A second-inversion triad (\(^6_4\) chord) places the fifth of the triad in the bass, creating a fourth (or eleventh) between the bass and the root, and a sixth (or thirteenth) between the bass and the third. The three standard uses of the \(^6_4\) chord in tonal counterpoint are:
  • Cadential \(^6_4\): the dominant \(^6_4\) chord (e.g., \(\text{G–C–E}\) above a G bass in C major), which appears on a strong beat immediately before the dominant triad at a cadence. The fourth (C) and sixth (E) above the bass resolve downward by step to the third (B) and fifth (D) of the dominant, creating a double suspension over the dominant bass note.
  • Passing \(^6_4\): a \(^6_4\) chord that arises from the passing motion of the bass through the fifth of a chord, connecting two positions of the same harmony. It appears on a weak beat and resolves by the continuation of the bass through passing motion.
  • Pedal \(^6_4\): a \(^6_4\) chord that arises from the motion of upper voices above a sustained or repeated bass note. The bass is the structural note; the upper voices move to and from the \(^6_4\) position as a neighbor-tone embellishment of the simpler harmony.

The cadential \(^6_4\) is by far the most important of these three uses and is one of the most powerful harmonic devices in tonal music. Its power derives from the same principle as the suspension: a strong-beat dissonance (the fourth and sixth above the dominant bass) is sustained momentarily before resolving downward to a consonance (the third and fifth of the dominant triad). The cadential \(^6_4\) is, in effect, a double suspension over the dominant, and its resolution produces the sense of harmonic arrival at the dominant that makes the subsequent motion to the tonic all the more decisive. Bach uses the cadential \(^6_4\) at virtually every formal cadence of significance in his chorale harmonizations, as does virtually every tonal composer from the Baroque through the late Romantic period.

5.5 Bach Chorales as Models

Bach’s chorale harmonizations are the supreme models of four-part contrapuntal writing, and every serious student of harmony and counterpoint must study them with the closest possible attention. In these settings — of Lutheran chorales originally composed in the sixteenth and seventeenth centuries — Bach transforms simple, often modal tunes into extraordinarily rich harmonic tapestries, using the full range of chromatic harmony, second-inversion chords, applied dominants, borrowed chords, and melodic figuration, all while maintaining impeccable four-part voice-leading throughout. A Bach chorale is simultaneously a masterpiece of harmonic imagination and a flawless example of four-part voice-leading technique, and the student can learn from it at both levels.

Example 5.1 (Analyzing a Bach Chorale: "Herzlich tut mich verlangen," BWV 244/25). In this famous chorale, which appears multiple times in the St. Matthew Passion, Bach harmonizes a melody of particular beauty and restraint. Analyzing the bass line alone reveals a masterpiece of contrapuntal bass writing: the bass moves primarily by step and by consonant skips (thirds and fourths), avoids parallel fifths and octaves with the soprano at every point, supports the harmonic rhythm with decisive root-position chords at cadences and lighter first-inversion chords at non-cadential moments, and forms with the soprano a two-voice counterpoint of considerable beauty in its own right. The inner voices fill in the harmony with minimal melodic interest — appropriately, since the soprano carries the chorale melody and must not be obscured — but each inner voice still maintains a coherent step-wise melodic motion that avoids forbidden parallels with every other voice.

A useful analytical exercise is to reduce a Bach chorale to its outer-voice counterpoint alone — stripping away alto and tenor — and to analyze the resulting soprano-bass dialogue as a two-voice contrapuntal composition. Invariably, the outer voices form a satisfying and coherent two-voice counterpoint; the inner voices serve as harmonic amplification and textural enrichment rather than as structural necessities. This reduction reveals the two-voice skeleton that Bach composed first (or at least conceived first), the framework on which the full four-voice texture is elaborated.

5.6 Non-Harmonic Tones in Four-Part Texture

The species rules for dissonance treatment — passing tones, neighbor tones, suspensions — apply with equal force in four-part writing, but the presence of additional voices creates new possibilities for dissonance that have no equivalent in two-voice writing. In particular, the pedal point — a sustained bass note over which the upper voices move through harmonic progressions that create dissonances against the held note — is exclusively a four-part (or more) device; in two-voice writing, a sustained bass note simply maintains a fixed harmonic relationship with the moving upper voice, producing whatever consonances or passing dissonances the species rules permit. With four voices, the held bass note may become dissonant against the upper-voice harmony even while those upper voices form consonances among themselves, creating a three-layer texture of considerable richness.

Definition 5.4 (Pedal Point). A pedal point (or organ point) is a sustained note in one voice — almost always the bass — over which the other voices move through harmonic progressions that include dissonances against the sustained note. The pedal note is typically the tonic or dominant scale degree. A tonic pedal confirms and sustains the sense of the home key during episodes of harmonic complexity in the upper voices. A dominant pedal creates an arch of tension in which the dominant harmony is sustained beneath increasingly dissonant upper-voice progressions until the tonic resolution, at which point the pedal note becomes again the root of the tonic chord.

The dominant pedal — sustained dominant in the bass beneath chromatic or dissonant upper-voice progressions — is one of the most powerful tension-building devices in all of tonal music. Its power derives from a paradox: the bass, the register most strongly associated with harmonic foundation, is sustaining the “wrong” note — the dominant rather than the tonic — while the upper voices are free to roam harmonically. The ear registers this as an unresolved tension of extraordinary duration, and when the resolution arrives — the dominant pedal note finally resolving upward to the tonic (or the upper voices settling onto the tonic triad above the sustained dominant pedal, creating a final dissonance before the resolution) — the effect is one of accumulated pressure finally released.

Example 5.2 (Tonic Pedal in Bach's Prelude in C Major, WTC Book I, BWV 846). The Prelude in C Major — the piece that opens the entire Well-Tempered Clavier and perhaps the most famous keyboard prelude ever written — concludes with an extended tonic pedal: the bass sustains C for the final seven measures, while the upper voices move through a V\(^7\) chord (with the dominant seventh G–B–D–F sounding above the sustained C), then resolve to the tonic triad (C–E–G). The dominant seventh above the tonic pedal creates a powerful dissonance: the F and B in the upper voices form a tritone against each other, and F forms a tritone against B in the bass — wait, no, C is in the bass, and F is a fourth against C (which is dissonant above the bass in this context), while B is a major seventh above C. The accumulated dissonance of the dominant seventh chord sounding above the sustained tonic pedal is then resolved by the step of the upper voices to the final tonic chord. This preparation — dominant seventh above tonic pedal, resolving to tonic — is Bach's supreme use of the pedal-point technique and a direct demonstration of how sustained dissonance, properly managed, creates a moment of extraordinary harmonic weight.

5.7 Chromatic Voice-Leading in Four-Part Writing

The full resources of tonal harmony include not only diatonic chords but chromatic harmonies — applied dominants (secondary dominants), Neapolitan chords, augmented sixth chords, borrowed chords from the parallel major or minor, and the full range of chromatic passing tones and neighbor tones. Incorporating chromatic harmonies into four-part counterpoint requires particular care, because chromatic notes create additional voice-leading obligations: the raised note of an applied dominant must resolve upward (as a temporary leading tone), the lowered note of a Neapolitan must resolve downward, and the augmented sixth of an augmented-sixth chord must resolve outward by semitone in both directions simultaneously.

Definition 5.6 (Applied Dominant). An applied dominant (or secondary dominant) is a dominant-function chord built on a scale degree other than \(\hat{5}\) — specifically, a major triad or dominant seventh chord built on the note a fifth above any diatonic chord other than the diminished seventh on \(\hat{7}\). The applied dominant temporarily tonicizes its target chord, treating it as a momentary local tonic. The raised third of the applied dominant functions as a temporary leading tone and must resolve upward by semitone to the target chord's root.

The voice-leading of applied dominants in four-part chorale writing requires care at two points: the approach to the applied dominant (the chromatic note must be introduced without creating a forbidden augmented second in any voice) and the resolution (the temporary leading tone must resolve upward, creating the risk of parallel octaves if the bass also moves to the same pitch class). Bach navigates these issues in his chorale harmonizations with characteristic elegance: he often introduces chromatic notes as passing tones or neighbor tones, approached by half step from a diatonic pitch, so that the chromaticism feels like melodic figuration rather than a harmonic shock.

Remark 5.2 (The Augmented Sixth Chord in Counterpoint). The augmented sixth chord — Italian, French, or German — is one of the most expressive harmonic devices in tonal music. Its characteristic interval, the augmented sixth between \(\flat\hat{6}\) and \(\sharp\hat{4}\) (e.g., Ab and F# in C major), is a doubly-directed vector: the lower note resolves downward by semitone and the upper note resolves upward by semitone, both arriving simultaneously on the dominant pitch. In four-part writing, the German augmented sixth chord (which includes an additional perfect fifth above \(\flat\hat{6}\), creating an interval that sounds enharmonically like a dominant seventh chord) requires special handling to avoid parallel octaves in the resolution, since the perfect fifth above \(\flat\hat{6}\) resolves to an octave on the dominant while the augmented sixth resolves to a unison on the dominant — producing parallel octaves unless the resolution is carefully managed by placing the German sixth in a specific voicing.

5.9 The Complete Chorale Texture: Harmonic Rhythm and Voice-Leading

In Bach’s chorale harmonizations, every note of the soprano chorale melody is harmonized by a complete four-part chord, and the harmonic rhythm — the rate at which the underlying harmony changes — varies from one chord per beat to one chord per half-measure or even one chord per measure, depending on the character of the phrase. Controlling the harmonic rhythm is one of the most important skills in four-part writing; a uniformly fast harmonic rhythm produces relentless activity without repose, while a uniformly slow one produces stasis without forward motion. The art lies in varying the harmonic rhythm to match the emotional and formal demands of the phrase.

Theorem 5.2 (Harmonic Rhythm and Phrase Structure). In four-part tonal counterpoint, harmonic rhythm should generally:
  • Slow down (fewer chord changes per unit time) at the approach to a cadence, to create a sense of gathering and arrival.
  • Speed up (more chord changes) in the middle of a phrase, to create forward momentum and harmonic interest.
  • Begin at a moderate pace to establish the metric and harmonic framework before any accelerations or decelerations.
  • Avoid changing harmony on every single beat for extended passages, as uniform harmonic rhythm eliminates the metric hierarchy that gives strong and weak beats their perceptual distinction.

Bach’s skill in harmonic rhythm is perhaps most evident in his chorale harmonizations of irregular phrase lengths. A phrase of five or seven beats — irregular because it falls outside the expected four-beat norm — demands particularly careful harmonic rhythm management: if the harmony changes too predictably, the irregularity of the phrase length is smoothed over; if the harmony changes too irregularly, the phrase loses its sense of direction. Bach invariably finds the path that honors the phrase length’s individuality while maintaining the sense of purposeful forward motion that makes each chorale a complete and satisfying musical statement.


Chapter 6: Invertible Counterpoint and Canon

6.1 Invertible Counterpoint at the Octave

Invertible counterpoint is the technique of composing two or more voices such that they can be exchanged — the upper voice placed below the lower — without producing forbidden parallels or other voice-leading errors. A passage of invertibly written counterpoint is a far more economical compositional resource than one that exists in only one configuration: the composer gains a second (or more) valid version of the material at no additional compositional cost. Bach exploited invertible counterpoint with extraordinary thoroughness; the two-part inventions are studies in the technique, and the fugues of the Well-Tempered Clavier deploy it at every structural level.

Definition 6.1 (Invertible Counterpoint at the Octave). Two voices, \(v_1\) (upper) and \(v_2\) (lower), form counterpoint invertible at the octave if the version with \(v_1\) above \(v_2\) and the version with \(v_2\) transposed an octave above \(v_1\) (or \(v_1\) transposed an octave below \(v_2\)) are both valid two-voice counterpoints. Under inversion at the octave, diatonic interval numbers transform according to \(i \mapsto 9 - i\), so that: unison (1) becomes octave (8); second (2) becomes seventh (7); third (3) becomes sixth (6); fourth (4) becomes fifth (5); fifth (5) becomes fourth (4); sixth (6) becomes third (3); seventh (7) becomes second (2); octave (8) becomes unison (1).

The interval transformation under octave inversion has a critical consequence: the perfect fifth (interval 5) inverts to a perfect fourth (interval 4). In two-voice counterpoint, the perfect fourth above the bass is treated as a dissonance requiring resolution when it appears on a strong beat without the support of additional voices. This means that in a passage intended to be invertible at the octave, each perfect fifth must be voice-led in a way that would make the resulting fourth workable in the inversion. In practice this means avoiding strong-beat fifths that would become inadmissible strong-beat fourths, and being particularly careful with parallel fifths, which would invert to parallel fourths — not technically forbidden between upper voices, but acoustically troubling.

Theorem 6.1 (Constraints on Counterpoint Invertible at the Octave). In a two-voice passage invertible at the octave, the following interval-specific constraints apply:
  • Avoid consecutive perfect fifths in the original version (they become consecutive fourths in the inversion).
  • The perfect fifth on a strong beat in the original becomes a perfect fourth on a strong beat in the inversion — use with caution; ensure the fourth is approached and quitted in a manner that would be acceptable as a dissonance treatment.
  • All other consonances invert to consonances, and all standard dissonance treatments (passing tones, neighbor tones, suspensions) remain valid after inversion, since the resolution direction (downward by step) is preserved under octave inversion.

6.2 Practical Composition of Invertibly Counterpointed Passages

The practical discipline of composing a passage of invertible counterpoint at the octave is best approached in two stages. First, compose the upper voice (dux) as a complete, melodically satisfying line that observes all the rules of the given species. Then, before writing the lower voice (comes), mentally transpose the upper voice down an octave and determine what that transposition will become in the inverted version: this is the voice that will appear in the upper register when the passage is inverted. Now compose the lower voice so that it forms correct counterpoint with the original upper voice and simultaneously, when it is displaced to the upper octave, forms correct counterpoint with the upper voice displaced to the lower octave.

Example 6.2 (Constructing Invertible Counterpoint at the Octave). Suppose the upper voice is G–A–F–G (a short melodic figure in C major). To write the lower voice invertibly, we note that in the inversion the lower voice will appear an octave higher, and the upper voice an octave lower, so what is now the lower voice becomes the top voice and vice versa. The interval between voice 1 and voice 2 in the original, when inverted, becomes \(9 - i\): if the original interval is a third (3), the inverted interval is a sixth (6); if the original is a sixth (6), the inverted is a third (3); if the original is a fifth (5), the inverted is a fourth (4). We must therefore avoid using the interval 5 in the original if we cannot tolerate a strong-beat fourth in the inversion. Testing the lower voice C–C–D–C against the upper voice G–A–F–G: intervals are fifth (5), minor seventh (7), minor third (3), fifth (5). The fifths will invert to fourths — problematic on strong beats. Better: lower voice E–F–D–E, giving intervals major third (3), minor third (3), minor third (3), minor third (3). All thirds invert to sixths — entirely consonant. The result passes the invertibility test.

6.3 Invertible Counterpoint at the Tenth and Twelfth

While invertible counterpoint at the octave is by far the most common type, the Renaissance and Baroque theorists also recognized and employed inversion at the tenth (producing an exchange of voices with a displacement of a tenth) and at the twelfth (displacement by a twelfth). These rarer inversions impose more severe constraints and are correspondingly more difficult to compose.

Definition 6.2 (Invertible Counterpoint at the Tenth and Twelfth). Counterpoint is invertible at the tenth if the voices can be displaced by a tenth (= octave + third) without producing forbidden errors. Under inversion at the tenth, intervals transform as \(i \mapsto 11 - i\): thirds become sixths, fourths become sevenths (dissonant in the inversion!), fifths become sixths, sixths become thirds. Counterpoint is invertible at the twelfth if displacement by a twelfth (= octave + fifth) is possible; under this inversion, \(i \mapsto 13 - i\): thirds become tenths (= thirds in the next octave), fourths become ninths, fifths become octaves, sixths become sevenths. Inversion at the twelfth is particularly restrictive because sixths (common and desirable) become sevenths (dissonant), forbidding their free use.

Beethoven’s use of counterpoint invertible at the twelfth in his late string quartets (particularly Op. 131 and Op. 132) produces textures of extraordinary complexity: the same motivic combination sounds in entirely different registral configurations in each of its appearances, always valid contrapuntally, yet always surprising in its effect. The intellectual discipline required to compose such passages — to envision the material in all its inversional configurations simultaneously — is among the most demanding in musical composition.

6.4 The Double Countersubject and Multiple-Countersubject Fugues

Some fugues employ not one but two regular countersubjects, each invertible with the subject and with each other. This produces a fugue with three mutually invertible elements — subject, countersubject 1, and countersubject 2 — permitting all six possible orderings of the three voices. Such a fugue is said to employ triple counterpoint (in the sense of three mutually invertible melodic elements, not to be confused with three simultaneous voices). The formal richness available to a composer who has constructed three mutually invertible elements is enormous: each of the six orderings produces a new harmonic and registral configuration of the same motivic material, and the middle section of the fugue can deploy these orderings in varied succession to create a sense of constantly renewed discovery.

Theorem 6.3 (Triple Invertible Counterpoint in Fugue). Let \(S\) be the subject, \(CS_1\) and \(CS_2\) be two regular countersubjects, all three mutually invertible at the octave. The six permutations of \(\{S, CS_1, CS_2\}\) in three registral positions (top, middle, bottom) are all permissible provided:
  • Every pair of the three elements is individually invertible at the octave (no forbidden parallels in any inversion of any pair).
  • The combination of all three simultaneously produces no forbidden parallels in any of the six ordering permutations.
Constructing three mutually invertible elements is considerably more difficult than constructing a single invertible pair, because the constraints are now applied to three pairs simultaneously: \((S, CS_1)\), \((S, CS_2)\), and \((CS_1, CS_2)\) must all satisfy the octave-inversion constraints.

Bach’s B-flat minor fugue from WTC Book I (BWV 891) is frequently cited as an example of a fugue with two regular countersubjects, both invertibly counterpointed with the subject and with each other. The exposition presents each of the three voices in turn, each new entry accompanied by the two previously stated elements in their various permutations. The economy of material — three elements generating an entire fugue’s worth of motivic content — and the formal elegance of the systematic permutation of orderings make this type of fugue a particularly satisfying demonstration of contrapuntal craft.

6.5 Invertible Counterpoint in the Well-Tempered Clavier

Bach’s systematic exploitation of invertible counterpoint in the WTC provides the clearest available demonstration of how this technique shapes the large-scale formal plan of a fugue. In a typical Bach fugue with a regular countersubject, the entry pattern follows a rotation: subject in voice A accompanied by countersubject in voice B; then subject in voice B accompanied by countersubject in voice A (the exchange of positions, possible only because the two are invertibly counterpointed). A fugue with n voices can, in principle, present the subject n times in the exposition with n different registral assignments of subject and countersubject — each permutation being a new formal event, a new combination of familiar material. In a three-voice fugue with one regular countersubject, three entries of the subject yield three permutations: (S above CS), (CS above S in middle voice, with free counterpoint in third voice), and (S in bass with CS above). Not all three permutations need be used in the exposition, but their availability shapes the formal choices available in the middle section and the return.

Example 6.3 (Invertible Counterpoint Across the Expositions of WTC Book I, No. 6 in D Minor, BWV 851). In the D minor fugue (BWV 851), Bach uses the two-voice exposition to present subject and countersubject in a single permutation: the subject in the upper voice, the countersubject (a steady quarter-note descent) in the lower voice. In the recapitulation (final entry), the positions are exchanged: the subject appears in the lower voice while the countersubject appears above. The exchange is possible because the original combination was composed invertibly at the octave. The effect of this exchange — familiar material heard in a new registral configuration — contributes powerfully to the sense of arrival and resolution at the final entry, because the subject now sounds in the resonant low register of the bass while the countersubject, previously its harmonic foundation, floats above it like a recollection. Invertible counterpoint, in this instance, is not merely a technical device but a formal and expressive strategy.

6.6 Canon: Definition and Construction

A canon is a polyphonic composition in which one or more voices strictly imitate a leading voice after a fixed time delay and (optionally) at a fixed pitch interval. It is the purest expression of the contrapuntal ideal: a single melody, sounding simultaneously in multiple temporal and possibly pitch-transposed versions, creating a complete harmonic and polyphonic texture from a single melodic line. The canon demonstrates that the contrapuntal ideal — the equality and independence of multiple voices — can be realized with maximal economy: one voice, heard against its own temporal reflection.

Definition 6.3 (Canon). A canon is a composition in which a leading voice (the dux, or antecedent) is strictly imitated by one or more following voices (the comes, or consequent), each entering after a fixed temporal interval (the time interval of imitation) and at a fixed pitch interval (the pitch interval of imitation, which may be zero for a unison canon). All notes of the dux are replicated exactly in the comes, transposed by the pitch interval if non-zero. A canon at the unison has the follower replicating the leader at the same pitch (or an octave transposition thereof); a canon at the fifth has the comes a fifth above or below the dux.

Composing a canon is an exercise in extreme compositional discipline. Every note of the dux becomes, after the time interval, a note of the comes; the composer must ensure that every vertical interval formed between the dux and the comes satisfies the voice-leading rules. Since the comes is simply a delayed copy of the dux, this imposes a strict self-consistency requirement on the dux: the melody must be consonant (or appropriately dissonant) with itself when heard against a temporally displaced version of itself. This constraint typically forces the dux melody to have a very particular intervallic structure, and discovering this structure — composing a melody that generates its own valid harmonic companion — is one of the great intellectual pleasures of contrapuntal composition.

Example 6.1 (Bach's Musical Offering, BWV 1079: Canon per Tonos). Among the ten canons in Bach's Musical Offering — composed in 1747 for Frederick the Great of Prussia — the Canon per Tonos ("canon through the keys") is perhaps the most astonishing. It is a two-voice canon at the second, meaning the comes enters a major second above the dux and a measure later. What is extraordinary is that the canon ends in a different key than it begins: each complete cycle of the canon rises by a whole step, so that after six cycles the canon has modulated through six keys and returned (at the octave) to the starting key. The canon thus has no ending — it spirals upward indefinitely. Bach appended the instruction "As the modulation rises, so may the glory of the King." The mathematical structure is elegant: the canon exploits the near-enharmonic equivalence of equal temperament to achieve a continuous modulating cycle that would be impossible in just intonation.

6.7 Canon Varieties and the Double Invention

The variety of canonical techniques available to a composer is a rich source of structural interest. Beyond the straightforward unison or fifth canon, the Baroque era explored augmentation, diminution, inversion, retrograde, and combinations thereof. Each variety imposes its own constraints on the dux melody and produces its own characteristic sonic effect.

Definition 6.4 (Canon Varieties).
  • Canon by augmentation: the comes moves at half the speed of the dux — all note values doubled. The dux must be consonant with a version of itself at double speed, a very strong constraint.
  • Canon by diminution: the comes moves at twice the speed of the dux — all note values halved.
  • Canon by inversion: every ascending interval in the dux becomes a descending interval of the same size in the comes, and vice versa. The dux must be consonant with its own mirror image.
  • Retrograde canon (crab canon, cancrizans): the comes plays the dux backward, from the last note to the first. The dux must form valid two-voice counterpoint when combined with its own retrograde — an extraordinarily severe constraint.
  • Spiral (perpetual) canon: each successive entry is at a different pitch level, and the canon modulates continuously through a cycle of keys before returning to the original pitch. Bach's Canon per Tonos is the supreme example.

The double invention is a two-voice piece in which two contrasting motives are introduced separately and then combined in invertible counterpoint. Bach’s Invention No. 9 in F minor (BWV 780) is a particularly clear example: the first motive (a leaping arpeggio figure) is presented in one voice while the second motive (a flowing chromatic descent) appears in the other; after both have been stated, they are combined, and then their registral positions are exchanged. The invertibility of the combination — the fact that the arpeggio works both above and below the chromatic descent — is the structural demonstration that the two motives were composed specifically for each other, that their harmonic profiles are complementary.

6.8 Augmentation and Diminution as Structural Devices

Beyond their use in canon, augmentation and diminution serve as structural devices in the middle sections and final entries of fugues, providing a means of varying the temporal profile of a theme without altering its pitch content. When the subject appears in augmentation — all note values doubled — it seems to expand in time, acquiring a quality of dignity or gravitas appropriate to a climactic arrival. When it appears in diminution — values halved — it moves twice as fast, creating a sense of acceleration, urgency, or fugato playfulness.

Definition 6.5 (Augmentation, Diminution, and Inversion as Thematic Transformations). Three classical thematic transformations applicable to any melodic subject are:
  • Augmentation: all note values multiplied by a constant factor \(k > 1\) (typically \(k = 2\)). The pitch sequence is preserved; the rhythm is slowed.
  • Diminution: all note values multiplied by \(k < 1\) (typically \(k = 1/2\)). The pitch sequence is preserved; the rhythm is accelerated.
  • Melodic inversion: each ascending interval of the subject becomes a descending interval of the same size, and vice versa. The subject is reflected about its first pitch, or about any chosen axis pitch. Melodic inversion creates a mirror image of the subject that may suggest a modal or expressive contrast to the original.
These transformations may be combined: augmentation in inversion, diminution in retrograde, and so on. Bach employs all combinations in the Art of Fugue (BWV 1080), his final and most systematic exploration of contrapuntal technique.

The Art of Fugue — Bach’s last major work, left unfinished at his death in 1750 — is an encyclopedic collection of contrapuntal studies on a single subject: a stepwise theme in D minor of great simplicity and motivic richness. Each of the work’s eighteen canons and fugues demonstrates a different contrapuntal technique: simple fugue, fugue in contrary motion, double and triple fugue, augmentation fugue, stretto fugue, mirror fugue (in which the entire fugue is played simultaneously with all voices inverted), and canons at various intervals and time distances. The Art of Fugue is thus the theoretical culmination of everything the present course has covered — a systematic demonstration that a single subject, properly constructed, contains the seeds of an entire contrapuntal universe.

6.9 The Goldberg Variations as Compendium of Canonic Technique

Bach’s Goldberg Variations (BWV 988, 1741) provide the richest single demonstration of canonic technique in the keyboard repertoire. The thirty variations are organized in groups of three: each group of three ends with a canon, and the canons progress systematically from a canon at the unison (Variation 3) through canons at the second, third, fourth, fifth, sixth, seventh, eighth, and ninth (Variations 6, 9, 12, 15, 18, 21, 24, 27), concluding not with a canon but with a quodlibet — a humorous combination of popular songs — before the return of the original aria. The systematic progression through canonical intervals provides a survey of the acoustical and contrapuntal properties of each interval: the canon at the unison feels intimate and self-reflective; the canon at the fifth (Variation 15, in G minor — the only minor-mode variation) has a quality of yearning and searching; the canon at the seventh (Variation 21) creates the most acute harmonic tension of any canon in the set, since the seventh is a dissonance requiring resolution and the canonic relationship constantly creates momentary seventh-suspensions between dux and comes. Together, the nine canons of the Goldberg Variations constitute a practical textbook of canonical technique at least as instructive as any theoretical exposition.

6.10 The Art of Writing a Canon

Composing a successful canon — one that sounds musical rather than mechanical — is one of the most demanding exercises in contrapuntal writing, and mastering it confers an intimate understanding of the relationship between melodic structure and harmonic implication that no other exercise can provide. The difficulty is that every note of the dux simultaneously serves two functions: it is a note in the dux melody (with certain horizontal, melodic obligations) and it is a harmonic preparation for the corresponding note of the comes (with certain vertical, harmonic obligations toward the comes’ continuation). These two functions are simultaneously operative and occasionally in tension, and the art of canon-writing lies in finding melodies that satisfy both without compromise.

Theorem 6.2 (Self-Compatibility Condition for Canon at the Unison). A melody \(m = (m_1, m_2, \ldots, m_n)\) can serve as the dux of a strict two-voice canon at the unison with time interval \(k\) if and only if, for every position \(i\), the interval between \(m_i\) (the comes' note at position \(i\)) and \(m_{i+k}\) (the dux's note at position \(i+k\)) is a permissible interval in strict counterpoint — either a consonance on the beat or an appropriately treated dissonance. That is, the melody must be consonant with a \(k\)-step-delayed version of itself.

This self-compatibility condition is the mathematical heart of canon construction. It explains why canons tend to avoid the perfect fifth as a harmonic interval (it would invert to a fourth in the comes-over-dux arrangement, and fourths over a bass are dissonant), why they prefer stepwise motion (leaps in the dux create potentially awkward intervals against the comes’ continuation), and why the most successful canon melodies have a certain internal redundancy — they tend to move in relatively predictable, scalewise patterns that remain consonant against their own temporal displacement.

Example 6.2 (Bach's Canon at the Octave from the Goldberg Variations, BWV 988). The thirty Goldberg Variations contain nine canons — one every three variations, at successively larger intervals from the unison to the ninth — providing an encyclopedic demonstration of canon technique at various pitch intervals. The Canon at the Octave (Variation 15) is particularly beautiful: a two-voice canon in G minor in which the dux is a flowing, sinuous melody in the alto, imitated at the octave below by the tenor eight beats later. The bass supports both voices with an independent harmonic line, making this technically a three-voice texture: two canonic voices plus a non-canonic bass. The canonic pair is entirely invertible (the dux works equally well above or below the comes), and the result is a texture of extraordinary elegance — two voices pursuing each other through G minor with an intimate, almost conversational quality that makes the strict canonical technique sound completely natural.

Chapter 7: Fugue: Structure and Technique

7.1 Subject and Answer

The fugue is the supreme achievement of the contrapuntal tradition — the genre in which every technique studied in the preceding chapters converges into a single, architecturally unified composition. To approach the fugue after the species exercises, the inventions, the chorale harmonizations, and the study of invertible counterpoint and canon is to experience the satisfaction of synthesis: every technique one has mastered in isolation appears here in combination, serving the unified formal goal of the fugue. The suspension that was isolated in fourth species now appears as the countersubject’s primary harmonic feature; the sequence that was practiced in the invention’s episode now serves as the modulating link between subject entries; the invertible counterpoint that was the subject of Chapter 6 now organizes the formal deployment of subject and countersubject across the exposition and middle entries. The fugue is not a new technique but the integration of all previous techniques into a single compositional form of extraordinary expressive range and formal discipline. A fugue is not merely a contrapuntal exercise of arbitrary complexity; it is a precisely articulated form with defined sections, each serving a distinct formal and expressive function. To understand the fugue is to understand the entire Baroque contrapuntal system in its most fully realized expression.

The fugue begins with its most basic unit: the subject (or dux, leader). The subject is a single melodic line, typically four to eight measures in length, that encapsulates the character and harmonic meaning of the entire fugue. A great fugue subject has a distinctive rhythmic profile, a clear harmonic implication, and a melodic contour that is both singable in isolation and capable of generating interest when combined with itself and with a countersubject. The subject is the fugue’s DNA: every subsequent event in the fugue — the countersubject, the episodes, the stretto — grows from it or is shaped by it.

Definition 7.1 (Fugue Subject and Answer). The subject of a fugue is its primary melodic theme, stated initially by a single voice without harmonic accompaniment in the tonic key. The answer (or comes) is an imitation of the subject at the interval of a dominant fifth above or a subdominant fourth below — that is, in the dominant key — presented by a second voice while the first voice continues with the countersubject. The answer is called real if it is an exact transposition of the subject to the dominant; it is called tonal if certain intervals are adjusted to maintain tonal coherence and prevent premature modulation to the dominant-of-the-dominant.

The distinction between real and tonal answers is one of the most subtle points in fugue construction and reflects a deep principle of tonal harmonic grammar. A real answer transposes the subject literally to the dominant key; a tonal answer modifies certain intervals — particularly the opening interval, if the subject begins with an ascent from \(\hat{1}\) to \(\hat{5}\) or with a prominent \(\hat{5}\) in the first measure. The rationale: if the subject opens with the tonic-to-dominant fifth, the answer that replaces it with a dominant-to-dominant-dominant fifth implies a modulation to the key of the dominant’s dominant — two steps away from the tonic — at the very outset. To preserve the tonic-dominant polarity of the opening exposition, the tonal answer replaces the opening ascending fifth with an ascending fourth (from \(\hat{5}\) to \(\hat{1}\) in the dominant key), effectively neutralizing the modulating tendency while still implying the dominant key.

Example 7.1 (Real vs. Tonal Answer: WTC Book I, C Major Fugue, BWV 846). Bach's C major fugue from Book I opens with a subject that begins on C and rises through the interval of a fifth to G (approximately), implying the move from tonic to dominant. If answered literally (real answer), the comes would open on G and ascend a fifth to D — strongly implying G major, the key of the dominant. Bach instead answers tonally: the comes opens on G but the initial ascending fifth is compressed to an ascending fourth (G to C), after which the remainder of the subject is answered literally at the fifth. The result is a comes that stays comfortably within the orbit of C major's dominant — clearly in G major, but without asserting G major's own dominant — before the third entry of the subject returns the music decisively to C major.

7.2 Subject Design and Its Harmonic Implications

The design of a fugue subject is the most consequential compositional decision in the entire fugue. Every feature of the subject — its length, its rhythm, its harmonic implication, its melodic contour — predetermines what will be possible in the subsequent development. A subject that begins on the tonic and ends on the dominant is easily answered; a subject that modulates aggressively through multiple keys before settling is problematic, because the answer must begin in the dominant but cannot simply transpose the subject’s modulations up a fifth without immediately overshooting the harmonic range of the exposition. A subject that consists entirely of repeated notes has no possibilities for sequential development, because sequences require melodic motion; a subject that moves entirely by step offers limited possibilities for harmonic sequence at the fifth, because the step motion does not span the harmonic distance that sequences typically traverse.

Theorem 7.1 (Properties of a Well-Designed Fugue Subject). A fugue subject suitable for extended development possesses the following properties:
  • Harmonic clarity: it implies a single tonal center (or a clear departure from and return to one), so that the tonal answer is unambiguous and the expositions of subject and answer are harmonically coherent.
  • Rhythmic identity: it has a distinctive rhythmic profile that is recognizable even when the subject is inverted, augmented, or heard in stretto against itself.
  • Motivic fertility: it contains internal motives — short rhythmic or melodic patterns — that can be extracted and developed independently in the episodes. A subject with no internal motivic cells offers nothing from which to build episodes.
  • Stretto potential: its harmonic structure permits it to be combined with a time-displaced version of itself without producing forbidden parallels. Not every subject has this property, and subjects that lack it cannot be used in stretto.
  • Melodic completeness: it is a satisfying melody in its own right, independent of the countersubject. If the subject is melodically inert, the fugue will feel lifeless at every moment of bare subject entry.

The relationship between subject design and stretto potential is particularly important and illustrates the degree to which the subject of a great fugue is compositionally engineered rather than merely invented. Bach’s subjects in the Well-Tempered Clavier vary widely in their stretto potential: some admit stretto at only one time interval, while others (particularly in the numerically complex multi-voice fugues like BWV 849 in C# minor) admit stretto at multiple time intervals and pitch intervals. The C# minor fugue’s subject appears to have been composed specifically to maximize stretto potential — its internal structure is such that every possible stretto configuration produces valid counterpoint — making the fugue a tour de force of contrapuntal engineering in which the formal development of the middle section seems to follow inevitably from the properties of the subject itself.

7.3 The Countersubject and Its Invertibility

As the answer enters in the second voice, the first voice does not fall silent; it continues with new material called the countersubject. The countersubject is the fugue’s second melodic personality, and in a well-constructed fugue it is composed to be not merely compatible with the subject but invertibly counterpointed with it. This invertibility — the ability of subject and countersubject to exchange registral positions — is what allows the countersubject to appear in multiple voices throughout the fugue without losing its function.

Definition 7.2 (Countersubject). The countersubject is the melodic material accompanying the answer (and subsequent entries of the subject or answer) in the voice that has just stated the subject. A regular countersubject appears consistently at each entry of the subject throughout the fugue and is invertibly counterpointed with the subject at the octave (or tenth or twelfth). An irregular countersubject (or free counterpoint) varies with each appearance of the subject. A fugue may have one, two, or (rarely) three regular countersubjects, each invertible with the subject and with each other.

A well-designed countersubject serves multiple compositional functions simultaneously. Melodically, it must be interesting enough to sustain attention during the answer — it cannot simply sustain a single pitch or repeat a single rhythm. Rhythmically, it should contrast with the subject: if the subject moves in long notes, the countersubject should move in shorter values, and vice versa, so that the two voices provide a rhythmic texture of complementary activity. Harmonically, it must be compatible with the subject at every point of their combination, producing no forbidden parallels in any of the invertible permutations in which it will appear. Motivically, it may derive some of its material from the subject — creating an internal unity that ties the two themes together — or it may be completely contrasting, providing variety.

Example 7.2 (Countersubject Analysis: C Minor Fugue, WTC Book I, BWV 847). The C minor fugue's subject is chromatic and harmonically ambiguous, beginning on C and descending by semitones before leaping. The countersubject, which enters in the soprano as the alto states the answer, moves largely in the opposite direction — ascending by step in longer note values — providing rhythmic and directional contrast. When subject and countersubject are exchanged in the third entry (soprano takes the subject while the alto takes the countersubject), the combination works without forbidden parallels, demonstrating the invertibility that Bach built into the materials from the outset. The entire three-part texture of the exposition — subject, countersubject, and free counterpoint in a third voice — is thus architecturally designed from the first measure of the fugue.

7.4 The Codetta and Its Motivic Function

Between the subject and the answer in the fugal exposition, a brief connecting passage — the codetta — may appear. The codetta is not merely a harmonic connector; it is a motivic opportunity, a place where the composer can introduce episodic material that will be developed later in the fugue’s middle section. A well-designed codetta thus serves a double function: it manages the harmonic transition from tonic to dominant (or dominant back to tonic) in preparation for the next entry, and it introduces motivic cells that will be recognizable and meaningful when they reappear in the episodes.

Definition 7.3 (Codetta). A codetta is a brief passage occurring between the subject and the answer within the fugal exposition, serving to modulate harmonically to the key of the next entry and (optionally) to introduce motivic material for subsequent episodic development. Codettas are absent in many fugues (the answer enters directly after the subject without harmonic preparation) and present in others; their length varies from a single beat to several measures. A codetta that is too long disrupts the rhythmic momentum of the exposition; one that is too short or harmonically abrupt fails to prepare the key of the answer smoothly.

The motivic relationship between codetta and episode is one of the most elegant aspects of fugal architecture. When the codetta’s motivic material reappears in the episodes of the middle section — transformed, sequenced, inverted — the listener who has been attentive to the codetta recognizes the material as familiar, as “from” the piece, even though the subject itself is absent. This creates a sense of motivic economy and coherence: the fugue generates all its material from a very small number of melodic ideas, and even the transitional passages contribute thematically to the whole.

7.5 The Exposition

The exposition is the opening section of the fugue, in which each voice enters in turn with either the subject or the answer. In a four-voice fugue, the exposition presents four entries: subject — answer — subject — answer, alternating between tonic and dominant key presentations until all voices have entered. The order of voice entries is typically from highest to lowest (soprano–alto–tenor–bass) or lowest to highest, though many other orderings occur in Bach.

Definition 7.3 (Fugal Exposition). The exposition is the opening section of a fugue in which each voice introduces either the subject (in the tonic key) or the answer (in the dominant key) for the first time. Between the subject and answer entries, a brief connecting passage called the codetta may appear, allowing modulation from tonic to dominant (or vice versa) to prepare the next entry and introducing motivic material that will reappear in the episodes. After all voices have entered, the exposition closes and the fugue proceeds to its developmental middle section.

The exposition establishes three things simultaneously: the subject’s melodic and rhythmic identity, the harmonic polarity of tonic and dominant that will structure the entire fugue, and the contrapuntal relationships (subject with countersubject, subject with free counterpoint) that will be developed in the middle section. An exposition that accomplishes all three of these with clarity and economy — that presents its material distinctly yet without over-exposition, that creates the expectation of development without yet fulfilling it — is one of the most satisfying structural achievements in all of music.

7.6 Middle Entries, Episodes, and Sequential Development

After the exposition, the fugue enters its middle section — the longest and structurally most varied part of the form. The middle section alternates between entries of the subject (in various keys and voices) and episodes — developmental passages in which the subject itself is absent but in which its motivic material is developed through sequence, imitation, and invertible counterpoint.

Definition 7.4 (Episode and Middle Entry). A middle entry is an appearance of the subject or answer in a key other than the tonic — typically the dominant, the relative major or minor, the subdominant, or other closely related keys. Each middle entry is a moment of formal weight, comparable to the structural pillars of a building, between which the episodes provide the connecting fabric. An episode is a developmental passage between subject entries in which the subject does not appear, but in which motivic material derived from the subject, countersubject, or codetta is developed, typically through harmonic sequence: the repetition of a melodic or harmonic pattern at successively different pitch levels.

Sequential episodes are the primary vehicle of harmonic development in the middle of a fugue. A two-bar motivic pattern is stated and then repeated immediately a step or third higher or lower, and the process is repeated several times, traversing harmonic territory efficiently and with a sense of logical inevitability. The sequential episode exploits the invertible counterpoint of the episodic motives: as the sequence progresses, the motives exchange registral positions, presenting the same material in successively different configurations. A well-constructed episode sequence can modulate from the relative minor to the subdominant in the space of four or six bars without a single arbitrary harmonic move — every chord follows from the sequence by voice-leading necessity.

The formal plan of a fugue’s middle section is not arbitrary but follows a characteristic tonal itinerary: from the tonic, to the dominant (or, in a minor-mode fugue, to the relative major), then to the subdominant or supertonic minor, and back to the tonic. This itinerary traces a large-scale tonic–dominant–subdominant–tonic progression at the level of the fugue’s key areas, mirroring the tonic–dominant–subdominant–tonic chord progressions that operate at the local harmonic level within each phrase. The fugue’s tonal plan is, in a sense, a macrocosm of its harmonic syntax — the same formal logic operating at a vastly expanded time scale. This self-similarity between harmonic gesture at the note level and harmonic gesture at the sectional level is one of the most intellectually satisfying features of Baroque fugal architecture.

Example 7.3 (Stretto in the C# Minor Fugue, WTC Book I, BWV 849). The C# minor fugue — the longest and most architecturally ambitious fugue in Book I — employs five voices and a systematic use of stretto. The subject is constructed in such a way that it can be overlapped with itself at several different time intervals: entries two beats apart, one beat apart, and even simultaneous (at the same pitch and octave). Bach explores these stretto possibilities methodically in the middle section, beginning with wide time intervals (easy, because the harmonic conflicts are minimal) and progressively narrowing them until the final stretto, in which all five voices pile up entry upon entry in rapid succession. The cumulative effect is one of mounting contrapuntal intensity, the subject seeming to double back upon itself in increasingly urgent self-pursuit, until the final tonic entry in augmentation in the bass provides the definitive, spacious resolution of the tension.

7.7 Stretto, Final Entry, and Coda

Stretto — the overlapping of subject entries in quick succession — is the most powerful device of fugal development. Its effect is one of mounting urgency: the subject seems to pursue itself, entries crowding upon each other in rhythmic and motivic compression until the texture reaches a point of maximum density that demands the release of the final entry.

Definition 7.5 (Stretto and Augmentation). Stretto (Italian: "tight," "narrow") is the technique in which each successive entry of the subject begins before the previous entry has concluded. Stretto requires that the subject be consonant with itself when heard against a time-displaced version of itself — a constraint that, like the constraint on canon, is encoded in the specific melodic and harmonic structure of the subject. Augmentation is the presentation of the subject in note values twice as long as the original; it is often employed in the final entry of a fugue, where the expanded time scale creates a sense of arrival, spaciousness, and formal finality after the compressed activity of the stretto passages.

The final entry of a fugue — the tonic return of the subject in (typically) the bass voice, often in augmentation, sometimes accompanied by a tonic pedal point — is one of the great moments of musical architecture. Its arrival is felt as homecoming: the subject, which has been developed, inverted, subjected to stretto, and heard in remote key areas, returns at last to its original key and in its most emphatic registral location. Often Bach reinforces this arrival with a pedal point — a sustained bass note on the tonic — over which the upper voices continue their contrapuntal activity. The subject sounds above this pedal not as a modulating theme but as a tonic proclamation, its harmonic implications now resolved into a single, unambiguous tonal center.

Remark 7.1 (Subject as Seed of Structure). The deepest achievement of the great fugues in the Well-Tempered Clavier is the sense that every structural feature of the fugue — the tonal answer, the countersubject, the episode motives, the stretto intervals — was implicit in the subject from the very beginning. A great fugue subject is not merely a melody that the composer then elaborates; it is a precisely engineered object whose melodic and harmonic properties predetermine the entire compositional plan. When one hears the final entry after a complex development, the sense of return is not merely formal but psychological: the subject has been the same throughout, but the journey through the middle section has revealed aspects of its harmonic and structural meaning that were latent but invisible at the opening. This is the fugue's unique contribution to musical form — the demonstration that logical, rigorous, mathematically constrained development and deep aesthetic beauty are not merely compatible but identical.

7.8 Analyzing a Complete Fugue: The D Minor Fugue, WTC Book I, BWV 851

Walking through a complete fugue analysis — not just identifying its sections by name but tracing the logic of every structural decision from the first note to the last — is the most effective way to internalize the fugue’s formal logic. The D minor fugue from Book I of the Well-Tempered Clavier (BWV 851) is ideal for this purpose: it is a two-voice fugue, the simplest possible texture, yet it exhibits every essential feature of fugue construction with textbook clarity.

Example 7.4 (Full Analysis: D Minor Fugue, WTC Book I, BWV 851). The subject — four measures in length — opens on D4, ascends by step to F4, leaps up a minor sixth to D5, then descends stepwise to A4. The ascent by step followed by the large leap creates an arch of considerable tension; the subsequent stepwise descent releases that tension while maintaining forward motion through the phrase. The subject ends on A4, the dominant scale degree, implying a half-cadence and making the tonal answer begin on A — the dominant key — entirely natural.

The answer (tonal) begins on A4 (the dominant of D minor is A minor), answers the ascending step figure, and descends stepwise back toward E. The countersubject — entering in the original voice as the answer sounds — moves primarily in contrary motion to the answer: while the answer descends, the countersubject tends to ascend, creating the contrary-motion independence of genuine two-voice counterpoint.

The exposition occupies measures 1–8 (approximately): subject in the right hand, answer (with countersubject below) in measures 1–4; then the answer continues while the right hand answers with the subject in the dominant — wait: in a two-voice fugue, there are only two voices, so the exposition consists of two entries only: subject in voice 1 (measures 1–4), answer in voice 2 (measures 5–8) with the countersubject in voice 1.

After the exposition, episodes constructed from the subject’s opening leap and stepwise descent motif traverse F major, A minor, and C major before the final entry returns the subject to D minor — in the low register of the left hand, with the right hand providing the countersubject above it in its inverted position. The final entry is preceded by a dominant pedal (two measures of sustained A in the bass), which creates the tension whose release is the formal goal of the entire fugue. The cadence — a full authentic cadence in D minor, the bass moving from A to D while the soprano descends from E to D — arrives with the force of a physical impact, made all the more decisive by the sustained dominant pedal that preceded it.

This analysis reveals a structural principle that holds throughout the Well-Tempered Clavier: the exposition presents the subject in its “natural” harmonic context (tonic and dominant); the middle section develops the subject in related key areas (relative major, subdominant, supertonic minor); and the final entry restores the subject to the tonic with an additional degree of harmonic weight — the pedal point, or the additional voices entering above or below — that makes the return feel like more than a mere repetition. The fugue has not simply returned to its starting point; it has arrived at its destination.

Remark 7.2 (Two-Voice vs. Four-Voice Fugue). Two-voice fugues — of which there are several in the Well-Tempered Clavier — present the fugal logic in its most exposed form: there is no harmonic filling provided by inner voices, and the two-voice contrapuntal writing must carry the entire harmonic and formal weight of the piece. Every interval counts; there is nowhere to hide. Four-voice fugues, by contrast, benefit from the harmonic support of inner voices and can sustain more complex harmonic progressions and more elaborate decorative figuration. The formal logic of the fugue is the same in both cases; only the density and complexity of the texture differs. The student who can write a satisfying two-voice fugue has demonstrated the fundamentals; the student who can write a satisfying four-voice fugue has demonstrated mastery.

Chapter 8: Counterpoint After Bach

8.1 The Well-Tempered Clavier as Compendium

Before examining counterpoint after Bach, it is worth pausing to appreciate the extraordinary scope of the Well-Tempered Clavier as a compendium of fugal and contrapuntal technique. The forty-eight preludes and fugues of Books I and II (composed and revised between 1722 and approximately 1742) are not merely a collection of pedagogical exercises but an encyclopedic survey of the possibilities of the fugue genre at a single historical moment. Each fugue demonstrates a different combination of formal features — different subject types, different numbers of voices (two, three, and four), different degrees of motivic complexity, different relationships between subject and countersubject, different approaches to the middle-section episodes and the final return — and the variety is so complete that the entire repertoire can be read as a systematic demonstration that the fugue form is not a rigid template but a flexible architecture capable of infinite variety within a shared structural logic.

Remark 8.1 (Numerical Symbolism in the WTC). Several scholars have noted numerical relationships in the WTC that suggest Bach conceived of the collection as a unified whole rather than a collection of independent pieces. The forty-eight fugues divide symmetrically between the two books; the keys are arranged in ascending chromatic order through all twelve major and minor keys in each book; and some fugues appear to have been composed or selected to demonstrate specific structural principles (e.g., a two-voice fugue followed by a three-voice fugue in the same key area, or a fugue with a chromatic subject followed by one with a diatonic subject). Whether or not one accepts the most elaborate numerical interpretations, the WTC clearly represents a compositional program of extraordinary ambition: the demonstration, in forty-eight individual pieces, that the fugue form can be made to serve the full range of expressive and structural possibilities available within the tonal system.

8.2 Counterpoint in the Classical Era

Bach died in 1750, and the style he brought to its supreme expression — the High Baroque polyphonic manner of interlocking contrapuntal lines governed by species rules and harmonic sequence — was already beginning to seem archaic to his contemporaries. The Classical era, dominated by Haydn and Mozart, adopted a predominantly homophonic texture in which melody and accompaniment replaced the polyphonic equality of voices. Yet counterpoint did not disappear; it was absorbed, transformed, and deployed with calculated expressive effect within the new formal framework of sonata form, the string quartet, and the symphony.

Remark 8.1 (The Classical Fugue as Learned Style). For Haydn and Mozart, the fugue was a marker of the stile antico — the learned, ecclesiastical style associated with church music and the tradition of Fux. When a Classical composer wrote a fugue, they were self-consciously invoking this tradition, signaling seriousness, technical command, and connection to a musical past of authority and gravity. The fugal finales of Haydn's Op. 20 string quartets (Nos. 2, 5, and 6, all with fugal finales), the fugues in Mozart's Requiem (K. 626), and the fugal development sections of Beethoven's late piano sonatas and string quartets all partake of this deliberate archaism, deploying contrapuntal complexity as an expression of elevated musical discourse — a mode of speaking that the composer reserves for moments of highest formal weight.

Mozart’s study of counterpoint was intensive and deeply felt. Under his father Leopold’s direction he worked through Fux’s Gradus as a child; later, in Vienna in 1782, he encountered the music of J.S. Bach through Baron van Swieten’s private concerts and was electrified. He immediately began composing his own fugues for string quartet (the Fugue in C minor, K. 426, later arranged for strings as K. 546) and arranging Bach’s keyboard fugues for string quartet as pedagogical exercises. The counterpoint in his late works — the Kyrie of the Requiem, the overture to The Magic Flute with its Bachian three-voice fugue, the finale of the Jupiter Symphony — reflects a mature assimilation of the Baroque contrapuntal tradition into the Classical formal language.

Example 8.1 (The Jupiter Symphony Finale: Five-Voice Combination). The finale of Mozart's Symphony No. 41 in C major (K. 551, the "Jupiter") is one of the most remarkable structural achievements in the symphonic literature. It is in sonata form, with a development section that employs fugal technique, and it closes with a coda in which five distinct themes — all introduced separately in the course of the movement — are combined simultaneously in invertible counterpoint. The five themes fit together in multiple configurations, each valid harmonically and contrapuntally, producing a polyphonic texture of joyful abundance. The achievement is not merely technical: it gives the movement a sense of culminating synthesis, as though the movement's entire thematic material has been striving toward this moment of simultaneous combination. The five-voice combination is the fugal stretto of the symphony, compressed into a final peroration.

8.3 Romantic Counterpoint: Brahms and Wagner

The Romantic era brought a paradoxical relationship with counterpoint. On the surface, the massive orchestral and operatic textures of Berlioz, Liszt, and early Wagner emphasize harmonic colour, tonal ambiguity, and expressive extremity over contrapuntal logic. Yet the greatest composers of the period were deeply learned contrapuntists whose surface effects rested on rigorous polyphonic foundations. The contradiction resolves when one understands that Romantic counterpoint works not by the strict interval-by-interval logic of the Baroque but by harmonic voice-leading: the voices are guided by the logic of functional harmony operating at the level of the phrase rather than the beat.

Example 8.2 (Brahms's Passacaglia: Symphony No. 4, Finale). Johannes Brahms was the most profoundly learned contrapuntist of the nineteenth century. His study of Renaissance polyphony (he edited collections of early music and corresponded with musicologists about modal counterpoint), his meticulous mastery of the species system, and his intimate knowledge of Bach made him uniquely equipped to deploy historical contrapuntal forms in a Romantic harmonic language. The finale of his Fourth Symphony (op. 98) is an extended passacaglia: a thirty-note bass ostinato, stated thirty-one times, with thirty-one variations above it. Each variation is a distinct contrapuntal and orchestral study; together they form an arch of tension and release over the fixed bass — species counterpoint operating at the formal rather than the note-to-note level. The bass is the cantus firmus; the variations are the species; and the entire movement is a demonstration that the contrapuntal ideal of ordered polyphony above a fixed line transcends the historical style of its original formulation.

Wagner’s relationship with counterpoint is more idiosyncratic and more rhetorical. His enormous orchestral textures are frequently imitative without being strictly fugal; the leitmotif system of his later operas creates a kind of associative counterpoint in which several dramatically charged themes are combined simultaneously, their combination generating new dramatic meaning beyond what any single theme carries alone. This is not species counterpoint; it is not even invertible in the strict sense; but it is profoundly contrapuntal in spirit, governed by the principle that simultaneous voices create meaning through their relationship that neither could create alone.

Definition 8.1 (Triple Counterpoint). Triple counterpoint is a three-voice texture in which all three voices are mutually invertible — any of the six possible orderings of the three voices (positions 1, 2, 3 in any arrangement) produces valid counterpoint without forbidden parallels. Composing triple counterpoint requires that every interval formed between each pair of voices be valid in all inversional combinations, imposing severe restrictions on the intervals that may be used and their voice-leading. Wagner's combination of three leitmotifs in the Die Meistersinger prelude approximates triple counterpoint, though Wagner allows voice-leading liberties that strict Baroque counterpoint would prohibit.

8.4 Counterpoint and the String Quartet

The classical string quartet — two violins, viola, and cello — is the instrumental medium that most closely approximates the four-voice SATB texture of the chorale, and from Haydn through Bartók it has been the primary laboratory for instrumental counterpoint outside the keyboard. The four instruments of the string quartet each occupy a distinct registral stratum (first violin as soprano, second violin as alto, viola as tenor, cello as bass), and the independence of each instrument — the fact that each can sustain a melodic line of its own, that none is relegated to mere accompaniment by the nature of the instrument itself — makes the quartet the ideal medium for genuine four-voice polyphonic writing.

Remark 8.3 (Haydn's Op. 20 Quartets and the String Quartet Fugue). Haydn's six string quartets Op. 20 (published 1772) mark a decisive moment in the history of the string quartet as a medium for serious counterpoint. Three of the six quartets (Nos. 2, 5, and 6) end with full fugues — not fugal passages or fugato textures, but complete, architecturally developed fugues with subjects, answers, countersubjects, episodes, and stretto. Haydn's quartet fugues are technically rigorous: the voice-leading is careful, the invertible counterpoint is properly constructed, and the formal architecture of exposition, development, and return is intact. They represent the application of the keyboard fugue tradition — specifically the Well-Tempered Clavier tradition — to the four-instrument medium, and they demonstrate that the fugue form is not instrument-dependent: it is a compositional logic that can be realized by any ensemble capable of sustained independent voices.

The development of the string quartet in the early Classical era thus runs parallel to the growing importance of counterpoint as a compositional value in the learned style. As the quartet became increasingly important as a chamber genre, composers invested more contrapuntal sophistication in it: Haydn’s later quartets (Op. 33, Op. 64, Op. 76) contain increasingly rich imitative textures; Mozart’s “Haydn” quartets (K. 387, K. 421, K. 428, K. 458, K. 464, K. 465) include fugal finales and extensively imitative development sections; and Beethoven’s late quartets (Op. 127, Op. 130, Op. 131, Op. 132, Op. 135) push the string quartet to the limits of what four independent voices can achieve, with movements in strict fugal form, sections of invertible counterpoint at the twelfth, and a pervasive contrapuntal treatment of even the slow movement textures that goes far beyond what any earlier quartet had attempted.

8.5 Twentieth-Century Neoclassical Counterpoint

The early twentieth century saw a deliberate reaction against the chromatic excess of late Romanticism in the form of the neoclassical movement, which sought to reclaim the clarity, formal discipline, and contrapuntal rigour of the eighteenth century. The principal figures of this movement — Stravinsky, Hindemith, and Bartók — approached counterpoint from very different aesthetic starting points, yet all three revitalized the polyphonic tradition by subjecting it to the harmonic and rhythmic innovations of their era. The result was a counterpoint that retained the formal architecture of the Baroque while speaking an unmistakably modern harmonic language.

The neoclassical moment raises an interesting question: why should the fugue form, or the canon, or the species rules, survive the transition to a completely different harmonic language? What is it about the formal architecture of the fugue that makes it applicable to Hindemith’s “tonality” based on the overtone series, or to Bartók’s folk-inflected modality, or to Stravinsky’s bitonally inflected diatonicism? The answer, one comes to appreciate through the study of the species system, is that the fugue’s architecture is not dependent on the specific harmonic language that fills it. The fugue form is a temporal organization — subject entries, episodes, stretto, final entry — that organizes motivic and harmonic material in time according to a logic of departure and return, tension and resolution, that is not specific to any particular harmonic system. It is, in this sense, a meta-musical form: an architecture for the organization of whatever musical language one chooses to deploy within it. The neoclassicists understood this intuitively, and their fugues demonstrate it practically: the form remains recognizable and structurally coherent even when the harmonic vocabulary inside it is entirely non-Baroque.

Example 8.3 (Hindemith's Ludus Tonalis). Paul Hindemith's Ludus Tonalis (1942) is an explicit homage to the Well-Tempered Clavier: twelve three-voice fugues, connected by interludes, one for each of the twelve pitch classes — though Hindemith's ordering of the twelve tonics is determined by his own theoretical "Series I," derived from the acoustic properties of the overtone series rather than the chromatic scale. Each fugue employs the formal apparatus of the Baroque — subject, answer, exposition, episode, stretto, final entry — within a harmonic language defined by Hindemith's Craft of Musical Composition (1937). Hindemith's theory replaces consonance and dissonance with a continuous spectrum of "harmonic tension" derived from the overtone series, and his fugues deploy dissonances with full structural awareness of their place on this spectrum. The result demonstrates that the fugue as a formal type is independent of any particular harmonic language: the container remains when the contents change.
Example 8.4 (Bartók's Mikrokosmos and the String Quartets). Bartók's six volumes of Mikrokosmos (1926–1939, for solo piano) constitute a pedagogical progression analogous to the species system: early volumes present simple melodic and rhythmic problems; later volumes engage advanced contrapuntal techniques including two-voice imitation, canon, mirror fugue, and inversion. Volume VI contains a "Six Dances in Bulgarian Rhythm" and several pieces in which imitative counterpoint operates within an asymmetric folk-derived rhythmic framework — a modal melodic language subjected to Baroque formal discipline. The six string quartets extend this synthesis to four voices: the fourth quartet opens with a highly compressed imitative texture in which the four instruments present related chromatic motives in extremely close imitation, creating a harmonic density that the species rules would prohibit but that the ear accepts as characteristic of the style.

Stravinsky’s neoclassical counterpoint — particularly in the Symphony of Psalms (1930), the Octet for wind instruments (1923), and the Piano Concerto (1924) — is characterized by a deliberate angularity and rhythmic irregularity that reflects both his Russian background and his engagement with jazz and popular music. The Symphony of Psalms contains a double fugue in which two subjects — one angular and chromatic, one flowing and diatonic — are first presented separately and then combined in invertible counterpoint. The combination is a tour de force: the two subjects, apparently heterogeneous in character, fit together with a harmonic precision that reveals how carefully their interval structures were engineered for mutual compatibility.

The neoclassical return to counterpoint in the early twentieth century was not merely nostalgic; it was programmatic. The composers of the neoclassical movement understood — or at any rate believed — that the clarity, economy, and formal precision of the Baroque polyphonic style offered a corrective to what they perceived as the excess, bombast, and structural incoherence of late Romanticism. Counterpoint, in this view, was a form of musical discipline — a set of structural constraints that forced the composer to think carefully about every note, to make every melodic choice consequential, to achieve expressive power through economy rather than excess. The neoclassical movement is, in this sense, a musical manifesto for the ethical virtues of constraint: the belief that rules, properly understood and rigorously applied, liberate rather than restrict the musical imagination.

8.5 Serial Counterpoint and the Second Viennese School

The Second Viennese School — Schoenberg, Webern, and Berg — developed a contrapuntal system in which the organizing principle is not the consonance and dissonance hierarchy of species counterpoint but the twelve-tone row and its transformations. This serial counterpoint preserves the formal architecture of polyphony — multiple independent voices, motivic development, imitative entries — while replacing the tonal harmonic framework with a serial one.

Definition 8.2 (Serial Counterpoint). In serial counterpoint, as practiced by Schoenberg, Webern, and Berg, the melodic and harmonic content of all voices is derived from a single twelve-tone row and its transformations: the prime form (P), retrograde (R), inversion (I), and retrograde-inversion (RI), each transposable to twelve pitch levels, yielding 48 possible row forms. The row serves an analogous function to the cantus firmus in species counterpoint — it is the given material from which all other voices are derived — but instead of individual pitches, it provides a sequence of pitch-class intervals that governs the intervallic structure of each voice.

Webern’s serial counterpoint achieves an extraordinary density of structural organization within astonishing brevity. His Symphony Op. 21 — which lasts approximately ten minutes — is constructed from a twelve-tone row of remarkable symmetry (the second hexachord is the retrograde of the first, so that the row sounds like itself backward), and every detail of the two-movement work is derived from this row and its transformations through canonic imitation. The second movement is a set of variations, each of which is a different canonic arrangement of the row — at different time intervals, different pitch intervals, and with different voice assignments. The result is a microscopic fugal universe, every note structurally determined, yet the surface impression is of crystalline sonic beauty without apparent effort.

Remark 8.2 (Elliott Carter and Metric Counterpoint). Beyond pitch counterpoint, Elliott Carter developed the concept of metric modulation — a system in which different voices simultaneously move in different tempos, creating a kind of rhythmic counterpoint that is entirely independent of the pitch domain. In his String Quartet No. 2 (1959), each of the four instruments is assigned a distinct rhythmic character and range of tempos, and the four instruments pursue their independent temporal paths throughout the work. The result is a polyphony of tempos — a counterpoint of rhythmic identities — that extends the definition of contrapuntal independence from the pitch domain into the domain of time itself. Carter's metric counterpoint is thus the logical generalization of species counterpoint: where species counterpoint manages the independence of pitch trajectories, metric counterpoint manages the independence of temporal trajectories.

8.6 Minimalism, Spectralism, and Contemporary Polyphony

The second half of the twentieth century and the early twenty-first century have seen a remarkable diversification of approaches to polyphonic writing. Minimalism, spectralism, and the various post-tonal and post-serial practices of the late twentieth century all engage, in their different ways, with the fundamental contrapuntal questions of voice independence, melodic identity, and the management of tension and release.

Example 8.5 (Steve Reich's Phasing and the Generalization of Canon). Steve Reich's tape pieces and early ensemble works — Piano Phase (1967), Violin Phase (1967), Clapping Music (1972), Music for 18 Musicians (1976) — employ a form of counterpoint that has no historical precedent yet is deeply rooted in the perceptual logic of polyphonic listening. In Reich's phasing technique, two identical voices begin in unison and gradually fall out of synchronization as one voice repeats a short pattern slightly faster than the other. The result is a continuously evolving two-voice texture — every possible temporal displacement of the melody against itself is heard in succession — that is simultaneously hypnotic and structurally fascinating. The comes is the dux displaced by a continuously varying time interval; the canon modulates not in pitch but in temporal phase. This is the deepest generalization of the canonical principle: a melody heard simultaneously against every possible temporal reflection of itself.
Example 8.6 (Spectral Counterpoint: Gérard Grisey). The spectral composers of the 1970s and 1980s — Gérard Grisey, Tristan Murail, Hugues Dufourt — derived their harmonic and melodic materials from the physical analysis of sound spectra, treating the overtone series not as a background theoretical principle (as in Hindemith's theory) but as the direct source of compositional material. In Grisey's Partiels (1975), the orchestral texture is a slow temporal unfolding of the overtone spectrum of a low E, each partial entering in sequence from the fundamental upward. The counterpoint between the partials — some tuned in just intonation, others at equal-tempered approximations — creates a shimmering texture in which the "voices" are the individual harmonics of a single composite sound. This is not species counterpoint in any conventional sense, but it is profoundly contrapuntal in its principle: multiple simultaneous voices, each with its own pitch identity and temporal trajectory, combined according to a rigorous structural logic derived from the physics of acoustics.

The study of counterpoint is never finished. Every major development in Western music from the Renaissance motet through the spectral orchestra has engaged with the fundamental questions that counterpoint asks: How do independent melodic voices coexist? How is tension created and released between simultaneous voices? How can a single melodic idea generate an entire polyphonic universe through repetition, imitation, and transformation? The answers differ radically across style, era, and aesthetic philosophy; the questions remain constant. To study counterpoint is not merely to learn a historical technique but to engage with the deepest questions about the nature of musical time, musical memory, and the relationship between logic and beauty in organized sound.

Remark 8.3 (Counterpoint and Auditory Cognition). Recent research in music cognition has illuminated the perceptual basis of many contrapuntal rules formulated empirically by Renaissance theorists. The prohibition on parallel fifths, the requirement that dissonances resolve by step, the preference for contrary motion — all find their explanations in the auditory stream-segregation processes studied by Albert Bregman: the cognitive mechanisms by which the auditory system separates simultaneous sounds into distinct perceptual streams. A parallel fifth collapses two auditory streams into one because the acoustic similarity of perfect-fifth-related pitches causes them to fuse; a resolving suspension is heard as motion within a single stream because stepwise resolution maintains the stream's continuity. The ancient rules of counterpoint, in this light, are not arbitrary conventions but a systematic mapping of auditory cognitive structure, accumulated over centuries of careful listening. The species system is a grammar of auditory stream formation — a set of compositional rules for creating music that the auditory system can parse into distinct, simultaneously perceivable melodic identities. This is why counterpoint, despite its antiquity, remains the essential foundation of serious musical training: it teaches composers not merely how to write notes but how sound is perceived, and how the structures of musical perception can be engaged, directed, and ultimately fulfilled.

8.7 Imitative Polyphony in Sacred and Secular Choral Music

One domain of counterpoint that the present course has touched on only briefly deserves more explicit attention: the choral polyphony of the seventeenth and eighteenth centuries, in which the full resources of the polyphonic tradition were deployed for large vocal forces in both sacred and secular contexts. Handel’s oratorio choruses — particularly those of Messiah (1741) — represent an extraordinary synthesis of Baroque counterpoint and dramatic choral writing. The famous “Hallelujah” chorus deploys a fugal subject (“King of Kings and Lord of Lords”) in combination with homophonic choral outbursts (“Hallelujah”), alternating between contrapuntal and chordal textures to create a formal arch of mounting intensity. The fugal passages follow the species rules precisely; the homophonic passages provide rhythmic and harmonic contrast; and the alternation between the two textures is the primary formal device of the movement.

Example 8.10 (Handel's "And He Shall Purify," Messiah, HWV 56). The chorus "And He Shall Purify" from Part I of Messiah is a fugue for four-voice choir (SATB) with continuo accompaniment. The subject — a flowing, scalar figure in D minor — is stated by the soprano alone, then answered by the alto, then restated by the tenor, then answered by the bass, completing the exposition in standard fugal fashion. The countersubject is a descending step figure in longer note values, invertibly counterpointed with the subject. The middle section develops the subject through the relative major (F major) and the dominant minor (A minor) before returning to D minor for the final entry in the bass with the soprano's countersubject above. The formal structure is textbook-correct species counterpoint extended to a four-voice choral texture; the idiom is Baroque; the medium is the full choral apparatus of the oratorio. This chorus demonstrates that species counterpoint is not merely a keyboard discipline but a universal compositional logic applicable to any vocal or instrumental medium.

8.8 Counterpoint as Analytical Tool

Beyond its role as a compositional discipline, counterpoint provides an indispensable analytical lens for understanding music of any style. The analyst who can identify the contrapuntal skeleton of a complex orchestral passage — tracing the fundamental two-voice framework beneath layers of orchestration and harmonic elaboration — understands the music at a structural depth that harmonic analysis alone cannot reach. Heinrich Schenker’s analytical method, the most influential analytical theory of the twentieth century, is fundamentally a theory of counterpoint: it reads tonal music as an elaborated two-voice framework (the Ursatz) in which the soprano descends from an initial scale degree (\(\hat{3}\), \(\hat{5}\), or \(\hat{8}\)) to the tonic while the bass moves from tonic to dominant and back.

Definition 8.3 (Schenkerian Ursatz). In Heinrich Schenker's theory of tonal music, the Ursatz (fundamental structure) is the two-voice contrapuntal skeleton underlying an entire tonal composition. It consists of:
  • The Urlinie (fundamental melodic line): a stepwise descent in the soprano from a structural upper voice note — the Kopfton (head tone), which is \(\hat{3}\), \(\hat{5}\), or \(\hat{8}\) — to \(\hat{1}\). This descent is the soprano's contribution to the Ursatz.
  • The Bassbrechung (bass arpeggiation): a bass motion from \(\hat{1}\) (tonic) to \(\hat{5}\) (dominant) and back to \(\hat{1}\) (tonic), outlining the fundamental harmonic progression of the composition.
The Ursatz is the deepest structural level of a tonal composition; all other features of the music — themes, harmonies, rhythms, textures — are elaborations of this primordial two-voice counterpoint.

Schenker’s theory is controversial in its details but profound in its central claim: that tonal music is, at its deepest level, two-voice counterpoint. Every tonal composition from Bach to Brahms, in this view, is an elaboration of the two-voice framework that the species system teaches. To study counterpoint is to study the grammar of which tonal composition is the literature — the rules of the language in which the masterworks of Western music are written.

Example 8.7 (Schenkerian Analysis of a Bach Chorale). Consider the Bach chorale harmonization of "O Haupt voll Blut und Wunden" (BWV 244/54). The soprano presents the chorale melody, descending from E5 through D5, C5, B4 to A4 over the course of the first phrase. The bass moves from A (tonic) to E (dominant) and back to A (tonic). In Schenkerian terms, the Urlinie descent \(\hat{5}–\hat{4}–\hat{3}–\hat{2}–\hat{1}\) (E–D–C–B–A in A major / A minor) is present in the soprano, and the Bassbrechung A–E–A is present in the bass. Everything else — the inner-voice harmonies, the passing chords, the chromatic inflections — is an elaboration of this fundamental two-voice framework. The Schenker analysis reveals that the harmonic richness of Bach's setting is an ornamental layer laid over a structure of extreme simplicity, and that this simplicity is what gives the harmonization its sense of architectural solidity.

8.9 Counterpoint in the Twentieth-Century Orchestra

The orchestral writing of the twentieth century — in composers as different as Bartók, Shostakovich, Prokofiev, and Ligeti — frequently deploys contrapuntal textures that trace their lineage directly to the Renaissance and Baroque tradition even while using an entirely different harmonic language. Understanding these textures requires the same analytical tools as understanding a Bach fugue: the ability to isolate individual voices, trace their melodic lines, identify their harmonic relationships, and understand how tension and release are managed across time.

Example 8.8 (Shostakovich's Contrapuntal Writing). Dmitri Shostakovich's symphonies and string quartets contain passages of stringent two- and three-voice counterpoint that, while harmonically far from Baroque practice, obey the species rules of voice independence and dissonance treatment with scrupulous care. The finale of the Tenth Symphony (op. 93) contains a fugato — a fugue-like passage that is not a complete fugue but employs fugal exposition technique — in which the principal theme is presented by each orchestral section in turn, creating a massive textural crescendo through cumulative imitative entry. The technique is identical in principle to the exposition of a Bach fugue; only the harmonic language and orchestral forces differ. Shostakovich, who studied Bach intensively throughout his career, understood that the fugal technique of gradually accumulating voices is one of the most powerful tools for building large-scale tension in any musical style.
Example 8.9 (Ligeti's Micropolyphony). György Ligeti's orchestral works of the 1960s — Atmosphères (1961), Lontano (1967) — employ a technique Ligeti called micropolyphony: extremely dense canonic textures in which dozens of orchestral voices each pursue an independent melodic line, entering at fractionally different times and moving at slightly different speeds, so that the individual voices are inaudible but their mass creates a smooth, slowly shifting sonic cloud. Micropolyphony is, in a sense, the limiting case of invertible counterpoint: taken to its extreme, where so many independently moving voices are present that no individual voice can be tracked, the contrapuntal texture dissolves into a new kind of sound — a texture without surface, a field of sonic activity rather than a collection of identifiable melodies. Yet the principle is the same as in any counterpoint: independent voices, moving according to internal melodic logic, create a sonic result richer than any single voice could achieve.

The relationship between Ligeti’s micropolyphony and species counterpoint is not merely historical. Ligeti himself acknowledged his debt to Renaissance polyphony, and the analytical tools of species counterpoint — interval classification, dissonance treatment, voice independence — apply, in modified form, even to micropolyphonic textures. In a dense canonic mass of twenty-four voices, the “interval” between voices must be understood statistically (what is the distribution of intervals between any randomly chosen pair of voices at any moment?) rather than individually, but the principle that the distribution of intervals affects the perceived acoustic roughness or smoothness of the texture is directly traceable to the interval-consonance hierarchy of Definition 1.1.

The student who has worked through the species exercises, composed two-voice inventions, analyzed fugues from the Well-Tempered Clavier, and studied the counterpoint of Bach’s successors possesses something more valuable than a set of historical techniques. They possess a training in musical perception — a cultivated sensitivity to the independence of simultaneous voices, to the direction of harmonic tension, to the logic of melodic gesture — that is applicable to any musical style they will encounter or compose. This sensitivity, once developed, does not fade; it persists as a permanent change in the way one listens to music, hears harmonic motion, and thinks about the relationship between melody and harmony, between individual voice and collective texture, between the note-by-note and the phrase-by-phrase and the movement-by-movement. It is, in the fullest sense, a musical education — not an education in a style, or a period, or a technique, but an education in musical thought itself. The rules of strict counterpoint are eventually internalized and transcended; the sensitivity they develop is permanent. As Fux wrote at the conclusion of the Gradus ad Parnassum, addressing his fictional student Josephus: you have now arrived at the summit; but remember that the rules are not ends in themselves, they are the discipline through which musical imagination acquires the precision and clarity it needs to speak with power. Whether one writes a Renaissance motet, a Baroque fugue, a Romantic symphony, or a minimalist phase piece, this discipline remains the foundation. The punctus contra punctum — note against note, voice against voice, musical idea against musical idea — is the irreducible core of the compositional art, and its study is the study of musical thought itself.

8.9 Writing the Fugue Exposition: A Step-by-Step Method

Composing a fugue exposition — the first major compositional task in the study of fugue — is best approached through a systematic procedure that addresses the interrelated decisions of subject design, answer type, and countersubject construction in a logical order. The following method reflects standard practice at major conservatories and music schools.

First, design the subject with stretto potential and motivic fertility in mind. Write the subject without any harmonic accompaniment and sing it aloud repeatedly until its melodic character is vivid and distinctive. Ask: does this subject have a rhythmic profile that will be recognizable in inversion and augmentation? Does it begin and end in a way that makes the tonal answer straightforward? Does it contain an internal motivic cell — a short rhythmic or melodic figure of two or three notes — that can be sequenced in an episode?

Second, determine whether the answer should be real or tonal. If the subject begins on \(\hat{1}\) and ascends to \(\hat{5}\), or if \(\hat{5}\) appears prominently in the first measure, the answer should be tonal: compress the opening ascending fifth to an ascending fourth, then continue with the literal transposition. If the subject begins on \(\hat{3}\) or \(\hat{5}\) and does not prominently feature the \(\hat{1}–\hat{5}\) ascent, a real answer (literal transposition to the dominant) is likely appropriate.

Theorem 7.2 (Criteria for Tonal Answer). A tonal answer (as opposed to a real answer) is required when: (1) the subject begins on \(\hat{1}\) and the first melodic interval ascends to \(\hat{5}\); or (2) the subject begins on \(\hat{5}\); or (3) the subject modulates to the dominant key within its own body (a relatively rare situation in Baroque practice, but common in some later fugal writing). In all other cases, a real answer is the default. The specific melodic adjustment in the tonal answer replaces the ascending \(\hat{1}–\hat{5}\) fifth in the subject with an ascending \(\hat{5}–\hat{1}\) fourth in the answer, and adjusts any subsequent notes that would imply the dominant-of-the-dominant.

Third, compose the countersubject: write it simultaneously with the answer, treating the combination of answer and countersubject as a first-species exercise (ensure all strong-beat intervals are consonant) with added passing tones and suspensions as the musical character demands. The countersubject should be rhythmically complementary to the subject — if the subject moves primarily in eighth notes, the countersubject should have some longer notes; if the subject opens with a long note, the countersubject may begin with faster motion.

Fourth, verify the invertibility of subject and countersubject by transposing the countersubject up an octave and the subject down an octave, mentally, and checking that the combination still forms correct counterpoint. Any interval of a fifth between them in the original will become a fourth in the inversion; check that this fourth is not on a strong beat in an unresolved position.

Finally, write the full exposition: subject alone (measure 1–subject length), answer with countersubject (overlapping the subject’s final bars), subject with countersubject in the inversion (if three voices; second countersubject in first voice if four voices), and final answer with all material combined. Verify all parallel fifths and octaves between every pair of voices at every beat.

8.10 Conclusion: Counterpoint as the Grammar of Musical Thought

The eight chapters of this course have traced the development of contrapuntal technique from its most elementary form — the interval classification of Definition 1.1 — through the species system, two-voice tonal counterpoint, four-part chorale writing, invertible counterpoint and canon, and the fugue, into the twentieth and twenty-first century’s transformations and extensions of the polyphonic tradition. At each stage, the same fundamental principles have recurred: voice independence, dissonance as controlled tension requiring resolution, consonance as stability and arrival, the long-range management of harmonic tension from opening to cadence.

The remarkable persistence of these principles across five centuries of Western music is not a historical accident. It reflects the fact that the principles of counterpoint are grounded in the psychology of auditory perception — in the way the human auditory system organizes simultaneous sounds into streams, detects patterns of tension and release, and experiences musical time as a structured unfolding rather than a formless succession of sounds. As long as humans hear music with the same ears and the same auditory cortex that Renaissance singers and Baroque keyboard players used, the principles of counterpoint will remain relevant — not as historical curiosities, but as the grammar of musical thought itself.

8.12 Practical Counterpoint Exercises for the Advanced Student

The following exercises, drawn from the pedagogical traditions of the Yale School of Music and the Eastman School of Music, represent a progressive curriculum for the student who has completed the species exercises and is ready to engage with free tonal counterpoint and fugue composition.

Remark 8.4 (Recommended Sequence of Exercises). The following sequence is recommended for students proceeding from species counterpoint into free composition:
  1. Analysis of Bach Two-Part Inventions: Analyze all fifteen inventions, identifying the subject, its imitative entries, the episodes and their sequence technique, the modulation plan, and the cadences. Write a one-page formal diagram for each invention showing the formal sections, the key areas, and the structural events.
  2. Composition of a Two-Part Invention: Compose a complete two-part invention in the style of Bach, approximately 30–50 measures, in a major or minor key, using imitation, sequence, invertible counterpoint, and a clear three-part formal structure (tonic, development, return).
  3. Analysis of Bach Chorale Harmonizations: Analyze twenty chorale harmonizations from the Riemenschneider collection, identifying all non-harmonic tones, all cadential formulas, and all examples of chromaticism. Pay particular attention to the outer-voice counterpoint and verify that it passes the two-voice consonance test in each case.
  4. Composition of a Four-Part Chorale: Harmonize a given chorale melody in the style of Bach, using four-part SATB texture, following all the rules of voice-leading and doubling, and employing the full range of non-harmonic tones available in the species system.
  5. Analysis of WTC Fugues: Analyze five fugues from the Well-Tempered Clavier (at least two from each book), identifying subject, answer (real or tonal, with explanation), countersubject(s), exposition, middle entries, episodes, stretto (if present), and final entry/coda. Explain the harmonic plan of each fugue and trace the motivic development of the episode material.
  6. Composition of a Two-Voice Fugue: Compose a complete two-voice fugue exposition (subject, answer, and countersubject) and one episode, approximately 20–30 measures total. The subject should be designed for stretto at at least one time interval.

These exercises are not merely technical drills; they are the mechanism by which abstract knowledge becomes compositional intuition. The distinction between knowing the rules of counterpoint and being able to compose in counterpoint is the distinction between reading a grammar book and speaking a language fluently. The exercises — particularly the composition exercises — are the practice that converts knowledge into fluency, theory into instinct, rules into music.

A student who has completed all six exercises in the sequence described above will have composed a two-part invention, a four-voice chorale, and a fugue exposition. These three compositions, however modest in scale, represent genuine exercises of the compositional art. They are not transcriptions or arrangements; they are original compositions in historical styles, and the process of composing them — the struggle to satisfy multiple simultaneous constraints, the discovery of solutions that are both correct and musical, the experience of hearing one’s own counterpoint and judging it as music rather than as a rule-following exercise — is irreplaceable as a mode of musical education. A student who has analyzed twenty Bach chorales and composed one in the style of Bach understands those chorales at a depth that no amount of passive listening or theoretical reading can achieve. A student who has composed a fugue subject — who has struggled with the constraints that stretto potential places on melodic freedom, who has discovered that not every melody one might write will yield an interesting fugue — understands the Well-Tempered Clavier as Bach’s achievement in a way that analysis alone cannot convey. Counterpoint, finally, is learned by doing. The notes on the page are a starting point; the music in the hand is the destination.

Example 8.10 (A Sample Fugue Subject and Its Properties). Consider the subject: D–E–F–G–A–G–F#–G in D minor (approximately two measures in 4/4 at a moderate tempo). This subject has the following properties: (1) it begins on the tonic D and ends on G — the dominant scale degree — implying a half-cadence and making the tonal answer straightforward (beginning on A, the dominant, and ending on C or D); (2) it contains a chromatic inflection — the raised sixth degree F# appearing at the end — which gives it a distinctive harmonic colour and implies the shift toward the harmonic minor at the cadential approach; (3) its rhythmic profile — primarily eighth notes with a passing dotted rhythm — is distinctive and recognizable in augmentation; (4) its ascending scalar motion in the first half and descending motion in the second half creates a clear arch shape that will be recognizable even when the subject is inverted; (5) the F–G–A–G figure in the middle is extractable as an episode motive. As a stretto candidate: the subject can be combined with a version entering two beats later at the fifth above (beginning on A) without parallel fifths or octaves, provided the countersubject respects the interval constraints at the point of overlap. This subject therefore has good stretto potential at the two-beat interval and would serve as the basis for a satisfying three-voice fugue.

The exploration of counterpoint, from the most elementary species exercise to the most complex fugue analysis, is ultimately an exploration of the musical mind — of how it organizes simultaneous sounds, creates expectation, builds and releases tension, and achieves the sense of inevitability in what is in fact a complex, multiply constrained compositional object. Every rule of counterpoint encodes a fact about musical perception; every exercise develops a capacity for musical thought that no other discipline can provide in quite the same way. The punctus contra punctum is not a historical artifact; it is the living grammar of musical intelligence.

The influence of the contrapuntal tradition is not confined to the classical canon. Jazz improvisation, at its highest levels, involves real-time species counterpoint: the improviser must simultaneously manage the horizontal melodic line, the vertical harmonic implication against the chord changes, the rhythmic interaction with the rhythm section, and the motivic development of the improvisation’s own material. Charlie Parker’s bebop lines — fast, harmonically dense, rhythmically intricate — are the product of an internalized contrapuntal training that is informal but no less rigorous than Fux’s species exercises. Parker’s lines move through chord changes the way a fifth-species counterpoint moves through a cantus firmus: always consonant (or appropriately dissonant) against the underlying harmony, always melodically coherent in themselves, always pointing purposefully toward the next structural downbeat.

Remark 8.4 (The Fugue in Rock and Popular Music). Instances of genuine fugal technique in popular music are rare but not absent. The Beatles' "Because" (from Abbey Road, 1969) is a three-voice canon whose three parts — recorded by Lennon, McCartney, and Harrison singing the same melody at staggered time intervals — create a canonic texture of considerable delicacy. Bach's influence on the Beatles is explicit: John Lennon stated that "Because" was inspired by hearing Yoko Ono play the Moonlight Sonata, and the connection to the Baroque harmonic tradition is audible in the song's unusual harmonic progression. More directly, the progressive rock genre of the 1970s — Emerson, Lake and Palmer, Yes, Genesis in their earlier albums — frequently employed fugal passages and imitative counterpoint as markers of compositional seriousness, bridging the gap between classical and popular practice in ways that, whatever their aesthetic merits, demonstrate the fugue's remarkable cultural resilience.

The influence of the contrapuntal tradition on popular and jazz music also extends to the technique of countermelody — a second melody added to a primary melody, each independent enough to be satisfying alone, yet combined to create a richer whole. The countermelody is essentially first-species counterpoint applied to popular song form: the primary melody serves as the cantus firmus, and the countermelody is the added voice. Great arrangers — Gil Evans’s arrangements for Miles Davis, Oliver Nelson’s orchestrations for jazz ensembles, Brian Wilson’s Beach Boys arrangements — consistently employ countermelodies that respect the interval-independence principles of species counterpoint: moving in contrary motion to the primary melody, avoiding parallel octaves and fifths, and maintaining their own melodic coherence when heard alone. The species principles persist, unnamed and untheorized, in every domain of sophisticated musical practice.

The broader lesson is that contrapuntal thinking — the coordination of independent voices, the management of tension and release, the use of imitation to create coherence from melodic fragments — is not a style but a cognitive mode. It appears wherever music of genuine complexity is created, regardless of cultural context, and its principles, once internalized, are applicable to any musical situation. The student who has completed the exercises of this course is not equipped merely to compose in the style of Palestrina or Bach; they are equipped to think musically in the deepest sense, to hear any music — whether it is a Renaissance motet, a Bach fugue, a Brahms symphony, a Charlie Parker improvisation, or a Steve Reich phase piece — as a play of independent voices in temporal space, each with its own identity, each contributing to a whole that none could achieve alone. This is what counterpoint teaches, and it is, as Fux’s student Josephus learned, both an art and a discipline: the highest discipline, in the service of the most human of all arts.

Back to top