MUSIC 272: Counterpoint and Fugue
Estimated study time: 3 hr 16 min
Table of contents
These notes draw on Johann Joseph Fux’s Gradus ad Parnassum (trans. Mann, 1943), Kent Kennan’s Counterpoint (4th ed., 1999), Robert Gauldin’s A Practical Approach to Eighteenth-Century Counterpoint (2nd ed., 2013), Johann Sebastian Bach’s Well-Tempered Clavier and Two-Part Inventions (as primary analytical examples), and supplementary material from Yale School of Music counterpoint course materials and Eastman School of Music undergraduate theory resources.
Chapter 1: Foundations of Counterpoint
1.1 Intervals and Their Classification
Counterpoint is the art of combining independent melodic lines into a coherent and expressively unified whole. The word derives from the Latin punctus contra punctum — note against note — and in its broadest sense encompasses every discipline of polyphonic writing from the Renaissance motet to the Baroque fugue, from the Classical string quartet to the modern phasing compositions of Steve Reich. Before one can compose a single measure of two-voice counterpoint, one must understand with precision the raw material from which all polyphony is constructed: the interval.
An interval is the distance in pitch between two tones, measured by the number of diatonic steps separating them. We count inclusively: from C to E is a third (C, D, E — three steps). Intervals are named by their size (unison, second, third, fourth, fifth, sixth, seventh, octave) and their quality (perfect, major, minor, augmented, diminished). The distinction between perfect and imperfect intervals is not merely taxonomic; it reflects a hierarchy of acoustic stability that directly governs which intervals may appear, and where, in a contrapuntal texture. The theorist who understands why this hierarchy exists — rooted not in convention but in the physics of vibrating strings and the mathematics of frequency ratios — will find that the rules of counterpoint feel not arbitrary but inevitable.
This classification is not arbitrary. The ancient Greeks, particularly Pythagoras, observed that the most acoustically stable simultaneous tones are those related by the simplest ratios. Two strings vibrating in the ratio \(2:1\) produce tones an octave apart, so thoroughly consonant that medieval theorists occasionally treated them as the same pitch class. The ratio \(3:2\) yields the perfect fifth, the interval that forms the basis of the circle of fifths, the overtone series, and ultimately the entire tonal system. The ratio \(4:3\) gives the perfect fourth — acoustically a near-twin to the fifth, yet treated with considerably more ambivalence in Renaissance and Baroque practice, as we shall see.
The imperfect consonances — thirds and sixths — introduce the prime 5 into the ratio. A major third at \(5:4\) carries a brightness absent from the hollow resonance of a perfect fifth; a minor third at \(6:5\) has a mellower, more shadowed quality. These intervals, which medieval theory largely dismissed as dissonances (too complex, too far from the Pythagorean ideal of pure powers of 2 and 3), became the structural pillars of Renaissance polyphony. Composers like Josquin des Prez and Palestrina built entire movements on the careful interlocking of thirds and sixths, understanding — perhaps intuitively, perhaps from long practice — that these intervals create a richer, warmer sonority than the naked perfect consonances of the earlier Gothic style. Their richness arises precisely from the slight acoustic complexity introduced by the prime 5.
The tritone deserves special attention. Its frequency ratio is irrational in equal temperament — the ratio is \(2^{6/12} : 1 = \sqrt{2}:1\) — and even in just intonation it admits no clean small-integer form, the closest approximations (\(45:32\) or \(64:45\)) involving large numbers. Medieval theorists called it diabolus in musica — the devil in music — and its use was severely restricted in early polyphony. Yet it is precisely this restless instability that makes the tritone indispensable to tonal music: it creates the harmonic tension that the dominant seventh chord must resolve, and it defines the half-step voice-leading that gives authentic cadences their sense of inevitability. Without the tritone, there is no functional harmony in the modern sense.
1.2 The Overtone Series and Harmonic Justification
The interval classification of Definition 1.1 has a deeper physical justification than the mere simplicity of frequency ratios. Every pitched sound produced by a musical instrument is in fact a complex mixture of frequencies: the fundamental frequency \(f_0\) and its integer multiples \(2f_0, 3f_0, 4f_0, \ldots\), the so-called overtone or harmonic series. A string vibrating at 110 Hz (A2) simultaneously produces partial tones at 220 Hz (A3), 330 Hz (E4), 440 Hz (A4), 550 Hz (approximately C#5), 660 Hz (E5), and so on. The first six partials of any fundamental collectively span a major triad — an observation that is not a coincidence but a derivation: the major triad is the acoustically natural chord, the chord that nature produces whenever a single pitched tone is sounded.
- Partials 1 and 2: ratio \(2:1\), the octave (perfect consonance).
- Partials 2 and 3: ratio \(3:2\), the perfect fifth.
- Partials 3 and 4: ratio \(4:3\), the perfect fourth.
- Partials 4 and 5: ratio \(5:4\), the major third.
- Partials 5 and 6: ratio \(6:5\), the minor third.
- Partials 6 and 7: ratio \(7:6\), approximately a minor third but slightly flat (the "harmonic seventh").
This theorem is the physical foundation of the contrapuntal consonance hierarchy. The perfect consonances (unison, fifth, octave) share the most harmonics; the imperfect consonances (thirds, sixths) share somewhat fewer but still enough for clear fusion; dissonances share very few, producing the acoustic roughness that the mind associates with incompleteness and motion. Species counterpoint is, in this light, a system for managing the density of shared harmonics between simultaneously sounding voices — for ensuring that the texture maintains sufficient harmonic resonance on strong beats while permitting localized acoustic tension on weak beats.
1.3 Consonance and Dissonance in Polyphony
Understanding intervals in the abstract is only the first step. In a polyphonic texture, intervals acquire meaning through their context: where they appear in the measure, how they are approached, and how they resolve. Renaissance counterpoint, as codified by Fux in the Gradus ad Parnassum (1725), operates according to a set of rules that might initially appear arbitrary but in fact encode centuries of accumulated aesthetic wisdom about the perception of simultaneous sound. The student who approaches these rules as a lawyer approaches statutes — seeking loopholes, looking for exceptions, asking what is technically permitted — will produce exercises that are formally correct and aesthetically lifeless. The student who approaches them as a musician — asking what each rule is trying to achieve, what perceptual or aesthetic goal it serves — will produce exercises that are both correct and genuinely musical. The goal of this chapter is to provide the understanding that makes the second approach possible: not just the rules, but the reasons.
The historical development of the consonance hierarchy is itself a story of aesthetic evolution. The earliest medieval polyphony — the organum of the Notre Dame school (twelfth and thirteenth centuries) — admitted only unisons, fifths, and octaves as consonances, treating thirds and sixths as active, unstable intervals requiring resolution. The late medieval period (Machaut, Dufay) began to treat thirds as consonances in certain contexts, and the early Renaissance (Josquin, Obrecht) completed the transition: by the early sixteenth century, thirds and sixths had become the primary harmonic intervals of polyphony, displacing the bare fifths and octaves of the medieval style. This evolution was not arbitrary; it reflected a genuine change in the acoustic aesthetic of Western music — a shift from the hollow, resonant power of perfect consonances to the warmer, fuller sound of the imperfect consonances that the prime 5 introduced. The species rules, by accepting both perfect and imperfect consonances as valid strong-beat intervals, reflect the fully developed Renaissance aesthetic in which both types of consonance are structurally available.
The reasoning behind this principle is psychoacoustic as much as aesthetic. When two tones sound simultaneously, the auditory system attempts to fuse them into a single percept. Consonant intervals — those with simple frequency ratios — fuse readily because their overtone series share many components, producing a sensation of repose and stability. Dissonant intervals resist fusion because their overtone series produce many near-coincident partials that beat against each other, creating the acoustic sensation of roughness or tension. On a strong beat — the moment of metric emphasis — the ear expects stability. A dissonance on the downbeat strikes the listener as a collision, something demanding immediate correction. On a weak beat, the same dissonance registers as motion, a passing moment of tension swept away before the next strong beat arrives. The rules of species counterpoint are, in this sense, a grammar of perceptual expectation, mapping the physics of acoustics onto the psychology of metric time.
The student must cultivate the habit of hearing intervals not as isolated sonic events but as points on a trajectory of tension and release. A major sixth leading to a perfect octave through contrary motion is not simply two consecutive intervals; it is a gesture of closure, a cadential arrival. A minor seventh dissolving to a major third through stepwise contrary motion is a suspension resolving — a moment of friction yielding to consonance, of harmonic pressure released. Developing this sense of intervallic trajectory, this ability to hear each interval in the context of where it came from and where it is going, is the most important cognitive skill in the study of counterpoint. No amount of rule-memorization substitutes for it.
1.4 Interval Inversion and Its Properties
A fundamental operation on intervals that is indispensable in counterpoint, and especially in the study of invertible counterpoint and fugue, is interval inversion: the transposition of the lower note of an interval up by an octave (or equivalently, the transposition of the upper note down by an octave), converting the interval \(i\) to the interval \(9 - i\) (using interval numbers from 1 to 8).
- Unison (1) inverts to octave (8): both are perfect consonances.
- Second (2) inverts to seventh (7): both are dissonances.
- Third (3) inverts to sixth (6): both are imperfect consonances.
- Fourth (4) inverts to fifth (5): both are perfect consonances (with the caveat about the fourth in two-voice writing).
The quality of an interval also changes under inversion in a predictable pattern: major inverts to minor, minor to major, augmented to diminished, diminished to augmented, and perfect remains perfect. Thus the major third (M3) inverts to the minor sixth (m6), the minor third (m3) inverts to the major sixth (M6), the major sixth (M6) inverts to the minor third (m3), and so on. These quality inversions have practical consequences for the harmonic colour of inverted passages: a passage that featured warm major sixths in its original form will feature brighter minor thirds in its inversion, changing the expressive character even while the structural contrapuntal logic remains valid.
1.5 The Four Types of Motion
When two voices move simultaneously, their relative motion falls into one of four categories, each with distinct contrapuntal properties and associated rules. The classification of motion types is not merely descriptive; it is a framework for analyzing and managing the independence of voices, which is the fundamental goal of all polyphonic writing.
- Contrary motion: \(\delta_1\) and \(\delta_2\) have opposite signs — one voice ascends while the other descends.
- Similar motion: \(\delta_1\) and \(\delta_2\) have the same sign but \(|\delta_1| \neq |\delta_2|\) — voices move in the same direction by different intervals.
- Oblique motion: exactly one of \(\delta_1, \delta_2\) is zero — one voice holds a sustained pitch while the other moves.
- Parallel motion: \(\delta_1 = \delta_2 \neq 0\) — voices move in the same direction by the same interval, preserving the intervallic distance between them exactly.
Contrary motion is the gold standard of contrapuntal writing. When the voices move in opposite directions, they assert their independence most forcefully: neither voice seems to pull or follow the other, and the result is a texture of genuine polyphonic equality. The two voices trace arcs that are, in a sense, mirror images, and the resulting sonic impression is one of balance, dialogue, and genuine two-part composition. It is no accident that cadences — the moments of greatest formal weight in a contrapuntal piece — typically employ contrary motion in their approach: the soprano rising by step from the leading tone to the tonic octave while the bass descends by a fourth or fifth from the dominant to the tonic.
Similar motion is less independent than contrary but entirely permissible, provided it does not produce forbidden parallel intervals. The two voices travel in the same direction, implying a certain gravitational pull between them, but the difference in interval size prevents their complete assimilation. Oblique motion, in which one voice sustains while the other moves, provides textural contrast and is particularly useful in the creation of suspensions in fourth-species counterpoint — the stationary voice creates the suspended dissonance while the moving voice provides the harmonic change. Parallel motion — the most restricted of the four types — preserves the interval between the voices exactly, and when that interval is a perfect consonance, the result is the notorious parallel fifth or parallel octave, the most rigorously forbidden event in strict counterpoint.
In practice, the experienced contrapuntist does not consciously categorize every interval of motion as contrary, similar, oblique, or parallel; the categories are internalized through exercise to the point where the ear immediately flags parallel motion into a perfect consonance as a perceptual wrong note, requiring no conscious analysis. The goal of the species exercises is precisely this internalization — the formation of a musical instinct that operates below the level of deliberate thought, so that the conscious mind is freed to attend to the higher-level questions of melody, harmony, and expression. The motion-type classification is the analytical vocabulary; the developed ear is the goal.
1.6 Forbidden Parallels and Voice Independence
The prohibition against consecutive perfect fifths and consecutive perfect octaves (often called “parallel fifths” and “parallel octaves”) is the single most famous rule in counterpoint. Students encounter it early, violate it by instinct, and spend years developing the ear to detect and avoid it. Understanding why this prohibition exists — not merely that it does — is essential to genuine contrapuntal thinking, and it is one of the great satisfactions of a rigorous counterpoint education.
It is instructive to consider what parallel fifths actually sound like, and why the ear objects to them. Play, at the piano or at any instrument, the progression C–G in the bass against E–B in the soprano (moving from an interval of a sixth to an interval of a fifth as both voices ascend by a step). This is a hidden fifth, not a parallel fifth: the preceding interval was a sixth. Now play C–G in the bass against G–D in the soprano (moving from an interval of a fifth to another fifth as both voices ascend by a step). This is the parallel fifth. The difference is subtle but audible: the parallel fifth creates a momentary sense that the two voices have merged into a single “super-voice” harmonized at the fifth, while the hidden fifth merely approaches the fifth from a different angle, preserving the voices’ independence. The prohibition captures this perceptual reality: parallel fifths are not merely wrong by convention, they are wrong by the logic of polyphonic perception. The rule is the formalization of an auditory fact.
The acoustic justification is subtle but profound. A perfect fifth, with its ratio of \(3:2\), produces such strong resonance that the two voices fuse nearly into one acoustic entity. The fundamental of the lower voice and the first overtone of the upper voice are very close in frequency; the second overtone of the lower voice is identical to the fundamental of the upper voice. This overtone sharing creates the characteristic hollow yet powerful sound of the perfect fifth, a sound that is perceptually singular — we do not easily hear it as two independent tones. When two voices move in parallel fifths — say, from C–G to D–A — the listener hears not two independent melodic lines but a single melodic entity harmonized at the fifth. The contrapuntal goal of independent polyphonic voices is thus defeated: the texture collapses from two distinct melodic personalities into a single melody with a fixed harmonic shadow. The voices have ceased to be voices in the genuine sense and have become a single compound instrument.
The same principle applies even more forcefully to parallel octaves. The octave ratio of \(2:1\) causes the upper voice to coincide with the second partial of the lower voice; the two tones fuse so completely in perception that listeners frequently do not hear them as two separate voices at all. Consecutive octaves produce the effect of a single voice doubling at the octave — not counterpoint, but unison writing with added thickness. The prohibition on parallel octaves is therefore not so much a stylistic rule as a definition: music that proceeds in consecutive octaves is not polyphony and has no claim to being treated as counterpoint.
1.7 Voice Independence as Aesthetic Ideal
Before turning to voice ranges and melodic writing, it is worth pausing to articulate the aesthetic ideal that the prohibition on parallel fifths and octaves is designed to protect: the ideal of voice independence. In a genuine polyphonic texture — as opposed to a single melody harmonized homophonically — every voice has its own melodic identity, its own rhythmic character, its own harmonic trajectory, and its own arc of tension and release. The voices coexist in the same acoustic space without merging into each other: they are, in the phenomenological sense, distinct musical subjects, each with its own “voice” in the speech-act sense as well as the musical sense.
This ideal of voice independence is not merely a technical requirement but an ethical one, in the sense that it reflects a musical value: the belief that multiple perspectives are more interesting than one, that dialogue is richer than monologue, that the simultaneous pursuit of independent melodic goals can produce a whole that is greater than the sum of its parts. The polyphonic ideal — multiple equal voices, each maintaining its identity while cooperating in a common harmonic goal — is a musical expression of an aesthetic and philosophical commitment that the Western musical tradition has upheld, with varying degrees of consistency, from the twelfth century to the present. The study of counterpoint is, at its deepest level, the study of how this ideal is realized in musical practice.
1.8 Voice Ranges and Melodic Writing
Counterpoint is not an abstract mathematical exercise; it is vocal music. Even when written for instruments, the contrapuntal tradition conceives of each voice as a human singer with a natural range, a characteristic tessitura, and physical limits on the intervals it can execute comfortably and expressively. The traditional four voices — soprano, alto, tenor, bass (SATB) — each occupy a range of roughly a tenth to a twelfth, and the rules of melodic writing in counterpoint are calibrated to what a trained singer can render naturally.
- Soprano: C4 (middle C) to G5, with tessitura D4–E5.
- Alto: G3 to C5, with tessitura A3–D4.
- Tenor: C3 to G4, with tessitura D3–E4.
- Bass: E2 to C4, with tessitura F2–D3.
Good melodic writing in counterpoint is governed by principles of singability and shape. A well-formed contrapuntal melody rises and falls with a clear arc; it reaches a single climactic high point (or occasionally a low point) and then descends (or ascends) to its conclusion. It balances stepwise motion with occasional leaps, and large leaps — sixths, sevenths, octaves — are compensated by stepwise motion in the opposite direction immediately after the leap. This principle of leap compensation reflects the physical reality of singing: after a large ascending leap, the voice naturally wants to fall back; the compensation gives the melodic line the sense of recovering its balance. A melody that leaps repeatedly in the same direction without compensation feels driven to an extreme, unstable.
- Augmented intervals of any size (the augmented second A2, augmented fourth A4) — difficult to sing in tune and stylistically alien to Renaissance polyphony.
- Diminished intervals, particularly the diminished fourth and diminished seventh — similarly difficult and anguished in effect.
- Any leap larger than an octave — beyond the comfortable range of the voice.
- Two consecutive leaps in the same direction that together span an augmented or diminished interval, or that span a seventh or more — these create the effect of a hidden augmented or diminished interval and disrupt melodic coherence.
A melody in strict counterpoint should be comprehensible and singable on its own, independent of the other voices: it should feel like a melody, not merely a series of intervals chosen to avoid forbidden parallels. This demand for melodic integrity in each voice — the requirement that every voice be worth singing alone — is what distinguishes genuine counterpoint from chord-writing. In a Bach two-part invention, if one isolates either voice and plays it alone, one hears a complete, shapely, and interesting melody. This is not a coincidence or a by-product; it is a fundamental design criterion of contrapuntal composition.
1.9 Palestrina and the Codification of Renaissance Style
Giovanni Pierluigi da Palestrina (c. 1525–1594) stands at the apex of the Renaissance polyphonic tradition, and his style — smooth, carefully controlled, rigorously avoiding harsh dissonances, animated by flowing voice-leading — became the canonical model that Fux codified in the Gradus ad Parnassum and that the species system was designed to teach. Understanding Palestrina’s style is inseparable from understanding why the species rules take the form they do: every rule encodes an aesthetic judgment that Palestrina made, distilled from the accumulated practice of the Roman polyphonic school.
The contrast between Palestrina’s smooth style and the slightly more adventurous writing of his contemporary Orlando di Lasso (Roland de Lassus, c. 1532–1594) illustrates that the species rules codify one approach to Renaissance polyphony rather than the only possible one. Lasso’s music uses dissonance more freely and admits a wider range of melodic intervals, including occasional augmented seconds in expressive contexts. The species system, by targeting the Palestrina style specifically, teaches the most constrained and therefore most revealing form of Renaissance counterpoint, from which the student can relax the constraints in controlled ways as they progress toward free composition. Starting with the strictest style and relaxing its rules is always more productive, pedagogically, than starting from freedom and trying to impose constraints retroactively.
1.11 Modes and the Historical Context of Renaissance Polyphony
Before the full establishment of major-minor tonality in the seventeenth century, the harmonic framework of polyphonic music was modal rather than tonal. The eight church modes — Dorian, Phrygian, Lydian, Mixolydian, and their plagal variants — each defined a characteristic scale pattern, a preferred final (the modal equivalent of the tonic), and a preferred dominant (the reciting tone, or tenor). Renaissance counterpoint is fundamentally modal counterpoint, and the study of species counterpoint as Fux formulated it is, strictly speaking, a study of modal polyphony in the style of Palestrina.
- Dorian: final D, scale D–E–F–G–A–B–C–D (minor with raised sixth).
- Phrygian: final E, scale E–F–G–A–B–C–D–E (characterized by the semitone between final and second degree).
- Lydian: final F, scale F–G–A–B–C–D–E–F (major with raised fourth — the tritone above the final).
- Mixolydian: final G, scale G–A–B–C–D–E–F–G (major with lowered seventh).
- Aeolian: final A, scale A–B–C–D–E–F–G–A (the natural minor scale).
- Ionian: final C, scale C–D–E–F–G–A–B–C (the major scale).
The modal system profoundly influences the character of Renaissance counterpoint in ways that extend beyond the simple question of which pitches are available. Each mode has its own characteristic melodic gestures, its own range of available cadential formulas, and its own emotional character as perceived by Renaissance theorists. The Phrygian mode, for instance, with its unique semitone at the bottom (E–F rather than the whole step at the bottom of all other modes) produces a characteristic cadential gesture — the Phrygian cadence, in which the bass descends by semitone (E to D#, or in transposition any descent by semitone) — that sounds strikingly different from all other cadential approaches and persists in tonal music as a special expressive device even after the modal system was supplanted by major-minor tonality.
The practical consequence for species counterpoint exercises is that the student must be aware of the modal framework of the cantus firmus and adjust the counterpoint accordingly. A Dorian cantus firmus — ending on D, with a characteristic B-natural in ascent and B-flat in descent — demands that the counterpoint respect these inflections and employ the Phrygian-cadence gesture (E to D in the bass) at the final cadence rather than the major-mode leading-tone approach. Understanding the modal character of the cantus is inseparable from the craft of good counterpoint in the Renaissance style.
Chapter 2: First and Second Species Counterpoint
2.1 First Species: Note Against Note
First species counterpoint is the most rigorous and revealing of the five species, because it strips away all rhythmic differentiation and presents the harmonic skeleton of the polyphonic texture in the starkest possible form. In first species, the counterpoint voice adds one note for each note of the cantus firmus (the given melody), and every vertical interval between the two voices must be consonant. There is no rhythmic complexity to distract from the interval-by-interval logic; every choice is exposed and consequential.
The opening and closing requirements are both formal and acoustic. A polyphonic piece must establish and confirm the mode (or key), and in strict counterpoint the strongest declarations of modal centre are the perfect consonances. The approach to the final octave or unison by contrary motion — typically the soprano moving up by step (the leading tone to tonic) and the bass moving down by step or fourth (dominant to tonic) — is the prototype of all cadential motion in Western music. Every perfect authentic cadence in common-practice harmony is an elaboration of this basic gesture, ornamented by harmonic rhythm, dominant seventh chords, and rhythmic displacement, but structurally reducible to this two-voice framework.
The rule against hidden fifths and octaves (Definition 1.4) adds a further constraint. A hidden fifth occurs when two voices move in similar motion to a perfect fifth, with at least one voice leaping. Even though the preceding interval was not a fifth, the sudden parallel approach to the fifth creates an acoustic impression of parallel-fifth motion. In strict first species, hidden fifths between the outer voices are permitted only when the upper voice moves by step — which guarantees that the stepwise motion distinguishes the two voices sufficiently to prevent the impression of fusion.
2.2 The Cantus Firmus and Its Role
The cantus firmus — the “fixed song” — occupies a central position in the pedagogical tradition not merely because it provides a harmonic framework for the counterpoint exercise but because it connects the student to the deepest roots of the polyphonic tradition. In the medieval organum, a plainchant melody was held in long notes in the lower voice (the tenor, from Latin tenere, to hold) while upper voices added florid elaborations above it. Fux’s cantus firmus exercises recreate this structure in miniature, connecting the student through a continuous practice to the origins of Western polyphony in the chant tradition of the medieval church.
The student should be aware that the cantus firmus is not merely a harmonic given but a melodic partner. Even in first species, where the added voice has the responsibility for creating all the melodic interest (since the cantus moves in whole notes at the same rate as the counterpoint), the added voice must be composed in dialogue with the cantus, not merely in avoidance of the cantus. A good first-species exercise feels like a conversation — question and answer, rise and fall, tension and resolution — not like a harmonic calculation.
2.3 Rules and Good Melodic Intervals
In first species, because every note is exposed on a strong beat and every interval is directly audible, the quality of the melodic intervals in the counterpoint voice matters enormously. A first-species counterpoint that consists entirely of stepwise motion has no melodic interest and fails as a melody even if it respects all the harmonic rules. One that contains too many leaps becomes angular and unsingable. The art is in the balance.
- Stepwise motion (seconds) and the consonant skip of a third should predominate, accounting for the majority of melodic motion.
- Larger leaps (fourths, fifths, sixths) should be exceptional and should be approached and quitted by contrary stepwise motion.
- The counterpoint should have a clear melodic arc — rising to a single climax and descending to the cadence — rather than oscillating without direction.
- Contrary motion between the voices should predominate; similar, oblique, and (especially) parallel motion should be used sparingly and with awareness of their harmonic implications.
It is worth dwelling on the aesthetic rationale for these melodic preferences. The insistence on a single melodic climax reflects the Renaissance understanding of melody as a purposive arc — a gesture that begins, rises to a point of maximum energy, and falls to a point of repose. A melody that oscillates or wanders without direction lacks this purposiveness; it fails to communicate anything. The preference for contrary motion between the voices reflects the polyphonic ideal of independence: when both voices move in contrary directions, neither dominates, and the listener hears a genuine dialogue between equals. These are not merely technical rules but aesthetic principles that the great masters of counterpoint internalized as the deep grammar of musical discourse.
2.4 Approaching the Cadence
The cadence — from the Latin cadere, to fall — is the moment of formal arrival at which melodic and harmonic motion converges on a point of rest. In first species counterpoint, the cadence is structurally simple but musically decisive: it is the moment at which all the tension of the preceding intervals resolves to the stability of a perfect consonance. Understanding the cadence as a structural event — not merely a formula to be applied but a moment of formal conclusion that must be prepared by the preceding material — is essential to genuine contrapuntal thinking. A well-prepared cadence feels inevitable; the final perfect consonance is felt as the answer to a question that the entire exercise has been asking. A poorly prepared cadence feels arbitrary — the exercise simply stops, without the sense that it has reached its destination.
This cadential formula encodes the fundamental logic of tonal harmonic motion. The leading tone’s ascending semitone to the tonic creates a sense of inevitable arrival — the slight dissonance of the diminished seventh between \(\hat{7}\) and the bass \(\hat{1}\) (in a tonal context where \(\hat{7}\) sits above the dominant) resolves outward. The simultaneous descent of the bass reinforces the finality of the cadence. Every formal cadence in common-practice harmony is an elaboration of this gesture; the dominant seventh chord formalizes the harmonic basis of the leading tone’s urgency; but the underlying two-voice motion remains constant from the fifteenth century to the nineteenth.
2.5 Second Species: Two Notes Against One
Second species introduces rhythmic differentiation into the contrapuntal texture. Against each whole note of the cantus firmus, the counterpoint voice adds two half notes, producing a 2:1 ratio of motion. This ratio creates a crucial distinction between strong beats (the downbeat, where the half note coincides with the whole note) and weak beats (the midpoint of the bar, where the half note sounds against the sustained cantus note). Dissonance becomes available for the first time — but only in strictly prescribed circumstances that ensure perceptual resolution.
The passing tone is the central device of second species. It fills in the space between two consonances by stepwise motion, and the fact that it is dissonant on the weak beat is acoustically tolerable because the ear, having registered the strong-beat consonance, perceives the dissonant passing tone as motion — a flash of tension swept aside before the next strong beat arrives. The weak beat is, in this sense, a zone of relative harmonic permission: not lawless, but operating under a more lenient code than the strong beat. The ear’s tolerance for dissonance on a weak beat is approximately inversely proportional to the metric weight of that beat; in compound meter, where some weak beats are stronger than others, the rules must be calibrated accordingly.
2.6 The Cambiata and Consonant Skip
Second species permits two additional melodic devices beyond the passing tone and neighbor tone: the nota cambiata (changing-note figure) and the consonant skip. The cambiata is a four-note figure — step down, skip down a third, step up — that was a Renaissance cliché, widely used by Palestrina and his contemporaries. Although the third note of the figure (the note arrived at by the downward skip) is dissonant and is quitted by ascending step rather than by the expected resolution, the figure was so thoroughly conventionalized by Renaissance practice that strict species pedagogy admits it as an exception. This is a notable moment in the rules of counterpoint: an irregularity tolerated because it has become a stylistic formula.
The consonant skip provides melodic flexibility when the strict alternation of strong consonances and weak passing tones would produce an uncomfortably stepwise, monotonous line. A well-formed melodic contour requires occasional leaps, and the consonant skip — always to a consonance — satisfies this need while respecting the fundamental requirement that weak-beat activity resolve perceptually before the next downbeat. The consonant skip is not a dissonance treatment at all; it is a melodic option that happens to occur on a weak beat, and its consonance ensures that no special resolution is needed.
2.7 Melodic Independence in Two-Voice Writing
A persistent danger in two-voice counterpoint — one that afflicts student exercises with disproportionate frequency — is the tendency for the added voice to lose its melodic character and become merely a harmonic adjunct to the cantus firmus. This failure is partly a matter of attention — the student, focused on the vertical interval relationships, neglects to hear their added voice as a melody — and partly a matter of conception: the student thinks of themselves as “harmonizing” the cantus rather than “conversing” with it. The corrective is a change of perspective: one must compose the added voice as if it were the primary melody, temporarily forgetting the cantus, and then check that the resulting combination respects the interval rules. A voice composed with melodic integrity will almost always produce better counterpoint than one composed from the interval outward.
The test of melodic independence is the single-voice test: remove one voice from the texture and play or sing the other alone. A genuinely independent voice will sound like a complete and interesting melody in isolation; a dependent voice will sound like an incomplete fragment whose only reason for existence is its relationship to the other voice. Every voice in a species exercise — and every voice in a Bach invention or fugue — should pass the single-voice test. This is the standard to which the student must hold themselves, and it is a higher standard than mere correctness.
Chapter 3: Third, Fourth, and Fifth Species
3.1 Third Species: Four Notes Against One
Third species places four quarter notes against each whole note of the cantus, introducing the fastest melodic motion yet encountered and demanding a corresponding increase in the number of available voice-leading devices. The shift from two quarter notes (second species) to four (third species) is not merely a doubling of melodic activity; it changes the character of the contrapuntal texture fundamentally. In second species, the melody has time to breathe — two notes per cantus note, with each note occupying a full half beat and each melodic gesture clear and deliberate. In third species, the melody flows continuously, and the ear must follow a stream of quarter notes whose harmonic implications change at every beat. The contrapuntal challenge shifts from choosing the right interval to composing a genuinely flowing melody that respects all interval and motion-type constraints while maintaining melodic interest and direction. At this density — four times the speed of the cantus — the counterpoint voice has the character of a flowing, ornamented melody, and the student must manage not only vertical intervals but melodic shape, direction, and rhythmic continuity across the entire exercise. The challenge is no longer simply to avoid forbidden parallels but to compose a genuinely beautiful melodic line that observes the harmonic rules as a matter of course.
The abundance of notes in third species does not relax the fundamental prohibition against parallel fifths and octaves; if anything, the density of motion makes their detection more challenging. One must scan not only consecutive beats within the bar but also the connection between the fourth quarter note of one bar and the first quarter of the next — the bar-line connection that receives special scrutiny because it is the moment of metric renewal. Parallel fifths that straddle the bar line are just as forbidden as those within it, and more insidious because the eye tends to treat each bar as a self-contained unit. The discipline of systematically checking every bar-line connection — placing a mental bracket at the join of every two bars and verifying the motion type and interval quality — is an essential habit for the third-species student and for any contrapuntist writing at a fast rhythmic density.
The melodic demands of third species are substantial. The four-quarter-note stream must maintain a clear sense of direction and arc; it should not meander aimlessly from note to note. The most common defect in student third-species exercises is excessive stepwise motion without compensating leaps, producing a melody that seems to crawl rather than flow. Conversely, too many leaps produce an angular, disjunct line that loses melodic coherence. The ideal third-species counterpoint alternates between scalar runs and melodic skips with the naturalness of a well-conceived melody, always moving purposefully toward the cadence.
3.2 Third Species: Forbidden Parallels and the Battuta
The density of third species creates a specific problem with forbidden parallels that requires explicit attention. The most dangerous parallel in third species is the one that crosses the bar line — the parallel between the fourth quarter note of one bar and the first quarter note of the next. This parallel is particularly easy to miss in the process of composition because the bar line creates a visual break that the eye tends to treat as a harmonic break. But there is no harmonic break at the bar line in species counterpoint; the harmonic obligation continues without interruption from bar to bar.
A related issue is the treatment of parallel fourths in third species. While parallel fifths between any pair of beats are absolutely prohibited, and parallel octaves are prohibited between any strong-beat notes, parallel fourths between upper voices (not involving the bass) are permissible in third species, as they are in all species. The distinction between the outer-voice rules and the inner-voice rules becomes particularly important in three- and four-voice textures, where the rules governing the relationship of each voice to the bass must be distinguished from those governing the relationships between pairs of upper voices.
3.3 Fourth Species: Suspensions
Fourth species is arguably the most musically consequential of the five species, because it introduces the suspension — the device that generates more harmonic tension and expressive power than any other single technique in tonal music. The suspension is a held note from a preceding consonance that becomes dissonant against a new harmonic context on the downbeat, then resolves downward by step to a new consonance. Its characteristic effect — the sudden arrival of a long-anticipated dissonance on the metric accent, followed by the gentle release of resolution — is the emotional heart of an enormous proportion of Western tonal music, from the Renaissance motet through the Romantic symphony.
- Preparation: the note that will be suspended is first heard as a consonance on a weak beat (or on the preceding strong beat).
- Suspension: the note is held (tied) into the next strong beat, where it becomes dissonant against the other voice(s).
- Resolution: the suspended note moves downward by step to a consonance on the following weak beat.
The psychological power of the suspension is immense and is worth analyzing carefully. The preparation establishes the suspended note as a consonance — the ear hears it in a stable harmonic context and registers the note as “belonging” to the current harmony. When the other voice changes at the next downbeat, the held note suddenly becomes dissonant against the new harmonic context; the ear registers this as a collision, a moment of unresolved tension. The resolution — the downward step to a consonance — is thus heard as a release of that tension, an answering of the question the suspension posed. The entire three-phase figure is a paradigm of expectation, frustration, and fulfillment that mirrors the most basic patterns of human emotional experience.
What makes the suspension particularly powerful as a musical device is the combination of its metric placement and its preparation. A dissonance that appears without preparation — a sudden accented dissonance on the downbeat with no held-over note — would be jarring, a true collision. The preparation converts the dissonance from a collision to a suspension: the note has been there, it is familiar, it simply finds itself in an unexpected harmonic context. The listener’s response is not shock but sympathy — a sense that the suspended note must now find its way to resolution, and a sense of satisfaction when it does. This is the emotional logic of the suspension, and it is also the emotional logic of much of human experience: the familiar thing that finds itself displaced, suspended in a new context, and seeks its natural resolution.
The prohibition on upward resolution is not arbitrary. The psychological effect of a suspension depends on the sense that the suspended note is “held back” by the weight of the harmonic change beneath it — suspended, in the gravitational sense, above a new floor. The downward resolution is the note “falling” to its natural resting place below. An upward resolution reverses this metaphor: the note “escapes” upward rather than resolving downward, producing a sense of avoidance rather than resolution. This is dramatically appropriate in certain expressive contexts (Bach uses upward resolutions as expressive exceptions in free composition) but stylistically incongruent with the sustained, inexorable quality of strict species counterpoint.
3.3 Fifth Species: Florid Counterpoint
Fifth species, called florid counterpoint, combines all the devices of the preceding four species into a freely flowing line that moves in mixed note values according to the musical demands of the moment. Fifth species is the culmination of the species method — the point at which the systematic isolation of individual voice-leading techniques yields to their free combination in a texture that approaches actual composition. A well-written fifth-species exercise should sound like music, not like a pedagogical drill.
The transition from strict species to free composition is not a relaxation of standards but a change of the kind of discipline required. In the earlier species, the discipline is external: rules are applied one by one to each note as it is placed. In fifth species, the discipline must be internal: the rules have been so thoroughly absorbed that they operate automatically, below the level of conscious rule-checking, leaving the composer free to attend to the higher-level questions of melodic shape, rhythmic variety, and expressive trajectory. This internalization is the ultimate goal of the species system — not the production of correct exercises, but the formation of a musical mind that cannot produce forbidden parallels any more than a native speaker can produce ungrammatical sentences.
3.4 Fifth Species: Florid Counterpoint
Fifth species, called florid counterpoint, combines all the devices of the preceding four species into a freely flowing line that moves in mixed note values according to the musical demands of the moment. Fifth species is the culmination of the species method — the point at which the systematic isolation of individual voice-leading techniques yields to their free combination in a texture that approaches actual composition. A well-written fifth-species exercise should sound like music, not like a pedagogical drill.
The transition from strict species to free composition is not a relaxation of standards but a change of the kind of discipline required. In the earlier species, the discipline is external: rules are applied one by one to each note as it is placed. In fifth species, the discipline must be internal: the rules have been so thoroughly absorbed that they operate automatically, below the level of conscious rule-checking, leaving the composer free to attend to the higher-level questions of melodic shape, rhythmic variety, and expressive trajectory. This internalization is the ultimate goal of the species system — not the production of correct exercises, but the formation of a musical mind that cannot produce forbidden parallels any more than a native speaker can produce ungrammatical sentences.
3.5 Fourth Species Continued: The Bass Suspension and the 9–8
The most common suspensions in standard fourth species — 7–6, 4–3, and 9–8 — all involve the upper voice as the suspended voice, holding a note from the previous beat while the lower voice changes. The bass suspension reverses this: the lower voice holds a note from the previous beat while the upper voice changes, creating the interval of a second (or ninth) above the bass on the downbeat, resolving upward by step to a third. This 2–3 figure (the interval names referring to the bass suspension: second moving to third) is less common in strict species counterpoint but widespread in free tonal writing, where it typically arises from the stepwise descent of an inner voice (in four parts) while the bass is held.
3.7 The Aesthetics of Dissonance Treatment
One of the most revelatory aspects of studying species counterpoint is the light it sheds on the aesthetic function of dissonance in music more broadly. Dissonance, in the species system, is not an evil to be tolerated but a resource to be cultivated — the source of harmonic tension without which the consonances would be inert, lacking the contrast that gives them their sense of arrival and rest. Every master of counterpoint from Palestrina to Bach understood this intuitively: a piece of music that contains no dissonance is a piece without forward motion, without yearning, without the sense of becoming that is the essential quality of music as a temporal art.
The treatment of dissonance in fifth species — the free combination of all previous species — serves as a kind of controlled laboratory for developing the student’s sense of how much dissonance, of what types, and in what contexts, produces the optimal aesthetic effect. Too little dissonance, and the music sounds bland; too much, and it becomes incoherent, the ear unable to find its bearings. The expert fifth-species exercise achieves a continuous oscillation between tension and repose that is, in miniature, the same arc that a complete musical movement must trace on a larger scale.
Chapter 4: Two-Part Tonal Counterpoint
4.1 The Harmonic Minor Scale and Its Contrapuntal Implications
The transition from modal counterpoint (Chapter 1) to tonal counterpoint brings with it one of the most important practical complications in Western harmony: the dual nature of the minor mode. The natural minor scale (Aeolian mode) has a subtonic seventh degree (\(\hat{7} = \text{D}\) in E minor) that does not create the half-step leading tone needed for the authentic cadence. The harmonic minor scale — in which the seventh degree is raised by a semitone (\(\hat{7}^\sharp\)) — provides the leading tone but introduces an augmented second between \(\hat{6}\) and \(\hat{7}^\sharp\) (e.g., F to G# in A harmonic minor), an interval prohibited in strict melodic writing.
The practical consequence for two-voice tonal counterpoint is that the student must be aware of the directional melodic conventions of minor-mode writing. A soprano that ascends through \(\hat{6}^\natural\) and \(\hat{7}^\natural\) while approaching the tonic will produce a subtonic approach — a whole step to the tonic rather than a half step — which sounds modal and archaic in a tonal context. A soprano that uses the raised \(\hat{7}^\sharp\) in all contexts, even when descending, will produce the augmented second \(\hat{6}–\hat{7}^\sharp\) in downward motion, which is prohibited in melodic writing. These constraints — arising from the dual nature of the minor mode — add a layer of complexity to tonal counterpoint in minor that is absent in major, and they explain why minor-mode fugue subjects, chorale harmonizations, and free counterpoint require particular care in the treatment of the sixth and seventh scale degrees.
4.2 The Figured Bass Tradition and Its Connection to Counterpoint
Parallel to the species counterpoint tradition, the Baroque era developed a practical system of realizing harmony from an abbreviated notation: the figured bass, or basso continuo. In figured bass notation, the bass line is written out in full, and Arabic numerals placed below each bass note indicate the intervals to be realized above it by the keyboard player (harpsichordist or organist). The figure “6” indicates a first-inversion chord (sixth and third above the bass); “6/4” indicates a second-inversion chord; “7” indicates a seventh chord in root position; and so on. The keyboard player realizes the figured bass by filling in the upper voices according to the figured intervals, respecting the voice-leading rules of four-part counterpoint.
The connection between figured bass realization and species counterpoint is intimate. In both, the primary challenge is the management of independent voices above a given bass line; in both, the rules of consonance, dissonance treatment, and forbidden parallels govern every note-to-note connection. The difference is that figured bass realization requires the addition of three upper voices simultaneously (rather than the single added voice of species counterpoint), and the harmonic vocabulary is considerably richer (the full range of diatonic and chromatic chords, rather than the simple consonance framework of the species). The student who has mastered species counterpoint in two voices and four-part chorale writing has, in effect, mastered the underlying skills of figured bass realization; the additional challenge is merely the speed and fluency required by the practical performance context.
4.3 From Species to Free Counterpoint
The species method is a pedagogical scaffold, not a compositional style. No masterpiece of music consists entirely of strict first-species counterpoint, nor does any great composer work note by note through a rulebook while composing. The point of the species system is to isolate and master individual contrapuntal principles — consonance management, dissonance treatment, voice independence — so that they can be deployed naturally and flexibly in free composition. The transition from species exercises to free tonal counterpoint requires one additional conceptual step: the integration of harmonic progression.
In species counterpoint, the cantus firmus is simply a given melody, and the student adds voices whose notes are individually consonant with it. In free tonal counterpoint, both voices are composed simultaneously, and their interaction is governed not only by the interval-by-interval rules of the species system but also by the harmonic logic of functional tonality. Chords, harmonic rhythm, and the directed motion toward tonal goals shape the counterpoint at a higher structural level, creating a two-tier compositional problem: the interval-by-interval level of voice-leading and the phrase-by-phrase level of harmonic plan.
The bass and soprano voices are the primary contrapuntal pair in virtually all tonal music. The bass defines the harmonic root progression and the chord inversions; the soprano provides the melodic identity of the phrase. Together they form a two-voice counterpoint that must be satisfying in itself, even before inner voices are added. Heinrich Schenker made this point central to his analytical theory: the Ursatz (fundamental structure) of a tonal composition is a two-voice framework — a soprano descending stepwise from \(\hat{3}\) (or \(\hat{5}\) or \(\hat{8}\)) to \(\hat{1}\) above a bass that moves from the tonic to the dominant and back — that underlies the entire melodic and harmonic surface. Every chord change, melodic ornament, and rhythmic figure is, in Schenker’s analysis, an elaboration of this primordial two-voice framework.
4.4 Sequence and the Harmonic Logic of Modulation
The harmonic sequence — the repetition of a harmonic or melodic pattern at a different pitch level — is the primary mechanism by which tonal counterpoint traverses harmonic distance and visits secondary key areas. Sequences are not merely convenient devices for filling space between more important harmonic events; they have their own internal logic, deriving their forward momentum from the voice-leading motion of their constituent intervals.
The descending-fifth sequence is the most common in tonal music because it follows the natural direction of harmonic tension: each chord in the sequence acts briefly as a local dominant to the following chord, so the sequence creates a continuous chain of unresolved dominant resolutions. This chain of temporary dominants produces a sense of harmonic inevitability and forward motion that is the sequence’s most valuable property. In the contrapuntal domain, the descending-fifth sequence typically appears as a chain of 7–6 suspensions in two voices: the soprano holds a suspended seventh that resolves to a sixth, immediately becoming the preparation for the next suspension. The sequential chain of suspensions is thus a fourth-species pattern elevated to the harmonic level, operating across multiple chord changes rather than a single measure.
4.5 Figuration and Ornamentation
Once the two-voice harmonic skeleton has been established — a succession of consonant intervals governed by a coherent harmonic progression — the art of figuration takes over. Figuration is the process of animating the harmonic skeleton with the devices of the species system: passing tones, suspensions, neighbor tones, and the consonant skips that give each voice its melodic character. In free composition, figuration is not optional decoration but an essential part of musical expression; the same harmonic skeleton can be figurated in an infinite variety of ways, each producing a distinctive melodic and rhythmic character.
The Baroque technique of diminution — the replacement of a long structural note with several shorter notes that elaborate the same harmonic point — is the basic tool of figuration. A structural half note C supported by a G in the bass might be figured as C–D–C (upper neighbor), C–B–C (lower neighbor and return), C–B–A–G (descending passing tones through a third), or C (sixteenth)–B (anticipation)–C (return on the beat). In each case the harmonic fact — C5 against G3, a tenth — remains constant while the melodic surface is elaborated into a characteristic rhythmic and melodic gesture. Bach’s keyboard suites are inexhaustible libraries of figurative technique; every prelude, every allemande, every sarabande demonstrates a different approach to the animation of a harmonic skeleton.
4.6 The Contrapuntal Cadence
The cadence in free tonal counterpoint is more elaborate than in strict species, but it remains grounded in the same two-voice logic. The clausula vera — the authentic cadence — involves a soprano leading tone ascending to the tonic octave while the bass descends by a fourth (or ascends by a fifth) from dominant to tonic. This two-voice cadential formula is the harmonic prototype from which all cadential elaboration proceeds; every perfect authentic cadence in common-practice harmony is a elaboration of this formula.
- Perfect authentic cadence (PAC): bass moves \(\hat{5}–\hat{1}\) (root position dominant to root position tonic), soprano arrives on \(\hat{1}\). The strongest form of formal closure.
- Imperfect authentic cadence (IAC): dominant moves to tonic, but one or both chords is inverted, or the soprano ends on \(\hat{3}\) or \(\hat{5}\) rather than \(\hat{1}\). Weaker closure than the PAC.
- Half cadence: ends on the dominant, typically with soprano motion to \(\hat{2}\), \(\hat{4}\), or \(\hat{5}\). Creates a moment of structural suspension demanding continuation in a new phrase.
- Deceptive cadence: bass proceeds \(\hat{5}–\hat{6}\) (dominant to submediant) rather than to the expected tonic, while the soprano may arrive on \(\hat{1}\). The harmonic surprise creates a moment of withheld resolution, typically followed immediately by a repetition or elaboration of the cadential gesture.
- Plagal cadence: bass moves \(\hat{4}–\hat{1}\) (subdominant to tonic). More commonly heard as a post-cadential extension (the "Amen" cadence) than as a primary structural arrival.
4.7 Creating Musical Tension and Release
The overarching goal of contrapuntal writing — and, indeed, of all tonal music — is the management of tension and release. Tension is created by dissonance, by melodic ascent, by harmonic distance from the tonic, by registral expansion, and by rhythmic density. Release is created by resolution of dissonance, by melodic descent, by harmonic return to the tonic, by registral contraction, and by rhythmic relaxation. A great contrapuntal composition creates a long-range trajectory of tension and release, with local fluctuations nested within a global arc of departure from and return to the tonic.
The two-voice invention, as exemplified by Bach’s fifteen Two-Part Inventions (BWV 772–786), demonstrates the management of tension and release with extraordinary economy. Each invention is built from a single motivic idea, typically a compact figure of two to four beats, that is stated, imitated, inverted, and sequenced through a carefully planned tonal journey: from the tonic to the dominant (or relative major or minor), through various intermediate key areas, and back to the tonic with a final cadence of decisive weight. The contrapuntal techniques of imitation, inversion, augmentation, and stretto provide the means of this development; the harmonic plan provides the large-scale architecture that gives the development direction and purpose. What makes Bach’s inventions so powerful despite their brevity is the absolute coherence of these two levels — the local voice-leading is impeccable, and the global harmonic plan is architecturally clear — working together to produce a sense of organic inevitability that belies the music’s apparent simplicity.
4.8 Bach’s Two-Part Inventions as Compositional Models
Bach’s fifteen Two-Part Inventions (BWV 772–786), composed in their final form around 1723 for the musical education of his son Wilhelm Friedemann, are the most widely studied examples of free two-voice tonal counterpoint in the repertoire, and for good reason. Each invention is a complete compositional study in a single contrapuntal technique or stylistic gesture, rendered with absolute economy and clarity. They serve the same pedagogical function in free counterpoint that the species exercises serve in strict counterpoint: they isolate a single principle and demonstrate it with such transparency that the principle becomes unmistakable.
The student of free tonal counterpoint is advised to analyze Bach’s inventions at multiple levels simultaneously. At the surface level, one should identify the voice-leading: which notes are passing tones, which are suspensions, which are consonant skips? At the motivic level, one should identify the principal motive and trace its transformations — where it is stated literally, where it is inverted, where it appears in diminution or augmentation. At the harmonic level, one should map the key areas visited: where does the invention leave the tonic, what keys does it pass through, and how does it return? At the formal level, one should identify the sections — exposition, development, recapitulation or final cadence — and understand how they relate to each other. Only by analyzing at all four levels simultaneously can one begin to understand how the invention achieves its characteristic effect of spontaneous inevitability.
Chapter 5: Three- and Four-Part Counterpoint
5.1 Adding Inner Voices
Two-part counterpoint establishes the fundamental harmonic and melodic skeleton of a polyphonic texture. The outer voices — soprano and bass — carry the harmonic argument and define the two-voice counterpoint that underlies the entire texture. Inner voices — alto and tenor — fill in the harmony, complete the chords, and add registral warmth, but they are, in a structural sense, secondary to the outer-voice dialogue. This hierarchy is not merely theoretical; it is audible. In any four-part texture, the soprano and bass project more clearly than the inner voices: the soprano occupies the highest register, commanding attention by its acoustical prominence; the bass occupies the lowest, providing the harmonic foundation that the upper voices define themselves against. The alto and tenor fill the middle ground, their lines important for smooth voice-leading and complete harmony but rarely the primary bearers of melodic interest. Three- and four-part writing fills in this skeleton with additional voices that complete the harmony, add registral depth, and create a richer timbral environment. The addition of inner voices is not a mechanical harmonic task — the insertion of whatever pitch happens to complete the chord — but a contrapuntal challenge requiring that each added voice maintain its own melodic integrity, follow the rules of voice-leading, and avoid creating forbidden parallels not only with the outer voices but also with each other.
The challenge of four-part writing is combinatorial: where two-part counterpoint requires managing one pair of voices, four-part writing requires managing six pairs simultaneously (soprano–alto, soprano–tenor, soprano–bass, alto–tenor, alto–bass, tenor–bass). Forbidden parallels between any pair of these six combinations are forbidden; voice crossing and overlapping, while technically possible, introduce registral confusion that undermines the clean identification of voices. This apparent complexity is manageable in practice because many of the constraints reinforce each other: good outer-voice counterpoint (soprano and bass) tends to create conditions that simplify inner-voice management, since the harmonic framework established by the outer voices constrains the choices available to the inner voices.
A practical strategy for composing four-part counterpoint is to proceed in two stages: first, compose the outer-voice counterpoint (soprano and bass) as a complete two-voice composition in the style of the species exercises, ensuring that it forms a satisfying and correct two-voice piece in its own right. Then add the inner voices (alto and tenor) to fill in the harmony, treating each inner voice as a third-species counterpoint above the bass and checking each addition against the soprano and against the other inner voice. This two-stage approach — outer voices first, inner voices as filling — reflects the hierarchical structure of the texture and is how many composers (including Bach, by all accounts) actually worked. The outer-voice composition is the structural act; the inner-voice completion is the harmonic elaboration.
5.2 Chord Spacing and Doubling Rules
The physical distribution of chord tones across the four voices — chord spacing — profoundly affects the sonority of the texture. The principles of spacing derive from the acoustics of the overtone series: lower intervals should be wider to avoid the acoustic muddiness created by closely spaced bass tones (whose overtones clash more severely at short intervals), while higher intervals may be narrower because the upper partials of high-register tones are weaker and less likely to produce perceived roughness.
- Double the root: the most stable and acoustically grounded choice.
- Double the fifth: permissible and common when the root doubling produces awkward voice-leading or forbidden parallels.
- Double the third: generally avoided, though sometimes necessary; in particular, the third of a major chord in a bright register can be doubled without ill effect.
- Never double the leading tone: both voices would need to resolve upward by semitone to the tonic, producing parallel octaves on the most structurally important moment of the phrase.
The prohibition on doubling the leading tone is a direct consequence of the parallel-octave prohibition, applied to the specific harmonic context of the dominant chord resolving to the tonic. If the leading tone (\(\hat{7}\)) appears in two voices, both must resolve upward by semitone to the tonic (\(\hat{8}\)); the resulting parallel octaves on the cadence — the most emphatic moment of the phrase — would be a catastrophic failure of voice independence at the one moment when independence matters most. The rule against leading-tone doubling is therefore not a freestanding prohibition but a specific application of the most fundamental principle of counterpoint.
5.3 Voice Crossing and Overlapping
Voice crossing occurs when a lower voice temporarily rises above a higher voice, or vice versa. Voice overlapping occurs when a voice moves to a pitch beyond the previous position of an adjacent voice, even without literally crossing in the current chord. Both practices disrupt the registral hierarchy that gives each voice its distinct identity, and both are prohibited in strict four-part writing.
5.4 The Six-Four Chord and Its Contrapuntal Treatment
The second-inversion triad — the chord in which the fifth of the chord appears in the bass — presents a special voice-leading challenge in four-part writing that has no exact equivalent in two-voice counterpoint. Because the bass note is the fifth of the triad rather than the root, the chord root appears as a fourth above the bass; in the two-voice context of Definition 1.5, we noted that the fourth above the bass functions as a dissonance. In four-part writing, the second-inversion triad — the \(^6_4\) chord — is treated as an embellishing dissonance that requires both preparation and resolution.
- Cadential \(^6_4\): the dominant \(^6_4\) chord (e.g., \(\text{G–C–E}\) above a G bass in C major), which appears on a strong beat immediately before the dominant triad at a cadence. The fourth (C) and sixth (E) above the bass resolve downward by step to the third (B) and fifth (D) of the dominant, creating a double suspension over the dominant bass note.
- Passing \(^6_4\): a \(^6_4\) chord that arises from the passing motion of the bass through the fifth of a chord, connecting two positions of the same harmony. It appears on a weak beat and resolves by the continuation of the bass through passing motion.
- Pedal \(^6_4\): a \(^6_4\) chord that arises from the motion of upper voices above a sustained or repeated bass note. The bass is the structural note; the upper voices move to and from the \(^6_4\) position as a neighbor-tone embellishment of the simpler harmony.
The cadential \(^6_4\) is by far the most important of these three uses and is one of the most powerful harmonic devices in tonal music. Its power derives from the same principle as the suspension: a strong-beat dissonance (the fourth and sixth above the dominant bass) is sustained momentarily before resolving downward to a consonance (the third and fifth of the dominant triad). The cadential \(^6_4\) is, in effect, a double suspension over the dominant, and its resolution produces the sense of harmonic arrival at the dominant that makes the subsequent motion to the tonic all the more decisive. Bach uses the cadential \(^6_4\) at virtually every formal cadence of significance in his chorale harmonizations, as does virtually every tonal composer from the Baroque through the late Romantic period.
5.5 Bach Chorales as Models
Bach’s chorale harmonizations are the supreme models of four-part contrapuntal writing, and every serious student of harmony and counterpoint must study them with the closest possible attention. In these settings — of Lutheran chorales originally composed in the sixteenth and seventeenth centuries — Bach transforms simple, often modal tunes into extraordinarily rich harmonic tapestries, using the full range of chromatic harmony, second-inversion chords, applied dominants, borrowed chords, and melodic figuration, all while maintaining impeccable four-part voice-leading throughout. A Bach chorale is simultaneously a masterpiece of harmonic imagination and a flawless example of four-part voice-leading technique, and the student can learn from it at both levels.
A useful analytical exercise is to reduce a Bach chorale to its outer-voice counterpoint alone — stripping away alto and tenor — and to analyze the resulting soprano-bass dialogue as a two-voice contrapuntal composition. Invariably, the outer voices form a satisfying and coherent two-voice counterpoint; the inner voices serve as harmonic amplification and textural enrichment rather than as structural necessities. This reduction reveals the two-voice skeleton that Bach composed first (or at least conceived first), the framework on which the full four-voice texture is elaborated.
5.6 Non-Harmonic Tones in Four-Part Texture
The species rules for dissonance treatment — passing tones, neighbor tones, suspensions — apply with equal force in four-part writing, but the presence of additional voices creates new possibilities for dissonance that have no equivalent in two-voice writing. In particular, the pedal point — a sustained bass note over which the upper voices move through harmonic progressions that create dissonances against the held note — is exclusively a four-part (or more) device; in two-voice writing, a sustained bass note simply maintains a fixed harmonic relationship with the moving upper voice, producing whatever consonances or passing dissonances the species rules permit. With four voices, the held bass note may become dissonant against the upper-voice harmony even while those upper voices form consonances among themselves, creating a three-layer texture of considerable richness.
The dominant pedal — sustained dominant in the bass beneath chromatic or dissonant upper-voice progressions — is one of the most powerful tension-building devices in all of tonal music. Its power derives from a paradox: the bass, the register most strongly associated with harmonic foundation, is sustaining the “wrong” note — the dominant rather than the tonic — while the upper voices are free to roam harmonically. The ear registers this as an unresolved tension of extraordinary duration, and when the resolution arrives — the dominant pedal note finally resolving upward to the tonic (or the upper voices settling onto the tonic triad above the sustained dominant pedal, creating a final dissonance before the resolution) — the effect is one of accumulated pressure finally released.
5.7 Chromatic Voice-Leading in Four-Part Writing
The full resources of tonal harmony include not only diatonic chords but chromatic harmonies — applied dominants (secondary dominants), Neapolitan chords, augmented sixth chords, borrowed chords from the parallel major or minor, and the full range of chromatic passing tones and neighbor tones. Incorporating chromatic harmonies into four-part counterpoint requires particular care, because chromatic notes create additional voice-leading obligations: the raised note of an applied dominant must resolve upward (as a temporary leading tone), the lowered note of a Neapolitan must resolve downward, and the augmented sixth of an augmented-sixth chord must resolve outward by semitone in both directions simultaneously.
The voice-leading of applied dominants in four-part chorale writing requires care at two points: the approach to the applied dominant (the chromatic note must be introduced without creating a forbidden augmented second in any voice) and the resolution (the temporary leading tone must resolve upward, creating the risk of parallel octaves if the bass also moves to the same pitch class). Bach navigates these issues in his chorale harmonizations with characteristic elegance: he often introduces chromatic notes as passing tones or neighbor tones, approached by half step from a diatonic pitch, so that the chromaticism feels like melodic figuration rather than a harmonic shock.
5.9 The Complete Chorale Texture: Harmonic Rhythm and Voice-Leading
In Bach’s chorale harmonizations, every note of the soprano chorale melody is harmonized by a complete four-part chord, and the harmonic rhythm — the rate at which the underlying harmony changes — varies from one chord per beat to one chord per half-measure or even one chord per measure, depending on the character of the phrase. Controlling the harmonic rhythm is one of the most important skills in four-part writing; a uniformly fast harmonic rhythm produces relentless activity without repose, while a uniformly slow one produces stasis without forward motion. The art lies in varying the harmonic rhythm to match the emotional and formal demands of the phrase.
- Slow down (fewer chord changes per unit time) at the approach to a cadence, to create a sense of gathering and arrival.
- Speed up (more chord changes) in the middle of a phrase, to create forward momentum and harmonic interest.
- Begin at a moderate pace to establish the metric and harmonic framework before any accelerations or decelerations.
- Avoid changing harmony on every single beat for extended passages, as uniform harmonic rhythm eliminates the metric hierarchy that gives strong and weak beats their perceptual distinction.
Bach’s skill in harmonic rhythm is perhaps most evident in his chorale harmonizations of irregular phrase lengths. A phrase of five or seven beats — irregular because it falls outside the expected four-beat norm — demands particularly careful harmonic rhythm management: if the harmony changes too predictably, the irregularity of the phrase length is smoothed over; if the harmony changes too irregularly, the phrase loses its sense of direction. Bach invariably finds the path that honors the phrase length’s individuality while maintaining the sense of purposeful forward motion that makes each chorale a complete and satisfying musical statement.
Chapter 6: Invertible Counterpoint and Canon
6.1 Invertible Counterpoint at the Octave
Invertible counterpoint is the technique of composing two or more voices such that they can be exchanged — the upper voice placed below the lower — without producing forbidden parallels or other voice-leading errors. A passage of invertibly written counterpoint is a far more economical compositional resource than one that exists in only one configuration: the composer gains a second (or more) valid version of the material at no additional compositional cost. Bach exploited invertible counterpoint with extraordinary thoroughness; the two-part inventions are studies in the technique, and the fugues of the Well-Tempered Clavier deploy it at every structural level.
The interval transformation under octave inversion has a critical consequence: the perfect fifth (interval 5) inverts to a perfect fourth (interval 4). In two-voice counterpoint, the perfect fourth above the bass is treated as a dissonance requiring resolution when it appears on a strong beat without the support of additional voices. This means that in a passage intended to be invertible at the octave, each perfect fifth must be voice-led in a way that would make the resulting fourth workable in the inversion. In practice this means avoiding strong-beat fifths that would become inadmissible strong-beat fourths, and being particularly careful with parallel fifths, which would invert to parallel fourths — not technically forbidden between upper voices, but acoustically troubling.
- Avoid consecutive perfect fifths in the original version (they become consecutive fourths in the inversion).
- The perfect fifth on a strong beat in the original becomes a perfect fourth on a strong beat in the inversion — use with caution; ensure the fourth is approached and quitted in a manner that would be acceptable as a dissonance treatment.
- All other consonances invert to consonances, and all standard dissonance treatments (passing tones, neighbor tones, suspensions) remain valid after inversion, since the resolution direction (downward by step) is preserved under octave inversion.
6.2 Practical Composition of Invertibly Counterpointed Passages
The practical discipline of composing a passage of invertible counterpoint at the octave is best approached in two stages. First, compose the upper voice (dux) as a complete, melodically satisfying line that observes all the rules of the given species. Then, before writing the lower voice (comes), mentally transpose the upper voice down an octave and determine what that transposition will become in the inverted version: this is the voice that will appear in the upper register when the passage is inverted. Now compose the lower voice so that it forms correct counterpoint with the original upper voice and simultaneously, when it is displaced to the upper octave, forms correct counterpoint with the upper voice displaced to the lower octave.
6.3 Invertible Counterpoint at the Tenth and Twelfth
While invertible counterpoint at the octave is by far the most common type, the Renaissance and Baroque theorists also recognized and employed inversion at the tenth (producing an exchange of voices with a displacement of a tenth) and at the twelfth (displacement by a twelfth). These rarer inversions impose more severe constraints and are correspondingly more difficult to compose.
Beethoven’s use of counterpoint invertible at the twelfth in his late string quartets (particularly Op. 131 and Op. 132) produces textures of extraordinary complexity: the same motivic combination sounds in entirely different registral configurations in each of its appearances, always valid contrapuntally, yet always surprising in its effect. The intellectual discipline required to compose such passages — to envision the material in all its inversional configurations simultaneously — is among the most demanding in musical composition.
6.4 The Double Countersubject and Multiple-Countersubject Fugues
Some fugues employ not one but two regular countersubjects, each invertible with the subject and with each other. This produces a fugue with three mutually invertible elements — subject, countersubject 1, and countersubject 2 — permitting all six possible orderings of the three voices. Such a fugue is said to employ triple counterpoint (in the sense of three mutually invertible melodic elements, not to be confused with three simultaneous voices). The formal richness available to a composer who has constructed three mutually invertible elements is enormous: each of the six orderings produces a new harmonic and registral configuration of the same motivic material, and the middle section of the fugue can deploy these orderings in varied succession to create a sense of constantly renewed discovery.
- Every pair of the three elements is individually invertible at the octave (no forbidden parallels in any inversion of any pair).
- The combination of all three simultaneously produces no forbidden parallels in any of the six ordering permutations.
Bach’s B-flat minor fugue from WTC Book I (BWV 891) is frequently cited as an example of a fugue with two regular countersubjects, both invertibly counterpointed with the subject and with each other. The exposition presents each of the three voices in turn, each new entry accompanied by the two previously stated elements in their various permutations. The economy of material — three elements generating an entire fugue’s worth of motivic content — and the formal elegance of the systematic permutation of orderings make this type of fugue a particularly satisfying demonstration of contrapuntal craft.
6.5 Invertible Counterpoint in the Well-Tempered Clavier
Bach’s systematic exploitation of invertible counterpoint in the WTC provides the clearest available demonstration of how this technique shapes the large-scale formal plan of a fugue. In a typical Bach fugue with a regular countersubject, the entry pattern follows a rotation: subject in voice A accompanied by countersubject in voice B; then subject in voice B accompanied by countersubject in voice A (the exchange of positions, possible only because the two are invertibly counterpointed). A fugue with n voices can, in principle, present the subject n times in the exposition with n different registral assignments of subject and countersubject — each permutation being a new formal event, a new combination of familiar material. In a three-voice fugue with one regular countersubject, three entries of the subject yield three permutations: (S above CS), (CS above S in middle voice, with free counterpoint in third voice), and (S in bass with CS above). Not all three permutations need be used in the exposition, but their availability shapes the formal choices available in the middle section and the return.
6.6 Canon: Definition and Construction
A canon is a polyphonic composition in which one or more voices strictly imitate a leading voice after a fixed time delay and (optionally) at a fixed pitch interval. It is the purest expression of the contrapuntal ideal: a single melody, sounding simultaneously in multiple temporal and possibly pitch-transposed versions, creating a complete harmonic and polyphonic texture from a single melodic line. The canon demonstrates that the contrapuntal ideal — the equality and independence of multiple voices — can be realized with maximal economy: one voice, heard against its own temporal reflection.
Composing a canon is an exercise in extreme compositional discipline. Every note of the dux becomes, after the time interval, a note of the comes; the composer must ensure that every vertical interval formed between the dux and the comes satisfies the voice-leading rules. Since the comes is simply a delayed copy of the dux, this imposes a strict self-consistency requirement on the dux: the melody must be consonant (or appropriately dissonant) with itself when heard against a temporally displaced version of itself. This constraint typically forces the dux melody to have a very particular intervallic structure, and discovering this structure — composing a melody that generates its own valid harmonic companion — is one of the great intellectual pleasures of contrapuntal composition.
6.7 Canon Varieties and the Double Invention
The variety of canonical techniques available to a composer is a rich source of structural interest. Beyond the straightforward unison or fifth canon, the Baroque era explored augmentation, diminution, inversion, retrograde, and combinations thereof. Each variety imposes its own constraints on the dux melody and produces its own characteristic sonic effect.
- Canon by augmentation: the comes moves at half the speed of the dux — all note values doubled. The dux must be consonant with a version of itself at double speed, a very strong constraint.
- Canon by diminution: the comes moves at twice the speed of the dux — all note values halved.
- Canon by inversion: every ascending interval in the dux becomes a descending interval of the same size in the comes, and vice versa. The dux must be consonant with its own mirror image.
- Retrograde canon (crab canon, cancrizans): the comes plays the dux backward, from the last note to the first. The dux must form valid two-voice counterpoint when combined with its own retrograde — an extraordinarily severe constraint.
- Spiral (perpetual) canon: each successive entry is at a different pitch level, and the canon modulates continuously through a cycle of keys before returning to the original pitch. Bach's Canon per Tonos is the supreme example.
The double invention is a two-voice piece in which two contrasting motives are introduced separately and then combined in invertible counterpoint. Bach’s Invention No. 9 in F minor (BWV 780) is a particularly clear example: the first motive (a leaping arpeggio figure) is presented in one voice while the second motive (a flowing chromatic descent) appears in the other; after both have been stated, they are combined, and then their registral positions are exchanged. The invertibility of the combination — the fact that the arpeggio works both above and below the chromatic descent — is the structural demonstration that the two motives were composed specifically for each other, that their harmonic profiles are complementary.
6.8 Augmentation and Diminution as Structural Devices
Beyond their use in canon, augmentation and diminution serve as structural devices in the middle sections and final entries of fugues, providing a means of varying the temporal profile of a theme without altering its pitch content. When the subject appears in augmentation — all note values doubled — it seems to expand in time, acquiring a quality of dignity or gravitas appropriate to a climactic arrival. When it appears in diminution — values halved — it moves twice as fast, creating a sense of acceleration, urgency, or fugato playfulness.
- Augmentation: all note values multiplied by a constant factor \(k > 1\) (typically \(k = 2\)). The pitch sequence is preserved; the rhythm is slowed.
- Diminution: all note values multiplied by \(k < 1\) (typically \(k = 1/2\)). The pitch sequence is preserved; the rhythm is accelerated.
- Melodic inversion: each ascending interval of the subject becomes a descending interval of the same size, and vice versa. The subject is reflected about its first pitch, or about any chosen axis pitch. Melodic inversion creates a mirror image of the subject that may suggest a modal or expressive contrast to the original.
The Art of Fugue — Bach’s last major work, left unfinished at his death in 1750 — is an encyclopedic collection of contrapuntal studies on a single subject: a stepwise theme in D minor of great simplicity and motivic richness. Each of the work’s eighteen canons and fugues demonstrates a different contrapuntal technique: simple fugue, fugue in contrary motion, double and triple fugue, augmentation fugue, stretto fugue, mirror fugue (in which the entire fugue is played simultaneously with all voices inverted), and canons at various intervals and time distances. The Art of Fugue is thus the theoretical culmination of everything the present course has covered — a systematic demonstration that a single subject, properly constructed, contains the seeds of an entire contrapuntal universe.
6.9 The Goldberg Variations as Compendium of Canonic Technique
Bach’s Goldberg Variations (BWV 988, 1741) provide the richest single demonstration of canonic technique in the keyboard repertoire. The thirty variations are organized in groups of three: each group of three ends with a canon, and the canons progress systematically from a canon at the unison (Variation 3) through canons at the second, third, fourth, fifth, sixth, seventh, eighth, and ninth (Variations 6, 9, 12, 15, 18, 21, 24, 27), concluding not with a canon but with a quodlibet — a humorous combination of popular songs — before the return of the original aria. The systematic progression through canonical intervals provides a survey of the acoustical and contrapuntal properties of each interval: the canon at the unison feels intimate and self-reflective; the canon at the fifth (Variation 15, in G minor — the only minor-mode variation) has a quality of yearning and searching; the canon at the seventh (Variation 21) creates the most acute harmonic tension of any canon in the set, since the seventh is a dissonance requiring resolution and the canonic relationship constantly creates momentary seventh-suspensions between dux and comes. Together, the nine canons of the Goldberg Variations constitute a practical textbook of canonical technique at least as instructive as any theoretical exposition.
6.10 The Art of Writing a Canon
Composing a successful canon — one that sounds musical rather than mechanical — is one of the most demanding exercises in contrapuntal writing, and mastering it confers an intimate understanding of the relationship between melodic structure and harmonic implication that no other exercise can provide. The difficulty is that every note of the dux simultaneously serves two functions: it is a note in the dux melody (with certain horizontal, melodic obligations) and it is a harmonic preparation for the corresponding note of the comes (with certain vertical, harmonic obligations toward the comes’ continuation). These two functions are simultaneously operative and occasionally in tension, and the art of canon-writing lies in finding melodies that satisfy both without compromise.
This self-compatibility condition is the mathematical heart of canon construction. It explains why canons tend to avoid the perfect fifth as a harmonic interval (it would invert to a fourth in the comes-over-dux arrangement, and fourths over a bass are dissonant), why they prefer stepwise motion (leaps in the dux create potentially awkward intervals against the comes’ continuation), and why the most successful canon melodies have a certain internal redundancy — they tend to move in relatively predictable, scalewise patterns that remain consonant against their own temporal displacement.
Chapter 7: Fugue: Structure and Technique
7.1 Subject and Answer
The fugue is the supreme achievement of the contrapuntal tradition — the genre in which every technique studied in the preceding chapters converges into a single, architecturally unified composition. To approach the fugue after the species exercises, the inventions, the chorale harmonizations, and the study of invertible counterpoint and canon is to experience the satisfaction of synthesis: every technique one has mastered in isolation appears here in combination, serving the unified formal goal of the fugue. The suspension that was isolated in fourth species now appears as the countersubject’s primary harmonic feature; the sequence that was practiced in the invention’s episode now serves as the modulating link between subject entries; the invertible counterpoint that was the subject of Chapter 6 now organizes the formal deployment of subject and countersubject across the exposition and middle entries. The fugue is not a new technique but the integration of all previous techniques into a single compositional form of extraordinary expressive range and formal discipline. A fugue is not merely a contrapuntal exercise of arbitrary complexity; it is a precisely articulated form with defined sections, each serving a distinct formal and expressive function. To understand the fugue is to understand the entire Baroque contrapuntal system in its most fully realized expression.
The fugue begins with its most basic unit: the subject (or dux, leader). The subject is a single melodic line, typically four to eight measures in length, that encapsulates the character and harmonic meaning of the entire fugue. A great fugue subject has a distinctive rhythmic profile, a clear harmonic implication, and a melodic contour that is both singable in isolation and capable of generating interest when combined with itself and with a countersubject. The subject is the fugue’s DNA: every subsequent event in the fugue — the countersubject, the episodes, the stretto — grows from it or is shaped by it.
The distinction between real and tonal answers is one of the most subtle points in fugue construction and reflects a deep principle of tonal harmonic grammar. A real answer transposes the subject literally to the dominant key; a tonal answer modifies certain intervals — particularly the opening interval, if the subject begins with an ascent from \(\hat{1}\) to \(\hat{5}\) or with a prominent \(\hat{5}\) in the first measure. The rationale: if the subject opens with the tonic-to-dominant fifth, the answer that replaces it with a dominant-to-dominant-dominant fifth implies a modulation to the key of the dominant’s dominant — two steps away from the tonic — at the very outset. To preserve the tonic-dominant polarity of the opening exposition, the tonal answer replaces the opening ascending fifth with an ascending fourth (from \(\hat{5}\) to \(\hat{1}\) in the dominant key), effectively neutralizing the modulating tendency while still implying the dominant key.
7.2 Subject Design and Its Harmonic Implications
The design of a fugue subject is the most consequential compositional decision in the entire fugue. Every feature of the subject — its length, its rhythm, its harmonic implication, its melodic contour — predetermines what will be possible in the subsequent development. A subject that begins on the tonic and ends on the dominant is easily answered; a subject that modulates aggressively through multiple keys before settling is problematic, because the answer must begin in the dominant but cannot simply transpose the subject’s modulations up a fifth without immediately overshooting the harmonic range of the exposition. A subject that consists entirely of repeated notes has no possibilities for sequential development, because sequences require melodic motion; a subject that moves entirely by step offers limited possibilities for harmonic sequence at the fifth, because the step motion does not span the harmonic distance that sequences typically traverse.
- Harmonic clarity: it implies a single tonal center (or a clear departure from and return to one), so that the tonal answer is unambiguous and the expositions of subject and answer are harmonically coherent.
- Rhythmic identity: it has a distinctive rhythmic profile that is recognizable even when the subject is inverted, augmented, or heard in stretto against itself.
- Motivic fertility: it contains internal motives — short rhythmic or melodic patterns — that can be extracted and developed independently in the episodes. A subject with no internal motivic cells offers nothing from which to build episodes.
- Stretto potential: its harmonic structure permits it to be combined with a time-displaced version of itself without producing forbidden parallels. Not every subject has this property, and subjects that lack it cannot be used in stretto.
- Melodic completeness: it is a satisfying melody in its own right, independent of the countersubject. If the subject is melodically inert, the fugue will feel lifeless at every moment of bare subject entry.
The relationship between subject design and stretto potential is particularly important and illustrates the degree to which the subject of a great fugue is compositionally engineered rather than merely invented. Bach’s subjects in the Well-Tempered Clavier vary widely in their stretto potential: some admit stretto at only one time interval, while others (particularly in the numerically complex multi-voice fugues like BWV 849 in C# minor) admit stretto at multiple time intervals and pitch intervals. The C# minor fugue’s subject appears to have been composed specifically to maximize stretto potential — its internal structure is such that every possible stretto configuration produces valid counterpoint — making the fugue a tour de force of contrapuntal engineering in which the formal development of the middle section seems to follow inevitably from the properties of the subject itself.
7.3 The Countersubject and Its Invertibility
As the answer enters in the second voice, the first voice does not fall silent; it continues with new material called the countersubject. The countersubject is the fugue’s second melodic personality, and in a well-constructed fugue it is composed to be not merely compatible with the subject but invertibly counterpointed with it. This invertibility — the ability of subject and countersubject to exchange registral positions — is what allows the countersubject to appear in multiple voices throughout the fugue without losing its function.
A well-designed countersubject serves multiple compositional functions simultaneously. Melodically, it must be interesting enough to sustain attention during the answer — it cannot simply sustain a single pitch or repeat a single rhythm. Rhythmically, it should contrast with the subject: if the subject moves in long notes, the countersubject should move in shorter values, and vice versa, so that the two voices provide a rhythmic texture of complementary activity. Harmonically, it must be compatible with the subject at every point of their combination, producing no forbidden parallels in any of the invertible permutations in which it will appear. Motivically, it may derive some of its material from the subject — creating an internal unity that ties the two themes together — or it may be completely contrasting, providing variety.
7.4 The Codetta and Its Motivic Function
Between the subject and the answer in the fugal exposition, a brief connecting passage — the codetta — may appear. The codetta is not merely a harmonic connector; it is a motivic opportunity, a place where the composer can introduce episodic material that will be developed later in the fugue’s middle section. A well-designed codetta thus serves a double function: it manages the harmonic transition from tonic to dominant (or dominant back to tonic) in preparation for the next entry, and it introduces motivic cells that will be recognizable and meaningful when they reappear in the episodes.
The motivic relationship between codetta and episode is one of the most elegant aspects of fugal architecture. When the codetta’s motivic material reappears in the episodes of the middle section — transformed, sequenced, inverted — the listener who has been attentive to the codetta recognizes the material as familiar, as “from” the piece, even though the subject itself is absent. This creates a sense of motivic economy and coherence: the fugue generates all its material from a very small number of melodic ideas, and even the transitional passages contribute thematically to the whole.
7.5 The Exposition
The exposition is the opening section of the fugue, in which each voice enters in turn with either the subject or the answer. In a four-voice fugue, the exposition presents four entries: subject — answer — subject — answer, alternating between tonic and dominant key presentations until all voices have entered. The order of voice entries is typically from highest to lowest (soprano–alto–tenor–bass) or lowest to highest, though many other orderings occur in Bach.
The exposition establishes three things simultaneously: the subject’s melodic and rhythmic identity, the harmonic polarity of tonic and dominant that will structure the entire fugue, and the contrapuntal relationships (subject with countersubject, subject with free counterpoint) that will be developed in the middle section. An exposition that accomplishes all three of these with clarity and economy — that presents its material distinctly yet without over-exposition, that creates the expectation of development without yet fulfilling it — is one of the most satisfying structural achievements in all of music.
7.6 Middle Entries, Episodes, and Sequential Development
After the exposition, the fugue enters its middle section — the longest and structurally most varied part of the form. The middle section alternates between entries of the subject (in various keys and voices) and episodes — developmental passages in which the subject itself is absent but in which its motivic material is developed through sequence, imitation, and invertible counterpoint.
Sequential episodes are the primary vehicle of harmonic development in the middle of a fugue. A two-bar motivic pattern is stated and then repeated immediately a step or third higher or lower, and the process is repeated several times, traversing harmonic territory efficiently and with a sense of logical inevitability. The sequential episode exploits the invertible counterpoint of the episodic motives: as the sequence progresses, the motives exchange registral positions, presenting the same material in successively different configurations. A well-constructed episode sequence can modulate from the relative minor to the subdominant in the space of four or six bars without a single arbitrary harmonic move — every chord follows from the sequence by voice-leading necessity.
The formal plan of a fugue’s middle section is not arbitrary but follows a characteristic tonal itinerary: from the tonic, to the dominant (or, in a minor-mode fugue, to the relative major), then to the subdominant or supertonic minor, and back to the tonic. This itinerary traces a large-scale tonic–dominant–subdominant–tonic progression at the level of the fugue’s key areas, mirroring the tonic–dominant–subdominant–tonic chord progressions that operate at the local harmonic level within each phrase. The fugue’s tonal plan is, in a sense, a macrocosm of its harmonic syntax — the same formal logic operating at a vastly expanded time scale. This self-similarity between harmonic gesture at the note level and harmonic gesture at the sectional level is one of the most intellectually satisfying features of Baroque fugal architecture.
7.7 Stretto, Final Entry, and Coda
Stretto — the overlapping of subject entries in quick succession — is the most powerful device of fugal development. Its effect is one of mounting urgency: the subject seems to pursue itself, entries crowding upon each other in rhythmic and motivic compression until the texture reaches a point of maximum density that demands the release of the final entry.
The final entry of a fugue — the tonic return of the subject in (typically) the bass voice, often in augmentation, sometimes accompanied by a tonic pedal point — is one of the great moments of musical architecture. Its arrival is felt as homecoming: the subject, which has been developed, inverted, subjected to stretto, and heard in remote key areas, returns at last to its original key and in its most emphatic registral location. Often Bach reinforces this arrival with a pedal point — a sustained bass note on the tonic — over which the upper voices continue their contrapuntal activity. The subject sounds above this pedal not as a modulating theme but as a tonic proclamation, its harmonic implications now resolved into a single, unambiguous tonal center.
7.8 Analyzing a Complete Fugue: The D Minor Fugue, WTC Book I, BWV 851
Walking through a complete fugue analysis — not just identifying its sections by name but tracing the logic of every structural decision from the first note to the last — is the most effective way to internalize the fugue’s formal logic. The D minor fugue from Book I of the Well-Tempered Clavier (BWV 851) is ideal for this purpose: it is a two-voice fugue, the simplest possible texture, yet it exhibits every essential feature of fugue construction with textbook clarity.
The answer (tonal) begins on A4 (the dominant of D minor is A minor), answers the ascending step figure, and descends stepwise back toward E. The countersubject — entering in the original voice as the answer sounds — moves primarily in contrary motion to the answer: while the answer descends, the countersubject tends to ascend, creating the contrary-motion independence of genuine two-voice counterpoint.
The exposition occupies measures 1–8 (approximately): subject in the right hand, answer (with countersubject below) in measures 1–4; then the answer continues while the right hand answers with the subject in the dominant — wait: in a two-voice fugue, there are only two voices, so the exposition consists of two entries only: subject in voice 1 (measures 1–4), answer in voice 2 (measures 5–8) with the countersubject in voice 1.
After the exposition, episodes constructed from the subject’s opening leap and stepwise descent motif traverse F major, A minor, and C major before the final entry returns the subject to D minor — in the low register of the left hand, with the right hand providing the countersubject above it in its inverted position. The final entry is preceded by a dominant pedal (two measures of sustained A in the bass), which creates the tension whose release is the formal goal of the entire fugue. The cadence — a full authentic cadence in D minor, the bass moving from A to D while the soprano descends from E to D — arrives with the force of a physical impact, made all the more decisive by the sustained dominant pedal that preceded it.
This analysis reveals a structural principle that holds throughout the Well-Tempered Clavier: the exposition presents the subject in its “natural” harmonic context (tonic and dominant); the middle section develops the subject in related key areas (relative major, subdominant, supertonic minor); and the final entry restores the subject to the tonic with an additional degree of harmonic weight — the pedal point, or the additional voices entering above or below — that makes the return feel like more than a mere repetition. The fugue has not simply returned to its starting point; it has arrived at its destination.
Chapter 8: Counterpoint After Bach
8.1 The Well-Tempered Clavier as Compendium
Before examining counterpoint after Bach, it is worth pausing to appreciate the extraordinary scope of the Well-Tempered Clavier as a compendium of fugal and contrapuntal technique. The forty-eight preludes and fugues of Books I and II (composed and revised between 1722 and approximately 1742) are not merely a collection of pedagogical exercises but an encyclopedic survey of the possibilities of the fugue genre at a single historical moment. Each fugue demonstrates a different combination of formal features — different subject types, different numbers of voices (two, three, and four), different degrees of motivic complexity, different relationships between subject and countersubject, different approaches to the middle-section episodes and the final return — and the variety is so complete that the entire repertoire can be read as a systematic demonstration that the fugue form is not a rigid template but a flexible architecture capable of infinite variety within a shared structural logic.
8.2 Counterpoint in the Classical Era
Bach died in 1750, and the style he brought to its supreme expression — the High Baroque polyphonic manner of interlocking contrapuntal lines governed by species rules and harmonic sequence — was already beginning to seem archaic to his contemporaries. The Classical era, dominated by Haydn and Mozart, adopted a predominantly homophonic texture in which melody and accompaniment replaced the polyphonic equality of voices. Yet counterpoint did not disappear; it was absorbed, transformed, and deployed with calculated expressive effect within the new formal framework of sonata form, the string quartet, and the symphony.
Mozart’s study of counterpoint was intensive and deeply felt. Under his father Leopold’s direction he worked through Fux’s Gradus as a child; later, in Vienna in 1782, he encountered the music of J.S. Bach through Baron van Swieten’s private concerts and was electrified. He immediately began composing his own fugues for string quartet (the Fugue in C minor, K. 426, later arranged for strings as K. 546) and arranging Bach’s keyboard fugues for string quartet as pedagogical exercises. The counterpoint in his late works — the Kyrie of the Requiem, the overture to The Magic Flute with its Bachian three-voice fugue, the finale of the Jupiter Symphony — reflects a mature assimilation of the Baroque contrapuntal tradition into the Classical formal language.
8.3 Romantic Counterpoint: Brahms and Wagner
The Romantic era brought a paradoxical relationship with counterpoint. On the surface, the massive orchestral and operatic textures of Berlioz, Liszt, and early Wagner emphasize harmonic colour, tonal ambiguity, and expressive extremity over contrapuntal logic. Yet the greatest composers of the period were deeply learned contrapuntists whose surface effects rested on rigorous polyphonic foundations. The contradiction resolves when one understands that Romantic counterpoint works not by the strict interval-by-interval logic of the Baroque but by harmonic voice-leading: the voices are guided by the logic of functional harmony operating at the level of the phrase rather than the beat.
Wagner’s relationship with counterpoint is more idiosyncratic and more rhetorical. His enormous orchestral textures are frequently imitative without being strictly fugal; the leitmotif system of his later operas creates a kind of associative counterpoint in which several dramatically charged themes are combined simultaneously, their combination generating new dramatic meaning beyond what any single theme carries alone. This is not species counterpoint; it is not even invertible in the strict sense; but it is profoundly contrapuntal in spirit, governed by the principle that simultaneous voices create meaning through their relationship that neither could create alone.
8.4 Counterpoint and the String Quartet
The classical string quartet — two violins, viola, and cello — is the instrumental medium that most closely approximates the four-voice SATB texture of the chorale, and from Haydn through Bartók it has been the primary laboratory for instrumental counterpoint outside the keyboard. The four instruments of the string quartet each occupy a distinct registral stratum (first violin as soprano, second violin as alto, viola as tenor, cello as bass), and the independence of each instrument — the fact that each can sustain a melodic line of its own, that none is relegated to mere accompaniment by the nature of the instrument itself — makes the quartet the ideal medium for genuine four-voice polyphonic writing.
The development of the string quartet in the early Classical era thus runs parallel to the growing importance of counterpoint as a compositional value in the learned style. As the quartet became increasingly important as a chamber genre, composers invested more contrapuntal sophistication in it: Haydn’s later quartets (Op. 33, Op. 64, Op. 76) contain increasingly rich imitative textures; Mozart’s “Haydn” quartets (K. 387, K. 421, K. 428, K. 458, K. 464, K. 465) include fugal finales and extensively imitative development sections; and Beethoven’s late quartets (Op. 127, Op. 130, Op. 131, Op. 132, Op. 135) push the string quartet to the limits of what four independent voices can achieve, with movements in strict fugal form, sections of invertible counterpoint at the twelfth, and a pervasive contrapuntal treatment of even the slow movement textures that goes far beyond what any earlier quartet had attempted.
8.5 Twentieth-Century Neoclassical Counterpoint
The early twentieth century saw a deliberate reaction against the chromatic excess of late Romanticism in the form of the neoclassical movement, which sought to reclaim the clarity, formal discipline, and contrapuntal rigour of the eighteenth century. The principal figures of this movement — Stravinsky, Hindemith, and Bartók — approached counterpoint from very different aesthetic starting points, yet all three revitalized the polyphonic tradition by subjecting it to the harmonic and rhythmic innovations of their era. The result was a counterpoint that retained the formal architecture of the Baroque while speaking an unmistakably modern harmonic language.
The neoclassical moment raises an interesting question: why should the fugue form, or the canon, or the species rules, survive the transition to a completely different harmonic language? What is it about the formal architecture of the fugue that makes it applicable to Hindemith’s “tonality” based on the overtone series, or to Bartók’s folk-inflected modality, or to Stravinsky’s bitonally inflected diatonicism? The answer, one comes to appreciate through the study of the species system, is that the fugue’s architecture is not dependent on the specific harmonic language that fills it. The fugue form is a temporal organization — subject entries, episodes, stretto, final entry — that organizes motivic and harmonic material in time according to a logic of departure and return, tension and resolution, that is not specific to any particular harmonic system. It is, in this sense, a meta-musical form: an architecture for the organization of whatever musical language one chooses to deploy within it. The neoclassicists understood this intuitively, and their fugues demonstrate it practically: the form remains recognizable and structurally coherent even when the harmonic vocabulary inside it is entirely non-Baroque.
Stravinsky’s neoclassical counterpoint — particularly in the Symphony of Psalms (1930), the Octet for wind instruments (1923), and the Piano Concerto (1924) — is characterized by a deliberate angularity and rhythmic irregularity that reflects both his Russian background and his engagement with jazz and popular music. The Symphony of Psalms contains a double fugue in which two subjects — one angular and chromatic, one flowing and diatonic — are first presented separately and then combined in invertible counterpoint. The combination is a tour de force: the two subjects, apparently heterogeneous in character, fit together with a harmonic precision that reveals how carefully their interval structures were engineered for mutual compatibility.
The neoclassical return to counterpoint in the early twentieth century was not merely nostalgic; it was programmatic. The composers of the neoclassical movement understood — or at any rate believed — that the clarity, economy, and formal precision of the Baroque polyphonic style offered a corrective to what they perceived as the excess, bombast, and structural incoherence of late Romanticism. Counterpoint, in this view, was a form of musical discipline — a set of structural constraints that forced the composer to think carefully about every note, to make every melodic choice consequential, to achieve expressive power through economy rather than excess. The neoclassical movement is, in this sense, a musical manifesto for the ethical virtues of constraint: the belief that rules, properly understood and rigorously applied, liberate rather than restrict the musical imagination.
8.5 Serial Counterpoint and the Second Viennese School
The Second Viennese School — Schoenberg, Webern, and Berg — developed a contrapuntal system in which the organizing principle is not the consonance and dissonance hierarchy of species counterpoint but the twelve-tone row and its transformations. This serial counterpoint preserves the formal architecture of polyphony — multiple independent voices, motivic development, imitative entries — while replacing the tonal harmonic framework with a serial one.
Webern’s serial counterpoint achieves an extraordinary density of structural organization within astonishing brevity. His Symphony Op. 21 — which lasts approximately ten minutes — is constructed from a twelve-tone row of remarkable symmetry (the second hexachord is the retrograde of the first, so that the row sounds like itself backward), and every detail of the two-movement work is derived from this row and its transformations through canonic imitation. The second movement is a set of variations, each of which is a different canonic arrangement of the row — at different time intervals, different pitch intervals, and with different voice assignments. The result is a microscopic fugal universe, every note structurally determined, yet the surface impression is of crystalline sonic beauty without apparent effort.
8.6 Minimalism, Spectralism, and Contemporary Polyphony
The second half of the twentieth century and the early twenty-first century have seen a remarkable diversification of approaches to polyphonic writing. Minimalism, spectralism, and the various post-tonal and post-serial practices of the late twentieth century all engage, in their different ways, with the fundamental contrapuntal questions of voice independence, melodic identity, and the management of tension and release.
The study of counterpoint is never finished. Every major development in Western music from the Renaissance motet through the spectral orchestra has engaged with the fundamental questions that counterpoint asks: How do independent melodic voices coexist? How is tension created and released between simultaneous voices? How can a single melodic idea generate an entire polyphonic universe through repetition, imitation, and transformation? The answers differ radically across style, era, and aesthetic philosophy; the questions remain constant. To study counterpoint is not merely to learn a historical technique but to engage with the deepest questions about the nature of musical time, musical memory, and the relationship between logic and beauty in organized sound.
8.7 Imitative Polyphony in Sacred and Secular Choral Music
One domain of counterpoint that the present course has touched on only briefly deserves more explicit attention: the choral polyphony of the seventeenth and eighteenth centuries, in which the full resources of the polyphonic tradition were deployed for large vocal forces in both sacred and secular contexts. Handel’s oratorio choruses — particularly those of Messiah (1741) — represent an extraordinary synthesis of Baroque counterpoint and dramatic choral writing. The famous “Hallelujah” chorus deploys a fugal subject (“King of Kings and Lord of Lords”) in combination with homophonic choral outbursts (“Hallelujah”), alternating between contrapuntal and chordal textures to create a formal arch of mounting intensity. The fugal passages follow the species rules precisely; the homophonic passages provide rhythmic and harmonic contrast; and the alternation between the two textures is the primary formal device of the movement.
8.8 Counterpoint as Analytical Tool
Beyond its role as a compositional discipline, counterpoint provides an indispensable analytical lens for understanding music of any style. The analyst who can identify the contrapuntal skeleton of a complex orchestral passage — tracing the fundamental two-voice framework beneath layers of orchestration and harmonic elaboration — understands the music at a structural depth that harmonic analysis alone cannot reach. Heinrich Schenker’s analytical method, the most influential analytical theory of the twentieth century, is fundamentally a theory of counterpoint: it reads tonal music as an elaborated two-voice framework (the Ursatz) in which the soprano descends from an initial scale degree (\(\hat{3}\), \(\hat{5}\), or \(\hat{8}\)) to the tonic while the bass moves from tonic to dominant and back.
- The Urlinie (fundamental melodic line): a stepwise descent in the soprano from a structural upper voice note — the Kopfton (head tone), which is \(\hat{3}\), \(\hat{5}\), or \(\hat{8}\) — to \(\hat{1}\). This descent is the soprano's contribution to the Ursatz.
- The Bassbrechung (bass arpeggiation): a bass motion from \(\hat{1}\) (tonic) to \(\hat{5}\) (dominant) and back to \(\hat{1}\) (tonic), outlining the fundamental harmonic progression of the composition.
Schenker’s theory is controversial in its details but profound in its central claim: that tonal music is, at its deepest level, two-voice counterpoint. Every tonal composition from Bach to Brahms, in this view, is an elaboration of the two-voice framework that the species system teaches. To study counterpoint is to study the grammar of which tonal composition is the literature — the rules of the language in which the masterworks of Western music are written.
8.9 Counterpoint in the Twentieth-Century Orchestra
The orchestral writing of the twentieth century — in composers as different as Bartók, Shostakovich, Prokofiev, and Ligeti — frequently deploys contrapuntal textures that trace their lineage directly to the Renaissance and Baroque tradition even while using an entirely different harmonic language. Understanding these textures requires the same analytical tools as understanding a Bach fugue: the ability to isolate individual voices, trace their melodic lines, identify their harmonic relationships, and understand how tension and release are managed across time.
The relationship between Ligeti’s micropolyphony and species counterpoint is not merely historical. Ligeti himself acknowledged his debt to Renaissance polyphony, and the analytical tools of species counterpoint — interval classification, dissonance treatment, voice independence — apply, in modified form, even to micropolyphonic textures. In a dense canonic mass of twenty-four voices, the “interval” between voices must be understood statistically (what is the distribution of intervals between any randomly chosen pair of voices at any moment?) rather than individually, but the principle that the distribution of intervals affects the perceived acoustic roughness or smoothness of the texture is directly traceable to the interval-consonance hierarchy of Definition 1.1.
The student who has worked through the species exercises, composed two-voice inventions, analyzed fugues from the Well-Tempered Clavier, and studied the counterpoint of Bach’s successors possesses something more valuable than a set of historical techniques. They possess a training in musical perception — a cultivated sensitivity to the independence of simultaneous voices, to the direction of harmonic tension, to the logic of melodic gesture — that is applicable to any musical style they will encounter or compose. This sensitivity, once developed, does not fade; it persists as a permanent change in the way one listens to music, hears harmonic motion, and thinks about the relationship between melody and harmony, between individual voice and collective texture, between the note-by-note and the phrase-by-phrase and the movement-by-movement. It is, in the fullest sense, a musical education — not an education in a style, or a period, or a technique, but an education in musical thought itself. The rules of strict counterpoint are eventually internalized and transcended; the sensitivity they develop is permanent. As Fux wrote at the conclusion of the Gradus ad Parnassum, addressing his fictional student Josephus: you have now arrived at the summit; but remember that the rules are not ends in themselves, they are the discipline through which musical imagination acquires the precision and clarity it needs to speak with power. Whether one writes a Renaissance motet, a Baroque fugue, a Romantic symphony, or a minimalist phase piece, this discipline remains the foundation. The punctus contra punctum — note against note, voice against voice, musical idea against musical idea — is the irreducible core of the compositional art, and its study is the study of musical thought itself.
8.9 Writing the Fugue Exposition: A Step-by-Step Method
Composing a fugue exposition — the first major compositional task in the study of fugue — is best approached through a systematic procedure that addresses the interrelated decisions of subject design, answer type, and countersubject construction in a logical order. The following method reflects standard practice at major conservatories and music schools.
First, design the subject with stretto potential and motivic fertility in mind. Write the subject without any harmonic accompaniment and sing it aloud repeatedly until its melodic character is vivid and distinctive. Ask: does this subject have a rhythmic profile that will be recognizable in inversion and augmentation? Does it begin and end in a way that makes the tonal answer straightforward? Does it contain an internal motivic cell — a short rhythmic or melodic figure of two or three notes — that can be sequenced in an episode?
Second, determine whether the answer should be real or tonal. If the subject begins on \(\hat{1}\) and ascends to \(\hat{5}\), or if \(\hat{5}\) appears prominently in the first measure, the answer should be tonal: compress the opening ascending fifth to an ascending fourth, then continue with the literal transposition. If the subject begins on \(\hat{3}\) or \(\hat{5}\) and does not prominently feature the \(\hat{1}–\hat{5}\) ascent, a real answer (literal transposition to the dominant) is likely appropriate.
Third, compose the countersubject: write it simultaneously with the answer, treating the combination of answer and countersubject as a first-species exercise (ensure all strong-beat intervals are consonant) with added passing tones and suspensions as the musical character demands. The countersubject should be rhythmically complementary to the subject — if the subject moves primarily in eighth notes, the countersubject should have some longer notes; if the subject opens with a long note, the countersubject may begin with faster motion.
Fourth, verify the invertibility of subject and countersubject by transposing the countersubject up an octave and the subject down an octave, mentally, and checking that the combination still forms correct counterpoint. Any interval of a fifth between them in the original will become a fourth in the inversion; check that this fourth is not on a strong beat in an unresolved position.
Finally, write the full exposition: subject alone (measure 1–subject length), answer with countersubject (overlapping the subject’s final bars), subject with countersubject in the inversion (if three voices; second countersubject in first voice if four voices), and final answer with all material combined. Verify all parallel fifths and octaves between every pair of voices at every beat.
8.10 Conclusion: Counterpoint as the Grammar of Musical Thought
The eight chapters of this course have traced the development of contrapuntal technique from its most elementary form — the interval classification of Definition 1.1 — through the species system, two-voice tonal counterpoint, four-part chorale writing, invertible counterpoint and canon, and the fugue, into the twentieth and twenty-first century’s transformations and extensions of the polyphonic tradition. At each stage, the same fundamental principles have recurred: voice independence, dissonance as controlled tension requiring resolution, consonance as stability and arrival, the long-range management of harmonic tension from opening to cadence.
The remarkable persistence of these principles across five centuries of Western music is not a historical accident. It reflects the fact that the principles of counterpoint are grounded in the psychology of auditory perception — in the way the human auditory system organizes simultaneous sounds into streams, detects patterns of tension and release, and experiences musical time as a structured unfolding rather than a formless succession of sounds. As long as humans hear music with the same ears and the same auditory cortex that Renaissance singers and Baroque keyboard players used, the principles of counterpoint will remain relevant — not as historical curiosities, but as the grammar of musical thought itself.
8.12 Practical Counterpoint Exercises for the Advanced Student
The following exercises, drawn from the pedagogical traditions of the Yale School of Music and the Eastman School of Music, represent a progressive curriculum for the student who has completed the species exercises and is ready to engage with free tonal counterpoint and fugue composition.
- Analysis of Bach Two-Part Inventions: Analyze all fifteen inventions, identifying the subject, its imitative entries, the episodes and their sequence technique, the modulation plan, and the cadences. Write a one-page formal diagram for each invention showing the formal sections, the key areas, and the structural events.
- Composition of a Two-Part Invention: Compose a complete two-part invention in the style of Bach, approximately 30–50 measures, in a major or minor key, using imitation, sequence, invertible counterpoint, and a clear three-part formal structure (tonic, development, return).
- Analysis of Bach Chorale Harmonizations: Analyze twenty chorale harmonizations from the Riemenschneider collection, identifying all non-harmonic tones, all cadential formulas, and all examples of chromaticism. Pay particular attention to the outer-voice counterpoint and verify that it passes the two-voice consonance test in each case.
- Composition of a Four-Part Chorale: Harmonize a given chorale melody in the style of Bach, using four-part SATB texture, following all the rules of voice-leading and doubling, and employing the full range of non-harmonic tones available in the species system.
- Analysis of WTC Fugues: Analyze five fugues from the Well-Tempered Clavier (at least two from each book), identifying subject, answer (real or tonal, with explanation), countersubject(s), exposition, middle entries, episodes, stretto (if present), and final entry/coda. Explain the harmonic plan of each fugue and trace the motivic development of the episode material.
- Composition of a Two-Voice Fugue: Compose a complete two-voice fugue exposition (subject, answer, and countersubject) and one episode, approximately 20–30 measures total. The subject should be designed for stretto at at least one time interval.
These exercises are not merely technical drills; they are the mechanism by which abstract knowledge becomes compositional intuition. The distinction between knowing the rules of counterpoint and being able to compose in counterpoint is the distinction between reading a grammar book and speaking a language fluently. The exercises — particularly the composition exercises — are the practice that converts knowledge into fluency, theory into instinct, rules into music.
A student who has completed all six exercises in the sequence described above will have composed a two-part invention, a four-voice chorale, and a fugue exposition. These three compositions, however modest in scale, represent genuine exercises of the compositional art. They are not transcriptions or arrangements; they are original compositions in historical styles, and the process of composing them — the struggle to satisfy multiple simultaneous constraints, the discovery of solutions that are both correct and musical, the experience of hearing one’s own counterpoint and judging it as music rather than as a rule-following exercise — is irreplaceable as a mode of musical education. A student who has analyzed twenty Bach chorales and composed one in the style of Bach understands those chorales at a depth that no amount of passive listening or theoretical reading can achieve. A student who has composed a fugue subject — who has struggled with the constraints that stretto potential places on melodic freedom, who has discovered that not every melody one might write will yield an interesting fugue — understands the Well-Tempered Clavier as Bach’s achievement in a way that analysis alone cannot convey. Counterpoint, finally, is learned by doing. The notes on the page are a starting point; the music in the hand is the destination.
The exploration of counterpoint, from the most elementary species exercise to the most complex fugue analysis, is ultimately an exploration of the musical mind — of how it organizes simultaneous sounds, creates expectation, builds and releases tension, and achieves the sense of inevitability in what is in fact a complex, multiply constrained compositional object. Every rule of counterpoint encodes a fact about musical perception; every exercise develops a capacity for musical thought that no other discipline can provide in quite the same way. The punctus contra punctum is not a historical artifact; it is the living grammar of musical intelligence.
8.10 The Fugue in Popular Music and Jazz
The influence of the contrapuntal tradition is not confined to the classical canon. Jazz improvisation, at its highest levels, involves real-time species counterpoint: the improviser must simultaneously manage the horizontal melodic line, the vertical harmonic implication against the chord changes, the rhythmic interaction with the rhythm section, and the motivic development of the improvisation’s own material. Charlie Parker’s bebop lines — fast, harmonically dense, rhythmically intricate — are the product of an internalized contrapuntal training that is informal but no less rigorous than Fux’s species exercises. Parker’s lines move through chord changes the way a fifth-species counterpoint moves through a cantus firmus: always consonant (or appropriately dissonant) against the underlying harmony, always melodically coherent in themselves, always pointing purposefully toward the next structural downbeat.
The influence of the contrapuntal tradition on popular and jazz music also extends to the technique of countermelody — a second melody added to a primary melody, each independent enough to be satisfying alone, yet combined to create a richer whole. The countermelody is essentially first-species counterpoint applied to popular song form: the primary melody serves as the cantus firmus, and the countermelody is the added voice. Great arrangers — Gil Evans’s arrangements for Miles Davis, Oliver Nelson’s orchestrations for jazz ensembles, Brian Wilson’s Beach Boys arrangements — consistently employ countermelodies that respect the interval-independence principles of species counterpoint: moving in contrary motion to the primary melody, avoiding parallel octaves and fifths, and maintaining their own melodic coherence when heard alone. The species principles persist, unnamed and untheorized, in every domain of sophisticated musical practice.
The broader lesson is that contrapuntal thinking — the coordination of independent voices, the management of tension and release, the use of imitation to create coherence from melodic fragments — is not a style but a cognitive mode. It appears wherever music of genuine complexity is created, regardless of cultural context, and its principles, once internalized, are applicable to any musical situation. The student who has completed the exercises of this course is not equipped merely to compose in the style of Palestrina or Bach; they are equipped to think musically in the deepest sense, to hear any music — whether it is a Renaissance motet, a Bach fugue, a Brahms symphony, a Charlie Parker improvisation, or a Steve Reich phase piece — as a play of independent voices in temporal space, each with its own identity, each contributing to a whole that none could achieve alone. This is what counterpoint teaches, and it is, as Fux’s student Josephus learned, both an art and a discipline: the highest discipline, in the service of the most human of all arts.