MUSIC 678: Music Theory Pedagogy

Estimated study time: 2 hr 8 min

Table of contents

These notes draw on Jennifer Snodgrass (ed.) Teaching Music Theory: New Voices and Approaches (2020), Michael Rogers’s Teaching Approaches in Music Theory: An Overview of Pedagogical Philosophies (2nd ed., 2004), Philip Duker, Anna Gawboy, Bryn Hughes, and Kris Shaffer (eds.) The Routledge Companion to Music Theory Pedagogy (2020), Gary Karpinski’s Aural Skills Acquisition: The Development of Listening, Reading, and Performing Skills in College-Level Musicians (2000), and supplementary materials from Eastman School of Music graduate pedagogy certificate, Peabody Conservatory MM in Theory Pedagogy, and University of North Texas graduate theory teaching practicum.


Chapter 1: Why Pedagogy Matters in Music Theory

1.1 The Theory Classroom as a Site of Musical Formation

The undergraduate music theory classroom is not a neutral space. Students who pass through a two-year or four-year theory sequence emerge with transformed ears and transformed minds. They hear phrase structures where they once heard only melody; they perceive voice-leading motion where they once heard chords; they track formal proportions where they once followed only feeling. This transformation is irreversible in the best sense: once a listener has learned to hear a deceptive cadence as a syntactic surprise, she cannot easily un-hear the expectation that preceded it.

The stakes of theory pedagogy are therefore high. A well-designed course does not merely teach students to spell seventh chords or label Roman numerals. It equips them to engage with music at a level of structural awareness that will shape their performing, composing, and listening for the rest of their professional lives. A poorly designed course can do the opposite: it can convince students that music theory is an arbitrary set of rules unconnected to anything they actually care about, producing graduates who regard theory as an obstacle they survived rather than a resource they carry.

Because most professional musicians take at least some theory as undergraduates — regardless of whether they are performers, composers, conductors, music educators, or music therapists — the pedagogy of music theory shapes the musical culture of an entire profession. Decisions made in curriculum committees, textbook writing rooms, and individual instructors’ lesson plans ripple outward through the careers of every student who passes through those gates.

1.2 The Gap Between Research and Teaching

There is a persistent and instructive gap between what music theorists know and what music theory teachers do. The research frontier of professional music theory — the literature published in Music Theory Spectrum, Journal of Music Theory, and Music Theory Online — covers transformational theory (David Lewin’s algebraic models of musical motion), neo-Riemannian analysis (the geometric spaces of Douthett and Steinbach, Richard Cohn’s hexatonic systems), spectralism and inharmonic tuning systems, metric theory, and the formal analysis of popular music.

The undergraduate core curriculum taught at the overwhelming majority of American music programs, however, looks quite different. First-year students learn interval spelling, diatonic scales, and basic rhythm. Second-year students learn functional harmony, species counterpoint, and voice-leading in four-part chorale style. Third-year students encounter chromatic harmony — modal mixture, secondary dominants, the Neapolitan and augmented-sixth chords — and may begin an introduction to post-tonal theory. The overlap between this curriculum and the research frontier is thin.

This gap is not simply a matter of teachers being behind the times. The undergraduate curriculum serves a different population with different goals: it prepares musicians for professional practice, not for music-theoretical research. The more interesting question — and the one this course addresses — is whether the pedagogical approaches and cognitive insights developed in the research community can be brought to bear on the teaching of this very different curriculum.

Remark 1.1. The gap between research and teaching is not unique to music theory — it characterizes many academic disciplines. But it has a particular salience in music theory because the discipline's self-identity is bound up with the idea that theoretical knowledge is directly relevant to musical practice. A music theorist who cannot explain why her research matters for how students perform or compose must either revise that claim or acknowledge a degree of insulation from practical musical concerns that is difficult to sustain when facing a room of frustrated first-year musicians asking "why do I need to know this?"

1.3 Historical Overview: From Zarlino to the American University

The history of music theory pedagogy stretches from antiquity to the present, but several turning points are especially instructive for understanding the modern American curriculum. Gioseffo Zarlino (1517–1590), the great Venetian theorist and chapel master of San Marco, transmitted his knowledge of counterpoint and modal theory primarily through an apprentice system: a gifted student worked closely with a master, absorbing both the rules of the craft and the musical judgment needed to apply them well. Zarlino’s Le istitutioni harmoniche (1558) represents the first systematic attempt to ground contrapuntal practice in a comprehensive theoretical framework.

Johann Joseph Fux’s Gradus ad Parnassum (1725), discussed at length in Chapter 5, marks the founding moment of the species counterpoint tradition that still shapes pedagogy today. The conservatory system of the nineteenth century — Paris (founded 1795), Prague (1811), Vienna (1817) — institutionalized music theory as a required subject for all trained musicians, not just future theorists. Students at these conservatories learned harmony through figured bass, counterpoint through species and imitation, and form through analysis of canonical works. This conservatory model was transplanted to American institutions as European émigré musicians and scholars founded or shaped the great American conservatories and university music departments of the late nineteenth and early twentieth centuries.

The American university music theory requirement took its characteristic shape in the mid-twentieth century. The National Association of Schools of Music (NASM) established accreditation standards that drove toward standardization across programs. By the 1970s and 1980s, a generation of widely adopted textbooks — Paul Aldwell and Carl Schachter’s Harmony and Voice Leading (1978), Stefan Kostka and Dorothy Payne’s Tonal Harmony (1984), Robert Gauldin’s Harmonic Practice in Tonal Music (1997) — had crystallized a relatively stable core curriculum centered on the Bach chorale as the primary pedagogical object.

The influence of the European émigré generation on American theory pedagogy deserves special mention. Scholars such as Felix Salzer (whose Structural Hearing, 1952, brought Schenker’s analytical method to American audiences), Hans Tischler, Paul Pisk, and Karl Geiringer brought with them the traditions of Viennese music theory — including both the conservatory harmony tradition and the Schenkerian analytical approach — that had been developing in German-speaking countries since the nineteenth century. The synthesis they created — a core curriculum grounded in functional harmony and species counterpoint, supplemented at the upper levels by Schenkerian analysis — is recognizably the curriculum still taught in most American theory programs today. Understanding this genealogy helps theory teachers appreciate both the strengths of the tradition they have inherited and its historical contingency: the American undergraduate theory sequence is not a neutral transmission of universal truths about music but a particular historical formation, shaped by specific people, institutions, and intellectual traditions.

1.4 The Music Theory Wars

The 1990s and 2000s saw a significant public debate within the music theory community over the content and purpose of the undergraduate core — a debate sometimes called the music theory wars. On one side stood advocates for the synthetic-tonal approach: a curriculum organized around a unified theory of tonal syntax, using Roman numeral analysis to describe harmonic function across a broad historical period (roughly 1600–1900). This approach, exemplified by Kostka and Payne’s textbook and associated with the functional-harmony tradition flowing from Rameau through Riemann, treats the chorale-style part-writing exercise as the central pedagogical tool for developing harmonic intuition.

On the other side stood advocates for historically specific approaches. Rather than teaching a synthetic theory invented in the nineteenth century and applied retroactively to music of the seventeenth through early twentieth centuries, why not teach the actual theoretical frameworks composers and musicians of each period used? This meant teaching thoroughbass realization for Baroque music, galant schemata for Classical style, and so forth. The historically informed pedagogy movement argued that the synthetic Roman-numeral approach distorts students’ understanding by forcing diverse musics into a single Procrustean framework.

A related debate concerned species counterpoint: is it a sound pedagogical tool for teaching voice-leading because it isolates rhythmic complexity and builds competence progressively, or is it a museum piece teaching rules that were never actually practiced by composers in the forms Fux codified them? This debate has never been fully resolved, and its tensions continue to animate discussions about curriculum design.

A third strand of the music theory wars concerned the relationship between theory and analysis. Some theorists argued that the primary goal of the undergraduate curriculum should be to train analytical readers — students who can read a score intelligently, recognize structural patterns, and articulate musical arguments about the works they perform and study. Others argued that the primary goal should be compositional fluency — the ability to write in the styles studied, to realize a figured bass, to harmonize a melody in the manner of Brahms. Still others argued for the centrality of aural skills: theory is, at its root, a discipline of the ear, and no amount of score analysis or part-writing skill amounts to genuine musical understanding if it is not grounded in acute and flexible listening. These three positions — analytical, compositional, and aural — are not mutually exclusive, but they do imply different priorities in curriculum design, different assessment practices, and different answers to the question of what a student should be able to do at the end of a theory sequence.

Remark 1.2. The music theory wars produced more heat than light in some of their manifestations, but they also forced a productive clarification of values and goals. The most enduring legacy of the debates is the widespread recognition — now nearly universal in the professional community — that the question "what should be in the undergraduate theory core?" cannot be answered without first answering the question "what is the undergraduate theory core for?" And answering that question requires engaging seriously with educational goals, student populations, and the relationship between theory and musical practice — precisely the questions that this course addresses.

1.5 The SMT and Professional Organizations

The SMT (Society for Music Theory), founded in 1977, is the primary professional organization for music theorists in North America. Within the SMT, the Pedagogy Interest Group has served since the 1980s as the professional home for scholars focused on questions of music theory teaching and learning. The journal Music Theory Online (MTO), the SMT’s open-access online journal, has published a substantial body of pedagogically focused research since its founding in 1993. The journal Journal of Music Theory Pedagogy, published at the University of Oklahoma, is devoted entirely to pedagogical scholarship in the field.

These venues matter because they legitimize pedagogical inquiry as a form of scholarship. In many academic settings, “mere teaching” is treated as less prestigious than research publication. The existence of peer-reviewed journals and professional organizations specifically devoted to pedagogy signals that thinking carefully about how to teach music theory is itself a form of serious intellectual work — one that requires empirical investigation, theoretical sophistication, and reflective engagement with educational research.

The Music Theory Pedagogy session at the annual SMT meeting — a regular feature of the conference program since the 1990s — provides a forum for scholars to present pedagogical innovations, share curricular experiments, and exchange practical insights about teaching. Papers presented in this session range from empirical studies of student learning outcomes to philosophical arguments about what theory education should aspire to be, and from reports on specific technological tools to reflections on classroom equity and inclusion. The diversity of approaches in this forum reflects the diversity of the field: there is no single right way to teach music theory, and the community of pedagogy scholars is correspondingly pluralistic.

1.6 Music Theory Pedagogy and Cognitive Science

Perhaps the most productive development in recent music theory pedagogy has been the growing engagement with cognitive science and music psychology. Research on music cognition — how listeners perceive and process musical structures — has direct implications for how theory should be taught. If listeners process tonal expectations implicitly and rapidly, what does that imply about the relationship between explicit theoretical knowledge and the intuitive tonal fluency that theory courses aim to reinforce?

The evidence on transfer — whether theoretical knowledge learned in an academic setting transfers to improved performance or listening outside that setting — remains mixed and contested. Some studies suggest that students who have completed theory courses demonstrate improved performance in sight-reading and harmonic anticipation tasks; others find that students cannot reliably apply theoretical concepts they can correctly define in the abstract to unfamiliar musical examples. This gap between declarative knowledge (knowing that the dominant seventh resolves to the tonic) and procedural competence (hearing a dominant seventh in a complex texture and predicting its resolution) is one of the central pedagogical puzzles of the field.

The most significant recent empirical study in this area is the longitudinal research by Johnson-Laird and colleagues on musical grammar induction, which suggests that listeners develop implicit probabilistic models of tonal structure through exposure alone, and that explicit theoretical instruction may accelerate and refine these implicit models rather than supplanting them. The instructional implication is that theory courses work best when they build on and make explicit what students already know implicitly — when the teacher’s role is articulation and refinement rather than construction from scratch.

The research program of music psychology has been closely relevant here. The work of Carol Krumhansl on the tonal hierarchy — the finding that trained and untrained Western listeners organize pitch space hierarchically around the tonic, with fifth-related scale degrees next most stable, and chromatically altered non-scale degrees least stable — provides empirical grounding for the functional harmony concepts that the theory curriculum takes as its organizing framework. Knowing that the tonal hierarchy is a product of Western musical enculturation rather than an acoustic universal is both humbling (it reminds us that other musical cultures organize pitch space differently) and instructive (it explains why functional tonal harmony is difficult for students who grew up in other tonal traditions to internalize at the level of reflex). The music psychology of Eugene Narmour, Leonard Meyer, and more recently David Huron (Sweet Anticipation, 2006) on musical expectation further enriches the cognitive foundation for tonal pedagogy: tonal music works, in part, by creating and then satisfying or defeating listener expectations, and teaching students to hear those expectations — to feel the pull of the leading tone, the openness of the half cadence, the surprise of the deceptive cadence — is teaching them to engage with music as a temporal, expectation-driven experience.

Example 1.1 (Tonal Expectation and Theoretical Knowledge). Consider a student who grew up listening to Western popular music. She has heard the I–V–vi–IV chord progression thousands of times in songs by Taylor Swift, Adele, and Coldplay, and her tonal ear expects the subdominant chord (IV) to follow the submediant (vi) in that progression. When she encounters the concept of a "deceptive cadence" in a theory course — a V chord resolving to vi rather than I — she is not learning that the vi chord can follow the dominant; her ear already knows this. She is learning the name for the phenomenon, its structural logic within the tonal system, and the analytical framework that allows her to identify it, describe it to others, and compose with it intentionally. This is the typical relationship between prior musical experience and theoretical instruction: theory names and systematizes what musical experience has already partially taught.

Chapter 2: Learning Theory for Music Theory Teachers

2.1 Constructivism

The dominant framework for thinking about learning in contemporary educational research is constructivism, associated primarily with the developmental psychologist Jean Piaget (1896–1980) and the socio-cultural theorist Lev Vygotsky (1896–1934). The central claim of constructivism is epistemological rather than merely pedagogical: students do not learn by passively receiving information transmitted from a teacher. They learn by actively constructing knowledge — building mental models, testing those models against new experiences, and revising them when the models fail to predict or explain what they encounter.

For music theory pedagogy, constructivism has a direct and powerful implication. Students will not develop genuine musical understanding simply by memorizing definitions of intervals, chord qualities, and voice-leading rules. They must engage actively with musical materials: listening to and analyzing actual musical passages, writing harmonic progressions and hearing how they sound, testing their ear-training intuitions against notated scores. The teacher’s role is not to transmit knowledge but to create structured encounters with musical material that give students’ mental models the friction they need to develop.

Piaget’s concept of schema is particularly useful here. A schema is a mental structure that organizes experience — a template for recognizing and responding to familiar situations. Students arrive in theory courses with existing schemas for music: they already have implicit knowledge of how tonal music works, developed through years of listening. The pedagogical task is to make those implicit schemas explicit and to refine and extend them. When a student who has listened to pop music for fifteen years encounters the concept of a dominant seventh chord resolving to a tonic, she is not learning something entirely new — she is articulating a pattern she has already internalized, giving it a name and a theoretical description.

Piaget’s complementary concepts of assimilation and accommodation are also useful here. Assimilation occurs when a new experience is incorporated into an existing schema without modifying the schema: a student who encounters the concept of a secondary dominant and files it under the general category of “dominant chord” is assimilating. Accommodation occurs when a new experience cannot be incorporated into an existing schema without modifying it — when the student must revise the schema to make room for the new concept. For example, a student who has learned that the dominant is always V7 in the home key must accommodate when she encounters a chord that functions as a dominant but is not V7 — perhaps the V/ii, which introduces a chromatically altered note that does not belong to the home key. Her schema for “dominant harmony” must expand to include the idea of a locally applied dominant. This accommodative revision is cognitively more demanding than assimilation, and it is precisely what genuine conceptual learning looks like.

Definition 2.3 (Assimilation and Accommodation). In Piagetian constructivism, assimilation is the cognitive process by which new experiences are incorporated into existing mental schemas without requiring modification of those schemas. Accommodation is the complementary process by which schemas are revised or new schemas are created in response to experiences that cannot be assimilated into existing structures. Genuine conceptual development in music theory — as opposed to rote memorization of labels — requires accommodation: the student's mental model of tonal harmony must be repeatedly revised and enlarged to encompass the full range of harmonic phenomena the curriculum introduces. A pedagogy that enables assimilation without requiring accommodation produces students who can label but not understand.

2.2 Zone of Proximal Development

Vygotsky’s most pedagogically influential concept is the Zone of Proximal Development (ZPD): the range between what a learner can accomplish independently and what she can accomplish with the support of a more knowledgeable other. Learning happens most effectively in this zone. Tasks that fall entirely within a student’s existing competence produce no learning — she already knows how to do them. Tasks that fall entirely beyond the ZPD produce frustration and withdrawal — she cannot make progress because the gap is too large. The pedagogical art lies in calibrating tasks and support so that students are always working at the upper edge of what they can manage — stretching, but not snapping.

Definition 2.1 (Zone of Proximal Development). The Zone of Proximal Development (ZPD), as formulated by Lev Vygotsky, is the distance between a learner's actual developmental level — what she can accomplish independently — and her potential developmental level — what she can accomplish under the guidance of a more knowledgeable peer or instructor. Effective pedagogy targets this zone, providing support (scaffolding) that enables the learner to accomplish tasks beyond her current independent reach, then systematically withdrawing that support as the learner's competence grows.

Scaffolding — the term Vygotsky’s followers, particularly Jerome Bruner, use for the support structures a teacher provides — takes many forms in the music theory classroom. In a counterpoint course, scaffolding might look like this: first, the instructor provides a complete cantus firmus and asks students only to evaluate it melodically; then students compose their own cantus firmus; then students add a first-species counterpoint to a given cantus firmus; then students write the cantus firmus and counterpoint together. Each stage removes one layer of scaffolding, extending the student’s responsibility progressively as competence develops.

2.3 Threshold Concepts

A particularly powerful framework for music theory pedagogy comes from the educational researchers Jan Meyer and Ray Land, who introduced the concept of threshold concepts in 2003. A threshold concept is a concept that, once grasped, fundamentally and irreversibly transforms the learner’s understanding of a domain. Before crossing the threshold, the learner may have functional knowledge of isolated facts; after crossing it, she understands the domain from a new, integrative perspective that reorganizes everything she previously knew.

Threshold concepts are characterized by being: transformative (they change how the learner sees the subject); irreversible (once understood, they cannot be unlearned); integrative (they connect previously separate ideas); bounded (they mark the boundary of a discipline); and often troublesome (counterintuitive, counterexperiential, or conceptually alien to students’ prior frameworks).

In music theory, plausible threshold concepts include: tonal function (the idea that chords are not merely collections of notes but agents in a directed syntactic process — that a dominant is always tending toward a tonic, not merely combining two notes at a fifth); voice leading as independent lines (the idea that music is best understood as the simultaneous motion of independent melodic voices, not as a succession of vertical chords with added ornamentation); and set-class equivalence (the idea, central to post-tonal theory, that transpositions and inversions of a pitch-class set are formally “the same,” even when they sound and look different).

Remark 2.1. The threshold-concept framework has important pedagogical implications. If tonal function is a threshold concept, then students who have not yet crossed the threshold will process harmonic analysis as a labeling task — attaching Roman numerals to chords by pattern-matching — rather than as an analytical task of understanding directed harmonic motion. Instruction aimed at the labeling level will not, by itself, bring students to the analytical understanding on the other side of the threshold. Teachers need to design experiences that push students toward the transformative insight, not merely supply them with more efficient labeling procedures.

2.4 Cognitive Load Theory

Cognitive Load Theory (CLT), developed by psychologist John Sweller in the 1980s and 1990s, is based on the empirical finding that human working memory is severely limited in capacity. Most people can hold approximately four to seven discrete “chunks” of information in active working memory simultaneously. Complex cognitive tasks that exceed this capacity produce a characteristic state of overload — the student feels overwhelmed, makes systematic errors, and retains little of what was presented.

CLT distinguishes three types of cognitive load: Intrinsic load arises from the inherent complexity of the material itself, which cannot be reduced without simplifying the material. Extraneous load arises from poor instructional design — complex explanations, cluttered visual presentations, unnecessary cognitive demands imposed by the format rather than the content. Germane load is the cognitive effort that contributes directly to learning — schema formation and automation. Effective instruction minimizes extraneous load, manages intrinsic load through sequencing and task decomposition, and maximizes germane load.

Example 2.1 (Cognitive Load in Four-Part Writing). Asking students in a chorale-writing exercise to harmonize a soprano line while simultaneously managing voice-leading rules for four independent voices, attending to proper doublings, avoiding forbidden parallels, and ensuring melodic quality in all voices may exceed working memory capacity for novice students. Beginning instead with soprano-and-bass outer-voice frameworks — then adding inner voices — decomposes the task into manageable sub-problems, each of which demands less simultaneous attention. This staged approach is not a concession to weakness; it is sound cognitive architecture.

For music theory pedagogy, CLT provides strong theoretical support for the species counterpoint pedagogical sequence. Teaching note-against-note first species counterpoint before introducing syncopations (fourth species) is not mere historical tradition — it is sound cognitive design. Each species introduces a controlled increase in complexity while holding other variables constant, keeping intrinsic load within manageable bounds at each stage.

2.5 Deliberate Practice

The psychologist Anders Ericsson introduced the concept of deliberate practice to describe the kind of practice that actually produces expert performance, as distinct from mere repetition. Deliberate practice is characterized by: effortful engagement at the upper boundary of current ability; explicit goals focused on specific sub-skills; immediate and informative feedback; and concentrated attention on the practice process rather than on performing for an audience.

Mere repetition — playing through a piece many times, or working through many interval identification exercises in a row — produces modest skill gains that plateau. Deliberate practice — targeting a specific aspect of performance, receiving specific feedback, adjusting specifically in response — produces continued improvement. For aural skills pedagogy, the implications are significant: students who play EarMaster or Auralia for thirty minutes and randomly identify intervals are not engaging in deliberate practice; they receive feedback (right or wrong) but the feedback is not formative in the sense required.

Deliberate practice in interval recognition would require: identifying which specific intervals the student misidentifies most often; focusing practice exclusively on those intervals; receiving feedback that explains the error pattern; and adjusting listening strategy in response. This kind of targeted, diagnostic practice is harder to automate and harder to sustain, but it produces far superior results.

2.6 Transfer

The ultimate goal of undergraduate music theory education is not that students can complete textbook exercises correctly — it is that they can bring theoretical understanding to bear on unfamiliar musical situations. This ability to apply learned skills and concepts in new contexts is called transfer.

Cognitive research on transfer distinguishes near transfer (applying a learned skill to situations highly similar to those in which it was practiced) from far transfer (applying it to novel situations that share deep structural features but look very different on the surface). Textbook voice-leading exercises produce reliable near transfer: students who have completed many chorale-style exercises can complete similar exercises competently. Whether they produce far transfer to analyzing the voice-leading of a Brahms string quartet or a Stevie Wonder song is far less certain.

Designing exercises that promote transfer requires deliberate pedagogical choices: varying the surface features of examples while preserving the underlying structural relationships being learned; asking students to apply concepts across repertoires rather than only in a fixed exercise format; requiring verbal explanations of analytical reasoning, not just labeled scores; and engaging students in self-monitoring — asking them to predict what they will find before analyzing, then comparing prediction to result.

2.7 Growth Mindset in the Theory Classroom

Psychologist Carol Dweck distinguishes two implicit beliefs students hold about the nature of ability. Students with a fixed mindset believe that ability is an innate trait — you either have musical talent or you do not, and practice and instruction can only reveal, not create, capacity. Students with a growth mindset believe that ability is developed through effort, strategy, and persistence — that practicing ear training makes you better at hearing, not just better at recognizing that you cannot hear.

Music theory courses frequently trigger fixed-mindset responses, particularly in students who lack prior training in notation or classical music. A student who struggles to hear intervals in a dictation exercise may attribute her difficulty to a fixed trait (“I’m not a theory person,” “I can’t hear intervals — it’s just not how my brain works”) rather than to a learnable skill not yet learned. These attributions become self-fulfilling: the student stops practicing because she believes practice will not help, falls further behind, and confirms her initial belief.

Instructors can cultivate growth mindsets by: praising process rather than ability (“You worked carefully through that exercise” rather than “You’re so musical”); explicitly teaching the neuroscience of skill acquisition — that musical hearing is a learned capacity, not an innate gift; designing assignments where improvement over time is rewarded, not just final performance level; and sharing their own experiences of struggle and growth as developing musicians and theorists.

Example 2.2 (Growth Mindset Intervention in a Theory Course). At the beginning of a first-year theory course, the instructor distributes a brief reading on neuroplasticity and musical skill development, explaining in accessible terms that the auditory cortex physically changes in response to musical training and that the capacity to hear intervals is demonstrably trainable through consistent practice. She then administers a low-stakes baseline dictation exercise and keeps the results in a sealed envelope. At the end of the semester, she distributes an equivalent exercise and has students compare their results with the baseline. Students who initially struggled with the dictation and persisted in their practice — even if they have not reached the level of stronger students — see concrete evidence of their own growth. The experience reframes the course narrative: the question is not "am I a theory person?" but "how much have I grown this semester?"

2.8 Motivation and Self-Determination Theory

Beyond growth mindset, the broader framework of Self-Determination Theory (SDT), developed by psychologists Edward Deci and Richard Ryan, offers a comprehensive account of human motivation relevant to music theory pedagogy. SDT identifies three basic psychological needs whose satisfaction promotes intrinsic motivation and sustained engagement: the need for autonomy (feeling that one’s actions are self-chosen and aligned with one’s values), the need for competence (feeling effective and capable in one’s activities), and the need for relatedness (feeling connected to others in the learning environment).

When a theory course satisfies these needs — when students feel they have genuine choices in how they engage with the material, when they experience themselves as developing competence through effort, and when they feel a sense of community with classmates and instructor — intrinsic motivation follows naturally. When the course frustrates these needs — when students feel that arbitrary rules are imposed from outside and explain nothing, when they feel chronically overwhelmed and ineffective, and when the classroom culture is competitive rather than collaborative — extrinsic motivation at best results, and disengagement at worst.

Definition 2.2 (Intrinsic vs. Extrinsic Motivation). Intrinsic motivation refers to engagement in an activity for its own sake — because it is interesting, challenging, or inherently satisfying. Extrinsic motivation refers to engagement driven by external rewards or the avoidance of external consequences (grades, praise, penalties). Self-Determination Theory holds that intrinsic motivation is more durable, produces deeper learning, and supports greater creative engagement than extrinsic motivation. In a theory course, a student who wants to understand how voice-leading works because she finds it genuinely fascinating will learn more and retain it longer than a student who is trying only to avoid a failing grade — even if the two students produce similar work in the short term.

Chapter 3: Curriculum Design

3.1 The Standard American Undergraduate Theory Sequence

The standard American undergraduate music theory sequence, as codified by NASM accreditation expectations and reflected in the most widely used textbooks, consists of four semesters of written theory alongside four semesters of ear training and sight-singing (often called aural skills). The written theory sequence proceeds roughly as follows: first semester covers fundamentals (notation, intervals, scales, basic triads and seventh chords); second semester covers diatonic functional harmony in four-part chorale style (harmonic progression, voice-leading rules, cadences, non-chord tones, basic phrase structure); third semester extends harmony into the chromatic domain (secondary dominants, modal mixture, the Neapolitan, augmented-sixth chords) and introduces species counterpoint and basic formal analysis; fourth semester covers advanced chromaticism, an introduction to post-tonal theory (pitch-class sets, twelve-tone rows), and sometimes Schenkerian analysis.

This sequence is historically contingent and subject to local variation, but it is sufficiently standardized that students who transfer between programs can generally navigate the curricular landscape without complete disorientation. The standardization is not entirely beneficial: it constrains curricular experimentation and may privilege a particular historical and cultural framework — German tonal music of the eighteenth and nineteenth centuries — to the exclusion of other musical traditions and practices.

3.2 Backward Design

The most influential framework for curriculum design in contemporary educational research is backward design, developed by curriculum theorists Grant Wiggins and Jay McTighe in their influential book Understanding by Design (1998). The core insight of backward design is that most curricular planning proceeds in the wrong order: instructors decide first what they will teach (content), then how they will teach it (activities), and finally how they will evaluate whether students learned it (assessment). Wiggins and McTighe argue that this sequence produces curricula in which activities feel engaging but do not reliably build toward meaningful understanding.

Backward design reverses the sequence: begin by identifying the desired learning outcomes — what you want students to understand and be able to do by the end of the course; then design the assessments that will reveal whether students have achieved those outcomes; only then design the learning experiences (lectures, discussions, exercises, listening activities) that will prepare students for those assessments. This approach ensures that every instructional activity is purposefully connected to a learning outcome, and that assessments actually measure what the course is supposed to teach.

Example 3.1 (Backward Design Applied to a Chromatic Harmony Unit). Suppose the desired learning outcome is: "Students will be able to identify secondary dominants by ear in a four-voice texture, label them correctly in Roman numeral analysis, and resolve them correctly in part-writing." Working backward: the assessment should include a dictation passage containing several secondary dominants (testing aural identification), an analytical exercise on an unfamiliar score (testing visual identification and labeling), and a harmonization exercise in which secondary dominants are appropriate harmonic choices (testing part-writing). The instructional sequence should therefore include: listening examples featuring secondary dominants prominently; explanation of the voice-leading logic generating secondary dominants; drill in spelling secondary dominant chords in various keys; four-part writing exercises introducing secondary dominants one at a time; and score analysis showing secondary dominants across a range of textures.

3.3 Writing Learning Outcomes

Effective learning outcomes are specific, observable, and assessable. The widely used Bloom’s Taxonomy of educational objectives provides a useful hierarchy of cognitive processes, from lower-order skills (remember, understand, apply) to higher-order skills (analyze, evaluate, create). An outcome phrased as “students will understand secondary dominants” fails all three criteria: it is vague (what does “understand” mean?), unobservable (understanding is a mental state, not a behavior), and unassessable (how will you know when understanding has occurred?).

An outcome phrased as “students will be able to identify a secondary dominant in a four-voice chorale setting and resolve it correctly in four-part writing” satisfies all three criteria: it specifies a behavior (“identify,” “resolve”), an observable product (a completed four-part writing exercise), and measurable criteria (correct identification, correct resolution). Writing precise outcomes forces instructors to be clear about what they actually want students to be able to do, which in turn makes assessment and instruction design far more tractable.

Definition 3.1 (Learning Outcome). A learning outcome is a statement describing what a student will be able to do, know, or value at the end of a course or unit of instruction. A well-formed learning outcome is (1) observable: it describes a behavior or product, not an internal mental state; (2) specific: it designates a particular skill or body of knowledge rather than a general disposition; and (3) assessable: it can be evaluated against defined criteria. The verb chosen should come from an appropriate level of Bloom's Taxonomy — "identify," "analyze," "compose," "evaluate" — rather than vague verbs such as "understand," "appreciate," or "know."

3.4 Sequencing Decisions

Why does counterpoint come before harmonic analysis in the traditional sequence, and diatonic harmony before chromatic? These sequencing decisions have both historical and pedagogical rationales. The historical rationale is that species counterpoint was, historically, the first systematic pedagogical approach to teaching musical composition — it predates the Roman-numeral harmonic theory that came to dominate the curriculum in the nineteenth century. The pedagogical rationale is the cognitive load argument: counterpoint in two voices is simpler than four-voice harmony, because there are fewer voices to manage simultaneously. Learning counterpoint first gives students a solid grasp of voice-leading as independent melodic motion before introducing the additional complexity of functional harmonic progression.

Diatonic before chromatic is similarly motivated by cognitive load: chromatic harmony involves more complex chord spellings, more ambiguous function, and more difficult voice-leading than diatonic harmony. Students who have not fully mastered diatonic part-writing are not well-positioned to handle the additional challenges of, say, the German augmented-sixth chord resolving to a dominant with a chord of preparation.

Remark 3.1. The traditional sequencing rationale is pedagogically defensible, but it is not the only possible rational sequence. Some programs begin with harmonic analysis by ear — students analyze songs they know by pop music conventions — before introducing formal notation and species counterpoint. This inductive approach uses student familiarity with tonal syntax to build theoretical vocabulary, reversing the traditional deductive sequence. Neither approach is universally superior; the choice depends on the student population, program goals, and the instructor's pedagogical commitments.

3.5 Integrated Musicianship Models

One of the most significant curricular innovations in recent decades has been the movement toward integrated musicianship: rather than teaching written theory and aural skills in separate courses with separate instructors and no coordinated curriculum, the integrated approach teaches them together, using the same musical materials in written analysis, dictation, sight-singing, and keyboard harmony simultaneously.

The theoretical advantage of integration is transfer: students who study the augmented-sixth chord in their written theory course and hear it in dictation in the same week are far more likely to connect the two experiences than students who encounter the chord in writing in third semester and in ear training a semester later. The logistical challenges of integration are significant: it requires coordinated curriculum planning across multiple instructors, often including a radical rethinking of the staffing and scheduling structure of the theory department. Several programs have implemented the integrated model with considerable success.

An important distinction in discussing integrated musicianship is between structural integration (theory and aural skills are taught in the same course, by the same instructor, meeting at the same time) and curricular coordination (theory and aural skills are taught in separate courses, but their syllabi are deliberately synchronized so that the same topics appear in both courses in the same week). Structural integration is the more thoroughgoing reform, and it is the one that promises the greatest transfer benefits; curricular coordination is more modest but far easier to implement within existing departmental structures. Even curricular coordination — ensuring that students encounter the secondary dominant in written theory and in dictation in the same two-week period — produces measurable transfer benefits compared to completely unsynchronized curricula.

Example 3.3 (Curricular Coordination in Practice). A theory area at a mid-sized university maintains separate written theory and aural skills courses but coordinates them by meeting monthly to align syllabi. In the week when the written theory course introduces the Neapolitan chord, the aural skills course features dictation passages prominently featuring the characteristic sound of the ♭II harmony. Students in the aural skills class may not yet have the Roman numeral label for what they are hearing; students in the written theory class may not yet have heard the sound of what they are labeling. But the overlapping exposure — a visual introduction in one context, an aural introduction in another, within the same week — accelerates the integration of the two representations in a way that purely sequential presentation does not.

3.6 Elective Offerings and Graduate Theory Curricula

The core undergraduate sequence is only part of the curricular picture. Upper-division electives and graduate courses offer opportunities to develop more specialized competencies: jazz theory and improvisation, post-tonal theory, Schenkerian analysis, form and analysis, tonal counterpoint, music and text, analysis of popular music. Designing an elective sequence requires attending to prerequisites, student demand, faculty expertise, and the degree to which electives complement rather than merely repeat the core.

Graduate theory curricula serve a different population with different goals. Graduate students in music theory are preparing for careers as researchers and teachers, not primarily as performers. They need: facility with advanced analytical methods (Schenkerian analysis, transformational theory, pitch-class set theory, topic theory, corpus methods); knowledge of the scholarly literature in music theory; experience reading and evaluating analytical arguments; and, in programs that include a pedagogy component, supervised teaching experience with structured mentoring. The shift from skills-based training to analytical and theoretical research is the defining characteristic of graduate music theory education.

3.7 The Syllabus as Pedagogical Document

The course syllabus is often treated as a bureaucratic requirement — a list of policies, due dates, and grade weights distributed on the first day of class. Treated more carefully, it is the primary document through which an instructor communicates her pedagogical philosophy, establishes the norms and expectations of the course, and creates the first impression that shapes students’ orientations toward the material.

A well-crafted theory syllabus does several things: it articulates the course’s learning outcomes clearly and specifically; it explains the rationale for major assignments and how they connect to the learning outcomes; it frames the course’s content as connected to students’ musical lives rather than as an autonomous academic exercise; it communicates the instructor’s expectations for student engagement in terms that are encouraging rather than threatening; and it provides practical information about policies and procedures in a tone that treats students as mature, responsible adults.

Remark 3.2. Research on syllabus design suggests that students form strong expectations about a course within the first few minutes of reading the syllabus — expectations that shape their engagement throughout the semester. A syllabus heavy in punitive language ("late work will not be accepted," "cheating will result in automatic failure") signals a low-trust classroom environment and may trigger the stereotype threat and fixed-mindset responses discussed in Chapter 2. A syllabus that opens with a genuine expression of the instructor's enthusiasm for the subject and a description of what students will be able to do by the end of the course signals a high-trust, growth-oriented environment that supports learning.

3.8 Coordinating a Multi-Section Theory Program

Many music departments run the undergraduate theory sequence in multiple sections taught by different instructors — a combination of faculty, visiting lecturers, and graduate teaching assistants. Coordinating such a program so that students across sections cover equivalent material, use consistent terminology, and prepare for shared examinations is one of the most complex administrative and pedagogical challenges a theory area coordinator faces.

The most successful multi-section programs achieve coordination through: a shared detailed syllabus with common assignments and assessments; regular meetings of all instructors to discuss pacing and pedagogy; shared grading rubrics applied consistently across sections; and a deliberate community of practice in which instructors observe each other’s classes and discuss what they observe. The alternative — autonomous instructors teaching to a common examination with no coordination of pedagogy — tends to produce unequal student experiences, grading inconsistencies, and a fragmented curriculum.

Example 3.2 (Graduate Teaching Assistant Mentoring in a Multi-Section Program). A theory coordinator at a large state university runs the first-year theory sequence in eight sections, six of which are taught by advanced doctoral students serving as teaching assistants. The coordinator holds a weekly pedagogy seminar in which TAs discuss upcoming lesson content, share student work samples, workshop planned activities, and report on what did and did not work in the previous week's classes. TAs observe the coordinator's section twice per semester and the coordinator observes each TA's section at least once. TAs write brief reflective journals after each class meeting. This structure provides TAs with the mentoring they need to develop as teachers while ensuring that students across all sections receive comparable instruction.

Chapter 4: Teaching Tonal Harmony

4.1 The Four-Voice Chorale as Pedagogical Object

The four-voice chorale — modeled on the harmonizations of Lutheran chorale melodies by J.S. Bach — has served as the central pedagogical object of tonal harmony instruction in American universities for over a century. There are good reasons for this centrality. The chorale texture is maximally voice-leading-explicit: with only four voices and a predominantly homophonic texture, every harmonic decision and every voice-leading motion is visible and audible. There is relatively little ornamentation to distract from structural voice-leading, and the voice ranges (soprano, alto, tenor, bass) are familiar from choral practice.

The Bach chorales themselves are extraordinarily diverse: over 370 survive, and they collectively demonstrate virtually every harmonic situation in the diatonic and chromatic tonal vocabulary. As analytical models, they are unequaled in richness and accessibility. The four voices move according to strict conventions: each voice should move smoothly (by step or small leap), avoid crossing another voice, avoid parallel fifths and octaves with any other voice, double the root of root-position triads in most circumstances, and resolve tendency tones — the leading tone and the dissonant seventh — according to their syntactic tendencies. These conventions are introduced progressively across the first two years of the standard sequence.

Beyond its historical authority, the Bach chorale has practical pedagogical virtues that are easily overlooked. Its homophonic texture ensures that all voice-leading decisions are simultaneously visible and audible. Its brevity — most chorales are between twenty and forty measures — means that students can analyze a complete piece in a single class period, getting the experience of formal closure and tonal completeness rather than analyzing a fragmentary excerpt. Its harmonic richness and diversity — each chorale presents a different harmonic vocabulary, a different phrase structure, and a different set of voice-leading challenges — means that working through even a dozen chorales provides a survey of the tonal harmonic language that is both systematic and practically illuminating. And its absolute historical authenticity — these are real compositions by a major composer, not textbook fabrications — gives students the experience of engaging with actual music even within the constraints of a pedagogical exercise.

Remark 4.3. The most common practical objection to the Bach chorale as a pedagogical model is that students who are not trained in the Lutheran chorale tradition find it culturally remote. This objection is real but manageable. Playing recordings of Bach chorales — the full choral performance, not just a mechanical piano realization — typically resolves the cultural-distance problem immediately: students who find the harmonic notation abstract and unfamiliar hear it transformed into something recognizably moving and expressive. The musical power of the chorales is the most effective argument for their pedagogical centrality.

4.2 Common Voice-Leading Errors and Their Pedagogy

The most common voice-leading errors in student work fall into a relatively small number of recurring categories, each with its own pedagogical challenge.

Parallel perfect fifths and octaves — consecutive parallel motion by perfect fifth or perfect octave between the same pair of voices — were prohibited in modal polyphony and remain prohibited in tonal part-writing pedagogy because they collapse the independence of the two voices involved. When two voices move in parallel by a perfect fifth, they begin to sound like a single entity rather than two independent lines. Teaching students to hear parallel fifths requires ear-training practice, not just rule memorization: students who can identify parallel fifths visually by counting intervals on a score but who cannot hear them in a texture have learned only half the lesson.

Voice crossing (a lower voice moving above a higher voice, or vice versa) and voice overlapping (a lower voice moving above the previous position of a higher voice) are less audibly obvious than parallel fifths but equally problematic for voice-leading independence. Improper resolution of tendency tones is perhaps the most musically significant error: the leading tone (\(\hat{7}\)) tends powerfully toward the tonic (\(\hat{8}\)) by half step, and the chordal seventh of a dominant seventh chord tends toward \(\hat{3}\) by half step (\(\hat{4}\) → \(\hat{3}\)). Students who allow these tones to move in other directions produce voice leading that violates the syntactic logic of tonal harmony, not merely an arbitrary rule.

Definition 4.1 (Tendency Tones). In tonal music, tendency tones are scale degrees whose melodic motion is strongly directed by the harmonic context. In a major key, the most prominent tendency tones are scale degree \(\hat{7}\) (the leading tone), which tends upward by half step to \(\hat{8}\), and scale degree \(\hat{4}\) (the chordal seventh in a V7 chord), which tends downward by half step to \(\hat{3}\). These tendencies are particularly strong when the tones appear in the soprano or in a structurally prominent voice; they may be less rigorously observed in inner voices when voice-leading constraints make exact resolution impractical.

4.3 Part-Writing Exercises vs. Harmonization Exercises

A crucial pedagogical distinction — one that textbooks do not always make clearly enough — is the difference between part-writing exercises and harmonization exercises. In a part-writing exercise, the instructor provides an outer-voice framework (soprano and bass are given) and asks the student to supply the inner voices (alto and tenor). The harmonic progression is implicitly specified by the outer voices; the student’s task is to fill in the inner voices correctly according to voice-leading rules. This is a relatively constrained exercise that isolates the skill of managing inner-voice motion.

In a harmonization exercise, the instructor provides only a soprano melody and asks the student to choose an appropriate harmonic progression (select Roman numerals), then realize that progression in four-part texture. This is a far more demanding task: it requires not only voice-leading competence but harmonic judgment — the ability to hear a melody and imagine appropriate harmonic supports for it, considering phrase structure, harmonic rhythm, and stylistic conventions. The two skills are related but distinct, and curricula should sequence them appropriately: most part-writing before most harmonization, with clear acknowledgment of the different cognitive demands each exercise type places on the student.

4.4 Teaching Roman Numeral Analysis

The choice between different approaches to Roman numeral analysis has been a persistent debate in music theory pedagogy. The dominant American approach, codified in Kostka and Payne and most widely used textbooks, assigns Roman numerals to chords based on their root (the bass of the root-position chord), with Arabic numerals in figured-bass notation appended to indicate inversions. This approach is pedagogically efficient and widely understood but has been criticized for encouraging “root-hunting” — students identify the root of every chord without necessarily understanding its voice-leading function.

The approach associated with Aldwell and Schachter’s Harmony and Voice Leading begins from figured bass rather than from Roman numerals: students learn to realize a figured bass line before learning to analyze root motion. This approach keeps voice-leading in the foreground and may produce deeper understanding of harmonic syntax, but it requires more instructional time and is less familiar to students who have encountered harmony through popular music or guitar-chord notation.

A third approach — associated with the galant schemata research of Robert Gjerdingen and the historically informed pedagogy of Giorgio Sanguinetti — organizes harmonic instruction around recognizable bass patterns (the Romanesca, the Monte, the Fonte) rather than around individual chord Roman numerals. These patterns are multi-chord formulae that recur across hundreds of Classical and pre-Classical compositions, and learning them by name and by ear gives students a vocabulary for harmonic recognition that is at once more historically authentic and more perceptually direct than Roman-numeral analysis applied chord by chord. The galant approach remains a minority tradition in American pedagogy, but its growing scholarly profile suggests that it may play a larger role in future curricula.

Remark 4.2. The choice among Roman-numeral, figured-bass, and galant-schema approaches to harmonic pedagogy is not purely theoretical — it has practical consequences for what students can do with their analytical knowledge. Roman-numeral analysis is the lingua franca of American music theory pedagogy and produces students who can communicate about harmony with any American colleague. Figured-bass analysis produces students with strong skills in tonal realization and keyboard harmony. Galant-schema analysis produces students with keen perception of multi-chord patterns and a historically richer understanding of Classical style. A curriculum that combines all three — using Roman numerals for systematic analysis, figured bass for keyboard harmony, and galant schemata for style-specific pattern recognition — may produce the most comprehensively equipped graduates, though at the cost of greater curricular complexity.

4.5 Secondary Dominants and Modulation

Teaching the concept of tonicization — the brief, local treatment of a chord other than the tonic as if it were a temporary tonic — is one of the most significant pedagogical moments in the harmonic sequence. A secondary dominant is a dominant-function chord (major triad or dominant seventh chord) applied to a chord other than the tonic. The notation V/V (read “five of five”) indicates a dominant chord applied to the dominant chord of the current key: in C major, V/V is D major (or D–F♯–A–C), which resolves to G major as if G were momentarily a local tonic.

A common student misconception is to interpret any secondary dominant as a modulation: “The F♯ means we’ve modulated to G major.” Teaching students to distinguish tonicization (a brief, local chromatic inflection that does not shift the tonal center) from genuine modulation (a more sustained reorientation of tonal center, typically confirmed by a cadence in the new key) requires both analytical examples showing the difference and precise vocabulary for discussing it.

The concept of the pivot chord — a chord that belongs to both the original key and the new key, used to shift smoothly between them — is a key tool for this distinction. In a modulation from C major to G major, the chord D minor (ii in C major, vi in G major) can function as a pivot: it is heard in its original function in C major and reinterpreted as vi in G major, allowing the harmonic shift to occur without a jarring discontinuity.

4.6 Chromatic Harmonies: Neapolitan and Augmented Sixth Chords

The Neapolitan chord (typically notated ♭II or N in Roman numeral analysis) is a major triad built on the lowered second scale degree: in C major, it is D♭–F–A♭. Its characteristic sound — a major chord with a flat-second root — has a rich history in tonal music from the Baroque through the Romantic period, appearing prominently in Scarlatti, Beethoven, Schubert, and Chopin.

The augmented sixth chords — the Italian, French, and German augmented sixth — are best understood not by their exotic names but by their voice-leading logic: they are chromatically intensified pre-dominant chords in which the interval of an augmented sixth between the bass (\(\hat{\flat}6\)) and an upper voice (\(\hat{\sharp}4\)) expands outward by half step to an octave on the dominant. Teaching these chords from their voice-leading function — rather than from their exotic Roman numeral labels — helps students understand why they work musically and makes them easier to recognize, resolve, and spell.

Example 4.1 (The German Augmented Sixth in C Major). The German augmented sixth chord in C major consists of the pitches A♭–C–E♭–F♯. It is built on the flattened sixth scale degree (A♭), includes the tonic (C) and the flattened third (E♭), and adds a chromatically raised fourth degree (F♯). The interval between A♭ (bass) and F♯ (upper voice) is an augmented sixth, which expands outward by contrary motion to the octave G–G as the chord resolves to the dominant. Enharmonically, A♭–C–E♭–F♯ is equivalent to A♭–C–E♭–G♭, a dominant seventh chord in D♭ major — a relationship composers exploit for enharmonic modulations to distant keys.

4.7 Keyboard Harmony and the Role of Listening

Keyboard harmony — the practice of playing harmonic progressions at the piano — is a vital supplement to written harmony exercises that is sometimes neglected in programs without keyboard harmony requirements. When students play a progression they have written — hearing the bass moving from tonic to dominant, feeling the leading tone resolve under their fingers — they develop a physical and aural intuition for harmonic syntax that written exercises alone cannot provide.

Every written harmony lesson should include audio examples. Students who write four-part harmonizations without hearing them are learning a visual grammar without a spoken language — they can manipulate symbols correctly without having any aural referent for what those symbols represent. Instructors should play model examples at the piano or play recordings, and should ask students to bring their completed assignments to the piano and play them, even badly, as part of the assignment.

4.8 Teaching Non-Chord Tones

The study of non-chord tones — melodic notes that do not belong to the underlying harmony — occupies a significant place in the harmonic curriculum because it bridges the gap between abstract Roman-numeral analysis and the richly ornamented surface of actual tonal music. The traditional taxonomy includes: the passing tone (melodically connecting two chord tones by stepwise motion through a dissonance); the neighbor tone (departing from a chord tone by step and returning); the suspension (a held-over chord tone that becomes dissonant and resolves downward); the appoggiatura (an unresolved approach to a chord tone by leap, creating a dissonance that resolves stepwise); the escape tone (departing from a chord tone by step and resolving by leap); and the anticipation (a note of the following chord arrived at early).

Teaching non-chord tones requires moving carefully between two levels of description: the abstract voice-leading logic (how each non-chord tone type behaves as a linear event) and the musical effect (how each type contributes to melodic expressiveness and rhythmic interest). Students who learn to identify non-chord tones by their abstract definition but cannot hear them in a score have learned the taxonomy without the musical insight it is meant to convey.

Definition 4.2 (Passing Tone). A passing tone is a non-chord tone that occurs between two notes of the underlying harmony, connected to both by stepwise motion in the same direction. A passing tone on a metrically weak beat is called an unaccented passing tone; one on a metrically strong beat is an accented passing tone. Accented passing tones create a more intense dissonance and are characteristic of the late Baroque and Classical styles; they are often mistaken by students for chord tones because of their metrically prominent position.
Example 4.2 (Non-Chord Tone Identification in a Bach Chorale). In the soprano of Bach's chorale harmonization "Jesu, Joy of Man's Desiring" (BWV 147), the flowing triplet melody contains a continuous stream of passing tones connecting chord tones in the underlying harmonic progression. Students analyzing this excerpt should first identify the harmonic rhythm (how frequently the harmony changes), then determine the chord tone at the beginning of each harmony, then classify the intervening notes as passing tones (stepwise motion between chord tones) or neighbor tones (departures from and returns to a repeated chord tone). The exercise trains two related skills simultaneously: harmonic hearing (identifying the underlying progression) and linear hearing (tracing the melodic motion across the surface).

4.9 Teaching Formal Analysis Alongside Harmony

The relationship between harmonic analysis and formal analysis — the analysis of musical form, phrase structure, and large-scale organization — is closer than the typical curriculum structure (harmony in second semester, form in third or fourth) might suggest. Harmonic cadences are the primary markers of formal boundaries: an authentic cadence (V–I with both chords in root position and the tonic note in the soprano at the cadence point) marks a strong formal close; a half cadence (ending on V) marks a point of harmonic openness that the music will subsequently resolve; a deceptive cadence (V–vi) marks a deflection that delays the expected close.

Teaching students to identify cadence types is therefore not merely a harmonic exercise but a gateway to formal analysis. The period — the basic two-phrase formal unit in which an antecedent phrase ends with a half cadence and a consequent phrase ends with an authentic cadence — is introduced most naturally through harmonic analysis: students identify the two cadences, note that the antecedent is harmonically open and the consequent harmonically closed, and recognize the period as a larger-scale question-and-answer structure that mirrors the voice-leading logic of the suspension-and-resolution at a much longer time scale.

Remark 4.1. Integrating formal analysis into the second-semester harmony curriculum — rather than deferring it to a separate third- or fourth-semester form and analysis course — has the significant advantage of making harmonic analysis immediately purposeful. Students who learn to identify cadences not merely as chord progressions but as formal articulations understand why harmonic analysis matters: it reveals the architecture of the music, the proportions and relationships that give tonal compositions their characteristic shapes. Without this architectural dimension, harmonic analysis risks feeling like an end in itself — a labeling exercise rather than an analytical one.

Chapter 5: Teaching Counterpoint

5.1 The Pedagogical Logic of Species Counterpoint

Species counterpoint is the most carefully systematized pedagogical system in the history of Western music theory education. Its logic is fundamentally cognitive-load reduction through controlled task decomposition. The five species isolate rhythmic complexity: in first species (note-against-note), every note of the counterpoint sounds against exactly one note of the cantus firmus, so all notes are consonant and the only challenge is managing harmonic intervals and melodic shape; second species (two notes against one) introduces passing tones on weak beats, requiring attention to the distinction between consonance and dissonance; third species (four notes against one) introduces neighbor tones and other embellishing patterns; fourth species (syncopation) introduces suspensions — the central dissonance type of tonal voice-leading — by requiring the counterpoint to enter on the offbeat and sustain across the barline; fifth species (free or florid counterpoint) combines elements of all four previous species in a free mixture, approaching the rhythmic variety of actual composed music.

5.2 Fux’s Gradus ad Parnassum and Its Legacy

Johann Joseph Fux published his Gradus ad Parnassum in Latin in Vienna in 1725. The work presents counterpoint pedagogy in dialogue form — a student (Josephus, widely understood to be Fux himself) learns from a master (Aloysius, understood to represent Palestrina) — and proceeds through the five species in each of the four modal categories that Fux recognized. The Gradus was immediately recognized as a pedagogical masterwork: Handel owned a copy, Haydn studied it independently as a young composer in Vienna, Mozart’s father used it to teach his son, and Beethoven worked through exercises from it with his teacher Albrechtsberger. Schoenberg assigned it to his composition students; Hindemith’s Traditional Harmony explicitly acknowledges its debt.

The enduring influence of the Gradus is somewhat paradoxical: Fux based his system on the modal counterpoint of Palestrina (c.1525–1594), which is stylistically quite remote from the tonal harmonic language that most theory students are simultaneously learning. The cantus firmi are in modes (Dorian, Phrygian, Lydian, Mixolydian), not in major or minor. Yet students who learn the system carefully develop a voice-leading facility that transfers well to tonal harmony — because the fundamental skills (linear independence, stepwise motion, careful dissonance treatment) are common to both.

5.3 Cantus Firmus Composition

Before adding a counterpoint to a given cantus firmus, students in many pedagogical traditions are asked to compose a cantus firmus themselves. This is more demanding than it appears: a good cantus firmus must have a clearly directed melodic shape (typically a single climax point), move predominantly by step with occasional leaps, avoid outlining dissonant intervals between non-adjacent pitches, recover from large leaps by stepwise motion in the opposite direction, and begin and end on the finalis of the mode. Composing a cantus firmus is thus a test of melodic craft in its purest form, free from any harmonic considerations.

Definition 5.1 (Cantus Firmus). A cantus firmus (Latin: "fixed song") is a pre-existing melody used as the structural foundation of a polyphonic composition or pedagogical exercise. In species counterpoint pedagogy, the cantus firmus is typically a rhythmically uniform melody in whole notes or breves, set in a modal scale and used as the scaffolding against which the student writes the counterpoint. A good cantus firmus is characterized by: predominantly stepwise motion, a single climactic peak, avoidance of repeated notes, recovery from leaps by contrary stepwise motion, and a clear sense of melodic beginning, middle, and end.

5.4 Teaching Suspension (Fourth Species)

Fourth species counterpoint introduces the suspension — the most important dissonance type in tonal voice-leading pedagogy. A suspension is produced when a consonant note is held over (sustained) across a barline while the bass moves to a new note, creating a dissonance; the suspended note then resolves downward by step to a consonance. The suspension has three parts: the preparation (the note is consonant in its first appearance), the suspension itself (the note is held over and becomes dissonant), and the resolution (the note moves down by step to a consonance).

The most common suspensions in two-voice counterpoint above a bass are: the 7–6 suspension (the dissonant seventh resolves to a sixth), the 4–3 suspension (the dissonant fourth resolves to a third), the 9–8 suspension (the ninth resolves to the octave), and the 2–3 suspension (in the lower voice, a bass suspension).

The suspension is the archetype for the tonal concept of non-chord tone resolution: virtually every non-chord tone in tonal music — the passing tone, the neighbor tone, the appoggiatura, the escape tone — can be understood as a more ornamented version of the basic suspension-and-resolution logic. Teaching fourth species carefully, with attention to the preparation-suspension-resolution sequence as a voice-leading principle rather than merely a rhythmic pattern, equips students with a model for understanding tonal dissonance treatment that they will use for the rest of their analytical careers.

Example 5.1 (A 7–6 Suspension Chain in Second-Species Style). In a soprano voice above a held bass note, suppose the soprano sustains a pitch that forms a seventh against the bass: the seventh is dissonant, held over from the previous beat where it was consonant (as a sixth against the previous bass note). As the bass moves to a new pitch below, the soprano resolves downward by step to form a sixth. This preparation–suspension–resolution sequence, repeated across successive beats, produces the characteristic stepwise descending chain of suspensions found throughout Renaissance polyphony and codified by Fux as the paradigmatic fourth-species pattern.

5.5 From Species Counterpoint to Tonal Counterpoint

The transition from Fuxian species counterpoint (modal, abstract, pedagogical) to tonal counterpoint (functional harmonic, applied to actual composed music) requires explicit pedagogical attention. The key texts for this transition are Robert Gauldin’s A Practical Approach to 18th-Century Counterpoint and Benjamin, Horvit, and Nelson’s Counterpoint in the Style of J.S. Bach. These texts recontextualize the species-counterpoint skills in the harmonic framework of the Baroque: cantus firmi are replaced by actual bass lines implying functional harmonic progressions; species conventions are relaxed in favor of stylistic conventions drawn from Bach’s actual practice; and students analyze Bach inventions, sinfonias, and chorale preludes as models.

The most important conceptual step in this transition is understanding that species counterpoint’s rule about dissonance treatment is not an arbitrary prohibition but a description of a deep principle: in tonal music, dissonances are not independent harmonic events but linear events arising from the motion of individual voices. A suspension in fourth species is the same phenomenon as an appoggiatura in a Mozart slow movement — a linear delay of a harmonic tone, creating momentary tension that resolves to consonance. Recognizing this continuity between the species tradition and tonal practice is itself a threshold concept.

5.6 Teaching Invention and Fugue

The teaching of fugue represents the most advanced stage of the contrapuntal curriculum. A fugue is organized around a subject (the principal melodic idea, presented alone in a single voice at the opening), which is then imitated in other voices in turn in the answer — the subject transposed to the dominant — followed by successive entries. The relationship between subject and answer may be tonal (a slight modification of the subject’s intervals to remain within the key) or real (an exact transposition). The countersubject is the melodic material that accompanies each subsequent subject entry.

Teaching fugue as both an analytical and compositional activity develops a range of skills simultaneously: recognizing contrapuntal imitation in score, understanding the interplay between melodic foreground and harmonic background in the episodes (passages between subject entries), tracking tonal structure at the large scale, and appreciating how motivic economy — the development of a whole composition from a compact, distinctive subject — is itself a generative compositional principle.

5.7 Grading Counterpoint

Developing principled rubrics for grading counterpoint is more complex than it might appear. A purely rule-based rubric — one error per parallel fifth, one error per voice crossing — can be applied consistently but fails to capture the most musically important properties of student work: melodic shape and continuity, the sense of directed motion toward a climax and cadence, the musical logic of the dissonance choices. A rubric that includes a holistic “musical quality” criterion addresses these dimensions but introduces subjectivity that students may experience as arbitrary.

The most defensible approach is a layered rubric: a base layer of specific, rule-based criteria (voice-leading correctness, dissonance treatment, melodic propriety) that together account for roughly two-thirds of the grade, plus a holistic criterion of musical continuity and shape that accounts for the remaining third and is defined in the rubric with concrete descriptors. This structure allows consistent application of the rule-based criteria while acknowledging that melodic quality is a real and teachable property that deserves recognition.

5.8 Three-Voice and Four-Voice Counterpoint

The extension of species counterpoint from two voices to three and four voices introduces new challenges that go beyond adding more voices to the same exercise. In three-voice counterpoint, the student must manage three independent lines simultaneously — a challenge that exceeds the capacity of most beginners’ working memory — and must also attend to the harmonic completeness of the three voices together: a three-voice texture in first species should express a complete triad at each point of contact, not merely a consonant dyad.

In four-voice counterpoint, the student enters the terrain of the Bach chorale: four independent voices, each governed by its own melodic logic, all required to produce a coherent harmonic progression at every beat. The transition from two-voice species exercises to four-voice writing is one of the most demanding steps in the entire theory curriculum. Students who successfully complete two-voice fifth species often find four-voice first-semester chorale exercises more difficult, not less — because the number of variables that must be managed simultaneously has more than doubled.

Remark 5.1. Some pedagogues argue that two-voice species counterpoint and four-voice harmonic part-writing should be taught simultaneously from the beginning of the theory curriculum, on the grounds that the skills reinforce each other and that students who learn them together develop a richer and more integrated understanding of voice-leading. Others argue for strict sequencing: master two-voice species before attempting four-voice harmony. The cognitive-load argument favors sequencing; the integration argument favors simultaneous development. The empirical literature does not definitively favor either approach, and the choice may depend more on the specific student population and faculty resources available.

5.9 Counterpoint in Post-Tonal Music

The pedagogical treatment of counterpoint does not end with the tonal tradition. Post-tonal counterpoint — independent linear voices in a non-functional harmonic context — poses its own pedagogical challenges, particularly in explaining what it means for voices to be “independent” when the harmonic criteria for consonance and dissonance that structured tonal counterpoint no longer apply.

Several approaches are available. The intervallic approach — focusing on interval class content and avoiding the saturation of any single interval class — gives students compositional criteria without importing tonal function where it does not belong. The canonic approach — using strict canonic imitation as the primary structural device — applies the logic of tonal imitative counterpoint to post-tonal contexts: the canon produces independence through time displacement rather than through harmonic criteria. Analysis of actual post-tonal contrapuntal works — Bartók’s Music for Strings, Percussion, and Celesta, Webern’s Symphony op. 21, Lutosławski’s string quartets — provides models that make the abstract criteria concrete.

The twelve-tone row as a device for organizing post-tonal counterpoint provides a particularly interesting pedagogical case. In Schoenberg’s and Webern’s twelve-tone counterpoint, the independence of voices is maintained through registral and timbral differentiation, through rhythmic contrast, and through the careful management of interval content — since the row itself determines which pitch classes are available at any moment, the composer’s contrapuntal freedom operates within tight parametric constraints. Pedagogically, having students compose a short two-voice piece in which each voice unfolds a different form of the same row (prime and inversion, or original and retrograde) develops both twelve-tone technique and contrapuntal thinking simultaneously.

Example 5.2 (Two-Voice Twelve-Tone Counterpoint as a Pedagogical Exercise). Students are given a twelve-tone row and asked to: (1) derive its four basic forms (original, retrograde, inversion, retrograde inversion); (2) compose a two-voice, sixteen-measure piece in which the upper voice presents the row in original order and the lower voice presents it in inversion, beginning simultaneously; (3) ensure that neither voice repeats any pitch class before its respective row is complete; and (4) create rhythmic differentiation between the two voices so that they are perceptually independent. Completing this exercise successfully requires command of row mechanics, understanding of how to create linear independence without tonal voice-leading rules, and compositional judgment about rhythm and register. It is a demanding task that integrates skills from post-tonal theory, counterpoint, and compositional practice in a single exercise.

Chapter 6: Aural Skills Pedagogy

6.1 Solfège Systems

The choice of solfège system is one of the most consequential and contested decisions in aural skills pedagogy. The two primary systems in use in American conservatories and universities are fixed-do and movable-do. In the fixed-do system, each solfège syllable corresponds to an absolute pitch class: C is always do, D is always re, E is always mi, and so forth, regardless of key. In the movable-do system, the syllable do always designates the tonic of the current key: in C major, C is do; in G major, G is do; in B♭ major, B♭ is do.

The movable-do system has significant advantages for developing functional tonal hearing: because every scale degree has the same syllable in every key (do is always tonic, sol is always dominant), students develop a vocabulary for hearing and singing the functional relationships between scale degrees — the tendency of ti to resolve to do, the stability of do and sol, the brightness of mi. These syllable-function associations become an internalized tonal map that supports both sight-singing and harmonic hearing.

The fixed-do system has advantages for chromatic and atonal music and for reading in non-transposing instruments: a student trained in fixed-do can immediately name the pitch being sung without calculating a transposition. Most European conservatories use fixed-do; most American music programs use movable-do, with some using a la-based minor variant (in which the natural minor scale is sung la–ti–do–re–mi–fa–sol–la) and others using a do-based minor variant (in which the tonic of a minor key is always do, with me, le, and te for the lowered third, sixth, and seventh).

Remark 6.1. The debate between fixed-do and movable-do is sometimes conducted with more ideological fervor than the empirical evidence warrants. Skilled musicians have been trained under both systems, and neither guarantees superior aural skills. The more important consideration is consistency: whatever system a program adopts should be used uniformly across all courses and instructors, since switching systems mid-sequence forces students to rebuild their solfège vocabulary from scratch. Programs should also acknowledge each system's limitations: movable-do students often struggle with chromatic and post-tonal music, while fixed-do students may have a harder time internalizing functional tonal relationships.

6.2 Karpinski’s Aural Skills Acquisition

Gary Karpinski’s Aural Skills Acquisition: The Development of Listening, Reading, and Performing Skills in College-Level Musicians (2000) is the most comprehensive research-based account of how college musicians develop aural skills. Karpinski draws on cognitive science, music psychology, and extensive pedagogical experience to articulate a model of aural skill development that is both empirically grounded and practically usable.

A central concept in Karpinski’s framework is audiation — his adaptation of Edwin Gordon’s term for the cognitive process of hearing music in the mind without the music actually sounding. When a skilled musician reads a notated melody and “hears” it internally before singing it, or when a composer imagines the sound of a passage before writing it down, she is audiating. Audiation is distinct from mere recognition (identifying that a chord is a dominant seventh) and from passive listening (hearing the chord play); it is active, generative musical hearing — the capacity to produce musical sound in the mind.

Karpinski argues that the development of audiation is the central goal of aural skills education, and that many conventional aural skills activities — isolated interval identification, chord-quality recognition from chords played out of context — fail to develop audiation because they do not require students to produce mental musical sound. They require, at most, pattern matching against memorized templates.

6.3 Gordon’s Music Learning Theory

Edwin Gordon’s Music Learning Theory provides a broader developmental framework within which Karpinski’s work sits. Gordon argues that musical learning proceeds through a sequence of stages, from basic audiation of isolated tonal and rhythm patterns to complex audiation of syntactic musical sequences. His taxonomy of tonal content — beginning with tonal patterns in major and moving through minor and then modal and other scales — provides a sequencing principle for aural skills instruction: students should develop secure audiation of simple patterns before encountering more complex ones, and instruction should always keep audiation (active mental hearing) at the center.

Gordon’s research also supports the importance of rote learning before note learning: students should develop aural command of musical patterns — be able to sing them, recognize them, and audiates them mentally — before they encounter the notation for those patterns. This is analogous to first-language acquisition: children learn to speak and understand their native language years before they learn to read and write it, and their oral competence supports their literacy development. Music literacy (reading and writing notation) develops most soundly on the foundation of prior aural competence.

Gordon’s taxonomy of tonal content — the specific patterns, scales, and harmonic sequences that students should master in sequence — gives aural skills instructors a principled way to organize their curricula beyond the simple diatonic-to-chromatic-to-post-tonal sequence that most programs follow by default. In Gordon’s framework, students should develop secure audiation of resting tone patterns (patterns that begin and end on the tonic) before moving to active tone patterns (patterns that involve scale degrees with strong tendency, particularly \(\hat{7}\) and \(\hat{2}\)). This micro-sequencing within the diatonic major phase — which most curricula treat as a uniform block — reflects the real differences in perceptual difficulty among different scale-degree combinations.

Definition 6.2 (Audiation). Audiation, as defined by Edwin Gordon, is the cognitive process of hearing and comprehending music in one's mind when the physical sound is not present — or, at a higher level, when it has not yet been produced. Audiation differs from mere aural imagery (simple mental replay of a memorized passage) in that it involves active musical comprehension: the audiating musician hears music as syntactically organized, tonally meaningful sound, not as a passive replay of a stored recording. Gordon distinguished audiation from listening (hearing music as it sounds in the present moment), reading (hearing music as it is notated), and writing (notating music that one audiates). All aural skills activities, in Gordon's framework, should develop the capacity for audiation across all these modes.

6.4 Dictation as a Pedagogical Activity

Dictation — transcribing a musical passage from sound to notation — is the activity most closely associated with aural skills courses. Dictation exercises come in three main types: melodic dictation (transcribing a single melodic line), rhythmic dictation (transcribing a rhythm without pitch content), and harmonic dictation (identifying the harmonic progression of a passage, typically by bass line and Roman numerals). Each type develops different skills and poses different challenges.

The research literature on what makes dictation effective — as opposed to merely difficult — suggests several principles. Passages should be long enough to provide harmonic context but short enough to fit in working memory (four to eight measures for melodic dictation is typical). Students should be allowed multiple hearings with specific listening tasks assigned to each hearing: first hearing — determine meter and mode; second hearing — sing along mentally and capture the opening; third hearing — fill in gaps; fourth hearing — check. The instructor should provide the beginning note and key signature before the exercise begins, to eliminate variables that are not the focus of the exercise. Harmonic dictation should use passages from actual music rather than invented harmonic exercises, to develop the associative listening skills that transfer to score study.

Example 6.1 (A Structured Harmonic Dictation Sequence). The instructor plays an eight-measure excerpt from a Haydn string quartet. On the first playing, students only listen and determine the meter and mode (major or minor). On the second playing, students write down the bass line as best they can, without worrying about the upper voices. On the third playing, students identify the chord quality at each beat of harmonic change, placing Roman numerals below their bass line. On the fourth playing, students check their Roman numerals and correct any errors. On the fifth playing, students verify the complete harmonic progression and identify any non-chord tones in the bass. This five-stage protocol distributes attention systematically across the dimensions of the exercise, preventing students from being overwhelmed by the task of simultaneously tracking bass motion, chord quality, and Roman numeral label.

6.5 Sight-Singing

Sight-singing — performing an accurate vocal rendition of a notated melody on first reading — is, along with dictation, the core activity of aural skills training. Effective sight-singing instruction moves through a carefully sequenced curriculum: beginning with stepwise motion in simple meter in major keys, then introducing small leaps and compound meter, then diatonic leaps of sixths and sevenths, then modal scale degrees, then chromatic alteration, then post-tonal melodies with no key signature.

A crucial pedagogical distinction is between functional and technical approaches to sight-singing. A functional approach emphasizes understanding the scale-degree functions of each note before singing — knowing that the note is \(\hat{5}\) before singing it, which gives a pitch target derived from tonal knowledge. A technical approach emphasizes ear-to-mouth coordination — developing the kinesthetic habit of producing the correct pitch by a process closer to athletic training than intellectual calculation. The research suggests that the most effective sight-singing instruction combines both approaches: students should develop both the functional understanding (what scale degree am I singing?) and the kinesthetic habit (how does this interval feel in my voice?) that together produce reliable sight-reading ability.

Pacing is crucial in sight-singing instruction. Students who are asked to maintain a strict tempo while sight-singing unfamiliar material before they have internalized the pitch content tend to produce rhythmically accurate but melodically inaccurate results, or to sacrifice the tempo entirely in order to puzzle out the pitches. A productive strategy is to separate pitch and rhythm: first have students audiate the pitches slowly without conducting a beat; then add the rhythmic dimension once the pitch content is secure. This approximates the process used by skilled sight-readers, who process pitch and rhythm in overlapping but partially sequential stages rather than truly simultaneously.

Remark 6.3. The question of tempo in sight-singing examinations is genuinely contested. Some instructors maintain a strict, unforgiving beat, arguing that real-world sight-reading (in an orchestra or choral rehearsal) does not allow for the luxury of stopping to correct pitches. Others allow students to slow down when needed, arguing that accuracy at a reduced tempo is more educationally valuable than rhythmic steadiness at the cost of pitch accuracy. The research suggests that both constraints matter, and that the most educationally productive approach may be to set different expectations at different stages of training: flexibility in early instruction, stricter tempo maintenance as proficiency develops.

6.6 Interval Recognition

The traditional approach to interval recognition — playing isolated intervals and asking students to identify them by name, typically using melodic association (“ascending perfect fourth = ‘Here Comes the Bride’”) — has been criticized in the research literature as developing a skill that does not transfer well to real musical contexts. In actual musical listening, intervals are not heard as isolated dyads but as relationships within a key and a harmonic progression. A student who can reliably identify a perfect fourth in isolation may still fail to hear the raised fourth degree of a secondary dominant when it appears in a complex texture.

The research supporting contextual interval training — hearing and singing intervals as functional relationships within a tonal context — suggests that this approach produces better transfer to musical listening. In this approach, students learn to hear and sing the interval of a minor sixth not as an abstract dyad but as the relationship from \(\hat{3}\) to \(\hat{\flat}1\) (or another specific scale-degree pair) in a tonal context that makes the interval’s function clear.

6.7 Error Detection and Score Study

A powerful and underused aural skills activity is error detection: the instructor (or a recording) plays a passage while students follow along with a score, then identifies a passage in which what was played differs from what is written. Students must locate the discrepancy and describe it precisely. This activity develops precisely the skill that distinguishes a sophisticated orchestral musician from a novice: the ability to listen to a complex texture while reading a score and notice when what is heard diverges from what is notated.

Error detection is also highly motivating: it frames listening as an active investigative task rather than a passive reception task, and the “aha” moment of finding the discrepancy provides immediate positive reinforcement.

6.8 Technology in Aural Skills

The proliferation of software tools for aural skills training has transformed the practical landscape of the discipline. EarMaster, Auralia, Tenuto, and Tuning Trainer offer extensive libraries of automated exercises covering interval identification, chord recognition, melodic and harmonic dictation, and rhythmic reading. These tools have real advantages: they are available at any hour, they provide immediate feedback, they track student progress, and they free class time for higher-level activities.

Their limitations, however, are significant from a pedagogical perspective. Most software tools implement the discrete-skill, out-of-context approach that the research literature criticizes: they present isolated intervals or chords and require identification, rather than developing audiation of musical syntax. Student practice on these tools tends to improve performance on similar software tasks without necessarily improving musical hearing in context. The most defensible use of technology in aural skills pedagogy is as a supplement to context-rich, musicianship-oriented classroom instruction — not as a replacement for it.

6.9 Rhythm and Meter in Aural Skills

Rhythm and meter are sometimes treated as the “easy” half of aural skills — as if pitch is where the real difficulty lies. This is a mistake. The perception of meter — the hierarchically organized pattern of strong and weak beats that underlies rhythmic experience in tonal music — involves active cognitive construction, not passive reception. Research in music psychology (Longuet-Higgins and Lee, London’s Hearing in Time) shows that metrical interpretation is an active, expectation-driven process: listeners construct a metrical grid from rhythmic cues in the music and project that grid forward in time, and when the music violates their projection (syncopation, hemiola, cross-rhythm), they experience a characteristic perceptual tension.

Rhythmic dictation exercises are most effective when they begin with simple patterns in a single meter before introducing syncopation, ties across barlines, or compound subdivision. The most common failure mode in rhythmic dictation pedagogy is introducing notational complexity (eighth-note triplets in compound time) before students have internalized the metrical grid against which that complexity is defined. Students who do not yet have a secure sense of the underlying beat cannot hear how a rhythm departs from it, and cannot correctly transcribe that departure.

Definition 6.1 (Meter). Meter is the hierarchical organization of musical time into regularly recurring patterns of strong and weak beats. A simple meter (duple, triple, or quadruple) divides the beat into two equal sub-beats; a compound meter divides the beat into three equal sub-beats. The metrical hierarchy extends both upward (beats grouped into measures, measures grouped into hypermeasures) and downward (beats subdivided into eighth notes, sixteenth notes, or their compound equivalents). The perception of meter is not a direct reading of notated time signatures but an active construction by the listening mind, based on rhythmic pattern, accent, harmonic rhythm, and phrase structure.

6.10 Intonation Awareness and Tuning

Intonation — the accuracy with which a performer produces the intended pitch — is a practical concern for all musicians and a theoretical one for those who understand the mathematics of tuning systems. Aural skills courses that include intonation training — developing students’ ability to hear and correct tuning errors in their own singing and playing — build a skill that transfers directly to ensemble performance.

The pedagogical approach to intonation awareness proceeds from coarse to fine: first, students learn to distinguish in-tune from clearly out-of-tune playing (a fifth that is obviously flat or sharp); then to identify the direction of the error (flat or sharp); then to quantify the error in approximate cents; and finally to make the fine adjustments needed for ensemble tuning in a specific tuning system (equal temperament, just intonation for certain intervals, expressive intonation as a rhetorical device).

Remark 6.2. The **Tuning Trainer** software app (developed for string players but useful for all instrumentalists and vocalists) allows students to record their own playing and visualize intonation deviations in real time, comparing their produced pitch to a target. Combined with deliberate practice principles — targeting specifically the intervals or passages where intonation deviates most consistently — this kind of feedback loop can produce measurable intonation improvement over the course of a semester. The app's value is diagnostic: it makes visible a dimension of performance quality that is often discussed impressionistically ("your intonation is a bit off") but rarely analyzed systematically.

Chapter 7: Assessment Design and Inclusive Pedagogy

7.1 Formative vs. Summative Assessment

The distinction between formative and summative assessment is one of the most fundamental in educational research, and it is particularly important in music theory pedagogy because the two functions require different assessment designs.

Formative assessment provides information during the learning process that can be used to adjust instruction and learning: its purpose is not to evaluate and grade but to inform. Weekly homework assignments, in-class worksheets, low-stakes quizzes, and informal listening exercises are formative when their primary purpose is to reveal where students are in their learning and to guide subsequent instruction.

Summative assessment evaluates learning at the end of a unit or course: its purpose is to determine what students have learned, assign grades, and provide information for external accountability. Midterm examinations, final projects, and end-of-semester portfolios are typically summative.

The most effective theory courses use formative and summative assessment in tandem: formative assessment identifies misconceptions and gaps early enough that instruction can address them before the summative assessment, and summative assessment evaluates learning that has been supported throughout the term by ongoing feedback. A common failure mode is treating all assessment as primarily summative — weekly homework graded and returned but not discussed in class, with no mechanism for students to apply what they learned from the feedback before the next graded assessment. The research is clear that feedback produces learning only when students have the opportunity to act on it.

7.2 Designing Rubrics for Theory Assignments

A well-designed rubric for a theory assignment does several things simultaneously: it communicates expectations to students before they complete the assignment; it guides consistent grading by multiple instructors or graders; it provides specific, actionable feedback to students about the quality of their work; and it supports partial credit for partial competence.

Example 7.1 (Rubric for a Four-Part Chorale Exercise). A four-category rubric might assess: (1) Voice-leading correctness — parallel fifths, octaves, voice crossing, overlapping voices (weighted most heavily at 40%); (2) Harmonic accuracy — correct Roman numeral realization, correct doublings, correct treatment of tendency tones (30%); (3) Soprano melody quality — smooth motion, appropriate climax, adherence to melodic guidelines (20%); (4) Completeness — all voices present, all notes within range, all rhythm values correct (10%). Each category is described at four performance levels (exemplary / competent / developing / inadequate) with specific observable descriptors at each level, allowing different graders to assign similar scores to similar work.

7.3 Mastery-Based Grading

Mastery-based grading (sometimes called specifications grading or standards-based grading) represents a significant departure from traditional percentage-based grading systems. In a mastery-based system, each assignment or skill objective has a defined standard of acceptable performance — what competent work looks like — and students receive credit only when their work meets or exceeds that standard. Work that does not meet the standard is returned for revision and resubmission. Grades are assigned based on the number and type of mastery-level demonstrations a student has completed by the end of the semester, not on a weighted average of all work completed.

The pedagogical rationale for mastery-based grading is compelling: it ensures that students who initially struggle — perhaps because of uneven prior preparation — can still achieve mastery by the end of the semester through persistent effort and revision. A student who submits a chorale with many errors on the first attempt but who revises it to mastery level after feedback has learned the material. A grading system that averages the first-attempt score with the revised score penalizes the learning process itself. The research on mastery-based grading in music theory shows improved outcomes, particularly for students whose prior preparation is inconsistent.

7.4 Universal Design for Learning

Universal Design for Learning (UDL) is a framework for designing courses and learning environments that are accessible, from the outset, to the widest possible range of learners — rather than designing for an imagined “typical” student and then adding accommodations for students who deviate from the norm. UDL, developed by CAST (Center for Applied Special Technology), is organized around three core principles:

Multiple means of representation — presenting information in more than one format. In a music theory course, this means not presenting voice-leading rules only in written notation, but also playing audio examples, showing animated voice-leading diagrams, and demonstrating at the keyboard — providing multiple representational modes for students with different perceptual strengths.

Multiple means of action and expression — allowing students to demonstrate learning in more than one way. Rather than accepting only written notation as the output for a harmonization exercise, the instructor also accepts keyboard performance, a recorded singing of the soprano with vocal annotations of harmonic function, or an oral explanation accompanied by lead-sheet notation.

Multiple means of engagement — offering more than one route to motivation and participation. Rather than relying solely on high-stakes examinations as motivators, the instructor builds in choice and student agency in elective assignments, collaborative projects, and exploratory listening activities.

Definition 7.1 (Universal Design for Learning). Universal Design for Learning (UDL) is a research-based framework for curriculum and instructional design that proactively addresses the diversity of learner needs by providing: (1) multiple means of representation (how information is presented); (2) multiple means of action and expression (how students demonstrate learning); and (3) multiple means of engagement (how students are motivated and supported). UDL shifts the design goal from accessibility-as-accommodation — retrofitting standard designs for atypical learners — to accessibility-as-design — building inclusion into the course architecture from the beginning.

7.5 Music Theory’s Equity Problem

The traditional undergraduate music theory curriculum has been subject to significant and productive critical scrutiny in recent years, most prominently in the work of Philip Ewell, whose paper “Music Theory’s White Racial Frame” (2020, Music Theory Spectrum) argued that the discipline’s centering of J.S. Bach chorales and Germanic common-practice music as the primary pedagogical model implicitly encodes a racial and cultural hierarchy. Ewell’s arguments provoked extensive discussion and considerable controversy within the music theory community, but they have also catalyzed genuine reflection on curricular change.

The pedagogical question is practical: how can instructors expand the range of musical examples and traditions represented in the theory curriculum without sacrificing the structural coherence that makes the curriculum teachable? A core curriculum organized around a single repertoire — Bach chorales — has the advantage of consistency: students compare their work against a stable set of models, developing a clear sense of stylistic norms. Expanding to include jazz harmony, popular music, music from non-Western traditions, and non-canonical Western music is desirable but requires careful thought about how to maintain structural coherence.

Remark 7.1. The practical path forward is likely not to abandon the Bach chorale as a pedagogical model — it remains an extraordinarily efficient vehicle for teaching four-voice voice-leading — but to supplement it substantially with examples from other repertoires, and to be explicit with students about the historical and cultural specificity of the corpus from which it is drawn. Teaching the Bach chorale and teaching a jazz standard in the same unit — comparing their harmonic syntax, their voice-leading conventions, their approaches to dissonance — can deepen understanding of both rather than diluting either.

7.6 Stereotype Threat in the Theory Classroom

Stereotype threat — the phenomenon, documented extensively by Claude Steele and colleagues, whereby members of groups subject to negative stereotypes perform below their actual ability when the stereotype is made salient — is a real and consequential factor in music theory classrooms. Students of color, first-generation college students, and students whose primary musical experience is in popular or non-classical genres may arrive in a theory course carrying the implicit burden of stereotypes: that theory is a white, European, classical-music domain in which they do not naturally belong.

When those stereotypes are activated — by the repertoire used in class, by the way theory is framed as the study of “serious” music, or by the demographics of the classroom itself — student performance suffers in ways that are genuinely attributable to the cognitive burden of managing the threat, not to any difference in ability.

Instructors can reduce stereotype threat through several strategies: explicitly framing theory as a human universal that applies to all music, not as a culturally specific achievement of German composers; using diverse examples that reflect the musical backgrounds of the students in the room; avoiding early negative feedback that confirms students’ fears; and creating a classroom culture in which effort and growth are visibly valued over prior preparation.

Example 7.3 (Reducing Stereotype Threat Through Example Diversification). An instructor teaching a unit on secondary dominants uses the following example sequence. She begins with a brief passage from a Bach cantata — the expected canonical example — and analyzes the secondary dominant in Roman numerals. She then plays a short clip from a Stevie Wonder song (*Superstition*) and asks students to identify a moment that "sounds like" the Bach example. Students with backgrounds in popular music recognize the harmonic effect immediately; students whose backgrounds are primarily classical hear the popular music through their newly acquired analytical lens. Both groups discover that secondary dominants are not a feature of "classical music" but of tonal music broadly construed — a reframing that makes the concept feel less culturally exclusive and more musically universal.

7.7 Office Hours and Student Support

Research on student help-seeking consistently shows that students who most need academic support are least likely to attend office hours. The barriers are multiple: stigma (attending office hours signals weakness or inadequacy), unfamiliarity with academic culture (first-generation students are less likely to know that office hours are for routine academic help, not just crises), and logistical obstacles (office hours scheduled at inconvenient times, or in offices that feel intimidating). The terminology itself — “office hours” — signals a formal, hierarchical encounter that many students find uncomfortable.

Strategies that increase help-seeking include: renaming office hours “drop-in hours” or “open studio” to signal their low-barrier, informal character; holding them in a less formal location (a lounge, a practice room, a café); requiring all students to attend at least once during the semester (normalizing attendance so that it does not signal weakness); and building brief, low-stakes one-on-one interactions into the course structure — brief individual check-ins during lab sessions, short oral components of assignments. These structural changes communicate, more convincingly than any verbal reassurance, that seeking help is a normal and expected part of learning.

7.8 Assessment of Aural Skills

The assessment of aural skills — sight-singing and dictation — presents distinctive challenges that differ in important ways from the assessment of written theory skills. In a written chorale exercise, the student’s work is permanently recorded on paper: the instructor can examine it at leisure, apply the rubric carefully, and return detailed written feedback. In a sight-singing examination, the student’s performance exists only in the moment: it cannot be rewound and re-examined, it is affected by nervousness and the social dynamics of the examination setting, and it involves the student’s voice in a way that makes evaluation feel more personal and more exposing than written work.

Several strategies address these challenges. Recorded sight-singing examinations — students record themselves singing a prepared excerpt in a practice room and submit the recording — remove the real-time pressure of the live examination and allow the instructor to evaluate the recording carefully and repeatedly. They do not, however, test genuine sight-reading; a student who has practiced the specific excerpt will perform better than her actual sight-reading ability would predict. Live sight-singing with rotation — students receive a new excerpt at the moment of the examination with no preparation time — tests genuine first-reading ability but requires careful calibration of excerpt difficulty and may introduce significant performance anxiety.

Example 7.2 (Portfolio-Based Aural Skills Assessment). Some programs adopt a portfolio approach to aural skills assessment: rather than a single high-stakes sight-singing or dictation examination, students submit a portfolio of audio recordings and written transcriptions made throughout the semester, selecting their best examples at the end of the term for formal evaluation. The portfolio approach rewards improvement over time, allows students to demonstrate their abilities under less pressured conditions, and produces a rich record of student development that a single examination cannot capture. Its disadvantage is administrative: managing and evaluating a large number of recordings from many students is time-intensive and requires clear, consistently applied rubrics.

7.9 Grading and Academic Integrity

Music theory courses face distinctive academic integrity challenges related to the collaborative culture of music-making. Performers are accustomed to learning from each other by observation and imitation: a student who learns to play a passage by watching a more advanced player execute it is engaging in legitimate musical learning. This culture of sharing can translate, in a theory course, into practices that cross the line into academic dishonesty: copying a classmate’s homework assignment, using a website that provides completed chorale exercises, or using AI tools to generate Roman numeral analyses without doing the analysis oneself.

Designing assessments that are resistant to these forms of dishonesty requires attending to the specifics of the task. A harmonization exercise that gives every student the same soprano melody is easily copied; a harmonization exercise that gives each student a different melody (drawn from a large bank of melodies with similar characteristics) makes copying useless. An analysis exercise on a canonical Bach chorale that is freely available online and on which AI tools perform well is easily delegated; an analysis exercise on an obscure and unpublished chorale from a library special collection provides less opportunity for substitution.

Remark 7.2. The most effective deterrent to academic dishonesty in theory courses is not punitive policy but authentic task design: creating assessments that are inherently personal and difficult to delegate. An oral examination in which the student analyzes an unfamiliar excerpt in real time, explaining her reasoning as she goes, cannot be completed by a stand-in or an AI tool. A composition assignment requiring the student to reflect in writing on the specific harmonic decisions she made and why makes the process of making those decisions — rather than merely their product — the object of assessment. These approaches are more time-intensive to design and evaluate than standard written exercises, but they produce more reliable evidence of genuine learning.

7.10 Connecting Assessment to Student Motivation

The relationship between assessment and student motivation is more complex than the simple equation “grades motivate students” suggests. High-stakes summative assessments — midterms and finals — can suppress intrinsic motivation: when students know that a single examination determines a large fraction of their grade, they are incentivized to study for the test rather than to develop genuine musical understanding. The cramming-and-forgetting cycle that results is well-documented in educational psychology: students memorize patterns for the examination, receive their grade, and then experience rapid forgetting because the knowledge was never meaningfully integrated.

Assessment practices that support intrinsic motivation tend to be characterized by: multiple low-stakes opportunities for demonstration rather than single high-stakes examinations; authentic tasks that have clear musical relevance beyond the classroom; opportunities for student choice within assignments; transparent criteria that students understand before beginning the task; and formative feedback that helps students improve rather than merely evaluating their current performance. In a theory course designed around these principles, assessment becomes a tool for learning rather than a measurement of learning after the fact.

Remark 7.3. The phrase "grading on a curve" is sometimes used in theory courses to manage the distribution of grades in large sections, adjusting scores upward when a class performs below expectation on an examination. This practice, while administratively convenient, sends exactly the wrong motivational message: it tells students that their grade depends not on what they know but on how they perform relative to their peers, turning the theory classroom into a competitive environment rather than a collaborative one. Absolute grading standards — defined criteria that a student either meets or does not, regardless of what her peers do — are more consistent with growth-mindset pedagogy and with mastery-based learning philosophy.

Chapter 8: The Theory Classroom as Creative Space

8.1 Theory as a Tool for Composition

One of the most productive reframings of music theory pedagogy in recent decades has been the insistence that theory is not a set of rules for evaluating music after the fact but a set of generative resources for making music in the first place. The danger of teaching theory as a catalogue of prohibitions — don’t write parallel fifths, don’t double the leading tone, don’t approach a sixth chord from the same chord in root position — is that students experience it as a constraint system designed to limit what they can do. The possibility of teaching theory as a generative grammar — here are the patterns and relationships that create tonal coherence, and you can use them to make music — is more powerful and more accurate to the history of theory itself.

Composition integrated into the theory curriculum can take many forms: writing original melodies in the style of a given composer, harmonizing those melodies according to the conventions studied, writing a short invention following Baroque contrapuntal conventions, or composing a short piece in a specified form (binary, ternary, rounded binary) using the harmonic vocabulary of the current unit. The key pedagogical principle is that the compositional task should require genuine application of theoretical knowledge — not just recall of rules but musical judgment about which of several permissible options sounds best in context.

8.2 Improvisation as a Theory Pedagogy Tool

Improvisation is perhaps the most under-used resource in tonal harmony pedagogy. Realizing a figured bass at the keyboard — taking a bass line annotated with figured-bass numerals and improvising an upper-voice realization — was the standard method by which Baroque musicians learned harmonic syntax, and it remains one of the most effective ways to develop harmonic fluency. The kinesthetic and aural experience of improvising harmonic progressions at the keyboard develops an intuitive command of voice-leading that written exercises, however carefully completed, rarely achieve.

Jazz-derived pedagogical strategies are equally valuable in a tonal harmony context. Playing over a simple harmonic progression — improvising a melody that follows the harmony, hearing which scale tones sound consonant against which chords — develops harmonic hearing from the inside. Voice-leading improvisation — taking a harmonic skeleton (a Roman-numeral sequence) and improvising the inner voices of a four-voice texture at the keyboard — is a demanding but extraordinarily effective technique for developing the internalized command of voice-leading that distinguishes a fluent harmonic thinker from one who merely knows the rules.

Example 8.1 (Figured Bass Improvisation as Classroom Activity). The instructor writes a bass line with figured-bass numerals on the board: an eight-measure progression moving through tonic, subdominant, and dominant harmonies with a V7–I cadence. Students take turns at the piano, each realizing the progression in a different way — some with close-position chords, some with a more spread voicing, some adding simple passing tones in an upper voice. After each student plays, the class identifies the voice-leading choices made, notes any parallel fifths or improper resolutions, and proposes alternatives. The activity combines theory, ear training, and keyboard harmony in a single efficient pedagogical moment.

8.3 Analysis as Performance Preparation

The theory classroom has the potential to serve as a site of preparation for musical performance — not only for performers who will take advanced theory electives, but for all students in the core sequence. Harmonic and formal analysis of a piece prepares a performer to make informed interpretive decisions: understanding the large-scale harmonic plan of a sonata movement helps a performer calibrate dynamic and agogic emphases toward the structural moments; recognizing the phrase structure of a theme helps a performer shape the phrase toward its cadence; identifying a deceptive cadence alerts a performer to the moment of syntactic surprise that the composer has engineered.

The pedagogical approach that links analysis to performance preparation asks students not only to label the Roman numerals of a score but to formulate and defend interpretive decisions based on their analysis. “I will play this phrase with a slight ritardando at the deceptive cadence because the syntactic surprise warrants a moment of reflection.” This kind of guided analytical-interpretive dialogue between theory and performance is one of the most productive educational experiences available in the theory classroom, and it is one of the most reliable ways to convince performing students that theory is relevant to their musical lives.

The connection between analysis and performance is not merely one-directional. Performance experience also informs analysis: a pianist who has physically executed the technical challenges of a passage — who knows from muscular memory where the difficulty lies, where the natural breathing points are, where the phrase wants to rush or linger — brings a dimensionality to the analysis that a purely visual, score-reading approach cannot match. Some of the most penetrating analyses of tonal music have been written by performer-scholars who bring both technical knowledge and intimate performance experience to the analytical task: Charles Rosen’s analyses of Classical style, for example, are inseparable from his career as a concert pianist.

Remark 8.6. The model of the theory course as performance preparation raises a challenge for programs in which theory is taught by specialists with primarily academic rather than performance careers. A theory teacher who does not perform regularly may struggle to make the connection between theoretical analysis and performance interpretation feel authentic — because for her, it may not be authentic. Programs that assign performing faculty to co-teach theory units, or that require theory students to discuss their theoretical analyses with their applied teachers, can bridge this gap structurally rather than depending on each individual theory instructor to be equally fluent in both analytical and performative modes.

8.4 Technology in the Theory Classroom

The landscape of technology for music theory instruction has expanded dramatically in the last decade. MuseScore (free and open-source notation software) allows students to enter and play back four-part chorale exercises, immediately hearing the result of their voice-leading choices. Noteflight offers browser-based notation with collaboration features, enabling students to share and comment on each other’s work. Flat.io provides a similar interface with strong accessibility features. Hook Theory offers a browser-based harmonic analysis tool organized around chord function that is particularly useful for popular music analysis.

For audio production in a theory context, Soundtrap and GarageBand allow students to create harmonic progressions using MIDI instruments, developing aural intuition for harmonic color and texture through the medium of production technology that many students find more accessible than traditional notation. These tools are not replacements for traditional theory skills — they cannot substitute for the ability to read and write music notation fluently — but they are powerful supplements that expand the range of productive engagement with musical material available in the classroom.

Remark 8.1. Technology in the theory classroom is most effective when it serves a clearly defined pedagogical purpose rather than being used because it is available. The question to ask before introducing any tool is: what does this enable students to do that they could not do (or could not do as effectively) without it? MuseScore enables immediate aural feedback on a written exercise — a genuine pedagogical advantage. Using a tablet app to display a static score that the instructor could simply project from a PDF adds no pedagogical value. Technology should expand the range of musical engagement, not merely digitize existing activities.

8.5 Peer Learning and Collaborative Activities

The research on peer learning in skill-based courses is consistently positive: students who work collaboratively on analytical and compositional tasks learn more, and learn more durably, than students who work alone. The explanations for this are several: students who must explain their reasoning to a peer are forced to articulate it more precisely; students who hear a different explanation of the same concept from a peer often understand it more readily than from an instructor; the peer relationship reduces the social threat that can inhibit learning in teacher-student interactions; and collaborative work develops the metacognitive skills (awareness of one’s own understanding and gaps) that are crucial for independent learning.

Collaborative activities for the theory classroom include: paired sight-singing (each student sings a different voice from a two-voice canon, then switches); group harmonization (small groups agree on a harmonic progression for a given melody and defend their choice to the class); peer critique of counterpoint exercises (students exchange exercises and apply the species rules as a rubric, then discuss discrepancies); and analytical group projects in which teams prepare a presentation on a large-scale form from a single score, each team member responsible for one aspect of the analysis.

The research on peer instruction — a specific collaborative technique developed by Harvard physicist Eric Mazur and widely adapted across disciplines — is particularly relevant for the theory classroom. In a peer instruction sequence, the instructor poses a conceptual question (for example: “Which of the following harmonizations of this melody is most stylistically appropriate, and why?”); students respond individually (typically via clicker or show-of-hands); then students who gave different answers discuss their reasoning with their neighbors for two to three minutes; then the class votes again. The discussion phase consistently improves the proportion of correct answers, and the improvement reflects genuine conceptual understanding developed through the articulation and exchange of reasoning — not merely social conformity to a perceived consensus. Adapted to music theory, peer instruction can be used for harmonic analysis (“vote for the Roman numeral that best describes this chord”), voice-leading evaluation (“vote for the counterpoint example with the most severe error”), and even aural skills (“vote for the solfège syllable of this scale degree after hearing the passage once”).

Example 8.3 (Peer Instruction in a Harmony Course). The instructor displays four harmonizations of the same soprano melody on the board — three student-submitted examples from the previous week's homework, anonymized, and one model solution — without indicating which is the model. Students study the four options for two minutes, then vote for which one they consider most stylistically successful. In the ensuing small-group discussion, students must articulate their reasoning: "I voted for option B because the bass motion is smoother and it avoids the voice crossing I see in option A." The instructor then reveals that option C is the model, explains the voice-leading advantages that distinguish it, and asks students to discuss what they would need to change in option B to bring it closer to the model's quality. The exercise makes comparative harmonic evaluation — a high-order analytical skill — the focus of structured peer discourse.

8.6 The Reflective Practitioner

Donald Schön’s influential book The Reflective Practitioner: How Professionals Think in Action (1983) introduced the concept of reflection-in-action — the capacity of skilled practitioners to think systematically about what they are doing while doing it, adjusting their approach in real time in response to what they observe. For music theory teachers, Schön’s framework suggests that effective teaching is not a matter of implementing a fixed set of techniques but of continuously observing what is happening in the classroom, theorizing about why it is happening, and adjusting accordingly.

Practical tools for cultivating reflective practice include: teaching journals (brief written records of what happened in each class session, what worked, what did not, and what adjustments to make); peer observation (inviting a colleague to observe a class and provide structured feedback, then reciprocating); student feedback loops (mid-semester course evaluations that give students a genuine voice in how the course is going, with the instructor’s response shared with the class); and lesson study (collaborative planning, observation, and debriefing of individual lessons with a group of colleagues).

Schön distinguishes reflection-in-action (adjusting in real time) from reflection-on-action (stepping back after the fact to analyze what happened and why). Both are necessary. Reflection-in-action is the moment when a teacher notices that her explanation of the augmented sixth chord is producing confused faces and pivots immediately to a keyboard demonstration; reflection-on-action is the moment at the end of the day when she asks herself why the explanation failed, what alternative framings she might try, and whether the confusion reflects a gap in prior knowledge that needs to be addressed before continuing. Neither form of reflection happens automatically or without effort; both must be deliberately cultivated, and both improve with practice and with the feedback of trusted colleagues.

Remark 8.5. The teaching journal is the most accessible and the most underutilized tool in the reflective practitioner's repertoire. Even a five-minute post-class entry — "what did I plan to do, what actually happened, what surprised me, what I would do differently" — accumulates over a semester into a record of genuine pedagogical development. Journals kept over multiple semesters allow teachers to identify recurring patterns: the same explanation that confused students in one year tends to confuse them in the next; the activity that worked brilliantly last spring can be scheduled again this fall. The journal is not a performance document; it is a private thinking space, and its value is proportional to the teacher's willingness to be honest in it.

8.7 Preparing to Teach: The Portfolio and the Job Interview

Graduate students preparing for academic careers as music theory teachers must develop not only the research skills expected of any doctoral candidate but also the teaching competencies — and the capacity to demonstrate those competencies to a hiring committee — that academic positions require.

The Eastman School of Music’s graduate pedagogy certificate, the Peabody Conservatory’s MM in Theory Pedagogy, and the University of North Texas’s graduate theory teaching practicum are among the most structured programs for developing these competencies systematically. Each involves some combination of coursework in pedagogy (the kind of material covered in this course), supervised teaching experience, peer observation and feedback, and guided reflection. Students who complete these programs are demonstrably better prepared for the pedagogical demands of faculty positions than those who teach without structured mentoring — not because they have received a fixed set of techniques, but because they have developed the reflective habits and the professional vocabulary that allow them to continue growing as teachers throughout their careers.

Definition 8.1 (Teaching Philosophy Statement). A teaching philosophy statement is a one-to-three-page reflective essay, typically required in academic job applications and teaching portfolios, in which an instructor articulates: the core values and beliefs that guide her teaching; the specific pedagogical strategies she employs and their rationale; evidence from her teaching experience that illustrates her approach; and how her teaching has developed and continues to develop. A strong teaching philosophy statement is specific rather than generic (citing actual classroom experiences), grounded in educational principles (showing familiarity with relevant learning theory), and honest about challenges and growth rather than merely celebratory. In music theory, a compelling teaching philosophy statement will speak specifically to the challenges of making abstract harmonic and contrapuntal concepts meaningful for working musicians.

The teaching portfolio typically includes: a teaching philosophy statement; syllabi from courses taught; sample assignments with grading rubrics; sample student work with instructor feedback (anonymized); a record of teaching evaluations; and evidence of professional development in pedagogy — workshops attended, pedagogy courses taken, peer observations conducted.

In the music theory job interview, teaching-related questions have become increasingly central as departments have come to prioritize teaching effectiveness alongside research productivity. Candidates may be asked to teach a sample lesson in front of a live group of students or faculty — a “teaching demonstration” — and to field questions about their pedagogical approach, their experience with diverse student populations, their philosophy of assessment, and their plans for developing new courses. Candidates who have thought carefully about why they teach the way they do, and who can articulate that thinking with specific examples, have a significant advantage over those who treat teaching as a set of default behaviors rather than a reflective professional practice.

The teaching demonstration deserves special attention in graduate preparation. Unlike the research presentation — a genre in which most doctoral students receive extensive mentoring — the teaching demonstration is often attempted without adequate preparation. It is a performance in a double sense: the candidate is both teaching a class of students and performing for an audience of evaluators who are watching not the students but the teacher. The most common failure modes in teaching demonstrations are: trying to cover too much material in the allotted time, producing a lecture that is impressive in content but leaves no time for student engagement; failing to interact genuinely with the student audience, treating the demonstration as a monologue rather than a dialogue; and selecting demonstration content that showcases the candidate’s research interests rather than material appropriate for the course level and student population specified by the hiring department.

Example 8.4 (Preparing a Theory Teaching Demonstration). A doctoral candidate preparing a teaching demonstration on the topic of "voice-leading in the Bach chorale" (assigned by the hiring department for a second-semester undergraduate harmony course) designs her demonstration as follows: she opens with a two-minute listening activity (students hear a chorale excerpt and describe what they notice about the motion of the bass line); she introduces the concept of voice-leading independence with a pair of keyboard examples (one with good independent voices, one with voices moving in parallel); she gives students a three-minute guided activity — identify the parallel motion error in a short chorale excerpt on the handout; she takes student answers and uses them to explain the rule's musical rationale; and she closes by playing a corrected version of the excerpt at the piano so students can hear the difference. In thirty minutes, she has included listening, analysis, keyboard demonstration, a guided activity, class discussion, and synthesis — and she has managed to do all of this without reading from notes.

8.8 The Future of Music Theory Pedagogy

The COVID-19 pandemic of 2020–2021 forced the rapid and largely unplanned migration of music theory instruction from in-person to online formats. This experiment, however painful and disruptive, produced valuable knowledge: some theory instruction — particularly the analytical and written theory components — transferred to online formats more effectively than expected, with asynchronous video explanations allowing students to pause and replay difficult concepts at their own pace. Aural skills instruction transferred more poorly: the latency of internet audio makes real-time sight-singing and dictation activities extremely difficult, and the absence of in-person ensemble listening contexts impoverished the aural experience.

Looking forward, adaptive learning platforms — systems that monitor student performance in real time and adjust the difficulty and sequence of exercises accordingly — offer the possibility of genuinely personalized instruction: each student works at precisely her level, receiving immediate targeted feedback on her specific errors. The challenge is that currently available adaptive systems focus almost exclusively on discrete-skill practice (interval identification, chord spelling) rather than the integrated musicianship that is the true goal of aural skills education.

Artificial intelligence tools — particularly those capable of providing automated feedback on student-composed chorales and counterpoint exercises — are advancing rapidly. Several research groups are developing systems that can identify common voice-leading errors (parallel fifths, improper resolutions of tendency tones) in student work with high accuracy. If these systems become reliable and pedagogically sophisticated, they could free instructors’ time from routine error correction for higher-level engagement — discussion of musical interpretation, compositional strategy, and analytical insight.

The persistent and most important question, however, is not technological but human: what does the music theory teacher do that technology cannot replicate? The answer, this course suggests, lies in the relational, interpretive, and creative dimensions of pedagogy: the teacher who recognizes the specific learning obstacle a specific student is facing and finds exactly the right re-explanation; the teacher who plays an unexpected example at the piano that makes a concept suddenly vivid; the teacher who creates a classroom culture in which intellectual risk-taking and musical curiosity are genuinely valued. Technology can automate the identification of parallel fifths. It cannot, yet, inspire a student to fall in love with the harmonic world of a Bach chorale.

8.9 Diversity and Equity in the Creative Theory Classroom

The creative-space model of the theory classroom has particular relevance for questions of equity and inclusion. If theory is taught primarily as a set of rules for evaluating a fixed canon of works — and if that canon is predominantly German, male, and from the eighteenth and nineteenth centuries — then the creative possibilities the course opens are correspondingly narrow. Students whose musical formation was in jazz, in West African music, in Indian classical music, in hip-hop production, or in any number of other traditions encounter a theory classroom that tacitly tells them their music is not the subject of serious theoretical inquiry.

Expanding the creative possibilities of the theory classroom means, first and most obviously, expanding the repertoire of musical examples and models: analyzing a James Brown groove alongside a Bach sarabande, studying the harmonic language of Thelonious Monk as seriously as that of Brahms, engaging with the rhythmic complexity of West African drumming ensembles as a site of theoretical insight about polyrhythm and metrical ambiguity. But it also means reconsidering the compositional models that frame creative assignments: asking students to compose “in the style of J.S. Bach” implicitly suggests that Bach’s style is the normative creative destination, and that other styles are either preparatory stages on the way there or departures from it. Asking students instead to compose in their own musical language, using the theoretical tools of the course as resources rather than constraints, makes the course’s creative space genuinely broader.

Remark 8.3. The practical challenge of expanding the creative theory classroom is not primarily ideological — most instructors are willing in principle to engage with diverse musics — but pedagogical. How does one teach a unified skill set (voice-leading, counterpoint, harmonic function) while simultaneously honoring the diversity of musical traditions in which those skills manifest very differently? One productive approach is to use the skill as a lens rather than a template: rather than asking "does this music follow tonal voice-leading rules?" ask "what are the voice-leading principles that govern this music, and how do they compare to the tonal principles we have been studying?" This reframing turns diversity from a problem (some music does not fit the rules) into a resource (different musics illuminate different aspects of how musical lines and harmonies relate).

8.10 The Theory Teacher as Musician

One of the most important things a theory teacher can do for her students — and one of the most often neglected — is to remain an active musician herself. Theory teachers who perform regularly, compose actively, or engage seriously with a broad range of musical repertoire bring a quality of musical aliveness to their teaching that cannot be replicated by those whose musical engagement has been reduced to grading chorales and preparing lectures.

The connection is not merely atmospheric. A theory teacher who is currently working on a Beethoven sonata for a faculty recital will have intuitions about the harmonic analysis of Beethoven’s late style that a teacher who has not performed Beethoven in a decade will lack. A theory teacher who composes will bring genuine compositional curiosity to the task of designing composition assignments — asking not only “will this exercise teach the relevant skill?” but “is this actually an interesting compositional problem?” A theory teacher who listens widely and adventurously will be able to bring unexpected connections and surprising examples that enrich the classroom far beyond the textbook examples.

Example 8.2 (Bringing Musical Experience into the Classroom). A theory instructor who is also an active jazz pianist describes, at the beginning of a unit on secondary dominants, a moment from her practice session the previous evening: she was improvising over the changes of a Coltrane ballad and found herself naturally moving through a chain of secondary dominants before arriving at the tonic — a harmonic motion she had executed hundreds of times by ear but had never analyzed explicitly until teaching the unit. She plays the passage on the classroom piano, then analyzes it in Roman numerals on the board, showing the class exactly how the abstract concept of secondary dominant describes something she had been doing intuitively for years. The anecdote models the relationship between musical intuition and theoretical understanding that the course is designed to cultivate: theory is not imposed from outside musical experience but arises from and returns to it.

8.11 Program Review and Curriculum Revision

Music theory programs should engage in systematic program review — a structured process of evaluating the effectiveness of the curriculum and making evidence-based revisions — on a regular cycle, typically every five to seven years. A productive program review asks: What are students learning by the end of the sequence? How do we know? What are students not learning that they should be? What changes in the discipline, in the student population, or in the broader musical culture suggest revisions to the curriculum?

Evidence for program review can be gathered from multiple sources: exit surveys of graduating seniors, follow-up surveys of recent alumni asking what they wish they had learned, faculty observation of what upper-division students are able and unable to do, analysis of student performance on capstone assessments, and comparative curriculum study (what are peer institutions doing?). The most common finding of music theory program reviews is that students leave the core sequence with stronger skills in written harmonic analysis than in aural skills, and that they struggle to apply theoretical concepts to unfamiliar repertoire — a finding that speaks directly to the transfer problem discussed in Chapter 2 and that typically motivates revisions in the direction of more repertoire diversity and more integrated theory-and-aural-skills instruction.

Definition 8.2 (Program Review). A program review in music theory is a structured, evidence-based evaluation of the effectiveness of a music theory curriculum in achieving its stated learning outcomes. A comprehensive program review typically includes: (1) documentation of current learning outcomes and curriculum structure; (2) collection of evidence of student learning at key points in the sequence; (3) comparison of outcomes against peer institutions and accreditation standards; (4) identification of areas of strength and areas for improvement; (5) development and implementation of a revision plan; and (6) documentation of the process for accreditation and internal accountability purposes. Program review is most productive when it involves all instructors in the theory area, engages students as informants about their experience, and results in concrete, implemented changes rather than reports that are filed and forgotten.
Remark 8.4. The most enduring challenge of music theory pedagogy is not curricular or technological but motivational: how to help students experience the theory classroom not as a set of obstacles between them and their music-making but as a resource that deepens and enriches their musical life. When a student who has struggled through a semester of four-part writing sits at the piano, plays a progression she has harmonized, and hears — for the first time with clear ears — the logic of the voice-leading she has written, the leading tone resolving, the inner voices moving smoothly, the bass grounding the harmony — that moment of recognition is the goal of everything this course has described. The techniques and frameworks of these notes are means; that moment of musical awakening is the end.

8.12 Conclusion: The Theorist as Teacher

The graduate student in music theory occupies a distinctive position in the ecology of the academic music profession. She is, simultaneously, a scholar pursuing original theoretical and analytical research; a developing teacher learning to communicate difficult ideas to students at very different levels of preparation; a musician whose theoretical work is — or should be — rooted in genuine musical engagement; and a member of a professional community whose standards, values, and debates she is just beginning to understand. The pedagogy course is the site at which all these roles are explicitly reflected upon together.

This course has surveyed the major intellectual resources available to the developing music theory teacher: the learning theories of Piaget, Vygotsky, Sweller, Ericsson, and Dweck; the pedagogical frameworks of backward design, mastery-based grading, and Universal Design for Learning; the specific challenges of teaching tonal harmony, counterpoint, and aural skills; and the equity and inclusion imperatives that call on theory teachers to examine and expand the cultural assumptions embedded in their curricula.

What this course cannot provide — and what no course can provide — is the practical wisdom that comes from actually teaching: from standing in front of a room of skeptical first-year musicians and finding a way to make the half-diminished seventh chord matter to them; from reading a student’s counterpoint and finding words that will help her understand what has gone wrong; from designing an examination that is fair, meaningful, and aligned with what the course has actually taught. That practical wisdom is developed through teaching, reflecting on teaching, talking about teaching with colleagues, and teaching again. The invitation this course extends is to approach that ongoing practice with the seriousness, curiosity, and intellectual rigor that music itself deserves — and that every student who encounters music theory, however briefly, is owed.

The chapters of these notes have tried to model that seriousness by taking each dimension of the theory teacher’s work — the learning science, the curriculum design, the specific pedagogical challenges of harmony and counterpoint and aural skills, the equity imperatives, the assessment design, and the creative possibilities — as worthy of the same careful, evidence-informed thought that we bring to our analytical and theoretical research. Theory pedagogy is not a soft supplement to the serious work of the discipline. It is serious work: the work of shaping how the next generation of musicians hears and understands and makes music. No other work that a music theorist does touches as many lives as directly, or leaves as lasting a mark on the musical world. To do it well is, finally, its own kind of scholarship — and its own kind of music-making. The theorist who teaches well extends her analytical and theoretical intelligence into a domain that most theory research never reaches: the lived musical development of real students, in real classrooms, at real moments in their musical lives. That is a privilege, and a responsibility, that deserves to be taken as seriously as any publication in Music Theory Spectrum or any paper delivered at the SMT annual meeting. These notes have tried to provide some of the intellectual tools for taking it seriously. The rest is practice.

Remark 8.7. A final practical note for those preparing for the academic job market: the best preparation for the pedagogical demands of a faculty position is not the accumulation of pedagogical techniques but the cultivation of genuine pedagogical curiosity — the habit of wondering, after every class session, whether there might be a better way. Faculty who continue asking that question throughout their careers, who read the pedagogical literature, who welcome peer observation, who remain genuinely interested in their students' learning experiences, consistently produce better educational outcomes than those who settled into a fixed teaching routine early in their careers and have taught the same way ever since. The reflective practitioner that Schön describes is not an ideal type reserved for extraordinary teachers; it is a professional standard within reach of any teacher who chooses to pursue it.
Back to top