TutorialMusic TranscriptionAndrew Carlins8 min read

How to Transcribe Jazz Piano Chords With AI

Jazz piano hides its secrets in the voicings. Working out a pianist's chords by ear can take hours, and you still miss the inner voices. Here's how to use AI transcription to get the notes down, then read the harmony back as chords you can study.

How to Transcribe Jazz Piano Chords With AI

Jazz piano hides in its voicings. A great pianist plays a chord you recognize, then spreads it across both hands with extensions, tensions, and an inner line that moves while the outer notes hold. You hear that it sounds rich. Working out exactly which notes built that sound is the hard part, and it is where most ears give up.

AI transcription closes that gap. You record the performance, run it through a piano model, and read the harmony back as notation with chord symbols on top. This guide covers how to capture jazz piano cleanly, what the transcription gives you, and where the densest voicings still need your eyes.

Why Jazz Voicings Are Hard by Ear

A pop song often gives you a clean triad you can name in a second. Jazz piano rarely does that. Take a symbol like "Dm7". It reads as D minor seventh, the four notes D, F, A, and C, a D minor triad with a flat seventh added on top. On the page it looks simple. At the keyboard a jazz pianist might come out as a rootless voicing, with the ninth and eleventh stacked above the third and seventh and the root left for the bassist. The notes sit close together, and several of them are dissonant on purpose.

That density is what makes ear transcription slow. You can pick out the top note and the bass, but the two or three notes in the middle blur together. This is the same problem covered in our explainer on monophonic versus polyphonic transcription: many notes at once are far harder to separate than one note at a time. Jazz piano is polyphony at its densest.

Step One: Capture a Clean Recording

The recording decides how good the transcription gets. Piano is our most mature model, and it does its best work on a clear, isolated solo piano take. The fewer competing instruments, the more cleanly it reads the inner voices.

  • Prefer a solo piano source. A studio solo track or a close acoustic recording beats a live trio mix where bass and drums bleed into the piano.
  • Isolate the piano if you must. If all you have is a full band recording, separate the piano stem first. Cleaner input gives a cleaner result.
  • Use direct MIDI when you can. If you are playing a digital piano, capturing the MIDI gives perfect note data and the cleanest possible transcription.

Step Two: Read the Voicings Back as Notation

Upload the recording and choose piano. You get back editable notation plus a piano roll showing every note as a block on a grid. The model splits the playing into left and right hands and lays it out on a grand staff. For a clean solo take, the result is close to what a skilled transcriber would produce, including the inner voices you could not pick out by ear.

The piano roll is the best view for studying a dense voicing. Each note in the stack is a separate block, so you can see exactly how the pianist spaced the chord. You can generate this from any recording with a piano transcription, and our step-by-step guide to transcribing piano music with AI covers the upload settings that give the cleanest output.

Step Three: Use the Chord Labels as a Starting Point

The AI detects the chord at each point in the performance and labels it. Those labels are a strong starting point for understanding the harmony. They tell you where the progression turns, where a ii-V lands, and which chord a busy voicing is built on.

The "ii-V" is worth knowing, because it is the backbone of most jazz tunes. The roman numerals count chords up from the key: in C major, ii is D minor (the chord on the second degree), V is G dominant (the fifth), and I is C major (the home chord). String them together and you get the ii-V-I, a Dm7 to G7 to Cmaj7 cadence that pulls the ear back home. Once you can spot it, the chord labels stop being a list of isolated symbols and start reading as a sentence. When you read the harmony back from a transcription, recognizing a ii-V tells you the function of each chord, not just its name, and that is what makes a transcribed chart something you can actually play from.

Be ready to refine a few. The same stack of notes can be read more than one way depending on the bass and the context, and jazz harmony is full of those ambiguities. Use the labels to navigate the chart, then confirm them against the notes on the staff. If your goal is the chords and melody alone, you can keep the top line and the symbols and trim the rest into a lead sheet. Our guide to transcribing a lead sheet from a recording walks through that trim, and what a lead sheet is explains the format.

Step Four: Edit the Densest Spots

A few passages may need a second look. Very dense voicings, where four or five notes crowd a narrow span, and fast comping are the spots where the model can struggle, sometimes dropping or adding a note. Scan those sections in the piano roll, where a stray note is easy to see and drag into place. The rhythm of a swung, rubato passage may also want a nudge to read cleanly on the page. Everything else, the long held voicings and the clear single-line solos, tends to come through accurate. Treat the transcription as a faithful draft you refine, and use it to study how a player you admire builds their harmony. Our guide to analyzing songs you admire covers turning that study into your own writing. The voicing you could never name by ear is now sitting on the staff in front of you.

Frequently Asked Questions

Why does the same voicing sometimes get two different chord labels?

Because a stack of notes can belong to more than one chord depending on the bass and the surrounding harmony. A rootless voicing of D, F, A, and C reads as Dm7 with a D in the bass, but the same notes over an F can read as F6, and over a B-flat they lean toward a B-flat major shape. The AI picks the reading it finds most likely, and on ambiguous spots that may not match the function you hear. When a label looks off, check what the bass is doing and what the chords around it imply, then relabel to the reading that fits the progression.

Does the transcription give me chord symbols?

Yes. The AI detects the chord at each point in the performance and gives you a chord label to go with the notes. Those labels are a strong starting point for studying the harmony. On dense jazz voicings you may relabel a few, since the same stack of notes can be read more than one way depending on the bass and the context. Use the labels to navigate the harmony, then confirm them against the notation.

What recording works best for jazz piano transcription?

A clean, isolated solo piano recording. The fewer competing instruments, the better the model reads the voicings. A studio solo track or a close recording of an acoustic piano works far better than a live trio mix where bass and drums bleed into the piano. If you only have a full band recording, isolate the piano first; cleaner input gives a cleaner result.

How do I turn the transcription into a lead sheet?

Keep the melody and the detected chord symbols, then drop the rest. A full transcription captures every voiced note; a lead sheet keeps only the top-line melody and the chord labels above it. You trim the inner voices in the editor and export the melody-plus-chords version. That gives you a chart a band can read while leaving the dense voicing study in the full transcription.

About the author

Written by

Andrew Carlins

Co-Founder & CEO, Songscription

Andrew co-founded Songscription at Stanford with a few fellow musicians who were tired of not finding the notes to the songs they wanted to play. He grew up playing piano and baritone saxophone and performing in musical theater, and though he hasn't performed in years, he likes to think he's still pretty sharp. He writes about getting a song off the recording and onto the page.

More about the team

Keep exploring more posts on the same topics.