TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Automatic Piano Transcription with Hierarchical Frequency-...

Automatic Piano Transcription with Hierarchical Frequency-Time Transformer

Keisuke Toyama, Taketo Akama, Yukara Ikemiya, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji

2023-07-10Music Transcription
PaperPDFCode(official)

Abstract

Taking long-term spectral and temporal dependencies into account is essential for automatic piano transcription. This is especially helpful when determining the precise onset and offset for each note in the polyphonic piano content. In this case, we may rely on the capability of self-attention mechanism in Transformers to capture these long-term dependencies in the frequency and time axes. In this work, we propose hFT-Transformer, which is an automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture. The first hierarchy includes a convolutional block in the time axis, a Transformer encoder in the frequency axis, and a Transformer decoder that converts the dimension in the frequency axis. The output is then fed into the second hierarchy which consists of another Transformer encoder in the time axis. We evaluated our method with the widely used MAPS and MAESTRO v3.0.0 datasets, and it demonstrated state-of-the-art performance on all the F1-scores of the metrics among Frame, Note, Note with Offset, and Note with Offset and Velocity estimations.

Results

TaskDatasetMetricValueModel
Music TranscriptionMAESTROOnset F197.44hFT-Transformer
Music TranscriptionMAPSOnset F185.14hFT-Transformer

Related Papers

Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription2025-06-17Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System2025-05-22Unified Cross-modal Translation of Score Images, Symbolic Music, and Performance Audio2025-05-19Automatic Music Transcription using Convolutional Neural Networks and Constant-Q transform2025-05-07Music Tempo Estimation on Solo Instrumental Performance2025-04-25Scalable Approximate Algorithms for Optimal Transport Linear Models2025-04-06Multi-task learning-based temporal pattern matching network for guitar tablature transcription2025-04-03D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription2025-01-09