TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Lightweight Instrument-Agnostic Model for Polyphonic Not...

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

Rachel M. Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert

2022-03-18Music Transcription
PaperPDFCode(official)

Abstract

Automatic Music Transcription (AMT) has been recognized as a key enabling technology with a wide range of applications. Given the task's complexity, best results have typically been reported for systems focusing on specific settings, e.g. instrument-specific systems tend to yield improved results over instrument-agnostic methods. Similarly, higher accuracy can be obtained when only estimating frame-wise $f_0$ values and neglecting the harder note event detection. Despite their high accuracy, such specialized systems often cannot be deployed in the real-world. Storage and network constraints prohibit the use of multiple specialized models, while memory and run-time constraints limit their complexity. In this paper, we propose a lightweight neural network for musical instrument transcription, which supports polyphonic outputs and generalizes to a wide variety of instruments (including vocals). Our model is trained to jointly predict frame-wise onsets, multipitch and note activations, and we experimentally show that this multi-output structure improves the resulting frame-level note accuracy. Despite its simplicity, benchmark results show our system's note estimation to be substantially better than a comparable baseline, and its frame-level accuracy to be only marginally below those of specialized state-of-the-art AMT systems. With this work we hope to encourage the community to further investigate low-resource, instrument-agnostic AMT systems.

Results

TaskDatasetMetricValueModel
Music TranscriptionSlakh2100note-level F-measure-no-offset (Fno)0.43Basic Pitch

Related Papers

Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription2025-06-17Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System2025-05-22Unified Cross-modal Translation of Score Images, Symbolic Music, and Performance Audio2025-05-19Automatic Music Transcription using Convolutional Neural Networks and Constant-Q transform2025-05-07Music Tempo Estimation on Solo Instrumental Performance2025-04-25Scalable Approximate Algorithms for Optimal Transport Linear Models2025-04-06Multi-task learning-based temporal pattern matching network for guitar tablature transcription2025-04-03D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription2025-01-09