DadaGP

MidiCustom (research-only)Introduced 2021-07-30

DadaGP is a new symbolic music dataset comprising 26,181 song scores in the GuitarPro format covering 739 musical genres, along with an accompanying tokenized format well-suited for generative sequence models such as the Transformer. The tokenized format is inspired by event-based MIDI encodings, often used in symbolic music generation models. The dataset is released with an encoder/decoder which converts GuitarPro files to tokens and back.

Description from: DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models

Image source: https://arxiv.org/pdf/2107.14653v1.pdf