DadaGP
MidiCustom (research-only)Introduced 2021-07-30
DadaGP is a new symbolic music dataset comprising 26,181 song scores in the GuitarPro format covering 739 musical genres, along with an accompanying tokenized format well-suited for generative sequence models such as the Transformer. The tokenized format is inspired by event-based MIDI encodings, often used in symbolic music generation models. The dataset is released with an encoder/decoder which converts GuitarPro files to tokens and back.
Description from: DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models
Image source: https://arxiv.org/pdf/2107.14653v1.pdf