KP20k
TextsUnknownIntroduced 2017-01-01
KP20k is a large-scale scholarly articles dataset with 528K articles for training, 20K articles for validation and 20K articles for testing.
Source: Keyphrase Prediction With Pre-trained Language Model Image Source: https://arxiv.org/pdf/1704.06879.pdf