KPBiomed
CC-BY-NC v4.0
A large scale dataset of scientific records from PubMed for scientific keyphrase generation. The dataset has three sizes: 500k, 2million and 5.6million documents
Each document contains a title, an abstract, author keyphrases and some documents also have Mesh terms assigned by professional indexors.