GIS

Github Issue Similarity

CC BY-SAIntroduced 2023-09-22

This dataset can be used for semantic textual similarity tasks. It consists of duplicate and non-duplicate Github issues. It has 18565, 1547, and 1548 samples for train, validation, and test set, respectively.