GIS
Github Issue Similarity
CC BY-SAIntroduced 2023-09-22
This dataset can be used for semantic textual similarity tasks. It consists of duplicate and non-duplicate Github issues. It has 18565, 1547, and 1548 samples for train, validation, and test set, respectively.