ExampleStack
CC BY 4.0Introduced 2019-05-28
This is a dataset of code snippets in StackOverflow that have been used in Github repositories by extending and adapting them. The dataset links SO posts to GitHub counterparts based on clone detection, time stamp analysis, and explicit URL references.
The authors qualitatively inspected 400 SO examples and their GitHub counterparts and develop a taxonomy of 24 adaptation types. Using this taxonomy, an automated adaptation analysis technique on top of GumTree is built to classify the entire dataset into these types.