Human-ChatGPT texts
TextsCC BY
A dataset including texts by humans (labeled 0) and then rephrased by ChatGPT (labeled 1), created to train models for machine-generated text detection.
It is a robust dataset - includes text of various lengths, and the human texts are taken from multiple sources.