Human-ChatGPT texts

TextsCC BY

A dataset including texts by humans (labeled 0) and then rephrased by ChatGPT (labeled 1), created to train models for machine-generated text detection.

It is a robust dataset - includes text of various lengths, and the human texts are taken from multiple sources.