Binary text classification on TURINGBENCH (Turing Test, GPT-3)

Metric: F1 score (higher is better)

LeaderboardDataset
Loading chart...