Binary text classification on TURINGBENCH (Turing Test, GPT-3)
Metric: F1 score (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | F1 score▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GigaCheck (Mistral-7B) | 0.9709 | No | GigaCheck: Detecting LLM-generated Content | 2024-10-31 | - |
| 2 | RoBERTa | 0.5209 | No | TURINGBENCH: A Benchmark Environment for Turing ... | 2021-09-27 | Code |