Boundary Detection on RoFT-chatgpt

Metric: MSE (lower is better)

LeaderboardDataset
Loading chart...