Natural Language Inference on CICERO

Metric: ROUGE (higher is better)

LeaderboardDataset
Loading chart...