Natural Language Inference on SICK

Metric: 1:1 Accuracy (higher is better)

LeaderboardDataset
Loading chart...