Probing Language Models on KAMEL
Metric: Average F1 (higher is better)
LeaderboardDataset
Results
Submit a result| # | Model↕ | Average F1▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | OPT-13b | 17.62 | No | - | - | Code |
Metric: Average F1 (higher is better)
| # | Model↕ | Average F1▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | OPT-13b | 17.62 | No | - | - | Code |