作者给的test文件
Reported on 8 benchmarks across 1 task
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing8 results
- 81.64best: 88.86 (GIT2, Single Model)
- 63.79best: 76.1 (GIT, Single Model)
- 43.43best: 60.53 (GIT, Single Model)
- 25.15best: 41.65 (GIT, Single Model)
- 85.81best: 149.1 (PaLI)
- METEOR27.25best: 34.22 (PaLI)
- ROUGE-L55.06best: 64.39 (PaLI)
- 12.35best: 16.36 (GIT2, Single Model)