TopDown-AlignedAtt (1NN)
Reported on 3 benchmarks across 1 task
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Audio3 results
- CIDEr0.593best: 50.2 (Audio Flamingo)
- SPICE0.144best: 15.1 (Audio Flamingo)
- SPIDEr0.369best: 32.6 (Audio Flamingo)