Audio captioning on AudioCaps

Metric: #params (M) (higher is better)

LeaderboardDataset