Metric: Top-1 Verb (higher is better)
| # | Model↕ | Top-1 Verb▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Audiovisual Masked Autoencoder (Audiovisual, Single) | 71.4 | Yes | Audiovisual Masked Autoencoders | 2022-12-09 | Code |
| 2 | Audiovisual Masked Autoencoder (Video-only, Single) | 70.8 | Yes | Audiovisual Masked Autoencoders | 2022-12-09 | Code |
| 3 | Audiovisual Masked Autoencoder (Audio-only, Single) | 52.7 | Yes | Audiovisual Masked Autoencoders | 2022-12-09 | Code |
| 4 | PlayItBackX3 | 47 | No | Play It Back: Iterative Attention for Audio Reco... | 2022-10-20 | Code |