Metric: F1-score (Augmented) (higher is better)
| # | Model↕ | F1-score (Augmented)▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | CLIP-It | 69 | No | CLIP-It! Language-Guided Video Summarization | 2021-07-01 | Code |
| 2 | iPTNet | 64.2 | No | - | - | - |
| 3 | DSNet | 63.9 | Yes | - | - | Code |
| 4 | DSNet | 63.9 | No | - | - | Code |
| 5 | RR-STG | 63.6 | Yes | - | - | - |
| 6 | RR-STG | 63.6 | No | - | - | - |
| 7 | VASNet | 62.37 | Yes | Summarizing Videos with Attention | 2018-12-05 | Code |
| 8 | VJMHT | 61.9 | No | Video Joint Modelling Based on Hierarchical Tran... | 2021-12-27 | Code |
| 9 | M-AVS | 61.8 | Yes | Video Summarization with Attention-Based Encoder... | 2017-08-31 | - |
| 10 | SSPVS | 61.8 | No | Progressive Video Summarization via Multimodal S... | 2022-01-07 | Code |
| 11 | HMT | 60.3 | No | Hierarchical Multimodal Transformer to Summarize... | 2021-09-22 | - |
| 12 | DR-DSN | 59.8 | No | Deep Reinforcement Learning for Unsupervised Vid... | 2017-12-29 | Code |
| 13 | CSNet | 57.1 | No | Discriminative Feature Learning for Unsupervised... | 2018-11-24 | Code |