Metric: Spearman's Rho (higher is better)
| # | Model↕ | Spearman's Rho▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | CSTA | 0.274 | No | CSTA: CNN-based Spatiotemporal Attention for Vid... | 2024-05-20 | Code |
| 2 | CSTA | 0.274 | No | CSTA: CNN-based Spatiotemporal Attention for Vid... | 2024-05-20 | Code |
| 3 | SSPVS(+Text) | 0.257 | No | Progressive Video Summarization via Multimodal S... | 2022-01-07 | Code |
| 4 | SSPVS | 0.24 | No | Progressive Video Summarization via Multimodal S... | 2022-01-07 | Code |
| 5 | RR-STG | 0.234 | No | - | - | - |
| 6 | MSVA | 0.23 | No | Supervised Video Summarization via Multiple Feat... | 2021-04-23 | Code |
| 7 | VASNet [DBLP:conf/accv/FajtlSAMR18] | 0.17 | No | Supervised Video Summarization via Multiple Feat... | 2021-04-23 | Code |
| 8 | A2Summ | 0.129 | No | Align and Attend: Multimodal Summarization with ... | 2023-03-13 | Code |
| 9 | iPTNet | 0.119 | Yes | - | - | - |
| 10 | VJMHT | 0.108 | No | Video Joint Modelling Based on Hierarchical Tran... | 2021-12-27 | Code |
| 11 | DMASum | 0.089 | No | Query Twice: Dual Mixture Attention Meta Learnin... | 2020-08-19 | - |
| 12 | HMT | 0.08 | No | Hierarchical Multimodal Transformer to Summarize... | 2021-09-22 | - |