TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/MiT

Video on MiT

Metric: Top 1 Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Top 1 Accuracy▼Extra DataPaperDate↕Code
1OmniVec253.1Yes---
2InternVideo2-1B50.9YesInternVideo2: Scaling Foundation Models for Mult...2024-03-22Code
3UMT-L (ViT-L/16)48.7YesUnmasked Teacher: Towards Training-Efficient Vid...2023-03-28Code
4UniFormerV2-L47.8Yes--Code
5MTV-H (WTS 60M)47.2YesMultiview Transformers for Video Recognition2022-01-12Code
6CoVeR(JFT-3B)46.1YesCo-training Transformer with Videos and Images I...2021-12-14-
7CoVeR(JFT-300M)45YesCo-training Transformer with Videos and Images I...2021-12-14-
8VATT-Large41.1YesVATT: Transformers for Multimodal Self-Supervise...2021-04-22Code
9MoViNet-A640.2NoMoViNets: Mobile Video Networks for Efficient Vi...2021-03-21Code
10MoViNet-A539.1NoMoViNets: Mobile Video Networks for Efficient Vi...2021-03-21Code
11MoViNet-A437.9NoMoViNets: Mobile Video Networks for Efficient Vi...2021-03-21Code
12VTN37.4YesVideo Transformer Network2021-02-01Code
13MBT (AV)37.3NoAttention Bottlenecks for Multimodal Fusion2021-06-30Code
14MoViNet-A335.6NoMoViNets: Mobile Video Networks for Efficient Vi...2021-03-21Code
15MoViNet-A234.3NoMoViNets: Mobile Video Networks for Efficient Vi...2021-03-21Code
16SRTG r3d-10133.56NoLearn to cycle: Time-consistent feature discover...2020-06-15Code
17MoViNet-A132NoMoViNets: Mobile Video Networks for Efficient Vi...2021-03-21Code
18SRTG r(2+1)d-5031.6NoLearn to cycle: Time-consistent feature discover...2020-06-15Code
19SRTG r3d-5030.72NoLearn to cycle: Time-consistent feature discover...2020-06-15Code
20SRTG r(2+1)d-3428.97NoLearn to cycle: Time-consistent feature discover...2020-06-15Code
21SRTG r3d-3428.55NoLearn to cycle: Time-consistent feature discover...2020-06-15Code
22TRN-Multiscale28.27NoMoments in Time Dataset: one million videos for ...2018-01-09Code
23MoViNet-A027.5NoMoViNets: Mobile Video Networks for Efficient Vi...2021-03-21Code