Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video
/
MiT
Video on MiT
Metric: Top 1 Accuracy (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
Top 1 Accuracy
▼
Extra Data
Paper
Date
↕
Code
1
OmniVec2
53.1
Yes
-
-
-
2
InternVideo2-1B
50.9
Yes
InternVideo2: Scaling Foundation Models for Mult...
2024-03-22
Code
3
UMT-L (ViT-L/16)
48.7
Yes
Unmasked Teacher: Towards Training-Efficient Vid...
2023-03-28
Code
4
UniFormerV2-L
47.8
Yes
-
-
Code
5
MTV-H (WTS 60M)
47.2
Yes
Multiview Transformers for Video Recognition
2022-01-12
Code
6
CoVeR(JFT-3B)
46.1
Yes
Co-training Transformer with Videos and Images I...
2021-12-14
-
7
CoVeR(JFT-300M)
45
Yes
Co-training Transformer with Videos and Images I...
2021-12-14
-
8
VATT-Large
41.1
Yes
VATT: Transformers for Multimodal Self-Supervise...
2021-04-22
Code
9
MoViNet-A6
40.2
No
MoViNets: Mobile Video Networks for Efficient Vi...
2021-03-21
Code
10
MoViNet-A5
39.1
No
MoViNets: Mobile Video Networks for Efficient Vi...
2021-03-21
Code
11
MoViNet-A4
37.9
No
MoViNets: Mobile Video Networks for Efficient Vi...
2021-03-21
Code
12
VTN
37.4
Yes
Video Transformer Network
2021-02-01
Code
13
MBT (AV)
37.3
No
Attention Bottlenecks for Multimodal Fusion
2021-06-30
Code
14
MoViNet-A3
35.6
No
MoViNets: Mobile Video Networks for Efficient Vi...
2021-03-21
Code
15
MoViNet-A2
34.3
No
MoViNets: Mobile Video Networks for Efficient Vi...
2021-03-21
Code
16
SRTG r3d-101
33.56
No
Learn to cycle: Time-consistent feature discover...
2020-06-15
Code
17
MoViNet-A1
32
No
MoViNets: Mobile Video Networks for Efficient Vi...
2021-03-21
Code
18
SRTG r(2+1)d-50
31.6
No
Learn to cycle: Time-consistent feature discover...
2020-06-15
Code
19
SRTG r3d-50
30.72
No
Learn to cycle: Time-consistent feature discover...
2020-06-15
Code
20
SRTG r(2+1)d-34
28.97
No
Learn to cycle: Time-consistent feature discover...
2020-06-15
Code
21
SRTG r3d-34
28.55
No
Learn to cycle: Time-consistent feature discover...
2020-06-15
Code
22
TRN-Multiscale
28.27
No
Moments in Time Dataset: one million videos for ...
2018-01-09
Code
23
MoViNet-A0
27.5
No
MoViNets: Mobile Video Networks for Efficient Vi...
2021-03-21
Code