Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/TBN

TBN

Reported on 9 benchmarks across 5 tasks · 3 papers · 1 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Robots4 results

Activity RecognitiononEPIC-KITCHENS-55
Actions Top-1 (S1)· 2019-08-22
34.8
best: 35.8 (DEEP-HAL with ODF+SDF (AssembleNet++))
SOTA
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition arXiv:1908.08498
Activity RecognitiononEPIC-KITCHENS-100 (test)
recall@5· 2021-07-18
11
best: 23.75 (InAViT)
Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos arXiv:2107.09504
Activity RecognitiononEPIC-KITCHENS-100
Action@1· 2020-06-23
35.55
best: 58.3 (LLaVAction)
Rescaling Egocentric Vision arXiv:2006.13256
Activity RecognitiononEPIC-KITCHENS-55
Actions Top-1 (S2)· 2019-08-22
19.06
best: 27.3 (DEEP-HAL with ODF+SDF (AssembleNet++))
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition arXiv:1908.08498

Time Series2 results

Action RecognitiononEPIC-KITCHENS-100 (test)
recall@5· 2021-07-18
11
best: 23.75 (InAViT)
Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos arXiv:2107.09504
Action RecognitiononEPIC-KITCHENS-100
Action@1· 2020-06-23
35.55
best: 58.3 (LLaVAction)
Rescaling Egocentric Vision arXiv:2006.13256

Computer Vision2 results

Action AnticipationonEPIC-KITCHENS-100 (test)
recall@5· 2021-07-18
11
best: 23.75 (InAViT)
Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos arXiv:2107.09504
Action Recognition In VideosonEPIC-KITCHENS-100 (test)
recall@5· 2021-07-18
11
best: 23.75 (InAViT)
Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos arXiv:2107.09504

Knowledge Base1 result

2D Human Pose EstimationonEPIC-KITCHENS-100 (test)
recall@5· 2021-07-18
11
best: 23.75 (InAViT)
Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos arXiv:2107.09504