TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/TBN

TBN

Reported on 9 benchmarks across 5 tasks · 3 papers · 1 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Robots4 results

  • Activity RecognitiononEPIC-KITCHENS-55
    Actions Top-1 (S1)· 2019-08-22
    34.8
    best: 35.8 (DEEP-HAL with ODF+SDF (AssembleNet++))
    SOTA
    EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action RecognitionarXiv:1908.08498
  • Activity RecognitiononEPIC-KITCHENS-100 (test)
    recall@5· 2021-07-18
    11
    best: 23.75 (InAViT)
    Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric VideosarXiv:2107.09504
  • Activity RecognitiononEPIC-KITCHENS-100
    Action@1· 2020-06-23
    35.55
    best: 58.3 (LLaVAction)
    Rescaling Egocentric VisionarXiv:2006.13256
  • Activity RecognitiononEPIC-KITCHENS-55
    Actions Top-1 (S2)· 2019-08-22
    19.06
    best: 27.3 (DEEP-HAL with ODF+SDF (AssembleNet++))
    EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action RecognitionarXiv:1908.08498

Time Series2 results

  • Action RecognitiononEPIC-KITCHENS-100 (test)
    recall@5· 2021-07-18
    11
    best: 23.75 (InAViT)
    Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric VideosarXiv:2107.09504
  • Action RecognitiononEPIC-KITCHENS-100
    Action@1· 2020-06-23
    35.55
    best: 58.3 (LLaVAction)
    Rescaling Egocentric VisionarXiv:2006.13256

Computer Vision2 results

  • Action AnticipationonEPIC-KITCHENS-100 (test)
    recall@5· 2021-07-18
    11
    best: 23.75 (InAViT)
    Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric VideosarXiv:2107.09504
  • Action Recognition In VideosonEPIC-KITCHENS-100 (test)
    recall@5· 2021-07-18
    11
    best: 23.75 (InAViT)
    Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric VideosarXiv:2107.09504

Knowledge Base1 result

  • 2D Human Pose EstimationonEPIC-KITCHENS-100 (test)
    recall@5· 2021-07-18
    11
    best: 23.75 (InAViT)
    Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric VideosarXiv:2107.09504