TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/TempAgg

TempAgg

Reported on 31 benchmarks across 5 tasks · 2 papers · 15 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision10 results

  • Action AnticipationonAssembly101
    Actions Recall@5· 2020-06-01
    8.53
    best: 12.07 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action AnticipationonAssembly101
    Objects Recall@5· 2020-06-01
    26.27
    best: 28.38 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action AnticipationonAssembly101
    Verbs Recall@5· 2020-06-01
    59.11
    best: 60.04 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action Recognition In VideosonAssembly101
    Actions Recall@5· 2020-06-01
    8.53
    best: 12.07 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action Recognition In VideosonAssembly101
    Objects Recall@5· 2020-06-01
    26.27
    best: 28.38 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action Recognition In VideosonAssembly101
    Verbs Recall@5· 2020-06-01
    59.11
    best: 60.04 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action AnticipationonEPIC-KITCHENS-100 (test)
    recall@5· 2021-06-06
    12.6
    best: 23.75 (InAViT)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Action AnticipationonEPIC-KITCHENS-100
    Recall@5· 2021-06-06
    14.73
    best: 27.6 (PlausiVL)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Action Recognition In VideosonEPIC-KITCHENS-100 (test)
    recall@5· 2021-06-06
    12.6
    best: 23.75 (InAViT)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Action Recognition In VideosonEPIC-KITCHENS-100
    Recall@5· 2021-06-06
    14.73
    best: 27.6 (PlausiVL)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152

Robots8 results

  • Activity RecognitiononAssembly101
    Actions Recall@5· 2020-06-01
    8.53
    best: 12.07 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Activity RecognitiononAssembly101
    Objects Recall@5· 2020-06-01
    26.27
    best: 28.38 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Activity RecognitiononAssembly101
    Verbs Recall@5· 2020-06-01
    59.11
    best: 60.04 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Activity RecognitiononEPIC-KITCHENS-100
    Action@1· 2021-06-06
    45.26
    best: 58.3 (LLaVAction)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Activity RecognitiononEPIC-KITCHENS-100
    Noun@1· 2021-06-06
    53.35
    best: 69 (LLaVAction)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Activity RecognitiononEPIC-KITCHENS-100
    Verb@1· 2021-06-06
    66
    best: 76.2 (TIM)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Activity RecognitiononEPIC-KITCHENS-100 (test)
    recall@5· 2021-06-06
    12.6
    best: 23.75 (InAViT)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Activity RecognitiononEPIC-KITCHENS-100
    Recall@5· 2021-06-06
    14.73
    best: 27.6 (PlausiVL)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152

Time Series8 results

  • Action RecognitiononAssembly101
    Actions Recall@5· 2020-06-01
    8.53
    best: 12.07 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action RecognitiononAssembly101
    Objects Recall@5· 2020-06-01
    26.27
    best: 28.38 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action RecognitiononAssembly101
    Verbs Recall@5· 2020-06-01
    59.11
    best: 60.04 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • Action RecognitiononEPIC-KITCHENS-100
    Action@1· 2021-06-06
    45.26
    best: 58.3 (LLaVAction)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Action RecognitiononEPIC-KITCHENS-100
    Noun@1· 2021-06-06
    53.35
    best: 69 (LLaVAction)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Action RecognitiononEPIC-KITCHENS-100
    Verb@1· 2021-06-06
    66
    best: 76.2 (TIM)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Action RecognitiononEPIC-KITCHENS-100 (test)
    recall@5· 2021-06-06
    12.6
    best: 23.75 (InAViT)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • Action RecognitiononEPIC-KITCHENS-100
    Recall@5· 2021-06-06
    14.73
    best: 27.6 (PlausiVL)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152

Knowledge Base5 results

  • 2D Human Pose EstimationonAssembly101
    Actions Recall@5· 2020-06-01
    8.53
    best: 12.07 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • 2D Human Pose EstimationonAssembly101
    Objects Recall@5· 2020-06-01
    26.27
    best: 28.38 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • 2D Human Pose EstimationonAssembly101
    Verbs Recall@5· 2020-06-01
    59.11
    best: 60.04 (Goal Consistency)
    SOTA
    Temporal Aggregate Representations for Long-Range Video UnderstandingarXiv:2006.00830
  • 2D Human Pose EstimationonEPIC-KITCHENS-100 (test)
    recall@5· 2021-06-06
    12.6
    best: 23.75 (InAViT)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152
  • 2D Human Pose EstimationonEPIC-KITCHENS-100
    Recall@5· 2021-06-06
    14.73
    best: 27.6 (PlausiVL)
    Technical Report: Temporal Aggregate RepresentationsarXiv:2106.03152