TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/VLG-Net

VLG-Net

Reported on 30 benchmarks across 1 task · 2 papers · 24 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision30 results

  • VideoonMAD
    R@1,IoU=0.5· 2021-12-01
    1.61
    best: 7.06 (DeCafNet)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@10,IoU=0.3· 2021-12-01
    15.2
    best: 19.86 (VLG-Net + Guidance Model)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@10,IoU=0.5· 2021-12-01
    10.18
    best: 13.72 (VLG-Net + Guidance Model)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@100,IoU=0.1· 2021-12-01
    49.65
    best: 73.62 (DenoiseLoc)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@100,IoU=0.3· 2021-12-01
    43.95
    best: 49.38 (VLG-Net + Guidance Model)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@100,IoU=0.5· 2021-12-01
    34.18
    best: 39.12 (VLG-Net + Guidance Model)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@5,IoU=0.5· 2021-12-01
    6.23
    best: 16.13 (DeCafNet)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@50,IoU=0.1· 2021-12-01
    38.41
    best: 66.07 (DenoiseLoc)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@50,IoU=0.3· 2021-12-01
    33.68
    best: 39.77 (VLG-Net + Guidance Model)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@50,IoU=0.5· 2021-12-01
    25.33
    best: 30.22 (VLG-Net + Guidance Model)
    SOTA
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonTACoS
    R@1,IoU=0.3· 2020-11-19
    45.46
    best: 58.1 (SG-DETR (w/ PT))
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonTACoS
    R@1,IoU=0.5· 2020-11-19
    34.19
    best: 46.79 (DeCafNet)
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonTACoS
    R@5,IoU=0.1· 2020-11-19
    81.8
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonTACoS
    R@5,IoU=0.3· 2020-11-19
    70.38
    best: 71.13 (DeCafNet)
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonTACoS
    R@5,IoU=0.5· 2020-11-19
    56.56
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonActivityNet Captions
    R@1,IoU=0.5· 2020-11-19
    46.32
    best: 60.67 (GVL (paragraph-level))
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonActivityNet Captions
    R@1,IoU=0.7· 2020-11-19
    29.82
    best: 38.55 (GVL (paragraph-level))
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonActivityNet Captions
    R@5,IoU=0.7· 2020-11-19
    63.33
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonDiDeMo
    R@1,IoU=0.5· 2020-11-19
    33.35
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonDiDeMo
    R@1,IoU=0.7· 2020-11-19
    25.57
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonDiDeMo
    R@1,IoU=1.0· 2020-11-19
    25.57
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonDiDeMo
    R@5,IoU=0.5· 2020-11-19
    88.86
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonDiDeMo
    R@5,IoU=0.7· 2020-11-19
    71.72
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonDiDeMo
    R@5,IoU=1.0· 2020-11-19
    71.65
    SOTA
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132
  • VideoonMAD
    R@1,IoU=0.1· 2021-12-01
    3.5
    best: 17.3 (ReVisionLLM)
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@1,IoU=0.3· 2021-12-01
    2.63
    best: 12.7 (ReVisionLLM)
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@10,IoU=0.1· 2021-12-01
    18.32
    best: 41.44 (DenoiseLoc)
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@5,IoU=0.1· 2021-12-01
    11.74
    best: 30.35 (DenoiseLoc)
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonMAD
    R@5,IoU=0.3· 2021-12-01
    9.49
    best: 23.68 (DeCafNet)
    MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsarXiv:2112.00431
  • VideoonActivityNet Captions
    R@5,IoU=0.5· 2020-11-19
    77.15
    best: 81.5 (UnLoc-B)
    VLG-Net: Video-Language Graph Matching Network for Video GroundingarXiv:2011.10132