TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/GLIP-T

GLIP-T

Reported on 37 benchmarks across 6 tasks · 2 papers · 37 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Methodology28 results

  • 3DonODinW Full-shot 35 Tasks
    AP· uses extra data· 2022-04-19
    62.6
    best: 72.4 (Grounding DINO 1.5 Pro)
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • 3DonELEVATER
    AP· 2022-04-19
    62.6
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • 2D ClassificationonODinW Full-shot 35 Tasks
    AP· uses extra data· 2022-04-19
    62.6
    best: 72.4 (Grounding DINO 1.5 Pro)
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • 2D ClassificationonELEVATER
    AP· 2022-04-19
    62.6
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • 2D Object DetectiononODinW Full-shot 35 Tasks
    AP· uses extra data· 2022-04-19
    62.6
    best: 72.4 (Grounding DINO 1.5 Pro)
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • 2D Object DetectiononELEVATER
    AP· 2022-04-19
    62.6
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • 16konODinW Full-shot 35 Tasks
    AP· uses extra data· 2022-04-19
    62.6
    best: 72.4 (Grounding DINO 1.5 Pro)
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • 16konELEVATER
    AP· 2022-04-19
    62.6
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • 3DonODinW-35
    Average Score· 2021-12-07
    38.9
    best: 54.7 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 3DonODinW-13
    Average Score· 2021-12-07
    50.7
    best: 66.3 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 3DonDescription Detection Dataset
    Intra-scenario ABS mAP· 2021-12-07
    21.5
    best: 26 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 3DonDescription Detection Dataset
    Intra-scenario FULL mAP· 2021-12-07
    19.1
    best: 22.9 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 3DonDescription Detection Dataset
    Intra-scenario PRES mAP· 2021-12-07
    18.3
    best: 23.7 (OFA-DOD-base)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D ClassificationonODinW-35
    Average Score· 2021-12-07
    38.9
    best: 54.7 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D ClassificationonODinW-13
    Average Score· 2021-12-07
    50.7
    best: 66.3 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D ClassificationonDescription Detection Dataset
    Intra-scenario ABS mAP· 2021-12-07
    21.5
    best: 26 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D ClassificationonDescription Detection Dataset
    Intra-scenario FULL mAP· 2021-12-07
    19.1
    best: 22.9 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D ClassificationonDescription Detection Dataset
    Intra-scenario PRES mAP· 2021-12-07
    18.3
    best: 23.7 (OFA-DOD-base)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D Object DetectiononODinW-35
    Average Score· 2021-12-07
    38.9
    best: 54.7 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D Object DetectiononODinW-13
    Average Score· 2021-12-07
    50.7
    best: 66.3 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D Object DetectiononDescription Detection Dataset
    Intra-scenario ABS mAP· 2021-12-07
    21.5
    best: 26 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D Object DetectiononDescription Detection Dataset
    Intra-scenario FULL mAP· 2021-12-07
    19.1
    best: 22.9 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 2D Object DetectiononDescription Detection Dataset
    Intra-scenario PRES mAP· 2021-12-07
    18.3
    best: 23.7 (OFA-DOD-base)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 16konODinW-35
    Average Score· 2021-12-07
    38.9
    best: 54.7 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 16konODinW-13
    Average Score· 2021-12-07
    50.7
    best: 66.3 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 16konDescription Detection Dataset
    Intra-scenario ABS mAP· 2021-12-07
    21.5
    best: 26 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 16konDescription Detection Dataset
    Intra-scenario FULL mAP· 2021-12-07
    19.1
    best: 22.9 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • 16konDescription Detection Dataset
    Intra-scenario PRES mAP· 2021-12-07
    18.3
    best: 23.7 (OFA-DOD-base)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857

Computer Vision9 results

  • Object DetectiononODinW Full-shot 35 Tasks
    AP· uses extra data· 2022-04-19
    62.6
    best: 72.4 (Grounding DINO 1.5 Pro)
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • Object DetectiononELEVATER
    AP· 2022-04-19
    62.6
    SOTA
    ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsarXiv:2204.08790
  • Object DetectiononODinW-35
    Average Score· 2021-12-07
    38.9
    best: 54.7 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • Object DetectiononODinW-13
    Average Score· 2021-12-07
    50.7
    best: 66.3 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • Object DetectiononDescription Detection Dataset
    Intra-scenario ABS mAP· 2021-12-07
    21.5
    best: 26 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • Object DetectiononDescription Detection Dataset
    Intra-scenario FULL mAP· 2021-12-07
    19.1
    best: 22.9 (MM-Grounding-DINO)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • Object DetectiononDescription Detection Dataset
    Intra-scenario PRES mAP· 2021-12-07
    18.3
    best: 23.7 (OFA-DOD-base)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • Few-Shot Object DetectiononODinW-35
    Average Score· 2021-12-07
    38.9
    best: 54.7 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857
  • Few-Shot Object DetectiononODinW-13
    Average Score· 2021-12-07
    50.7
    best: 66.3 (Grounding DINO 1.5 Pro)
    SOTA
    Grounded Language-Image Pre-trainingarXiv:2112.03857