Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/ViTDet-H

ViTDet-H

Reported on 7 benchmarks across 6 tasks · 1 paper · 6 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Methodology4 results

3DonLVIS v1.0 val
box AP· 2022-03-30
53.4
best: 68 (Co-DETR (single-scale))
SOTA
Exploring Plain Vision Transformer Backbones for Object Detection arXiv:2203.16527
2D ClassificationonLVIS v1.0 val
box AP· 2022-03-30
53.4
best: 68 (Co-DETR (single-scale))
SOTA
Exploring Plain Vision Transformer Backbones for Object Detection arXiv:2203.16527
2D Object DetectiononLVIS v1.0 val
box AP· 2022-03-30
53.4
best: 68 (Co-DETR (single-scale))
SOTA
Exploring Plain Vision Transformer Backbones for Object Detection arXiv:2203.16527
16konLVIS v1.0 val
box AP· 2022-03-30
53.4
best: 68 (Co-DETR (single-scale))
SOTA
Exploring Plain Vision Transformer Backbones for Object Detection arXiv:2203.16527

Computer Vision3 results

Object DetectiononLVIS v1.0 val
box AP· 2022-03-30
53.4
best: 68 (Co-DETR (single-scale))
SOTA
Exploring Plain Vision Transformer Backbones for Object Detection arXiv:2203.16527
Instance SegmentationonLVIS v1.0 val
mask APr· 2022-03-30
36.9
best: 45.8 (DiverGen (Swin-L))
SOTA
Exploring Plain Vision Transformer Backbones for Object Detection arXiv:2203.16527
Instance SegmentationonLVIS v1.0 val
mask AP· 2022-03-30
48.1
best: 60.7 (Co-DETR (single-scale))
Exploring Plain Vision Transformer Backbones for Object Detection arXiv:2203.16527