TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/ViDT Swin-base

ViDT Swin-base

Reported on 30 benchmarks across 5 tasks · 1 paper · 25 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Methodology24 results

  • 3DonCOCO 2017 val
    AP· 2021-10-08
    49.2
    best: 72.2 (LOGO-CAP (Ours) HRNet-W48)
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 3DonCOCO 2017 val
    AP50· 2021-10-08
    69.4
    best: 79 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 3DonCOCO 2017 val
    APL· 2021-10-08
    66.9
    best: 75.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 3DonCOCO 2017 val
    APM· 2021-10-08
    52.6
    best: 65.6 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 3DonCOCO 2017 val
    APS· 2021-10-08
    30.6
    best: 47.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D ClassificationonCOCO 2017 val
    AP· 2021-10-08
    49.2
    best: 61.8 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D ClassificationonCOCO 2017 val
    AP50· 2021-10-08
    69.4
    best: 79 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D ClassificationonCOCO 2017 val
    APL· 2021-10-08
    66.9
    best: 75.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D ClassificationonCOCO 2017 val
    APM· 2021-10-08
    52.6
    best: 65.6 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D ClassificationonCOCO 2017 val
    APS· 2021-10-08
    30.6
    best: 47.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D Object DetectiononCOCO 2017 val
    AP· 2021-10-08
    49.2
    best: 61.8 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D Object DetectiononCOCO 2017 val
    AP50· 2021-10-08
    69.4
    best: 79 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D Object DetectiononCOCO 2017 val
    APL· 2021-10-08
    66.9
    best: 75.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D Object DetectiononCOCO 2017 val
    APM· 2021-10-08
    52.6
    best: 65.6 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D Object DetectiononCOCO 2017 val
    APS· 2021-10-08
    30.6
    best: 47.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 16konCOCO 2017 val
    AP· 2021-10-08
    49.2
    best: 61.8 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 16konCOCO 2017 val
    AP50· 2021-10-08
    69.4
    best: 79 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 16konCOCO 2017 val
    APL· 2021-10-08
    66.9
    best: 75.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 16konCOCO 2017 val
    APM· 2021-10-08
    52.6
    best: 65.6 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 16konCOCO 2017 val
    APS· 2021-10-08
    30.6
    best: 47.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 3DonCOCO 2017 val
    AP75· 2021-10-08
    53.1
    best: 67.6 (Mr. DETR (Swin-L, 1x, 5cale))
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D ClassificationonCOCO 2017 val
    AP75· 2021-10-08
    53.1
    best: 67.6 (Mr. DETR (Swin-L, 1x, 5cale))
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 2D Object DetectiononCOCO 2017 val
    AP75· 2021-10-08
    53.1
    best: 67.6 (Mr. DETR (Swin-L, 1x, 5cale))
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • 16konCOCO 2017 val
    AP75· 2021-10-08
    53.1
    best: 67.6 (Mr. DETR (Swin-L, 1x, 5cale))
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921

Computer Vision6 results

  • Object DetectiononCOCO 2017 val
    AP· 2021-10-08
    49.2
    best: 61.8 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • Object DetectiononCOCO 2017 val
    AP50· 2021-10-08
    69.4
    best: 79 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • Object DetectiononCOCO 2017 val
    APL· 2021-10-08
    66.9
    best: 75.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • Object DetectiononCOCO 2017 val
    APM· 2021-10-08
    52.6
    best: 65.6 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • Object DetectiononCOCO 2017 val
    APS· 2021-10-08
    30.6
    best: 47.7 (Mr. DETR (Swin-L, 1x, 5cale))
    SOTA
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921
  • Object DetectiononCOCO 2017 val
    AP75· 2021-10-08
    53.1
    best: 67.6 (Mr. DETR (Swin-L, 1x, 5cale))
    ViDT: An Efficient and Effective Fully Transformer-based Object DetectorarXiv:2110.03921