TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/CVT

CVT

Reported on 13 benchmarks across 4 tasks · 2 papers · 1 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Medical4 results

  • Semantic SegmentationonnuScenes
    IoU veh - 224x480 - No vis filter - 100x100 at 0.5· 2022-05-05
    31.4
    best: 39.9 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • Semantic SegmentationonnuScenes
    IoU veh - 224x480 - Vis filter. - 100x100 at 0.5· 2022-05-05
    36
    best: 44.7 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • Semantic SegmentationonnuScenes
    IoU veh - 448x800 - No vis filter - 100x100 at 0.5· 2022-05-05
    32.5
    best: 43.2 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • Semantic SegmentationonnuScenes
    IoU veh - 448x800 - Vis filter. - 100x100 at 0.5· 2022-05-05
    37.7
    best: 48.7 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833

Audio4 results

  • 10-shot image generationonnuScenes
    IoU veh - 224x480 - No vis filter - 100x100 at 0.5· 2022-05-05
    31.4
    best: 39.9 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • 10-shot image generationonnuScenes
    IoU veh - 224x480 - Vis filter. - 100x100 at 0.5· 2022-05-05
    36
    best: 44.7 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • 10-shot image generationonnuScenes
    IoU veh - 448x800 - No vis filter - 100x100 at 0.5· 2022-05-05
    32.5
    best: 43.2 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • 10-shot image generationonnuScenes
    IoU veh - 448x800 - Vis filter. - 100x100 at 0.5· 2022-05-05
    37.7
    best: 48.7 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833

Computer Vision4 results

  • Bird's-Eye View Semantic SegmentationonnuScenes
    IoU veh - 224x480 - No vis filter - 100x100 at 0.5· 2022-05-05
    31.4
    best: 39.9 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • Bird's-Eye View Semantic SegmentationonnuScenes
    IoU veh - 224x480 - Vis filter. - 100x100 at 0.5· 2022-05-05
    36
    best: 44.7 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • Bird's-Eye View Semantic SegmentationonnuScenes
    IoU veh - 448x800 - No vis filter - 100x100 at 0.5· 2022-05-05
    32.5
    best: 43.2 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833
  • Bird's-Eye View Semantic SegmentationonnuScenes
    IoU veh - 448x800 - Vis filter. - 100x100 at 0.5· 2022-05-05
    37.7
    best: 48.7 (PointBeV)
    Cross-view Transformers for real-time Map-view Semantic SegmentationarXiv:2205.02833

Natural Language Processing1 result

  • Machine TranslationonIWSLT2015 English-Vietnamese
    BLEU· uses extra data· 2018-09-22
    29.6
    best: 40.2 (EnViT5 + MTet)
    SOTA
    Semi-Supervised Sequence Modeling with Cross-View TrainingarXiv:1809.08370