ViT-P (OneFormer, DiNAT-L, single-scale, 1280x1280, COCO_pretrain)

Reported on 4 benchmarks across 4 tasks · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision2 results

Medical1 result

Audio1 result