TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/OmniVec2

OmniVec2

Reported on 34 benchmarks across 14 tasks

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision16 results

  • VideoonMiT
    Top 1 Accuracy· uses extra data
    53.1
  • VideoonMoments in Time
    Top 1 Accuracy· uses extra data
    53.1
  • VideoonKinetics-400
    Acc@1
    93.6
  • Image ClassificationoniNaturalist 2018
    Top-1 Accuracy· uses extra data
    94.6
  • Image ClassificationonPlaces365
    Top 1 Accuracy· uses extra data
    65.1
  • Image ClassificationonOxford-IIIT Pet Dataset
    Accuracy· uses extra data
    99.6
  • Shape Representation Of 3D Point CloudsonScanObjectNN
    Overall Accuracy· uses extra data
    97.2
  • Shape Representation Of 3D Point CloudsonModelNet40-C
    Error Rate· uses extra data
    0.142
  • Fine-Grained Image ClassificationonOxford-IIIT Pet Dataset
    Accuracy· uses extra data
    99.6
  • 3D Point Cloud ClassificationonScanObjectNN
    Overall Accuracy· uses extra data
    97.2
  • 3D Point Cloud ClassificationonModelNet40-C
    Error Rate· uses extra data
    0.142
  • 3D Point Cloud ReconstructiononScanObjectNN
    Overall Accuracy· uses extra data
    97.2
  • 3D Point Cloud ReconstructiononModelNet40-C
    Error Rate· uses extra data
    0.142
  • Zero-Shot Video RetrievalonYouCook2
    text-to-video R@1
    26.1
  • Zero-Shot Video RetrievalonYouCook2
    text-to-video R@10
    70.8
  • Zero-Shot Video RetrievalonYouCook2
    text-to-video R@5
    54.1

Knowledge Base8 results

  • Text SummarizationonSAMSum
    BertScoreF1· uses extra data
    65.1
    best: 71.92 (SICK)
  • Text SummarizationonSAMSum
    ROUGE-1· uses extra data
    59.1
  • Text SummarizationonSAMSum
    ROUGE-2· uses extra data
    34.1
  • Text SummarizationonSAMSum
    ROUGE-L· uses extra data
    63.7
  • Text SummarizationonDialogSum
    BertScore· uses extra data
    72.8
  • Text SummarizationonDialogSum
    Rouge1· uses extra data
    47.6
    best: 47.8 (InstructDS)
  • Text SummarizationonDialogSum
    Rouge2· uses extra data
    22.1
    best: 22.2 (InstructDS)
  • Text SummarizationonDialogSum
    RougeL· uses extra data
    41.4

Audio4 results

  • Audio ClassificationonESC-50
    Accuracy (5-fold)· uses extra data
    99.1
  • Audio ClassificationonESC-50
    Top-1 Accuracy· uses extra data
    99.1
  • Audio ClassificationonAudioSet
    Test mAP· uses extra data
    0.558
  • 10-shot image generationonNYU Depth v2
    Mean IoU· uses extra data
    63.6

Methodology3 results

  • ClassificationonESC-50
    Accuracy (5-fold)· uses extra data
    99.1
  • ClassificationonESC-50
    Top-1 Accuracy· uses extra data
    99.1
  • ClassificationonAudioSet
    Test mAP· uses extra data
    0.558

Robots1 result

  • Activity RecognitiononUCF101
    3-fold Accuracy· uses extra data
    99.6
    best: 99.7 (FTP-UniFormerV2-L/14)

Medical1 result

  • Semantic SegmentationonNYU Depth v2
    Mean IoU· uses extra data
    63.6

Time Series1 result

  • Action RecognitiononUCF101
    3-fold Accuracy· uses extra data
    99.6
    best: 99.7 (FTP-UniFormerV2-L/14)