Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/DFN-5B H/14-378 + PrefixedIter Decoder (FT0)

DFN-5B H/14-378 + PrefixedIter Decoder (FT0)

Reported on 8 benchmarks across 1 task · 1 paper · 4 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision8 results

Zero-Shot Image ClassificationonOVIC Datasets (World-H)
Overall Score· 2024-07-15
87.9
SOTA
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion arXiv:2407.11211
Zero-Shot Image ClassificationonOVIC Datasets (World-H)
Prediction Score· 2024-07-15
88.27
SOTA
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion arXiv:2407.11211
Zero-Shot Image ClassificationonOVIC Datasets (World-H)
Top 1 Accuracy· 2024-07-15
86.95
SOTA
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion arXiv:2407.11211
Zero-Shot Image ClassificationonOVIC Datasets (Wiki-H)
Top 1 Accuracy· 2024-07-15
77.1
SOTA
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion arXiv:2407.11211
Zero-Shot Image ClassificationonOVIC Datasets (Wiki-L)
Prediction Score (mean of 3)· 2024-07-15
74.48
best: 74.88 (DFN-5B H/14-378 + PrefixedIter Decoder (FT2))
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion arXiv:2407.11211
Zero-Shot Image ClassificationonOVIC Datasets (World-H)
Prediction Score (mean of 3)· 2024-07-15
86.41
best: 87.49 (SigLIP SO/14 + PrefixedIter Decoder (FT2))
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion arXiv:2407.11211
Zero-Shot Image ClassificationonOVIC Datasets (Wiki-H)
Overall Score· 2024-07-15
78.21
best: 79.02 (DFN-5B H/14-378 + PrefixedIter Decoder (FT2))
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion arXiv:2407.11211
Zero-Shot Image ClassificationonOVIC Datasets (Wiki-H)
Prediction Score· 2024-07-15
79.18
best: 80.13 (DFN-5B H/14-378 + PrefixedIter Decoder (FT2))
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion arXiv:2407.11211