FETA's CLIP-MIL (Many-Shot Image-to-text)

Reported on 6 benchmarks across 2 tasks · 1 paper · 6 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision3 results

Image RetrievalonFETA Car-Manuals
R@1· 2022-09-08
29
SOTA
FETA: Towards Specializing Foundation Models for Expert Task Applications arXiv:2209.03648
Image RetrievalonFETA Car-Manuals
R@10· 2022-09-08
72.6
SOTA
FETA: Towards Specializing Foundation Models for Expert Task Applications arXiv:2209.03648
Image RetrievalonFETA Car-Manuals
R@5· 2022-09-08
59.9
SOTA
FETA: Towards Specializing Foundation Models for Expert Task Applications arXiv:2209.03648

Natural Language Processing3 results

Image-to-Text RetrievalonFETA Car-Manuals
R@1· 2022-09-08
35.5
SOTA
FETA: Towards Specializing Foundation Models for Expert Task Applications arXiv:2209.03648
Image-to-Text RetrievalonFETA Car-Manuals
R@10· 2022-09-08
67
SOTA
FETA: Towards Specializing Foundation Models for Expert Task Applications arXiv:2209.03648
Image-to-Text RetrievalonFETA Car-Manuals
R@5· 2022-09-08
58.3
SOTA
FETA: Towards Specializing Foundation Models for Expert Task Applications arXiv:2209.03648