Metric: R@50 (higher is better)
| # | Model↕ | R@50▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | CoVR-BLIP-2 | 71.17 | No | CoVR-2: Automatic Data Construction for Composed... | 2023-08-28 | Code |
| 2 | CoLLM (finetuned - BLIP-L/16) | 60.7 | No | CoLLM: A Large Language Model for Composed Image... | 2025-03-25 | Code |
| 3 | SCOT (WACV 2025) | 60.03 | No | SCOT: Self-Supervised Contrastive Pretraining Fo... | 2025-01-12 | - |
| 4 | CoVR-BLIP-2 | 58.44 | No | CoVR-2: Automatic Data Construction for Composed... | 2023-08-28 | Code |
| 5 | MagicLens (CoCa L) | 58.2 | No | MagicLens: Self-Supervised Image Retrieval with ... | 2024-03-28 | Code |
| 6 | CoLLM (Pretrained - BLIP-L/16) | 56 | No | CoLLM: A Large Language Model for Composed Image... | 2025-03-25 | Code |
| 7 | MagicLens (CLIP L) | 52.5 | No | MagicLens: Self-Supervised Image Retrieval with ... | 2024-03-28 | Code |
| 8 | ImageScope (CLIP-ViT-L/14) | 50.78 | No | ImageScope: Unifying Language-Guided Image Retri... | 2025-03-13 | Code |
| 9 | CoLLM (Pretrained - CLIP-L/14) | 49.5 | No | CoLLM: A Large Language Model for Composed Image... | 2025-03-25 | Code |