Metric: R@1 (higher is better)
| # | Model↕ | R@1▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | APTM | 68.51 | Yes | Towards Unified Text-based Person Retrieval: A L... | 2023-06-05 | Code |
| 2 | Filtering-WoRA(Small) | 68.35 | No | From Data Deluge to Data Curation: A Filtering-W... | 2024-04-16 | Code |
| 3 | RDE | 67.68 | Yes | Noisy-Correspondence Learning for Text-to-Image ... | 2023-08-19 | Code |
| 4 | MARS | 67.6 | No | MARS: Paying more attention to visual attributes... | 2024-07-05 | Code |
| 5 | RaSa | 65.28 | No | RaSa: Relation and Sensitivity Aware Representat... | 2023-05-23 | Code |
| 6 | TBPS-CLIP (ViT-B/16) | 65.05 | No | An Empirical Study of CLIP for Text-based Person... | 2023-08-19 | Code |
| 7 | PLIP-RN50 | 64.25 | No | PLIP: Language-Image Pre-training for Person Rep... | 2023-05-15 | Code |
| 8 | IRRA | 63.46 | Yes | Cross-Modal Implicit Relation Reasoning and Alig... | 2023-03-22 | Code |
| 9 | VGSG (ViT-Base) | 63.05 | No | VGSG: Vision-Guided Semantic-Group Network for T... | 2023-11-13 | - |
| 10 | SRCF | 57.18 | No | - | - | Code |
| 11 | SSAN | 54.23 | No | Semantically Self-Aligned Network for Text-to-Im... | 2021-07-27 | Code |