Cross-Modal Information Retrieval on MSCOCO

Metric: Image-to-text R@1 (higher is better)

LeaderboardDataset
Loading chart...
#ModelImage-to-text R@1Extra DataPaperDateCode
13SHNet85.8No3SHNet: Boosting Image-Sentence Retrieval via Vi...2024-04-26Code