CC3M-TagMask

ImagesTextsIntroduced 2024-03-30

The dataset offers tag and mask annotations for image-text pairs from the CC3M validation set. Tag annotations denote words that aptly describe the relationship between the image and the corresponding text. These annotations provide valuable insights into the semantic connection between each pair's visual and textual elements.

Benchmarks

10-shot image generation/mIoU Classification/Precision Classification/Recall Classification/F1 Classification/Accuracy Classification/mAP Multi-Label Text Classification/Precision Multi-Label Text Classification/Recall Multi-Label Text Classification/F1 Multi-Label Text Classification/Accuracy Multi-Label Text Classification/mAP Semantic Segmentation/mIoU Text Classification/Precision Text Classification/Recall Text Classification/F1 Text Classification/Accuracy Text Classification/mAP