CC3M-TagMask
ImagesTextsIntroduced 2024-03-30
The dataset offers tag and mask annotations for image-text pairs from the CC3M validation set. Tag annotations denote words that aptly describe the relationship between the image and the corresponding text. These annotations provide valuable insights into the semantic connection between each pair's visual and textual elements.
Benchmarks
10-shot image generation/mIoUClassification/PrecisionClassification/RecallClassification/F1Classification/AccuracyClassification/mAPMulti-Label Text Classification/PrecisionMulti-Label Text Classification/RecallMulti-Label Text Classification/F1Multi-Label Text Classification/AccuracyMulti-Label Text Classification/mAPSemantic Segmentation/mIoUText Classification/PrecisionText Classification/RecallText Classification/F1Text Classification/AccuracyText Classification/mAP