TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/CUB-200-2011

CUB-200-2011

Caltech-UCSD Birds-200-2011

ImagesUnknown

The Caltech-UCSD Birds-200-2011 (CUB-200-2011) dataset is the most widely-used dataset for fine-grained visual categorization task. It contains 11,788 images of 200 subcategories belonging to birds, 5,994 for training and 5,794 for testing. Each image has detailed annotations: 1 subcategory label, 15 part locations, 312 binary attributes and 1 bounding box. The textual information comes from Reed et al.. They expand the CUB-200-2011 dataset by collecting fine-grained natural language descriptions. Ten single-sentence descriptions are collected for each image. The natural language descriptions are collected through the Amazon Mechanical Turk (AMT) platform, and are required at least 10 words, without any information of subcategories and actions.

Source: Fine-grained Visual-textual Representation Learning Image Source: http://www.vision.caltech.edu/visipedia/CUB-200-2011.html

Benchmarks

3D/FIDColorization/PSNR@10Colorization/PSNR@1Colorization/PSNR@100Concept-based Classification/Task Accuracy (%)Concept-based Classification/Concept Accuracy (%)Document Text Classification/AccuracyError Understanding/Average highest confidence (ResNet-101)Error Understanding/Insertion AUC score (ResNet-101)Error Understanding/Average highest confidence (MobileNetV2)Error Understanding/Insertion AUC score (MobileNetV2)Error Understanding/Average highest confidence (EfficientNetV2-M)Error Understanding/Insertion AUC score (EfficientNetV2-M)Fine-Grained Image Classification/AccuracyImage Attribution/Insertion AUC score (ResNet-101)Image Attribution/Deletion AUC score (ResNet-101)Image Classification/AccuracyImage Classification/Task Accuracy (%)Image Classification/Concept Accuracy (%)Image Clustering/NMIImage Matching/Mean PCK@0.05Image Matching/Mean PCK@0.1Image Recognition/AccuracyImage Retrieval/R@1Image Retrieval/R@2Image Retrieval/R@4Image Retrieval/R@8Interpretable Machine Learning/Top 1 AccuracyMetric Learning/R@1Multimodal Deep Learning/AccuracyMultimodal Text and Image Classification/AccuracyObject Localization/GT-known localization accuracyObject Localization/Top-1 Localization AccuracyObject Localization/average top-1 classification accuracyReconstruction/FIDSemantic correspondence/Mean PCK@0.05Semantic correspondence/Mean PCK@0.1Single-View 3D Reconstruction/FIDVisual Recognition/Accuracy (%)Zero-Shot Learning/average top-1 classification accuracyZero-Shot Learning/Accuracy SeenZero-Shot Learning/Accuracy UnseenZero-Shot Learning/HZero-Shot Learning/AccuracyZero-Shot Learning/Harmonic mean

Related Benchmarks

CUB-200-2011 (20 tasks) - 1 epoch/Continual Learning/AccuracyCUB-200-2011 (ResNet-101)/Error Understanding/Average highest confidenceCUB-200-2011 (ResNet-101)/Error Understanding/Insertion AUC scoreCUB-200-2011 - 0-Shot/Few-Shot Image Classification/AP50CUB-200-2011 - 0-Shot/Few-Shot Image Classification/Top-1 AccuracyCUB-200-2011 - 0-Shot/Image Classification/AP50CUB-200-2011 - 0-Shot/Image Classification/Top-1 AccuracyCUB-200-2011 5-way (1-shot)/Few-Shot Image Classification/AccuracyCUB-200-2011 5-way (1-shot)/Image Classification/AccuracyCUB-200-2011 5-way (5-shot)/Few-Shot Image Classification/AccuracyCUB-200-2011 5-way (5-shot)/Image Classification/AccuracyCUB-200-2011, 30 samples per class/Image Classification/AccuracyCUB-200-2011, 5 samples per class/Image Classification/Accuracy

Statistics

Papers
2,235
Benchmarks
45

Links

Homepage

Tasks

3DBird Species Classification With Audio-Visual DataColorizationConcept-based ClassificationCross-Domain Few-ShotDataset Distillation - 1IPCDocument Text ClassificationError UnderstandingFew-Shot Class-Incremental LearningFew-Shot Image ClassificationFew-Shot LearningFine-Grained Image ClassificationFine-Grained Image RecognitionFine-Grained Visual RecognitionGeneralized Few-Shot LearningGeneralized Zero-Shot LearningGraph MatchingImage AttributionImage ClassificationImage ClusteringImage GenerationImage MatchingImage RecognitionImage RetrievalInterpretable Machine LearningLong-tail learning with class descriptorsMetric LearningMulti-Modal Document ClassificationMultimodal Deep LearningMultimodal Text and Image ClassificationObject LocalizationPoint-interactive Image ColorizationReconstructionSemantic correspondenceSingle-View 3D ReconstructionSmall Data Image ClassificationText-to-Image GenerationTransductive Zero-Shot ClassificationUnsupervised Keypoint EstimationVisual RecognitionWeakly-Supervised Object LocalizationZero-Shot Learning