Wilds
Builds on top of recent data collection efforts by domain experts in these applications and provides a unified collection of datasets with evaluation metrics and train/test splits that are representative of real-world distribution shifts.
The v2.0 update adds unlabeled data to 8 datasets. The labeled data and evaluation metrics are exactly the same, so all previous results are directly comparable.
Source: WILDS: A Benchmark of in-the-Wild Distribution Shifts
Related Benchmarks
WildScenes/10-shot image generation/mIoUWildScenes/10-shot image generation/mIoU (Env DA)WildScenes/10-shot image generation/mIoU (Temporal DA)WildScenes/2D Semantic Segmentation/mIoUWildScenes/2D Semantic Segmentation/mIoU (Env DA)WildScenes/2D Semantic Segmentation/mIoU (Temporal DA) WildScenes/3D Semantic Segmentation/mIoUWildScenes/3D Semantic Segmentation/mIoU (Env DA)WildScenes/3D Semantic Segmentation/mIoU (Temporal DA)WildScenes/Semantic Segmentation/mIoUWildScenes/Semantic Segmentation/mIoU (Env DA)WildScenes/Semantic Segmentation/mIoU (Temporal DA)