Wilds

Builds on top of recent data collection efforts by domain experts in these applications and provides a unified collection of datasets with evaluation metrics and train/test splits that are representative of real-world distribution shifts.

The v2.0 update adds unlabeled data to 8 datasets. The labeled data and evaluation metrics are exactly the same, so all previous results are directly comparable.

Source: WILDS: A Benchmark of in-the-Wild Distribution Shifts

Related Benchmarks

WildScenes/10-shot image generation/mIoU WildScenes/10-shot image generation/mIoU (Env DA)WildScenes/10-shot image generation/mIoU (Temporal DA)WildScenes/2D Semantic Segmentation/mIoU WildScenes/2D Semantic Segmentation/mIoU (Env DA)WildScenes/2D Semantic Segmentation/mIoU (Temporal DA) WildScenes/3D Semantic Segmentation/mIoU WildScenes/3D Semantic Segmentation/mIoU (Env DA)WildScenes/3D Semantic Segmentation/mIoU (Temporal DA)WildScenes/Semantic Segmentation/mIoU WildScenes/Semantic Segmentation/mIoU (Env DA)WildScenes/Semantic Segmentation/mIoU (Temporal DA)