3,275 machine learning datasets
3,275 dataset results
MacaquePose is an animal pose estimation dataset containing pictures of macaque monkeys and manually labeled annotations on them.
Vinegar Fly is a pose estimation dataset for fruit flies.
Green family of datasets for emergent communications on relations.
MAOMaps is a dataset for evaluation of Visual SLAM, RGB-D SLAM and Map Merging algorithms. It contains 40 samples with RGB and depth images, and ground truth trajectories and maps. These 40 samples are joined into 20 pairs of overlapping maps for map merging methods evaluation. The samples were collected using Matterport3D dataset and Habitat simulator.
Mouse Brain MRI atlas (both in-vivo and ex-vivo) (repository relocated from the original webpage)
The D3DFACS dataset is a dynamic 3D facial expression data set based on the Facial Action Coding System. It contains Action Unit (AU) sequences from 10 people, with 519 sequences in total. The peak image of each expression sequence has been manually FACS coded by a certified expert.
Clickable heat-map visualizations of the experiments run to quantify the Classic ECN AQM problem and to evaluate the success of the Classic AQM Detection and Fall-back algorithm.
This data contains about 2500 trajectories (with images and actions) of a Sawyer robot interacting with various objects.
ARC Ukiyo-e Faces is a large-scale (>10k paintings, >20k faces) Ukiyo-e dataset with coherent semantic labels and geometric annotations through augmenting and organizing existing datasets with automatic detection.
Unsplash2K is high-resolution image dataset with 2K resolution. Unsplash2K dataset is crawled from unsplash. Unsplash2K dataset contains 498 high-resolution images and corresponding low-resolution images which are downsampled by bicubic downsamling for x2, x4, x8 scale. Unsplash2K contains diverse contents such as animals, architectures and flowers.
A collection of photographic and synthetic images intended for analysis of image processing techniques and quality assessment of displays.
Number of images: 1,657 images during or after the fire
~1M Flickr images from the XX century-aged from the 1910s to 1990s. Dataset was introduced by Müller et al. and can be found https://www.radar-service.eu/radar/en/dataset/tJzxrsYUkvPklBOw
This is a pose estimation dataset, consisting of symmetric 3D shapes where multiple orientations are visually indistinguishable. The challenge is to predict all equivalent orientations when only one orientation is paired with each image during training (as is the scenario for most pose estimation datasets). In contrast to most pose estimation datasets, the full set of equivalent orientations is available for evaluation.
MARS dataset processed with our re-Detect and Link (DL) module.
Since robust foreground/background separation and segmentation of cellular objects (i.e.,identification of which pixels below to which objects) strongly depends on image quality, focus artifacts are detrimental to data quality. This image set provides examples of in- and out-of-focus synthetic images, which can be used for validation of focus metrics.
A large-scale training dataset suffering from the defocus spread effect (DSE) is synthesized by applying an $\alpha$-matte boundary defocus model to the VOC 2012 dataset.
The Oxford Road Boundaries is a dataset designed for training and testing machine-learning-based road-boundary detection and inference approaches.
Probing cross-modal capabilities of Vision & Language models with a counting task.
Calliar is a dataset for Arabic calligraphy. The dataset consists of 2500 json files that contain strokes manually annotated for Arabic calligraphy.