Datasets

19,997 machine learning datasets

19,997 dataset results

PerKey

A corpus of 553k news articles from six Persian news websites and agencies with relatively high quality author extracted keyphrases, which is then filtered and cleaned to achieve higher quality keyphrases.

3 papers0 benchmarksTexts

Photoswitch

A benchmark for molecular machine learning where improvements in model performance can be immediately observed in the throughput of promising molecules synthesized in the lab. Photoswitches are a versatile class of molecule for medical and renewable energy applications where a molecule's efficacy is governed by its electronic transition wavelengths.

3 papers0 benchmarks

PhotoSynth

The PhotoSynth (PS) dataset for patch matching consists of a total of 30 scenes with 25 scenes for training and 5 scenes for validation. The different image pairs are captured in different illumination conditions, at different scales and with different viewpoints.

3 papers0 benchmarksImages

PRED18 (PRED18: Predator/Prey DAVIS Dataset)

Twenty DAVIS recordings with a total duration of about 1.25 hour were obtained by driving the two robots in the robot arena of the University of Ulster in Londonderry.

3 papers0 benchmarks

Prostate MRI Segmentation Dataset

This prostate MRI segmentation dataset is collected from six different data sources.

3 papers0 benchmarksMedical

PTL (Pedestrian-Traffic-Lights)

A dataset of pedestrian traffic lights containing over 5000 photos taken at hundreds of intersections in Shanghai.

3 papers0 benchmarks

REFreSD (Rationalized English-French Semantic Divergences)

Consists of English-French sentence-pairs annotated with semantic divergence classes and token-level rationales.

3 papers0 benchmarks

RP2K

A new large-scale retail product dataset for fine-grained image classification. Unlike previous datasets focusing on relatively few products, more than 500,000 images of retail products on shelves were collected, belonging to 2000 different products. The dataset aims to advance the research in retail object recognition, which has massive applications such as automatic shelf auditing and image-based product information retrieval.

3 papers0 benchmarksImages

Ruralscapes

A dataset with high resolution (4K) images and manually-annotated dense labels every 50 frames.

3 papers0 benchmarks

San Francisco Landmark Dataset

The San Francisco Landmark Dataset contains a database of 1.7 million images of buildings in San Francisco with ground truth labels, geotags, and calibration data, as well as a difficult query set of 803 cell phone images taken with a variety of different camera phones. The data is originally acquired by vehicle-mounted cameras with wide-angle lenses capturing spherical panoramic images. For all visible buildings in each panorama, a set of overlapping perspective images is generated.

3 papers3 benchmarksImages

SatStereo

Provides a set of stereo-rectified images and the associated groundtruthed disparities for 10 AOIs (Area of Interest) drawn from two sources: 8 AOIs from IARPA's MVS Challenge dataset and 2 AOIs from the CORE3D-Public dataset.

3 papers0 benchmarksImages

SeasonDepth

Aa new cross-season scaleless monocular depth prediction dataset from CMU Visual Localization dataset through structure from motion.

3 papers0 benchmarks

SEMCAT

Contains more than 6500 words semantically grouped under 110 categories.

3 papers0 benchmarks

Serial Speakers

An annotated dataset of 161 episodes from three popular American TV serials: Breaking Bad, Game of Thrones and House of Cards. Serial Speakers is suitable both for investigating multimedia retrieval in realistic use case scenarios, and for addressing lower level speech related tasks in especially challenging conditions.

3 papers0 benchmarks

SEWA DB

A database of more than 2000 minutes of audio-visual data of 398 people coming from six cultures, 50% female, and uniformly spanning the age range of 18 to 65 years old. Subjects were recorded in two different contexts: while watching adverts and while discussing adverts in a video chat. The database includes rich annotations of the recordings in terms of facial landmarks, facial action units (FAU), various vocalisations, mirroring, and continuously valued valence, arousal, liking, agreement, and prototypic examples of (dis)liking. This database aims to be an extremely valuable resource for researchers in affective computing and automatic human sensing and is expected to push forward the research in human behaviour analysis, including cultural studies.

3 papers0 benchmarks

SpaceNet MVOI (SpaceNet Multi-View Overhead Imagery Dataset)

An open source Multi-View Overhead Imagery dataset with 27 unique looks from a broad range of viewing angles (-32.5 degrees to 54.0 degrees). Each of these images cover the same 665 square km geographic extent and are annotated with 126,747 building footprint labels, enabling direct assessment of the impact of viewpoint perturbation on model performance.

3 papers0 benchmarks

SPIRS

A first-of-its-kind large dataset of sarcastic/non-sarcastic tweets with high-quality labels and extra features: (1) sarcasm perspective labels (2) new contextual features. The dataset is expected to advance sarcasm detection research.

3 papers0 benchmarksTexts

Stream-51

A new dataset for streaming classification consisting of temporally correlated images from 51 distinct object categories and additional evaluation classes outside of the training distribution to test novelty recognition.

3 papers0 benchmarks

ThirdToFirst

Two datasets (synthetic and natural/real) containing simultaneously recorded egocentric and exocentric videos.

3 papers0 benchmarks

TIM-Tremor (Technology in Motion Tremor)

Contains static tasks as well as a multitude of more dynamic tasks, involving larger motion of the hands. The dataset has 55 tremor patient recordings together with: associated ground truth accelerometer data from the most affected hand, RGB video data, and aligned depth data.

3 papers0 benchmarks

PreviousPage 263 of 1000Next