Datasets

19,997 machine learning datasets

19,997 dataset results

HatemojiCheck

HatemojiCheck is a test suite for detecting emoji-based hate of 3,930 test cases covering seven functionalities of emoji-based hate and six identities.

3 papers0 benchmarksTexts

The WikiScenes dataset consists of paired images and language descriptions capturing world landmarks and cultural sites, with associated 3D models and camera poses. WikiScenes is derived from the massive public catalog of freely-licensed crowdsourced data in the Wikimedia Commons project, which contains a large variety of images with captions and other metadata.

3 papers0 benchmarks3D, Images, Texts

WDC-Dialogue

WDC-Dialogue is a dataset built from the Chinese social media to train EVA. Specifically, conversations from various sources are gathered and a rigorous data cleaning pipeline is designed to enforce the quality of WDC-Dialogue.

3 papers0 benchmarksTexts

Marine Debris Turntable

Marine Debris Turntable is a dataset for sonar perception.

3 papers0 benchmarks

RareDis corpus

The RareDis corpus contains more than 5,000 rare diseases and almost 6,000 clinical manifestations are annotated. Moreover, the Inter Annotator Agreement evaluation shows a relatively high agreement (F1-measure equal to 83.5% under exact match criteria for the entities and equal to 81.3% for the relations). Based on these results, this corpus is of high quality, supposing a significant step for the field since there is a scarcity of available corpus annotated with rare diseases.

3 papers0 benchmarksTexts

DeepFake MNIST+

DeepFake MNIST+ is a deepfake facial animation dataset. The dataset is generated by a SOTA image animation generator. It includes 10,000 facial animation videos in ten different actions, which can spoof the recent liveness detectors.

3 papers0 benchmarksVideos

GESTURES (Generalized Epileptic Seizure classification from video-Telemetry Using REcurrent convolutional neural networkS)

This is the dataset to support the paper:

3 papers0 benchmarks

ASR-GLUE

The ASR-GLUE benchmark is a collection of 6 different NLU (Natural Language Understanding) tasks for evaluating the performance of models under automatic speech recognition (ASR) error across 3 different levels of background noise and 6 speakers with various voice characteristics.

3 papers0 benchmarksSpeech

SHIFT15M

SHIFT15M is a dataset that can be used to properly evaluate models in situations where the distribution of data changes between training and testing. The SHIFT15M dataset has several good properties: (i) Multiobjective. Each instance in the dataset has several numerical values that can be used as target variables. (ii) Large-scale. The SHIFT15M dataset consists of 15million fashion images. (iii) Coverage of types of dataset shifts. SHIFT15M contains multiple dataset shift problem settings (e.g., covariate shift or target shift). SHIFT15M also enables the performance evaluation of the model under various magnitudes of dataset shifts by switching the magnitude.

3 papers0 benchmarksImages

HeadlineCause

HeadlineCause is a dataset for detecting implicit causal relations between pairs of news headlines. The dataset includes over 5000 headline pairs from English news and over 9000 headline pairs from Russian news labeled through crowdsourcing. The pairs vary from totally unrelated or belonging to the same general topic to the ones including causation and refutation relations.

3 papers0 benchmarksTexts

TIMo (TIMo (Time-of-Flight Indoor Monitoring))

TIMo (Time-of-Flight Indoor Monitoring) is a dataset of infrared and depth videos intended for the use in Anomaly Detection and Person Detection/People Counting. It features more than 1,500 sequences for anomaly detection, which sum up to more than 500,000 individual frames. For person detection the dataset contains more than than 240 sequences. The data was captured using a Microsoft Azure Kinect RGB-D camera. In addition, we provide annotations of anomalous frame ranges for use with anomaly detection and bounding boxes and segmentation masks for use with person detection. The data was captured in parts from a tilted view and a top-down perspective.

3 papers2 benchmarksVideos

IfAct (Identifying Human Actions Visible in Online Vlogs)

We consider the task of identifying human actions visible in online videos. We focus on the widely spread genre of lifestyle vlogs, which consist of videos of people performing actions while verbally describing them. Our goal is to identify if actions mentioned in the speech description of a video are visually present.

3 papers0 benchmarksTexts, Videos

BnB

BnB is a large-scale and diverse in-domain VLN (Vision and Language Navigation) dataset.

3 papers0 benchmarksImages, Texts

MUC-4 (Fourth Message Uunderstanding Conference)

A dataset for evaluate system's understanding of given passages.

3 papers1 benchmarks

Commonsense-Dialogues

Commonsense-Dialogues is a crowdsourced dataset of ~11K dialogues grounded in social contexts involving utilization of commonsense. The social contexts used were sourced from the train split of the SocialIQA dataset, a multiple-choice question-answering based social commonsense reasoning benchmark.

3 papers0 benchmarksTexts

BenchIE

BenchIE: a benchmark and evaluation framework for comprehensive evaluation of OIE systems for English, Chinese and German. In contrast to existing OIE benchmarks, BenchIE takes into account informational equivalence of extractions: our gold standard consists of fact synsets, clusters in which we exhaustively list all surface forms of the same fact.

3 papers3 benchmarksTexts

PreviousPage 271 of 1000Next

Datasets

HatemojiCheck

WikiScenes

WDC-Dialogue

Marine Debris Turntable

RareDis corpus

DeepFake MNIST+

GESTURES (Generalized Epileptic Seizure classification from video-Telemetry Using REcurrent convolutional neural networkS)

ASR-GLUE

SHIFT15M

HeadlineCause

TIMo (TIMo (Time-of-Flight Indoor Monitoring))

IfAct (Identifying Human Actions Visible in Online Vlogs)

BnB

MUC-4 (Fourth Message Uunderstanding Conference)

Commonsense-Dialogues

BenchIE

Konzil (Konzilsprotokolle_C)

Schiller (Shiller)

Ricordi

Patzig

Datasets

HatemojiCheck

WikiScenes

WDC-Dialogue

Marine Debris Turntable

RareDis corpus

DeepFake MNIST+

GESTURES (Generalized Epileptic Seizure classification from video-Telemetry Using REcurrent convolutional neural networkS)

ASR-GLUE

SHIFT15M

HeadlineCause

TIMo (TIMo (Time-of-Flight Indoor Monitoring))

IfAct (Identifying Human Actions Visible in Online Vlogs)

BnB

MUC-4 (Fourth Message Uunderstanding Conference)

Commonsense-Dialogues

BenchIE

Konzil (Konzilsprotokolle_C)

Schiller (Shiller)

Ricordi

Patzig