Datasets

3,148 machine learning datasets

3,148 dataset results

MuMiN-small

This is the small version of the MuMiN dataset.

1 papers2 benchmarksGraphs, Images, Texts

MuMiN-medium

This is the medium version of the MuMiN dataset.

1 papers2 benchmarksGraphs, Images, Texts

MuMiN-large

This is the large version of the MuMiN dataset.

1 papers2 benchmarksGraphs, Images, Texts

TraVLR is a synthetic dataset comprising four visio-linguistic reasoning tasks. Each example encodes the scene bimodally such that either modality can be dropped during training/testing with no loss of relevant information. TraVLR's training and testing distributions are also constrained along task-relevant dimensions, enabling the evaluation of out-of-distribution generalisation.

1 papers0 benchmarksTexts

LSFB Datasets (French Belgian Sign Language Datasets)

Sign Language Datasets for French Belgian Sign Language This dataset is built upon the work of Belgian linguists from the University of Namur. During eight years, they've collected and annotated 50 hours of videos depicting sign language conversation. 100 signers were recorded, making it one of the most representative sign language corpus. The annotation has been sanitized and enriched with metadata to construct two, easy to use, datasets for sign language recognition. One for continuous sign language recognition and the other for isolated sign recognition.

1 papers0 benchmarksTexts, Videos

CANDOR Corpus (CANDOR = Conversation: A Naturalistic Dataset of Online Recordings)

The CANDOR corpus is a large, novel, multimodal corpus of 1,656 recorded conversations in spoken English. This 7+ million word, 850 hour corpus totals over 1TB of audio, video, and transcripts, with moment-to-moment measures of vocal, facial, and semantic expression, along with an extensive survey of speaker post conversation reflections.

1 papers0 benchmarksImages, Tabular, Texts, Time series, Videos

NELA-GT-2021

NELA-GT-2021 is the fourth installment of the NELA-GT datasets, NELA-GT-2021. The dataset contains 1.8M articles from 367 outlets between January 1st, 2021 and December 31st, 2021. Just as in past releases of the dataset, NELA-GT-2021 includes outlet-level veracity labels from Media Bias/Fact Check and tweets embedded in collected news articles.

1 papers0 benchmarksTexts

BBAI Dataset (Black-box Agent Integration)

This dataset is for evaluating the task of Black-box Multi-agent Integration which focuses on combining the capabilities of multiple black-box conversational agents at scale. It provides data to explore two main frameworks of exploration: question agent pairing and question response pairing.

1 papers1 benchmarksTexts

GD-NLI (Generated Debiased NLI Datasets)

This is a set of debiased Natural Language Inference (NLI) datasets produced by the paper Generating Data to Mitigate Spurious Correlations in Natural Language Inference Datasets. The datasets are constructed by augmenting SNLI or MNLI with data samples that are generated to mitigate the spurious correlations in the original datasets. Please visit this repository for more details.

1 papers0 benchmarksTexts

V3C1 (the Vimeo Creative Commons Collection 1)

The dataset has been designed to represent true web videos in the wild, with good visual quality and diverse content characteristics, and will serve as evaluation basis for the Video Browser Showdown 2019-2021 and TREC Video Retrieval (TRECVID) Ad-Hoc Video Search tasks 2019-2021. The dataset comes with a shot segmentation (around 1 million shots) for which we analyze content specifics and statistics. Our research shows that the content of V3C1 is very diverse, has no predominant characteristics and provides a low self-similarity. Thus it is very well suited for video retrieval evaluations as well as for participants of TRECVID AVS or the VBS.

1 papers0 benchmarksTexts, Videos

TRECVID-AVS20 (V3C1)

The dataset has been designed to represent true web videos in the wild, with good visual quality and diverse content characteristics, The test video collection for TRECVID-AVS2019-TRECVID-AVS2021, which contains 1,082,649 web video clips, with even more diverse content, no predominant characteristics and low self-similarity.

1 papers1 benchmarksTexts, Videos

RoomEnv-v0 (The Room environment - v0)

The Room environment - v0

1 papers1 benchmarksGraphs, Texts

Korean UnSmile Dataset (SmilegateAI Korean UnSmile Dataset)

1.9K Korean Online Hate Speech Comments for Multilabel Classification (Annotated by Three Independent Labelers per Data)

1 papers0 benchmarksTexts

SSD_ID (Sub-Slot Dialogue dataset id number domain)

SSD (Sub-slot Dialog) dataset: This is the dataset for the ACL 2022 paper "A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots".

1 papers0 benchmarksTexts

SSD_NAME (Sub-Slot Dialogue dataset name domain)

SSD (Sub-slot Dialog) dataset: This is the dataset for the ACL 2022 paper "A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots".

1 papers6 benchmarksTexts

SSD_PLATE (Sub-Slot Dialogue dataset license plate number domain)

SSD (Sub-slot Dialog) dataset: This is the dataset for the ACL 2022 paper "A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots".

1 papers0 benchmarksTexts

ChAII - Hindi and Tamil Question Answering

The dataset covers Hindi and Tamil, collected without the use of translation. It provides a realistic information-seeking task with questions written by native-speaking expert data annotators.

1 papers1 benchmarksTexts

Korean Hate Speech Evaluation Datasets

APEACH is the first crowd-generated Korean evaluation dataset for hate speech detection. Sentences of the dataset are created by anonymous participants using an online crowdsourcing platform DeepNatural AI.

1 papers0 benchmarksTexts

Casino Reviews (Online reviews of North American Casinos from Google Reviews)

This dataset contain online reviews gathered from google reviews written by north american casino users. explain motivations and summary of its content. Can be used to study user experience and relative research directions such as cultural impacts on latency of aspects, domain importance, sentiment analysis, opinion mining, aspect-based sentiment analysis, etc.

1 papers0 benchmarksTexts

60k Stack Overflow Questions (60k Stack Overflow Questions from 2016-2020 classified into three categories based on their quality)

The dataset contains 60,000 Stack Overflow questions from 2016-2020, classified into three categories:

1 papers1 benchmarksTexts

PreviousPage 116 of 158Next