TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets

19,997 machine learning datasets

Filter by Modality

  • Images3,275
  • Texts3,148
  • Videos1,019
  • Audio486
  • Medical395
  • 3D383
  • Time series298
  • Graphs285
  • Tabular271
  • Speech199
  • RGB-D192
  • Environment148
  • Point cloud135
  • Biomedical123
  • LiDAR95
  • RGB Video87
  • Tracking78
  • Biology71
  • Actions68
  • 3d meshes65
  • Tables52
  • Music48
  • EEG45
  • Hyperspectral images45
  • Stereo44
  • MRI39
  • Physics32
  • Interactive29
  • Dialog25
  • Midi22
  • 6D17
  • Replay data11
  • Financial10
  • Ranking10
  • Cad9
  • fMRI7
  • Parallel6
  • Lyrics2
  • PSG2

19,997 dataset results

KnowIT VQA

KnowIT VQA is a video dataset with 24,282 human-generated question-answer pairs about The Big Bang Theory. The dataset combines visual, textual and temporal coherence reasoning together with knowledge-based questions, which need of the experience obtained from the viewing of the series to be answered.

9 papers0 benchmarksTexts, Videos

LoDoPaB-CT

LoDoPaB-CT is a dataset of computed tomography images and simulated low-dose measurements. It contains over 40,000 scan slices from around 800 patients selected from the LIDC/IDRI Database.

9 papers1 benchmarksImages, Medical

LogoDet-3K

A logo detection dataset with full annotation, which has 3,000 logo categories, about 200,000 manually annotated logo objects and 158,652 images. LogoDet-3K creates a more challenging benchmark for logo detection, for its higher comprehensive coverage and wider variety in both logo categories and annotated objects compared with existing datasets.

9 papers0 benchmarks

MEIR (Multimodal Entity Image Repurposing)

MEIR is a substantially challenging dataset over that which has been previously available to support research into image repurposing detection. The new dataset includes location, person, and organization manipulations on real-world data sourced from Flickr.

9 papers0 benchmarksImages

MSASL-1000

MSASL is a real-life large-scale sign language data set comprising over 25,000 annotated videos.

9 papers2 benchmarksVideos

NCLS (Neural Cross-Lingual Summarization Corpora)

Presents two high-quality large-scale CLS datasets based on existing monolingual summarization datasets.

9 papers0 benchmarks

OmniArt

Presents half a million samples and structured meta-data to encourage further research and societal engagement.

9 papers1 benchmarks

PathTrack

PathTrack is a dataset for person tracking which contains more than 15,000 person trajectories in 720 sequences.

9 papers0 benchmarksTracking, Videos

PEC (Persona-Based Empathetic Conversational)

A novel large-scale multi-domain dataset for persona-based empathetic conversations.

9 papers0 benchmarks

PTB-TIR

PTB-TIR is a Thermal InfraRed (TIR) pedestrian tracking benchmark, which provides 60 TIR sequences with mannuly annoations. The benchmark is used to fair evaluate TIR trackers.

9 papers0 benchmarksVideos

QuickDraw-Extended

Consists of 330,000 sketches and 204,000 photos spanning across 110 categories.

9 papers0 benchmarksImages

RAVEN-FAIR

RAVEN-FAIR is a modified version of the RAVEN dataset.

9 papers0 benchmarksTexts

ReCO

A human-curated ChineseReading Comprehension dataset on Opinion. The questions in ReCO are opinion based queries issued to the commercial search engine. The passages are provided by the crowdworkers who extract the support snippet from the retrieved documents.

9 papers0 benchmarks

ReDWeb-S

ReDWeb-S is a large-scale challenging dataset for Salient Object Detection. It has totally 3179 images with various real-world scenes and high-quality depth maps. The dataset is split into a training set with 2179 RGB-D image pairs and a testing set with the remaining 1000 image pairs.

9 papers0 benchmarksImages

SelQA

SelQA is a dataset that consists of questions generated through crowdsourcing and sentence length answers that are drawn from the ten most prevalent topics in the English Wikipedia.

9 papers0 benchmarksTexts

SOBA (Shadow-OBject Association)

A new dataset called SOBA, named after Shadow-OBject Association, with 3,623 pairs of shadow and object instances in 1,000 photos, each with individual labeled masks.

9 papers6 benchmarksImages

SPEECH-COCO

SPEECH-COCO contains speech captions that are generated using text-to-speech (TTS) synthesis resulting in 616,767 spoken captions (more than 600h) paired with images.

9 papers0 benchmarksSpeech

SQuADShifts

Provides four new test sets for the Stanford Question Answering Dataset (SQuAD) and evaluate the ability of question-answering systems to generalize to new data.

9 papers0 benchmarks

Standardized Project Gutenberg Corpus

The Standardized Project Gutenberg Corpus (SPGC) is an open science approach to a curated version of the complete PG data containing more than 50,000 books and more than 3×109 word-tokens.

9 papers0 benchmarksTexts

TSAC (Tunisian Sentiment Analysis Corpus)

Tunisian Sentiment Analysis Corpus (TSAC) is a Tunisian Dialect corpus of 17.000 comments from Facebook.

9 papers0 benchmarks
PreviousPage 161 of 1000Next