TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets

383 machine learning datasets

Filter by Modality

  • Images3,275
  • Texts3,148
  • Videos1,019
  • Audio486
  • Medical395
  • 3D383
  • Time series298
  • Graphs285
  • Tabular271
  • Speech199
  • RGB-D192
  • Environment148
  • Point cloud135
  • Biomedical123
  • LiDAR95
  • RGB Video87
  • Tracking78
  • Biology71
  • Actions68
  • 3d meshes65
  • Tables52
  • Music48
  • EEG45
  • Hyperspectral images45
  • Stereo44
  • MRI39
  • Physics32
  • Interactive29
  • Dialog25
  • Midi22
  • 6D17
  • Replay data11
  • Financial10
  • Ranking10
  • Cad9
  • fMRI7
  • Parallel6
  • Lyrics2
  • PSG2
Clear filter

383 dataset results

Aria Digital Twin Dataset

A real-world dataset, with hyper-accurate digital counterpart & comprehensive ground-truth annotation.

2 papers6 benchmarks3D, 3d meshes, Point cloud, RGB Video, Videos

NRHints-Synthetic (NRHints Synthetic Relighting Scenes)

A high-quality synthetic dataset for object relighting. Covering a wide range of geometry and material.

2 papers0 benchmarks3D, Images

NRHints-RealCapture (NRHints Real Captured Objects)

A high-quality captured dataset for object relighting. Covering a wide range of geometry and material.

2 papers0 benchmarks3D, Images

BASEPROD (The Bardenas Semi-Desert Planetary Rover Dataset)

BASEPROD provides comprehensive rover sensor data collected over a 1.7 km traverse, accompanied by high-resolution 2D and 3D drone maps of the terrain. The dataset also includes laser-induced breakdown spectroscopy (LIBS) measurements from key sampling sites along the rover's path, as well as weather station data to contextualize environmental conditions.

2 papers0 benchmarks3D, Environment, Images, Point cloud, RGB-D, Stereo, Tabular, Time series

RePAIR Dataset

Our dataset consists of over 1000 fractured frescoes. The RePAIR stands as a realistic computational challenge for methods for 2D and 3D puzzle solving, and serves as a benchmark that enables the study of fractured object reassembly and presents new challenges for geometric shape understanding. Please visit our website for more dataset information, access to source code scripts and for an interactive gallery viewing of the dataset samples.

2 papers0 benchmarks3D, Images

ThermoHands

ThermoHands is the first benchmark dataset specifically designed for egocentric 3D hand pose estimation from thermal images. It addresses the challenges of hand pose estimation in low-light conditions and when the hand is occluded by gloves or other wearables—scenarios where traditional RGB or NIR-based systems struggle.

2 papers0 benchmarks3D, Images, Videos

Beacon3D

Dataset of the Beacon3D benchmark: Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis.

2 papers0 benchmarks3D, Texts

Combinatorial 3D Shape Dataset

The combinatorial 3D shape dataset is composed of 406 instances of 14 classes. Specifically, each object in the dataset is considered equivalent to a sequence of primitive placement.

1 papers0 benchmarks3D, Images

Fine-grained 3D Pose

A new large-scale dataset that consists of 409 fine-grained categories and 31,881 images with accurate 3D pose annotation.

1 papers0 benchmarks3D

JHU CoSTAR Block Stacking Dataset

Involves data where a robot interacts with 5.1 cm colored blocks to complete an order-fulfillment style block stacking task. It contains dynamic scenes and real time-series data in a less constrained environment than comparable datasets. There are nearly 12,000 stacking attempts and over 2 million frames of real data.

1 papers0 benchmarks3D, Images, Point cloud, RGB Video, RGB-D

PointDenoisingBenchmark

The PointDenoisingBenchmark dataset features 28 different shapes, split into 18 training shapes and 10 test shapes.

1 papers0 benchmarks3D

ShapeNet-Skeleton

The ShapeNet-Skeleton dataset has ground-truth skeleton point sets and skeletal volumes for object instances in the ShapeNet dataset.

1 papers0 benchmarks3D

SIDOD

SIDOD is a new, publicly-available image dataset generated by the NVIDIA Deep Learning Data Synthesizer intended for use in object detection, pose estimation, and tracking applications. This dataset contains 144k stereo image pairs that synthetically combine 18 camera viewpoints of three photorealistic virtual environments with up to 10 objects (chosen randomly from the 21 object models of the YCB dataset) and flying distractors.

1 papers0 benchmarks3D, Images

Store dataset

The Store Dataset is a dataset for estimating 3D poses of multiple humans in real-time. It is captured inside two kinds of simulated stores with 12 and 28 cameras, respectively.

1 papers0 benchmarks3D

Minecraft Segmentation

Minecraft Segmentation is a segmentation dataset for the Minecraft House that adds semantic segmentation labels for sub-components of the house. There are 2050 houses in total and 1038 distinct labels of subcomponents.

1 papers0 benchmarks3D

3ThreeDWorld

TDW is a 3D virtual world simulation platform, utilizing state-of-the-art video game engine technology. A TDW simulation consists of two components: a) the Build, a compiled executable running on the Unity3D Engine, which is responsible for image rendering, audio synthesis and physics simulations; and b) the Controller, an external Python interface to communicate with the build.

1 papers0 benchmarks3D, Environment

SARA motion (Synthetic Actors and Real Actions)

Sara motion is a 3D motion dataset, named Synthetic Actors and Real Actions (SARA), for training a model to produce motion embeddings suitable for reasoning about motion similarity.

1 papers0 benchmarks3D, Videos

AMT Objects

AMT Objects is a large dataset of object centric videos suitable for training and benchmarking models for generating 3D models of objects from a small number of photos of the objects. The dataset consists of multiple views of a large collection of object instances.

1 papers0 benchmarks3D, Videos

Boombox

Boombox is a multi-modal dataset for visual reconstruction from acoustic vibrations. Involves dropping objects into a box and capturing resulting images and vibrations. Used for training ML systems that predict images from vibration.

1 papers0 benchmarks3D, Audio, Images, RGB-D, Time series

Multi-template MRI mouse brain atlas (Multi-template MRI mouse brain atlas for both in vivo and ex vivo analysis)

Mouse Brain MRI atlas (both in-vivo and ex-vivo) (repository relocated from the original webpage)

1 papers0 benchmarks3D, Biomedical, Images, MRI, Medical
PreviousPage 14 of 20Next