TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets

383 machine learning datasets

Filter by Modality

  • Images3,275
  • Texts3,148
  • Videos1,019
  • Audio486
  • Medical395
  • 3D383
  • Time series298
  • Graphs285
  • Tabular271
  • Speech199
  • RGB-D192
  • Environment148
  • Point cloud135
  • Biomedical123
  • LiDAR95
  • RGB Video87
  • Tracking78
  • Biology71
  • Actions68
  • 3d meshes65
  • Tables52
  • Music48
  • EEG45
  • Hyperspectral images45
  • Stereo44
  • MRI39
  • Physics32
  • Interactive29
  • Dialog25
  • Midi22
  • 6D17
  • Replay data11
  • Financial10
  • Ranking10
  • Cad9
  • fMRI7
  • Parallel6
  • Lyrics2
  • PSG2
Clear filter

383 dataset results

Vastextures (Vast Dataset for textures and PBR materials)

VasTexture is a free giant repository of textures and PBR materials extracted from real-world images. The repository contains 500,000 highly diverse textures and PBR materials. All assets are free to download and use. The PBR materials and textures were extracted from natural images using an unsupervised approach (no human intervention). As a result, the textures and PBR materials are significantly more diverse but also significantly less refined compared to assets made using manual and AI approaches.

1 papers0 benchmarks3D, Images

MuSoHu (Toward human-like social robot navigation: A large-scale, multi-modal, social human navigation dataset)

A large-scale, egocentric, multimodal, and context-aware dataset of human demonstrations of social navigation.

1 papers0 benchmarks3D, Actions, LiDAR, Point cloud, RGB-D, Stereo, Videos

HePIC 🏛️

Heritage Pointcloud Instance Collection dataset, acquired from two large buildings and annotated at a point-wise semantic level based on existent BIM models. Devid Campagnolo, Elena Camuffo, Umberto Michieli, Paolo Borin, Simone Milani and Andrea Giordano, "Fully Automated Scan-to-BIM via Point Cloud Instance Segmentation", In Proceedings of the International Conference on Image Processing (ICIP) 2023.

1 papers2 benchmarks3D, Point cloud

ConSLAM (Construction Dataset for SLAM)

ConSLAM is a real-world dataset collected periodically on a construction site to measure the accuracy of mobile scanners' SLAM algorithms.

1 papers0 benchmarks3D, LiDAR, Point cloud, RGB Video, Tracking, Videos

SBA (Sequentail Brick Assembly Dataset)

The RAD (Randomly Assembled Object Construction) dataset is a synthetic 3D LEGO dataset designed for the task of Sequential Brick Assembly (SBA). Here are the key characteristics and details:

1 papers0 benchmarks3D, 3d meshes, Actions, Images

aiMotive 3D Traffic Light and Traffic Sign Dataset

A large-scale traffic sign and traffic light dataset with accurate 3D positioning and temporally consistent 3D bounding boxes of traffic management objects from up to 200 meters away. The dataset contains additional attributes such as traffic light state, traffic light mask type, traffic sign type, and occlusion. The application areas are 3D traffic lights and sign detection for autonomous driving.

1 papers0 benchmarks3D, Images

RealArt-6

This is the official dataset collected for to test the sim-to-real transfer. It contains 6 articulated object instances, each captured from 20 camera views under 5 states in scenarios with and without background, as well as presence or absence of distractors.

1 papers0 benchmarks3D, Point cloud

Aria Everyday Objects

A small-scale, real-world Project Aria dataset with high quality static 3D oriented bounding boxs annotations.

1 papers6 benchmarks3D, Point cloud, Videos

NuiSI Dataset (Nuitrack Skeleton Interaction Dataset)

The NuiSI dataset contains skeleton tracking trajectories of Human Interaction Partners performing a variety of physically interactive behaviors (waving, handshaking, rocket fistbump, parachute fistbump) with each other. This is inspired by the dataset in Bütepage et al. "Imitating by generating: Deep generative models for imitation of interactive tasks." Frontiers in Robotics and AI (2020) wherein they capture a dataset with rokoko motion capture suits. Instead we track the skeletons of the interaction partner with Intel Realsense cameras using Nuitrack, for a more realistic scenario, with noise coming from the depth sensor, the skeleton tracking and some partial occlusions. This makes it more representative of real world interactions with a Robot equipped with an RGBD camera. T This dataset is used in our papers for training Interaction models for Human-Robot Interaction with a humanoid social robot. If you find the dataset useful in your work, please cite our paper:

1 papers0 benchmarks3D, Tracking

U-10: United-10 COVID19 CT Dataset

This dataset supports the research detailed in the pre-print "Virtual Imaging Trials Improved the Transparency and Reliability of AI Systems in COVID-19 Imaging." The study employs both clinical and simulated CT data to evaluate AI models for COVID-19 diagnosis. By leveraging the Virtual Imaging Trials (VIT) framework, the research addresses reproducibility and generalizability issues prevalent in medical imaging AI models.

1 papers1 benchmarks3D, Images, Medical

BraTS PEDs 2023 (The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs))

Click to add a brief description of the dataset (Markdown and LaTeX enabled).

1 papers0 benchmarks3D, MRI, Medical

BlendNet

📚 BlendNet The dataset contains $12k$ samples. To balance cost savings with data quality and scale, we manually annotated $2k$ samples and used GPT-4o to annotate the remaining $10k$ samples.

1 papers0 benchmarks3D, 3d meshes, Cad, Texts

CADBench

📚 CADBench CADBench is a comprehensive benchmark to evaluate the ability of LLMs to generate CAD scripts. It contains 500 simulated data samples and 200 data samples collected from online forums.

1 papers0 benchmarks3D, 3d meshes, Cad, Texts

RAOS (Rethinking Abdominal Organ Segmentation)

Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases.

1 papers0 benchmarks3D, Biomedical, Images, Medical

S-BIAD843 (Individual 3D cell shapes of Drosophila Wing Disc)

Late third instar wing imaginal discs were cultured in Shields and Sang M3 media (Sigma) supplemented with 2% FBS (Sigma), 1% pen/strep (Gibco), 3ng/ml ecdysone (Sigma) and 2ng/ml insulin (Sigma). Wing discs were cultured in 35mm fluorodishes (WPI) under 12mm filters (Millicell), as described in https://doi.org/10.1038%2Fs41567-019-0618-1

1 papers0 benchmarks3D, Images

MPM-Verse (MPMVerse Physics Simulation Dataset)

This dataset contains Material-Point-Method (MPM) simulations for various materials, including water, sand, plasticine, elasticity, jelly, rigid collisions, and melting. Each material is represented as point-clouds that evolve over time. The dataset is designed for learning and predicting MPM-based physical simulations.

1 papers0 benchmarks3D, Point cloud

Mpm-Verse-Large (MPMVerse Physics Simulation Dataset)

This dataset contains Material-Point-Method (MPM) simulations for various materials, including water, sand, plasticine, jelly, and rigid collisions. Each material is represented as point-clouds that evolve over time. The dataset is designed for learning and predicting MPM-based physical simulations. Each material contains 50 trajectories with different initial velocity field.

1 papers0 benchmarks3D, Point cloud

MeshFLeet (eshFleet: Filtered and Annotated 3D Vehicle Dataset for Domain Specific Generative Modeling Resources)

MeshFleet is a filtered and annotated dataset of High Quality vehicles derived from Objaverse XL. It contains the sha256 of the objects together with consitent object captions and vehicle parameters.

1 papers0 benchmarks3D, Images

Spiideo SoccerNet SynLoc

Synthetic soccer players rendered on top of real world stadium images in 4K covering half a pitch each. Ground truth annotations in form of precise location of players on the pitch as well as 3D location of player pelvis and image bounding boxes.

1 papers18 benchmarks3D, Images

MODIS AOD (imputed) (Pre-processed MODIS AOD and ERA5 data (2003-2022) for North Africa)

Structured atmospheric data for AI/ML Long-term, pre-processed, atmospheric datasets for use in Machine Learning/AI based forecasting. Initially intended to predict AOD, however can be adapted for prediction of other atmospheric particles.

1 papers0 benchmarks3D, Environment, Physics, Time series
PreviousPage 18 of 20Next