TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets

383 machine learning datasets

Filter by Modality

  • Images3,275
  • Texts3,148
  • Videos1,019
  • Audio486
  • Medical395
  • 3D383
  • Time series298
  • Graphs285
  • Tabular271
  • Speech199
  • RGB-D192
  • Environment148
  • Point cloud135
  • Biomedical123
  • LiDAR95
  • RGB Video87
  • Tracking78
  • Biology71
  • Actions68
  • 3d meshes65
  • Tables52
  • Music48
  • EEG45
  • Hyperspectral images45
  • Stereo44
  • MRI39
  • Physics32
  • Interactive29
  • Dialog25
  • Midi22
  • 6D17
  • Replay data11
  • Financial10
  • Ranking10
  • Cad9
  • fMRI7
  • Parallel6
  • Lyrics2
  • PSG2
Clear filter

383 dataset results

StableText2Lego

This dataset contains over 47,000 LEGO structures of over 28,000 unique 3D objects accompanied by detailed captions. It was used to train LegoGPT, the first approach for generating physically stable LEGO brick models from text prompts.

1 papers0 benchmarks3D, Physics, Texts

Remote Flash LiDAR Vehicles Dataset

This dataset includes 3D point-cloud and 2D imagery from a flash LiDAR...

1 papers6 benchmarks3D, Images, LiDAR, Point cloud, Videos

BlurRF-Synth

The first large-scale dataset for training and evaluating novel-view synthesis from blurred images.

1 papers0 benchmarks3D, Images

BlurRF-Real

A real-world low-light camera motion blur dataset for evaluating deblurring radiance fields methods.

1 papers0 benchmarks3D, Images

Why is Allegiant phone number busy? (3D object detection model)

Why is Allegiant's phone number always busy? The Allegiant phone number + 1-(801)-855-(5905).. or ++ 1-(801)-855-(5905).. (No Hold) may be consistently busy due to a high volume of customer inquiries, particularly during peak travel seasons or times of significant flight disruptions.

1 papers0 benchmarks3D

[Travel@Guide® ]What is the 24 hour rule for KLM?

KLM's 24-hour rule allows passengers to cancel or modify their flight booking within 24 hours of purchase without penalty,☎+1||804||853||9001 or +1||801||855||5905✅ as long as the flight was booked at least 7 days before departure.

1 papers0 benchmarks3D

OpenSurfaces

OpenSurfaces is a large database of annotated surfaces created from real-world consumer photographs. The framework used for the annotation process draws on crowdsourcing to segment surfaces from photos, and then annotate them with rich surface properties, including material, texture and contextual information.

0 papers0 benchmarks3D, Images

Princeton Shape

The Princeton Shape dataset provides a repository of 3D models and software tools for evaluating shape-based retrieval and analysis algorithms. The motivation is to promote the use of standardized data sets and evaluation methods for research in matching, classification, clustering, and recognition of 3D models. Researchers are encouraged to use these resources to produce comparisons of competing algorithms in future publications. There are 1,814 models in total.

0 papers0 benchmarks3D

IKEA 3D

IKEA 3D is a dataset of IKEA 3D models and aligned images, which is suitable for pose estimation. There are 759 images and 219 models including Sketchup (skp) and Wavefront (obj) files.

0 papers0 benchmarks3D

Humans in 3D

H3D (Humans in 3D) is a dataset of annotated people. The annotations include:

0 papers0 benchmarks3D, Images

BTFDBB (BTF Database Bonn)

Reflectance measurements of Bidirectional Texture Functions (BTFs)

0 papers0 benchmarks3D, 3d meshes, Hyperspectral images, Images

Florence 3D actions dataset

The dataset collected at the University of Florence during 2012, has been captured using a Kinect camera. It includes 9 activities: wave, drink from a bottle, answer phone,clap, tight lace, sit down, stand up, read watch, bow. During acquisition, 10 subjects were asked to perform the above actions for 2/3 times. This resulted in a total of 215 activity samples.

0 papers0 benchmarks3D

ASL-Skeleton3D

The ASL-Skeleton3D introduces a representation based on mapping into the three-dimensional space the coordinates of the signers in the ASLLVD dataset. This enables a more accurate observation of the body parts and the signs articulation, allowing researchers to better understand the language and explore other approaches to the SLR field.

0 papers0 benchmarks3D

InfiniteRep

InfiniteRep is a synthetic, open-source dataset for fitness and physical therapy (PT) applications. It includes 1k videos of diverse avatars performing multiple repetitions of common exercises. It includes significant variation in the environment, lighting conditions, avatar demographics, and movement trajectories. From cadence to kinematic trajectory, each rep is done slightly differently -- just like real humans. InfiniteRep videos are accompanied by a rich set of pixel-perfect labels and annotations, including frame-specific repetition counts.

0 papers0 benchmarks3D, 3d meshes, Actions, Biomedical, Images, RGB Video, RGB-D, Tracking, Videos

CBCT Walnut (Cone-Beam X-Ray CT Data Collection Designed for Machine Learning)

The scans are performed using a custom-built, highly flexible X-ray CT scanner, the FleX-ray scanner, developed by XRE nvand located in the FleX-ray Lab at the Centrum Wiskunde & Informatica (CWI) in Amsterdam, Netherlands. The general purpose of the FleX-ray Lab is to conduct proof of concept experiments directly accessible to researchers in the field of mathematics and computer science. The scanner consists of a cone-beam microfocus X-ray point source that projects polychromatic X-rays onto a 1536-by-1944 pixels, 14-bit flat panel detector (Dexella 1512NDT) and a rotation stage in-between, upon which a sample is mounted. All three components are mounted on translation stages which allow them to move independently from one another.

0 papers0 benchmarks3D, Medical

4,458 People - 3D Facial Expressions Recognition Data

Description: 4,458 People - 3D Facial Expressions Recognition Data. The collection scenes include indoor scenes and outdoor scenes. The dataset includes males and females. The age distribution ranges from juvenile to the elderly, the young people and the middle aged are the majorities. The device includes iPhone X, iPhone XR. The data diversity includes different expressions, different ages, different races, different collecting scenes. This data can be used for tasks such as 3D facial expression recognition.

0 papers0 benchmarks3D

VIDIMU: Multimodal video and IMU kinematic dataset on daily life activities using affordable devices (https://zenodo.org/record/8210563)

Human activity recognition and clinical biomechanics are challenging problems in physical telerehabilitation medicine. However, most publicly available datasets on human body movements cannot be used to study both problems in an out-of-the-lab movement acquisition setting. The objective of the VIDIMU dataset is to pave the way towards affordable patient tracking solutions for remote daily life activities recognition and kinematic analysis.

0 papers0 benchmarks3D, Biomedical, RGB Video, Time series, Videos

HEADSET (HEADSET: Human Emotion Awareness under Partial Occlusions Multimodal DataSET)

The volumetric representation of human interactions is one of the fundamental domains in the development of immersive media productions and telecommunication applications. Particularly in the context of the rapid advancement of Extended Reality (XR) applications, this volumetric data has proven to be an essential technology for future XR elaboration. In this work, we present a new multimodal database to help advance the development of immersive technologies. Our proposed database provides ethically compliant and diverse volumetric data, in particular 27 participants displaying posed facial expressions and subtle body movements while speaking, plus 11 participants wearing head-mounted displays (HMDs). The recording system consists of a volumetric capture (VoCap) studio, including 31 synchronized modules with 62 RGB cameras and 31 depth cameras. In addition to textured meshes, point clouds, and multi-view RGB-D data, we use one Lytro Illum camera for providing light field (LF) data simul

0 papers0 benchmarks3D, 3d meshes, Audio, Images, Point cloud, RGB Video, RGB-D, Videos

InLUT3D (Indoor Lodz University of Technology Point Cloud Dataset)

This dataset called Indoor Lodz University of Technology Point Cloud Dataset (InLUT3D) is a point cloud set tailored for real object classification and both semantic and instance segmentation tasks. Comprising of 321 scans, some areas in the dataset are covered by multiple scans. All of them are captured using the Leica BLK360 scanner.

0 papers0 benchmarks3D, Graphs, LiDAR, Point cloud

THVD (Talking Head Video Dataset)

About

0 papers0 benchmarks3D, Actions, Audio, Environment, Speech, Videos
PreviousPage 19 of 20Next