TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets

11 machine learning datasets

Filter by Modality

  • Images3,275
  • Texts3,148
  • Videos1,019
  • Audio486
  • Medical395
  • 3D383
  • Time series298
  • Graphs285
  • Tabular271
  • Speech199
  • RGB-D192
  • Environment148
  • Point cloud135
  • Biomedical123
  • LiDAR95
  • RGB Video87
  • Tracking78
  • Biology71
  • Actions68
  • 3d meshes65
  • Tables52
  • Music48
  • EEG45
  • Hyperspectral images45
  • Stereo44
  • MRI39
  • Physics32
  • Interactive29
  • Dialog25
  • Midi22
  • 6D17
  • Replay data11
  • Financial10
  • Ranking10
  • Cad9
  • fMRI7
  • Parallel6
  • Lyrics2
  • PSG2
Clear filter

11 dataset results

V-D4RL

V-D4RL provides pixel-based analogues of the popular D4RL benchmarking tasks, derived from the dm_control suite, along with natural extensions of two state-of-the-art online pixel-based continuous control algorithms, DrQ-v2 and DreamerV2, to the offline setting.

16 papers0 benchmarksActions, Images, Replay data

map2seq

7,672 human written natural language navigation instructions for routes in OpenStreetMap with a focus on visual landmarks. Validated in Street View.

5 papers2 benchmarksImages, Interactive, Replay data, Texts

eSports Sensors Dataset

The eSports Sensors dataset contains sensor data collected from 10 players in 22 matches in League of Legends. The sensor data collected includes:

4 papers6 benchmarks6D, Actions, Biomedical, EEG, Environment, Replay data, Tabular, Time series, Tracking

Navigation Turing Test

Replay data from human players and AI agents navigating in a 3D game environment.

3 papers0 benchmarksImages, Replay data, Videos

StarData

StarData is a StarCraft: Brood War replay dataset, with 65,646 games. The full dataset after compression is 365 GB, 1535 million frames, and 496 million player actions. The entire frame data was dumped out at 8 frames per second.

2 papers0 benchmarksActions, Replay data

RLU (RL Unplugged)

RL Unplugged is suite of benchmarks for offline reinforcement learning. The RL Unplugged is designed around the following considerations: to facilitate ease of use, we provide the datasets with a unified API which makes it easy for the practitioner to work with all data in the suite once a general pipeline has been established. This is a dataset accompanying the paper RL Unplugged: Benchmarks for Offline Reinforcement Learning.

2 papers0 benchmarksActions, Environment, Images, Physics, RGB Video, Replay data

SC2ReSet: StarCraft II Esport Replaypack Set

Raw StarCraft II data is subject to processing under the Blizzard end user license agreement (EULA), and in special cases Blizzard AI and Machine Learning License may be applied. Please refer to the materials listed below.

1 papers0 benchmarksReplay data

SC2EGSet: StarCraft II Esport Game State Dataset

SC2EGSet: StarCraft II Esport Game State Dataset

1 papers0 benchmarksReplay data

TSN-FlexTest Traffic Streams for Spot Robot, Tactile Internet, and Generic Data

In this dataset, we provide detailed traffic stream data for the Spot robot, including both the Spot robot control traffic stream and the Spot video stream. The Spot robot traffic streams provide realistic traffic data for communication network evaluations, e.g., for measurements with the TSN FlexText testbed. Furthermore, we share data for the tactile internet including audio, video, and robotic communication. Finally, the dataset includes generic data streams for three different intervals (0.2ms, 0.3ms, and 0.5ms) with two different Ethernet frame sizes. The data is provided as .*pcap which can be replayed with various tools or be analyzed, e.g., with Wireshark. The Spot data streams are split into two directions and are based on Spot API calls.

1 papers0 benchmarksReplay data

MS-HAB-Demonstrations (ManiSkill-HAB Demonstration Datasets)

Whole-body, low-level control/manipulation demonstration dataset for ManiSkill-HAB. Demonstrations are organized by task-subtask-object. All demos use RGBD (128x128) and state. JSON files store metadata (tincluding even labels and success/failure mode), while HDF5 files store demonstration data.

1 papers0 benchmarksActions, Images, RGB-D, Replay data

MIKASA-Robo Dataset

Click to add a brief description of the dataset (Markdown and LaTeX enabled).

1 papers0 benchmarksActions, Images, Replay data