MHRI dataset

Multimodal Human-Robot Interaction dataset

AudioImagesRGB-D

The dataset includes recordings from 10 different users teaching the robot different common kitchen objects, that consists of synchronized recordings from three cameras and a microphone mounted on the robot:

An RGB-d camera covers the user manipulation and interaction with the robot
An RGB-d camera mounted on the top of the robot provides a top view of the whole scenario
A HD-RGB camera points to the user head to capture face and expressions