Sound of Water 50

AudioVideosMITIntroduced 2024-11-18

We collect a dataset of 805 clean videos that show the action of pouring water in a container. Our dataset spans over 50 unique containers made of 5 different materials, 4 different shapes and with hot and cold water.

The dataset can be used for the following tasks:

  • Physical property understanding (e.g., predict size of container just from the sound of pouring)
  • Audio generation (given video of pouring, generate a synchronised sound of pouring)
  • Video generation