CATER
VideosApache License 2.0
Rendered synthetically using a library of standard 3D objects, and tests the ability to recognize compositions of object movements that require long-term reasoning.
Source: CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Image Source: CATER