MOMA-LRG
Multi-Object Multi-Actor activity parsing with Language-Refined Graphs
TextsVideosCC BY-SA 4.0Introduced 2022-11-28
A dataset dedicated to multi-object, multi-actor activity parsing.
The dataset contains
- Video-level labels (activities)
- Segment-level labels (sub-activities)
- Atomic actions (spatio-temporal scene graph)
The scene graph annotations contain object/actor classes and bounding boxes, relationship annotations, and object/actor attributes.