Grounded Situation Recognition

6 benchmarks15 papers

Grounded Situation Recognition aims to produce the structured image summary which describes the primary activity (verb), its relevant entities (nouns), and their bounding-box groundings.

Benchmarks