TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Situation Recognition with Graph Neural Networks

Situation Recognition with Graph Neural Networks

Ruiyu Li, Makarand Tapaswi, Renjie Liao, Jiaya Jia, Raquel Urtasun, Sanja Fidler

2017-08-14ICCV 2017 10Grounded Situation Recognition
PaperPDFCode

Abstract

We address the problem of recognizing situations in images. Given an image, the task is to predict the most salient verb (action), and fill its semantic roles such as who is performing the action, what is the source and target of the action, etc. Different verbs have different roles (e.g. attacking has weapon), and each role can take on many possible values (nouns). We propose a model based on Graph Neural Networks that allows us to efficiently capture joint dependencies between roles using neural networks defined on a graph. Experiments with different graph connectivities show that our approach that propagates information between roles significantly outperforms existing work, as well as multiple baselines. We obtain roughly 3-5% improvement over previous work in predicting the full situation. We also provide a thorough qualitative analysis of our model and influence of different roles in the verbs.

Results

TaskDatasetMetricValueModel
Situation RecognitionimSituTop-1 Verb36.72GraphNet
Situation RecognitionimSituTop-1 Verb & Value27.52GraphNet
Situation RecognitionimSituTop-5 Verbs61.9GraphNet
Situation RecognitionimSituTop-5 Verbs & Value45.39GraphNet
Situation RecognitionSWiGTop-1 Verb36.72GraphNet
Situation RecognitionSWiGTop-1 Verb & Value27.52GraphNet
Situation RecognitionSWiGTop-5 Verbs61.9GraphNet
Situation RecognitionSWiGTop-5 Verbs & Value45.39GraphNet
Grounded Situation RecognitionSWiGTop-1 Verb36.72GraphNet
Grounded Situation RecognitionSWiGTop-1 Verb & Value27.52GraphNet
Grounded Situation RecognitionSWiGTop-5 Verbs61.9GraphNet
Grounded Situation RecognitionSWiGTop-5 Verbs & Value45.39GraphNet

Related Papers

Dynamic Scene Understanding from Vision-Language Representations2025-01-20Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer2024-04-24Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments2023-07-15ClipSitu: Effectively Leveraging CLIP for Conditional Predictions in Situation Recognition2023-07-02GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement2022-08-18Collaborative Transformers for Grounded Situation Recognition2022-03-30Rethinking the Two-Stage Framework for Grounded Situation Recognition2021-12-10Grounded Situation Recognition with Transformers2021-11-19