TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unbiased Scene Graph Generation from Biased Training

Unbiased Scene Graph Generation from Biased Training

Kaihua Tang, Yulei Niu, Jianqiang Huang, Jiaxin Shi, Hanwang Zhang

2020-02-27CVPR 2020 6Scene Graph GenerationCausal InferenceGraph GenerationUnbiased Scene Graph Generation
PaperPDFCodeCodeCodeCodeCode(official)Code

Abstract

Today's scene graph generation (SGG) task is still far from practical, mainly due to the severe training bias, e.g., collapsing diverse "human walk on / sit on / lay on beach" into "human on beach". Given such SGG, the down-stream tasks such as VQA can hardly infer better scene structures than merely a bag of objects. However, debiasing in SGG is not trivial because traditional debiasing methods cannot distinguish between the good and bad bias, e.g., good context prior (e.g., "person read book" rather than "eat") and bad long-tailed bias (e.g., "near" dominating "behind / in front of"). In this paper, we present a novel SGG framework based on causal inference but not the conventional likelihood. We first build a causal graph for SGG, and perform traditional biased training with the graph. Then, we propose to draw the counterfactual causality from the trained graph to infer the effect from the bad bias, which should be removed. In particular, we use Total Direct Effect (TDE) as the proposed final predicate score for unbiased SGG. Note that our framework is agnostic to any SGG model and thus can be widely applied in the community who seeks unbiased predictions. By using the proposed Scene Graph Diagnosis toolkit on the SGG benchmark Visual Genome and several prevailing models, we observed significant improvements over the previous state-of-the-art methods.

Results

TaskDatasetMetricValueModel
Scene ParsingVisual GenomeRecall@5031.93Causal-TDE
Scene ParsingVisual Genomemean Recall @206.9Causal-TDE
Scene ParsingVisual GenomeF@10036.9TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual GenomemR@2019.2TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual Genomeng-mR@2020.9TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual GenomeF@10037.2TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual GenomemR@2017.4TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual Genomeng-mR@2018.7TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual GenomeF@10018.6TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual GenomemR@2011.2TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual Genomeng-mR@2012.4TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual GenomeF@10019.9TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual GenomemR@209.9TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual Genomeng-mR@2010.7TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual GenomeF@10015.1TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual GenomemR@206.8TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual Genomeng-mR@207.8TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual GenomeF@10013.2TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual GenomemR@209.7TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual Genomeng-mR@207.4TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual GenomeRecall@5031.93Causal-TDE
2D Semantic SegmentationVisual Genomemean Recall @206.9Causal-TDE
2D Semantic SegmentationVisual GenomeF@10036.9TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual GenomemR@2019.2TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual Genomeng-mR@2020.9TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual GenomeF@10037.2TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual GenomemR@2017.4TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual Genomeng-mR@2018.7TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual GenomeF@10018.6TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual GenomemR@2011.2TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual Genomeng-mR@2012.4TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual GenomeF@10019.9TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual GenomemR@209.9TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual Genomeng-mR@2010.7TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual GenomeF@10015.1TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual GenomemR@206.8TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual Genomeng-mR@207.8TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual GenomeF@10013.2TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual GenomemR@209.7TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual Genomeng-mR@207.4TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual GenomeRecall@5031.93Causal-TDE
Scene Graph GenerationVisual Genomemean Recall @206.9Causal-TDE
Scene Graph GenerationVisual GenomeF@10036.9TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual GenomemR@2019.2TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual Genomeng-mR@2020.9TDE (VCTree-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual GenomeF@10037.2TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual GenomemR@2017.4TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual Genomeng-mR@2018.7TDE (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual GenomeF@10018.6TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual GenomemR@2011.2TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual Genomeng-mR@2012.4TDE (VCTree-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual GenomeF@10019.9TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual GenomemR@209.9TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual Genomeng-mR@2010.7TDE (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual GenomeF@10015.1TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual GenomemR@206.8TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual Genomeng-mR@207.8TDE (VCTree-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual GenomeF@10013.2TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual GenomemR@209.7TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual Genomeng-mR@207.4TDE (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)

Related Papers

NGTM: Substructure-based Neural Graph Topic Model for Interpretable Graph Generation2025-07-17GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning2025-07-08Estimating Interventional Distributions with Uncertain Causal Graphs through Meta-Learning2025-07-07GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning2025-07-04CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations2025-06-26CAT-SG: A Large Dynamic Scene Graph Dataset for Fine-Grained Understanding of Cataract Surgery2025-06-26HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions2025-06-24