TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Fine-Grained Scene Graph Generation with Data Transfer

Fine-Grained Scene Graph Generation with Data Transfer

Ao Zhang, Yuan YAO, Qianyu Chen, Wei Ji, Zhiyuan Liu, Maosong Sun, Tat-Seng Chua

2022-03-22Scene Graph GenerationPredicate ClassificationScene Graph DetectionScene Graph ClassificationGraph GenerationUnbiased Scene Graph Generation
PaperPDFCodeCode(official)

Abstract

Scene graph generation (SGG) is designed to extract (subject, predicate, object) triplets in images. Recent works have made a steady progress on SGG, and provide useful tools for high-level vision and language understanding. However, due to the data distribution problems including long-tail distribution and semantic ambiguity, the predictions of current SGG models tend to collapse to several frequent but uninformative predicates (e.g., on, at), which limits practical application of these models in downstream tasks. To deal with the problems above, we propose a novel Internal and External Data Transfer (IETrans) method, which can be applied in a plug-and-play fashion and expanded to large SGG with 1,807 predicate classes. Our IETrans tries to relieve the data distribution problem by automatically creating an enhanced dataset that provides more sufficient and coherent annotations for all predicates. By training on the enhanced dataset, a Neural Motif model doubles the macro performance while maintaining competitive micro performance. The code and data are publicly available at https://github.com/waxnkw/IETrans-SGG.pytorch.

Results

TaskDatasetMetricValueModel
Scene ParsingVisual GenomeRecall@10027.2IETrans
Scene ParsingVisual GenomeRecall@5023.5IETrans
Scene ParsingVisual Genomemean Recall @10018IETrans
Scene ParsingVisual GenomeF@10044.1IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual GenomemR@2028.9IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual Genomeng-mR@2036IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual GenomeF@10026IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual GenomemR@2017.5IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual Genomeng-mR@2021.8IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual GenomeF@10021.7IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual GenomemR@2010.9IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual Genomeng-mR@2013.4IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual GenomeRecall@10027.2IETrans
2D Semantic SegmentationVisual GenomeRecall@5023.5IETrans
2D Semantic SegmentationVisual Genomemean Recall @10018IETrans
2D Semantic SegmentationVisual GenomeF@10044.1IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual GenomemR@2028.9IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual Genomeng-mR@2036IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual GenomeF@10026IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual GenomemR@2017.5IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual Genomeng-mR@2021.8IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual GenomeF@10021.7IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual GenomemR@2010.9IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual Genomeng-mR@2013.4IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual GenomeRecall@10027.2IETrans
Scene Graph GenerationVisual GenomeRecall@5023.5IETrans
Scene Graph GenerationVisual Genomemean Recall @10018IETrans
Scene Graph GenerationVisual GenomeF@10044.1IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual GenomemR@2028.9IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual Genomeng-mR@2036IETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual GenomeF@10026IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual GenomemR@2017.5IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual Genomeng-mR@2021.8IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual GenomeF@10021.7IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual GenomemR@2010.9IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual Genomeng-mR@2013.4IETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)

Related Papers

NGTM: Substructure-based Neural Graph Topic Model for Interpretable Graph Generation2025-07-17GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning2025-07-08GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning2025-07-04CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations2025-06-26CAT-SG: A Large Dynamic Scene Graph Dataset for Fine-Grained Understanding of Cataract Surgery2025-06-26HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions2025-06-24DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement2025-06-18