TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Semantic Diversity-aware Prototype-based Learning for Unbi...

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

Jaehyeong Jeon, Kibum Kim, Kanghoon Yoon, Chanyoung Park

2024-07-22Scene Graph GenerationGraph GenerationUnbiased Scene Graph Generation
PaperPDFCode(official)

Abstract

The scene graph generation (SGG) task involves detecting objects within an image and predicting predicates that represent the relationships between the objects. However, in SGG benchmark datasets, each subject-object pair is annotated with a single predicate even though a single predicate may exhibit diverse semantics (i.e., semantic diversity), existing SGG models are trained to predict the one and only predicate for each pair. This in turn results in the SGG models to overlook the semantic diversity that may exist in a predicate, thus leading to biased predictions. In this paper, we propose a novel model-agnostic Semantic Diversity-aware Prototype-based Learning (DPL) framework that enables unbiased predictions based on the understanding of the semantic diversity of predicates. Specifically, DPL learns the regions in the semantic space covered by each predicate to distinguish among the various different semantics that a single predicate can represent. Extensive experiments demonstrate that our proposed model-agnostic DPL framework brings significant performance improvement on existing SGG models, and also effectively understands the semantic diversity of predicates.

Results

TaskDatasetMetricValueModel
Scene ParsingVisual GenomeF@10044.9DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual GenomemR@2026.2DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual Genomeng-mR@2031.3DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene ParsingVisual GenomeF@10025.2DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual GenomemR@2014.1DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual Genomeng-mR@2018.5DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene ParsingVisual GenomeF@10020.2DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual GenomemR@209.4DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene ParsingVisual Genomeng-mR@2010DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual GenomeF@10044.9DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual GenomemR@2026.2DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual Genomeng-mR@2031.3DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
2D Semantic SegmentationVisual GenomeF@10025.2DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual GenomemR@2014.1DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual Genomeng-mR@2018.5DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
2D Semantic SegmentationVisual GenomeF@10020.2DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual GenomemR@209.4DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
2D Semantic SegmentationVisual Genomeng-mR@2010DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual GenomeF@10044.9DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual GenomemR@2026.2DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual Genomeng-mR@2031.3DPL (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
Scene Graph GenerationVisual GenomeF@10025.2DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual GenomemR@2014.1DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual Genomeng-mR@2018.5DPL (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
Scene Graph GenerationVisual GenomeF@10020.2DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual GenomemR@209.4DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
Scene Graph GenerationVisual Genomeng-mR@2010DPL (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)

Related Papers

NGTM: Substructure-based Neural Graph Topic Model for Interpretable Graph Generation2025-07-17GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning2025-07-08GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning2025-07-04CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations2025-06-26CAT-SG: A Large Dynamic Scene Graph Dataset for Fine-Grained Understanding of Cataract Surgery2025-06-26HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions2025-06-24DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement2025-06-18