TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SynthRef: Generation of Synthetic Referring Expressions fo...

SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation

Ioannis Kazakos, Carles Ventura, Miriam Bellver, Carina Silberer, Xavier Giro-i-Nieto

2021-06-08Referring Expression SegmentationSegmentationVideo Object Segmentationobject-detection
PaperPDFCode(official)Code

Abstract

Recent advances in deep learning have brought significant progress in visual grounding tasks such as language-guided video object segmentation. However, collecting large datasets for these tasks is expensive in terms of annotation time, which represents a bottleneck. To this end, we propose a novel method, namely SynthRef, for generating synthetic referring expressions for target objects in an image (or video frame), and we also present and disseminate the first large-scale dataset with synthetic referring expressions for video object segmentation. Our experiments demonstrate that by training with our synthetic referring expressions one can improve the ability of a model to generalize across different datasets, without any additional annotation cost. Moreover, our formulation allows its application to any object detection or segmentation dataset.

Results

TaskDatasetMetricValueModel
Instance SegmentationDAVIS 2017 (val)J&F 1st frame45.3RefVOS + SynthRef-YouTube-VIS
Instance SegmentationDAVIS 2017 (val)J&F Full video44.8RefVOS + SynthRef-YouTube-VIS
Instance SegmentationRefer-YouTube-VOSMean IoU39.5RefVOS-Human REs
Instance SegmentationRefer-YouTube-VOSPrecision@0.538.6RefVOS-Human REs
Instance SegmentationRefer-YouTube-VOSPrecision@0.96.9RefVOS-Human REs
Instance SegmentationRefer-YouTube-VOSMean IoU35RefVOS-Synthetic REs
Instance SegmentationRefer-YouTube-VOSPrecision@0.532.3RefVOS-Synthetic REs
Instance SegmentationRefer-YouTube-VOSPrecision@0.91.8RefVOS-Synthetic REs
Referring Expression SegmentationDAVIS 2017 (val)J&F 1st frame45.3RefVOS + SynthRef-YouTube-VIS
Referring Expression SegmentationDAVIS 2017 (val)J&F Full video44.8RefVOS + SynthRef-YouTube-VIS
Referring Expression SegmentationRefer-YouTube-VOSMean IoU39.5RefVOS-Human REs
Referring Expression SegmentationRefer-YouTube-VOSPrecision@0.538.6RefVOS-Human REs
Referring Expression SegmentationRefer-YouTube-VOSPrecision@0.96.9RefVOS-Human REs
Referring Expression SegmentationRefer-YouTube-VOSMean IoU35RefVOS-Synthetic REs
Referring Expression SegmentationRefer-YouTube-VOSPrecision@0.532.3RefVOS-Synthetic REs
Referring Expression SegmentationRefer-YouTube-VOSPrecision@0.91.8RefVOS-Synthetic REs

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17