TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Unified Transformer Framework for Group-based Segmentati...

A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection

Yukun Su, Jingliang Deng, Ruizhou Sun, Guosheng Lin, Qingyao Wu

2022-03-09Video Salient Object DetectionSemantic SegmentationCo-Salient Object DetectionSalient Object Detectionobject-detectionObject DetectionSaliency Detection
PaperPDFCode(official)

Abstract

Humans tend to mine objects by learning from a group of images or several frames of video since we live in a dynamic world. In the computer vision area, many researches focus on co-segmentation (CoS), co-saliency detection (CoSD) and video salient object detection (VSOD) to discover the co-occurrent objects. However, previous approaches design different networks on these similar tasks separately, and they are difficult to apply to each other, which lowers the upper bound of the transferability of deep learning frameworks. Besides, they fail to take full advantage of the cues among inter- and intra-feature within a group of images. In this paper, we introduce a unified framework to tackle these issues, term as UFO (Unified Framework for Co-Object Segmentation). Specifically, we first introduce a transformer block, which views the image feature as a patch token and then captures their long-range dependencies through the self-attention mechanism. This can help the network to excavate the patch structured similarities among the relevant objects. Furthermore, we propose an intra-MLP learning module to produce self-mask to enhance the network to avoid partial activation. Extensive experiments on four CoS benchmarks (PASCAL, iCoseg, Internet and MSRC), three CoSD benchmarks (Cosal2015, CoSOD3k, and CocA) and four VSOD benchmarks (DAVIS16, FBMS, ViSal and SegV2) show that our method outperforms other state-of-the-arts on three different tasks in both accuracy and speed by using the same network architecture , which can reach 140 FPS in real-time.

Results

TaskDatasetMetricValueModel
VideoFBMS-59AVERAGE MAE0.028UFO
VideoFBMS-59MAX F-MEASURE0.89UFO
VideoFBMS-59S-Measure0.894UFO
VideoSegTrack v2AVERAGE MAE0.022UFO
VideoSegTrack v2MAX F-MEASURE0.863UFO
VideoSegTrack v2S-Measure0.892UFO
VideoDAVIS-2016AVERAGE MAE0.015UFO
VideoDAVIS-2016MAX F-MEASURE0.906UFO
VideoDAVIS-2016S-Measure0.918UFO
VideoViSalAverage MAE0.011UFO
VideoViSalS-Measure0.953UFO
VideoViSalmax E-measure0.987UFO
Saliency DetectionCoSOD3kMAE0.073UFO
Saliency DetectionCoSOD3kS-measure0.819UFO
Saliency DetectionCoSOD3kmax E-measure0.874UFO
Saliency DetectionCoSOD3kmax F-measure0.797UFO
Saliency DetectionCoSOD3kmean E-measure0.855UFO
Saliency DetectionCoSOD3kmean F-measure0.783UFO
Saliency DetectioniCoSegMAE0.029UFO
Saliency DetectioniCoSegS-measure0.924UFO
Saliency DetectioniCoSegmax E-measure0.969UFO
Saliency DetectioniCoSegmax F-measure0.953UFO
Saliency DetectionCoCAMAE0.095UFO
Saliency DetectionCoCAMean F-measure0.555UFO
Saliency DetectionCoCAS-measure0.697UFO
Saliency DetectionCoCAmax E-measure0.782UFO
Saliency DetectionCoCAmax F-measure0.571UFO
Saliency DetectionCoCAmean E-measure0.762UFO
Saliency DetectionCoSal2015MAE0.064UFO
Saliency DetectionCoSal2015S-measure0.86UFO
Saliency DetectionCoSal2015max E-measure0.906UFO
Saliency DetectionCoSal2015max F-measure0.865UFO
Saliency DetectionCoSal2015mean E-measure0.889UFO
Saliency DetectionCoSal2015mean F-measure0.848UFO
Object DetectionCoSOD3kMAE0.073UFO
Object DetectionCoSOD3kS-measure0.819UFO
Object DetectionCoSOD3kmax E-measure0.874UFO
Object DetectionCoSOD3kmax F-measure0.797UFO
Object DetectionCoSOD3kmean E-measure0.855UFO
Object DetectionCoSOD3kmean F-measure0.783UFO
Object DetectioniCoSegMAE0.029UFO
Object DetectioniCoSegS-measure0.924UFO
Object DetectioniCoSegmax E-measure0.969UFO
Object DetectioniCoSegmax F-measure0.953UFO
Object DetectionCoCAMAE0.095UFO
Object DetectionCoCAMean F-measure0.555UFO
Object DetectionCoCAS-measure0.697UFO
Object DetectionCoCAmax E-measure0.782UFO
Object DetectionCoCAmax F-measure0.571UFO
Object DetectionCoCAmean E-measure0.762UFO
Object DetectionCoSal2015MAE0.064UFO
Object DetectionCoSal2015S-measure0.86UFO
Object DetectionCoSal2015max E-measure0.906UFO
Object DetectionCoSal2015max F-measure0.865UFO
Object DetectionCoSal2015mean E-measure0.889UFO
Object DetectionCoSal2015mean F-measure0.848UFO
Object DetectionFBMS-59AVERAGE MAE0.028UFO
Object DetectionFBMS-59MAX F-MEASURE0.89UFO
Object DetectionFBMS-59S-Measure0.894UFO
Object DetectionSegTrack v2AVERAGE MAE0.022UFO
Object DetectionSegTrack v2MAX F-MEASURE0.863UFO
Object DetectionSegTrack v2S-Measure0.892UFO
Object DetectionDAVIS-2016AVERAGE MAE0.015UFO
Object DetectionDAVIS-2016MAX F-MEASURE0.906UFO
Object DetectionDAVIS-2016S-Measure0.918UFO
Object DetectionViSalAverage MAE0.011UFO
Object DetectionViSalS-Measure0.953UFO
Object DetectionViSalmax E-measure0.987UFO
3DCoSOD3kMAE0.073UFO
3DCoSOD3kS-measure0.819UFO
3DCoSOD3kmax E-measure0.874UFO
3DCoSOD3kmax F-measure0.797UFO
3DCoSOD3kmean E-measure0.855UFO
3DCoSOD3kmean F-measure0.783UFO
3DiCoSegMAE0.029UFO
3DiCoSegS-measure0.924UFO
3DiCoSegmax E-measure0.969UFO
3DiCoSegmax F-measure0.953UFO
3DCoCAMAE0.095UFO
3DCoCAMean F-measure0.555UFO
3DCoCAS-measure0.697UFO
3DCoCAmax E-measure0.782UFO
3DCoCAmax F-measure0.571UFO
3DCoCAmean E-measure0.762UFO
3DCoSal2015MAE0.064UFO
3DCoSal2015S-measure0.86UFO
3DCoSal2015max E-measure0.906UFO
3DCoSal2015max F-measure0.865UFO
3DCoSal2015mean E-measure0.889UFO
3DCoSal2015mean F-measure0.848UFO
3DFBMS-59AVERAGE MAE0.028UFO
3DFBMS-59MAX F-MEASURE0.89UFO
3DFBMS-59S-Measure0.894UFO
3DSegTrack v2AVERAGE MAE0.022UFO
3DSegTrack v2MAX F-MEASURE0.863UFO
3DSegTrack v2S-Measure0.892UFO
3DDAVIS-2016AVERAGE MAE0.015UFO
3DDAVIS-2016MAX F-MEASURE0.906UFO
3DDAVIS-2016S-Measure0.918UFO
3DViSalAverage MAE0.011UFO
3DViSalS-Measure0.953UFO
3DViSalmax E-measure0.987UFO
Video Object SegmentationFBMS-59AVERAGE MAE0.028UFO
Video Object SegmentationFBMS-59MAX F-MEASURE0.89UFO
Video Object SegmentationFBMS-59S-Measure0.894UFO
Video Object SegmentationSegTrack v2AVERAGE MAE0.022UFO
Video Object SegmentationSegTrack v2MAX F-MEASURE0.863UFO
Video Object SegmentationSegTrack v2S-Measure0.892UFO
Video Object SegmentationDAVIS-2016AVERAGE MAE0.015UFO
Video Object SegmentationDAVIS-2016MAX F-MEASURE0.906UFO
Video Object SegmentationDAVIS-2016S-Measure0.918UFO
Video Object SegmentationViSalAverage MAE0.011UFO
Video Object SegmentationViSalS-Measure0.953UFO
Video Object SegmentationViSalmax E-measure0.987UFO
RGB Salient Object DetectionCoSOD3kMAE0.073UFO
RGB Salient Object DetectionCoSOD3kS-measure0.819UFO
RGB Salient Object DetectionCoSOD3kmax E-measure0.874UFO
RGB Salient Object DetectionCoSOD3kmax F-measure0.797UFO
RGB Salient Object DetectionCoSOD3kmean E-measure0.855UFO
RGB Salient Object DetectionCoSOD3kmean F-measure0.783UFO
RGB Salient Object DetectioniCoSegMAE0.029UFO
RGB Salient Object DetectioniCoSegS-measure0.924UFO
RGB Salient Object DetectioniCoSegmax E-measure0.969UFO
RGB Salient Object DetectioniCoSegmax F-measure0.953UFO
RGB Salient Object DetectionCoCAMAE0.095UFO
RGB Salient Object DetectionCoCAMean F-measure0.555UFO
RGB Salient Object DetectionCoCAS-measure0.697UFO
RGB Salient Object DetectionCoCAmax E-measure0.782UFO
RGB Salient Object DetectionCoCAmax F-measure0.571UFO
RGB Salient Object DetectionCoCAmean E-measure0.762UFO
RGB Salient Object DetectionCoSal2015MAE0.064UFO
RGB Salient Object DetectionCoSal2015S-measure0.86UFO
RGB Salient Object DetectionCoSal2015max E-measure0.906UFO
RGB Salient Object DetectionCoSal2015max F-measure0.865UFO
RGB Salient Object DetectionCoSal2015mean E-measure0.889UFO
RGB Salient Object DetectionCoSal2015mean F-measure0.848UFO
RGB Salient Object DetectionFBMS-59AVERAGE MAE0.028UFO
RGB Salient Object DetectionFBMS-59MAX F-MEASURE0.89UFO
RGB Salient Object DetectionFBMS-59S-Measure0.894UFO
RGB Salient Object DetectionSegTrack v2AVERAGE MAE0.022UFO
RGB Salient Object DetectionSegTrack v2MAX F-MEASURE0.863UFO
RGB Salient Object DetectionSegTrack v2S-Measure0.892UFO
RGB Salient Object DetectionDAVIS-2016AVERAGE MAE0.015UFO
RGB Salient Object DetectionDAVIS-2016MAX F-MEASURE0.906UFO
RGB Salient Object DetectionDAVIS-2016S-Measure0.918UFO
RGB Salient Object DetectionViSalAverage MAE0.011UFO
RGB Salient Object DetectionViSalS-Measure0.953UFO
RGB Salient Object DetectionViSalmax E-measure0.987UFO
2D ClassificationCoSOD3kMAE0.073UFO
2D ClassificationCoSOD3kS-measure0.819UFO
2D ClassificationCoSOD3kmax E-measure0.874UFO
2D ClassificationCoSOD3kmax F-measure0.797UFO
2D ClassificationCoSOD3kmean E-measure0.855UFO
2D ClassificationCoSOD3kmean F-measure0.783UFO
2D ClassificationiCoSegMAE0.029UFO
2D ClassificationiCoSegS-measure0.924UFO
2D ClassificationiCoSegmax E-measure0.969UFO
2D ClassificationiCoSegmax F-measure0.953UFO
2D ClassificationCoCAMAE0.095UFO
2D ClassificationCoCAMean F-measure0.555UFO
2D ClassificationCoCAS-measure0.697UFO
2D ClassificationCoCAmax E-measure0.782UFO
2D ClassificationCoCAmax F-measure0.571UFO
2D ClassificationCoCAmean E-measure0.762UFO
2D ClassificationCoSal2015MAE0.064UFO
2D ClassificationCoSal2015S-measure0.86UFO
2D ClassificationCoSal2015max E-measure0.906UFO
2D ClassificationCoSal2015max F-measure0.865UFO
2D ClassificationCoSal2015mean E-measure0.889UFO
2D ClassificationCoSal2015mean F-measure0.848UFO
2D ClassificationFBMS-59AVERAGE MAE0.028UFO
2D ClassificationFBMS-59MAX F-MEASURE0.89UFO
2D ClassificationFBMS-59S-Measure0.894UFO
2D ClassificationSegTrack v2AVERAGE MAE0.022UFO
2D ClassificationSegTrack v2MAX F-MEASURE0.863UFO
2D ClassificationSegTrack v2S-Measure0.892UFO
2D ClassificationDAVIS-2016AVERAGE MAE0.015UFO
2D ClassificationDAVIS-2016MAX F-MEASURE0.906UFO
2D ClassificationDAVIS-2016S-Measure0.918UFO
2D ClassificationViSalAverage MAE0.011UFO
2D ClassificationViSalS-Measure0.953UFO
2D ClassificationViSalmax E-measure0.987UFO
2D Object DetectionCoSOD3kMAE0.073UFO
2D Object DetectionCoSOD3kS-measure0.819UFO
2D Object DetectionCoSOD3kmax E-measure0.874UFO
2D Object DetectionCoSOD3kmax F-measure0.797UFO
2D Object DetectionCoSOD3kmean E-measure0.855UFO
2D Object DetectionCoSOD3kmean F-measure0.783UFO
2D Object DetectioniCoSegMAE0.029UFO
2D Object DetectioniCoSegS-measure0.924UFO
2D Object DetectioniCoSegmax E-measure0.969UFO
2D Object DetectioniCoSegmax F-measure0.953UFO
2D Object DetectionCoCAMAE0.095UFO
2D Object DetectionCoCAMean F-measure0.555UFO
2D Object DetectionCoCAS-measure0.697UFO
2D Object DetectionCoCAmax E-measure0.782UFO
2D Object DetectionCoCAmax F-measure0.571UFO
2D Object DetectionCoCAmean E-measure0.762UFO
2D Object DetectionCoSal2015MAE0.064UFO
2D Object DetectionCoSal2015S-measure0.86UFO
2D Object DetectionCoSal2015max E-measure0.906UFO
2D Object DetectionCoSal2015max F-measure0.865UFO
2D Object DetectionCoSal2015mean E-measure0.889UFO
2D Object DetectionCoSal2015mean F-measure0.848UFO
2D Object DetectionFBMS-59AVERAGE MAE0.028UFO
2D Object DetectionFBMS-59MAX F-MEASURE0.89UFO
2D Object DetectionFBMS-59S-Measure0.894UFO
2D Object DetectionSegTrack v2AVERAGE MAE0.022UFO
2D Object DetectionSegTrack v2MAX F-MEASURE0.863UFO
2D Object DetectionSegTrack v2S-Measure0.892UFO
2D Object DetectionDAVIS-2016AVERAGE MAE0.015UFO
2D Object DetectionDAVIS-2016MAX F-MEASURE0.906UFO
2D Object DetectionDAVIS-2016S-Measure0.918UFO
2D Object DetectionViSalAverage MAE0.011UFO
2D Object DetectionViSalS-Measure0.953UFO
2D Object DetectionViSalmax E-measure0.987UFO
16kCoSOD3kMAE0.073UFO
16kCoSOD3kS-measure0.819UFO
16kCoSOD3kmax E-measure0.874UFO
16kCoSOD3kmax F-measure0.797UFO
16kCoSOD3kmean E-measure0.855UFO
16kCoSOD3kmean F-measure0.783UFO
16kiCoSegMAE0.029UFO
16kiCoSegS-measure0.924UFO
16kiCoSegmax E-measure0.969UFO
16kiCoSegmax F-measure0.953UFO
16kCoCAMAE0.095UFO
16kCoCAMean F-measure0.555UFO
16kCoCAS-measure0.697UFO
16kCoCAmax E-measure0.782UFO
16kCoCAmax F-measure0.571UFO
16kCoCAmean E-measure0.762UFO
16kCoSal2015MAE0.064UFO
16kCoSal2015S-measure0.86UFO
16kCoSal2015max E-measure0.906UFO
16kCoSal2015max F-measure0.865UFO
16kCoSal2015mean E-measure0.889UFO
16kCoSal2015mean F-measure0.848UFO
16kFBMS-59AVERAGE MAE0.028UFO
16kFBMS-59MAX F-MEASURE0.89UFO
16kFBMS-59S-Measure0.894UFO
16kSegTrack v2AVERAGE MAE0.022UFO
16kSegTrack v2MAX F-MEASURE0.863UFO
16kSegTrack v2S-Measure0.892UFO
16kDAVIS-2016AVERAGE MAE0.015UFO
16kDAVIS-2016MAX F-MEASURE0.906UFO
16kDAVIS-2016S-Measure0.918UFO
16kViSalAverage MAE0.011UFO
16kViSalS-Measure0.953UFO
16kViSalmax E-measure0.987UFO

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17