TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multi-view Aggregation Network for Dichotomous Image Segme...

Multi-view Aggregation Network for Dichotomous Image Segmentation

Qian Yu, Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu

2024-04-11CVPR 2024 1Dichotomous Image SegmentationSemantic SegmentationImage Segmentation
PaperPDFCode(official)

Abstract

Dichotomous Image Segmentation (DIS) has recently emerged towards high-precision object segmentation from high-resolution natural images. When designing an effective DIS model, the main challenge is how to balance the semantic dispersion of high-resolution targets in the small receptive field and the loss of high-precision details in the large receptive field. Existing methods rely on tedious multiple encoder-decoder streams and stages to gradually complete the global localization and local refinement. Human visual system captures regions of interest by observing them from multiple views. Inspired by it, we model DIS as a multi-view object perception problem and provide a parsimonious multi-view aggregation network (MVANet), which unifies the feature fusion of the distant view and close-up view into a single stream with one encoder-decoder structure. With the help of the proposed multi-view complementary localization and refinement modules, our approach established long-range, profound visual interactions across multiple views, allowing the features of the detailed close-up view to focus on highly slender structures.Experiments on the popular DIS-5K dataset show that our MVANet significantly outperforms state-of-the-art methods in both accuracy and speed. The source code and datasets will be publicly available at \href{https://github.com/qianyu-dlut/MVANet}{MVANet}.

Results

TaskDatasetMetricValueModel
Object DetectionDIS-TE4E-measure0.944MVANet
Object DetectionDIS-TE4HCE2331MVANet
Object DetectionDIS-TE4MAE0.041MVANet
Object DetectionDIS-TE4S-Measure0.903MVANet
Object DetectionDIS-TE4max F-Measure0.912MVANet
Object DetectionDIS-TE4weighted F-measure0.857MVANet
Object DetectionDIS-VDE-measure0.941MVANet
Object DetectionDIS-VDHCE893MVANet
Object DetectionDIS-VDMAE0.034MVANet
Object DetectionDIS-VDS-Measure0.905MVANet
Object DetectionDIS-VDmax F-Measure0.904MVANet
Object DetectionDIS-VDweighted F-measure0.863MVANet
Object DetectionDIS-TE2E-measure0.944MVANet
Object DetectionDIS-TE2HCE251MVANet
Object DetectionDIS-TE2MAE0.03MVANet
Object DetectionDIS-TE2S-Measure0.915MVANet
Object DetectionDIS-TE2max F-Measure0.916MVANet
Object DetectionDIS-TE2weighted F-measure0.874MVANet
Object DetectionDIS-TE1E-measure0.911MVANet
Object DetectionDIS-TE1HCE104MVANet
Object DetectionDIS-TE1MAE0.037MVANet
Object DetectionDIS-TE1S-Measure0.879MVANet
Object DetectionDIS-TE1max F-Measure0.873MVANet
Object DetectionDIS-TE1weighted F-measure0.823MVANet
Object DetectionDIS-TE3E-measure0.954MVANet
Object DetectionDIS-TE3HCE525MVANet
Object DetectionDIS-TE3MAE0.031MVANet
Object DetectionDIS-TE3S-Measure0.92MVANet
Object DetectionDIS-TE3max F-Measure0.929MVANet
Object DetectionDIS-TE3weighted F-measure0.89MVANet
3DDIS-TE4E-measure0.944MVANet
3DDIS-TE4HCE2331MVANet
3DDIS-TE4MAE0.041MVANet
3DDIS-TE4S-Measure0.903MVANet
3DDIS-TE4max F-Measure0.912MVANet
3DDIS-TE4weighted F-measure0.857MVANet
3DDIS-VDE-measure0.941MVANet
3DDIS-VDHCE893MVANet
3DDIS-VDMAE0.034MVANet
3DDIS-VDS-Measure0.905MVANet
3DDIS-VDmax F-Measure0.904MVANet
3DDIS-VDweighted F-measure0.863MVANet
3DDIS-TE2E-measure0.944MVANet
3DDIS-TE2HCE251MVANet
3DDIS-TE2MAE0.03MVANet
3DDIS-TE2S-Measure0.915MVANet
3DDIS-TE2max F-Measure0.916MVANet
3DDIS-TE2weighted F-measure0.874MVANet
3DDIS-TE1E-measure0.911MVANet
3DDIS-TE1HCE104MVANet
3DDIS-TE1MAE0.037MVANet
3DDIS-TE1S-Measure0.879MVANet
3DDIS-TE1max F-Measure0.873MVANet
3DDIS-TE1weighted F-measure0.823MVANet
3DDIS-TE3E-measure0.954MVANet
3DDIS-TE3HCE525MVANet
3DDIS-TE3MAE0.031MVANet
3DDIS-TE3S-Measure0.92MVANet
3DDIS-TE3max F-Measure0.929MVANet
3DDIS-TE3weighted F-measure0.89MVANet
RGB Salient Object DetectionDIS-TE4E-measure0.944MVANet
RGB Salient Object DetectionDIS-TE4HCE2331MVANet
RGB Salient Object DetectionDIS-TE4MAE0.041MVANet
RGB Salient Object DetectionDIS-TE4S-Measure0.903MVANet
RGB Salient Object DetectionDIS-TE4max F-Measure0.912MVANet
RGB Salient Object DetectionDIS-TE4weighted F-measure0.857MVANet
RGB Salient Object DetectionDIS-VDE-measure0.941MVANet
RGB Salient Object DetectionDIS-VDHCE893MVANet
RGB Salient Object DetectionDIS-VDMAE0.034MVANet
RGB Salient Object DetectionDIS-VDS-Measure0.905MVANet
RGB Salient Object DetectionDIS-VDmax F-Measure0.904MVANet
RGB Salient Object DetectionDIS-VDweighted F-measure0.863MVANet
RGB Salient Object DetectionDIS-TE2E-measure0.944MVANet
RGB Salient Object DetectionDIS-TE2HCE251MVANet
RGB Salient Object DetectionDIS-TE2MAE0.03MVANet
RGB Salient Object DetectionDIS-TE2S-Measure0.915MVANet
RGB Salient Object DetectionDIS-TE2max F-Measure0.916MVANet
RGB Salient Object DetectionDIS-TE2weighted F-measure0.874MVANet
RGB Salient Object DetectionDIS-TE1E-measure0.911MVANet
RGB Salient Object DetectionDIS-TE1HCE104MVANet
RGB Salient Object DetectionDIS-TE1MAE0.037MVANet
RGB Salient Object DetectionDIS-TE1S-Measure0.879MVANet
RGB Salient Object DetectionDIS-TE1max F-Measure0.873MVANet
RGB Salient Object DetectionDIS-TE1weighted F-measure0.823MVANet
RGB Salient Object DetectionDIS-TE3E-measure0.954MVANet
RGB Salient Object DetectionDIS-TE3HCE525MVANet
RGB Salient Object DetectionDIS-TE3MAE0.031MVANet
RGB Salient Object DetectionDIS-TE3S-Measure0.92MVANet
RGB Salient Object DetectionDIS-TE3max F-Measure0.929MVANet
RGB Salient Object DetectionDIS-TE3weighted F-measure0.89MVANet
2D ClassificationDIS-TE4E-measure0.944MVANet
2D ClassificationDIS-TE4HCE2331MVANet
2D ClassificationDIS-TE4MAE0.041MVANet
2D ClassificationDIS-TE4S-Measure0.903MVANet
2D ClassificationDIS-TE4max F-Measure0.912MVANet
2D ClassificationDIS-TE4weighted F-measure0.857MVANet
2D ClassificationDIS-VDE-measure0.941MVANet
2D ClassificationDIS-VDHCE893MVANet
2D ClassificationDIS-VDMAE0.034MVANet
2D ClassificationDIS-VDS-Measure0.905MVANet
2D ClassificationDIS-VDmax F-Measure0.904MVANet
2D ClassificationDIS-VDweighted F-measure0.863MVANet
2D ClassificationDIS-TE2E-measure0.944MVANet
2D ClassificationDIS-TE2HCE251MVANet
2D ClassificationDIS-TE2MAE0.03MVANet
2D ClassificationDIS-TE2S-Measure0.915MVANet
2D ClassificationDIS-TE2max F-Measure0.916MVANet
2D ClassificationDIS-TE2weighted F-measure0.874MVANet
2D ClassificationDIS-TE1E-measure0.911MVANet
2D ClassificationDIS-TE1HCE104MVANet
2D ClassificationDIS-TE1MAE0.037MVANet
2D ClassificationDIS-TE1S-Measure0.879MVANet
2D ClassificationDIS-TE1max F-Measure0.873MVANet
2D ClassificationDIS-TE1weighted F-measure0.823MVANet
2D ClassificationDIS-TE3E-measure0.954MVANet
2D ClassificationDIS-TE3HCE525MVANet
2D ClassificationDIS-TE3MAE0.031MVANet
2D ClassificationDIS-TE3S-Measure0.92MVANet
2D ClassificationDIS-TE3max F-Measure0.929MVANet
2D ClassificationDIS-TE3weighted F-measure0.89MVANet
2D Object DetectionDIS-TE4E-measure0.944MVANet
2D Object DetectionDIS-TE4HCE2331MVANet
2D Object DetectionDIS-TE4MAE0.041MVANet
2D Object DetectionDIS-TE4S-Measure0.903MVANet
2D Object DetectionDIS-TE4max F-Measure0.912MVANet
2D Object DetectionDIS-TE4weighted F-measure0.857MVANet
2D Object DetectionDIS-VDE-measure0.941MVANet
2D Object DetectionDIS-VDHCE893MVANet
2D Object DetectionDIS-VDMAE0.034MVANet
2D Object DetectionDIS-VDS-Measure0.905MVANet
2D Object DetectionDIS-VDmax F-Measure0.904MVANet
2D Object DetectionDIS-VDweighted F-measure0.863MVANet
2D Object DetectionDIS-TE2E-measure0.944MVANet
2D Object DetectionDIS-TE2HCE251MVANet
2D Object DetectionDIS-TE2MAE0.03MVANet
2D Object DetectionDIS-TE2S-Measure0.915MVANet
2D Object DetectionDIS-TE2max F-Measure0.916MVANet
2D Object DetectionDIS-TE2weighted F-measure0.874MVANet
2D Object DetectionDIS-TE1E-measure0.911MVANet
2D Object DetectionDIS-TE1HCE104MVANet
2D Object DetectionDIS-TE1MAE0.037MVANet
2D Object DetectionDIS-TE1S-Measure0.879MVANet
2D Object DetectionDIS-TE1max F-Measure0.873MVANet
2D Object DetectionDIS-TE1weighted F-measure0.823MVANet
2D Object DetectionDIS-TE3E-measure0.954MVANet
2D Object DetectionDIS-TE3HCE525MVANet
2D Object DetectionDIS-TE3MAE0.031MVANet
2D Object DetectionDIS-TE3S-Measure0.92MVANet
2D Object DetectionDIS-TE3max F-Measure0.929MVANet
2D Object DetectionDIS-TE3weighted F-measure0.89MVANet
16kDIS-TE4E-measure0.944MVANet
16kDIS-TE4HCE2331MVANet
16kDIS-TE4MAE0.041MVANet
16kDIS-TE4S-Measure0.903MVANet
16kDIS-TE4max F-Measure0.912MVANet
16kDIS-TE4weighted F-measure0.857MVANet
16kDIS-VDE-measure0.941MVANet
16kDIS-VDHCE893MVANet
16kDIS-VDMAE0.034MVANet
16kDIS-VDS-Measure0.905MVANet
16kDIS-VDmax F-Measure0.904MVANet
16kDIS-VDweighted F-measure0.863MVANet
16kDIS-TE2E-measure0.944MVANet
16kDIS-TE2HCE251MVANet
16kDIS-TE2MAE0.03MVANet
16kDIS-TE2S-Measure0.915MVANet
16kDIS-TE2max F-Measure0.916MVANet
16kDIS-TE2weighted F-measure0.874MVANet
16kDIS-TE1E-measure0.911MVANet
16kDIS-TE1HCE104MVANet
16kDIS-TE1MAE0.037MVANet
16kDIS-TE1S-Measure0.879MVANet
16kDIS-TE1max F-Measure0.873MVANet
16kDIS-TE1weighted F-measure0.823MVANet
16kDIS-TE3E-measure0.954MVANet
16kDIS-TE3HCE525MVANet
16kDIS-TE3MAE0.031MVANet
16kDIS-TE3S-Measure0.92MVANet
16kDIS-TE3max F-Measure0.929MVANet
16kDIS-TE3weighted F-measure0.89MVANet

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15