TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/M$^3$Net: Multilevel, Mixed and Multistage Attention Netwo...

M$^3$Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection

Yao Yuan, Pan Gao, Xiaoyang Tan

2023-09-15Salient Object Detectionobject-detectionObject DetectionRGB Salient Object Detection
PaperPDFCode(official)

Abstract

Most existing salient object detection methods mostly use U-Net or feature pyramid structure, which simply aggregates feature maps of different scales, ignoring the uniqueness and interdependence of them and their respective contributions to the final prediction. To overcome these, we propose the M$^3$Net, i.e., the Multilevel, Mixed and Multistage attention network for Salient Object Detection (SOD). Firstly, we propose Multiscale Interaction Block which innovatively introduces the cross-attention approach to achieve the interaction between multilevel features, allowing high-level features to guide low-level feature learning and thus enhancing salient regions. Secondly, considering the fact that previous Transformer based SOD methods locate salient regions only using global self-attention while inevitably overlooking the details of complex objects, we propose the Mixed Attention Block. This block combines global self-attention and window self-attention, aiming at modeling context at both global and local levels to further improve the accuracy of the prediction map. Finally, we proposed a multilevel supervision strategy to optimize the aggregated feature stage-by-stage. Experiments on six challenging datasets demonstrate that the proposed M$^3$Net surpasses recent CNN and Transformer-based SOD arts in terms of four metrics. Codes are available at https://github.com/I2-Multimedia-Lab/M3Net.

Results

TaskDatasetMetricValueModel
Object DetectionECSSDMAE0.021M3Net-S
Object DetectionECSSDS-Measure0.948M3Net-S
Object DetectionECSSDWeighted F-Measure0.947M3Net-S
Object DetectionECSSDMAE0.029M3Net-R
Object DetectionECSSDS-Measure0.931M3Net-R
Object DetectionECSSDWeighted F-Measure0.919M3Net-R
Object DetectionPASCAL-SMAE0.047M3Net-S
Object DetectionPASCAL-SS-Measure0.889M3Net-S
Object DetectionPASCAL-SWeighted F-Measure0.864M3Net-S
Object DetectionPASCAL-SMAE0.06M3Net-R
Object DetectionPASCAL-SS-Measure0.868M3Net-R
Object DetectionPASCAL-SWeighted F-Measure0.827M3Net-R
Object DetectionHKU-ISMAE0.019M3Net-S
Object DetectionHKU-ISS-Measure0.943M3Net-S
Object DetectionHKU-ISWeighted F-Measure0.937M3Net-S
Object DetectionHKU-ISMAE0.026M3Net-R
Object DetectionHKU-ISS-Measure0.929M3Net-R
Object DetectionHKU-ISWeighted F-Measure0.913M3Net-R
Object DetectionDUTS-TEMAE0.024M3Net-S
Object DetectionDUTS-TES-Measure0.927M3Net-S
Object DetectionDUTS-TEWeighted F-Measure0.902M3Net-S
Object DetectionDUTS-TEMAE0.036M3Net-R
Object DetectionDUTS-TES-Measure0.897M3Net-R
Object DetectionDUTS-TEWeighted F-Measure0.849M3Net-R
Object DetectionDUT-OMRONMAE0.045M3Net-S
Object DetectionDUT-OMRONS-Measure0.872M3Net-S
Object DetectionDUT-OMRONWeighted F-Measure0.811M3Net-S
Object DetectionDUT-OMRONMAE0.061M3Net-R
Object DetectionDUT-OMRONS-Measure0.848M3Net-R
Object DetectionDUT-OMRONWeighted F-Measure0.769M3Net-R
3DECSSDMAE0.021M3Net-S
3DECSSDS-Measure0.948M3Net-S
3DECSSDWeighted F-Measure0.947M3Net-S
3DECSSDMAE0.029M3Net-R
3DECSSDS-Measure0.931M3Net-R
3DECSSDWeighted F-Measure0.919M3Net-R
3DPASCAL-SMAE0.047M3Net-S
3DPASCAL-SS-Measure0.889M3Net-S
3DPASCAL-SWeighted F-Measure0.864M3Net-S
3DPASCAL-SMAE0.06M3Net-R
3DPASCAL-SS-Measure0.868M3Net-R
3DPASCAL-SWeighted F-Measure0.827M3Net-R
3DHKU-ISMAE0.019M3Net-S
3DHKU-ISS-Measure0.943M3Net-S
3DHKU-ISWeighted F-Measure0.937M3Net-S
3DHKU-ISMAE0.026M3Net-R
3DHKU-ISS-Measure0.929M3Net-R
3DHKU-ISWeighted F-Measure0.913M3Net-R
3DDUTS-TEMAE0.024M3Net-S
3DDUTS-TES-Measure0.927M3Net-S
3DDUTS-TEWeighted F-Measure0.902M3Net-S
3DDUTS-TEMAE0.036M3Net-R
3DDUTS-TES-Measure0.897M3Net-R
3DDUTS-TEWeighted F-Measure0.849M3Net-R
3DDUT-OMRONMAE0.045M3Net-S
3DDUT-OMRONS-Measure0.872M3Net-S
3DDUT-OMRONWeighted F-Measure0.811M3Net-S
3DDUT-OMRONMAE0.061M3Net-R
3DDUT-OMRONS-Measure0.848M3Net-R
3DDUT-OMRONWeighted F-Measure0.769M3Net-R
RGB Salient Object DetectionECSSDMAE0.021M3Net-S
RGB Salient Object DetectionECSSDS-Measure0.948M3Net-S
RGB Salient Object DetectionECSSDWeighted F-Measure0.947M3Net-S
RGB Salient Object DetectionECSSDMAE0.029M3Net-R
RGB Salient Object DetectionECSSDS-Measure0.931M3Net-R
RGB Salient Object DetectionECSSDWeighted F-Measure0.919M3Net-R
RGB Salient Object DetectionPASCAL-SMAE0.047M3Net-S
RGB Salient Object DetectionPASCAL-SS-Measure0.889M3Net-S
RGB Salient Object DetectionPASCAL-SWeighted F-Measure0.864M3Net-S
RGB Salient Object DetectionPASCAL-SMAE0.06M3Net-R
RGB Salient Object DetectionPASCAL-SS-Measure0.868M3Net-R
RGB Salient Object DetectionPASCAL-SWeighted F-Measure0.827M3Net-R
RGB Salient Object DetectionHKU-ISMAE0.019M3Net-S
RGB Salient Object DetectionHKU-ISS-Measure0.943M3Net-S
RGB Salient Object DetectionHKU-ISWeighted F-Measure0.937M3Net-S
RGB Salient Object DetectionHKU-ISMAE0.026M3Net-R
RGB Salient Object DetectionHKU-ISS-Measure0.929M3Net-R
RGB Salient Object DetectionHKU-ISWeighted F-Measure0.913M3Net-R
RGB Salient Object DetectionDUTS-TEMAE0.024M3Net-S
RGB Salient Object DetectionDUTS-TES-Measure0.927M3Net-S
RGB Salient Object DetectionDUTS-TEWeighted F-Measure0.902M3Net-S
RGB Salient Object DetectionDUTS-TEMAE0.036M3Net-R
RGB Salient Object DetectionDUTS-TES-Measure0.897M3Net-R
RGB Salient Object DetectionDUTS-TEWeighted F-Measure0.849M3Net-R
RGB Salient Object DetectionDUT-OMRONMAE0.045M3Net-S
RGB Salient Object DetectionDUT-OMRONS-Measure0.872M3Net-S
RGB Salient Object DetectionDUT-OMRONWeighted F-Measure0.811M3Net-S
RGB Salient Object DetectionDUT-OMRONMAE0.061M3Net-R
RGB Salient Object DetectionDUT-OMRONS-Measure0.848M3Net-R
RGB Salient Object DetectionDUT-OMRONWeighted F-Measure0.769M3Net-R
2D ClassificationECSSDMAE0.021M3Net-S
2D ClassificationECSSDS-Measure0.948M3Net-S
2D ClassificationECSSDWeighted F-Measure0.947M3Net-S
2D ClassificationECSSDMAE0.029M3Net-R
2D ClassificationECSSDS-Measure0.931M3Net-R
2D ClassificationECSSDWeighted F-Measure0.919M3Net-R
2D ClassificationPASCAL-SMAE0.047M3Net-S
2D ClassificationPASCAL-SS-Measure0.889M3Net-S
2D ClassificationPASCAL-SWeighted F-Measure0.864M3Net-S
2D ClassificationPASCAL-SMAE0.06M3Net-R
2D ClassificationPASCAL-SS-Measure0.868M3Net-R
2D ClassificationPASCAL-SWeighted F-Measure0.827M3Net-R
2D ClassificationHKU-ISMAE0.019M3Net-S
2D ClassificationHKU-ISS-Measure0.943M3Net-S
2D ClassificationHKU-ISWeighted F-Measure0.937M3Net-S
2D ClassificationHKU-ISMAE0.026M3Net-R
2D ClassificationHKU-ISS-Measure0.929M3Net-R
2D ClassificationHKU-ISWeighted F-Measure0.913M3Net-R
2D ClassificationDUTS-TEMAE0.024M3Net-S
2D ClassificationDUTS-TES-Measure0.927M3Net-S
2D ClassificationDUTS-TEWeighted F-Measure0.902M3Net-S
2D ClassificationDUTS-TEMAE0.036M3Net-R
2D ClassificationDUTS-TES-Measure0.897M3Net-R
2D ClassificationDUTS-TEWeighted F-Measure0.849M3Net-R
2D ClassificationDUT-OMRONMAE0.045M3Net-S
2D ClassificationDUT-OMRONS-Measure0.872M3Net-S
2D ClassificationDUT-OMRONWeighted F-Measure0.811M3Net-S
2D ClassificationDUT-OMRONMAE0.061M3Net-R
2D ClassificationDUT-OMRONS-Measure0.848M3Net-R
2D ClassificationDUT-OMRONWeighted F-Measure0.769M3Net-R
2D Object DetectionECSSDMAE0.021M3Net-S
2D Object DetectionECSSDS-Measure0.948M3Net-S
2D Object DetectionECSSDWeighted F-Measure0.947M3Net-S
2D Object DetectionECSSDMAE0.029M3Net-R
2D Object DetectionECSSDS-Measure0.931M3Net-R
2D Object DetectionECSSDWeighted F-Measure0.919M3Net-R
2D Object DetectionPASCAL-SMAE0.047M3Net-S
2D Object DetectionPASCAL-SS-Measure0.889M3Net-S
2D Object DetectionPASCAL-SWeighted F-Measure0.864M3Net-S
2D Object DetectionPASCAL-SMAE0.06M3Net-R
2D Object DetectionPASCAL-SS-Measure0.868M3Net-R
2D Object DetectionPASCAL-SWeighted F-Measure0.827M3Net-R
2D Object DetectionHKU-ISMAE0.019M3Net-S
2D Object DetectionHKU-ISS-Measure0.943M3Net-S
2D Object DetectionHKU-ISWeighted F-Measure0.937M3Net-S
2D Object DetectionHKU-ISMAE0.026M3Net-R
2D Object DetectionHKU-ISS-Measure0.929M3Net-R
2D Object DetectionHKU-ISWeighted F-Measure0.913M3Net-R
2D Object DetectionDUTS-TEMAE0.024M3Net-S
2D Object DetectionDUTS-TES-Measure0.927M3Net-S
2D Object DetectionDUTS-TEWeighted F-Measure0.902M3Net-S
2D Object DetectionDUTS-TEMAE0.036M3Net-R
2D Object DetectionDUTS-TES-Measure0.897M3Net-R
2D Object DetectionDUTS-TEWeighted F-Measure0.849M3Net-R
2D Object DetectionDUT-OMRONMAE0.045M3Net-S
2D Object DetectionDUT-OMRONS-Measure0.872M3Net-S
2D Object DetectionDUT-OMRONWeighted F-Measure0.811M3Net-S
2D Object DetectionDUT-OMRONMAE0.061M3Net-R
2D Object DetectionDUT-OMRONS-Measure0.848M3Net-R
2D Object DetectionDUT-OMRONWeighted F-Measure0.769M3Net-R
16kECSSDMAE0.021M3Net-S
16kECSSDS-Measure0.948M3Net-S
16kECSSDWeighted F-Measure0.947M3Net-S
16kECSSDMAE0.029M3Net-R
16kECSSDS-Measure0.931M3Net-R
16kECSSDWeighted F-Measure0.919M3Net-R
16kPASCAL-SMAE0.047M3Net-S
16kPASCAL-SS-Measure0.889M3Net-S
16kPASCAL-SWeighted F-Measure0.864M3Net-S
16kPASCAL-SMAE0.06M3Net-R
16kPASCAL-SS-Measure0.868M3Net-R
16kPASCAL-SWeighted F-Measure0.827M3Net-R
16kHKU-ISMAE0.019M3Net-S
16kHKU-ISS-Measure0.943M3Net-S
16kHKU-ISWeighted F-Measure0.937M3Net-S
16kHKU-ISMAE0.026M3Net-R
16kHKU-ISS-Measure0.929M3Net-R
16kHKU-ISWeighted F-Measure0.913M3Net-R
16kDUTS-TEMAE0.024M3Net-S
16kDUTS-TES-Measure0.927M3Net-S
16kDUTS-TEWeighted F-Measure0.902M3Net-S
16kDUTS-TEMAE0.036M3Net-R
16kDUTS-TES-Measure0.897M3Net-R
16kDUTS-TEWeighted F-Measure0.849M3Net-R
16kDUT-OMRONMAE0.045M3Net-S
16kDUT-OMRONS-Measure0.872M3Net-S
16kDUT-OMRONWeighted F-Measure0.811M3Net-S
16kDUT-OMRONMAE0.061M3Net-R
16kDUT-OMRONS-Measure0.848M3Net-R
16kDUT-OMRONWeighted F-Measure0.769M3Net-R

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07