TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Salience DETR: Enhancing Detection Transformer with Hierar...

Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement

Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen

2024-03-24CVPR 2024 1Dense Object Detection2D Object DetectionObject Detection
PaperPDFCode(official)CodeCode

Abstract

DETR-like methods have significantly increased detection performance in an end-to-end manner. The mainstream two-stage frameworks of them perform dense self-attention and select a fraction of queries for sparse cross-attention, which is proven effective for improving performance but also introduces a heavy computational burden and high dependence on stable query selection. This paper demonstrates that suboptimal two-stage selection strategies result in scale bias and redundancy due to the mismatch between selected queries and objects in two-stage initialization. To address these issues, we propose hierarchical salience filtering refinement, which performs transformer encoding only on filtered discriminative queries, for a better trade-off between computational efficiency and precision. The filtering process overcomes scale bias through a novel scale-independent salience supervision. To compensate for the semantic misalignment among queries, we introduce elaborate query refinement modules for stable two-stage initialization. Based on above improvements, the proposed Salience DETR achieves significant improvements of +4.0% AP, +0.2% AP, +4.4% AP on three challenging task-specific detection datasets, as well as 49.2% AP on COCO 2017 with less FLOPs. The code is available at https://github.com/xiuqhou/Salience-DETR.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO 2017 valAP57.3Salience-DETR (Focal-L 1x)
Object DetectionCOCO 2017 valAP5075.5Salience-DETR (Focal-L 1x)
Object DetectionCOCO 2017 valAP7562.3Salience-DETR (Focal-L 1x)
Object DetectionCOCO 2017 valAPL74.5Salience-DETR (Focal-L 1x)
Object DetectionCOCO 2017 valAPM61.8Salience-DETR (Focal-L 1x)
Object DetectionCOCO 2017 valAPS40.9Salience-DETR (Focal-L 1x)
Object DetectionCOCO 2017 valAP56.5Salience-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAP5075Salience-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAP7561.5Salience-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAPL72.8Salience-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAPM61.2Salience-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAPS40.2Salience-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAP50Salience-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAP5067.7Salience-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAP7554.2Salience-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAPL64.4Salience-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAPM54.4Salience-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAPS33.3Salience-DETR (ResNet50 1x)
3DCOCO 2017 valAP57.3Salience-DETR (Focal-L 1x)
3DCOCO 2017 valAP5075.5Salience-DETR (Focal-L 1x)
3DCOCO 2017 valAP7562.3Salience-DETR (Focal-L 1x)
3DCOCO 2017 valAPL74.5Salience-DETR (Focal-L 1x)
3DCOCO 2017 valAPM61.8Salience-DETR (Focal-L 1x)
3DCOCO 2017 valAPS40.9Salience-DETR (Focal-L 1x)
3DCOCO 2017 valAP56.5Salience-DETR (Swin-L 1x)
3DCOCO 2017 valAP5075Salience-DETR (Swin-L 1x)
3DCOCO 2017 valAP7561.5Salience-DETR (Swin-L 1x)
3DCOCO 2017 valAPL72.8Salience-DETR (Swin-L 1x)
3DCOCO 2017 valAPM61.2Salience-DETR (Swin-L 1x)
3DCOCO 2017 valAPS40.2Salience-DETR (Swin-L 1x)
3DCOCO 2017 valAP50Salience-DETR (ResNet50 1x)
3DCOCO 2017 valAP5067.7Salience-DETR (ResNet50 1x)
3DCOCO 2017 valAP7554.2Salience-DETR (ResNet50 1x)
3DCOCO 2017 valAPL64.4Salience-DETR (ResNet50 1x)
3DCOCO 2017 valAPM54.4Salience-DETR (ResNet50 1x)
3DCOCO 2017 valAPS33.3Salience-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAP57.3Salience-DETR (Focal-L 1x)
2D ClassificationCOCO 2017 valAP5075.5Salience-DETR (Focal-L 1x)
2D ClassificationCOCO 2017 valAP7562.3Salience-DETR (Focal-L 1x)
2D ClassificationCOCO 2017 valAPL74.5Salience-DETR (Focal-L 1x)
2D ClassificationCOCO 2017 valAPM61.8Salience-DETR (Focal-L 1x)
2D ClassificationCOCO 2017 valAPS40.9Salience-DETR (Focal-L 1x)
2D ClassificationCOCO 2017 valAP56.5Salience-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAP5075Salience-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAP7561.5Salience-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAPL72.8Salience-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAPM61.2Salience-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAPS40.2Salience-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAP50Salience-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAP5067.7Salience-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAP7554.2Salience-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAPL64.4Salience-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAPM54.4Salience-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAPS33.3Salience-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAP57.3Salience-DETR (Focal-L 1x)
2D Object DetectionCOCO 2017 valAP5075.5Salience-DETR (Focal-L 1x)
2D Object DetectionCOCO 2017 valAP7562.3Salience-DETR (Focal-L 1x)
2D Object DetectionCOCO 2017 valAPL74.5Salience-DETR (Focal-L 1x)
2D Object DetectionCOCO 2017 valAPM61.8Salience-DETR (Focal-L 1x)
2D Object DetectionCOCO 2017 valAPS40.9Salience-DETR (Focal-L 1x)
2D Object DetectionCOCO 2017 valAP56.5Salience-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAP5075Salience-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAP7561.5Salience-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAPL72.8Salience-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAPM61.2Salience-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAPS40.2Salience-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAP50Salience-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAP5067.7Salience-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAP7554.2Salience-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAPL64.4Salience-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAPM54.4Salience-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAPS33.3Salience-DETR (ResNet50 1x)
16kCOCO 2017 valAP57.3Salience-DETR (Focal-L 1x)
16kCOCO 2017 valAP5075.5Salience-DETR (Focal-L 1x)
16kCOCO 2017 valAP7562.3Salience-DETR (Focal-L 1x)
16kCOCO 2017 valAPL74.5Salience-DETR (Focal-L 1x)
16kCOCO 2017 valAPM61.8Salience-DETR (Focal-L 1x)
16kCOCO 2017 valAPS40.9Salience-DETR (Focal-L 1x)
16kCOCO 2017 valAP56.5Salience-DETR (Swin-L 1x)
16kCOCO 2017 valAP5075Salience-DETR (Swin-L 1x)
16kCOCO 2017 valAP7561.5Salience-DETR (Swin-L 1x)
16kCOCO 2017 valAPL72.8Salience-DETR (Swin-L 1x)
16kCOCO 2017 valAPM61.2Salience-DETR (Swin-L 1x)
16kCOCO 2017 valAPS40.2Salience-DETR (Swin-L 1x)
16kCOCO 2017 valAP50Salience-DETR (ResNet50 1x)
16kCOCO 2017 valAP5067.7Salience-DETR (ResNet50 1x)
16kCOCO 2017 valAP7554.2Salience-DETR (ResNet50 1x)
16kCOCO 2017 valAPL64.4Salience-DETR (ResNet50 1x)
16kCOCO 2017 valAPM54.4Salience-DETR (ResNet50 1x)
16kCOCO 2017 valAPS33.3Salience-DETR (ResNet50 1x)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07