TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CDFormer: Cross-Domain Few-Shot Object Detection Transform...

CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion

Boyuan Meng, Xiaohan Zhang, Peilin Li, Zhe Wu, Yiming Li, Wenkai Zhao, Beinan Yu, Hui-Liang Shen

2025-05-02Few-Shot Object DetectionCross-Domain Few-Shotobject-detectionCross-Domain Few-Shot Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Cross-domain few-shot object detection (CD-FSOD) aims to detect novel objects across different domains with limited class instances. Feature confusion, including object-background confusion and object-object confusion, presents significant challenges in both cross-domain and few-shot settings. In this work, we introduce CDFormer, a cross-domain few-shot object detection transformer against feature confusion, to address these challenges. The method specifically tackles feature confusion through two key modules: object-background distinguishing (OBD) and object-object distinguishing (OOD). The OBD module leverages a learnable background token to differentiate between objects and background, while the OOD module enhances the distinction between objects of different classes. Experimental results demonstrate that CDFormer outperforms previous state-of-the-art approaches, achieving 12.9% mAP, 11.0% mAP, and 10.4% mAP improvements under the 1/5/10 shot settings, respectively, when fine-tuned.

Results

TaskDatasetMetricValueModel
Object DetectionArtaxor mAP68.7CDFormer(w/FT)
Object DetectionArtaxor mAP37.3CDFormer(w/o FT)
Object DetectionNEU-DETmAP18.1CDFormer(w/FT)
Object DetectionNEU-DETmAP4CDFormer(w/o FT)
Object DetectionDIORmAP32.5CDFormer(w/FT)
Object DetectionDIORmAP7.9CDFormer(w/o FT)
Object DetectionClipark1k mAP59CDFormer(w/FT)
Object DetectionClipark1k mAP53.5CDFormer(w/o FT)
Object DetectionDeepFishmAP35.5CDFormer(w/FT)
Object DetectionDeepFishmAP25.7CDFormer(w/o FT)
Object DetectionUODDmAP26.4CDFormer(w/FT)
Object DetectionUODDmAP16.7CDFormer(w/o FT)
3DArtaxor mAP68.7CDFormer(w/FT)
3DArtaxor mAP37.3CDFormer(w/o FT)
3DNEU-DETmAP18.1CDFormer(w/FT)
3DNEU-DETmAP4CDFormer(w/o FT)
3DDIORmAP32.5CDFormer(w/FT)
3DDIORmAP7.9CDFormer(w/o FT)
3DClipark1k mAP59CDFormer(w/FT)
3DClipark1k mAP53.5CDFormer(w/o FT)
3DDeepFishmAP35.5CDFormer(w/FT)
3DDeepFishmAP25.7CDFormer(w/o FT)
3DUODDmAP26.4CDFormer(w/FT)
3DUODDmAP16.7CDFormer(w/o FT)
Few-Shot Object DetectionArtaxor mAP68.7CDFormer(w/FT)
Few-Shot Object DetectionArtaxor mAP37.3CDFormer(w/o FT)
Few-Shot Object DetectionNEU-DETmAP18.1CDFormer(w/FT)
Few-Shot Object DetectionNEU-DETmAP4CDFormer(w/o FT)
Few-Shot Object DetectionDIORmAP32.5CDFormer(w/FT)
Few-Shot Object DetectionDIORmAP7.9CDFormer(w/o FT)
Few-Shot Object DetectionClipark1k mAP59CDFormer(w/FT)
Few-Shot Object DetectionClipark1k mAP53.5CDFormer(w/o FT)
Few-Shot Object DetectionDeepFishmAP35.5CDFormer(w/FT)
Few-Shot Object DetectionDeepFishmAP25.7CDFormer(w/o FT)
Few-Shot Object DetectionUODDmAP26.4CDFormer(w/FT)
Few-Shot Object DetectionUODDmAP16.7CDFormer(w/o FT)
2D ClassificationArtaxor mAP68.7CDFormer(w/FT)
2D ClassificationArtaxor mAP37.3CDFormer(w/o FT)
2D ClassificationNEU-DETmAP18.1CDFormer(w/FT)
2D ClassificationNEU-DETmAP4CDFormer(w/o FT)
2D ClassificationDIORmAP32.5CDFormer(w/FT)
2D ClassificationDIORmAP7.9CDFormer(w/o FT)
2D ClassificationClipark1k mAP59CDFormer(w/FT)
2D ClassificationClipark1k mAP53.5CDFormer(w/o FT)
2D ClassificationDeepFishmAP35.5CDFormer(w/FT)
2D ClassificationDeepFishmAP25.7CDFormer(w/o FT)
2D ClassificationUODDmAP26.4CDFormer(w/FT)
2D ClassificationUODDmAP16.7CDFormer(w/o FT)
2D Object DetectionArtaxor mAP68.7CDFormer(w/FT)
2D Object DetectionArtaxor mAP37.3CDFormer(w/o FT)
2D Object DetectionNEU-DETmAP18.1CDFormer(w/FT)
2D Object DetectionNEU-DETmAP4CDFormer(w/o FT)
2D Object DetectionDIORmAP32.5CDFormer(w/FT)
2D Object DetectionDIORmAP7.9CDFormer(w/o FT)
2D Object DetectionClipark1k mAP59CDFormer(w/FT)
2D Object DetectionClipark1k mAP53.5CDFormer(w/o FT)
2D Object DetectionDeepFishmAP35.5CDFormer(w/FT)
2D Object DetectionDeepFishmAP25.7CDFormer(w/o FT)
2D Object DetectionUODDmAP26.4CDFormer(w/FT)
2D Object DetectionUODDmAP16.7CDFormer(w/o FT)
16kArtaxor mAP68.7CDFormer(w/FT)
16kArtaxor mAP37.3CDFormer(w/o FT)
16kNEU-DETmAP18.1CDFormer(w/FT)
16kNEU-DETmAP4CDFormer(w/o FT)
16kDIORmAP32.5CDFormer(w/FT)
16kDIORmAP7.9CDFormer(w/o FT)
16kClipark1k mAP59CDFormer(w/FT)
16kClipark1k mAP53.5CDFormer(w/o FT)
16kDeepFishmAP35.5CDFormer(w/FT)
16kDeepFishmAP25.7CDFormer(w/o FT)
16kUODDmAP26.4CDFormer(w/FT)
16kUODDmAP16.7CDFormer(w/o FT)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07