Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection

Jiancheng Pan, Yanxing Liu, Xiao He, Long Peng, Jiahao Li, Yuze Sun, Xiaomeng Huang

2025-04-06Few-Shot Object Detection Image Augmentation Data Augmentation Navigate Domain Generalization Cross-Domain Few-Shot object-detection Cross-Domain Few-Shot Object Detection Object Detection

Paper PDF Code(official)

Abstract

Foundation models pretrained on extensive datasets, such as GroundingDINO and LAE-DINO, have performed remarkably in the cross-domain few-shot object detection (CD-FSOD) task. Through rigorous few-shot training, we found that the integration of image-based data augmentation techniques and grid-based sub-domain search strategy significantly enhances the performance of these foundation models. Building upon GroundingDINO, we employed several widely used image augmentation methods and established optimization objectives to effectively navigate the expansive domain space in search of optimal sub-domains. This approach facilitates efficient few-shot object detection and introduces an approach to solving the CD-FSOD problem by efficiently searching for the optimal parameter configuration from the foundation model. Our findings substantially advance the practical deployment of vision-language models in data-scarce environments, offering critical insights into optimizing their cross-domain generalization capabilities without labor-intensive retraining. Code is available at https://github.com/jaychempan/ETS.

Results

Task	Dataset	Metric	Value	Model
Object Detection	Artaxor	mAP	71.2	ETS
Object Detection	NEU-DET	mAP	26.1	ETS
Object Detection	DIOR	mAP	37.5	ETS
Object Detection	Clipark1k	mAP	61.5	ETS
Object Detection	DeepFish	mAP	44.1	ETS
Object Detection	UODD	mAP	29.8	ETS
3D	Artaxor	mAP	71.2	ETS
3D	NEU-DET	mAP	26.1	ETS
3D	DIOR	mAP	37.5	ETS
3D	Clipark1k	mAP	61.5	ETS
3D	DeepFish	mAP	44.1	ETS
3D	UODD	mAP	29.8	ETS
Few-Shot Object Detection	Artaxor	mAP	71.2	ETS
Few-Shot Object Detection	NEU-DET	mAP	26.1	ETS
Few-Shot Object Detection	DIOR	mAP	37.5	ETS
Few-Shot Object Detection	Clipark1k	mAP	61.5	ETS
Few-Shot Object Detection	DeepFish	mAP	44.1	ETS
Few-Shot Object Detection	UODD	mAP	29.8	ETS
2D Classification	Artaxor	mAP	71.2	ETS
2D Classification	NEU-DET	mAP	26.1	ETS
2D Classification	DIOR	mAP	37.5	ETS
2D Classification	Clipark1k	mAP	61.5	ETS
2D Classification	DeepFish	mAP	44.1	ETS
2D Classification	UODD	mAP	29.8	ETS
2D Object Detection	Artaxor	mAP	71.2	ETS
2D Object Detection	NEU-DET	mAP	26.1	ETS
2D Object Detection	DIOR	mAP	37.5	ETS
2D Object Detection	Clipark1k	mAP	61.5	ETS
2D Object Detection	DeepFish	mAP	44.1	ETS
2D Object Detection	UODD	mAP	29.8	ETS
16k	Artaxor	mAP	71.2	ETS
16k	NEU-DET	mAP	26.1	ETS
16k	DIOR	mAP	37.5	ETS
16k	Clipark1k	mAP	61.5	ETS
16k	DeepFish	mAP	44.1	ETS
16k	UODD	mAP	29.8	ETS

Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection

Abstract

Results

Related Papers

Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection

Abstract

Results

Related Papers