TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Few-Shot Object Detection and Viewpoint Estimation for Obj...

Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild

Yang Xiao, Vincent Lepetit, Renaud Marlet

2020-07-23ECCV 2020 8Meta-LearningFew-Shot Object DetectionScene UnderstandingViewpoint Estimationobject-detectionObject Detection
PaperPDFCodeCode

Abstract

Detecting objects and estimating their viewpoints in images are key tasks of 3D scene understanding. Recent approaches have achieved excellent results on very large benchmarks for object detection and viewpoint estimation. However, performances are still lagging behind for novel object categories with few samples. In this paper, we tackle the problems of few-shot object detection and few-shot viewpoint estimation. We demonstrate on both tasks the benefits of guiding the network prediction with class-representative features extracted from data in different modalities: image patches for object detection, and aligned 3D models for viewpoint estimation. Despite its simplicity, our method outperforms state-of-the-art methods by a large margin on a range of datasets, including PASCAL and COCO for few-shot object detection, and Pascal3D+ and ObjectNet3D for few-shot viewpoint estimation. Furthermore, when the 3D model is not available, we introduce a simple category-agnostic viewpoint estimation method by exploiting geometrical similarities and consistent pose labelling across different classes. While it moderately reduces performance, this approach still obtains better results than previous methods in this setting. Last, for the first time, we tackle the combination of both few-shot tasks, on three challenging benchmarks for viewpoint estimation in the wild, ObjectNet3D, Pascal3D+ and Pix3D, showing very promising results.

Results

TaskDatasetMetricValueModel
Object DetectionMS-COCO (30-shot)AP14.7FsDetView
Object DetectionMS-COCO (10-shot)AP12.5FSDetView
3DMS-COCO (30-shot)AP14.7FsDetView
3DMS-COCO (10-shot)AP12.5FSDetView
Few-Shot Object DetectionMS-COCO (30-shot)AP14.7FsDetView
Few-Shot Object DetectionMS-COCO (10-shot)AP12.5FSDetView
2D ClassificationMS-COCO (30-shot)AP14.7FsDetView
2D ClassificationMS-COCO (10-shot)AP12.5FSDetView
2D Object DetectionMS-COCO (30-shot)AP14.7FsDetView
2D Object DetectionMS-COCO (10-shot)AP12.5FSDetView
16kMS-COCO (30-shot)AP14.7FsDetView
16kMS-COCO (10-shot)AP12.5FSDetView

Related Papers

Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16