TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Implicit 3D Orientation Learning for 6D Object Detection f...

Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Martin Sundermeyer, Zoltan-Csaba Marton, Maximilian Durner, Manuel Brucker, Rudolph Triebel

2019-02-04ECCV 2018 9DenoisingPose Estimation6D Pose Estimation using RGBobject-detection6D Pose EstimationObject Detection
PaperPDFCode(official)

Abstract

We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalizes to various test sensors and inherently handles object and view symmetries. Instead of learning an explicit mapping from input images to object poses, it provides an implicit representation of object orientations defined by samples in a latent space. Our pipeline achieves state-of-the-art performance on the T-LESS dataset both in the RGB and RGB-D domain. We also evaluate on the LineMOD dataset where we can compete with other synthetically trained approaches. We further increase performance by correcting 3D orientation estimates to account for perspective errors when the object deviates from the image center and show extended results.

Results

TaskDatasetMetricValueModel
Pose EstimationT-LESSMean Recall36.8Augmented Autoencoder
Pose EstimationLineMODMean ADD28.7Augmented Autoencoder
Pose EstimationT-LESSMean Recall72.76Augmented Autoencoder
Pose EstimationLineMODMean ADD64.67Augmented Autoencoder
3DT-LESSMean Recall36.8Augmented Autoencoder
3DLineMODMean ADD28.7Augmented Autoencoder
3DT-LESSMean Recall72.76Augmented Autoencoder
3DLineMODMean ADD64.67Augmented Autoencoder
1 Image, 2*2 StitchiT-LESSMean Recall36.8Augmented Autoencoder
1 Image, 2*2 StitchiLineMODMean ADD28.7Augmented Autoencoder
1 Image, 2*2 StitchiT-LESSMean Recall72.76Augmented Autoencoder
1 Image, 2*2 StitchiLineMODMean ADD64.67Augmented Autoencoder

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17