Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Martin Sundermeyer, Zoltan-Csaba Marton, Maximilian Durner, Manuel Brucker, Rudolph Triebel

2019-02-04ECCV 2018 9Denoising Pose Estimation 6D Pose Estimation using RGB object-detection 6D Pose Estimation Object Detection

Paper PDF Code(official)

Abstract

We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalizes to various test sensors and inherently handles object and view symmetries. Instead of learning an explicit mapping from input images to object poses, it provides an implicit representation of object orientations defined by samples in a latent space. Our pipeline achieves state-of-the-art performance on the T-LESS dataset both in the RGB and RGB-D domain. We also evaluate on the LineMOD dataset where we can compete with other synthetically trained approaches. We further increase performance by correcting 3D orientation estimates to account for perspective errors when the object deviates from the image center and show extended results.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	T-LESS	Mean Recall	36.8	Augmented Autoencoder
Pose Estimation	LineMOD	Mean ADD	28.7	Augmented Autoencoder
Pose Estimation	T-LESS	Mean Recall	72.76	Augmented Autoencoder
Pose Estimation	LineMOD	Mean ADD	64.67	Augmented Autoencoder
3D	T-LESS	Mean Recall	36.8	Augmented Autoencoder
3D	LineMOD	Mean ADD	28.7	Augmented Autoencoder
3D	T-LESS	Mean Recall	72.76	Augmented Autoencoder
3D	LineMOD	Mean ADD	64.67	Augmented Autoencoder
1 Image, 2*2 Stitchi	T-LESS	Mean Recall	36.8	Augmented Autoencoder
1 Image, 2*2 Stitchi	LineMOD	Mean ADD	28.7	Augmented Autoencoder
1 Image, 2*2 Stitchi	T-LESS	Mean Recall	72.76	Augmented Autoencoder
1 Image, 2*2 Stitchi	LineMOD	Mean ADD	64.67	Augmented Autoencoder

Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Abstract

Results

Related Papers

Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Abstract

Results

Related Papers