TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ES6D: A Computation Efficient and Symmetry-Aware 6D Pose R...

ES6D: A Computation Efficient and Symmetry-Aware 6D Pose Regression Framework

Ningkai Mo, Wanshui Gan, Naoto Yokoya, Shifeng Chen

2022-04-03CVPR 2022 1regression3D Object Detection6D Pose Estimation
PaperPDFCode(official)

Abstract

In this paper, a computation efficient regression framework is presented for estimating the 6D pose of rigid objects from a single RGB-D image, which is applicable to handling symmetric objects. This framework is designed in a simple architecture that efficiently extracts point-wise features from RGB-D data using a fully convolutional network, called XYZNet, and directly regresses the 6D pose without any post refinement. In the case of symmetric object, one object has multiple ground-truth poses, and this one-to-many relationship may lead to estimation ambiguity. In order to solve this ambiguity problem, we design a symmetry-invariant pose distance metric, called average (maximum) grouped primitives distance or A(M)GPD. The proposed A(M)GPD loss can make the regression network converge to the correct state, i.e., all minima in the A(M)GPD loss surface are mapped to the correct poses. Extensive experiments on YCB-Video and T-LESS datasets demonstrate the proposed framework's substantially superior performance in top accuracy and low computational cost.

Results

TaskDatasetMetricValueModel
Pose EstimationDTTD-MobileADD AUC13.25ES6D
Pose EstimationDTTD-MobileADD-S AUC37.38ES6D
Object DetectionDTTD-MobileADD AUC13.25ES6D
Object DetectionDTTD-MobileADD-S AUC37.38ES6D
3DDTTD-MobileADD AUC13.25ES6D
3DDTTD-MobileADD-S AUC37.38ES6D
3DDTTD-MobileADD AUC13.25ES6D
3DDTTD-MobileADD-S AUC37.38ES6D
3D Object DetectionDTTD-MobileADD AUC13.25ES6D
3D Object DetectionDTTD-MobileADD-S AUC37.38ES6D
6D Pose EstimationDTTD-MobileADD AUC13.25ES6D
6D Pose EstimationDTTD-MobileADD-S AUC37.38ES6D
2D ClassificationDTTD-MobileADD AUC13.25ES6D
2D ClassificationDTTD-MobileADD-S AUC37.38ES6D
2D Object DetectionDTTD-MobileADD AUC13.25ES6D
2D Object DetectionDTTD-MobileADD-S AUC37.38ES6D
1 Image, 2*2 StitchiDTTD-MobileADD AUC13.25ES6D
1 Image, 2*2 StitchiDTTD-MobileADD-S AUC37.38ES6D
16kDTTD-MobileADD AUC13.25ES6D
16kDTTD-MobileADD-S AUC37.38ES6D

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16Sparse Regression Codes exploit Multi-User Diversity without CSI2025-07-15Bradley-Terry and Multi-Objective Reward Modeling Are Complementary2025-07-10SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor Variations2025-07-08