TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MaskedFusion: Mask-based 6D Object Pose Estimation

MaskedFusion: Mask-based 6D Object Pose Estimation

Nuno Pereira, Luís A. Alexandre

2019-11-18Pose Estimation6D Pose Estimation using RGBD6D Pose Estimation
PaperPDFCode(official)

Abstract

MaskedFusion is a framework to estimate the 6D pose of objects using RGB-D data, with an architecture that leverages multiple sub-tasks in a pipeline to achieve accurate 6D poses. 6D pose estimation is an open challenge due to complex world objects and many possible problems when capturing data from the real world, e.g., occlusions, truncations, and noise in the data. Achieving accurate 6D poses will improve results in other open problems like robot grasping or positioning objects in augmented reality. MaskedFusion improves the state-of-the-art by using object masks to eliminate non-relevant data. With the inclusion of the masks on the neural network that estimates the 6D pose of an object we also have features that represent the object shape. MaskedFusion is a modular pipeline where each sub-task can have different methods that achieve the objective. MaskedFusion achieved 97.3% on average using the ADD metric on the LineMOD dataset and 93.3% using the ADD-S AUC metric on YCB-Video Dataset, which is an improvement, compared to the state-of-the-art methods. The code is available on GitHub (https://github.com/kroglice/MaskedFusion).

Results

TaskDatasetMetricValueModel
Pose EstimationYCB-VideoADDS AUC93.3MaskedFusion
Pose EstimationLineMODAccuracy (ADD)97.8MaskedFusion
Pose EstimationYCB-VideoMean ADD93.3MaskedFusion
Pose EstimationYCB-VideoMean ADD-S93.3MaskedFusion
Pose EstimationLineMODMean ADD97.8MaskedFusion
3DYCB-VideoADDS AUC93.3MaskedFusion
3DLineMODAccuracy (ADD)97.8MaskedFusion
3DYCB-VideoMean ADD93.3MaskedFusion
3DYCB-VideoMean ADD-S93.3MaskedFusion
3DLineMODMean ADD97.8MaskedFusion
6D Pose EstimationYCB-VideoADDS AUC93.3MaskedFusion
6D Pose EstimationLineMODAccuracy (ADD)97.8MaskedFusion
1 Image, 2*2 StitchiYCB-VideoADDS AUC93.3MaskedFusion
1 Image, 2*2 StitchiLineMODAccuracy (ADD)97.8MaskedFusion
1 Image, 2*2 StitchiYCB-VideoMean ADD93.3MaskedFusion
1 Image, 2*2 StitchiYCB-VideoMean ADD-S93.3MaskedFusion
1 Image, 2*2 StitchiLineMODMean ADD97.8MaskedFusion

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16