TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deep Closest Point: Learning Representations for Point Clo...

Deep Closest Point: Learning Representations for Point Cloud Registration

Yue Wang, Justin M. Solomon

2019-05-08ICCV 2019 10Visual LocalizationPoint Cloud Registration
PaperPDFCodeCode(official)CodeCode

Abstract

Point cloud registration is a key problem for computer vision applied to robotics, medical imaging, and other applications. This problem involves finding a rigid transformation from one point cloud into another so that they align. Iterative Closest Point (ICP) and its variants provide simple and easily-implemented iterative methods for this task, but these algorithms can converge to spurious local optima. To address local optima and other difficulties in the ICP pipeline, we propose a learning-based method, titled Deep Closest Point (DCP), inspired by recent techniques in computer vision and natural language processing. Our model consists of three parts: a point cloud embedding network, an attention-based module combined with a pointer generation layer, to approximate combinatorial matching, and a differentiable singular value decomposition (SVD) layer to extract the final rigid transformation. We train our model end-to-end on the ModelNet40 dataset and show in several settings that it performs better than ICP, its variants (e.g., Go-ICP, FGR), and the recently-proposed learning-based method PointNetLK. Beyond providing a state-of-the-art registration technique, we evaluate the suitability of our learned features transferred to unseen objects. We also provide preliminary analysis of our learned model to help understand whether domain-specific and/or global features facilitate rigid registration.

Results

TaskDatasetMetricValueModel
Visual LocalizationOxford Radar RobotCar (Full-6)Mean Translation Error18.45DCP
Point Cloud Registration3DMatch (at least 30% overlapped - FCGF setting)Recall (0.3m, 15 degrees)3.22DCP
3D Point Cloud Interpolation3DMatch (at least 30% overlapped - FCGF setting)Recall (0.3m, 15 degrees)3.22DCP

Related Papers

A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning2025-07-16Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures2025-07-14Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR2025-07-07MatChA: Cross-Algorithm Matching with Feature Augmentation2025-06-27OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography2025-06-26CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection2025-06-26Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles2025-06-18