TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CAD-Recode: Reverse Engineering CAD Code from Point Clouds

CAD-Recode: Reverse Engineering CAD Code from Point Clouds

Danila Rukhovich, Elona Dupont, Dimitrios Mallis, Kseniya Cherenkova, Anis Kacem, Djamila Aouada

2024-12-18CAD ReconstructionQuestion Answering
PaperPDFCode(official)

Abstract

Computer-Aided Design (CAD) models are typically constructed by sequentially drawing parametric sketches and applying CAD operations to obtain a 3D model. The problem of 3D CAD reverse engineering consists of reconstructing the sketch and CAD operation sequences from 3D representations such as point clouds. In this paper, we address this challenge through novel contributions across three levels: CAD sequence representation, network design, and dataset. In particular, we represent CAD sketch-extrude sequences as Python code. The proposed CAD-Recode translates a point cloud into Python code that, when executed, reconstructs the CAD model. Taking advantage of the exposure of pre-trained Large Language Models (LLMs) to Python code, we leverage a relatively small LLM as a decoder for CAD-Recode and combine it with a lightweight point cloud projector. CAD-Recode is trained solely on a proposed synthetic dataset of one million diverse CAD sequences. CAD-Recode significantly outperforms existing methods across three datasets while requiring fewer input points. Notably, it achieves 10 times lower mean Chamfer distance than state-of-the-art methods on DeepCAD and Fusion360 datasets. Furthermore, we show that our CAD Python code output is interpretable by off-the-shelf LLMs, enabling CAD editing and CAD-specific question answering from point clouds.

Results

TaskDatasetMetricValueModel
Object ReconstructionFusion 360 GalleryChamfer Distance1.21CAD-Recode
Object ReconstructionFusion 360 GalleryChamfer Distance (median)0.19CAD-Recode
Object ReconstructionFusion 360 GalleryInvalid Ratio5CAD-Recode
Object ReconstructionFusion 360 GalleryIoU79.1CAD-Recode
Object ReconstructionDeepCADCamfer Distance (median)0.18CAD-Recode
Object ReconstructionDeepCADChamfer Distance0.83CAD-Recode
Object ReconstructionDeepCADInvalidi Ratio3.1CAD-Recode
Object ReconstructionDeepCADIoU87.1CAD-Recode
Object ReconstructionCC3DChamfer Distance3.21CAD-Recode
Object ReconstructionCC3DChamfer Distance (median)0.54CAD-Recode
Object ReconstructionCC3DInvalid Ratio9.8CAD-Recode
Object ReconstructionCC3DIoU60.5CAD-Recode
3D Object ReconstructionFusion 360 GalleryChamfer Distance1.21CAD-Recode
3D Object ReconstructionFusion 360 GalleryChamfer Distance (median)0.19CAD-Recode
3D Object ReconstructionFusion 360 GalleryInvalid Ratio5CAD-Recode
3D Object ReconstructionFusion 360 GalleryIoU79.1CAD-Recode
3D Object ReconstructionDeepCADCamfer Distance (median)0.18CAD-Recode
3D Object ReconstructionDeepCADChamfer Distance0.83CAD-Recode
3D Object ReconstructionDeepCADInvalidi Ratio3.1CAD-Recode
3D Object ReconstructionDeepCADIoU87.1CAD-Recode
3D Object ReconstructionCC3DChamfer Distance3.21CAD-Recode
3D Object ReconstructionCC3DChamfer Distance (median)0.54CAD-Recode
3D Object ReconstructionCC3DInvalid Ratio9.8CAD-Recode
3D Object ReconstructionCC3DIoU60.5CAD-Recode

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16Warehouse Spatial Question Answering with LLM Agent2025-07-14Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09