TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Center Direction Network for Grasping Point Localization o...

Center Direction Network for Grasping Point Localization on Cloths

Domen Tabernik, Jon Muhovič, Matej Urbas, Danijel Skočaj

2024-08-26Keypoint Detection
PaperPDFCode(official)

Abstract

Object grasping is a fundamental challenge in robotics and computer vision, critical for advancing robotic manipulation capabilities. Deformable objects, like fabrics and cloths, pose additional challenges due to their non-rigid nature. In this work, we introduce CeDiRNet-3DoF, a deep-learning model for grasp point detection, with a particular focus on cloth objects. CeDiRNet-3DoF employs center direction regression alongside a localization network, attaining first place in the perception task of ICRA 2023's Cloth Manipulation Challenge. Recognizing the lack of standardized benchmarks in the literature that hinder effective method comparison, we present the ViCoS Towel Dataset. This extensive benchmark dataset comprises 8,000 real and 12,000 synthetic images, serving as a robust resource for training and evaluating contemporary data-driven deep-learning approaches. Extensive evaluation revealed CeDiRNet-3DoF's robustness in real-world performance, outperforming state-of-the-art methods, including the latest transformer-based models. Our work bridges a crucial gap, offering a robust solution and benchmark for cloth grasping in computer vision and robotics. Code and dataset are available at: https://github.com/vicoslab/CeDiRNet-3DoF

Results

TaskDatasetMetricValueModel
Pose EstimationViCoS Towel DatasetBest F181.4CeDiRNet-3DoF - RGB-D (ConvNext-B)
Pose EstimationViCoS Towel DatasetBest F180.8CeDiRNet-3DoF - RGB-D (ConvNext-L)
Pose EstimationViCoS Towel DatasetBest F178.4CeDiRNet-3DoF - RGB (ConvNext-L)
Pose EstimationViCoS Towel DatasetBest F178CeDiRNet-3DoF - RGB (ConvNext-B)
Pose EstimationViCoS Towel DatasetBest F172.7DINO - RGB (ConvNetx-B)
Pose EstimationViCoS Towel DatasetBest F168.3MaskRCNN - RGB (ResNext101)
Pose EstimationViCoS Towel DatasetBest F165.7Lisp et al. - RGB (ConvNetx-B)
Pose EstimationViCoS Towel DatasetBest F161.2DeformDETR - RGB (ConvNetx-B)
Pose EstimationViCoS Towel DatasetBest F148.3YOLOv7 - RGB
3DViCoS Towel DatasetBest F181.4CeDiRNet-3DoF - RGB-D (ConvNext-B)
3DViCoS Towel DatasetBest F180.8CeDiRNet-3DoF - RGB-D (ConvNext-L)
3DViCoS Towel DatasetBest F178.4CeDiRNet-3DoF - RGB (ConvNext-L)
3DViCoS Towel DatasetBest F178CeDiRNet-3DoF - RGB (ConvNext-B)
3DViCoS Towel DatasetBest F172.7DINO - RGB (ConvNetx-B)
3DViCoS Towel DatasetBest F168.3MaskRCNN - RGB (ResNext101)
3DViCoS Towel DatasetBest F165.7Lisp et al. - RGB (ConvNetx-B)
3DViCoS Towel DatasetBest F161.2DeformDETR - RGB (ConvNetx-B)
3DViCoS Towel DatasetBest F148.3YOLOv7 - RGB
1 Image, 2*2 StitchiViCoS Towel DatasetBest F181.4CeDiRNet-3DoF - RGB-D (ConvNext-B)
1 Image, 2*2 StitchiViCoS Towel DatasetBest F180.8CeDiRNet-3DoF - RGB-D (ConvNext-L)
1 Image, 2*2 StitchiViCoS Towel DatasetBest F178.4CeDiRNet-3DoF - RGB (ConvNext-L)
1 Image, 2*2 StitchiViCoS Towel DatasetBest F178CeDiRNet-3DoF - RGB (ConvNext-B)
1 Image, 2*2 StitchiViCoS Towel DatasetBest F172.7DINO - RGB (ConvNetx-B)
1 Image, 2*2 StitchiViCoS Towel DatasetBest F168.3MaskRCNN - RGB (ResNext101)
1 Image, 2*2 StitchiViCoS Towel DatasetBest F165.7Lisp et al. - RGB (ConvNetx-B)
1 Image, 2*2 StitchiViCoS Towel DatasetBest F161.2DeformDETR - RGB (ConvNetx-B)
1 Image, 2*2 StitchiViCoS Towel DatasetBest F148.3YOLOv7 - RGB

Related Papers

KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model2025-07-15GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft2025-07-15FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching2025-07-14Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection2025-07-10Reading a Ruler in the Wild2025-07-09MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning2025-07-09TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning2025-05-29Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU2025-05-24