Center Direction Network for Grasping Point Localization on Cloths

Domen Tabernik, Jon Muhovič, Matej Urbas, Danijel Skočaj

2024-08-26Keypoint Detection

Abstract

Object grasping is a fundamental challenge in robotics and computer vision, critical for advancing robotic manipulation capabilities. Deformable objects, like fabrics and cloths, pose additional challenges due to their non-rigid nature. In this work, we introduce CeDiRNet-3DoF, a deep-learning model for grasp point detection, with a particular focus on cloth objects. CeDiRNet-3DoF employs center direction regression alongside a localization network, attaining first place in the perception task of ICRA 2023's Cloth Manipulation Challenge. Recognizing the lack of standardized benchmarks in the literature that hinder effective method comparison, we present the ViCoS Towel Dataset. This extensive benchmark dataset comprises 8,000 real and 12,000 synthetic images, serving as a robust resource for training and evaluating contemporary data-driven deep-learning approaches. Extensive evaluation revealed CeDiRNet-3DoF's robustness in real-world performance, outperforming state-of-the-art methods, including the latest transformer-based models. Our work bridges a crucial gap, offering a robust solution and benchmark for cloth grasping in computer vision and robotics. Code and dataset are available at: https://github.com/vicoslab/CeDiRNet-3DoF

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	ViCoS Towel Dataset	Best F1	81.4	CeDiRNet-3DoF - RGB-D (ConvNext-B)
Pose Estimation	ViCoS Towel Dataset	Best F1	80.8	CeDiRNet-3DoF - RGB-D (ConvNext-L)
Pose Estimation	ViCoS Towel Dataset	Best F1	78.4	CeDiRNet-3DoF - RGB (ConvNext-L)
Pose Estimation	ViCoS Towel Dataset	Best F1	78	CeDiRNet-3DoF - RGB (ConvNext-B)
Pose Estimation	ViCoS Towel Dataset	Best F1	72.7	DINO - RGB (ConvNetx-B)
Pose Estimation	ViCoS Towel Dataset	Best F1	68.3	MaskRCNN - RGB (ResNext101)
Pose Estimation	ViCoS Towel Dataset	Best F1	65.7	Lisp et al. - RGB (ConvNetx-B)
Pose Estimation	ViCoS Towel Dataset	Best F1	61.2	DeformDETR - RGB (ConvNetx-B)
Pose Estimation	ViCoS Towel Dataset	Best F1	48.3	YOLOv7 - RGB
3D	ViCoS Towel Dataset	Best F1	81.4	CeDiRNet-3DoF - RGB-D (ConvNext-B)
3D	ViCoS Towel Dataset	Best F1	80.8	CeDiRNet-3DoF - RGB-D (ConvNext-L)
3D	ViCoS Towel Dataset	Best F1	78.4	CeDiRNet-3DoF - RGB (ConvNext-L)
3D	ViCoS Towel Dataset	Best F1	78	CeDiRNet-3DoF - RGB (ConvNext-B)
3D	ViCoS Towel Dataset	Best F1	72.7	DINO - RGB (ConvNetx-B)
3D	ViCoS Towel Dataset	Best F1	68.3	MaskRCNN - RGB (ResNext101)
3D	ViCoS Towel Dataset	Best F1	65.7	Lisp et al. - RGB (ConvNetx-B)
3D	ViCoS Towel Dataset	Best F1	61.2	DeformDETR - RGB (ConvNetx-B)
3D	ViCoS Towel Dataset	Best F1	48.3	YOLOv7 - RGB
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	81.4	CeDiRNet-3DoF - RGB-D (ConvNext-B)
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	80.8	CeDiRNet-3DoF - RGB-D (ConvNext-L)
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	78.4	CeDiRNet-3DoF - RGB (ConvNext-L)
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	78	CeDiRNet-3DoF - RGB (ConvNext-B)
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	72.7	DINO - RGB (ConvNetx-B)
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	68.3	MaskRCNN - RGB (ResNext101)
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	65.7	Lisp et al. - RGB (ConvNetx-B)
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	61.2	DeformDETR - RGB (ConvNetx-B)
1 Image, 2*2 Stitchi	ViCoS Towel Dataset	Best F1	48.3	YOLOv7 - RGB

Center Direction Network for Grasping Point Localization on Cloths

Abstract

Results

Related Papers

Center Direction Network for Grasping Point Localization on Cloths

Abstract

Results

Related Papers