Papers With Code 2 | ML Benchmarks, SotA Results & Code

The ViCoS Towel Dataset is a state-of-the-art benchmark for grasp point localization on cloth objects, specifically towels. Designed to advance research in robotic grasping and perception for textile objects, this dataset includes a collection of 8,000 high-resolution RGB-D images (1920×1080) captured with a Kinect V2 under a variety of conditions. Each image provides detailed depth information, making it ideal for training deep learning models and conducting thorough benchmarking.

Object Diversity
The dataset features 10 types of towels from the Household Cloth Objects. including various sizes and patterns such as big towels, checkered rags, and waffle rags. Each towel's corners are treated as potential grasp points for detection and localization.

Object Positions
Towels were positioned in 10 different configurations on a tabletop to simulate a range of visibility scenarios, from fully spread-out to crumpled and folded, providing diverse grasping conditions.

Background Variability
To enhance background variability, we used five different tabletop cloths or objects, including festive tablecloths and patterned tablecloths. This variation helps test algorithms under different background conditions.

Lighting Conditions
Images were captured under eight distinct lighting setups, ranging from no light to fully illuminated scenes with various shadow intensities. This variety helps in evaluating the robustness of grasping models under different lighting scenarios.

Clutter
Both cluttered and uncluttered scenes were included to create challenging conditions with occlusions. Various clutter items were added on the desk or directly on the towels to simulate realistic scenarios.

Annotations

Each visible corner of the towels in the images has been manually annotated as a potential grasping point. Annotations include a point label and the angle-of-approach, providing a total of 20,784 annotated points with corresponding angles.

Training and Testing Splits

The dataset is divided into training and testing subsets. Two towels (checkered rag small and cotton napkin) and one background (festive tablecloth) were reserved for testing, ensuring that these elements are not present in the training set. The dataset includes 5,120 training images and 2,880 testing images, covering all object configurations, lighting conditions, and clutter scenarios.

Synthetic Training Data

In addition to the real images, the dataset includes 12,000 synthetic images generated using the MuJoCo simulation environment. These synthetic images depict towels in varied positions and conditions, with different textures and lighting settings, providing further diversity for training purposes.

For more details and access to the dataset, visit GitHub repository.

CeDiRNet-3DoF: Center Direction Network for Grasping Point Localization on Cloths!