ITOP

Invariant-Top View Dataset

ImagesCC BY 4.0Introduced 2016-01-01

The ITOP dataset consists of 40K training and 10K testing depth images for each of the front-view and top-view tracks. This dataset contains depth images with 20 actors who perform 15 sequences each and is recorded by two Asus Xtion Pro cameras. The ground-truth of this dataset is the 3D coordinates of 15 body joints.

Source: V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map Image Source: https://www.youtube.com/watch?v=4gPI-GOf9wg

Related Benchmarks

ITOP front-view/1 Image, 2*2 Stitchi/Mean mAP ITOP front-view/3D/Mean mAP ITOP front-view/3D Human Pose Estimation/Mean mAP ITOP front-view/Pose Estimation/Mean mAP ITOP top-view/1 Image, 2*2 Stitchi/Mean mAP ITOP top-view/3D/Mean mAP ITOP top-view/Pose Estimation/Mean mAP