TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HSPACE: Synthetic Parametric Humans Animated in Complex En...

HSPACE: Synthetic Parametric Humans Animated in Complex Environments

Eduard Gabriel Bazavan, Andrei Zanfir, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

2021-12-233D Human Pose EstimationScene Understanding
PaperPDF

Abstract

Advances in the state of the art for 3d human sensing are currently limited by the lack of visual datasets with 3d ground truth, including multiple people, in motion, operating in real-world environments, with complex illumination or occlusion, and potentially observed by a moving camera. Sophisticated scene understanding would require estimating human pose and shape as well as gestures, towards representations that ultimately combine useful metric and behavioral signals with free-viewpoint photo-realistic visualisation capabilities. To sustain progress, we build a large-scale photo-realistic dataset, Human-SPACE (HSPACE), of animated humans placed in complex synthetic indoor and outdoor environments. We combine a hundred diverse individuals of varying ages, gender, proportions, and ethnicity, with hundreds of motions and scenes, as well as parametric variations in body shape (for a total of 1,600 different humans), in order to generate an initial dataset of over 1 million frames. Human animations are obtained by fitting an expressive human body model, GHUM, to single scans of people, followed by novel re-targeting and positioning procedures that support the realistic animation of dressed humans, statistical variation of body proportions, and jointly consistent scene placement of multiple moving people. Assets are generated automatically, at scale, and are compatible with existing real time rendering and game engines. The dataset with evaluation server will be made available for research. Our large-scale analysis of the impact of synthetic data, in connection with real data and weak supervision, underlines the considerable potential for continuing quality improvements and limiting the sim-to-real gap, in this practical setting, in connection with increased model capacity.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHSPACEMPJPE71T-THUNDR (HITI + HSPACE)
3D Human Pose EstimationHSPACEMPVPE81T-THUNDR (HITI + HSPACE)
3D Human Pose EstimationHSPACEPA-MPJPE47T-THUNDR (HITI + HSPACE)
3D Human Pose EstimationHSPACEPA-MPVPE58T-THUNDR (HITI + HSPACE)
Pose EstimationHSPACEMPJPE71T-THUNDR (HITI + HSPACE)
Pose EstimationHSPACEMPVPE81T-THUNDR (HITI + HSPACE)
Pose EstimationHSPACEPA-MPJPE47T-THUNDR (HITI + HSPACE)
Pose EstimationHSPACEPA-MPVPE58T-THUNDR (HITI + HSPACE)
3DHSPACEMPJPE71T-THUNDR (HITI + HSPACE)
3DHSPACEMPVPE81T-THUNDR (HITI + HSPACE)
3DHSPACEPA-MPJPE47T-THUNDR (HITI + HSPACE)
3DHSPACEPA-MPVPE58T-THUNDR (HITI + HSPACE)
1 Image, 2*2 StitchiHSPACEMPJPE71T-THUNDR (HITI + HSPACE)
1 Image, 2*2 StitchiHSPACEMPVPE81T-THUNDR (HITI + HSPACE)
1 Image, 2*2 StitchiHSPACEPA-MPJPE47T-THUNDR (HITI + HSPACE)
1 Image, 2*2 StitchiHSPACEPA-MPVPE58T-THUNDR (HITI + HSPACE)

Related Papers

Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation2025-07-15Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander2025-07-15Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis2025-07-15EmbRACE-3K: Embodied Reasoning and Action in Complex Environments2025-07-14OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding2025-07-10