TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Generative Adversarial Graph Convolutional Networks for Hu...

Generative Adversarial Graph Convolutional Networks for Human Action Synthesis

Bruno Degardin, João Neves, Vasco Lopes, João Brito, Ehsan Yaghoubi, Hugo Proença

2021-10-21Action GenerationDisentanglementHuman action generationMotion SynthesisVideo Generation
PaperPDFCode(official)

Abstract

Synthesising the spatial and temporal dynamics of the human body skeleton remains a challenging task, not only in terms of the quality of the generated shapes, but also of their diversity, particularly to synthesise realistic body movements of a specific action (action conditioning). In this paper, we propose Kinetic-GAN, a novel architecture that leverages the benefits of Generative Adversarial Networks and Graph Convolutional Networks to synthesise the kinetics of the human body. The proposed adversarial architecture can condition up to 120 different actions over local and global body movements while improving sample quality and diversity through latent space disentanglement and stochastic variations. Our experiments were carried out in three well-known datasets, where Kinetic-GAN notably surpasses the state-of-the-art methods in terms of distribution quality metrics while having the ability to synthesise more than one order of magnitude regarding the number of different actions. Our code and models are publicly available at https://github.com/DegardinBruno/Kinetic-GAN.

Results

TaskDatasetMetricValueModel
Activity RecognitionNTU RGB+DFID (CS)3.618Kinetic-GAN
Activity RecognitionNTU RGB+DFID (CV)4.235Kinetic-GAN
Activity RecognitionNTU RGB+D 120FID (CS)5.967Kinetic-GAN
Activity RecognitionNTU RGB+D 120FID (CV)6.751Kinetic-GAN
Activity RecognitionNTU RGB+D 2DMMDa (CS)0.256Kinetic-GAN
Activity RecognitionNTU RGB+D 2DMMDa (CV)0.295Kinetic-GAN
Activity RecognitionNTU RGB+D 2DMMDs (CS)0.273Kinetic-GAN
Activity RecognitionNTU RGB+D 2DMMDs (CV)0.31Kinetic-GAN
Activity RecognitionHuman3.6MMMDa0.071Kinetic-GAN
Activity RecognitionHuman3.6MMMDs0.082Kinetic-GAN
Human action generationNTU RGB+DFID (CS)3.618Kinetic-GAN
Human action generationNTU RGB+DFID (CV)4.235Kinetic-GAN
Human action generationNTU RGB+D 120FID (CS)5.967Kinetic-GAN
Human action generationNTU RGB+D 120FID (CV)6.751Kinetic-GAN
Human action generationNTU RGB+D 2DMMDa (CS)0.256Kinetic-GAN
Human action generationNTU RGB+D 2DMMDa (CV)0.295Kinetic-GAN
Human action generationNTU RGB+D 2DMMDs (CS)0.273Kinetic-GAN
Human action generationNTU RGB+D 2DMMDs (CV)0.31Kinetic-GAN
Human action generationHuman3.6MMMDa0.071Kinetic-GAN
Human action generationHuman3.6MMMDs0.082Kinetic-GAN

Related Papers

CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models2025-07-18VITA: Vision-to-Action Flow Matching Policy2025-07-17World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Leveraging Pre-Trained Visual Models for AI-Generated Video Detection2025-07-17Taming Diffusion Transformer for Real-Time Mobile Video Generation2025-07-17LoViC: Efficient Long Video Generation with Context Compression2025-07-17$I^{2}$-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting2025-07-12Towards Imperceptible JPEG Image Hiding: Multi-range Representations-driven Adversarial Stego Generation2025-07-11