AI Choreographer: Music Conditioned 3D Dance Generation with AIST++

RuiLong Li, Shan Yang, David A. Ross, Angjoo Kanazawa

2021-01-21ICCV 2021 10Pose Estimation Motion Generation Motion Synthesis

Abstract

We present AIST++, a new multi-modal dataset of 3D dance motion and music, along with FACT, a Full-Attention Cross-modal Transformer network for generating 3D dance motion conditioned on music. The proposed AIST++ dataset contains 5.2 hours of 3D dance motion in 1408 sequences, covering 10 dance genres with multi-view videos with known camera poses -- the largest dataset of this kind to our knowledge. We show that naively applying sequence models such as transformers to this dataset for the task of music conditioned 3D motion generation does not produce satisfactory 3D motion that is well correlated with the input music. We overcome these shortcomings by introducing key changes in its architecture design and supervision: FACT model involves a deep cross-modal transformer block with full-attention that is trained to predict $N$ future motions. We empirically show that these changes are key factors in generating long sequences of realistic dance motion that are well-attuned to the input music. We conduct extensive experiments on AIST++ with user studies, where our method outperforms recent state-of-the-art methods both qualitatively and quantitatively.

Results

Task	Dataset	Metric	Value	Model
Pose Tracking	BRACE	Beat DTW cost	12.92	AIST++
Pose Tracking	BRACE	Beat alignment score	0.136	AIST++
Pose Tracking	BRACE	Footwork average	40.73	AIST++
Pose Tracking	BRACE	Frechet Inception Distance	0.5743	AIST++
Pose Tracking	BRACE	Powermove average	52.89	AIST++
Pose Tracking	BRACE	Toprock average	6.39	AIST++
Pose Tracking	FineDance	BAS	0.1831	FACT
Pose Tracking	FineDance	fid_k	113.38	FACT
Pose Tracking	AIST++	Beat alignment score	0.221	AI Choreographer
Pose Tracking	AIST++	FID	35.35	AI Choreographer
Motion Synthesis	BRACE	Beat DTW cost	12.92	AIST++
Motion Synthesis	BRACE	Beat alignment score	0.136	AIST++
Motion Synthesis	BRACE	Footwork average	40.73	AIST++
Motion Synthesis	BRACE	Frechet Inception Distance	0.5743	AIST++
Motion Synthesis	BRACE	Powermove average	52.89	AIST++
Motion Synthesis	BRACE	Toprock average	6.39	AIST++
Motion Synthesis	FineDance	BAS	0.1831	FACT
Motion Synthesis	FineDance	fid_k	113.38	FACT
Motion Synthesis	AIST++	Beat alignment score	0.221	AI Choreographer
Motion Synthesis	AIST++	FID	35.35	AI Choreographer
10-shot image generation	BRACE	Beat DTW cost	12.92	AIST++
10-shot image generation	BRACE	Beat alignment score	0.136	AIST++
10-shot image generation	BRACE	Footwork average	40.73	AIST++
10-shot image generation	BRACE	Frechet Inception Distance	0.5743	AIST++
10-shot image generation	BRACE	Powermove average	52.89	AIST++
10-shot image generation	BRACE	Toprock average	6.39	AIST++
10-shot image generation	FineDance	BAS	0.1831	FACT
10-shot image generation	FineDance	fid_k	113.38	FACT
10-shot image generation	AIST++	Beat alignment score	0.221	AI Choreographer
10-shot image generation	AIST++	FID	35.35	AI Choreographer
3D Human Pose Tracking	BRACE	Beat DTW cost	12.92	AIST++
3D Human Pose Tracking	BRACE	Beat alignment score	0.136	AIST++
3D Human Pose Tracking	BRACE	Footwork average	40.73	AIST++
3D Human Pose Tracking	BRACE	Frechet Inception Distance	0.5743	AIST++
3D Human Pose Tracking	BRACE	Powermove average	52.89	AIST++
3D Human Pose Tracking	BRACE	Toprock average	6.39	AIST++
3D Human Pose Tracking	FineDance	BAS	0.1831	FACT
3D Human Pose Tracking	FineDance	fid_k	113.38	FACT
3D Human Pose Tracking	AIST++	Beat alignment score	0.221	AI Choreographer
3D Human Pose Tracking	AIST++	FID	35.35	AI Choreographer

Abstract

Results

Task	Dataset	Metric	Value	Model
Pose Tracking	BRACE	Beat DTW cost	12.92	AIST++
Pose Tracking	BRACE	Beat alignment score	0.136	AIST++
Pose Tracking	BRACE	Footwork average	40.73	AIST++
Pose Tracking	BRACE	Frechet Inception Distance	0.5743	AIST++
Pose Tracking	BRACE	Powermove average	52.89	AIST++
Pose Tracking	BRACE	Toprock average	6.39	AIST++
Pose Tracking	FineDance	BAS	0.1831	FACT
Pose Tracking	FineDance	fid_k	113.38	FACT
Pose Tracking	AIST++	Beat alignment score	0.221	AI Choreographer
Pose Tracking	AIST++	FID	35.35	AI Choreographer
Motion Synthesis	BRACE	Beat DTW cost	12.92	AIST++
Motion Synthesis	BRACE	Beat alignment score	0.136	AIST++
Motion Synthesis	BRACE	Footwork average	40.73	AIST++
Motion Synthesis	BRACE	Frechet Inception Distance	0.5743	AIST++
Motion Synthesis	BRACE	Powermove average	52.89	AIST++
Motion Synthesis	BRACE	Toprock average	6.39	AIST++
Motion Synthesis	FineDance	BAS	0.1831	FACT
Motion Synthesis	FineDance	fid_k	113.38	FACT
Motion Synthesis	AIST++	Beat alignment score	0.221	AI Choreographer
Motion Synthesis	AIST++	FID	35.35	AI Choreographer
10-shot image generation	BRACE	Beat DTW cost	12.92	AIST++
10-shot image generation	BRACE	Beat alignment score	0.136	AIST++
10-shot image generation	BRACE	Footwork average	40.73	AIST++
10-shot image generation	BRACE	Frechet Inception Distance	0.5743	AIST++
10-shot image generation	BRACE	Powermove average	52.89	AIST++
10-shot image generation	BRACE	Toprock average	6.39	AIST++
10-shot image generation	FineDance	BAS	0.1831	FACT
10-shot image generation	FineDance	fid_k	113.38	FACT
10-shot image generation	AIST++	Beat alignment score	0.221	AI Choreographer
10-shot image generation	AIST++	FID	35.35	AI Choreographer
3D Human Pose Tracking	BRACE	Beat DTW cost	12.92	AIST++
3D Human Pose Tracking	BRACE	Beat alignment score	0.136	AIST++
3D Human Pose Tracking	BRACE	Footwork average	40.73	AIST++
3D Human Pose Tracking	BRACE	Frechet Inception Distance	0.5743	AIST++
3D Human Pose Tracking	BRACE	Powermove average	52.89	AIST++
3D Human Pose Tracking	BRACE	Toprock average	6.39	AIST++
3D Human Pose Tracking	FineDance	BAS	0.1831	FACT
3D Human Pose Tracking	FineDance	fid_k	113.38	FACT
3D Human Pose Tracking	AIST++	Beat alignment score	0.221	AI Choreographer
3D Human Pose Tracking	AIST++	FID	35.35	AI Choreographer

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++

Abstract

Results

Related Papers

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++

Abstract

Results

Related Papers