Digital Life Project: Autonomous 3D Characters with Social Intelligence

Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu

2023-12-07CVPR 2024 1Motion Captioning Motion Generation Motion Synthesis

Paper PDF

Abstract

In this work, we present Digital Life Project, a framework utilizing language as the universal medium to build autonomous 3D characters, who are capable of engaging in social interactions and expressing with articulated body motions, thereby simulating life in a digital environment. Our framework comprises two primary components: 1) SocioMind: a meticulously crafted digital brain that models personalities with systematic few-shot exemplars, incorporates a reflection process based on psychology principles, and emulates autonomy by initiating dialogue topics; 2) MoMat-MoGen: a text-driven motion synthesis paradigm for controlling the character's digital body. It integrates motion matching, a proven industry technique to ensure motion quality, with cutting-edge advancements in motion generation for diversity. Extensive experiments demonstrate that each module achieves state-of-the-art performance in its respective domain. Collectively, they enable virtual characters to initiate and sustain dialogues autonomously, while evolving their socio-psychological states. Concurrently, these characters can perform contextually relevant bodily movements. Additionally, a motion captioning module further allows the virtual character to recognize and appropriately respond to human players' actions. Homepage: https://digital-life-project.com/

Results

Task	Dataset	Metric	Value	Model
Pose Tracking	InterHuman	FID	5.674	MoMat-MoGen
Pose Tracking	InterHuman	MMDist	3.79	MoMat-MoGen
Pose Tracking	InterHuman	MModality	1.295	MoMat-MoGen
Pose Tracking	InterHuman	R-Precision Top3	0.666	MoMat-MoGen
Motion Synthesis	InterHuman	FID	5.674	MoMat-MoGen
Motion Synthesis	InterHuman	MMDist	3.79	MoMat-MoGen
Motion Synthesis	InterHuman	MModality	1.295	MoMat-MoGen
Motion Synthesis	InterHuman	R-Precision Top3	0.666	MoMat-MoGen
10-shot image generation	InterHuman	FID	5.674	MoMat-MoGen
10-shot image generation	InterHuman	MMDist	3.79	MoMat-MoGen
10-shot image generation	InterHuman	MModality	1.295	MoMat-MoGen
10-shot image generation	InterHuman	R-Precision Top3	0.666	MoMat-MoGen
3D Human Pose Tracking	InterHuman	FID	5.674	MoMat-MoGen
3D Human Pose Tracking	InterHuman	MMDist	3.79	MoMat-MoGen
3D Human Pose Tracking	InterHuman	MModality	1.295	MoMat-MoGen
3D Human Pose Tracking	InterHuman	R-Precision Top3	0.666	MoMat-MoGen

Digital Life Project: Autonomous 3D Characters with Social Intelligence

Abstract

Results

Related Papers

Digital Life Project: Autonomous 3D Characters with Social Intelligence

Abstract

Results

Related Papers