A Large-scale Audio-visual Dataset for Emotional Talking-face Generation
Multi-view Emotional Audio-visual Dataset