VEMOCLAP: A video emotion classification web application

Serkan Sulun, Paula Viana, Matthew E. P. Davies

2024-10-22Emotion Classification Classification Video Emotion Recognition

Abstract

We introduce VEMOCLAP: Video EMOtion Classifier using Pretrained features, the first readily available and open-source web application that analyzes the emotional content of any user-provided video. We improve our previous work, which exploits open-source pretrained models that work on video frames and audio, and then efficiently fuse the resulting pretrained features using multi-head cross-attention. Our approach increases the state-of-the-art classification accuracy on the Ekman-6 video emotion dataset by 4.3% and offers an online application for users to run our model on their own videos or YouTube videos. We invite the readers to try our application at serkansulun.com/app.

Results

Task	Dataset	Metric	Value	Model
Emotion Recognition	Ekman6	Accuracy	65.28	VEMOCLAP

Related Papers

NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech2025-07-17 Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17 Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16 Safeguarding Federated Learning-based Road Condition Classification2025-07-16 AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)2025-07-13 Fuzzy Classification Aggregation for a Continuum of Agents2025-07-06 Hybrid-View Attention for csPCa Classification in TRUS2025-07-04 Devising a solution to the problems of Cancer awareness in Telangana2025-06-26