TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Piano Skills Assessment

Piano Skills Assessment

Paritosh Parmar, Jaiden Reddy, Brendan Morris

2021-01-13Skills EvaluationAudio ClassificationVideo RecognitionSkills AssessmentMultimodal Deep LearningAction Quality AssessmentVideo Classification
PaperPDFCode(official)

Abstract

Can a computer determine a piano player's skill level? Is it preferable to base this assessment on visual analysis of the player's performance or should we trust our ears over our eyes? Since current CNNs have difficulty processing long video videos, how can shorter clips be sampled to best reflect the players skill level? In this work, we collect and release a first-of-its-kind dataset for multimodal skill assessment focusing on assessing piano player's skill level, answer the asked questions, initiate work in automated evaluation of piano playing skills and provide baselines for future work. Dataset is available from: https://github.com/ParitoshParmar/Piano-Skills-Assessment.

Results

TaskDatasetMetricValueModel
VideoMultimodal PISAAccuracy (%)73.95Video
Audio ClassificationMultimodal PISAAccuracy (%)64.5Audio
Skills AssessmentMultimodal PISAAccuracy (%)74.6MMDL
ClassificationMultimodal PISAAccuracy (%)64.5Audio
Video ClassificationMultimodal PISAAccuracy (%)73.95Video

Related Papers

Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment2025-06-28Neuromorphic Wireless Split Computing with Resonate-and-Fire Neurons2025-06-24Fully Few-shot Class-incremental Audio Classification Using Multi-level Embedding Extractor and Ridge Regression Classifier2025-06-23Exploring Audio Cues for Enhanced Test-Time Video Model Adaptation2025-06-14Ontology-based knowledge representation for bone disease diagnosis: a foundation for safe and sustainable medical artificial intelligence systems2025-06-05