TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

The Impact of Software Testing with Quantum Optimization Meets Machine Learning

Gopichand Bandarupalli

2025-06-02Defect Detectionsoftware testing
Paper
Greening AI-enabled Systems with Software Engineering: A Research Agenda for Environmentally Sustainable AI Practices

Luís Cruz, João Paulo Fernandes, Maja H. Kirkeby, Silverio Martínez-Fernández, June Sallou et al.

2025-06-02Benchmarking
Paper
WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing

Yu Nakagome, Michael Hentschel

2025-06-02Speech RecognitionKeyword Spottingspeech-recognition+2
Paper
Online Audio-Visual Autoregressive Speaker Extraction

Zexu Pan, Wupeng Wang, Shengkui Zhao, Chong Zhang, Kun Zhou et al.

2025-06-02
Paper
Zero-Shot Text-to-Speech for Vietnamese

Thi Vu, Linh The Nguyen, Dat Quoc Nguyen

2025-06-02Text to Speechtext-to-speech
Paper
Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion

Kumud Tripathi, Chowdam Venkata Kumar, Pankaj Wasnik

2025-06-02Action DetectionActivity Detection
Paper
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation

Benedikt Hilmes, Nick Rossenbach, Ralf Schlüter

2025-06-02Speech RecognitionAutomatic Speech Recognitionspeech-recognition+1
Paper
SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction

Saurabh Agrawal, Raj Gohil, Gopal Kumar Agrawal, Vikram C M, Kushal Verma et al.

2025-06-02Text to SpeechSpeech SynthesisText-To-Speech Synthesis+2
Paper
Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion

Ajinkya Kulkarni, Sandipana Dowerah, Tanel Alumae, Mathew Magimai. -Doss

2025-06-02Metric LearningFace Swapping
Paper
Lessons Learned from the URGENT 2024 Speech Enhancement Challenge

Wangyou Zhang, Kohei Saijo, Samuele Cornell, Robin Scheibler, Chenda Li et al.

2025-06-02Speech Enhancement
PaperCode
Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech

Karl El Hajal, Enno Hermann, Sevada Hovsepyan, Mathew Magimai. -Doss

2025-06-02Speech RecognitionAutomatic Speech RecognitionAutomatic Speech Recognition (ASR)+3
PaperCode
Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric

Mattson Ogg, Caitlyn Bishop, Han Yi, Sarah Robinson

2025-06-02Speech RecognitionAutomatic Speech Recognitionspeech-recognition
Paper
Comparison of spectrogram scaling in multi-label Music Genre Recognition

Bartosz Karpiński, Cyryl Leszczyński

2025-06-02Music Genre Recognition
Paper
On-device Streaming Discrete Speech Units

Kwanghee Choi, Masao Someki, Emma Strubell, Shinji Watanabe

2025-06-02
PaperCode
Cocktail-Party Audio-Visual Speech Recognition

Thai-Binh Nguyen, Ngoc-Quan Pham, Alexander Waibel

2025-06-02Speech Recognitionspeech-recognitionAudio-Visual Speech Recognition+1
Paper
Towards Machine Unlearning for Paralinguistic Speech Processing

Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Shubham Singh, Swarup Ranjan Behera et al.

2025-06-02Depression DetectionSpeech Emotion RecognitionEmotion Recognition
Paper
Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction

Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Swarup Ranjan Behera, Pailla Balakrishna Reddy et al.

2025-06-02Speaker Recognition
Paper
Are Mamba-based Audio Foundation Models the Best Fit for Non-Verbal Emotion Recognition?

Mohd Mujtaba Akhtar, Orchid Chetia Phukan, Girish, Swarup Ranjan Behera, Ananda Chandra Nayak et al.

2025-06-02Synthetic Speech DetectionSpeech Emotion RecognitionEmotion Recognition
Paper
Continual Speech Learning with Fused Speech Features

Guitao Wang, Jinming Zhao, Hao Yang, Guilin Qi, Tongtong Wu et al.

2025-06-02
Paper
Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction

Wang Dai, Archontis Politis, Tuomas Virtanen

2025-06-02AttributeSpeech Extraction
Paper
PreviousPage 382 of 28782Next