Papers

575,626 papers

The Impact of Software Testing with Quantum Optimization Meets Machine Learning

Gopichand Bandarupalli

2025-06-02Defect Detectionsoftware testing

Paper

Greening AI-enabled Systems with Software Engineering: A Research Agenda for Environmentally Sustainable AI Practices

Luís Cruz, João Paulo Fernandes, Maja H. Kirkeby, Silverio Martínez-Fernández, June Sallou et al.

2025-06-02Benchmarking

Paper

WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing

Yu Nakagome, Michael Hentschel

2025-06-02Speech RecognitionKeyword Spottingspeech-recognition+2

Paper

Online Audio-Visual Autoregressive Speaker Extraction

Zexu Pan, Wupeng Wang, Shengkui Zhao, Chong Zhang, Kun Zhou et al.

2025-06-02

Paper

Zero-Shot Text-to-Speech for Vietnamese

Thi Vu, Linh The Nguyen, Dat Quoc Nguyen

2025-06-02Text to Speechtext-to-speech

Paper

Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion

Kumud Tripathi, Chowdam Venkata Kumar, Pankaj Wasnik

2025-06-02Action DetectionActivity Detection

Paper

Analyzing the Importance of Blank for CTC-Based Knowledge Distillation

Benedikt Hilmes, Nick Rossenbach, Ralf Schlüter

2025-06-02Speech RecognitionAutomatic Speech Recognitionspeech-recognition+1

Paper

SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction

Saurabh Agrawal, Raj Gohil, Gopal Kumar Agrawal, Vikram C M, Kushal Verma et al.

2025-06-02Text to SpeechSpeech SynthesisText-To-Speech Synthesis+2

Paper

Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion

Ajinkya Kulkarni, Sandipana Dowerah, Tanel Alumae, Mathew Magimai. -Doss

2025-06-02Metric LearningFace Swapping

Paper

Lessons Learned from the URGENT 2024 Speech Enhancement Challenge

Wangyou Zhang, Kohei Saijo, Samuele Cornell, Robin Scheibler, Chenda Li et al.

2025-06-02Speech Enhancement

Paper Code

Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech

Karl El Hajal, Enno Hermann, Sevada Hovsepyan, Mathew Magimai. -Doss

2025-06-02Speech RecognitionAutomatic Speech RecognitionAutomatic Speech Recognition (ASR)+3

Paper Code

Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric

Mattson Ogg, Caitlyn Bishop, Han Yi, Sarah Robinson

2025-06-02Speech RecognitionAutomatic Speech Recognitionspeech-recognition

Paper

Comparison of spectrogram scaling in multi-label Music Genre Recognition

Bartosz Karpiński, Cyryl Leszczyński

2025-06-02Music Genre Recognition

Paper

On-device Streaming Discrete Speech Units

Kwanghee Choi, Masao Someki, Emma Strubell, Shinji Watanabe

2025-06-02

Paper Code

Cocktail-Party Audio-Visual Speech Recognition

Thai-Binh Nguyen, Ngoc-Quan Pham, Alexander Waibel

2025-06-02Speech Recognitionspeech-recognitionAudio-Visual Speech Recognition+1

Paper

Towards Machine Unlearning for Paralinguistic Speech Processing

Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Shubham Singh, Swarup Ranjan Behera et al.

2025-06-02Depression DetectionSpeech Emotion RecognitionEmotion Recognition

Paper

Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction

Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Swarup Ranjan Behera, Pailla Balakrishna Reddy et al.

2025-06-02Speaker Recognition

Paper

Are Mamba-based Audio Foundation Models the Best Fit for Non-Verbal Emotion Recognition?

Mohd Mujtaba Akhtar, Orchid Chetia Phukan, Girish, Swarup Ranjan Behera, Ananda Chandra Nayak et al.

2025-06-02Synthetic Speech DetectionSpeech Emotion RecognitionEmotion Recognition

Paper

Continual Speech Learning with Fused Speech Features

Guitao Wang, Jinming Zhao, Hao Yang, Guilin Qi, Tongtong Wu et al.

2025-06-02

Paper

Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction

Wang Dai, Archontis Politis, Tuomas Virtanen

2025-06-02AttributeSpeech Extraction

Paper

PreviousPage 382 of 28782Next