TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

CODEMENV: Benchmarking Large Language Models on Code Migration

Keyuan Cheng, Xudong Shen, Yihao Yang, Tengyue Wang, Yang Cao et al.

2025-06-01Benchmarking
PaperCode
Legal Compliance Evaluation of Smart Contracts Generated By Large Language Models

Chanuka Wijayakoon, Hai Dong, H. M. N. Dilum Bandara, Zahir Tari, Anurag Soin et al.

2025-06-01Code Generation
Paper
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents

Xiang Fei, Xiawu Zheng, Hao Feng

2025-06-01Semantic SimilaritySemantic Textual SimilarityRetrieval+1
Paper
CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer

Daiki Takeuchi, Binh Thien Nguyen, Masahiro Yasuda, Yasunori Ohishi, Daisuke Niizumi et al.

2025-06-01QuantizationAudio captioningLanguage Modelling
Paper
Counterfactual Activation Editing for Post-hoc Prosody and Mispronunciation Correction in TTS Models

Kyowoon Lee, Artyom Stitsyuk, Gunu Jho, Inchul Hwang, Jaesik Choi et al.

2025-06-01Text to SpeechSpeech Synthesistext-to-speech
Paper
HASRD: Hierarchical Acoustic and Semantic Representation Disentanglement

Amir Hussein, Sameer Khurana, Gordon Wichern, Francois G. Germain, Jonathan Le Roux et al.

2025-06-01Self-Supervised LearningDisentanglement
Paper
Speech Unlearning

Jiali Cheng, Hadi Amiri

2025-06-01Keyword SpottingSpeaker IdentificationAdversarial Robustness
Paper
Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and Assessment

Parismita Gogoi, Vishwanath Pratap Singh, Seema Khadirnaikar, Soma Siddhartha, Sishir Kalita et al.

2025-06-01regressionRhythmClassification
PaperCode
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching

Leying Zhang, Yao Qian, Xiaofei Wang, Manthan Thakker, Dongmei Wang et al.

2025-06-01DisentanglementDialogue Generation
Paper
In-the-wild Audio Spatialization with Flexible Text-guided Localization

Tianrui Pan, Jie Liu, Zewen Huang, Jie Tang, Gangshan Wu et al.

2025-06-01Spatial Reasoning
PaperCode
General-purpose audio representation learning for real-world sound scenes

Goksenin Yuksel, Marcel van Gerven, Kiki van der Heijden

2025-06-01Sound ClassificationRepresentation Learning
Paper
Crowdsourcing MUSHRA Tests in the Age of Generative Speech Technologies: A Comparative Analysis of Subjective and Objective Testing Methods

Laura Lechler, Chamran Moradi, Ivana Balic

2025-06-01
PaperCode
Leveraging Large Language Models for Sarcastic Speech Annotation in Sarcasm Detection

Zhu Li, Yuqing Zhang, Xiyuan Gao, Shekhar Nayak, Matt Coler et al.

2025-06-01Sarcasm Detection
Paper
What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-training

Marianne de Heer Kloots, Hosein Mohebbi, Charlotte Pouw, Gaofei Shen, Willem Zuidema et al.

2025-06-01Speech RecognitionAutomatic Speech Recognitionspeech-recognition
PaperCode
Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching

Jialong Zuo, Shengpeng Ji, Minghui Fang, Mingze Li, Ziyue Jiang et al.

2025-06-01Style TransferRhythmVoice Conversion
Paper
A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement

Shenghui Lu, Hukai Huang, Jinanglong Yao, Kaidi Wang, Qingyang Hong et al.

2025-06-01Speech Enhancement
Paper
PseudoVC: Improving One-shot Voice Conversion with Pseudo Paired Data

Songjun Cao, Qinghua Wu, Jie Chen, Jin Li, Long Ma et al.

2025-06-01Voice Conversion
Paper
Learning More with Less: Self-Supervised Approaches for Low-Resource Speech Emotion Recognition

Ziwei Gong, Pengyuan Shi, Kaan Donbekci, Lin Ai, Run Chen et al.

2025-06-01Contrastive LearningSpeech Emotion RecognitionEmotion Recognition
Paper
FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

Shunian Chen, Xinyuan Xie, Zheshu Chen, Liyan Zhao, Owen Lee et al.

2025-06-01Instruction FollowingCaption GenerationAudio captioning+1
PaperCode
From Words to Waves: Analyzing Concept Formation in Speech and Text-Based Foundation Models

Asım Ersoy, Basel Mousi, Shammur Chowdhury, Firoj Alam, Fahim Dalvi et al.

2025-06-01World Knowledge
Paper
PreviousPage 392 of 28782Next