TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Lin Sun, WeiHong Lin, Jinzhu Wu, Yongfu Zhu, Xiaoqi Jian et al.

2025-06-05All
Paper
EMO-Debias: Benchmarking Gender Debiasing Techniques in Multi-Label Speech Emotion Recognition

Yi-Cheng Lin, Huang-Cheng Chou, Yu-Hsuan Li Liang, Hung-Yi Lee

2025-06-05FairnessBenchmarkingSpeech Emotion Recognition+1
Paper
Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference Models

Anirudh Bharadwaj, Chaitanya Malaviya, Nitish Joshi, Mark Yatskar

2025-06-05Data Augmentation
PaperCode
Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models

Taha Entesari, Arman Hatami, Rinat Khaziev, Anil Ramakrishna, Mahyar Fazlyab et al.

2025-06-05
Paper
ProRefine: Inference-time Prompt Refinement with Textual Feedback

Deepak Pandita, Tharindu Cyril Weerasooriya, Ankit Parag Shah, Christopher M. Homan, Wei Wei et al.

2025-06-05Mathematical Reasoning
Paper
Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning

Nan Huo, Jinyang Li, Bowen Qin, Ge Qu, Xiaolong Li et al.

2025-06-05Question AnsweringRAG
PaperCode
CLATTER: Comprehensive Entailment Reasoning for Hallucination Detection

Ron Eliav, Arie Cattan, Eran Hirsch, Shahaf Bassan, Elias Stengel-Eskin et al.

2025-06-05Natural Language InferenceHallucination
Paper
Towards a Unified System of Representation for Continuity and Discontinuity in Natural Language

Ratna Kandala, Prakash Mondal

2025-06-05
Paper
Improving Low-Resource Morphological Inflection via Self-Supervised Objectives

Adam Wiemerslage, Katharina von der Wense

2025-06-05Masked Language ModelingMorphological InflectionLanguage Modelling
Paper
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Nikhil Kandpal, Brian Lester, Colin Raffel, Sebastian Majstorovic, Stella Biderman et al.

2025-06-05
Paper
RELIC: Evaluating Compositional Instruction Following via Language Recognition

Jackson Petty, Michael Y. Hu, Wentao Wang, Shauli Ravfogel, William Merrill et al.

2025-06-05Instruction Following
Paper
Counterfactual reasoning: an analysis of in-context emergence

Moritz Miller, Bernhard Schölkopf, Siyuan Guo

2025-06-05regressionCounterfactual ReasoningStory Generation
PaperCode
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Yanzhao Zhang, Mingxin Li, Dingkun Long, Xin Zhang, Huan Lin et al.

2025-06-05Unsupervised Pre-trainingRerankingRetrieval
PaperCode
Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective

Bhavik Chandna, Zubair Bashir, Procheta Sen

2025-06-05named-entity-recognitionNamed Entity RecognitionLinguistic Acceptability
Paper
Do Large Language Models Judge Error Severity Like Humans?

Diege Sun, Guanyi Chen, Zhao Fan, Xiaorong Cheng, Tingting He et al.

2025-06-05Text Generation
Paper
Information Locality as an Inductive Bias for Neural Language Models

Taiga Someya, Anej Svete, Brian DuSell, Timothy J. O'Donnell, Mario Giulianelli et al.

2025-06-05
PaperCode
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning

Tanmay Parekh, Kartik Mehta, Ninareh Mehrabi, Kai-Wei Chang, Nanyun Peng et al.

2025-06-05document understandingEvent DetectionTransfer Learning
Paper
CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media

Tianyi Huang, Zikun Cui, Cuiqianhe Du, Chia-En Chiang

2025-06-05Contrastive LearningText Detection
Paper
Just a Scratch: Enhancing LLM Capabilities for Self-harm Detection through Intent Differentiation and Emoji Interpretation

Soumitra Ghosh, Gopendra Vikram Singh, Shambhavi, Sabarna Choudhury, Asif Ekbal et al.

2025-06-05Multi-Task Learning
Paper
RIVAL: Reinforcement Learning with Iterative and Adversarial Optimization for Machine Translation

Tianjiao Li, Mengran Yu, Chenyu Shi, Yanjun Zhao, Xiaojing Liu et al.

2025-06-05Machine TranslationTranslation
Paper
PreviousPage 326 of 28782Next