TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Identifying Reliable Evaluation Metrics for Scientific Text Revision

Léane Jourdan, Florian Boudin, Richard Dufour, Nicolas Hernandez

2025-06-05Instruction Following
PaperCode
Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection

Ziyi Zhou, XiaoMing Zhang, Litian Zhang, Yibo Zhang, Zhenyu Guan et al.

2025-06-05knowledge editingFake News Detection
Paper
SPARTA ALIGNMENT: Collectively Aligning Multiple Language Models through Combat

Yuru Jiang, Wenxuan Ding, Shangbin Feng, Greg Durrett, Yulia Tsvetkov et al.

2025-06-05
Paper
IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation

Bhavana Akkiraju, Aishwarya Pothula, Santosh Kesiraju, Anil Kumar Vuppala

2025-06-05Data AugmentationTranslation
Paper
Accelerated Test-Time Scaling with Model-Free Speculative Sampling

Woomin Song, Saket Dingliwal, Sai Muralidhar Jayanthi, Bhavana Ganesh, Jinwoo Shin et al.

2025-06-05Language Modelling
Paper
Cracking the Code: Enhancing Implicit Hate Speech Detection through Coding Classification

Lu Wei, Liangzhi Li, Tong Xiang, Xiao Liu, Noa Garcia et al.

2025-06-05Hate Speech Detection
Paper
Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models

Thao Nguyen, Yang Li, Olga Golovneva, Luke Zettlemoyer, Sewoong Oh et al.

2025-06-05
Paper
Normative Conflicts and Shallow AI Alignment

Raphaël Millière

2025-06-05
Paper
Flex-TravelPlanner: A Benchmark for Flexible Planning with Language Agents

Juhyun Oh, Eunsu Kim, Alice Oh

2025-06-05
PaperCode
TaDA: Training-free recipe for Decoding with Adaptive KV Cache Compression and Mean-centering

Vinay Joshi, Pratik Prabhanjan Brahma, Zicheng Liu, Emad Barsoum

2025-06-05Quantization
Paper
Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning

Zhiyuan Ma, Jiayu Liu, Xianzhen Luo, Zhenya Huang, Qingfu Zhu et al.

2025-06-05Imitation Learning
PaperCode
Static Word Embeddings for Sentence Semantic Representation

Takashi Wada, Yuki Hirakawa, Ryotaro Shimizu, Takahiro Kawashima, Yuki Saito et al.

2025-06-05Word EmbeddingsContrastive LearningKnowledge Distillation
Paper
Revisiting Test-Time Scaling: A Survey and a Diversity-Aware Method for Efficient Reasoning

Ho-Lam Chung, Teng-Yun Hsiao, Hsiao-Ying Huang, Chunerh Cho, Jian-Ren Lin et al.

2025-06-05Mathematical Reasoning
Paper
A MISMATCHED Benchmark for Scientific Natural Language Inference

Firoz Shaik, Mobashir Sadat, Nikita Gautam, Doina Caragea, Cornelia Caragea et al.

2025-06-05Natural Language Inference
PaperCode
Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification

Chengwu Liu, Ye Yuan, Yichun Yin, Yan Xu, Xin Xu et al.

2025-06-05Mathematical ReasoningAutomated Theorem ProvingHallucination
PaperCode
MuSciClaims: Multimodal Scientific Claim Verification

Yash Kumar Lal, Manikanta Bandham, Mohammad Saqib Hasan, Apoorva Kashi, Mahnaz Koupaee et al.

2025-06-05Multimodal ReasoningClaim VerificationDiagnostic
Paper
SUCEA: Reasoning-Intensive Retrieval for Adversarial Fact-checking through Claim Decomposition and Editing

Hongjun Liu, Yilun Zhao, Arman Cohan, Chen Zhao

2025-06-05MisinformationFact CheckingRetrieval
PaperCode
Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching

Jianfei Zhang, Bei Li, Jun Bai, Rumei Li, Yanmeng Wang et al.

2025-06-05
PaperCode
Are LLMs Reliable Translators of Logical Reasoning Across Lexically Diversified Contexts?

Qingchuan Li, Jiatong Li, Zirui Liu, Mingyue Cheng, Yuting Zeng et al.

2025-06-05Formal LogicLogical Reasoning
PaperCode
Reasoning or Overthinking: Evaluating Large Language Models on Financial Sentiment Analysis

Dimitris Vamvourellis, Dhagash Mehta

2025-06-05Sentiment AnalysisSentiment Classification
Paper
PreviousPage 328 of 28782Next