TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

FormGym: Doing Paperwork with Agents

Matthew Toles, Rattandeep Singh, Isaac Song Zhou Yu

2025-06-17FormInformation RetrievalOptical Character Recognition (OCR)
Paper
Into the Unknown: Applying Inductive Spatial-Semantic Location Embeddings for Predicting Individuals' Mobility Beyond Visited Places

Xinglei Wang, Tao Cheng, Stephen Law, Zichao Zeng, Ilya Ilyankou et al.

2025-06-17Representation LearningPredictionContrastive Learning
PaperCode
Optimizing Length Compression in Large Reasoning Models

Zhengxiang Cheng, Dongping Chen, Mingyang Fu, Tianyi Zhou

2025-06-17
PaperCode
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Mingkang Zhu, Xi Chen, Zhongdao Wang, Bei Yu, Hengshuang Zhao et al.

2025-06-17
PaperCode
Improving LoRA with Variational Learning

Bai Cong, Nico Daheim, Yuesong Shen, Rio Yokota, Mohammad Emtiyaz Khan et al.

2025-06-17
Paper
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Xumeng Wen, Zihan Liu, Shun Zheng, Zhijian Xu, Shengyu Ye et al.

2025-06-17
Paper
From Bytes to Ideas: Language Modeling with Autoregressive U-Nets

Mathurin Videau, Badr Youbi Idrissi, Alessandro Leite, Marc Schoenauer, Olivier Teytaud et al.

2025-06-17Language Modelling
PaperCode
Reasoning with Exploration: An Entropy Perspective

Daixuan Cheng, Shaohan Huang, Xuekai Zhu, Bo Dai, Wayne Xin Zhao et al.

2025-06-17Reinforcement Learning
Paper
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

Ling Team, Bin Hu, Cai Chen, Deng Zhao, Ding Liu et al.

2025-06-17Reinforcement LearningData IntegrationLarge Language Model
Paper
Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data

Anton Changalidis, Aki Härmä

2025-06-17Memorization
PaperCode
Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

Daniel D'souza, Julia Kreutzer, Adrien Morisot, Ahmet Üstün, Sara Hooker et al.

2025-06-17Instruction FollowingPrompt Engineering
Paper
Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality

Yuto Harada, Yusuke Yamauchi, Yusuke Oda, Yohei Oseki, Yusuke Miyao et al.

2025-06-17Mathematical ReasoningCode Generation
Paper
GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors

Hengyuan Zhang, Xinrong Chen, Yingmin Qiu, Xiao Liang, Ziyue Li et al.

2025-06-17parameter-efficient fine-tuning
PaperCode
Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot

Xiang Cheng, Chengyan Pan, Minjun Zhao, Deyang Li, Fangchao Liu et al.

2025-06-17Mathematical Reasoning
Paper
When Does Meaning Backfire? Investigating the Role of AMRs in NLI

Junghyun Min, Xiulin Yang, Shira Wein

2025-06-17Natural Language Inference
Paper
GenerationPrograms: Fine-grained Attribution with Executable Programs

David Wan, Eran Hirsch, Elias Stengel-Eskin, Ido Dagan, Mohit Bansal et al.

2025-06-17Question AnsweringText GenerationLong Form Question Answering+2
PaperCode
AlphaDecay: Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs

Di He, Ajay Jaiswal, Songjun Tu, Li Shen, Ganzhao Yuan et al.

2025-06-17
PaperCode
M2BeamLLM: Multimodal Sensing-empowered mmWave Beam Prediction with Large Language Models

Can Zheng, Jiguang He, Chung G. Kang, Guofa Cai, Zitong Yu et al.

2025-06-17PredictionBeam Prediction
Paper
How Far Can LLMs Improve from Experience? Measuring Test-Time Learning Ability in LLMs with Human Comparison

Jiayin Wang, Zhiquang Guo, Weizhi Ma, Min Zhang

2025-06-17
PaperCode
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Xiaoran Liu, Zhigeng Liu, Zengfeng Huang, Qipeng Guo, Ziwei He et al.

2025-06-17
PaperCode
PreviousPage 163 of 28782Next