TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache

Xiaoran Liu, Siyang He, Qiqi Wang, Ruixiao Li, Yuerong Song et al.

2025-06-13
Paper
Lag-Relative Sparse Attention In Long Context Training

Manlai Liang, Wanyi Huang, Mandi Liu, Huaijun Li, Jinlong Li et al.

2025-06-13
Paper
Fast Bayesian Optimization of Function Networks with Partial Evaluations

Poompol Buathong, Peter I. Frazier

2025-06-13Bayesian OptimizationDrug Discovery
PaperCode
Dual-View Disentangled Multi-Intent Learning for Enhanced Collaborative Filtering

Shanfan Zhang, Yongyi Lin, Yuan Rao, Chenlong Zhang

2025-06-13DisentanglementCollaborative Filtering
PaperCode
A Watermark for Auto-Regressive Image Generation Models

Yihan Wu, Xuehao Cui, Ruibo Chen, Georgios Milis, Heng Huang et al.

2025-06-13Face SwappingImage Generation
Paper
VGR: Visual Grounded Reasoning

Jiacong Wang, Zijian Kang, Haochen Wang, Haiyong Jiang, Jiawen Li et al.

2025-06-13MathMultimodal Large Language ModelVisual Reasoning+1
Paper
DART: Distilling Autoregressive Reasoning to Silent Thought

Nan Jiang, Ziming Wu, De-Chuan Zhan, Fuming Lai, Shaobing Lian et al.

2025-06-13
Paper
Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards

Jaehoon Yun, Jiwoong Sohn, Jungwoo Park, Hyunjae Kim, Xiangru Tang et al.

2025-06-13Diagnostic
Paper
ReVeal: Self-Evolving Code Agents via Iterative Generation-Verification

Yiyang Jin, Kunzhao Xu, Hang Li, Xueting Han, Yanmin Zhou et al.

2025-06-13Reinforcement LearningCode Generationreinforcement-learning
Paper
Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards

Jeff Da, Clinton Wang, Xiang Deng, Yuntao Ma, Nikhil Barhate et al.

2025-06-13MathNavigate
Paper
Visual Pre-Training on Unlabeled Images using Reinforcement Learning

Dibya Ghosh, Sergey Levine

2025-06-13Reinforcement Learningreinforcement-learning
PaperCode
TreeRL: LLM Reinforcement Learning with On-Policy Tree Search

Zhenyu Hou, Ziniu Hu, Yujiang Li, Rui Lu, Jie Tang et al.

2025-06-13MathReinforcement Learningreinforcement-learning
PaperCode
LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment

Shikun Li, Shipeng Li, Zhiqin Yang, Xinghua Zhang, Gaode Chen et al.

2025-06-13Mathematical ReasoningReinforcement LearningGSM8K
Paper
Optimization of bi-directional gated loop cell based on multi-head attention mechanism for SSD health state classification model

Zhizhao Wen, Ruoxin Zhang, Chao Wang

2025-06-13Binary Classification
Paper
Analysis and Optimization of Probabilities of Beneficial Mutation and Crossover Recombination in a Hamming Space

Roman V. Belavkin

2025-06-13
Paper
Instruction and Solution Probabilities as Heuristics for Inductive Programming

Edward McDaid, Sarah McDaid

2025-06-13
Paper
Enhancing Clinical Decision Support and EHR Insights through LLMs and the Model Context Protocol: An Open-Source MCP-FHIR Framework

Abul Ehtesham, Aditi Singh, Saket Kumar

2025-06-13
Paper
Causality in the human niche: lessons for machine learning

Richard D. Lange, Konrad P. Kording

2025-06-13
Paper
Feedforward Ordering in Neural Connectomes via Feedback Arc Minimization

Soroush Vahidi

2025-06-13
Paper
A Hybrid Multi-Agent Prompting Approach for Simplifying Complex Sentences

Pratibha Zunjare, Michael Hsiao

2025-06-13
Paper
PreviousPage 209 of 28782Next