TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Differentiable Logic Cellular Automata: From Game of Life to Pattern Generation

Pietro Miotti, Eyvind Niklasson, Ettore Randazzo, Alexander Mordvintsev

2025-06-05
Paper
Towards a Multi-Agent Simulation of Cyber-attackers and Cyber-defenders Battles

Julien Soulé, Jean-Paul Jamont, Michel Occello, Paul Théron, Louis-Marie Traonouez et al.

2025-06-05
Paper
Safe Planning and Policy Optimization via World Model Learning

Artem Latyshev, Gregory Gorbov, Aleksandr I. Panov

2025-06-05Reinforcement LearningContinuous Control
Paper
Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

Jiayu Wang, Yifei Ming, Zixuan Ke, Caiming Xiong, Shafiq Joty et al.

2025-06-05Mathematical ReasoningReinforcement Learningreinforcement-learning
Paper
Empowering Economic Simulation for Massively Multiplayer Online Games through Generative Agent-Based Modeling

Bihan Xu, Shiwei Zhao, Runze Wu, Zhenya Huang, Jiawei Wang et al.

2025-06-05Decision Making
Paper
E-bike agents: Large Language Model-Driven E-Bike Accident Analysis and Severity Prediction

Zhichao Yang, Jiashu He, Mohammad B. Al-Khasawneh, Darshan Pandit, Cirillo Cinzia et al.

2025-06-05severity predictionLarge Language ModelLanguage Modelling
Paper
Agents of Change: Self-Evolving LLM Agents for Strategic Planning

Nikolas Belle, Dakota Barnes, Alfonso Amayuelas, Ivan Bercovich, Xin Eric Wang et al.

2025-06-05
Paper
CHANCERY: Evaluating Corporate Governance Reasoning Capabilities in Language Models

Lucas Irwin, Arda Kaz, Peiyao Sheng, Sewoong Oh, Pramod Viswanath et al.

2025-06-05Binary Classification
Paper
Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation

Yuyang Wanyan, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang et al.

2025-06-05Multimodal ReasoningDecision Making
Paper
DeePoly: A High-Order Accuracy Scientific Machine Learning Framework for Function Approximation and Solving PDEs

Li Liu, Heng Yong

2025-06-05
PaperCode
OpenAg: Democratizing Agricultural Intelligence

Srikanth Thudumu, Jason Fisher

2025-06-05Knowledge GraphsTransfer Learning
Paper
Inference-Time Hyper-Scaling with KV Cache Compression

Adrian Łańcucki, Konrad Staniszewski, Piotr Nawrot, Edoardo M. Ponti

2025-06-05
Paper
Kinetics: Rethinking Test-Time Scaling Laws

Ranajoy Sadhukhan, Zhuoming Chen, Haizhong Zheng, Yang Zhou, Emma Strubell et al.

2025-06-05
PaperCode
Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

Yifan Sun, Jingyan Shen, Yibin Wang, Tianyu Chen, Zhendong Wang et al.

2025-06-05Reinforcement Learning
PaperCode
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Johannes von Oswald, Nino Scherrer, Seijin Kobayashi, Luca Versari, Songlin Yang et al.

2025-06-05Long-Context UnderstandingLanguage Modelling
PaperCode
Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts

Danil Sivtsov, Ivan Rodkin, Gleb Kuzmin, Yuri Kuratov, Ivan Oseledets et al.

2025-06-05Scheduling
PaperCode
Mitigating Degree Bias Adaptively with Hard-to-Learn Nodes in Graph Contrastive Learning

Jingyu Hu, Hongbo Bo, Jun Hong, Xiaowei Liu, Weiru Liu et al.

2025-06-05Node ClassificationContrastive Learning
Paper
LLM-First Search: Self-Guided Exploration of the Solution Space

Nathan Herr, Tim Rocktäschel, Roberta Raileanu

2025-06-05
PaperCode
Dissecting Long Reasoning Models: An Empirical Study

Yongyu Mu, Jiali Zeng, Bei Li, Xinyan Guan, Fandong Meng et al.

2025-06-05Reinforcement Learning
PaperCode
From EHRs to Patient Pathways: Scalable Modeling of Longitudinal Health Trajectories with LLMs

Chantal Pellegrini, Ege Özsoy, David Bani-Harouni, Matthias Keicher, Nassir Navab et al.

2025-06-05
Paper
PreviousPage 325 of 28782Next