TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains

Juncheng Wu, Sheng Liu, Haoqin Tu, Hang Yu, Xiaoke Huang et al.

2025-06-02MathReinforcement Learning
Paper
EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language Navigation

Bingqian Lin, Yunshuang Nie, Khun Loun Zai, Ziming Wei, Mingfei Han et al.

2025-06-02Vision-Language NavigationNavigate
Paper
ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding

Yiyang Zhou, Yangfan He, Yaofeng Su, Siwei Han, Joel Jang et al.

2025-06-02Video UnderstandingAction RecognitionVision-Language-Action
Paper
Small Language Models are the Future of Agentic AI

Peter Belcak, Greg Heinrich, Shizhe Diao, Yonggan Fu, Xin Dong et al.

2025-06-02AI Agent
Paper
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning

Yijun Yang, Zhao-Yang Wang, Qiuping Liu, Shuwen Sun, Kang Wang et al.

2025-06-02Survival AnalysisDecision MakingSpecificity
Paper
Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models

Yiwen Jiang, Deval Mehta, Wei Feng, ZongYuan Ge

2025-06-02Image Classification
Paper
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents

Bobo Li, Yuheng Wang, Hao Fei, Juncheng Li, Wei Ji et al.

2025-06-02BenchmarkingForm
Paper
PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization

Zouying Cao, Runze Wang, Yifei Yang, Xinbei Ma, Xiaoyong Zhu et al.

2025-06-02Large Language ModelLanguage Modelling
Paper
Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents

Manan Suri, Puneet Mathur, Nedim Lipka, Franck Dernoncourt, Ryan A. Rossi et al.

2025-06-02
Paper
LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback

Thai Hoang, Kung-Hsiang Huang, Shirley Kokane, JianGuo Zhang, Zuxin Liu et al.

2025-06-02Large Language Model
Paper
Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning

Yihong Tang, Kehai Chen, Muyun Yang, ZhengYu Niu, Jing Li et al.

2025-06-02
Paper
CVC: A Large-Scale Chinese Value Rule Corpus for Value Alignment of Large Language Models

Ping Wu, Guobin Shen, Dongcheng Zhao, Yuwei Wang, Yiting Dong et al.

2025-06-02Benchmarking
PaperCode
Introducing the PIT-plot -- a new tool in the portfolio manager's toolkit

Stig-Johan Wiklund, Magnus Ytterstad

2025-06-02Management
Paper
Embedded Acoustic Intelligence for Automotive Systems

Renjith Rajagopal, Peter Winzell, Sladjana Strbac, Konstantin Lindström, Petter Hörling et al.

2025-06-02Autonomous Driving
Paper
Can We Trust Machine Learning? The Reliability of Features from Open-Source Speech Analysis Tools for Speech Modeling

Tahiya Chowdhury, Veronica Romero

2025-06-02Fairness
Paper
MODS: Multi-source Observations Conditional Diffusion Model for Meteorological State Downscaling

Siwei Tu, Jingyi Xu, Weidong Yang, Lei Bai, Ben Fei et al.

2025-06-02
Paper
Alternates, Assemble! Selecting Optimal Alternates for Citizens' Assemblies

Angelos Assos, Carmel Baharav, Bailey Flanigan, Ariel Procaccia

2025-06-02
Paper
Cross-Lingual Transfer of Cultural Knowledge: An Asymmetric Phenomenon

Chen Zhang, Zhiyuan Liao, Yansong Feng

2025-06-02Cross-Lingual Transfer
Paper
ReFoCUS: Reinforcement-guided Frame Optimization for Contextual Understanding

Hosu Lee, Junho Kim, Hyunjun Kim, Yong Man Ro

2025-06-02
Paper
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Zhongwei Wan, Zhihao Dou, Che Liu, Yu Zhang, Dongfei Cui et al.

2025-06-02Reinforcement LearningMultimodal Reasoningreinforcement-learning
Paper
PreviousPage 373 of 28782Next