TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

EAR: Erasing Concepts from Unified Autoregressive Models

Haipeng Fan, Shiyuan Zhang, Baohunesitu, Zihang Guo, Huaiwen Zhang et al.

2025-06-25Image Generation
PaperCode
CCRS: A Zero-Shot LLM-as-a-Judge Framework for Comprehensive RAG Evaluation

Aashiq Muhamed

2025-06-25RAG
Paper
BrokenVideos: A Benchmark Dataset for Fine-Grained Artifact Localization in AI-Generated Videos

Jiahao Lin, Weixuan Peng, Bojia Zi, Yifeng Gao, Xianbiao Qi et al.

2025-06-25Artifact DetectionBenchmarkingVideo Generation
Paper
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations

Vardhan Dongre, Chi Gui, Shubham Garg, Hooshang Nayyeri, Gokhan Tur et al.

2025-06-25World Knowledge
PaperCode
SACL: Understanding and Combating Textual Bias in Code Retrieval with Semantic-Augmented Reranking and Localization

Dhruv Gupta, Gayathri Ganesh Lakshmy, Yiqing Xie

2025-06-25RerankingCode GenerationRetrieval+1
Paper
A Modular Multitask Reasoning Framework Integrating Spatio-temporal Models and LLMs

Kethmi Hirushini Hettige, Jiahao Ji, Cheng Long, Shili Xiang, Gao Cong et al.

2025-06-25Natural Language Queries
Paper
The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind

Andrei Lupu, Timon Willi, Jakob Foerster

2025-06-25NavigateMulti-agent Reinforcement Learning
PaperCode
Towards Community-Driven Agents for Machine Learning Engineering

Sijie Li, Weiwei Sun, Shanda Li, Ameet Talwalkar, Yiming Yang et al.

2025-06-25Large Language ModelLanguage Modelling
PaperCode
AI Assistants to Enhance and Exploit the PETSc Knowledge Base

Barry Smith, Junchao Zhang, Hong Zhang, Lois Curfman McInnes, Murat Keceli et al.

2025-06-25RerankingRAG
Paper
CogGen: A Learner-Centered Generative AI Architecture for Intelligent Tutoring with Programming Video

Wengxi Li, Roy Pea, Nick Haber, Hari Subramonyam

2025-06-25Video SegmentationVideo Semantic SegmentationKnowledge Tracing
Paper
Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges

Alexander D. Kalian, Jaewook Lee, Stefan P. Johannesson, Lennart Otte, Christer Hogstrand et al.

2025-06-25Prompt EngineeringRAG
PaperCode
Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios

Wenbin Gan, Minh-Son Dao, Koji Zettsu

2025-06-25Scene UnderstandingDecision MakingAutonomous Driving+4
Paper
Engineering Sentience

Konstantin Demin, Taylor Webb, Eric Elmoznino, Hakwan Lau

2025-06-25
Paper
Mixtures of Neural Cellular Automata: A Stochastic Framework for Growth Modelling and Self-Organization

Salvatore Milite, Giulio Caravagna, Andrea Sottoriva

2025-06-25Semantic SegmentationImage Segmentation
Paper
GymPN: A Library for Decision-Making in Process Management Systems

Riccardo Lo Bianco, Willem van Jaarsveld, Remco Dijkman

2025-06-25Decision MakingManagement
Paper
Smart Ride and Delivery Services with Electric Vehicles: Leveraging Bidirectional Charging for Profit Optimisation

Jinchun Du, Bojie Shen, Muhammad Aamir Cheema, Adel N. Toosi

2025-06-25
Paper
Paladin-mini: A Compact and Efficient Grounding Model Excelling in Real-World Scenarios

Dror Ivry, Oran Nahum

2025-06-25
Paper
Tabular Feature Discovery With Reasoning Type Exploration

Sungwon Han, Sungkyu Park, Seungeon Lee

2025-06-25Feature Engineering
Paper
Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards

Jihao Gu, Qihang Ai, Yingyao Wang, Pi Bu, Jingxuan Xing et al.

2025-06-25Reinforcement Learningreinforcement-learning
Paper
Enterprise Large Language Model Evaluation Benchmark

Liya Wang, David Yi, Damien Jose, John Passarelli, James Gao et al.

2025-06-25Large Language ModelMMLULanguage Modelling
Paper
PreviousPage 97 of 28782Next