TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium

Xie Yi, Zhanke Zhou, Chentao Cao, Qiyu Niu, Tongliang Liu et al.

2025-06-09Hierarchical Reinforcement Learning
PaperCode
RADAR: Benchmarking Language Models on Imperfect Tabular Data

Ken Gu, Zhihan Zhang, Kate Lin, Yuwei Zhang, Akshay Paruchuri et al.

2025-06-09Benchmarking
PaperCode
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors

Wenlong Meng, Shuguo Fan, Chengkun Wei, Min Chen, Yuwei Li et al.

2025-06-09BenchmarkingModel extraction
Paper
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists

Yifei Li, Hanane Nour Moussa, Ziru Chen, Shijie Chen, Botao Yu et al.

2025-06-09
Paper
Bingo: Boosting Efficient Reasoning of LLMs via Dynamic and Significance-based Reinforcement Learning

Hanbing Liu, Lang Cao, Yuanyi Ren, Mengyu Zhou, Haoyu Dong et al.

2025-06-09Reinforcement Learning
Paper
"I Wrote, I Paused, I Rewrote" Teaching LLMs to Read Between the Lines of Student Writing

Samra Zafar, Shaheer Minhas, Syed Ali Hassan Zaidi, Arfa Naeem, Zahra Ali et al.

2025-06-09
Paper
LLM-BT-Terms: Back-Translation as a Framework for Terminology Standardization and Dynamic Semantic Embedding

Li Weigang, Pedro Carvalho Brom

2025-06-09Translation
Paper
Can Artificial Intelligence Write Like Borges? An Evaluation Protocol for Spanish Microfiction

Gerardo Aleman Manzanarez, Nora de la Cruz Arana, Jorge Garcia Flores, Yobany Garcia Medina, Raul Monroy et al.

2025-06-09
Paper
ETT-CKGE: Efficient Task-driven Tokens for Continual Knowledge Graph Embedding

Lijing Zhu, Qizhen Lan, Qing Tian, Wenbo Sun, Li Yang et al.

2025-06-09Knowledge Graph EmbeddingTransfer LearningGraph Embedding
PaperCode
EconWebArena: Benchmarking Autonomous Agents on Economic Tasks in Realistic Web Environments

Zefang Liu, Yinzhu Quan

2025-06-09BenchmarkingVisual GroundingNavigate
Paper
QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA

Jacob Dineen, Aswin RRV, Qin Liu, Zhikun Xu, Xiao Ye et al.

2025-06-09Large Language Model
Paper
Conservative Bias in Large Language Models: Measuring Relation Predictions

Toyin Aguda, Erik Wilson, Allan Anzagira, Simerjot Kaur, Charese Smiley et al.

2025-06-09Relation ExtractionHallucinationSemantic Similarity+1
Paper
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

Junhong Shen, Hao Bai, Lunjun Zhang, Yifei Zhou, Amrith Setlur et al.

2025-06-09Reinforcement Learning
PaperCode
ArchiLense: A Framework for Quantitative Analysis of Architectural Styles Based on Vision Large Language Models

Jing Zhong, Jun Yin, Peilin Li, Pengyu Zeng, Miao Zang et al.

2025-06-09Descriptive
Paper
MoE-MLoRA for Multi-Domain CTR Prediction: Efficient Adaptation with Expert Specialization

Ken Yaggel, Eyal German, Aviel Ben Siman Tov

2025-06-09Click-Through Rate PredictionRecommendation Systems
PaperCode
Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation

Jiaxiang Chen, Zhuo Wang, Mingxi Zou, Qifan Wang, Zenglin Xu et al.

2025-06-09MathGSM8KHumanEval
Paper
GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

JingChao Wang, Haote Yang, Jiang Wu, Yifan He, Xingjian Wei et al.

2025-06-09Image Captioning
Paper
Seeing Voices: Generating A-Roll Video from Audio with Mirage

Aditi Sundararaman, Amogh Adishesha, Andrew Jaegle, Dan Bigioi, Hyoung-Kyu Song et al.

2025-06-09Text to SpeechSpeech Synthesistext-to-speech+1
Paper
Instruction-Tuned Video-Audio Models Elucidate Functional Specialization in the Brain

Subba Reddy Oota, Khushbu Pahwa, Prachi Jindal, Satya Sai Srinath Namburi, Maneesh Singh et al.

2025-06-09Disentanglement
PaperCode
Sparse Interpretable Deep Learning with LIES Networks for Symbolic Regression

Mansooreh Montazerin, Majd Al Aawar, Antonio Ortega, Ajitesh Srivastava

2025-06-09regressionSymbolic RegressionDeep Learning
PaperCode
PreviousPage 291 of 28782Next