TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Box-Constrained Softmax Function and Its Application for Post-Hoc Calibration

Kyohei Atarashi, Satoshi Oyama, Hiromi Arai, Hisashi Kashima

2025-06-12Decision Making
PaperCode
Measuring Semantic Information Production in Generative Diffusion Models

Florian Handke, Félix Koulischer, Gabriel Raya, Luca Ambrogioni

2025-06-12
Paper
Distributionally-Constrained Adversaries in Online Learning

Moïse Blanchard, Samory Kpotufe

2025-06-12
Paper
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning

Yuxuan Luo, Yuhui Yuan, Junwen Chen, Haonan Cai, Ziyi Yue et al.

2025-06-12Text-to-Image GenerationMultimodal ReasoningText to Image Generation+2
Paper
Build the web for agents, not agents for the web

Xing Han Lù, Gaurav Kamath, Marius Mosbach, Siva Reddy

2025-06-12Navigate
Paper
GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models

Evelyn Ma, Duo Zhou, Peizhi Niu, Huiting Zhou, huan zhang et al.

2025-06-12
Paper
VINCIE: Unlocking In-context Image Editing from Video

Leigang Qu, Feng Cheng, Ziyan Yang, Qi Zhao, Shanchuan Lin et al.

2025-06-12Story GenerationSegmentationPrediction
Paper
Robustly Improving LLM Fairness in Realistic Settings via Interpretability

Adam Karvonen, Samuel Marks

2025-06-12FairnessAttribute
PaperCode
Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?

Fei Lin, Ziyang Gong, Cong Wang, Yonglin Tian, Tengchao Zhang et al.

2025-06-12
Paper
The Diffusion Duality

Subham Sekhar Sahoo, Justin Deschenaux, Aaron Gokaslan, Guanghan Wang, Justin Chiu et al.

2025-06-12Text Generation
PaperCode
VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Huaying Yuan, Zheng Liu, Junjie Zhou, Ji-Rong Wen, Zhicheng Dou et al.

2025-06-12Video Understanding
PaperCode
FASCIST-O-METER: Classifier for Neo-fascist Discourse Online

Rudy Alexandro Garrido Veliz, Martin Semmann, Chris Biemann, Seid Muhie Yimam

2025-06-12
Paper
Neural at ArchEHR-QA 2025: Agentic Prompt Optimization for Evidence-Grounded Clinical Question Answering

Sai Prasanna Teja Reddy Bogireddy, Abrar Majeedi, Viswanatha Reddy Gajjala, Zhuoyan Xu, Siddhant Rai et al.

2025-06-12Question AnsweringAnswer Generation
Paper
TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving

Vincenzo Colle, Mohamed Sana, Nicola Piovesan, Antonio De Domenico, Fadhel Ayed et al.

2025-06-12Mathematical ReasoningLogical ReasoningMathematical Problem-Solving
Paper
Robust Unsupervised Adaptation of a Speech Recogniser Using Entropy Minimisation and Speaker Codes

Rogier C. van Dalen, Shucong Zhang, Titouan Parcollet, Sourav Bhattacharya

2025-06-12
Paper
Conversational Search: From Fundamentals to Frontiers in the LLM Era

Fengran Mo, Chuan Meng, Mohammad Aliannejadi, Jian-Yun Nie

2025-06-12Instruction FollowingConversational Search
Paper
Deep Learning-Based Digitization of Overlapping ECG Images with Open-Source Python Code

Reza Karbasi, Masoud Rahimi, Abdol-Hossein Vahabie, Hadi Moradi

2025-06-12
Paper
Encoding call-by-push-value in the pi-calculus

Benjamin Bennetzen, Nikolaj Rossander Kristensen, Peter Buus Steffensen

2025-06-12
Paper
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Yuhao Zhou, Yiheng Wang, Xuming He, Ruoyao Xiao, Zhiwei Li et al.

2025-06-12AttributeMultimodal ReasoningVisual Question Answering (VQA)
Paper
PAL: Probing Audio Encoders via LLMs -- A Study of Information Transfer from Audio Encoders to LLMs

Tony Alex, Wish Suharitdamrong, Sara Atito, Armin Mustafa, Philip J. B. Jackson et al.

2025-06-12
Paper
PreviousPage 230 of 28782Next