TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Simple Prompt Injection Attacks Can Leak Personal Data Observed by LLM Agents During Task Execution

Meysam Alizadeh, Zeynab Samei, Daria Stetsenko, Fabrizio Gilardi

2025-06-01
Paper
Nearly-Linear Time Private Hypothesis Selection with the Optimal Approximation Factor

Maryam Aliakbarpour, Zhan Shi, Ria Stevens, Vincent X. Wang

2025-06-01
Paper
Unfolding Boxes with Local Constraints

Long Qian, Eric Wang, Bernardo Subercaseaux, Marijn J. H. Heule

2025-06-01
PaperCode
Regulatory Graphs and GenAI for Real-Time Transaction Monitoring and Compliance Explanation in Banking

Kunal Khanvilkar, Kranthi Kommuru

2025-06-01
Paper
Higher-Order Responsibility

Junli Jiang, Pavel Naumov

2025-06-01Decision MakingEthics
Paper
VUSA: Virtually Upscaled Systolic Array Architecture to Exploit Unstructured Sparsity in AI Acceleration

Shereef Helal, Alberto Garcia-Ortiz, Lennart Bamberg

2025-06-01
Paper
How Neural Networks Organize Concepts: Introducing Concept Trajectory Analysis for Deep Learning Interpretability

Andrew Smigaj

2025-06-01Independent Research 2025 6Bias Detection
PaperCode
Mamba Drafters for Speculative Decoding

Daewon Choi, Seunghyuk Oh, Saket Dingliwal, Jihoon Tack, KyuYoung Kim et al.

2025-06-01Large Language Model
Paper
Compress, Gather, and Recompute: REFORMing Long-Context Processing in Transformers

Woomin Song, Sai Muralidhar Jayanthi, Srikanth Ronanki, Kanthashree Mysore Sathyendra, Jinwoo Shin et al.

2025-06-01
Paper
ACCESS DENIED INC: The First Benchmark Environment for Sensitivity Awareness

Dren Fazlija, Arkadij Orlov, Sandipan Sikdar

2025-06-01BenchmarkingManagementNatural Language Queries
PaperCode
CoBRA: Quantifying Strategic Language Use and LLM Pragmatics

Anshun Asher Zheng, Junyi Jessy Li, David I. Beaver

2025-06-01
PaperCode
Generalizable LLM Learning of Graph Synthetic Data with Reinforcement Learning

Yizhuo Zhang, Heng Wang, Shangbin Feng, Zhaoxuan Tan, Xinyun Liu et al.

2025-06-01
Paper
Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering Tasks

Yuntai Bao, Xuhong Zhang, Tianyu Du, Xinkui Zhao, Zhengwen Feng et al.

2025-06-01Question AnsweringNegationWorld Knowledge
PaperCode
LEMONADE: A Large Multilingual Expert-Annotated Abstractive Event Dataset for the Real World

Sina J. Semnani, Pingyue Zhang, Wanyue Zhai, Haozhuo Li, Ryan Beauchamp et al.

2025-06-01document understandingEntity LinkingEvent Extraction
PaperCode
RARE: Retrieval-Aware Robustness Evaluation for Retrieval-Augmented Generation Systems

Yixiao Zeng, Tianyu Cao, Danqing Wang, Xinran Zhao, Zimeng Qiu et al.

2025-06-01RetrievalRAG
PaperCode
Quantization-based Bounds on the Wasserstein Metric

Jonathan Bobrutsky, Amit Moscovich

2025-06-01QuantizationDomain AdaptationImage Retrieval
Paper
Mispronunciation Detection Without L2 Pronunciation Dataset in Low-Resource Setting: A Case Study in Finland Swedish

Nhan Phan, Mikko Kuronen, Maria Kautonen, Riikka Ullakonoja, Anna von Zansen et al.

2025-06-01
PaperCode
No Soundness in the Real World: On the Challenges of the Verification of Deployed Neural Networks

Attila Szász, Balázs Bánhelyi, Márk Jelasity

2025-06-01
PaperCode
MedBookVQA: A Systematic and Comprehensive Medical Benchmark Derived from Open-Access Book

Sau Lai Yip, Sunan He, Yuxiang Nie, Shu Pui Chan, Yilin Ye et al.

2025-06-01Benchmarking
PaperCode
IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory

Wei Song, Zhenya Huang, Cheng Cheng, Weibo Gao, Bihan Xu et al.

2025-06-01Semantic SimilaritySemantic Textual Similarity
PaperCode
PreviousPage 395 of 28782Next