TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks

Xiaoxing Mo, Yuxuan Cheng, Nan Sun, Leo Yu Zhang, Wei Luo et al.

2025-06-12backdoor defense
Paper
SoK: Evaluating Jailbreak Guardrails for Large Language Models

Xunguang Wang, Zhenlan Ji, Wenxuan Wang, Zongjie Li, Daoyuan Wu et al.

2025-06-12
PaperCode
StepProof: Step-by-step verification of natural language mathematical proofs

Xiaolin Hu, Qinghua Zhou, Bogdan Grechuk, Ivan Y. Tyukin

2025-06-12Mathematical Proofs
PaperCode
Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications

Felix Härer

2025-06-12Question AnsweringText GenerationCode Generation
PaperCode
Equitable Mechanism Design for Facility Location

Toby Walsh

2025-06-12
Paper
SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks

Kaiyuan Zhang, Siyuan Cheng, Hanxi Guo, Yuetian Chen, Zian Su et al.

2025-06-12
PaperCode
HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

Jiaqi Lv, Xufeng He, Yanchen Liu, Xu Dai, Yang Hu et al.

2025-06-12Data Augmentation
Paper
Augmenting Large Language Models with Static Code Analysis for Automated Code Quality Improvements

Seyed Moein Abtahi, Akramul Azim

2025-06-12Prompt EngineeringRAG
Paper
Using Language and Road Manuals to Inform Map Reconstruction for Autonomous Driving

Akshar Tumu, Henrik I. Christensen, Marcell Vazquez-Chanlatte, Chikao Tsuchiya, Dhaval Bhanderi et al.

2025-06-12Autonomous DrivingAutonomous Navigation
Paper
Towards Understanding Bias in Synthetic Data for Evaluation

Hossein A. Rahmani, Varsha Ramineni, Nick Craswell, Bhaskar Mitra, Emine Yilmaz et al.

2025-06-12Information Retrieval
PaperCode
RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding

Yisi Liu, Chenyang Wang, Hanjo Kim, Raniya Khan, Gopala Anumanchipalli et al.

2025-06-12Voice Conversion
Paper
Extended Creativity: A Conceptual Framework for Understanding Human-AI Creative Relations

Andrea Gaggioli, Sabrina Bartolotta, Andrea Ubaldi, Katusha Gerardini, Eleonora Diletta Sarcinella et al.

2025-06-12Ethics
Paper
GenPlanX. Generation of Plans and Execution

Daniel Borrajo, Giuseppe Canonaco, Tomás de la Rosa, Alfredo Garrachón, Sriram Gopalakrishnan et al.

2025-06-12
Paper
A Study on Individual Spatiotemporal Activity Generation Method Using MCP-Enhanced Chain-of-Thought Large Language Models

Yu Zhang, Yang Hu, De Wang

2025-06-12
PaperCode
Think before You Simulate: Symbolic Reasoning to Orchestrate Neural Computation for Counterfactual Question Answering

Adam Ishay, Zhun Yang, Joohyung Lee, Ilgu Kang, Dongjae Lim et al.

2025-06-12IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024 1Question AnsweringCounterfactual Reasoning
PaperCode
System ASPMT2SMT:Computing ASPMT Theories by SMT Solvers

Michael Bartholomew, Joohyung Lee

2025-06-12
Paper
Automated Validation of Textual Constraints Against AutomationML via LLMs and SHACL

Tom Westermann, Aljosha Köcher, Felix Gehlhoff

2025-06-12Large Language ModelLanguage Modelling
PaperCode
Primender Sequence: A Novel Mathematical Construct for Testing Symbolic Inference and AI Reasoning

Mohd Anwar Jamal Faiz

2025-06-12Benchmarking
Paper
LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs

Yanan Cai, Ahmed Salem, Besmira Nushi, Mark Russinovich

2025-06-12Relational Reasoning
Paper
OIBench: Benchmarking Strong Reasoning Models with Olympiad in Informatics

Yaoming Zhu, junxin Wang, Yiyang Li, Lin Qiu, ZongYu Wang et al.

2025-06-12Benchmarking
Paper
PreviousPage 220 of 28782Next