TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

A Trustworthiness-based Metaphysics of Artificial Intelligence Systems

Andrea Ferrario

2025-06-03
Paper
TestAgent: An Adaptive and Intelligent Expert for Human Assessment

Junhao Yu, Yan Zhuang, Yuxuan Sun, Weibo Gao, Qi Liu et al.

2025-06-03Question SelectionLarge Language ModelSociology
Paper
MAEBE: Multi-Agent Emergent Behavior Framework

Sinem Erisken, Timothy Gothard, Martin Leitgab, Ram Potham

2025-06-03
Paper
Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models

Ram Potham, Max Harms

2025-06-03Synthetic Data Generation
Paper
Causal Explainability of Machine Learning in Heart Failure Prediction from Electronic Health Records

Yina Hou, Shourav B. Rabbani, Liang Hong, Norou Diawara, Manar D. Samad et al.

2025-06-03Causal DiscoveryFeature Importance
Paper
Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff

Sophie Greenwood, Karen Levy, Solon Barocas, Hoda Heidari, Jon Kleinberg et al.

2025-06-03AI AgentDecision Making
Paper
Spatial Association Between Near-Misses and Accident Blackspots in Sydney, Australia: A Getis-Ord $G_i^*$ Analysis

Artur Grigorev, David Lillo-Trynes, Adriana-Simona Mihaita

2025-06-03Feature Importance
Paper
A Multimodal, Multilingual, and Multidimensional Pipeline for Fine-grained Crowdsourcing Earthquake Damage Evaluation

Zihui Ma, Lingyao Li, Juan Li, Wenyue Hua, Jingxiao Liu et al.

2025-06-03
PaperCode
Impact of Rankings and Personalized Recommendations in Marketplaces

Omar Besbes, Yash Kanoria, Akshit Kumar

2025-06-03Navigate
Paper
MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models

Xueqi Cheng, Minxing Zheng, Shixiang Zhu, Yushun Dong

2025-06-03Data AugmentationModel extraction
PaperCode
A Review of Various Datasets for Machine Learning Algorithm-Based Intrusion Detection System: Advances and Challenges

Sudhanshu Sekhar Tripathy, Bichitrananda Behera

2025-06-03Intrusion Detection
Paper
VPI-Bench: Visual Prompt Injection Attacks for Computer-Use Agents

Tri Cao, Bennett Lim, Yue Liu, Yuan Sui, Yuexin Li et al.

2025-06-03
PaperCode
BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage

Kalyan Nakka, Nitesh Saxena

2025-06-03Safety AlignmentPrompt EngineeringRed Teaming
PaperCode
CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale

Zhun Wang, Tianneng Shi, Jingxuan He, Matthew Cai, Jialin Zhang et al.

2025-06-03Large Language Model
PaperCode
Rethinking Machine Unlearning in Image Generation Models

Renyang Liu, Wenjie Feng, Tianwei Zhang, Wei Zhou, Xueqi Cheng et al.

2025-06-03BenchmarkingImage Generation
PaperCode
ATAG: AI-Agent Application Threat Assessment with Attack Graphs

Parth Atulbhai Gandhi, Akansha Shukla, David Tayouri, Beni Ifland, Yuval Elovici et al.

2025-06-03AI Agent
Paper
BadReward: Clean-Label Poisoning of Reward Models in Text-to-Image RLHF

Kaiwen Duan, Hongwei Yao, Yufei Chen, Ziyun Li, Tong Qiao et al.

2025-06-03
Paper
Generative AI for Predicting 2D and 3D Wildfire Spread: Beyond Physics-Based Models and Traditional Deep Learning

Haowen Xu, Sisi Zlatanova, Ruiyu Liang, Ismet Canbulat

2025-06-03
Paper
TL;DR: Too Long, Do Re-weighting for Efficient LLM Reasoning Compression

Zhong-Zhi Li, Xiao Liang, Zihao Tang, Lei Ji, Peijie Wang et al.

2025-06-03
PaperCode
Enriching Location Representation with Detailed Semantic Information

Junyuan Liu, Xinglei Wang, Tao Cheng

2025-06-03Contrastive Learning
Paper
PreviousPage 371 of 28782Next