TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models

Feng Luo, Yu-Neng Chuang, Guanchu Wang, Hoang Anh Duy Le, Shaochen Zhong et al.

2025-05-28
Paper
THINK-Bench: Evaluating Thinking Efficiency and Chain-of-Thought Quality of Large Reasoning Models

Zhiyuan Li, Yi Chang, Yuan Wu

2025-05-28
Paper
RAG-Zeval: Towards Robust and Interpretable Evaluation on RAG Responses through End-to-End Rule-Guided Reasoning

Kun Li, Yunxiang Li, Tianhua Zhang, Hongyin Luo, Xixin Wu et al.

2025-05-28RAG
Paper
Judging LLMs on a Simplex

Patrick Vossler, Fan Xia, Yifan Mai, Jean Feng

2025-05-28Uncertainty QuantificationBayesian Inference
Paper
Individualised Counterfactual Examples Using Conformal Prediction Intervals

James M. Adams, Gesine Reinert, Lukasz Szpruch, Carsten Maple, Andrew Elliott et al.

2025-05-28Prediction IntervalsBinary ClassificationData Augmentation+1
Paper
Neuromorphic Sequential Arena: A Benchmark for Neuromorphic Temporal Processing

Xinyi Chen, Chenxiang Ma, Yujie Wu, Kay Chen Tan, Jibin Wu et al.

2025-05-28
PaperCode
Learning World Models for Interactive Video Generation

Taiye Chen, Xun Hu, Zihan Ding, Chi Jin

2025-05-28Video RetrievalRetrievalVideo Generation
Paper
StateSpaceDiffuser: Bringing Long Context to Diffusion World Models

Nedko Savov, Naser Kazemi, Deheng Zhang, Danda Pani Paudel, Xi Wang et al.

2025-05-28
Paper
Universal Visuo-Tactile Video Understanding for Embodied Interaction

Yifan Xie, Mingyang Li, Shoujie Li, Xingting Li, Guangyu Chen et al.

2025-05-28Text GenerationLarge Language ModelVideo Understanding
Paper
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Ganqu Cui, Yuchen Zhang, Jiacheng Chen, Lifan Yuan, Zhi Wang et al.

2025-05-28
Paper
Improving Out-of-Distribution Detection with Markov Logic Networks

Konstantin Kirchheim, Frank Ortmeier

2025-05-28Out of Distribution (OOD) DetectionOut-of-Distribution Detection
Paper
Triple Attention Transformer Architecture for Time-Dependent Concrete Creep Prediction

Warayut Dokduea, Weerachart Tangchirapat, Sompote Youwai

2025-05-28
Paper
EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles

Aakriti Agrawal, Mucong Ding, Zora Che, ChengHao Deng, Anirudh Satheesh et al.

2025-05-28Large Language ModelLanguage Modelling
Paper
Curse of High Dimensionality Issue in Transformer for Long-context Modeling

Shuhai Zhang, Zeng You, Yaofo Chen, Zhiquan Wen, Qianyue Wang et al.

2025-05-28
PaperCode
From Motion to Behavior: Hierarchical Modeling of Humanoid Generative Behavior Control

Jusheng Zhang, Jinzhou Tang, Sidi Liu, Mingyan Li, Sheng Zhang et al.

2025-05-28Motion PlanningMotion Generation
Paper
Using LLMs to Advance the Cognitive Science of Collectives

Ilia Sucholutsky, Katherine M. Collins, Nori Jacoby, Bill D. Thompson, Robert D. Hawkins et al.

2025-05-28
Paper
Distributionally Robust Wireless Semantic Communication with Large AI Models

Long Tan Le, Senura Hansaja Wanasekara, Zerun Niu, Yansong Shi, Nguyen H. Tran et al.

2025-05-28Semantic Communication
Paper
DistMLIP: A Distributed Inference Platform for Machine Learning Interatomic Potentials

Kevin Han, Bowen Deng, Amir Barati Farimani, Gerbrand Ceder

2025-05-28Drug Discoverygraph partitioning
PaperCode
Risks of AI-driven product development and strategies for their mitigation

Jan Göpfert, Jann M. Weinand, Patrick Kuckertz, Noah Pflugradt, Jochen Linßen et al.

2025-05-28
Paper
Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting

Wei Lin, Chenyang Zhao, Antoni B. Chan

2025-05-28CVPR 2025 1Crowd CountingUnsupervised Domain AdaptationDomain Adaptation
PaperCode
PreviousPage 463 of 28782Next