TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

A Novel Discrete Memristor-Coupled Heterogeneous Dual-Neuron Model and Its Application in Multi-Scenario Image Encryption

Yi Zou, Mengjiao Wang, Xinan Zhang, Herbert Ho-Ching Iu

2025-05-30
Paper
On the Scaling of Robustness and Effectiveness in Dense Retrieval

Yu-An Liu, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan et al.

2025-05-30Adversarial RobustnessRetrieval
Paper
Heterogeneous Graph Masked Contrastive Learning for Robust Recommendation

Lei Sang, Yu Wang, Yiwen Zhang

2025-05-30Contrastive Learning
Paper
Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks

Tajamul Ashraf, Amal Saqib, Hanan Ghani, Muhra AlMahri, Yuhao Li et al.

2025-05-30MathMultimodal ReasoningAutonomous Driving+1
PaperCode
ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL

Yu Zhang, Yunqi Li, Yifan Yang, Rui Wang, Yuqing Yang et al.

2025-05-30Reinforcement LearningImage GenerationLanguage Modelling
PaperCode
Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation

Yucheng Zhou, Jiahao Yuan, Qianning Wang

2025-05-30BenchmarkingAllImage Generation
PaperCode
"Dyadosyncrasy", Idiosyncrasy and Demographic Factors in Turn-Taking

Julio Cesar Cavalcanti, Gabriel Skantze

2025-05-30
Paper
AMIA: Automatic Masking and Joint Intention Analysis Makes LVLMs Robust Jailbreak Defenders

Yuqi Zhang, Yuchun Miao, Zuchao Li, Liang Ding

2025-05-30Response Generation
Paper
KEVER^2: Knowledge-Enhanced Visual Emotion Reasoning and Retrieval

Fanhang Man, Xiaoyue Chen, Huandong Wang, Baining Zhao, Han Li et al.

2025-05-30RetrievalEmotion Recognition
Paper
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Junyu Zhang, Runpei Dong, Han Wang, Xuying Ning, Haoran Geng et al.

2025-05-30Answer Generation
Paper
Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks

Roksana Goworek, Haim Dubossarsky

2025-05-30Multilingual NLPCross-Lingual Transfer
Paper
How much do language models memorize?

John X. Morris, Chawin Sitawarin, Chuan Guo, Narine Kokhlikyan, G. Edward Suh et al.

2025-05-30MemorizationLanguage Modelling
Paper
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text

Li yunhan, Wu gengshen

2025-05-30Quantization
PaperCode
Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?

Jiayu Liu, Qing Zong, Weiqi Wang, Yangqiu Song

2025-05-30Question Answering
PaperCode
From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning

Haoyu Li, Xuhong LI, Yiming Dong, Kun Liu

2025-05-30Large Language ModelLanguage Modelling
Paper
LGAR: Zero-Shot LLM-Guided Neural Ranking for Abstract Screening in Systematic Literature Reviews

Christian Jaumann, Andreas Wiedholz, Annemarie Friedrich

2025-05-30Question AnsweringBinary Classification
PaperCode
Circuit Stability Characterizes Language Model Generalization

Alan Sun

2025-05-30Language Modelling
PaperCode
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Shelly Bensal, Umar Jamil, Christopher Bryant, Melisa Russak, Kiran Kamble et al.

2025-05-30MathReinforcement Learningreinforcement-learning
Paper
FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation

Junyu Luo, Zhizhuo Kou, Liming Yang, Xiao Luo, Jinsheng Huang et al.

2025-05-30Hallucination
PaperCode
BPE Stays on SCRIPT: Structured Encoding for Robust Multilingual Pretokenization

Sander Land, Catherine Arnett

2025-05-30
PaperCode
PreviousPage 414 of 28782Next