TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Time-IMM: A Dataset and Benchmark for Irregular Multimodal Multivariate Time Series

Ching Chang, Jeehyun Hwang, Yidan Shi, Haixin Wang, Wen-Chih Peng et al.

2025-06-12Irregular Time SeriesTime SeriesTime Series Analysis
Paper
Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning

Jikai Jin, Vasilis Syrgkanis, Sham Kakade, HANLIN ZHANG

2025-06-12Mathematical ReasoningInstruction FollowingRepresentation Learning
PaperCode
An Analysis of Datasets, Metrics and Models in Keyphrase Generation

Florian Boudin, Akiko Aizawa

2025-06-12Keyphrase Generation
PaperCode
Provably Learning from Language Feedback

Wanqiao Xu, Allen Nie, Ruijie Zheng, Aditya Modi, Adith Swaminathan et al.

2025-06-12Large Language Model
Paper
Detecting Sockpuppetry on Wikipedia Using Meta-Learning

Luc Raszewski, Christine de Kock

2025-06-12Meta-Learning
PaperCode
AC/DC: LLM-based Audio Comprehension via Dialogue Continuation

Yusuke Fujita, Tomoya Mizumoto, Atsushi Kojima, Lianbo Liu, Yui Sudo et al.

2025-06-12Question AnsweringInstruction FollowingAudio captioning
Paper
Discrete Audio Tokens: More Than a Survey!

Pooneh Mousavi, Gallil Maimon, Adel Moumen, Darius Petermann, Jiatong Shi et al.

2025-06-12QuantizationLanguage Modelling
Paper
How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts?

Sohee Yang, Sang-Woo Lee, Nora Kassner, Daniela Gottesman, Sebastian Riedel et al.

2025-06-12
Paper
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Yixin Ou, Yujie Luo, Jingsheng Zheng, Lanning Wei, Shuofei Qiao et al.

2025-06-12Large Language ModelCode Generation
PaperCode
ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

Kangwei Liu, Siyuan Cheng, Bozhong Tian, Xiaozhuan Liang, Yuyang Yin et al.

2025-06-12
PaperCode
Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training

Mozhi Zhang, Howe Tissue, Lu Wang, Xipeng Qiu

2025-06-12
Paper
Dynamic Epistemic Friction in Dialogue

Timothy Obiso, Kenneth Lai, Abhijnan Nath, Nikhil Krishnaswamy, James Pustejovsky et al.

2025-06-12
Paper
Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

Or Shafran, Atticus Geiger, Mor Geva

2025-06-12Dictionary Learning
PaperCode
Magistral

Mistral-AI, :, Abhinav Rastogi, Albert Q. Jiang, Andy Lo et al.

2025-06-12Instruction FollowingReinforcement Learning
Paper
Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning

Lan Zhang, Marco Valentino, Andre Freitas

2025-06-12Mathematical Reasoning
Paper
BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLP

Thomas Sounack, Joshua Davis, Brigitte Durieux, Antoine Chaffin, Tom J. Pollard et al.

2025-06-12Domain Adaptation
PaperCode
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

Yixiao Huang, Hanlin Zhu, Tianyu Guo, Jiantao Jiao, Somayeh Sojoudi et al.

2025-06-12HallucinationOptical Character Recognition (OCR)
Paper
Slimming Down LLMs Without Losing Their Minds

Qingda, Mai

2025-06-12Mathematical ReasoningGSM8KLarge Language Model+2
Paper
Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt Adjustment

Hongda Sun, Jiaren Peng, Wenzhong Yang, Liang He, Bo Du et al.

2025-06-12Dialogue Generation
PaperCode
Analyzing the relationships between pretraining language, phonetic, tonal, and speaker information in self-supervised speech models

Michele Gubian, Ioana Krehan, Oli Liu, James Kirby, Sharon Goldwater et al.

2025-06-12
Paper
PreviousPage 231 of 28782Next