TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge

Zeinab Sadat Taghavi, Ali Modarressi, Yunpu Ma, Hinrich Schütze

2025-06-17BenchmarkingSemantic SimilaritySemantic Textual Similarity+2
PaperCode
Digital Gatekeepers: Google's Role in Curating Hashtags and Subreddits

Amrit Poudel, Yifan Ding, Jurgen Pfeffer, Tim Weninger

2025-06-17
Paper
Evaluation Should Not Ignore Variation: On the Impact of Reference Set Choice on Summarization Metrics

Silvia Casola, Yang Janet Liu, Siyao Peng, Oliver Kraus, Albert Gatt et al.

2025-06-17
Paper
Expectation Confirmation Preference Optimization for Multi-Turn Conversational Recommendation Agent

Xueyang Feng, Jingsen Zhang, Jiakai Tang, Wei Li, Guohao Cai et al.

2025-06-17Conversational Recommendation
Paper
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents

Seongbo Jang, Minjin Jeon, Jaehoon Lee, Seonghyeon Lee, Dongha Lee et al.

2025-06-17Large Language ModelLanguage ModellingResponse Generation
PaperCode
Re-Initialization Token Learning for Tool-Augmented Large Language Models

Chenghao Li, Liu Liu, Baosheng Yu, Jiayan Qiu, Yibing Zhan et al.

2025-06-17Question AnsweringGSM8K
PaperCode
A Multi-Expert Structural-Semantic Hybrid Framework for Unveiling Historical Patterns in Temporal Knowledge Graphs

Yimin Deng, Yuxia Wu, Yejing Wang, Guoshuai Zhao, Li Zhu et al.

2025-06-17Knowledge GraphsGraph structure learning
PaperCode
Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team

Md Tanzib Hosain, Salman Rahman, Md Kishor Morol, Md Rizwan Parvez

2025-06-17Mathematical ReasoningMathGSM8K+1
PaperCode
Chaining Event Spans for Temporal Relation Grounding

Jongho Kim, Dohyeon Lee, Minsoo Kim, Seung-won Hwang

2025-06-17Reading ComprehensionRelation Extraction
PaperCode
Explainable Detection of Implicit Influential Patterns in Conversations via Data Augmentation

Sina Abdidizaji, Md Kowsher, Niloofar Yousefi, Ivan Garibay

2025-06-17Data AugmentationMulti-Label Classification
Paper
CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation

Jia-Chen Zhang, Zheng Zhou, Yu-jie Xiong, Chun-Ming Xia, Fei Dai et al.

2025-06-17Tabular Data Generation
Paper
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

Jingxu Xie, Dylan Xu, Xuandong Zhao, Dawn Song

2025-06-17
PaperCode
Intended Target Identification for Anomia Patients with Gradient-based Selective Augmentation

Jongho Kim, Romain Storaï, Seung-won Hwang

2025-06-17
Paper
MAS-LitEval : Multi-Agent System for Literary Translation Quality Assessment

Junghwan Kim, Kieun Park, Sohee Park, Hyunggug Kim, Bongwon Suh et al.

2025-06-17Translation
Paper
GRAM: A Generative Foundation Reward Model for Reward Generalization

Chenglong Wang, Yang Gan, Yifu Huo, Yongyu Mu, Qiaozhi He et al.

2025-06-17
PaperCode
MIST: Towards Multi-dimensional Implicit Bias and Stereotype Evaluation of LLMs via Theory of Mind

Yanlin Li, Hao liu, Huimin Liu, Yinwei Wei, Yupeng Hu et al.

2025-06-17
Paper
S$^4$C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models

Tao He, Guang Huang, Yu Yang, Tianshi Xu, Sicheng Zhao et al.

2025-06-17Text Generation
Paper
DCRM: A Heuristic to Measure Response Pair Quality in Preference Optimization

Chengyu Huang, Tanya Goyal

2025-06-17
PaperCode
Essential-Web v1.0: 24T tokens of organized web data

Essential AI, :, Andrew Hojel, Michael Pust, Tim Romanski et al.

2025-06-17Math
PaperCode
Abstract Meaning Representation for Hospital Discharge Summarization

Paul Landes, Sitara Rao, Aaron Jeremy Chaise, Barbara Di Eugenio

2025-06-17Hallucination
PaperCode
PreviousPage 164 of 28782Next