TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Generalized Category Discovery in Event-Centric Contexts: Latent Pattern Mining with LLMs

Yi Luo, Qiwen Wang, Junqi Yang, Luyao Tang, Zhenghao Lin et al.

2025-05-29
Paper
Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs

Julia Belikova, Konstantin Polev, Rauf Parchiev, Dmitry Simakov

2025-05-29Question AnsweringDimensionality ReductionHallucination+1
Paper
EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian

Daryna Dementieva, Nikolay Babakov, Alexander Fraser

2025-05-29Emotion Classification
Paper
ScEdit: Script-based Assessment of Knowledge Editing

Xinye Li, Zunwen Zheng, Qian Zhang, Dekai Zhuang, Jiabao Kang et al.

2025-05-29knowledge editing
PaperCode
Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective

Yong Zhang, Yanwen Huang, Ning Cheng, Yang Guo, Yun Zhu et al.

2025-05-29RAG
PaperCode
The Arabic AI Fingerprint: Stylometric Analysis and Detection of Large Language Models Text

Maged S. Al-shaibani, Moataz Ahmed

2025-05-29Misinformation
PaperCode
Automatic Construction of Multiple Classification Dimensions for Managing Approaches in Scientific Papers

Bing Ma, Hai Zhuge

2025-05-29
Paper
ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering

Jingxuan Wei, Nan Xu, Junnan Zhu, Yanni Hao, Gaowei Wu et al.

2025-05-29Question AnsweringInstruction FollowingChart Question Answering+2
Paper
MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration

Zhitao He, Sandeep Polisetty, Zhiyuan Fan, Yuchen Huang, Shujin Wu et al.

2025-05-29Multimodal ReasoningHallucination
PaperCode
ExpeTrans: LLMs Are Experiential Transfer Learners

Jinglong Gao, Xiao Ding, Lingxiao Zou, Bibo Cai, Bing Qin et al.

2025-05-29
Paper
Infinite-Instruct: Synthesizing Scaling Code instruction Data with Bidirectional Synthesis and Static Verification

Wenjing Xing, Wenke Lu, Yeheng Duan, Bing Zhao, Zhenghui kang et al.

2025-05-29Code Generation
Paper
Map&Make: Schema Guided Text to Table Generation

Naman Ahuja, Fenil Bardoliya, Chitta Baral, Vivek Gupta

2025-05-29HallucinationInformation Retrieval
Paper
Tell, Don't Show: Leveraging Language Models' Abstractive Retellings to Model Literary Themes

Li Lucy, Camilla Griffiths, Sarah Levine, Jennifer L. Eberhardt, Dorottya Demszky et al.

2025-05-29
PaperCode
Cross-Domain Bilingual Lexicon Induction via Pretrained Language Models

Qiuyu Ding, Zhiqiang Cao, Hailong Cao, Tiejun Zhao

2025-05-29Word EmbeddingsWord Translation
Paper
Enhancing Large Language Models'Machine Translation via Dynamic Focus Anchoring

Qiuyu Ding, Zhiqiang Cao, Hailong Cao, Tiejun Zhao

2025-05-29Machine TranslationTranslation
Paper
PBEBench: A Multi-Step Programming by Examples Reasoning Benchmark inspired by Historical Linguistics

Atharva Naik, Darsh Agrawal, Manav Kapadnis, Yuwei An, Yash Mathur et al.

2025-05-29Math
Paper
ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations

Yiming Lei, Zhizheng Yang, Zeming Liu, Haitao Leng, Shaoguo Liu et al.

2025-05-29
Paper
Elicit and Enhance: Advancing Multimodal Reasoning in Medical Scenarios

Linjie Mu, Zhongzhen Huang, Yakun Zhu, Xiangyu Zhao, Shaoting Zhang et al.

2025-05-29Multimodal Reasoning
Paper
Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data

Seohyeong Lee, Eunwon Kim, Hwaran Lee, Buru Chang

2025-05-29Large Language ModelLanguage Modelling
Paper
Generating Diverse Training Samples for Relation Extraction with Large Language Models

Zexuan Li, Hongliang Dai, Piji Li

2025-05-29Relation Extraction
Paper
PreviousPage 440 of 28782Next