TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration

Qinglin Zhu, Runcong Zhao, Hanqi Yan, Yulan He, Yudong Chen et al.

2025-05-30
Paper
A Simple Linear Patch Revives Layer-Pruned Large Language Models

Xinrui Chen, Haoli Bai, Tao Yuan, Ruikang Liu, Kang Zhao et al.

2025-05-30Question AnsweringKnowledge Distillation
Paper
TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis

Xiaorui Wu, Xiaofeng Mao, Fei Li, Xin Zhang, Xuanhong Li et al.

2025-05-30Safety AlignmentLarge Language ModelRed Teaming+1
PaperCode
Are Optimal Algorithms Still Optimal? Rethinking Sorting in LLM-Based Pairwise Ranking with Batching and Caching

Juan Wisznia, Cecilia Bolaños, Juan Tollo, Giovanni Marraffini, Agustín Gianolini et al.

2025-05-30
Paper
Disentangling Language and Culture for Evaluating Multilingual Large Language Models

Jiahao Ying, Wei Tang, Yiran Zhao, Yixin Cao, Yu Rong et al.

2025-05-30
Paper
Benchmarking Large Language Models for Cryptanalysis and Mismatched-Generalization

Utsav Maskey, Chencheng Zhu, Usman Naseem

2025-05-30BenchmarkingNatural Language Understanding
Paper
When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation

Daniela Occhipinti, Marco Guerini, Malvina Nissim

2025-05-30Dialogue Generation
Paper
Explainable Depression Detection using Masked Hard Instance Mining

Patawee Prakrankamanant, Shinji Watanabe, Ekapol Chuangsuwanich

2025-05-30Depression Detection
Paper
GATE: General Arabic Text Embedding for Enhanced Semantic Textual Similarity with Matryoshka Representation Learning and Hybrid Loss Training

Omer Nacar, Anis Koubaa, Serry Sibaee, Yasser Al-Habashi, Adel Ammar et al.

2025-05-30Representation LearningNatural Language InferenceMTEB Benchmark+3
Paper
Improving Language and Modality Transfer in Translation by Character-level Modeling

Ioannis Tsiamas, David Dale, Marta R. Costa-jussà

2025-05-30Speech-to-Text TranslationSpeech-to-TextTransfer Learning+1
Paper
A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

Xiaoang Xu, Shuo Wang, Xu Han, Zhenghao Liu, Huijia Wu et al.

2025-05-30Math
PaperCode
Don't Erase, Inform! Detecting and Contextualizing Harmful Language in Cultural Heritage Collections

Orfeas Menis Mastromichalakis, Jason Liartis, Kristina Rose, Antoine Isaac, Giorgos Stamou et al.

2025-05-30
PaperCode
DEEPQUESTION: Systematic Generation of Real-World Challenges for Evaluating LLMs Performance

Ali Khoramfar, Ali Ramezani, Mohammad Mahdi Mohajeri, Mohammad Javad Dousti, Majid Nili Ahmadabadi et al.

2025-05-30
Paper
Limited-Resource Adapters Are Regularizers, Not Linguists

Marcell Fekete, Nathaniel R. Robinson, Ernests Lavrinovics, E. Djeride Jean-Baptiste, Raj Dabre et al.

2025-05-30Machine TranslationCross-Lingual Transfer
Paper
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

Emilio Villa-Cueva, Sholpan Bolatzhanova, Diana Turmakhan, Kareem Elzeky, Henok Biadglign Ademtew et al.

2025-05-30Machine TranslationBenchmarkingMultimodal Machine Translation+1
Paper
Domain Pre-training Impact on Representations

Cesar Gonzalez-Gutierrez, Ariadna Quattoni

2025-05-30
Paper
When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways

Kailin Jiang, Yuntao Du, Yukai Ding, Yuchen Ren, Ning Jiang et al.

2025-05-30Continual LearningInstruction FollowingImage Augmentation
PaperCode
Exploring the Impact of Occupational Personas on Domain-Specific QA

Eojin Kang, Jaehyuk Yu, Juae Kim

2025-05-30Question Answering
Paper
Donate or Create? Comparing Data Collection Strategies for Emotion-labeled Multimodal Social Media Posts

Christopher Bagdon, Aidan Combs, Carina Silberer, Roman Klinger

2025-05-30
Paper
MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs

Zhiwei Liu, Lingfei Qian, Qianqian Xie, Jimin Huang, Kailai Yang et al.

2025-05-30Emotion ClassificationSentiment Analysis
PaperCode
PreviousPage 415 of 28782Next