Not All Jokes Land: Evaluating Large Language Models Understanding of Workplace Humor

Mohammadamin Shafiei, Hamidreza Saffari

2025-06-02All

Abstract

With the recent advances in Artificial Intelligence (AI) and Large Language Models (LLMs), the automation of daily tasks, like automatic writing, is getting more and more attention. Hence, efforts have focused on aligning LLMs with human values, yet humor, particularly professional industrial humor used in workplaces, has been largely neglected. To address this, we develop a dataset of professional humor statements along with features that determine the appropriateness of each statement. Our evaluation of five LLMs shows that LLMs often struggle to judge the appropriateness of humor accurately.

Related Papers

Modeling Code: Is Text All You Need?2025-07-15 All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15 Is Diversity All You Need for Scalable Robotic Manipulation?2025-07-08 DESIGN AND IMPLEMENTATION OF ONLINE CLEARANCE REPORT.2025-07-07 Is Reasoning All You Need? Probing Bias in the Age of Reasoning Language Models2025-07-03 Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays2025-07-01 State and Memory is All You Need for Robust and Reliable AI Agents2025-06-30 EAMamba: Efficient All-Around Vision State Space Model for Image Restoration2025-06-27