ColBERT: Using BERT Sentence Embedding in Parallel Neural Networks for Computational Humor

Issa Annamoradnejad, Gohar Zoghi

2020-04-27Humor Detection Sentence Embedding Sentence Embeddings Sentence-Embedding

Abstract

Automation of humor detection and rating has interesting use cases in modern technologies, such as humanoid robots, chatbots, and virtual assistants. In this paper, we propose a novel approach for detecting and rating humor in short texts based on a popular linguistic theory of humor. The proposed technical method initiates by separating sentences of the given text and utilizing the BERT model to generate embeddings for each one. The embeddings are fed to separate lines of hidden layers in a neural network (one line for each sentence) to extract latent features. At last, the parallel lines are concatenated to determine the congruity and other relationships between the sentences and predict the target value. We accompany the paper with a novel dataset for humor detection consisting of 200,000 formal short texts. In addition to evaluating our work on the novel dataset, we participated in a live machine learning competition focused on rating humor in Spanish tweets. The proposed model obtained F1 scores of 0.982 and 0.869 in the humor detection experiments which outperform general and state-of-the-art models. The evaluation performed on two contrasting settings confirm the strength and robustness of the model and suggests two important factors in achieving high accuracy in the current task: 1) usage of sentence embeddings and 2) utilizing the linguistic structure of humor in designing the proposed model.

Results

Task	Dataset	Metric	Value	Model
Humor Detection	200k Short Texts for Humor Detection	F1-score	0.982	ColBERT model
Humor Detection	200k Short Texts for Humor Detection	F1-score	0.882	Multinomial NB
Humor Detection	200k Short Texts for Humor Detection	F1-score	0.874	SVM
Humor Detection	200k Short Texts for Humor Detection	F1-score	0.794	Decision Tree

Related Papers

From Neurons to Semantics: Evaluating Cross-Linguistic Alignment Capabilities of Large Language Models via Neurons Alignment2025-07-20 SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17 Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation2025-06-25 Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support2025-06-19 Factors affecting the in-context learning abilities of LLMs for dialogue state tracking2025-06-10 Quality-Diversity Red-Teaming: Automated Generation of High-Quality and Diverse Attackers for Large Language Models2025-06-08 Mechanistic Decomposition of Sentence Representations2025-06-04 Rethinking the Understanding Ability across LLMs through Mutual Information2025-05-25