TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ColBERT: Using BERT Sentence Embedding in Parallel Neural ...

ColBERT: Using BERT Sentence Embedding in Parallel Neural Networks for Computational Humor

Issa Annamoradnejad, Gohar Zoghi

2020-04-27Humor DetectionSentence EmbeddingSentence EmbeddingsSentence-Embedding
PaperPDFCode(official)CodeCodeCode

Abstract

Automation of humor detection and rating has interesting use cases in modern technologies, such as humanoid robots, chatbots, and virtual assistants. In this paper, we propose a novel approach for detecting and rating humor in short texts based on a popular linguistic theory of humor. The proposed technical method initiates by separating sentences of the given text and utilizing the BERT model to generate embeddings for each one. The embeddings are fed to separate lines of hidden layers in a neural network (one line for each sentence) to extract latent features. At last, the parallel lines are concatenated to determine the congruity and other relationships between the sentences and predict the target value. We accompany the paper with a novel dataset for humor detection consisting of 200,000 formal short texts. In addition to evaluating our work on the novel dataset, we participated in a live machine learning competition focused on rating humor in Spanish tweets. The proposed model obtained F1 scores of 0.982 and 0.869 in the humor detection experiments which outperform general and state-of-the-art models. The evaluation performed on two contrasting settings confirm the strength and robustness of the model and suggests two important factors in achieving high accuracy in the current task: 1) usage of sentence embeddings and 2) utilizing the linguistic structure of humor in designing the proposed model.

Results

TaskDatasetMetricValueModel
Humor Detection200k Short Texts for Humor DetectionF1-score0.982ColBERT model
Humor Detection200k Short Texts for Humor DetectionF1-score0.882Multinomial NB
Humor Detection200k Short Texts for Humor DetectionF1-score0.874SVM
Humor Detection200k Short Texts for Humor DetectionF1-score0.794Decision Tree

Related Papers

From Neurons to Semantics: Evaluating Cross-Linguistic Alignment Capabilities of Large Language Models via Neurons Alignment2025-07-20SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation2025-06-25Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support2025-06-19Factors affecting the in-context learning abilities of LLMs for dialogue state tracking2025-06-10Quality-Diversity Red-Teaming: Automated Generation of High-Quality and Diverse Attackers for Large Language Models2025-06-08Mechanistic Decomposition of Sentence Representations2025-06-04Rethinking the Understanding Ability across LLMs through Mutual Information2025-05-25