TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hate speech detection using static BERT embeddings

Hate speech detection using static BERT embeddings

Gaurav Rajput, Narinder Singh Punn, Sanjay Kumar Sonbhadra, Sonali Agarwal

2021-06-29Hate Speech DetectionWord EmbeddingsSpecificity
PaperPDF

Abstract

With increasing popularity of social media platforms hate speech is emerging as a major concern, where it expresses abusive speech that targets specific group characteristics, such as gender, religion or ethnicity to spread violence. Earlier people use to verbally deliver hate speeches but now with the expansion of technology, some people are deliberately using social media platforms to spread hate by posting, sharing, commenting, etc. Whether it is Christchurch mosque shootings or hate crimes against Asians in west, it has been observed that the convicts are very much influenced from hate text present online. Even though AI systems are in place to flag such text but one of the key challenges is to reduce the false positive rate (marking non hate as hate), so that these systems can detect hate speech without undermining the freedom of expression. In this paper, we use ETHOS hate speech detection dataset and analyze the performance of hate speech detection classifier by replacing or integrating the word embeddings (fastText (FT), GloVe (GV) or FT + GV) with static BERT embeddings (BE). With the extensive experimental trails it is observed that the neural network performed better with static BE compared to using FT, GV or FT + GV as word embeddings. In comparison to fine-tuned BERT, one metric that significantly improved is specificity.

Results

TaskDatasetMetricValueModel
Abuse DetectionEthos BinaryClassification Accuracy0.8015BiLSTM + static BE
Abuse DetectionEthos BinaryF1-score0.7971BiLSTM + static BE
Abuse DetectionEthos BinaryPrecision0.8037BiLSTM + static BE
Hate Speech DetectionEthos BinaryClassification Accuracy0.8015BiLSTM + static BE
Hate Speech DetectionEthos BinaryF1-score0.7971BiLSTM + static BE
Hate Speech DetectionEthos BinaryPrecision0.8037BiLSTM + static BE

Related Papers

Fine-Grained Chinese Hate Speech Understanding: Span-Level Resources, Coded Term Lexicon, and Enhanced Detection Frameworks2025-07-15RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features2025-07-11Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Computational Detection of Intertextual Parallels in Biblical Hebrew: A Benchmark Study Using Transformer-Based Language Models2025-06-30PoseMaster: Generating 3D Characters in Arbitrary Poses from a Single Image2025-06-26IMC-PINN-FE: A Physics-Informed Neural Network for Patient-Specific Left Ventricular Finite Element Modeling with Image Motion Consistency and Biomechanical Parameter Estimation2025-06-25Predicting Readiness to Engage in Psychotherapy of People with Chronic Pain Based on their Pain-Related Narratives Saar2025-06-25On Context-Content Uncertainty Principle2025-06-25