TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ETHOS: an Online Hate Speech Detection Dataset

ETHOS: an Online Hate Speech Detection Dataset

Ioannis Mollas, Zoe Chrysopoulou, Stamatis Karlos, Grigorios Tsoumakas

2020-06-11Hate Speech Detection
PaperPDFCode(official)

Abstract

Online hate speech is a recent problem in our society that is rising at a steady pace by leveraging the vulnerabilities of the corresponding regimes that characterise most social media platforms. This phenomenon is primarily fostered by offensive comments, either during user interaction or in the form of a posted multimedia context. Nowadays, giant corporations own platforms where millions of users log in every day, and protection from exposure to similar phenomena appears to be necessary in order to comply with the corresponding legislation and maintain a high level of service quality. A robust and reliable system for detecting and preventing the uploading of relevant content will have a significant impact on our digitally interconnected society. Several aspects of our daily lives are undeniably linked to our social profiles, making us vulnerable to abusive behaviours. As a result, the lack of accurate hate speech detection mechanisms would severely degrade the overall user experience, although its erroneous operation would pose many ethical concerns. In this paper, we present 'ETHOS', a textual dataset with two variants: binary and multi-label, based on YouTube and Reddit comments validated using the Figure-Eight crowdsourcing platform. Furthermore, we present the annotation protocol used to create this dataset: an active sampling procedure for balancing our data in relation to the various aspects defined. Our key assumption is that, even gaining a small amount of labelled data from such a time-consuming process, we can guarantee hate speech occurrences in the examined material.

Results

TaskDatasetMetricValueModel
Abuse DetectionEthos MultiLabelHamming Loss0.2948MLARAM
Abuse DetectionEthos MultiLabelHamming Loss0.1606MLkNN
Abuse DetectionEthos MultiLabelHamming Loss0.1395Binary Relevance
Abuse DetectionEthos MultiLabelHamming Loss0.132Neural Classifier Chains
Abuse DetectionEthos MultiLabelHamming Loss0.1097Neural Binary Relevance
Abuse DetectionEthos BinaryClassification Accuracy0.7664BERT
Abuse DetectionEthos BinaryF1-score0.7883BERT
Abuse DetectionEthos BinaryPrecision79.17BERT
Abuse DetectionEthos BinaryClassification Accuracy0.7734BiLSTM+Attention+FT
Abuse DetectionEthos BinaryF1-score0.768BiLSTM+Attention+FT
Abuse DetectionEthos BinaryPrecision77.76BiLSTM+Attention+FT
Abuse DetectionEthos BinaryClassification Accuracy0.7515CNN+Attention+FT+GV
Abuse DetectionEthos BinaryF1-score0.7441CNN+Attention+FT+GV
Abuse DetectionEthos BinaryPrecision74.92CNN+Attention+FT+GV
Abuse DetectionEthos BinaryClassification Accuracy0.6643SVM
Abuse DetectionEthos BinaryF1-score0.6607SVM
Abuse DetectionEthos BinaryPrecision66.47SVM
Abuse DetectionEthos BinaryClassification Accuracy0.6504Random Forests
Abuse DetectionEthos BinaryF1-score0.6441Random Forests
Abuse DetectionEthos BinaryPrecision64.69Random Forests
Hate Speech DetectionEthos MultiLabelHamming Loss0.2948MLARAM
Hate Speech DetectionEthos MultiLabelHamming Loss0.1606MLkNN
Hate Speech DetectionEthos MultiLabelHamming Loss0.1395Binary Relevance
Hate Speech DetectionEthos MultiLabelHamming Loss0.132Neural Classifier Chains
Hate Speech DetectionEthos MultiLabelHamming Loss0.1097Neural Binary Relevance
Hate Speech DetectionEthos BinaryClassification Accuracy0.7664BERT
Hate Speech DetectionEthos BinaryF1-score0.7883BERT
Hate Speech DetectionEthos BinaryPrecision79.17BERT
Hate Speech DetectionEthos BinaryClassification Accuracy0.7734BiLSTM+Attention+FT
Hate Speech DetectionEthos BinaryF1-score0.768BiLSTM+Attention+FT
Hate Speech DetectionEthos BinaryPrecision77.76BiLSTM+Attention+FT
Hate Speech DetectionEthos BinaryClassification Accuracy0.7515CNN+Attention+FT+GV
Hate Speech DetectionEthos BinaryF1-score0.7441CNN+Attention+FT+GV
Hate Speech DetectionEthos BinaryPrecision74.92CNN+Attention+FT+GV
Hate Speech DetectionEthos BinaryClassification Accuracy0.6643SVM
Hate Speech DetectionEthos BinaryF1-score0.6607SVM
Hate Speech DetectionEthos BinaryPrecision66.47SVM
Hate Speech DetectionEthos BinaryClassification Accuracy0.6504Random Forests
Hate Speech DetectionEthos BinaryF1-score0.6441Random Forests
Hate Speech DetectionEthos BinaryPrecision64.69Random Forests

Related Papers

Fine-Grained Chinese Hate Speech Understanding: Span-Level Resources, Coded Term Lexicon, and Enhanced Detection Frameworks2025-07-15Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?2025-06-15Towards Fairness Assessment of Dutch Hate Speech Detection2025-06-14ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese2025-06-11Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models2025-06-10Multilingual Hate Speech Detection in Social Media Using Translation-Based Approaches with Large Language Models2025-06-09Cracking the Code: Enhancing Implicit Hate Speech Detection through Coding Classification2025-06-05On Fairness of Task Arithmetic: The Role of Task Vectors2025-05-30