NutCracker at WNUT-2020 Task 2: Robustly Identifying Informative COVID-19 Tweets using Ensembling and Adversarial Training

Priyanshu Kumar, Aadarsh Singh

2020-10-09EMNLP (WNUT) 2020 11Text Classification Task 2

Abstract

We experiment with COVID-Twitter-BERT and RoBERTa models to identify informative COVID-19 tweets. We further experiment with adversarial training to make our models robust. The ensemble of COVID-Twitter-BERT and RoBERTa obtains a F1-score of 0.9096 (on the positive class) on the test data of WNUT-2020 Task 2 and ranks 1st on the leaderboard. The ensemble of the models trained using adversarial training also produces similar result.

Results

Task	Dataset	Metric	Value	Model
Text Classification	WNUT-2020 Task 2	F1	0.9096	NutCracker
Classification	WNUT-2020 Task 2	F1	0.9096	NutCracker

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17 GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10 Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09 The Trilemma of Truth in Large Language Models2025-06-30 Robustness of Misinformation Classification Systems to Adversarial Examples Through BeamAttack2025-06-30 Perspectives in Play: A Multi-Perspective Approach for More Inclusive NLP Systems2025-06-25 Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning?2025-06-21 SHREC and PHEONA: Using Large Language Models to Advance Next-Generation Computational Phenotyping2025-06-19