TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Transformers are Short Text Classifiers: A Study of Induct...

Transformers are Short Text Classifiers: A Study of Inductive Short Text Classifiers on Benchmarks and Real-world Datasets

Fabian Karl, Ansgar Scherp

2022-11-30Text Classificationtext-classificationClassification
PaperPDFCode(official)

Abstract

Short text classification is a crucial and challenging aspect of Natural Language Processing. For this reason, there are numerous highly specialized short text classifiers. However, in recent short text research, State of the Art (SOTA) methods for traditional text classification, particularly the pure use of Transformers, have been unexploited. In this work, we examine the performance of a variety of short text classifiers as well as the top performing traditional text classifier. We further investigate the effects on two new real-world short text datasets in an effort to address the issue of becoming overly dependent on benchmark datasets with a limited number of characteristics. Our experiments unambiguously demonstrate that Transformers achieve SOTA accuracy on short text classification tasks, raising the question of whether specialized short text techniques are necessary.

Results

TaskDatasetMetricValueModel
Text ClassificationR8Accuracy98.451DeBERTa
Text ClassificationR8Accuracy98.28C-BERT (ESGNN + BERT)
Text ClassificationR8Accuracy98.23ESGNN
Text ClassificationR8Accuracy98.171BERT
Text ClassificationR8Accuracy98.09SGNN
Text ClassificationR8Accuracy98.041ERNIE 2.0
Text ClassificationR8Accuracy97.981DistilBERT
Text ClassificationR8Accuracy97.62ALBERTv2
Text ClassificationR8Accuracy96.98WideMLP
Text ClassificationR8Accuracy96.13fastText
Text ClassificationSearchsnippetsAccuracy89.69DistilBERT
Text ClassificationSearchsnippetsAccuracy88.2BERT
Text ClassificationSST-2Accuracy94.78DeBERTa
Text ClassificationSST-2Accuracy91.37BERT
Text ClassificationNICE-2Accuracy99.76RoBERTa
Text ClassificationMRAccuracy90.21DeBERTa
Text ClassificationMRAccuracy89.53ERNIE 2.0 (optimized)
Text ClassificationMRAccuracy89.42RoBERTa
Text ClassificationMRAccuracy88.97ERNIE 2.0
Text ClassificationMRAccuracy86.94BERT
Text ClassificationMRAccuracy86.02ALBERTv2
Text ClassificationMRAccuracy85.31DistilBERT
Text ClassificationTREC-10Accuracy99.4BERT
Text ClassificationSTOPS-41Accuracy89.73DeBERTa
Text ClassificationNICE-45Accuracy72.79BERT
Text ClassificationTwitterAccuracy99.97ERNIE 2.0
Text ClassificationTwitterAccuracy99.96BERT
Text ClassificationTwitterAccuracy99.96DistilBERT
Text ClassificationSTOPS-2STOPS-299.88ERNIE 2.0
ClassificationR8Accuracy98.451DeBERTa
ClassificationR8Accuracy98.28C-BERT (ESGNN + BERT)
ClassificationR8Accuracy98.23ESGNN
ClassificationR8Accuracy98.171BERT
ClassificationR8Accuracy98.09SGNN
ClassificationR8Accuracy98.041ERNIE 2.0
ClassificationR8Accuracy97.981DistilBERT
ClassificationR8Accuracy97.62ALBERTv2
ClassificationR8Accuracy96.98WideMLP
ClassificationR8Accuracy96.13fastText
ClassificationSearchsnippetsAccuracy89.69DistilBERT
ClassificationSearchsnippetsAccuracy88.2BERT
ClassificationSST-2Accuracy94.78DeBERTa
ClassificationSST-2Accuracy91.37BERT
ClassificationNICE-2Accuracy99.76RoBERTa
ClassificationMRAccuracy90.21DeBERTa
ClassificationMRAccuracy89.53ERNIE 2.0 (optimized)
ClassificationMRAccuracy89.42RoBERTa
ClassificationMRAccuracy88.97ERNIE 2.0
ClassificationMRAccuracy86.94BERT
ClassificationMRAccuracy86.02ALBERTv2
ClassificationMRAccuracy85.31DistilBERT
ClassificationTREC-10Accuracy99.4BERT
ClassificationSTOPS-41Accuracy89.73DeBERTa
ClassificationNICE-45Accuracy72.79BERT
ClassificationTwitterAccuracy99.97ERNIE 2.0
ClassificationTwitterAccuracy99.96BERT
ClassificationTwitterAccuracy99.96DistilBERT
ClassificationSTOPS-2STOPS-299.88ERNIE 2.0

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Safeguarding Federated Learning-based Road Condition Classification2025-07-16AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)2025-07-13GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10Fuzzy Classification Aggregation for a Continuum of Agents2025-07-06Hybrid-View Attention for csPCa Classification in TRUS2025-07-04