TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Comparison of SVM against Pre-trained Language Models (P...

A Comparison of SVM against Pre-trained Language Models (PLMs) for Text Classification Tasks

Yasmen Wahba, Nazim Madhavji, John Steinbacher

2022-11-04Text ClassificationFeature Engineeringtext-classification
PaperPDF

Abstract

The emergence of pre-trained language models (PLMs) has shown great success in many Natural Language Processing (NLP) tasks including text classification. Due to the minimal to no feature engineering required when using these models, PLMs are becoming the de facto choice for any NLP task. However, for domain-specific corpora (e.g., financial, legal, and industrial), fine-tuning a pre-trained model for a specific task has shown to provide a performance improvement. In this paper, we compare the performance of four different PLMs on three public domain-free datasets and a real-world dataset containing domain-specific words, against a simple SVM linear classifier with TFIDF vectorized text. The experimental results on the four datasets show that using PLMs, even fine-tuned, do not provide significant gain over the linear SVM classifier. Hence, we recommend that for text classification tasks, traditional SVM along with careful feature engineering can pro-vide a cheaper and superior performance than PLMs.

Results

TaskDatasetMetricValueModel
Text Classification20NEWSAccuracy93LinearSVM+TFIDF
Text Classification20NEWSF-measure93LinearSVM+TFIDF
Classification20NEWSAccuracy93LinearSVM+TFIDF
Classification20NEWSF-measure93LinearSVM+TFIDF

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation2025-07-10Advancing Magnetic Materials Discovery -- A structure-based machine learning approach for magnetic ordering and magnetic moment prediction2025-07-02The Trilemma of Truth in Large Language Models2025-06-30Robustness of Misinformation Classification Systems to Adversarial Examples Through BeamAttack2025-06-30Prompt Mechanisms in Medical Imaging: A Comprehensive Survey2025-06-28Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market2025-06-26Temporal-Aware Graph Attention Network for Cryptocurrency Transaction Fraud Detection2025-06-26