Universal Language Model Fine-tuning for Text Classification

Jeremy Howard, Sebastian Ruder

2018-01-18ACL 2018 7Text Classification Sentiment Analysis Transfer Learning General Classification Language Modelling

Abstract

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the error by 18-24% on the majority of datasets. Furthermore, with only 100 labeled examples, it matches the performance of training from scratch on 100x more data. We open-source our pretrained models and code.

Results

Task	Dataset	Metric	Value	Model
Sentiment Analysis	Yelp Fine-grained classification	Error	29.98	ULMFiT
Sentiment Analysis	Yelp Binary classification	Error	2.16	ULMFiT
Sentiment Analysis	IMDb	Accuracy	95.4	ULMFiT
Text Classification	TREC-6	Error	3.6	ULMFiT
Text Classification	DBpedia	Error	0.8	ULMFiT
Text Classification	AG News	Error	5.01	ULMFiT
Classification	TREC-6	Error	3.6	ULMFiT
Classification	DBpedia	Error	0.8	ULMFiT
Classification	AG News	Error	5.01	ULMFiT

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21 RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18 Making Language Model a Hierarchical Classifier and Generator2025-07-17 AdaptiSent: Context-Aware Adaptive Attention for Multimodal Aspect-Based Sentiment Analysis2025-07-17 Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17 VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17 The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17 Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17