TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DocBERT: BERT for Document Classification

DocBERT: BERT for Document Classification

Ashutosh Adhikari, Achyudh Ram, Raphael Tang, Jimmy Lin

2019-04-17Text ClassificationSentiment AnalysisDocument ClassificationGeneral ClassificationClassification
PaperPDFCodeCode(official)Code

Abstract

We present, to our knowledge, the first application of BERT to document classification. A few characteristics of the task might lead one to think that BERT is not the most appropriate model: syntactic structures matter less for content categories, documents can often be longer than typical BERT input, and documents often have multiple labels. Nevertheless, we show that a straightforward classification model using BERT is able to achieve the state of the art across four popular datasets. To address the computational expense associated with BERT inference, we distill knowledge from BERT-large to small bidirectional LSTMs, reaching BERT-base parity on multiple datasets using 30x fewer parameters. The primary contribution of our paper is improved baselines that can provide the foundation for future work.

Results

TaskDatasetMetricValueModel
Text ClassificationarXiv-10Accuracy0.764DocBERT
Text ClassificationReuters-21578F188.9KD-LSTMreg
Text ClassificationAAPDF172.9KD-LSTMreg
Text ClassificationYelp-14Accuracy69.4KD-LSTMreg
Document ClassificationReuters-21578F188.9KD-LSTMreg
Document ClassificationAAPDF172.9KD-LSTMreg
Document ClassificationYelp-14Accuracy69.4KD-LSTMreg
ClassificationarXiv-10Accuracy0.764DocBERT
ClassificationReuters-21578F188.9KD-LSTMreg
ClassificationAAPDF172.9KD-LSTMreg
ClassificationYelp-14Accuracy69.4KD-LSTMreg

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17AdaptiSent: Context-Aware Adaptive Attention for Multimodal Aspect-Based Sentiment Analysis2025-07-17Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Safeguarding Federated Learning-based Road Condition Classification2025-07-16AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles2025-07-15DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15SentiDrop: A Multi Modal Machine Learning model for Predicting Dropout in Distance Learning2025-07-14