Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Sentiment Analysis
/
IMDb
Sentiment Analysis on IMDb
Metric: Accuracy (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
Sort:
Accuracy (best first)
Accuracy (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Accuracy
▼
Extra Data
Paper
Date
↕
Code
1
RoBERTa-large with LlamBERT
96.68
No
LlamBERT: Large-scale low-cost data annotation i...
2024-03-23
Code
2
RoBERTa-large
96.54
No
LlamBERT: Large-scale low-cost data annotation i...
2024-03-23
Code
3
XLNet
96.21
Yes
XLNet: Generalized Autoregressive Pretraining fo...
2019-06-19
Code
4
Heinsen Routing + RoBERTa Large
96.2
No
An Algorithm for Routing Vectors in Sequences
2022-11-20
Code
5
RoBERTa-large 355M + Entailment as Few-shot Learner
96.1
No
Entailment as Few-Shot Learner
2021-04-29
Code
6
GraphStar
96
No
Graph Star Net for Generalized Multi-Task Learning
2019-06-21
Code
7
DV-ngrams-cosine with NB sub-sampling + RoBERTa.base
95.94
No
The Document Vectors Using Cosine Similarity Rev...
2022-05-26
Code
8
DV-ngrams-cosine + RoBERTa.base
95.92
No
The Document Vectors Using Cosine Similarity Rev...
2022-05-26
Code
9
Roberta_Large ST + Cosine Similarity Loss
95.9
No
-
-
-
10
BERT large finetune UDA
95.8
Yes
Unsupervised Data Augmentation for Consistency T...
2019-04-29
Code
11
BERT_large+ITPT
95.79
No
How to Fine-Tune BERT for Text Classification?
2019-05-14
Code
12
RoBERTa.base
95.79
Yes
The Document Vectors Using Cosine Similarity Rev...
2022-05-26
Code
13
L MIXED
95.68
No
Revisiting LSTM Networks for Semi-Supervised Tex...
2020-09-08
Code
14
BERT_base+ITPT
95.63
No
How to Fine-Tune BERT for Text Classification?
2019-05-14
Code
15
BERT large
95.49
No
Unsupervised Data Augmentation for Consistency T...
2019-04-29
Code
16
ULMFiT
95.4
No
Universal Language Model Fine-tuning for Text Cl...
2018-01-18
Code
17
Llama-2-70b-chat (0-shot)
95.39
Yes
LlamBERT: Large-scale low-cost data annotation i...
2024-03-23
Code
18
FLAN 137B (few-shot, k=2)
95
Yes
Finetuned Language Models Are Zero-Shot Learners
2021-09-03
Code
19
Block-sparse LSTM
94.99
No
-
-
Code
20
Space-XLNet
94.88
No
Breaking Free Transformer Models: Task-specific ...
2024-01-30
Code
21
CEN-tpc
94.52
No
Contextual Explanation Networks
2017-05-29
Code
22
FLAN 137B (zero-shot)
94.3
Yes
Finetuned Language Models Are Zero-Shot Learners
2021-09-03
Code
23
oh-LSTM
94.1
Yes
Supervised and Semi-Supervised Text Categorizati...
2016-02-07
-
24
Virtual adversarial training
94.1
No
Adversarial Training Methods for Semi-Supervised...
2016-05-25
Code
25
DV-ngrams-cosine + NB-weighted BON (re-evaluated)
93.68
No
The Document Vectors Using Cosine Similarity Rev...
2022-05-26
Code
26
Nyströmformer
93.2
No
Nyströmformer: A Nyström-Based Algorithm for App...
2021-02-07
Code
27
Modified LMU (34M)
93.2
No
Parallelizing Legendre Memory Unit Training
2021-02-22
Code
28
DV-ngrams-cosine
93.13
No
-
-
Code
29
OCaTS (kNN & GPT-3.5-turbo
93.06
No
Cache me if you Can: an Online Cost-aware Teache...
2023-10-20
Code
30
DistilBERT 66M
92.82
No
DistilBERT, a distilled version of BERT: smaller...
2019-10-02
Code
31
GPT-2 Finetuned
92.36
No
-
-
Code
32
seq2-bown-CNN
92.33
No
Effective Use of Word Order for Text Categorizat...
2014-12-01
Code
33
BP-Transformer + GloVe
92.12
No
BP-Transformer: Modelling Long-Range Context via...
2019-11-11
Code
34
BCN+Char+CoVe
91.8
No
Learned in Translation: Contextualized Word Vect...
2017-08-01
Code
35
ToWE-SG
90.8
No
-
-
Code
36
COSINE
90.54
No
Fine-Tuning Pre-trained Language Model with Weak...
2020-10-15
Code
37
LSTM with dynamic skip
90.1
No
Long Short-Term Memory with Dynamic Skip Connect...
2018-11-09
Code
38
CNN+LSTM
88.9
Yes
On the Role of Text Preprocessing in Neural Netw...
2017-07-06
Code
39
UnICORNN
88.4
No
UnICORNN: A recurrent model for learning very lo...
2021-03-09
Code
40
CfC
88.4
No
Closed-form Continuous-time Neural Models
2021-06-25
Code
41
Doc2VecC
88.3
No
Efficient Vector Representation for Documents th...
2017-07-08
Code
42
Bert+ Wilson-Cowan model RNN
87.46
No
Learning in Wilson-Cowan model for metapopulation
2024-06-24
Code
43
S-LSTM
87.15
Yes
Sentence-State LSTM for Text Representation
2018-05-07
Code
44
AlexNet [alexnet]
87
No
Classifying Textual Data with Pre-trained Vision...
2021-06-23
Code
45
VGG16 [vgg16]
86
No
Classifying Textual Data with Pre-trained Vision...
2021-06-23
Code
46
ResNext[resnext]
85
No
Classifying Textual Data with Pre-trained Vision...
2021-06-23
Code
47
Standard DR-AGG
45.1
Yes
Information Aggregation via Dynamic Routing for ...
2018-06-05
Code
48
Reverse DR-AGG
44.5
No
Information Aggregation via Dynamic Routing for ...
2018-06-05
Code
#1
RoBERTa-large with LlamBERT
SOTA
96.68
Accuracy
· 2024-03-23
LlamBERT: Large-scale low-cost data annotation in NLP
Code
#2
RoBERTa-large
96.54
Accuracy
· 2024-03-23
LlamBERT: Large-scale low-cost data annotation in NLP
Code
#3
XLNet
SOTA
96.21
Accuracy
· Extra Data
· 2019-06-19
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Code
#4
Heinsen Routing + RoBERTa Large
96.2
Accuracy
· 2022-11-20
An Algorithm for Routing Vectors in Sequences
Code
#5
RoBERTa-large 355M + Entailment as Few-shot Learner
96.1
Accuracy
· 2021-04-29
Entailment as Few-Shot Learner
Code
#6
GraphStar
96
Accuracy
· 2019-06-21
Graph Star Net for Generalized Multi-Task Learning
Code
#7
DV-ngrams-cosine with NB sub-sampling + RoBERTa.base
95.94
Accuracy
· 2022-05-26
The Document Vectors Using Cosine Similarity Revisited
Code
#8
DV-ngrams-cosine + RoBERTa.base
95.92
Accuracy
· 2022-05-26
The Document Vectors Using Cosine Similarity Revisited
Code
#9
Roberta_Large ST + Cosine Similarity Loss
95.9
Accuracy
No paper
#10
BERT large finetune UDA
SOTA
95.8
Accuracy
· Extra Data
· 2019-04-29
Unsupervised Data Augmentation for Consistency Training
Code
#11
BERT_large+ITPT
95.79
Accuracy
· 2019-05-14
How to Fine-Tune BERT for Text Classification?
Code
#12
RoBERTa.base
95.79
Accuracy
· Extra Data
· 2022-05-26
The Document Vectors Using Cosine Similarity Revisited
Code
#13
L MIXED
95.68
Accuracy
· 2020-09-08
Revisiting LSTM Networks for Semi-Supervised Text Classification via Mixed Objective Function
Code
#14
BERT_base+ITPT
95.63
Accuracy
· 2019-05-14
How to Fine-Tune BERT for Text Classification?
Code
#15
BERT large
95.49
Accuracy
· 2019-04-29
Unsupervised Data Augmentation for Consistency Training
Code
#16
ULMFiT
SOTA
95.4
Accuracy
· 2018-01-18
Universal Language Model Fine-tuning for Text Classification
Code
#17
Llama-2-70b-chat (0-shot)
95.39
Accuracy
· Extra Data
· 2024-03-23
LlamBERT: Large-scale low-cost data annotation in NLP
Code
#18
FLAN 137B (few-shot, k=2)
95
Accuracy
· Extra Data
· 2021-09-03
Finetuned Language Models Are Zero-Shot Learners
Code
#19
Block-sparse LSTM
94.99
Accuracy
No paper
Code
#20
Space-XLNet
94.88
Accuracy
· 2024-01-30
Breaking Free Transformer Models: Task-specific Context Attribution Promises Improved Generalizability Without Fine-tuning Pre-trained LLMs
Code
#21
CEN-tpc
SOTA
94.52
Accuracy
· 2017-05-29
Contextual Explanation Networks
Code
#22
FLAN 137B (zero-shot)
94.3
Accuracy
· Extra Data
· 2021-09-03
Finetuned Language Models Are Zero-Shot Learners
Code
#23
oh-LSTM
SOTA
94.1
Accuracy
· Extra Data
· 2016-02-07
Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings
#24
Virtual adversarial training
94.1
Accuracy
· 2016-05-25
Adversarial Training Methods for Semi-Supervised Text Classification
Code
#25
DV-ngrams-cosine + NB-weighted BON (re-evaluated)
93.68
Accuracy
· 2022-05-26
The Document Vectors Using Cosine Similarity Revisited
Code
#26
Nyströmformer
93.2
Accuracy
· 2021-02-07
Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention
Code
#27
Modified LMU (34M)
93.2
Accuracy
· 2021-02-22
Parallelizing Legendre Memory Unit Training
Code
#28
DV-ngrams-cosine
93.13
Accuracy
No paper
Code
#29
OCaTS (kNN & GPT-3.5-turbo
93.06
Accuracy
· 2023-10-20
Cache me if you Can: an Online Cost-aware Teacher-Student framework to Reduce the Calls to Large Language Models
Code
#30
DistilBERT 66M
92.82
Accuracy
· 2019-10-02
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Code
#31
GPT-2 Finetuned
92.36
Accuracy
No paper
Code
#32
seq2-bown-CNN
SOTA
92.33
Accuracy
· 2014-12-01
Effective Use of Word Order for Text Categorization with Convolutional Neural Networks
Code
#33
BP-Transformer + GloVe
92.12
Accuracy
· 2019-11-11
BP-Transformer: Modelling Long-Range Context via Binary Partitioning
Code
#34
BCN+Char+CoVe
91.8
Accuracy
· 2017-08-01
Learned in Translation: Contextualized Word Vectors
Code
#35
ToWE-SG
90.8
Accuracy
No paper
Code
#36
COSINE
90.54
Accuracy
· 2020-10-15
Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
Code
#37
LSTM with dynamic skip
90.1
Accuracy
· 2018-11-09
Long Short-Term Memory with Dynamic Skip Connections
Code
#38
CNN+LSTM
88.9
Accuracy
· Extra Data
· 2017-07-06
On the Role of Text Preprocessing in Neural Network Architectures: An Evaluation Study on Text Categorization and Sentiment Analysis
Code
#39
UnICORNN
88.4
Accuracy
· 2021-03-09
UnICORNN: A recurrent model for learning very long time dependencies
Code
#40
CfC
88.4
Accuracy
· 2021-06-25
Closed-form Continuous-time Neural Models
Code
#41
Doc2VecC
88.3
Accuracy
· 2017-07-08
Efficient Vector Representation for Documents through Corruption
Code
#42
Bert+ Wilson-Cowan model RNN
87.46
Accuracy
· 2024-06-24
Learning in Wilson-Cowan model for metapopulation
Code
#43
S-LSTM
87.15
Accuracy
· Extra Data
· 2018-05-07
Sentence-State LSTM for Text Representation
Code
#44
AlexNet [alexnet]
87
Accuracy
· 2021-06-23
Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations
Code
#45
VGG16 [vgg16]
86
Accuracy
· 2021-06-23
Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations
Code
#46
ResNext[resnext]
85
Accuracy
· 2021-06-23
Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations
Code
#47
Standard DR-AGG
45.1
Accuracy
· Extra Data
· 2018-06-05
Information Aggregation via Dynamic Routing for Sequence Encoding
Code
#48
Reverse DR-AGG
44.5
Accuracy
· 2018-06-05
Information Aggregation via Dynamic Routing for Sequence Encoding
Code