IMDb Movie Reviews
TabularTextsUnknownIntroduced 2011-01-01
The IMDb Movie Reviews dataset is a binary sentiment analysis dataset consisting of 50,000 reviews from the Internet Movie Database (IMDb) labeled as positive or negative. The dataset contains an even number of positive and negative reviews. Only highly polarizing reviews are considered. A negative review has a score ≤ 4 out of 10, and a positive review has a score ≥ 7 out of 10. No more than 30 reviews are included per movie. The dataset contains additional unlabeled data.
Source: http://nlpprogress.com/english/sentiment_analysis.html Image Source: Maas et al
Benchmarks
Classification/AUCClassification/Accuracy (2 classes)Classification/F1 MacroData Mining/AccuracyData Mining/F1Interpretable Machine Learning/AccuracyInterpretable Machine Learning/F1Sentiment Analysis/Accuracy (2 classes)Sentiment Analysis/F1 MacroText Classification/AUCText Classification/Accuracy (2 classes)Text Classification/F1 Macro