Reuters-21578
GraphsCustom (research-only, attribution)
The Reuters-21578 dataset is a collection of documents with news articles. The original corpus has 10,369 documents and a vocabulary of 29,930 words.
Source: Topic Model Based Multi-Label Classification from the Crowd
Benchmarks
Anomaly Detection/AUC (outlier ratio = 0.5)Classification/AccuracyClassification/F1Classification/Micro-F1Document Classification/AccuracyDocument Classification/F1Multi-Label Text Classification/Micro-F1Retrieval/Precision@100Text Classification/AccuracyText Classification/F1Text Classification/Micro-F1Unsupervised Anomaly Detection/AUC (outlier ratio = 0.5)