Alexander Ratner, Braden Hancock, Jared Dunnmon, Frederic Sala, Shreyash Pandey, Christopher Ré
Snorkel MeTaL: A framework for training models with multi-task weak supervision
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Natural Language Inference | MultiNLI | Matched | 87.6 | Snorkel MeTaL (ensemble) |
| Natural Language Inference | MultiNLI | Mismatched | 87.2 | Snorkel MeTaL (ensemble) |
| Semantic Textual Similarity | Quora Question Pairs | Accuracy | 89.9 | Snorkel MeTaL(ensemble) |
| Semantic Textual Similarity | Quora Question Pairs | F1 | 73.1 | Snorkel MeTaL(ensemble) |
| Sentiment Analysis | SST-2 Binary classification | Accuracy | 96.2 | Snorkel MeTaL(ensemble) |
| Paraphrase Identification | Quora Question Pairs | Accuracy | 89.9 | Snorkel MeTaL(ensemble) |
| Paraphrase Identification | Quora Question Pairs | F1 | 73.1 | Snorkel MeTaL(ensemble) |