TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/MED

MED

Monotonicity Entailment Dataset

Texts

MED is a new evaluation dataset that covers a wide range of monotonicity reasoning that was created by crowdsourcing and collected from linguistics publications. The dataset was constructed by collecting naturally-occurring examples by crowdsourcing and well-designed ones from linguistics publications. It consists of 5,382 examples.

Source: https://github.com/verypluming/MED Image Source: https://www.aclweb.org/anthology/W19-4804v2.pdf

Benchmarks

Natural Language Inference/1:1 Accuracy

Related Benchmarks

MedConceptsQA/Few-Shot Learning/AccuracyMedConceptsQA/Meta-Learning/AccuracyMedConceptsQA/Zero-Shot Learning/AccuracyMedMCQA/Question Answering/Dev Set (Acc-%)MedMCQA/Question Answering/Test Set (Acc-%)MedMCQA Dev/Question Answering/AccuarcyMedMentions/Entity Linking/AccuracyMedMentions/Entity Linking/Recall@64MedNLI/Few-Shot Learning/AccuracyMedNLI/Meta-Learning/AccuracyMedNLI/Natural Language Inference/AccuracyMedNLI/Natural Language Inference/Params (M)MedQA/Question Answering/AccuracyMedSTS/Language Modelling/Pearson CorrelationMedSTS/Representation Learning/Pearson CorrelationMedSTS/Semantic Similarity/Pearson CorrelationMedSTS/Sentence Embeddings/Pearson CorrelationMedSTS/Sentence Pair Modeling/Pearson CorrelationMedSecId/Classification/1 shot Micro-F1MedTurkQuAD: Medical Turkish Question-Answering Dataset/Question Answering/Exact MatchMedTurkQuAD: Medical Turkish Question-Answering Dataset/Question Answering/F1 ScoreMediBeng/Speech-to-Text Translation/BleuMediaEval2016/Fake News Detection/AccuracyMediaSpeech/Speech Recognition/WER for ArabicMediaSpeech/Speech Recognition/WER for FrenchMediaSpeech/Speech Recognition/WER for SpanishMediaSpeech/Speech Recognition/WER for TurkishMediaSum/Text Summarization/ROUGE-1Mediapi-RGB/Sign Language Translation/BLEU-4Medical Abstracts/Classification/F1-scoreMedical Abstracts/Text Classification/F1-scoreMedical Cost Personal Dataset/regression/R2 ScoreMedical Cost Personal Dataset/regression/lambdaMedical Segmentation Decathlon/Medical Image Segmentation/Dice (Average)Medical Segmentation Decathlon/Medical Image Segmentation/NSDMedical domain/Hypernym Discovery/MAPMedical domain/Hypernym Discovery/MRRMedical domain/Hypernym Discovery/P@5Medical domain/Taxonomy Learning/MAPMedical domain/Taxonomy Learning/MRRMedical domain/Taxonomy Learning/P@5Medico automatic polyp segmentation challenge (dataset)/Medical Image Segmentation/DSCMedico automatic polyp segmentation challenge (dataset)/Medical Image Segmentation/FPSMedico automatic polyp segmentation challenge (dataset)/Medical Image Segmentation/PrecisionMedico automatic polyp segmentation challenge (dataset)/Medical Image Segmentation/RecallMedico automatic polyp segmentation challenge (dataset)/Medical Image Segmentation/mIoU

Statistics

Papers
18
Benchmarks
1

Links

Homepage

Tasks

Automated Theorem ProvingData AugmentationNatural Language Inference