MultiNLI
Multi-Genre Natural Language Inference
TextsCustom (multiple, see the paper)Introduced 2018-01-01
The Multi-Genre Natural Language Inference (MultiNLI) dataset has 433K sentence pairs. Its size and mode of collection are modeled closely like SNLI. MultiNLI offers ten distinct genres (Face-to-face, Telephone, 9/11, Travel, Letters, Oxford University Press, Slate, Verbatim, Goverment and Fiction) of written and spoken English data. There are matched dev/test sets which are derived from the same sources as those in the training set, and mismatched sets which do not closely resemble any seen at training time.
Source: Semantic Sentence Matching with Densely-connectedRecurrent and Co-attentive Information