TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Natural Language Inference/SNLI

Natural Language Inference on SNLI

Metric: % Train Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕% Train Accuracy▼Extra DataPaperDate↕Code
1+ Unigram and bigram features99.7NoA large annotated corpus for learning natural la...2015-08-21Code
2Ntumpha99.1NoMulti-Task Deep Neural Networks for Natural Lang...2019-01-31Code
31024D GRU encoders w/ unsupervised 'skip-thoughts' pre-training98.8NoOrder-Embeddings of Images and Language2015-11-19Code
4MT-DNN97.2NoMulti-Task Deep Neural Networks for Natural Lang...2019-01-31Code
5Fine-Tuned LM-Pretrained Transformer96.6No--Code
6300D DMAN Ensemble96.1NoDiscourse Marker Augmented Network with Reinforc...2019-07-23Code
7300D DMAN Ensemble96.1NoDiscourse Marker Augmented Network with Reinforc...2019-07-23Code
8SJRC (BERT-Large +SRL)95.7NoExplicit Contextual Semantics for Text Comprehen...2018-09-08-
9150D Multiway Attention Network Ensemble95.5No--Code
10300D DMAN95.4NoDiscourse Marker Augmented Network with Reinforc...2019-07-23Code
11300D DMAN95.4NoDiscourse Marker Augmented Network with Reinforc...2019-07-23Code
12Densely-Connected Recurrent and Co-Attentive Network Ensemble95NoSemantic Sentence Matching with Densely-connecte...2018-05-29-
13600D BiLSTM with generalized pooling94.9NoEnhancing Sentence Embedding with Generalized Po...2018-06-26Code
14450D DR-BiLSTM Ensemble94.8NoDR-BiLSTM: Dependent Reading Bidirectional LSTM ...2018-02-15-
15150D Multiway Attention Network94.5No--Code
16SemBERT94.4NoSemantics-aware BERT for Language Understanding2019-09-05Code
17KIM94.1NoNeural Natural Language Inference Models Enhance...2017-11-12Code
18450D DR-BiLSTM94.1NoDR-BiLSTM: Dependent Reading Bidirectional LSTM ...2018-02-15-
19RE294NoSimple and Effective Text Matching with Richer A...2019-08-01Code
20KIM Ensemble93.6NoNeural Natural Language Inference Models Enhance...2017-11-12Code
21600D ESIM + 300D Syntactic TreeLSTM93.5NoEnhanced LSTM for Natural Language Inference2016-09-20Code
22Stochastic Answer Network93.3NoStochastic Answer Networks for Natural Language ...2018-04-21Code
23BiMPM Ensemble93.2NoBilateral Multi-Perspective Matching for Natural...2017-02-13Code
24MFAE93.18No--Code
25Densely-Connected Recurrent and Co-Attentive Network93.1NoSemantic Sentence Matching with Densely-connecte...2018-05-29-
26600D Gumbel TreeLSTM encoders93.1NoLearning to Compose Task-Specific Tree Structures2017-07-10Code
27CA-MTL92.6NoConditionally Adaptive Multi-Task Learning: Impr...2020-09-19Code
28DEIM92.6NoDEIM: An effective deep encoding and interaction...2022-03-20-
29300D Reinforced Self-Attention Network92.6NoReinforced Self-Attention Network: a Hybrid of H...2018-01-31Code
30300D CAFE Ensemble92.5NoCompare, Compress and Propagate: Enhancing Neura...2017-12-30-
31448D Densely Interactive Inference Network (DIIN, code) Ensemble92.3NoNatural Language Inference over Interaction Space2017-09-13Code
32ESIM + ELMo Ensemble92.1NoDeep contextualized word representations2018-02-15Code
33300D mLSTM word-by-word attention model92NoLearning Natural Language Inference with LSTM2015-12-30Code
34ESIM + ELMo91.6NoDeep contextualized word representations2018-02-15Code
35512D Dynamic Meta-Embeddings91.6NoDynamic Meta-Embeddings for Improved Sentence Re...2018-04-21Code
36Densely-Connected Recurrent and Co-Attentive Network (encoder)91.4NoSemantic Sentence Matching with Densely-connecte...2018-05-29-
37448D Densely Interactive Inference Network (DIIN, code)91.2NoNatural Language Inference over Interaction Space2017-09-13Code
38300D Gumbel TreeLSTM encoders91.2NoLearning to Compose Task-Specific Tree Structures2017-07-10Code
39300D Directional self-attention network encoders91.1NoDiSAN: Directional Self-Attention Network for RN...2017-09-14Code
40600D Residual stacked encoders91NoShortcut-Stacked Sentence Encoders for Multi-Dom...2017-08-07Code
41BiMPM90.9NoBilateral Multi-Perspective Matching for Natural...2017-02-13Code
42300D re-read LSTM90.7No---
43300D re-read LSTM90.7No---
44200D decomposable attention feed-forward model with intra-sentence attention90.5NoA Decomposable Attention Model for Natural Langu...2016-06-06Code
45200D decomposable attention model with intra-sentence attention90.5NoA Decomposable Attention Model for Natural Langu...2016-06-06Code
46600D (300+300) Deep Gated Attn. BiLSTM encoders90.5NoRecurrent Neural Network-Based Sentence Encoder ...2017-08-04Code
47600D Hierarchical BiLSTM with Max Pooling (HBMP, code)89.9NoSentence Embeddings in NLI with Iterative Refine...2018-08-27Code
48300D CAFE89.8NoCompare, Compress and Propagate: Enhancing Neura...2017-12-30-
49300D Residual stacked encoders89.8NoShortcut-Stacked Sentence Encoders for Multi-Dom...2017-08-07Code
50Distance-based Self-Attention Network89.6NoDistance-based Self-Attention Network for Natura...2017-12-06-
51200D decomposable attention feed-forward model89.5NoA Decomposable Attention Model for Natural Langu...2016-06-06Code
52200D decomposable attention model89.5NoA Decomposable Attention Model for Natural Langu...2016-06-06Code
53300D SPINN-PI encoders89.2NoA Fast Unified Model for Parsing and Sentence Un...2016-03-19Code
54SLRC89.1NoExplicit Contextual Semantics for Text Comprehen...2018-09-08-
552400D Multiple-Dynamic Self-Attention Model89NoDynamic Self-Attention : Computing Attention ove...2018-08-22Code
56Biattentive Classification Network + CoVe + Char88.5NoLearned in Translation: Contextualized Word Vect...2017-08-01Code
57300D Full tree matching NTI-SLSTM-LSTM w/ global attention88.5NoNeural Tree Indexers for Text Understanding2016-07-15Code
58450D LSTMN with deep attention fusion88.5NoLong Short-Term Memory-Networks for Machine Read...2016-01-25Code
59600D Dynamic Self-Attention Model87.3NoDynamic Self-Attention : Computing Attention ove...2018-08-22Code
60300D CAFE (no cross-sentence attention)87.3NoCompare, Compress and Propagate: Enhancing Neura...2017-12-30-
61300D LSTMN with deep attention fusion87.3NoLong Short-Term Memory-Networks for Machine Read...2016-01-25Code
62300D MMA-NSE encoders with attention86.9NoNeural Semantic Encoders2016-07-14Code
6350D stacked TC-LSTMs86.7NoModelling Interaction of Sentence Pair with coup...2016-05-18-
64600D (300+300) BiLSTM encoders86.4NoLearning Natural Language Inference using Bidire...2016-05-30Code
65300D NSE encoders86.2NoNeural Semantic Encoders2016-07-14Code
66600D (300+300) BiLSTM encoders with intra-attention and symbolic preproc.85.9NoLearning Natural Language Inference using Bidire...2016-05-30Code
674096D BiLSTM with max-pooling85.6NoSupervised Learning of Universal Sentence Repres...2017-05-05Code
68100D LSTMs w/ word-by-word attention85.3NoReasoning about Entailment with Neural Attention2015-09-22Code
69100D DF-LSTM85.2No---
70100D LSTM encoders84.8NoA large annotated corpus for learning natural la...2015-08-21Code
71600D (300+300) BiLSTM encoders with intra-attention84.5NoLearning Natural Language Inference using Bidire...2016-05-30Code
72300D LSTM encoders83.9NoA Fast Unified Model for Parsing and Sentence Un...2016-03-19Code
73300D Tree-based CNN encoders83.3NoNatural Language Inference by Tree-Based Convolu...2015-12-28-
74300D NTI-SLSTM-LSTM encoders82.5NoNeural Tree Indexers for Text Understanding2016-07-15Code
75Unlexicalized features49.4NoA large annotated corpus for learning natural la...2015-08-21Code