Shortcut-Stacked Sentence Encoders for Multi-Domain Inference

Yixin Nie, Mohit Bansal

2017-08-07WS 2017 9Natural Language Inference Word Embeddings

Abstract

We present a simple sequential sentence encoder for multi-domain natural language inference. Our encoder is based on stacked bidirectional LSTM-RNNs with shortcut connections and fine-tuning of word embeddings. The overall supervised model uses the above encoder to encode two input sentences into two vectors, and then uses a classifier over the vector combination to label the relationship between these two sentences as that of entailment, contradiction, or neural. Our Shortcut-Stacked sentence encoders achieve strong improvements over existing encoders on matched and mismatched multi-domain natural language inference (top non-ensemble single-model result in the EMNLP RepEval 2017 Shared Task (Nangia et al., 2017)). Moreover, they achieve the new state-of-the-art encoding result on the original SNLI dataset (Bowman et al., 2015).

Results

Task	Dataset	Metric	Value	Model
Natural Language Inference	SNLI	% Test Accuracy	86	600D Residual stacked encoders
Natural Language Inference	SNLI	% Train Accuracy	91	600D Residual stacked encoders
Natural Language Inference	SNLI	% Test Accuracy	85.7	300D Residual stacked encoders
Natural Language Inference	SNLI	% Train Accuracy	89.8	300D Residual stacked encoders

Related Papers

LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification2025-07-15 Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09 DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact Verification2025-07-08 Computational Detection of Intertextual Parallels in Biblical Hebrew: A Benchmark Study Using Transformer-Based Language Models2025-06-30 ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation2025-06-27 Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition2025-06-23 Low-resource keyword spotting using contrastively trained transformer acoustic word embeddings2025-06-21 Thunder-NUBench: A Benchmark for LLMs' Sentence-Level Negation Understanding2025-06-17