An Unsupervised Sentence Embedding Method by Mutual Information Maximization

Yan Zhang, Ruidan He, Zuozhu Liu, Kwan Hui Lim, Lidong Bing

2020-09-25EMNLP 2020 11Self-Supervised Learning Sentence Embedding Sentence Embeddings Semantic Textual Similarity Clustering STS Sentence-Embedding

Paper PDF Code(official)

Abstract

BERT is inefficient for sentence-pair tasks such as clustering or semantic search as it needs to evaluate combinatorially many sentence pairs which is very time-consuming. Sentence BERT (SBERT) attempted to solve this challenge by learning semantically meaningful representations of single sentences, such that similarity comparison can be easily accessed. However, SBERT is trained on corpus with high-quality labeled sentence pairs, which limits its application to tasks where labeled data is extremely scarce. In this paper, we propose a lightweight extension on top of BERT and a novel self-supervised learning objective based on mutual information maximization strategies to derive meaningful sentence embeddings in an unsupervised manner. Unlike SBERT, our method is not restricted by the availability of labeled data, such that it can be applied on different domain-specific corpus. Experimental results show that the proposed method significantly outperforms other unsupervised sentence embedding baselines on common semantic textual similarity (STS) tasks and downstream supervised tasks. It also outperforms SBERT in a setting where in-domain labeled data is not available, and achieves performance competitive with supervised methods on various tasks.

Results

Task	Dataset	Metric	Value	Model
Semantic Textual Similarity	STS14	Spearman Correlation	0.6121	IS-BERT-NLI
Semantic Textual Similarity	STS15	Spearman Correlation	0.7523	IS-BERT-NLI
Semantic Textual Similarity	SICK	Spearman Correlation	0.6425	IS-BERT-NLI
Semantic Textual Similarity	STS13	Spearman Correlation	0.6924	IS-BERT-NLI
Semantic Textual Similarity	STS Benchmark	Spearman Correlation	0.6921	IS-BERT-NLI
Semantic Textual Similarity	STS12	Spearman Correlation	0.5677	IS-BERT-NLI
Semantic Textual Similarity	STS16	Spearman Correlation	0.7016	IS-BERT-NLI

An Unsupervised Sentence Embedding Method by Mutual Information Maximization

Abstract

Results

Related Papers

An Unsupervised Sentence Embedding Method by Mutual Information Maximization

Abstract

Results

Related Papers