TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DiffCSE: Difference-based Contrastive Learning for Sentenc...

DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings

Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljačić, Shang-Wen Li, Wen-tau Yih, Yoon Kim, James Glass

2022-04-21NAACL 2022 7Representation LearningSentence EmbeddingsSemantic Textual SimilarityContrastive LearningLanguage Modelling
PaperPDFCode(official)

Abstract

We propose DiffCSE, an unsupervised contrastive learning framework for learning sentence embeddings. DiffCSE learns sentence embeddings that are sensitive to the difference between the original sentence and an edited sentence, where the edited sentence is obtained by stochastically masking out the original sentence and then sampling from a masked language model. We show that DiffSCE is an instance of equivariant contrastive learning (Dangovski et al., 2021), which generalizes contrastive learning and learns representations that are insensitive to certain types of augmentations and sensitive to other "harmful" types of augmentations. Our experiments show that DiffCSE achieves state-of-the-art results among unsupervised sentence representation learning methods, outperforming unsupervised SimCSE by 2.3 absolute points on semantic textual similarity tasks.

Results

TaskDatasetMetricValueModel
Semantic Textual SimilaritySTS14Spearman Correlation0.7647DiffCSE-BERT-base
Semantic Textual SimilaritySTS14Spearman Correlation0.7549DiffCSE-RoBERTa-base
Semantic Textual SimilaritySTS15Spearman Correlation0.839DiffCSE-BERT-base
Semantic Textual SimilaritySTS15Spearman Correlation0.8281DiffCSE-RoBERTa-base
Semantic Textual SimilaritySTS13Spearman Correlation0.8443DiffCSE-BERT-base
Semantic Textual SimilaritySTS13Spearman Correlation0.8343DiffCSE-RoBERTa-base
Semantic Textual SimilaritySTS12Spearman Correlation0.7228DiffCSE-BERT-base
Semantic Textual SimilaritySTS12Spearman Correlation0.7005DiffCSE-RoBERTa-base
Semantic Textual SimilaritySTS16Spearman Correlation0.8212DiffCSE-RoBERTa-base
Semantic Textual SimilaritySTS16Spearman Correlation0.8054DiffCSE-BERT-base

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20From Neurons to Semantics: Evaluating Cross-Linguistic Alignment Capabilities of Large Language Models via Neurons Alignment2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17