CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass

BoWen Zhang, Zixin Song, Chunping Li

2025-05-01Question Answering Representation Learning Text Clustering Information Retrieval Contrastive Learning

Abstract

As a fundamental task in Information Retrieval and Computational Linguistics, sentence representation has profound implications for a wide range of practical applications such as text clustering, content analysis, question-answering systems, and web search. Recent advances in pre-trained language models (PLMs) have driven remarkable progress in this field, particularly through unsupervised embedding derivation methods centered on discriminative PLMs like BERT. However, due to time and computational constraints, few efforts have attempted to integrate unsupervised sentence representation with generative PLMs, which typically possess much larger parameter sizes. Given that state-of-the-art models in both academia and industry are predominantly based on generative architectures, there is a pressing need for an efficient unsupervised text representation framework tailored to decoder-only PLMs. To address this concern, we propose CSE-SFP, an innovative method that exploits the structural characteristics of generative models. Compared to existing strategies, CSE-SFP requires only a single forward pass to perform effective unsupervised contrastive learning. Rigorous experimentation demonstrates that CSE-SFP not only produces higher-quality embeddings but also significantly reduces both training time and memory consumption. Furthermore, we introduce two ratio metrics that jointly assess alignment and uniformity, thereby providing a more robust means for evaluating the semantic spatial properties of encoding models.

CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass

Abstract

Related Papers

CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass

Abstract

Related Papers