TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SGPT: GPT Sentence Embeddings for Semantic Search

SGPT: GPT Sentence Embeddings for Semantic Search

Niklas Muennighoff

2022-02-17Question AnsweringNews RetrievalDuplicate-Question RetrievalArgument RetrievalFact CheckingEntity RetrievalPassage RetrievalSentence EmbeddingsTweet RetrievalInformation RetrievalBiomedical Information RetrievalCitation PredictionZero-shot Text Search
PaperPDFCode(official)

Abstract

Decoder transformers have continued increasing in scale reaching hundreds of billions of parameters. Due to their scale the same decoder sets state-of-the-art results on various language tasks via prompting or fine-tuning. Yet, these large foundation models remain unusable for the related fields of semantic search and sentence embeddings. This prevents possibly new state-of-the-art results and forces organizations to train and maintain separate models. To this end, we propose SGPT to use decoders for sentence embeddings and semantic search via prompting or fine-tuning. At 5.8 billion parameters SGPT improves on the previously best sentence embeddings by a margin of 7% and outperforms a concurrent method with 175 billion parameters as measured on the BEIR search benchmark. Code, models and result files are freely available at https://github.com/Muennighoff/sgpt.

Results

TaskDatasetMetricValueModel
Question AnsweringHotpotQA (BEIR)nDCG@100.699SGPT-CE-6.1B
Question AnsweringHotpotQA (BEIR)nDCG@100.593SGPT-BE-5.8B
Question AnsweringNQ (BEIR)nDCG@100.524SGPT-BE-5.8B
Question AnsweringNQ (BEIR)nDCG@100.401SGPT-CE-6.1B
Question AnsweringFiQA-2018 (BEIR)nDCG@100.401SGPT-CE-6.1B
Question AnsweringFiQA-2018 (BEIR)nDCG@100.372SGPT-BE-5.8B
Information RetrievalCQADupStackmAP@1000.16SGPT-BE-5.8B
Information RetrievalMSMARCO (BEIR)nDCG@100.399SGPT-BE-5.8B
Information RetrievalMSMARCO (BEIR)nDCG@100.29SGPT-CE-6.1B
Information RetrievalMSMARCO (BEIR)nDCG@100.278SGPT-CE-2.7B
Biomedical Information RetrievalNFCorpus (BEIR)nDCG@100.362SGPT-BE-5.8B
Biomedical Information RetrievalNFCorpus (BEIR)nDCG@100.358OpenAI Search-Davinci
Biomedical Information RetrievalNFCorpus (BEIR)nDCG@100.347SGPT-CE-6.1B
Biomedical Information RetrievalNFCorpus (BEIR)nDCG@100.333SGPT-CE-2.7B
Biomedical Information RetrievalBioASQ (BEIR)nDCG@100.547SGPT-CE-6.1B
Biomedical Information RetrievalBioASQ (BEIR)nDCG@100.546SGPT-CE-2.7B
Biomedical Information RetrievalBioASQ (BEIR)nDCG@100.413SGPT-BE-5.8B
Biomedical Information RetrievalTREC-COVID (BEIR)nDCG@100.873SGPT-BE-5.8B
Biomedical Information RetrievalTREC-COVID (BEIR)nDCG@100.791SGPT-CE-6.1B
Biomedical Information RetrievalTREC-COVID (BEIR)nDCG@100.762SGPT-CE-2.7B
Fact CheckingCLIMATE-FEVER (BEIR)nDCG@100.305SGPT-BE-5.8B
Fact CheckingCLIMATE-FEVER (BEIR)nDCG@100.161SGPT-CE-6.1B
Fact CheckingFEVER (BEIR)nDCG@100.783SGPT-BE-5.8B
Fact CheckingFEVER (BEIR)nDCG@100.725SGPT-CE-6.1B
Fact CheckingSciFact (BEIR)nDCG@100.747SGPT-BE-5.8B
Fact CheckingSciFact (BEIR)nDCG@100.682SGPT-CE-6.1B

Related Papers

PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants2025-07-21From Neurons to Semantics: Evaluating Cross-Linguistic Alignment Capabilities of Large Language Models via Neurons Alignment2025-07-20From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17