TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Semantic Textual Similarity/STS Benchmark

Semantic Textual Similarity on STS Benchmark

Metric: Pearson Correlation (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Pearson Correlation▼Extra DataPaperDate↕Code
1MT-DNN-SMART0.929NoSMART: Robust and Efficient Fine-Tuning for Pre-...2019-11-08Code
2StructBERTRoBERTa ensemble0.928NoStructBERT: Incorporating Language Structures in...2019-08-13-
3Mnet-Sim0.927NoMNet-Sim: A Multi-layered Semantic Similarity Ne...2021-11-09-
4T5-11B0.925NoExploring the Limits of Transfer Learning with a...2019-10-23Code
5ALBERT0.925YesALBERT: A Lite BERT for Self-supervised Learning...2019-09-26Code
6XLNet (single model)0.925NoXLNet: Generalized Autoregressive Pretraining fo...2019-06-19Code
7RoBERTa0.922NoRoBERTa: A Robustly Optimized BERT Pretraining A...2019-07-26Code
8ELECTRA0.921No---
9RoBERTa-large 355M (MLP quantized vector-wise, fine-tuned)0.919NoLLM.int8(): 8-bit Matrix Multiplication for Tran...2022-08-15Code
10PSQ (Chen et al., 2020)0.919NoA Statistical Framework for Low-bitwidth Trainin...2020-10-27Code
11RoBERTa-large 355M + Entailment as Few-shot Learner0.918NoEntailment as Few-Shot Learner2021-04-29Code
12ERNIE 2.0 Large0.912NoERNIE 2.0: A Continual Pre-training Framework fo...2019-07-29Code
13Q-BERT (Shen et al., 2020)0.911NoQ-BERT: Hessian Based Ultra Low Precision Quanti...2019-09-12-
14Q8BERT (Zafrir et al., 2019)0.911NoQ8BERT: Quantized 8Bit BERT2019-10-14Code
15ELECTRA (no tricks)0.91No---
16DistilBERT 66M0.907NoDistilBERT, a distilled version of BERT: smaller...2019-10-02Code
17T5-3B0.906NoExploring the Limits of Transfer Learning with a...2019-10-23Code
18MLM+ del-word0.905NoCLEAR: Contrastive Learning for Sentence Represe...2020-12-31-
19RealFormer0.9011NoRealFormer: Transformer Likes Residual Attention2020-12-21Code
20T5-Large0.899NoExploring the Limits of Transfer Learning with a...2019-10-23Code
21SpanBERT0.899NoSpanBERT: Improving Pre-training by Representing...2019-07-24Code
22T5-Base0.894NoExploring the Limits of Transfer Learning with a...2019-10-23Code
23ERNIE 2.0 Base0.876NoERNIE 2.0: A Continual Pre-training Framework fo...2019-07-29Code
24Charformer-Tall0.873NoCharformer: Fast Character Transformers via Grad...2021-06-23Code
25T5-Small0.856NoExploring the Limits of Transfer Learning with a...2019-10-23Code
26ERNIE0.832NoERNIE: Enhanced Language Representation with Inf...2019-05-17Code
2724hBERT0.82NoHow to Train BERT with an Academic Budget2021-04-15Code
28TinyBERT-4 14.5M0.799NoTinyBERT: Distilling BERT for Natural Language U...2019-09-23Code
29USE_T0.782NoUniversal Sentence Encoder2018-03-29Code