Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Text Clustering
/
MTEB
Text Clustering on MTEB
Metric: V-Measure (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
V-Measure (best first)
V-Measure (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
V-Measure
▼
Extra Data
Paper
Date
↕
Code
1
ST5-XXL
43.71
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
2
MPNet
43.69
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
3
GTR-XXL
42.42
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
4
MiniLM-L6
42.35
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
5
ST5-XL
42.34
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
6
MiniLM-L12
41.81
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
7
ST5-Large
41.65
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
8
GTR-Large
41.6
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
9
GTR-XL
41.51
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
10
Contriever
41.1
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
11
SGPT-5.8B-msmarco
40.35
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
12
ST5-Base
40.21
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
13
SGPT-1.3B-msmarco
39.92
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
14
SGPT-2.7B-msmarco
39.83
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
15
SGPT-BLOOM-7.1B-msmarco
38.93
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
16
GTR-Base
38.63
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
17
MPNet-multilingual
38.4
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
18
coCondenser-msmarco
37.64
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
19
Ada Similarity
37.52
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
20
MiniLM-L12-multilingual
37.14
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
21
SGPT-5.8B-nli
36.98
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
22
SGPT-125M-msmarco
35.79
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
23
SPECTER
34.06
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
24
SimCSE-BERT-sup
33.43
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
25
SGPT-125M-nli
30.95
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
26
BERT
30.12
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
27
LaBSE
29.55
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
28
SimCSE-BERT-unsup
29.04
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
29
Glove
27.73
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
30
Komninos
26.57
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
31
LASER2
15.28
No
MTEB: Massive Text Embedding Benchmark
2022-10-13
Code
#1
ST5-XXL
SOTA
43.71
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#2
MPNet
43.69
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#3
GTR-XXL
42.42
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#4
MiniLM-L6
42.35
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#5
ST5-XL
42.34
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#6
MiniLM-L12
41.81
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#7
ST5-Large
41.65
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#8
GTR-Large
41.6
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#9
GTR-XL
41.51
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#10
Contriever
41.1
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#11
SGPT-5.8B-msmarco
40.35
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#12
ST5-Base
40.21
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#13
SGPT-1.3B-msmarco
39.92
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#14
SGPT-2.7B-msmarco
39.83
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#15
SGPT-BLOOM-7.1B-msmarco
38.93
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#16
GTR-Base
38.63
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#17
MPNet-multilingual
38.4
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#18
coCondenser-msmarco
37.64
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#19
Ada Similarity
37.52
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#20
MiniLM-L12-multilingual
37.14
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#21
SGPT-5.8B-nli
36.98
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#22
SGPT-125M-msmarco
35.79
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#23
SPECTER
34.06
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#24
SimCSE-BERT-sup
33.43
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#25
SGPT-125M-nli
30.95
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#26
BERT
30.12
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#27
LaBSE
29.55
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#28
SimCSE-BERT-unsup
29.04
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#29
Glove
27.73
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#30
Komninos
26.57
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code
#31
LASER2
15.28
V-Measure
· 2022-10-13
MTEB: Massive Text Embedding Benchmark
Code