TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Text Clustering/MTEB

Text Clustering on MTEB

Metric: V-Measure (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕V-Measure▼Extra DataPaperDate↕Code
1ST5-XXL43.71NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
2MPNet43.69NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
3GTR-XXL42.42NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
4MiniLM-L642.35NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
5ST5-XL42.34NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
6MiniLM-L1241.81NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
7ST5-Large41.65NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
8GTR-Large41.6NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
9GTR-XL41.51NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
10Contriever41.1NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
11SGPT-5.8B-msmarco40.35NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
12ST5-Base40.21NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
13SGPT-1.3B-msmarco39.92NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
14SGPT-2.7B-msmarco39.83NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
15SGPT-BLOOM-7.1B-msmarco38.93NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
16GTR-Base38.63NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
17MPNet-multilingual38.4NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
18coCondenser-msmarco37.64NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
19Ada Similarity37.52NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
20MiniLM-L12-multilingual37.14NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
21SGPT-5.8B-nli36.98NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
22SGPT-125M-msmarco35.79NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
23SPECTER34.06NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
24SimCSE-BERT-sup33.43NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
25SGPT-125M-nli30.95NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
26BERT30.12NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
27LaBSE29.55NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
28SimCSE-BERT-unsup29.04NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
29Glove27.73NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
30Komninos26.57NoMTEB: Massive Text Embedding Benchmark2022-10-13Code
31LASER215.28NoMTEB: Massive Text Embedding Benchmark2022-10-13Code