Question Similarity on Q2Q Arabic Benchmark

Metric: F1 score (higher is better)

LeaderboardDataset
Loading chart...
#ModelF1 scoreExtra DataPaperDateCode
1Ensemble multilingual BERT model0.95924NoThe Inception Team at NSURL-2019 Task 8: Semanti...2020-04-24-
2Tha3aroon0.94848NoTha3aroon at NSURL-2019 Task 8: Semantic Questio...2019-12-28Code
3mBert0.8365NoDeep Learning Models for Multilingual Hate Speec...2020-04-14Code