TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/HotpotQA

Question Answering on HotpotQA

Metric: SUP-F1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕SUP-F1▼Extra DataPaperDate↕Code
1Beam Retrieval0.901NoEnd-to-End Beam Retrieval for Multi-Hop Question...2023-08-17Code
2BigBird-etc0.891NoBig Bird: Transformers for Longer Sequences2020-07-28Code
3AISO0.86NoAdaptive Information Seeking for Open-Domain Que...2021-09-14Code
4Chain-of-Skills0.853NoChain-of-Skills: A Configurable Model for Open-d...2023-05-04Code
5TPRR0.843No---
6EBS-Large0.84No---
7HopRetriever + Sp-search0.835NoHopRetriever: Retrieve Hops over Wikipedia to An...2020-12-31-
8IRRR+0.832NoAnswering Open-Domain Questions of Varying Reaso...2020-10-23Code
9EBS-SH0.831No---
10HopRetriever0.826No---
11IRRR0.821NoAnswering Open-Domain Questions of Varying Reaso...2020-10-23Code
12HopRetriever-V20.818No---
13AFSGraph-retriever0.812No---
14Recursive Dense Retriever0.809NoAnswering Complex Open-Domain Questions with Mul...2020-09-27Code
15Step-by-Step Retriever0.8No---
16HopRetriever-V10.793No---
17DDRQA0.789NoAnswering Any-hop Open-domain Questions with Ite...2020-09-16-
18DR model large0.778No---
19HGN-albert + SemanticRetrievalMRS IR0.774No---
20Model name0.772No---
21HopAns0.772No---
22Multi-dimensional-AFSGraph0.772No---
23Anonymous0.771No---
24AFSgraph0.769No---
25Tree-shaped-cluster0.768No---
26HGN-large + SemanticRetrievalMRS IR0.768No---
27Robustly Fine-tuned Graph-based Recurrent Retriever0.764NoLearning to Retrieve Reasoning Paths over Wikipe...2019-11-24Code
28HGN + SemanticRetrievalMRS IR0.764NoHierarchical Graph Network for Multi-hop Questio...2019-11-09Code
29AFSgraph model0.759No---
30SAFSR model0.757NoHotpotQA: A Dataset for Diverse, Explainable Mul...2018-09-25Code
31RoBERTa-DenseRetriever-Fast0.749No---
32DPR-recurrent0.749No---
33RoBERTa-DenseRetriever0.748No---
34GAR-BERT0.747No---
35GAR0.739No---
36Quark + SemanticRetrievalMRS IR0.73NoA Simple Yet Strong Pipeline for HotpotQA2020-04-14-
37Graph-based Recurrent Retriever0.73No---
38DR model0.725No---
39MIR+EPS+BERT0.72No---
40Transformer-XH-final0.714No--Code
41GraphRR-Fast0.713No---
42SemanticRetrievalMRS0.708NoRevealing the Importance of Semantic Retrieval f...2019-09-17Code
43Transformer-XH0.7No---
44GoldEn Retriever0.642NoAnswering Complex Open-domain Questions Through ...2019-10-15Code
45DrKIT0.598No---
46PR-Bert0.596No---
47SAFSr-Bert0.585No---
48Cognitive Graph QA0.577NoCognitive Graph for Multi-Hop Reading Comprehens...2019-05-14Code
49Entity-centric BERT Pipeline0.573No---
50IKFGraph0.512No---
51GRN + BERT0.497No---
52HGN Model-reproduce0.493No---
53GRN0.488No---
54MUPPET0.473NoMulti-Hop Paragraph Retrieval for Open-Domain Qu...2019-06-15Code
55KGNN0.472NoMulti-Paragraph Reasoning with Knowledge-enhance...2019-11-06-
56SAQA0.472No---
57AnonymousQ0.468No---
58GAR-NOSF0.448No---
59QFE0.444NoAnswering while Summarizing: Multi-task Learning...2019-05-21-
60Entity-centric IR0.432No---
61SAFSr_model0.406No---
62SuppBERT0.4No---
63Baseline Model0.377NoHotpotQA: A Dataset for Diverse, Explainable Mul...2018-09-25Code
64tes0.078No---
65PromptRank-fewshot-2-demo0No---
66graph-recurrent-retriever+roberta-base w. S/R-pretraining0No---
67TPReasoner w/o BERT0No---
68MultiQA0No---
69DecompRC0No--Code
700No---
71Mistral multi hop with very large sources0No---