TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/HotpotQA

Question Answering on HotpotQA

Metric: ANS-F1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕ANS-F1▼Extra DataPaperDate↕Code
1Beam Retrieval0.85NoEnd-to-End Beam Retrieval for Multi-Hop Question...2023-08-17Code
2AISO0.805NoAdaptive Information Seeking for Open-Domain Que...2021-09-14Code
3Chain-of-Skills0.801NoChain-of-Skills: A Configurable Model for Open-d...2023-05-04Code
4HopRetriever + Sp-search0.799NoHopRetriever: Retrieve Hops over Wikipedia to An...2020-12-31-
5HopRetriever0.799No---
6TPRR0.795No---
7EBS-Large0.793No---
8IRRR+0.791NoAnswering Open-Domain Questions of Varying Reaso...2020-10-23Code
9EBS-SH0.786No---
10IRRR0.782NoAnswering Open-Domain Questions of Varying Reaso...2020-10-23Code
11HopRetriever-V20.778No---
12AFSGraph-retriever0.778No---
13DDRQA0.759NoAnswering Any-hop Open-domain Questions with Ite...2020-09-16-
14BigBird-etc0.755NoBig Bird: Transformers for Longer Sequences2020-07-28Code
15Step-by-Step Retriever0.754No---
16Recursive Dense Retriever0.753NoAnswering Complex Open-Domain Questions with Mul...2020-09-27Code
17DR model large0.753No---
18Model name0.746No---
19HopAns0.746No---
20Multi-dimensional-AFSGraph0.746No---
21HopRetriever-V10.739No---
22Anonymous0.732No---
23Tree-shaped-cluster0.731No---
24AFSgraph0.73No---
25Robustly Fine-tuned Graph-based Recurrent Retriever0.73NoLearning to Retrieve Reasoning Paths over Wikipe...2019-11-24Code
26AFSgraph model0.73No---
27RoBERTa-DenseRetriever-Fast0.727No---
28DPR-recurrent0.727No---
29RoBERTa-DenseRetriever0.724No---
30DR model0.717No---
31SAFSR model0.716NoHotpotQA: A Dataset for Diverse, Explainable Mul...2018-09-25Code
32HGN-albert + SemanticRetrievalMRS IR0.714No---
33PromptRank-fewshot-2-demo0.711No---
34graph-recurrent-retriever+roberta-base w. S/R-pretraining0.71No---
35GraphRR-Fast0.709No---
36HGN-large + SemanticRetrievalMRS IR0.699No---
37HGN + SemanticRetrievalMRS IR0.692NoHierarchical Graph Network for Multi-hop Questio...2019-11-09Code
38Graph-based Recurrent Retriever0.689No---
39Quark + SemanticRetrievalMRS IR0.675NoA Simple Yet Strong Pipeline for HotpotQA2020-04-14-
40GAR-BERT0.648No---
41MIR+EPS+BERT0.648No---
42Transformer-XH-final0.641No--Code
43GAR0.613No---
44Transformer-XH0.608No---
45GAR-NOSF0.606No---
46SemanticRetrievalMRS0.573NoRevealing the Importance of Semantic Retrieval f...2019-09-17Code
47PR-Bert0.538No---
48Entity-centric BERT Pipeline0.531No---
49DrKIT0.517No---
50SAFSr-Bert0.514No---
51Cognitive Graph QA0.489NoCognitive Graph for Multi-Hop Reading Comprehens...2019-05-14Code
52GoldEn Retriever0.486NoAnswering Complex Open-domain Questions Through ...2019-10-15Code
53TPReasoner w/o BERT0.474No---
54Entity-centric IR0.463No---
55AnonymousQ0.46No---
56IKFGraph0.453No---
57HGN Model-reproduce0.427No---
58DecompRC0.407NoMulti-hop Reading Comprehension through Question...2019-06-07Code
590.407No---
60MUPPET0.403NoMulti-Hop Paragraph Retrieval for Open-Domain Qu...2019-06-15Code
61MultiQA0.402No---
62GRN + BERT0.391No---
63SAFSr_model0.391No---
64SAQA0.386No---
65QFE0.381NoAnswering while Summarizing: Multi-task Learning...2019-05-21-
66KGNN0.372NoMulti-Paragraph Reasoning with Knowledge-enhance...2019-11-06-
67GRN0.365No---
68Baseline Model0.329NoHotpotQA: A Dataset for Diverse, Explainable Mul...2018-09-25Code
69SuppBERT0.32No---
70Mistral multi hop with very large sources0.221No---
71tes0.121No---