TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/Bamboogle

Question Answering on Bamboogle

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1ReST meets ReAct (PaLM 2-L + Google Search)76.1NoReST meets ReAct: Self-Improvement for Multi-Ste...2023-12-15-
2MCR (code-davinci-002) + Google Search66.5NoAnswering Questions by Meta-Reasoning over Multi...2023-04-25Code
3RALM (LLaMA2-13B + Google Search)62.7NoMaking Retrieval-Augmented Language Models Robus...2023-10-02Code
4Self-ask (GPT-3; davinci-002) + Google Search60NoMeasuring and Narrowing the Compositionality Gap...2022-10-07Code
5Self-ask (GPT-3; davinci-002)57.6NoMeasuring and Narrowing the Compositionality Gap...2022-10-07Code
6Chain-of-Thought (GPT-3; davinci-002)46.4NoMeasuring and Narrowing the Compositionality Gap...2022-10-07Code
7FireAct44NoFireAct: Toward Language Agent Fine-tuning2023-10-09-
8Direct Prompting (GPT-3; davinci-002)17.6NoMeasuring and Narrowing the Compositionality Gap...2022-10-07Code
9Google Search0No--Code