TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/NewsQA

Question Answering on NewsQA

Metric: F1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕F1▼Extra DataPaperDate↕Code
1Riple/Saanvi-v0.5-DeepAnalysis94.01YesDeepSense: A Unified Deep Learning Framework for...2016-11-07Code
2OpenAI/o3-2025-01-31-high93.13Yeso3-mini vs DeepSeek-R1: Which One is Safer?2025-01-30Code
3OpenAI/o4-mini-2025-05-01-high91.31YesThinking Like Transformers2021-06-13Code
4OpenAI/o1-2024-12-17-high88.72Yes0/1 Deep Neural Networks via Block Coordinate De...2022-06-19-
5xAI/grok-3-121288.24YesXAI for Transformers: Better Explanations throug...2022-02-15Code
6deepseek-r186.13YesDeepSeek-R1: Incentivizing Reasoning Capability ...2025-01-22Code
7Riple/Saanvi-v0.185.44NoTime-series Transformer Generative Adversarial N...2022-05-23Code
8Anthropic/claude-3-7-sonnet82.3No---
9OpenAI/GPT-4o81.74YesGPT-4o as the Gold Standard: A Scalable and Gene...2024-10-03-
10Google/Gemini 2.5 Pro79.91YesGemini 1.5: Unlocking multimodal understanding a...2024-03-08Code
11SpanBERT73.6NoSpanBERT: Improving Pre-training by Representing...2019-07-24Code
12LinkBERT (large)72.6YesLinkBERT: Pretraining Language Models with Docum...2022-03-29Code
13DyREX68.53YesDyREx: Dynamic Query Representation for Extracti...2022-10-26Code
14DecaProp66.3NoDensely Connected Attention Propagation for Read...2018-11-10Code
15BERT+ASGen64.5No---
16AMANDA63.7NoA Question-Focused Multi-Factor Attention Networ...2018-01-25Code
17MINIMAL(Dyn)63.2YesEfficient and Robust Question Answering from Min...2018-05-21Code
18FastQAExt56.1YesMaking Neural QA as Simple as Possible but not S...2017-03-14Code