TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Browse State-of-the-Art

40,176 benchmarks across 2,101 tasks

AllMethodologyComputer VisionNatural Language ProcessingMedicalMiscellaneousTime SeriesGraphsRobotsKnowledge BaseAdversarialAudioSpeechPlaying GamesReasoningComputer CodeMusic

Reasoning

Visual Reasoning

31 benchmarks

698 papers

Video Question Answering

51 benchmarks

460 papers

Multimodal Reasoning

3 benchmarks

302 papers

Arithmetic Reasoning

6 benchmarks

175 papers

Systematic Generalization

0 benchmarks

126 papers

Math Word Problem Solving

17 benchmarks

107 papers

Causal Identification

0 benchmarks

48 papers

Natural Language Visual Grounding

1 benchmarks

32 papers

Odd One Out

1 benchmarks

21 papers

Video-based Generative Performance Benchmarking

7 benchmarks

20 papers

Generative Visual Question Answering

8 benchmarks

9 papers

Error Understanding

8 benchmarks

9 papers

Theory of Mind Modeling

0 benchmarks

8 papers

Analogical Similarity

1 benchmarks

7 papers

Emotion Interpretation

2 benchmarks

6 papers

Human Judgment Correlation

2 benchmarks

5 papers

Identify Odd Metapor

1 benchmarks

2 papers

Pre-election ratings estimation

0 benchmarks

0 papers

Anachronisms

0 benchmarks

0 papers

Human Judgment Classification

1 benchmarks

0 papers

Geometry Problem Solving

0 benchmarks

0 papers

Discrete Choice Models

0 benchmarks

0 papers

Assortment Optimization

0 benchmarks

0 papers

ARC

0 benchmarks

0 papers

Lightweight Deployment

0 benchmarks

0 papers