TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/PEN

PEN

Problems with Explanations for Numbers

Introduced 2022-05-01
  • Provided explanations on the existing three benchmark datasets on solving algebraic word problems: ALG514, DRAW-1K, MAWPS

Benchmarks

Math Word Problem Solving/Accuracy (%)Mathematical Question Answering/Accuracy (%)Mathematical Reasoning/Accuracy (%)Question Answering/Accuracy (%)

Related Benchmarks

PenDigits/Time Series Classification/AccuracyPenDigits/Time Series Classification/NLLPendulum-v1/OpenAI Gym/Action RepetitionPendulum-v1/OpenAI Gym/Average DecisionsPendulum-v1/OpenAI Gym/Mean RewardPenn Action/Action Recognition/AccuracyPenn Action/Activity Recognition/AccuracyPenn Treebank/Chunking/F1 scorePenn Treebank/Constituency Parsing/F1 scorePenn Treebank/Dependency Parsing/LASPenn Treebank/Dependency Parsing/POSPenn Treebank/Dependency Parsing/UASPenn Treebank/Open Information Extraction/AUCPenn Treebank/Open Information Extraction/F1Penn Treebank/Part-Of-Speech Tagging/AccuracyPenn Treebank/Shallow Syntax/F1 scorePenn Treebank (Character Level)/Language Modelling/Bit per Character (BPC)Penn Treebank (Character Level)/Language Modelling/Number of paramsPenn Treebank (Character Level) 3x1000 LSTM - 500 Epochs/Stochastic Optimization/Bit per Character (BPC)Penn Treebank (Word Level)/Language Modelling/ParamsPenn Treebank (Word Level)/Language Modelling/Test perplexityPenn Treebank (Word Level)/Language Modelling/Validation perplexityPenn94/Node Classification/1:1 AccuracyPenn94/Node Classification/Accuracypendigits/Image Clustering/Accuracypendigits/Image Clustering/NMIpendigits/Image/Document Clustering/Accuracy (%)pendigits/Image/Document Clustering/NMIpendigits/Image/Document Clustering/runtime (s)pendigits/Time Series Classification/Accuracypendulum.swingup/3D/Returnpendulum.swingup/3D Face Modelling/Returnpendulum.swingup/Continuous Control/Return

Statistics

Papers
5
Benchmarks
4

Links

Homepage

Tasks

Math Word Problem SolvingMathematical Question AnsweringMathematical ReasoningQuestion Answering