TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Identification of Tasks, Datasets, Evaluation Metrics, and...

Identification of Tasks, Datasets, Evaluation Metrics, and Numeric Scores for Scientific Leaderboards Construction

Yufang Hou, Charles Jochim, Martin Gleize, Francesca Bonin, Debasis Ganguly

2019-06-21ACL 2019 7Scientific Results Extraction
PaperPDFCode(official)

Abstract

While the fast-paced inception of novel tasks and new datasets helps foster active research in a community towards interesting directions, keeping track of the abundance of research activity in different areas on different datasets is likely to become increasingly difficult. The community could greatly benefit from an automatic system able to summarize scientific results, e.g., in the form of a leaderboard. In this paper we build two datasets and develop a framework (TDMS-IE) aimed at automatically extracting task, dataset, metric and score from NLP papers, towards the automatic construction of leaderboards. Experiments show that our model outperforms several baselines by a large margin. Our model is a first step towards automatic leaderboard construction, e.g., in the NLP domain.

Results

TaskDatasetMetricValueModel
Information RetrievalNLP-TDMS (Exp, arXiv only)Macro F18.8TDMS-IE
Information RetrievalNLP-TDMS (Exp, arXiv only)Macro Precision9.5TDMS-IE
Information RetrievalNLP-TDMS (Exp, arXiv only)Macro Recall8.6TDMS-IE
Information RetrievalNLP-TDMS (Exp, arXiv only)Micro F17.5TDMS-IE
Information RetrievalNLP-TDMS (Exp, arXiv only)Micro Precision6.8TDMS-IE
Information RetrievalNLP-TDMS (Exp, arXiv only)Micro Recall8.4TDMS-IE

Related Papers

Predicting Real-time Scientific Experiments Using Transformer models and Reinforcement Learning2022-04-25Automated Mining of Leaderboards for Empirical AI Research2021-08-31AxCell: Automatic Extraction of Results from Machine Learning Papers2020-04-29unarXive: A Large Scholarly Data Set with Publications' Full-Text, Annotated In-Text Citations, and Links to Metadata2020-03-02