TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/IAM: A Comprehensive and Large-Scale Dataset for Integrate...

IAM: A Comprehensive and Large-Scale Dataset for Integrated Argument Mining Tasks

Liying Cheng, Lidong Bing, Ruidan He, Qian Yu, Yan Zhang, Luo Si

2022-03-23ACL 2022 5Stance ClassificationClaim-Evidence Pair Extraction (CEPE)Claim Extraction with Stance Classification (CESC)Argument Mining
PaperPDFCode(official)

Abstract

Traditionally, a debate usually requires a manual preparation process, including reading plenty of articles, selecting the claims, identifying the stances of the claims, seeking the evidence for the claims, etc. As the AI debate attracts more attention these years, it is worth exploring the methods to automate the tedious process involved in the debating system. In this work, we introduce a comprehensive and large dataset named IAM, which can be applied to a series of argument mining tasks, including claim extraction, stance classification, evidence extraction, etc. Our dataset is collected from over 1k articles related to 123 topics. Near 70k sentences in the dataset are fully annotated based on their argument properties (e.g., claims, stances, evidence, etc.). We further propose two new integrated argument mining tasks associated with the debate preparation process: (1) claim extraction with stance classification (CESC) and (2) claim-evidence pair extraction (CEPE). We adopt a pipeline approach and an end-to-end method for each integrated task separately. Promising experimental results are reported to show the values and challenges of our proposed tasks, and motivate future research on argument mining.

Results

TaskDatasetMetricValueModel
Data MiningIAM DatasetMacro F160.25Multi-label
Data MiningIAM DatasetF135.92Multi-task
Interpretable Machine LearningIAM DatasetMacro F160.25Multi-label
Interpretable Machine LearningIAM DatasetF135.92Multi-task
Argument MiningIAM DatasetMacro F160.25Multi-label
Argument MiningIAM DatasetF135.92Multi-task

Related Papers

Leveraging Context for Multimodal Fallacy Classification in Political Debates2025-07-21ELLIS Alicante at CQs-Gen 2025: Winning the critical thinking questions shared task: LLM-based question generation and selection2025-06-17LLMs for Argument Mining: Detection, Extraction, and Relationship Classification of pre-defined Arguments in Online Comments2025-05-29SCRum-9: Multilingual Stance Classification over Rumours on Social Media2025-05-25Towards Comprehensive Argument Analysis in Education: Dataset, Tasks, and Method2025-05-17Exploring new Approaches for Information Retrieval through Natural Language Processing2025-05-04Argument Summarization and its Evaluation in the Era of Large Language Models2025-03-02Leveraging Small LLMs for Argument Mining in Education: Argument Component Identification, Classification, and Assessment2025-02-20