IAM: A Comprehensive and Large-Scale Dataset for Integrated Argument Mining Tasks

Liying Cheng, Lidong Bing, Ruidan He, Qian Yu, Yan Zhang, Luo Si

2022-03-23ACL 2022 5Stance Classification Claim-Evidence Pair Extraction (CEPE)Claim Extraction with Stance Classification (CESC)Argument Mining

Paper PDF Code(official)

Abstract

Traditionally, a debate usually requires a manual preparation process, including reading plenty of articles, selecting the claims, identifying the stances of the claims, seeking the evidence for the claims, etc. As the AI debate attracts more attention these years, it is worth exploring the methods to automate the tedious process involved in the debating system. In this work, we introduce a comprehensive and large dataset named IAM, which can be applied to a series of argument mining tasks, including claim extraction, stance classification, evidence extraction, etc. Our dataset is collected from over 1k articles related to 123 topics. Near 70k sentences in the dataset are fully annotated based on their argument properties (e.g., claims, stances, evidence, etc.). We further propose two new integrated argument mining tasks associated with the debate preparation process: (1) claim extraction with stance classification (CESC) and (2) claim-evidence pair extraction (CEPE). We adopt a pipeline approach and an end-to-end method for each integrated task separately. Promising experimental results are reported to show the values and challenges of our proposed tasks, and motivate future research on argument mining.

Results

Task	Dataset	Metric	Value	Model
Data Mining	IAM Dataset	Macro F1	60.25	Multi-label
Data Mining	IAM Dataset	F1	35.92	Multi-task
Interpretable Machine Learning	IAM Dataset	Macro F1	60.25	Multi-label
Interpretable Machine Learning	IAM Dataset	F1	35.92	Multi-task
Argument Mining	IAM Dataset	Macro F1	60.25	Multi-label
Argument Mining	IAM Dataset	F1	35.92	Multi-task

Related Papers

Leveraging Context for Multimodal Fallacy Classification in Political Debates2025-07-21 ELLIS Alicante at CQs-Gen 2025: Winning the critical thinking questions shared task: LLM-based question generation and selection2025-06-17 LLMs for Argument Mining: Detection, Extraction, and Relationship Classification of pre-defined Arguments in Online Comments2025-05-29 SCRum-9: Multilingual Stance Classification over Rumours on Social Media2025-05-25 Towards Comprehensive Argument Analysis in Education: Dataset, Tasks, and Method2025-05-17 Exploring new Approaches for Information Retrieval through Natural Language Processing2025-05-04 Argument Summarization and its Evaluation in the Era of Large Language Models2025-03-02 Leveraging Small LLMs for Argument Mining in Education: Argument Component Identification, Classification, and Assessment2025-02-20