Argument Mining on ValNov Subtask B

Metric: NOV-F1 (higher is better)

LeaderboardDataset