Reading Comprehension on RadQA

Metric: Answer F1 (lower is better)

LeaderboardDataset