Large Language Model on PubMedQA corpus with metadata

Metric: ANS-EM (higher is better)

LeaderboardDataset
Loading chart...
#ModelANS-EMAugmentationsPaperDateCode
1MetaGen Blended RAG77.9NoMetaGen Blended RAG: Higher Accuracy for Domain-...2025-05-23Code