Large Language Model on PubMedQA corpus with metadata
Metric: ANS-EM (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | ANS-EM▼ | Augmentations | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | MetaGen Blended RAG | 77.9 | No | MetaGen Blended RAG: Higher Accuracy for Domain-... | 2025-05-23 | Code |