MetaGen Blended RAG: Higher Accuracy for Domain-Specific Q&A Without Fine-Tuning

Kunal Sawarkar, Shivam R. Solanki, Abhilasha Mangal

2025-05-23Question Answering Few-Shot Learning Retrieval RAG

Abstract

Despite the widespread exploration of Retrieval-Augmented Generation (RAG), its deployment in enterprises for domain-specific datasets remains limited due to poor answer accuracy. These corpora, often shielded behind firewalls in private enterprise knowledge bases, having complex, domain-specific terminology, rarely seen by LLMs during pre-training; exhibit significant semantic variability across domains (like networking, military, or legal, etc.), or even within a single domain like medicine, and thus result in poor context precision for RAG systems. Currently, in such situations, fine-tuning or RAG with fine-tuning is attempted, but these approaches are slow, expensive, and lack generalization for accuracy as the new domain-specific data emerges. We propose an approach for Enterprise Search that focuses on enhancing the retriever for a domain-specific corpus through hybrid query indexes and metadata enrichment. This 'MetaGen Blended RAG' method constructs a metadata generation pipeline using key concepts, topics, and acronyms, and then creates a metadata-enriched hybrid index with boosted search queries. This approach avoids overfitting and generalizes effectively across domains. On the PubMedQA benchmark for the biomedical domain, the proposed method achieves 82% retrieval accuracy and 77% RAG accuracy, surpassing all previous RAG accuracy results without fine-tuning and sets a new benchmark for zero-shot results while outperforming much larger models like GPT3.5. The results are even comparable to the best fine-tuned models on this dataset, and we further demonstrate the robustness and scalability of the approach by evaluating it on other Q&A datasets like SQuAD, NQ etc.

Results

Task	Dataset	Metric	Value	Model
Few-Shot Learning	PubMedQA	Accuracy	77.9	MetaGen Blended RAG (zero-shot)
Question Answering	PubMedQA	Accuracy	77.9	MetaGen Blended RAG (zero-shot)
Knowledge Graphs	PubMedQA corpus with metadata	ANS-EM	77.9	MetaGen Blended RAG
Meta-Learning	PubMedQA	Accuracy	77.9	MetaGen Blended RAG (zero-shot)
Knowledge Graph Completion	PubMedQA corpus with metadata	ANS-EM	77.9	MetaGen Blended RAG
Retrieval	PubMedQA	Accuracy (Top-1)	82.1	MetaGen Blended RAG
Retrieval	PubMedQA corpus with metadata	Accuracy (Top-1)	82.1	MetaGen Blended RAG
Large Language Model	PubMedQA corpus with metadata	ANS-EM	77.9	MetaGen Blended RAG
Inductive knowledge graph completion	PubMedQA corpus with metadata	ANS-EM	77.9	MetaGen Blended RAG

MetaGen Blended RAG: Higher Accuracy for Domain-Specific Q&A Without Fine-Tuning

Abstract

Results

Related Papers

MetaGen Blended RAG: Higher Accuracy for Domain-Specific Q&A Without Fine-Tuning

Abstract

Results

Related Papers