TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Document Ranking with a Pretrained Sequence-to-Sequence Mo...

Document Ranking with a Pretrained Sequence-to-Sequence Model

Rodrigo Nogueira, Zhiying Jiang, Jimmy Lin

2020-03-14Findings of the Association for Computational Linguistics 2020Passage RankingAd-Hoc Information RetrievalGeneral ClassificationDocument Ranking
PaperPDFCodeCode

Abstract

This work proposes a novel adaptation of a pretrained sequence-to-sequence model to the task of document ranking. Our approach is fundamentally different from a commonly-adopted classification-based formulation of ranking, based on encoder-only pretrained transformer architectures such as BERT. We show how a sequence-to-sequence model can be trained to generate relevance labels as "target words", and how the underlying logits of these target words can be interpreted as relevance probabilities for ranking. On the popular MS MARCO passage ranking task, experimental results show that our approach is at least on par with previous classification-based models and can surpass them with larger, more-recent models. On the test collection from the TREC 2004 Robust Track, we demonstrate a zero-shot transfer-based approach that outperforms previous state-of-the-art models requiring in-dataset cross-validation. Furthermore, we find that our approach significantly outperforms an encoder-only model in a data-poor regime (i.e., with few training examples). We investigate this observation further by varying target words to probe the model's use of latent knowledge.

Results

TaskDatasetMetricValueModel
Ad-Hoc Information RetrievalTREC Robust04MAP0.3876monoT5-3B (zero-shot)
Ad-Hoc Information RetrievalTREC Robust04P@200.5165monoT5-3B (zero-shot)
Ad-Hoc Information RetrievalTREC Robust04nDCG@200.6091monoT5-3B (zero-shot)

Related Papers

IRanker: Towards Ranking Foundation Model2025-06-25Precise Zero-Shot Pointwise Ranking with LLMs through Post-Aggregated Global Context Information2025-06-12Chain-of-Thought Poisoning Attacks against R1-based Retrieval-Augmented Generation Systems2025-05-22Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking2025-05-20Beyond Retrieval: Joint Supervision and Multimodal Document Ranking for Textbook Question Answering2025-05-17A Unified Retrieval Framework with Document Ranking and EDU Filtering for Multi-document Summarization2025-04-23Specialized text classification: an approach to classifying Open Banking transactions2025-04-10How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective2025-04-10