Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection

Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti

2022-05-20Question Answering Answer Selection

Abstract

An important task for designing QA systems is answer sentence selection (AS2): selecting the sentence containing (or constituting) the answer to a question from a set of retrieved relevant documents. In this paper, we propose three novel sentence-level transformer pre-training objectives that incorporate paragraph-level semantics within and across documents, to improve the performance of transformers for AS2, and mitigate the requirement of large labeled datasets. Specifically, the model is tasked to predict whether: (i) two sentences are extracted from the same paragraph, (ii) a given sentence is extracted from a given paragraph, and (iii) two paragraphs are extracted from the same document. Our experiments on three public and one industrial AS2 datasets demonstrate the empirical superiority of our pre-trained transformers over baseline models such as RoBERTa and ELECTRA for AS2.

Results

Task	Dataset	Metric	Value	Model
Question Answering	TrecQA	MAP	0.923	DeBERTa-V3-Large + SSP
Question Answering	TrecQA	MRR	0.946	DeBERTa-V3-Large + SSP
Question Answering	TrecQA	MAP	0.903	RoBERTa-Base + PSD
Question Answering	TrecQA	MRR	0.951	RoBERTa-Base + PSD
Question Answering	WikiQA	MAP	0.909	DeBERTa-V3-Large + ALL
Question Answering	WikiQA	MRR	0.92	DeBERTa-V3-Large + ALL
Question Answering	WikiQA	MAP	0.901	DeBERTa-Large + SSP
Question Answering	WikiQA	MRR	0.914	DeBERTa-Large + SSP
Question Answering	WikiQA	MAP	0.887	RoBERTa-Base + SSP
Question Answering	WikiQA	MRR	0.899	RoBERTa-Base + SSP
Question Answering	ASNQ	MAP	0.743	DeBERTa-V3-Large + SSP
Question Answering	ASNQ	MRR	0.8	DeBERTa-V3-Large + SSP
Question Answering	ASNQ	MAP	0.697	ELECTRA-Base + SSP
Question Answering	ASNQ	MRR	0.757	ELECTRA-Base + SSP

Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection

Abstract

Results

Related Papers

Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection

Abstract

Results

Related Papers