CoQA: A Conversational Question Answering Challenge

Siva Reddy, Danqi Chen, Christopher D. Manning

2018-08-21TACL 2019 3Reading Comprehension Question Answering Conversational Question Answering Generative Question Answering

Paper PDF Code Code Code Code

Abstract

Humans gather information by engaging in conversations involving a series of interconnected questions and answers. For machines to assist in information gathering, it is therefore essential to enable them to answer conversational questions. We introduce CoQA, a novel dataset for building Conversational Question Answering systems. Our dataset contains 127k questions with answers, obtained from 8k conversations about text passages from seven diverse domains. The questions are conversational, and the answers are free-form text with their corresponding evidence highlighted in the passage. We analyze CoQA in depth and show that conversational questions have challenging phenomena not present in existing reading comprehension datasets, e.g., coreference and pragmatic reasoning. We evaluate strong conversational and reading comprehension models on CoQA. The best system obtains an F1 score of 65.4%, which is 23.4 points behind human performance (88.8%), indicating there is ample room for improvement. We launch CoQA as a challenge to the community at http://stanfordnlp.github.io/coqa/

Results

Task	Dataset	Metric	Value	Model
Question Answering	CoQA	In-domain	67	DrQA + seq2seq with copy attention (single model)
Question Answering	CoQA	Out-of-domain	60.4	DrQA + seq2seq with copy attention (single model)
Question Answering	CoQA	Overall	65.1	DrQA + seq2seq with copy attention (single model)
Question Answering	CoQA	In-domain	54.5	Vanilla DrQA (single model)
Question Answering	CoQA	Out-of-domain	47.9	Vanilla DrQA (single model)
Question Answering	CoQA	Overall	52.6	Vanilla DrQA (single model)
Question Answering	CoQA	F1-Score	45.4	PGNet

CoQA: A Conversational Question Answering Challenge

Abstract

Results

Related Papers

CoQA: A Conversational Question Answering Challenge

Abstract

Results

Related Papers