CodeQA

Introduced 2021-09-17

CodeQA is a free-form question answering dataset for the purpose of source code comprehension: given a code snippet and a question, a textual answer is required to be generated. CodeQA contains a Java dataset with 119,778 question-answer pairs and a Python dataset with 70,085 question-answer pairs.

Description from: CodeQA: A Question Answering Dataset for Source Code Comprehension