CodeQA: A Question Answering Dataset for Source Code Comprehension

09/17/2021
by   Chenxiao Liu, et al.
0

We propose CodeQA, a free-form question answering dataset for the purpose of source code comprehension: given a code snippet and a question, a textual answer is required to be generated. CodeQA contains a Java dataset with 119,778 question-answer pairs and a Python dataset with 70,085 question-answer pairs. To obtain natural and faithful questions and answers, we implement syntactic rules and semantic analysis to transform code comments into question-answer pairs. We present the construction process and conduct systematic analysis of our dataset. Experiment results achieved by several neural baselines on our dataset are shown and discussed. While research on question-answering and machine reading comprehension develops rapidly, few prior work has drawn attention to code question answering. This new dataset can serve as a useful research benchmark for source code comprehension.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2022

CS1QA: A Dataset for Assisting Code-based Question Answering in an Introductory Programming Course

We introduce CS1QA, a dataset for code-based question answering in the p...
research
04/18/2017

SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine

We publicly release a new large-scale dataset, called SearchQA, for mach...
research
09/16/2022

ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

We present a new task and dataset, ScreenQA, for screen content understa...
research
10/29/2018

ReviewQA: a relational aspect-based opinion reading dataset

Deep reading models for question-answering have demonstrated promising p...
research
04/20/2021

Towards Solving Multimodal Comprehension

This paper targets the problem of procedural multimodal machine comprehe...
research
09/17/2022

Learning to Answer Semantic Queries over Code

During software development, developers need answers to queries about se...
research
11/01/2021

Discourse Comprehension: A Question Answering Framework to Represent Sentence Connections

While there has been substantial progress in text comprehension through ...

Please sign up or login with your details

Forgot password? Click here to reset