A Convolutional Neural Network for Language-Agnostic Source Code Summarization

03/29/2019
by   Jessica Moore, et al.
0

Descriptive comments play a crucial role in the software engineering process. They decrease development time, enable better bug detection, and facilitate the reuse of previously written code. However, comments are commonly the last of a software developer's priorities and are thus either insufficient or missing entirely. Automatic source code summarization may therefore have the ability to significantly improve the software development process. We introduce a novel encoder-decoder model that summarizes source code, effectively writing a comment to describe the code's functionality. We make two primary innovations beyond current source code summarization models. First, our encoder is fully language-agnostic and requires no complex input preprocessing. Second, our decoder has an open vocabulary, enabling it to predict any word, even ones not seen in training. We demonstrate results comparable to state-of-the-art methods on a single-language data set and provide the first results on a data set consisting of multiple programming languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2019

Retrieve and Refine: Exemplar-based Neural Comment Generation

Code comment generation is a crucial task in the field of automatic soft...
research
07/23/2021

Ensemble Models for Neural Source Code Summarization of Subroutines

A source code summary of a subroutine is a brief description of that sub...
research
07/21/2023

Statement-based Memory for Neural Source Code Summarization

Source code summarization is the task of writing natural language descri...
research
06/13/2022

MetaTPTrans: A Meta Learning Approach for Multilingual Code Representation Learning

Representation learning of source code is essential for applying machine...
research
08/23/2021

ComSum: Commit Messages Summarization and Meaning Preservation

We present ComSum, a data set of 7 million commit messages for text summ...
research
07/18/2019

Logical Segmentation of Source Code

Many software analysis methods have come to rely on machine learning app...
research
04/15/2019

Semantic Source Code Models Using Identifier Embeddings

The emergence of online open source repositories in the recent years has...

Please sign up or login with your details

Forgot password? Click here to reset