SOTitle: A Transformer-based Post Title Generation Approach for Stack Overflow

02/20/2022
by   Ke Liu, et al.
0

On Stack Overflow, developers can not only browse question posts to solve their programming problems but also gain expertise from the question posts to help improve their programming skills. Therefore, improving the quality of question posts in Stack Overflow has attracted the wide attention of researchers. A concise and precise title can play an important role in helping developers understand the key information of the question post, which can improve the post quality. However, the quality of the generated title is not high due to the lack of professional knowledge related to their questions or the poor presentation ability of developers. A previous study aimed to automatically generate the title by analyzing the code snippets in the question post. However, this study ignored the useful information in the corresponding problem description. Therefore, we propose an approach SOTitle for automatic post title generation by leveraging the code snippets and the problem description in the question post (i.e., the multi-modal input). SOTitle follows the Transformer structure, which can effectively capture long-term dependencies through a multi-head attention mechanism. To verify the effectiveness of SOTitle, we construct a large-scale high-quality corpus from Stack Overflow, which includes 1,168,257 high-quality question posts for four popular programming languages. Experimental results show that SOTitle can significantly outperform six state-of-the-art baselines in both automatic evaluation and human evaluation. To encourage follow-up studies, we make our corpus and approach publicly available

READ FULL TEXT

page 1

page 2

page 10

research
05/20/2020

Generating Question Titles for Stack Overflow from Mined Code Snippets

Stack Overflow has been heavily used by software developers as a popular...
research
08/24/2022

Diverse Title Generation for Stack Overflow Posts with Multiple Sampling Enhanced Transformer

Stack Overflow is one of the most popular programming communities where ...
research
09/27/2021

Improving Stack Overflow question title generation with copying enhanced CodeBERT model and bi-modal information

Context: Stack Overflow is very helpful for software developers who are ...
research
02/01/2021

Automated Query Reformulation for Efficient Search based on Query Logs From Stack Overflow

As a popular Q A site for programming, Stack Overflow is a treasure fo...
research
07/14/2023

Are Large Language Models a Threat to Digital Public Goods? Evidence from Activity on Stack Overflow

Large language models like ChatGPT efficiently provide users with inform...
research
04/22/2023

Deconstructing Sentimental Stack Overflow Posts Through Interviews: Exploring the Case of Software Testing

The analysis of sentimental posts about software testing on Stack Overfl...
research
11/02/2018

The Evolution of Stack Overflow Posts: Reconstruction and Analysis

Stack Overflow (SO) is the most popular question-and-answer website for ...

Please sign up or login with your details

Forgot password? Click here to reset