On the validity of pre-trained transformers for natural language processing in the software engineering domain

09/10/2021
by   Julian von der Mosel, et al.
0

Transformers are the current state-of-the-art of natural language processing in many domains and are using traction within software engineering research as well. Such models are pre-trained on large amounts of data, usually from the general domain. However, we only have a limited understanding regarding the validity of transformers within the software engineering domain, i.e., how good such models are at understanding words and sentences within a software engineering context and how this improves the state-of-the-art. Within this article, we shed light on this complex, but crucial issue. We compare BERT transformer models trained with software engineering data with transformers based on general domain data in multiple dimensions: their vocabulary, their ability to understand which words are missing, and their performance in classification tasks. Our results show that for tasks that require understanding of the software engineering context, pre-training with software engineering data is valuable, while general domain models are sufficient for general language understanding, also within the software engineering domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/14/2022

The Dark Side of the Language: Pre-trained Transformers in the DarkNet

Pre-trained Transformers are challenging human performances in many natu...
research
04/06/2021

CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing

Currently, a growing number of mature natural language processing applic...
research
03/18/2023

Stop Words for Processing Software Engineering Documents: Do they Matter?

Stop words, which are considered non-predictive, are often eliminated in...
research
08/06/2021

Distilling Transformers for Neural Cross-Domain Search

Pre-trained transformers have recently clinched top spots in the gamut o...
research
12/04/2020

A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering

Chatbots are envisioned to dramatically change the future of Software En...
research
02/05/2021

Understanding Emails and Drafting Responses – An Approach Using GPT-3

Providing computer systems with the ability to understand and generate n...
research
03/27/2017

Bootstrapping a Lexicon for Emotional Arousal in Software Engineering

Emotional arousal increases activation and performance but may also lead...

Please sign up or login with your details

Forgot password? Click here to reset