Stochastic Code Generation

04/14/2023
by   Swapnil Sharma, et al.
0

Large language models pre-trained for code generation can generate high-quality short code but often struggle with generating coherent long code and understanding higher-level or system-level specifications. This issue is also observed in language modeling for long text generation, and one proposed solution is the use of a latent stochastic process. This approach involves generating a document plan and then producing text that is consistent with it. In this study, we investigate whether this technique can be applied to code generation to improve coherence. We base our proposed encoder and decoder on the pre-trained GPT-2 based CodeParrot model and utilize the APPS dataset for training. We evaluate our results using the HumanEval benchmark and observe that the modified Time Control model performs similarly to CodeParrot on this evaluation.

READ FULL TEXT

page 3

page 5

research
03/21/2022

Language modeling via stochastic processes

Modern language models can generate high-quality short texts. However, t...
research
02/01/2019

Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models

Variational autoencoders (VAEs) have received much attention recently as...
research
05/08/2023

Coherent Wave Dynamics and Language Generation of a Generative Pre-trained Transformer

Large Language Models (LLMs), such as the Generative Pretrained Transfor...
research
12/23/2022

Learning to Generate Questions by Enhancing Text Generation with Sentence Selection

We introduce an approach for the answer-aware question generation proble...
research
05/05/2022

Assistive Recipe Editing through Critiquing

There has recently been growing interest in the automatic generation of ...
research
12/02/2019

Neural Academic Paper Generation

In this work, we tackle the problem of structured text generation, speci...
research
09/12/2023

Unveiling the potential of large language models in generating semantic and cross-language clones

Semantic and Cross-language code clone generation may be useful for code...

Please sign up or login with your details

Forgot password? Click here to reset