A Preliminary Analysis on the Code Generation Capabilities of GPT-3.5 and Bard AI Models for Java Functions

05/16/2023
by   Giuseppe Destefanis, et al.
0

This paper evaluates the capability of two state-of-the-art artificial intelligence (AI) models, GPT-3.5 and Bard, in generating Java code given a function description. We sourced the descriptions from CodingBat.com, a popular online platform that provides practice problems to learn programming. We compared the Java code generated by both models based on correctness, verified through the platform's own test cases. The results indicate clear differences in the capabilities of the two models. GPT-3.5 demonstrated superior performance, generating correct code for approximately 90.6 descriptions, whereas Bard produced correct code for 53.1 While both models exhibited strengths and weaknesses, these findings suggest potential avenues for the development and refinement of more advanced AI-assisted code generation tools. The study underlines the potential of AI in automating and supporting aspects of software development, although further research is required to fully realize this potential.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2023

Towards Generating Functionally Correct Code Edits from Natural Language Issue Descriptions

Large language models (LLMs), such as OpenAI's Codex, have demonstrated ...
research
07/17/2023

Extending the Frontier of ChatGPT: Code Generation and Debugging

Large-scale language models (LLMs) have emerged as a groundbreaking inno...
research
08/08/2023

A Comparative Study of Code Generation using ChatGPT 3.5 across 10 Programming Languages

Large Language Models (LLMs) are advanced Artificial Intelligence (AI) s...
research
02/08/2023

Will ChatGPT get you caught? Rethinking of Plagiarism Detection

The rise of Artificial Intelligence (AI) technology and its impact on ed...
research
09/12/2023

Unveiling the potential of large language models in generating semantic and cross-language clones

Semantic and Cross-language code clone generation may be useful for code...
research
03/21/2022

A Policy Driven AI-Assisted PoW Framework

Proof of Work (PoW) based cyberdefense systems require incoming network ...

Please sign up or login with your details

Forgot password? Click here to reset