Neural Code Search Evaluation Dataset

08/26/2019
by   Hongyu Li, et al.
0

There has been an increase of interest in code search using natural language. Assessing the performance of such code search models can be difficult without a readily available evaluation suite. In this paper, we present an evaluation dataset consisting of natural language query and code snippet pairs, with the hope that future work in this area can use this dataset as a common benchmark. We also provide the results of two code search models ([1] and [6]) from recent work.

READ FULL TEXT

page 1

page 2

page 3

research
04/16/2021

BERT2Code: Can Pretrained Language Models be Leveraged for Code Search?

Millions of repetitive code snippets are submitted to code repositories ...
research
05/09/2019

When Deep Learning Met Code Search

There have been multiple recent proposals on using deep neural networks ...
research
05/28/2023

ConvGenVisMo: Evaluation of Conversational Generative Vision Models

Conversational generative vision models (CGVMs) like Visual ChatGPT (Wu ...
research
02/14/2022

On the Importance of Building High-quality Training Datasets for Neural Code Search

The performance of neural code search is significantly influenced by the...
research
10/01/2022

CodeDSI: Differentiable Code Search

Reimplementing solutions to previously solved software engineering probl...
research
10/12/2020

Evaluation of Siamese Networks for Semantic Code Search

With the increase in the number of open repositories and discussion foru...
research
05/19/2023

Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets

Code search is an important task that has seen many developments in rece...

Please sign up or login with your details

Forgot password? Click here to reset