Semantic Source Code Search: A Study of the Past and a Glimpse at the Future

08/15/2019
by   Muhammad Khalifa, et al.
0

With the recent explosion in the size and complexity of source codebases and software projects, the need for efficient source code search engines has increased dramatically. Unfortunately, existing information retrieval-based methods fail to capture the query semantics and perform well only when the query contains syntax-based keywords. Consequently, such methods will perform poorly when given high-level natural language queries. In this paper, we review existing methods for building code search engines. We also outline the open research directions and the various obstacles that stand in the way of having a universal source code search engine.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2018

RACK: Code Search in the IDE using Crowdsourced Knowledge

Traditional code search engines often do not perform well with natural l...
research
06/08/2017

Source Forager: A Search Engine for Similar Source Code

Developers spend a significant amount of time searching for code: e.g., ...
research
06/01/2017

Function Assistant: A Tool for NL Querying of APIs

In this paper, we describe Function Assistant, a lightweight Python-base...
research
11/05/2021

DeSkew-LSH based Code-to-Code Recommendation Engine

Machine learning on source code (MLOnCode) is a popular research field t...
research
04/06/2022

Code Search: A Survey of Techniques for Finding Code

The immense amounts of source code provide ample challenges and opportun...
research
09/05/2022

Compressing integer lists with Contextual Arithmetic Trits

Inverted indexes allow to query large databases without needing to searc...
research
01/24/2022

Generating Clarifying Questions for Query Refinement in Source Code Search

In source code search, a common information-seeking strategy involves pr...

Please sign up or login with your details

Forgot password? Click here to reset