Evaluating How Developers Use General-Purpose Web-Search for Code Retrieval

03/22/2018
by   Md Masudur Rahman, et al.
0

Search is an integral part of a software development process. Developers often use search engines to look for information during development, including reusable code snippets, API understanding, and reference examples. Developers tend to prefer general-purpose search engines like Google, which are often not optimized for code related documents and use search strategies and ranking techniques that are more optimized for generic, non-code related information. In this paper, we explore whether a general purpose search engine like Google is an optimal choice for code-related searches. In particular, we investigate whether the performance of searching with Google varies for code vs. non-code related searches. To analyze this, we collect search logs from 310 developers that contains nearly 150,000 search queries from Google and the associated result clicks. To differentiate between code-related searches and non-code related searches, we build a model which identifies the code intent of queries. Leveraging this model, we build an automatic classifier that detects a code and non-code related query. We confirm the effectiveness of the classifier on manually annotated queries where the classifier achieves a precision of 87 recall of 86 automatically annotate all the queries in the dataset. Analyzing this dataset, we observe that code related searching often requires more effort (e.g., time, result clicks, and query modifications) than general non-code search, which indicates code search performance with a general search engine is less effective.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2021

Readability and Understandability of Snippets Recommended by General-purpose Web Search Engines: a Comparative Study

Developers often search for reusable code snippets on general-purpose we...
research
05/30/2020

An Empirical Study of Software Exceptions in the Field using Search Logs

Software engineers spend a substantial amount of time using Web search t...
research
09/28/2022

StacerBot: A Stacktrace Search Engine for Stack Overflow

We as software developers or researchers very often get stacktrace error...
research
09/03/2020

CoNCRA: A Convolutional Neural Network Code Retrieval Approach

Software developers routinely search for code using general-purpose sear...
research
01/24/2022

Generating Clarifying Questions for Query Refinement in Source Code Search

In source code search, a common information-seeking strategy involves pr...
research
04/05/2022

PSDoodle: Fast App Screen Search via Partial Screen Doodle

Searching through existing repositories for a specific mobile app screen...
research
02/19/2018

Seasonal Web Search Query Selection for Influenza-Like Illness (ILI) Estimation

Influenza-like illness (ILI) estimation from web search data is an impor...

Please sign up or login with your details

Forgot password? Click here to reset