Towards Semantic Query Segmentation

07/25/2017
by   Ajinkya Kale, et al.
0

Query Segmentation is one of the critical components for understanding users' search intent in Information Retrieval tasks. It involves grouping tokens in the search query into meaningful phrases which help downstream tasks like search relevance and query understanding. In this paper, we propose a novel approach to segment user queries using distributed query embeddings. Our key contribution is a supervised approach to the segmentation task using low-dimensional feature vectors for queries, getting rid of traditional hand tuned and heuristic NLP features which are quite expensive. We benchmark on a 50,000 human-annotated web search engine query corpus achieving comparable accuracy to state-of-the-art techniques. The advantage of our technique is its fast and does not use external knowledge-base like Wikipedia for score boosting. This helps us generalize our approach to other domains like eCommerce without any fine-tuning. We demonstrate the effectiveness of this method on another 50,000 human-annotated eCommerce query corpus from eBay search logs. Our approach is easy to implement and generalizes well across different search domains proving the power of low-dimensional embeddings in query segmentation task, opening up a new direction of research for this problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2020

Query Understanding via Intent Description Generation

Query understanding is a fundamental problem in information retrieval (I...
research
04/12/2012

Learning to Rank Query Recommendations by Semantic Similarities

Logs of the interactions with a search engine show that users often refo...
research
07/04/2022

Still Haven't Found What You're Looking For – Detecting the Intent of Web Search Missions from User Interaction Features

Web search is among the most frequent online activities. Whereas traditi...
research
03/12/2018

Gradient Augmented Information Retrieval with Autoencoders and Semantic Hashing

This paper will explore the use of autoencoders for semantic hashing in ...
research
06/04/2020

Syntactic Search by Example

We present a system that allows a user to search a large linguistically ...
research
08/03/2016

Query Clustering using Segment Specific Context Embeddings

This paper presents a novel query clustering approach to capture the bro...
research
05/21/2019

A User-Centered Concept Mining System for Query and Document Understanding at Tencent

Concepts embody the knowledge of the world and facilitate the cognitive ...

Please sign up or login with your details

Forgot password? Click here to reset