Separate and Attend in Personal Email Search

11/21/2019
by   Yu Meng, et al.
0

In personal email search, user queries often impose different requirements on different aspects of the retrieved emails. For example, the query "my recent flight to the US" requires emails to be ranked based on both textual contents and recency of the email documents, while other queries such as "medical history" do not impose any constraints on the recency of the email. Recent deep learning-to-rank models for personal email search often directly concatenate dense numerical features (e.g., document age) with embedded sparse features (e.g., n-gram embeddings). In this paper, we first show with a set of experiments on synthetic datasets that direct concatenation of dense and sparse features does not lead to the optimal search performance of deep neural ranking models. To effectively incorporate both sparse and dense email features into personal email search ranking, we propose a novel neural model, SepAttn. SepAttn first builds two separate neural models to learn from sparse and dense features respectively, and then applies an attention mechanism at the prediction level to derive the final prediction from these two models. We conduct a comprehensive set of experiments on a large-scale email search dataset, and demonstrate that our SepAttn model consistently improves the search quality over the baseline models.

READ FULL TEXT
research
03/01/2022

DynamicRetriever: A Pre-training Model-based IR System with Neither Sparse nor Dense Index

Web search provides a promising way for people to obtain information and...
research
09/15/2018

Multi-Task Learning for Email Search Ranking with Auxiliary Query Clustering

User information needs vary significantly across different tasks, and th...
research
05/08/2018

Attention-based Hierarchical Neural Query Suggestion

Query suggestions help users of a search engine to refine their queries....
research
02/04/2020

Interpretable Time-Budget-Constrained Contextualization for Re-Ranking

Search engines operate under a strict time constraint as a fast response...
research
07/03/2020

MIRA: Leveraging Multi-Intention Co-click Information in Web-scale Document Retrieval using Deep Neural Networks

We study the problem of deep recall model in industrial web search, whic...
research
04/29/2018

Entity Set Search of Scientific Literature: An Unsupervised Ranking Approach

Literature search is critical for any scientific research. Different fro...
research
05/03/2019

Personalized Query Auto-Completion Through a Lightweight Representation of the User Context

Query Auto-Completion (QAC) is a widely used feature in many domains, in...

Please sign up or login with your details

Forgot password? Click here to reset