DeepAI AI Chat
Log In Sign Up

Deploying Deep Ranking Models for Search Verticals

by   Rohan Ramanath, et al.

In this paper, we present an architecture executing a complex machine learning model such as a neural network capturing semantic similarity between a query and a document; and deploy to a real-world production system serving 500M+users. We present the challenges that arise in a real-world system and how we solve them. We demonstrate that our architecture provides competitive modeling capability without any significant performance impact to the system in terms of latency. Our modular solution and insights can be used by other real-world search systems to realize and productionize recent gains in neural networks.


page 1

page 2

page 3

page 4


Optimizing Prediction Serving on Low-Latency Serverless Dataflow

Prediction serving systems are designed to provide large volumes of low-...

Efficient Incorporation of Multiple Latency Targets in the Once-For-All Network

Neural Architecture Search has proven an effective method of automating ...

Challenges in Deploying Machine Learning: a Survey of Case Studies

In recent years, machine learning has received increased interest both a...

DeText: A Deep Text Ranking Framework with BERT

Ranking is the most important component in a search system. Mostsearch s...

Applying Deep Learning To Airbnb Search

The application to search ranking is one of the biggest machine learning...

Is a Modular Architecture Enough?

Inspired from human cognition, machine learning systems are gradually re...

Extracting Hierarchies of Search Tasks & Subtasks via a Bayesian Nonparametric Approach

A significant amount of search queries originate from some real world in...