Climbing the WOL: Training for Cheaper Inference

07/02/2020
by   Zichang Liu, et al.
0

Efficient inference for wide output layers (WOLs) is an essential yet challenging task in large scale machine learning. Most approaches reduce this problem to approximate maximum inner product search (MIPS), which relies heavily on the observation that for a given model, ground truth labels correspond to logits of highest value during full model inference. However, such an assumption is restrictive in practice. In this paper, we argue that approximate MIPS subroutines, despite having sub-linear computation time, are sub-optimal because they are tailored for retrieving large inner products with high recall instead of retrieving the correct labels. With WOL, the labels often have moderate inner products, which makes approximate MIPS more challenging. We propose an alternative problem formulation, called Label Superior Sampling (LSS), where the objective is to tailor the system to ensure retrieval of the correct label. Accordingly, we propose a novel learned hash approach, which is significantly more efficient and sufficient for high inference accuracy than MIPS baselines. Our extensive evaluation indicates that LSS can match or even outperform full inference accuracy with around 5x speed up and 87

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2020

Sublinear Maximum Inner Product Search using Concomitants of Extreme Order Statistics

We propose a novel dimensionality reduction method for maximum inner pro...
research
01/23/2022

Reinforcement Routing on Proximity Graph for Efficient Recommendation

We focus on Maximum Inner Product Search (MIPS), which is an essential p...
research
08/23/2019

Revisiting Wedge Sampling for Budgeted Maximum Inner Product Search

Top-k maximum inner product search (MIPS) is a central task in many mach...
research
09/24/2018

Norm-Ranging LSH for Maximum Inner Product Search

Neyshabur and Srebro proposed Simple-LSH, which is the state-of-the-art ...
research
03/08/2020

Pseudo Labeling and Negative Feedback Learning for Large-scale Multi-label Domain Classification

In large-scale domain classification, an utterance can be handled by mul...
research
07/12/2020

Deep Retrieval: An End-to-End Learnable Structure Model for Large-Scale Recommendations

One of the core problems in large-scale recommendations is to retrieve t...
research
01/02/2019

Plugin Networks for Inference under Partial Evidence

In this paper, we propose a novel method to incorporate partial evidence...

Please sign up or login with your details

Forgot password? Click here to reset