Differentiable Retrieval Augmentation via Generative Language Modeling for E-commerce Query Intent Classification

08/18/2023
by   Chenyu Zhao, et al.
0

Retrieval augmentation, which enhances downstream models by a knowledge retriever and an external corpus instead of by merely increasing the number of model parameters, has been successfully applied to many natural language processing (NLP) tasks such as text classification, question answering and so on. However, existing methods that separately or asynchronously train the retriever and downstream model mainly due to the non-differentiability between the two parts, usually lead to degraded performance compared to end-to-end joint training. In this paper, we propose Differentiable Retrieval Augmentation via Generative lANguage modeling(Dragan), to address this problem by a novel differentiable reformulation. We demonstrate the effectiveness of our proposed method on a challenging NLP task in e-commerce search, namely query intent classification. Both the experimental results and ablation study show that the proposed method significantly and reasonably improves the state-of-the-art baselines on both offline evaluation and online A/B test.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2020

Text Data Augmentation: Towards better detection of spear-phishing emails

Text data augmentation, i.e. the creation of synthetic textual data from...
research
03/25/2021

Visual Grounding Strategies for Text-Only Natural Language Processing

Visual grounding is a promising path toward more robust and accurate Nat...
research
05/26/2021

Joint Optimization of Tokenization and Downstream Model

Since traditional tokenizers are isolated from a downstream task and mod...
research
08/12/2022

Pre-training Tasks for User Intent Detection and Embedding Retrieval in E-commerce Search

BERT-style models pre-trained on the general corpus (e.g., Wikipedia) an...
research
10/31/2019

A neural document language modeling framework for spoken document retrieval

Recent developments in deep learning have led to a significant innovatio...
research
12/19/2022

Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling

The quality of knowledge retrieval is crucial in knowledge-intensive con...
research
04/21/2023

Downstream Task-Oriented Neural Tokenizer Optimization with Vocabulary Restriction as Post Processing

This paper proposes a method to optimize tokenization for the performanc...

Please sign up or login with your details

Forgot password? Click here to reset