XFBoost: Improving Text Generation with Controllable Decoders

02/16/2022
by   Xiangyu Peng, et al.
5

Multimodal conditionality in transformer-based natural language models has demonstrated state-of-the-art performance in the task of product description generation. Recent approaches condition a language model on one or more images and other textual metadata to achieve near-human performance for describing products from e-commerce stores. However, generated descriptions may exhibit degrees of inaccuracy or even contradictory claims relative to the inputs of a given product. In this paper, we propose a controllable language generation framework called Extract-Finetune-Boost (XFBoost), which addresses the problem of inaccurate low-quality inference. By using visual semantic attributes as constraints at the decoding stage of the generation process and finetuning the language model with policy gradient techniques, the XFBoost framework is found to produce significantly more descriptive text with higher image relevancy, outperforming baselines and lowering the frequency of factually inaccurate descriptions. We further demonstrate the application of XFBoost to online learning wherein human-in-the-loop critics improve language models with active feedback.

READ FULL TEXT

page 6

page 7

research
09/02/2021

Multimodal Conditionality for Natural Language Generation

Large scale pretrained language models have demonstrated state-of-the-ar...
research
06/21/2022

Automatic Controllable Product Copywriting for E-Commerce

Automatic product description generation for e-commerce has witnessed si...
research
12/20/2022

Controllable Text Generation with Language Constraints

We consider the task of text generation in language models with constrai...
research
05/21/2022

Few-Shot Natural Language Inference Generation with PDD: Prompt and Dynamic Demonstration

Natural Language Inference Generation task is to generate a text hypothe...
research
09/24/2022

Controllable Text Generation for Open-Domain Creativity and Fairness

Recent advances in large pre-trained language models have demonstrated s...
research
08/05/2020

6VecLM: Language Modeling in Vector Space for IPv6 Target Generation

Fast IPv6 scanning is challenging in the field of network measurement as...
research
09/19/2023

Toward Unified Controllable Text Generation via Regular Expression Instruction

Controllable text generation is a fundamental aspect of natural language...

Please sign up or login with your details

Forgot password? Click here to reset