Evaluating Style Transfer for Text

04/04/2019
by   Remi Mir, et al.
0

Research in the area of style transfer for text is currently bottlenecked by a lack of standard evaluation practices. This paper aims to alleviate this issue by experimentally identifying best practices with a Yelp sentiment dataset. We specify three aspects of interest (style transfer intensity, content preservation, and naturalness) and show how to obtain more reliable measures of them from human evaluation than in previous work. We propose a set of metrics for automated evaluation and demonstrate that they are more strongly correlated and in agreement with human judgment: direction-corrected Earth Mover's Distance, Word Mover's Distance on style-masked texts, and adversarial classification for the respective aspects. We also show that the three examined models exhibit tradeoffs between aspects of interest, demonstrating the importance of evaluating style transfer models at specific points of their tradeoff plots. We release software with our evaluation metrics to facilitate research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2021

A Review of Human Evaluation for Style Transfer

This paper reviews and summarizes human evaluation practices described i...
research
10/20/2021

Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer

While the field of style transfer (ST) has been growing rapidly, it has ...
research
08/13/2018

What is wrong with style transfer for texts?

A number of recent machine learning papers work with an automated style ...
research
06/01/2023

A Call for Standardization and Validation of Text Style Transfer Evaluation

Text Style Transfer (TST) evaluation is, in practice, inconsistent. Ther...
research
08/25/2023

Text Style Transfer Evaluation Using Large Language Models

Text Style Transfer (TST) is challenging to evaluate because the quality...
research
04/10/2018

Sentiment Transfer using Seq2Seq Adversarial Autoencoders

Expressing in language is subjective. Everyone has a different style of ...
research
08/19/2019

Style Transfer for Texts: to Err is Human, but Error Margins Matter

This paper shows that standard assessment methodology for style transfer...

Please sign up or login with your details

Forgot password? Click here to reset