Diverse Multimedia Layout Generation with Multi Choice Learning

01/16/2023
by   David D. Nguyen, et al.
0

Designing visually appealing layouts for multimedia documents containing text, graphs and images requires a form of creative intelligence. Modelling the generation of layouts has recently gained attention due to its importance in aesthetics and communication style. In contrast to standard prediction tasks, there are a range of acceptable layouts which depend on user preferences. For example, a poster designer may prefer logos on the top-left while another prefers logos on the bottom-right. Both are correct choices yet existing machine learning models treat layouts as a single choice prediction problem. In such situations, these models would simply average over all possible choices given the same input forming a degenerate sample. In the above example, this would form an unacceptable layout with a logo in the centre. In this paper, we present an auto-regressive neural network architecture, called LayoutMCL, that uses multi-choice prediction and winner-takes-all loss to effectively stabilise layout generation. LayoutMCL avoids the averaging problem by using multiple predictors to learn a range of possible options for each layout object. This enables LayoutMCL to generate multiple and diverse layouts from a single input which is in contrast with existing approaches which yield similar layouts with minor variations. Through quantitative benchmarks on real data (magazine, document and mobile app layouts), we demonstrate that LayoutMCL reduces Fréchet Inception Distance (FID) by 83-98 diversity in comparison to existing approaches.

READ FULL TEXT

page 1

page 8

research
07/06/2021

DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis

Despite significant progress on current state-of-the-art image generatio...
research
08/15/2023

Enhancing Visually-Rich Document Understanding via Layout Structure Modeling

In recent years, the use of multi-modal pre-trained Transformers has led...
research
03/16/2021

RackLay: Multi-Layer Layout Estimation for Warehouse Racks

Given a monocular colour image of a warehouse rack, we aim to predict th...
research
08/02/2021

Constrained Graphic Layout Generation via Latent Optimization

It is common in graphic design humans visually arrange various elements ...
research
01/11/2021

Learning to Automate Chart Layout Configurations Using Crowdsourced Paired Comparison

We contribute a method to automate parameter configurations for chart la...
research
08/24/2020

DiverseNet: When One Right Answer is not Enough

Many structured prediction tasks in machine vision have a collection of ...
research
04/30/2022

LayoutBERT: Masked Language Layout Model for Object Insertion

Image compositing is one of the most fundamental steps in creative workf...

Please sign up or login with your details

Forgot password? Click here to reset