StyleDiff: Attribute Comparison Between Unlabeled Datasets in Latent Disentangled Space

03/09/2023
by   Keisuke Kawano, et al.
0

One major challenge in machine learning applications is coping with mismatches between the datasets used in the development and those obtained in real-world applications. These mismatches may lead to inaccurate predictions and errors, resulting in poor product quality and unreliable systems. In this study, we propose StyleDiff to inform developers of the differences between the two datasets for the steady development of machine learning systems. Using disentangled image spaces obtained from recently proposed generative models, StyleDiff compares the two datasets by focusing on attributes in the images and provides an easy-to-understand analysis of the differences between the datasets. The proposed StyleDiff performs in O (d Nlog N), where N is the size of the datasets and d is the number of attributes, enabling the application to large datasets. We demonstrate that StyleDiff accurately detects differences between datasets and presents them in an understandable format using, for example, driving scenes datasets.

READ FULL TEXT

page 9

page 11

page 15

page 16

page 17

page 18

page 19

page 20

research
03/24/2023

An investigation of licensing of datasets for machine learning based on the GQM model

Dataset licensing is currently an issue in the development of machine le...
research
11/25/2020

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

We explore and analyze the latent style space of StyleGAN2, a state-of-t...
research
05/13/2020

Understanding the Nature of System-Related Issues in Machine Learning Frameworks: An Exploratory Study

Modern systems are built using development frameworks. These frameworks ...
research
08/14/2021

Unsupervised Disentanglement without Autoencoding: Pitfalls and Future Directions

Disentangled visual representations have largely been studied with gener...
research
04/11/2020

Attribute-based Regularization of VAE Latent Spaces

Selective manipulation of data attributes using deep generative models i...
research
05/30/2023

DualVAE: Controlling Colours of Generated and Real Images

Colour controlled image generation and manipulation are of interest to a...

Please sign up or login with your details

Forgot password? Click here to reset