Diversity in Machine Learning

07/04/2018
by   Zhiqiang Gong, et al.
0

Machine learning methods have achieved good performance and been widely applied in various real-world applications. It can learn the model adaptively and be better fit for special requirements of different tasks. Many factors can affect the performance of the machine learning process, among which diversity of the machine learning is an important one. Generally, a good machine learning system is composed of plentiful training data, a good model training process, and an accurate inference. The diversity could help each procedure to guarantee a total good machine learning: diversity of the training data ensures the data contain enough discriminative information, diversity of the learned model (diversity in parameters of each model or diversity in models) makes each parameter/model capture unique or complement information and the diversity in inference can provide multiple choices each of which corresponds to a plausible result. However, there is no systematical analysis of the diversification in machine learning system. In this paper, we systematically summarize the methods to make data diversification, model diversification, and inference diversification in machine learning process, respectively. In addition, the typical applications where the diversity technology improved the machine learning performances have been surveyed, including the remote sensing imaging tasks, machine translation, camera relocalization, image segmentation, object detection, topic modeling, and others. Finally, we discuss some challenges of diversity technology in machine learning and point out some directions in future work. Our analysis provides a deeper understanding of the diversity technology in machine learning tasks, and hence can help design and learn more effective models for specific tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2021

Representation Matters: Assessing the Importance of Subgroup Allocations in Training Data

Collecting more diverse and representative training data is often touted...
research
10/26/2019

Deep Learning for Hyperspectral Image Classification: An Overview

Hyperspectral image (HSI) classification has become a hot topic in the f...
research
08/14/2019

Trustable and Automated Machine Learning Running with Blockchain and Its Applications

Machine learning algorithms learn from data and use data from databases ...
research
10/14/2014

Detection of cheating by decimation algorithm

We expand the item response theory to study the case of "cheating studen...
research
04/27/2020

The Dark Side of Unikernels for Machine Learning

This paper analyzes the shortcomings of unikernels as a method of deploy...
research
03/24/2023

Optimizing the Procedure of CT Segmentation Labeling

In Computed Tomography, machine learning is often used for automated dat...
research
05/12/2023

Comparison of machine learning models applied on anonymized data with different techniques

Anonymization techniques based on obfuscating the quasi-identifiers by m...

Please sign up or login with your details

Forgot password? Click here to reset