Expert-guided Regularization via Distance Metric Learning

12/09/2019
by   Shouvik Mani, et al.
0

High-dimensional prediction is a challenging problem setting for traditional statistical models. Although regularization improves model performance in high dimensions, it does not sufficiently leverage knowledge on feature importances held by domain experts. As an alternative to standard regularization techniques, we propose Distance Metric Learning Regularization (DMLreg), an approach for eliciting prior knowledge from domain experts and integrating that knowledge into a regularized linear model. First, we learn a Mahalanobis distance metric between observations from pairwise similarity comparisons provided by an expert. Then, we use the learned distance metric to place prior distributions on coefficients in a linear model. Through experimental results on a simulated high-dimensional prediction problem, we show that DMLreg leads to improvements in model performance when the domain expert is knowledgeable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2021

Exploring dual information in distance metric learning for clustering

Distance metric learning algorithms aim to appropriately measure similar...
research
12/14/2018

A Tutorial on Distance Metric Learning: Mathematical Foundations, Algorithms and Software

This paper describes the discipline of distance metric learning, a branc...
research
03/11/2016

Nonstationary Distance Metric Learning

Recent work in distance metric learning has focused on learning transfor...
research
12/07/2016

Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Providing accurate predictions is challenging for machine learning algor...
research
12/10/2016

Knowledge Elicitation via Sequential Probabilistic Inference for High-Dimensional Prediction

Prediction in a small-sized sample with a large number of covariates, th...
research
02/26/2019

Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets

Learning predictive models from small high-dimensional data sets is a ke...
research
04/14/2020

Knowledge Elicitation using Deep Metric Learning and Psychometric Testing

Knowledge present in a domain is well expressed as relationships between...

Please sign up or login with your details

Forgot password? Click here to reset