Post-Estimation Smoothing: A Simple Baseline for Learning with Side Information

03/12/2020
by   Esther Rolf, et al.
0

Observational data are often accompanied by natural structural indices, such as time stamps or geographic locations, which are meaningful to prediction tasks but are often discarded. We leverage semantically meaningful indexing data while ensuring robustness to potentially uninformative or misleading indices. We propose a post-estimation smoothing operator as a fast and effective method for incorporating structural index data into prediction. Because the smoothing step is separate from the original predictor, it applies to a broad class of machine learning tasks, with no need to retrain models. Our theoretical analysis details simple conditions under which post-estimation smoothing will improve accuracy over that of the original predictor. Our experiments on large scale spatial and temporal datasets highlight the speed and accuracy of post-estimation smoothing in practice. Together, these results illuminate a novel way to consider and incorporate the natural structure of index variables in machine learning.

READ FULL TEXT

page 7

page 10

page 11

page 20

page 24

research
05/26/2021

Blurs Make Results Clearer: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness

Bayesian neural networks (BNNs) have shown success in the areas of uncer...
research
07/14/2022

Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness

Neural network ensembles, such as Bayesian neural networks (BNNs), have ...
research
07/10/2018

Multi-D Kneser-Ney Smoothing Preserving the Original Marginal Distributions

Smoothing is an essential tool in many NLP tasks, therefore numerous tec...
research
02/19/2021

Center Smoothing for Certifiably Robust Vector-Valued Functions

Randomized smoothing has been successfully applied in high-dimensional i...
research
06/12/2020

Indexing Data on the Web: A Comparison of Schema-level Indices for Data Search – Extended Technical Report

Indexing the Web of Data offers many opportunities, in particular, to fi...
research
01/13/2015

Random Bits Regression: a Strong General Predictor for Big Data

To improve accuracy and speed of regressions and classifications, we pre...
research
05/28/2020

Composition Estimation via Shrinkage

In this note, we explore a simple approach to composition estimation, us...

Please sign up or login with your details

Forgot password? Click here to reset