Knowledge-Based Learning through Feature Generation

06/06/2020
by   Michal Badian, et al.
0

Machine learning algorithms have difficulties to generalize over a small set of examples. Humans can perform such a task by exploiting vast amount of background knowledge they possess. One method for enhancing learning algorithms with external knowledge is through feature generation. In this paper, we introduce a new algorithm for generating features based on a collection of auxiliary datasets. We assume that, in addition to the training set, we have access to additional datasets. Unlike the transfer learning setup, we do not assume that the auxiliary datasets represent learning tasks that are similar to our original one. The algorithm finds features that are common to the training set and the auxiliary datasets. Based on these features and examples from the auxiliary datasets, it induces predictors for new features from the auxiliary datasets. The induced predictors are then added to the original training set as generated features. Our method was tested on a variety of learning tasks, including text classification and medical prediction, and showed a significant improvement over using just the given features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2018

Recursive Feature Generation for Knowledge-based Learning

When humans perform inductive learning, they often enhance the process w...
research
06/05/2017

ToPs: Ensemble Learning with Trees of Predictors

We present a new approach to ensemble learning. Our approach constructs ...
research
10/25/2022

Auxiliary task discovery through generate-and-test

In this paper, we explore an approach to auxiliary task discovery in rei...
research
03/07/2022

Trajectory Test-Train Overlap in Next-Location Prediction Datasets

Next-location prediction, consisting of forecasting a user's location gi...
research
05/27/2019

Dataset2Vec: Learning Dataset Meta-Features

Machine learning tasks such as optimizing the hyper-parameters of a mode...
research
04/14/2018

ClassiNet -- Predicting Missing Features for Short-Text Classification

The fundamental problem in short-text classification is feature sparsene...
research
06/07/2023

Enabling tabular deep learning when d ≫ n with an auxiliary knowledge graph

Machine learning models exhibit strong performance on datasets with abun...

Please sign up or login with your details

Forgot password? Click here to reset