Conjecturing-Based Computational Discovery of Patterns in Data

11/23/2020
by   J. P. Brooks, et al.
0

Modern machine learning methods are designed to exploit complex patterns in data regardless of their form, while not necessarily revealing them to the investigator. Here we demonstrate situations where modern machine learning methods are ill-equipped to reveal feature interaction effects and other nonlinear relationships. We propose the use of a conjecturing machine that generates feature relationships in the form of bounds for numerical features and boolean expressions for nominal features that are ignored by machine learning algorithms. The proposed framework is demonstrated for a classification problem with an interaction effect and a nonlinear regression problem. In both settings, true underlying relationships are revealed and generalization performance improves. The framework is then applied to patient-level data regarding COVID-19 outcomes to suggest possible risk factors.

READ FULL TEXT

page 8

page 10

research
11/23/2018

Nonlinear Regression without i.i.d. Assumption

In this paper, we consider a class of nonlinear regression problems with...
research
02/15/2022

REPID: Regional Effect Plots with implicit Interaction Detection

Machine learning models can automatically learn complex relationships, s...
research
08/17/2017

Extensions of Morse-Smale Regression with Application to Actuarial Science

The problem of subgroups is ubiquitous in scientific research (ex. disea...
research
12/15/2021

Solving the Data Sparsity Problem in Predicting the Success of the Startups with Machine Learning Methods

Predicting the success of startup companies is of great importance for b...
research
05/25/2020

Bayesian Stress Testing of Models in a Classification Hierarchy

Building a machine learning solution in real-life applications often inv...
research
05/25/2022

Forecasting Patient Demand at Urgent Care Clinics using Machine Learning

Urgent care clinics and emergency departments around the world periodica...
research
01/25/2022

A Machine Learning-based Characterization Framework for Parametric Representation of Nonlinear Sloshing

The growing interest in creating a parametric representation of liquid s...

Please sign up or login with your details

Forgot password? Click here to reset