Gradient-Boosted Based Structured and Unstructured Learning

02/28/2023
by   Andrea Treviño Gavito, et al.
0

We propose two frameworks to deal with problem settings in which both structured and unstructured data are available. Structured data problems are best solved by traditional machine learning models such as boosting and tree-based algorithms, whereas deep learning has been widely applied to problems dealing with images, text, audio, and other unstructured data sources. However, for the setting in which both structured and unstructured data are accessible, it is not obvious what the best modeling approach is to enhance performance on both data sources simultaneously. Our proposed frameworks allow joint learning on both kinds of data by integrating the paradigms of boosting models and deep neural networks. The first framework, the boosted-feature-vector deep learning network, learns features from the structured data using gradient boosting and combines them with embeddings from unstructured data via a two-branch deep neural network. Secondly, the two-weak-learner boosting framework extends the boosting paradigm to the setting with two input data sources. We present and compare first- and second-order methods of this framework. Our experimental results on both public and real-world datasets show performance gains achieved by the frameworks over selected baselines by magnitudes of 0.1

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2020

Semi-Structured Deep Piecewise Exponential Models

We propose a versatile framework for survival analysis that combines adv...
research
07/14/2020

A Framework for Capturing and Analyzing Unstructured and Semi-structured Data for a Knowledge Management System

Mainstream knowledge management researchers generally agree that knowled...
research
04/25/2023

Morphological Classification of Extragalactic Radio Sources Using Gradient Boosting Methods

The field of radio astronomy is witnessing a boom in the amount of data ...
research
07/23/2020

Graph integration of structured, semistructured and unstructured data for data journalism

Nowadays, journalism is facilitated by the existence of large amounts of...
research
03/02/2018

Driving Digital Rock towards Machine Learning: predicting permeability with Gradient Boosting and Deep Neural Networks

We present a research study aimed at testing of applicability of machine...
research
12/06/2022

Loss Adapted Plasticity in Deep Neural Networks to Learn from Data with Unreliable Sources

When data is streaming from multiple sources, conventional training meth...
research
02/28/2018

Collective Entity Disambiguation with Structured Gradient Tree Boosting

We present a gradient-tree-boosting-based structured learning model for ...

Please sign up or login with your details

Forgot password? Click here to reset