In-Machine-Learning Database: Reimagining Deep Learning with Old-School SQL

04/11/2020
by   Len Du, et al.
0

In-database machine learning has been very popular, almost being a cliche. However, can we do it the other way around? In this work, we say "yes" by applying plain old SQL to deep learning, in a sense implementing deep learning algorithms with SQL. Most deep learning frameworks, as well as generic machine learning ones, share a de facto standard of multidimensional array operations, underneath fancier infrastructure such as automatic differentiation. As SQL tables can be regarded as generalisations of (multi-dimensional) arrays, we have found a way to express common deep learning operations in SQL, encouraging a different way of thinking and thus potentially novel models. In particular, one of the latest trend in deep learning was the introduction of sparsity in the name of graph convolutional networks, whereas we take sparsity almost for granted in the database world. As both databases and machine learning involve transformation of datasets, we hope this work can inspire further works utilizing the large body of existing wisdom, algorithms and technologies in the database field to advance the state of the art in machine learning, rather than merely integerating machine learning into databases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2021

Efficient and Accurate In-Database Machine Learning with SQL Code Generation in Python

Following an analysis of the advantages of SQL-based Machine Learning (M...
research
07/29/2019

sql4ml A declarative end-to-end workflow for machine learning

We present sql4ml, a system for expressing supervised machine learning (...
research
07/08/2022

Combining Deep Learning with Good Old-Fashioned Machine Learning

We present a comprehensive, stacking-based framework for combining deep ...
research
07/01/2019

Using Database Rule for Weak Supervised Text-to-SQL Generation

We present a simple and novel way to do the task of text-to-SQL problem ...
research
12/03/2017

Polystore Mathematics of Relational Algebra

Financial transactions, internet search, and data analysis are all placi...
research
01/19/2020

SQLFlow: A Bridge between SQL and Machine Learning

Industrial AI systems are mostly end-to-end machine learning (ML) workfl...
research
08/16/2020

Benchmarking database performance for genomic data

Genomic regions represent features such as gene annotations, transcripti...

Please sign up or login with your details

Forgot password? Click here to reset