Cognitive Database: A Step towards Endowing Relational Databases with Artificial Intelligence Capabilities

by   Rajesh Bordawekar, et al.

We propose Cognitive Databases, an approach for transparently enabling Artificial Intelligence (AI) capabilities in relational databases. A novel aspect of our design is to first view the structured data source as meaningful unstructured text, and then use the text to build an unsupervised neural network model using a Natural Language Processing (NLP) technique called word embedding. This model captures the hidden inter-/intra-column relationships between database tokens of different types. For each database token, the model includes a vector that encodes contextual semantic relationships. We seamlessly integrate the word embedding model into existing SQL query infrastructure and use it to enable a new class of SQL-based analytics queries called cognitive intelligence (CI) queries. CI queries use the model vectors to enable complex queries such as semantic matching, inductive reasoning queries such as analogies, predictive queries using entities not present in a database, and, more generally, using knowledge from external sources. We demonstrate unique capabilities of Cognitive Databases using an Apache Spark based prototype to execute inductive reasoning CI queries over a multi-modal database containing text and images. We believe our first-of-a-kind system exemplifies using AI functionality to endow relational databases with capabilities that were previously very hard to realize in practice.


Enabling Cognitive Intelligence Queries in Relational Databases using Low-dimensional Word Embeddings

We apply distributed language embedding methods from Natural Language Pr...

A Scalable Space-efficient In-database Interpretability Framework for Embedding-based Semantic SQL Queries

AI-Powered database (AI-DB) is a novel relational database system that u...

The Tensor Data Platform: Towards an AI-centric Database System

Database engines have historically absorbed many of the innovations in d...

DrugDBEmbed : Semantic Queries on Relational Database using Supervised Column Encodings

Traditional relational databases contain a lot of latent semantic inform...

Query Processing on Tensor Computation Runtimes

The huge demand for computation in artificial intelligence (AI) is drivi...

Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables

In this paper, we propose Multi-Modal Databases (MMDBs), which is a new ...

COMPARE: Accelerating Groupwise Comparison in Relational Databases for Data Analytics

Data analysis often involves comparing subsets of data across many dimen...

Please sign up or login with your details

Forgot password? Click here to reset