An Algebraic Approach for High-level Text Analytics

05/03/2020
by   Xiuwen Zheng, et al.
0

Text analytical tasks like word embedding, phrase mining, and topic modeling, are placing increasing demands as well as challenges to existing database management systems. In this paper, we provide a novel algebraic approach based on associative arrays. Our data model and algebra can bring together relational operators and text operators, which enables interesting optimization opportunities for hybrid data sources that have both relational and textual data. We demonstrate its expressive power in text analytics using several real-world tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2018

Answering Analytical Queries on Text Data with Temporal Term Histograms

Temporal text, i.e., time-stamped text data are found abundantly in a va...
research
05/18/2023

Modal Algebra of Multirelations

We formalise the modal operators from the concurrent dynamic logics of P...
research
03/23/2021

HADAD: A Lightweight Approach for Optimizing Hybrid Complex Analytics Queries (Extended Version)

Hybrid complex analytics workloads typically include (i) data management...
research
05/22/2023

An Optimized Tri-store System for Multi-model Data Analytics

Data science applications increasingly rely on heterogeneous data source...
research
09/20/2020

TADOC: Text Analytics Directly on Compression

This article provides a comprehensive description of Text Analytics Dire...
research
03/13/2018

IDEL: In-Database Entity Linking with Neural Embeddings

We present a novel architecture, In-Database Entity Linking (IDEL), in w...
research
11/28/2019

RETRO: Relation Retrofitting For In-Database Machine Learning on Textual Data

There are massive amounts of textual data residing in databases, valuabl...

Please sign up or login with your details

Forgot password? Click here to reset