Answering Analytical Queries on Text Data with Temporal Term Histograms

09/28/2018
by   Kai Lin, et al.
0

Temporal text, i.e., time-stamped text data are found abundantly in a variety of data sources like newspapers, blogs and social media posts. While today's data management systems provide facilities for searching full-text data, they do not provide any simple primitives for performing analytical operations with text. This paper proposes the temporal term histograms (TTH) as an intermediate primitive that can be used for analytical tasks. We propose an algebra, with operators and equivalence rules for TTH and present a reference implementation on a relational database system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2020

An Algebraic Approach for High-level Text Analytics

Text analytical tasks like word embedding, phrase mining, and topic mode...
research
09/03/2018

Typed Linear Algebra for Efficient Analytical Querying

This paper uses typed linear algebra (LA) to represent data and perform ...
research
04/20/2020

Taming the Expressiveness and Programmability of Graph Analytical Queries

Graph database has enjoyed a boom in the last decade, and graph queries ...
research
12/14/2022

Analytical Engines With Context-Rich Processing: Towards Efficient Next-Generation Analytics

As modern data pipelines continue to collect, produce, and store a varie...
research
07/25/2017

Integrating Lexical and Temporal Signals in Neural Ranking Models for Searching Social Media Streams

Time is an important relevance signal when searching streams of social m...
research
05/22/2018

MonetDBLite: An Embedded Analytical Database

While traditional RDBMSes offer a lot of advantages, they require signif...

Please sign up or login with your details

Forgot password? Click here to reset