From Unstructured to Structured: Transforming Chatbot Dialogues into Data Mart Schema for Visualization

05/07/2023
by   Mark Edward M. Gonzales, et al.
0

Schools are among the primary avenues for public healthcare interventions. With resource limitations posing challenges to the routine conduct of health and wellness checks in Philippine public schools, the deployment of a chatbot-assisted health monitoring system may provide an alternative method. However, deriving insights from raw conversations is not straightforward due to the expressiveness of natural language that causes variances in the input. In this paper, we present a process for transforming unstructured dialogues into a structured schema. The process comprises four stages: (i) processing the dialogues through entity extraction and data aggregation, (ii) storing them as NoSQL documents on the cloud, (iii) transforming them into a star schema for online analytical processing and building an extract-transform-load workflow, and (iv) creating a web-based dashboard for visualizing summarized data and reports. Performance evaluation of this dashboard showed that increasing the number of stored dialogues by a factor of 100,000 increased the loading time for the display of roll-up, drill-down, and filter results by around only one second.

READ FULL TEXT
research
10/19/2022

Schema-aware Reference as Prompt Improves Data-Efficient Relational Triple and Event Extraction

Information Extraction, which aims to extract structural relational trip...
research
10/22/2022

PHEE: A Dataset for Pharmacovigilance Event Extraction from Text

The primary goal of drug safety researchers and regulators is to promptl...
research
01/04/2018

Text Extraction and Retrieval from Smartphone Screenshots: Building a Repository for Life in Media

Daily engagement in life experiences is increasingly interwoven with mob...
research
12/28/2021

Cognitive Computing to Optimize IT Services

In this paper, the challenges of maintaining a healthy IT operational en...
research
03/09/2022

PET: A new Dataset for Process Extraction from Natural Language Text

Although there is a long tradition of work in NLP on extracting entities...
research
07/24/2017

Evaluation of Semantic Web Technologies for Storing Computable Definitions of Electronic Health Records Phenotyping Algorithms

Electronic Health Records are electronic data generated during or as a b...

Please sign up or login with your details

Forgot password? Click here to reset