Towards Personalized and Human-in-the-Loop Document Summarization

08/21/2021
by   Samira Ghodratnama, et al.
0

The ubiquitous availability of computing devices and the widespread use of the internet have generated a large amount of data continuously. Therefore, the amount of available information on any given topic is far beyond humans' processing capacity to properly process, causing what is known as information overload. To efficiently cope with large amounts of information and generate content with significant value to users, we require identifying, merging and summarising information. Data summaries can help gather related information and collect it into a shorter format that enables answering complicated questions, gaining new insight and discovering conceptual boundaries. This thesis focuses on three main challenges to alleviate information overload using novel summarisation techniques. It further intends to facilitate the analysis of documents to support personalised information extraction. This thesis separates the research issues into four areas, covering (i) feature engineering in document summarisation, (ii) traditional static and inflexible summaries, (iii) traditional generic summarisation approaches, and (iv) the need for reference summaries. We propose novel approaches to tackle these challenges, by: i)enabling automatic intelligent feature engineering, ii) enabling flexible and interactive summarisation, iii) utilising intelligent and personalised summarisation approaches. The experimental results prove the efficiency of the proposed approaches compared to other state-of-the-art models. We further propose solutions to the information overload problem in different domains through summarisation, covering network traffic data, health data and business process data.

READ FULL TEXT
research
05/31/2021

Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

Faceted summarization provides briefings of a document from different pe...
research
12/24/2020

Adaptive Summaries: A Personalized Concept-based Summarization Approach by Learning from Users' Feedback

Exploring the tremendous amount of data efficiently to make a decision, ...
research
05/18/2020

Question-Driven Summarization of Answers to Consumer Health Questions

Automatic summarization of natural language is a widely studied area in ...
research
03/31/2023

ConceptEVA: Concept-Based Interactive Exploration and Customization of Document Summaries

With the most advanced natural language processing and artificial intell...
research
07/20/2018

Abstractive and Extractive Text Summarization using Document Context Vector and Recurrent Neural Networks

Sequence to sequence (Seq2Seq) learning has recently been used for abstr...
research
07/09/2023

A Personalized Reinforcement Learning Summarization Service for Learning Structure from Unstructured Data

The exponential growth of textual data has created a crucial need for to...
research
04/28/2020

Human-Like Summaries from Heterogeneous and Time-Windowed Software Development Artefacts

Automatic text summarisation has drawn considerable interest in the area...

Please sign up or login with your details

Forgot password? Click here to reset