Extraction of Salient Sentences from Labelled Documents

12/21/2014
by   Misha Denil, et al.
0

We present a hierarchical convolutional document model with an architecture designed to support introspection of the document structure. Using this model, we show how to use visualisation techniques from the computer vision literature to identify and extract topic-relevant sentences. We also introduce a new scalable evaluation technique for automatic sentence extraction systems that avoids the need for time consuming human annotation of validation data.

READ FULL TEXT
research
12/21/2021

Sentence Embeddings and High-speed Similarity Search for Fast Computer Assisted Annotation of Legal Documents

Human-performed annotation of sentences in legal documents is an importa...
research
04/25/2018

Hierarchical RNN for Information Extraction from Lawsuit Documents

Every lawsuit document contains the information about the party's claim,...
research
03/14/2023

Automatic summarisation of Instagram social network posts Combining semantic and statistical approaches

The proliferation of data and text documents such as articles, web pages...
research
03/07/2015

An Improved Image Mosaicing Algorithm for Damaged Documents

It is a common phenomenon in day to day life; where in some of the docum...
research
06/12/2021

A Sentence-level Hierarchical BERT Model for Document Classification with Limited Labelled Data

Training deep learning models with limited labelled data is an attractiv...
research
02/13/2018

Attention based Sentence Extraction from Scientific Articles using Pseudo-Labeled data

In this work, we present a weakly supervised sentence extraction techniq...
research
02/13/2019

SECTOR: A Neural Model for Coherent Topic Segmentation and Classification

When searching for information, a human reader first glances over a docu...

Please sign up or login with your details

Forgot password? Click here to reset