Inducing Distant Supervision in Suggestion Mining through Part-of-Speech Embeddings

09/21/2017
by   Sapna Negi, et al.
0

Mining suggestion expressing sentences from a given text is a less investigated sentence classification task, and therefore lacks hand labeled benchmark datasets. In this work, we propose and evaluate two approaches for distant supervision in suggestion mining. The distant supervision is obtained through a large silver standard dataset, constructed using the text from wikiHow and Wikipedia. Both the approaches use a LSTM based neural network architecture to learn a classification model for suggestion mining, but vary in their method to use the silver standard dataset. The first approach directly trains the classifier using this dataset, while the second approach only learns word embeddings from this dataset. In the second approach, we also learn POS embeddings, which interestingly gives the best classification accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2018

Improving Distant Supervision with Maxpooled Attention and Sentence-Level Supervision

We propose an effective multitask learning setup for reducing distant su...
research
03/03/2021

Lex2vec: making Explainable Word Embedding via Distant Supervision

In this technical report we propose an algorithm, called Lex2vec, that e...
research
03/03/2023

Ancient Chinese Word Segmentation and Part-of-Speech Tagging Using Distant Supervision

Ancient Chinese word segmentation (WSG) and part-of-speech tagging (POS)...
research
08/01/2017

Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm

NLP tasks are often limited by scarcity of manually annotated data. In s...
research
04/14/2019

Text segmentation on multilabel documents: A distant-supervised approach

Segmenting text into semantically coherent segments is an important task...
research
03/17/2020

BrazilDAM: A Benchmark dataset for Tailings Dam Detection

In this work we present BrazilDAM, a novel public dataset based on Senti...

Please sign up or login with your details

Forgot password? Click here to reset