Augmenting Semantic Representation of Depressive Language: from Forums to Microblogs

08/23/2020
by   nawshad, et al.
0

We discuss and analyze the process of creating word embedding feature representations specifically designed for a learning task when annotated data is scarce, like depressive language detection from Tweets. We start from rich word embedding pre-trained from a general dataset, then enhance it with embedding learned from a domain specific but relatively much smaller dataset. Our strengthened representation portrays better the domain of depression we are interested in as it combines the semantics learned from the specific domain and word coverage from the general language. We present a comparative analyses of our word embedding representations with a simple bag-of-words model, a well known sentiment lexicon, a psycholinguistic lexicon, and a general pre-trained word embedding, based on their efficacy in accurately identifying depressive Tweets. We show that our representations achieve a significantly better F1 score than the others when applied to a high quality dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2021

A comprehensive empirical analysis on cross-domain semantic enrichment for detection of depressive language

We analyze the process of creating word embedding feature representation...
research
08/28/2020

QutNocturnal@HASOC'19: CNN for Hate Speech and Offensive Content Identification in Hindi Language

We describe our top-team solution to Task 1 for Hindi in the HASOC conte...
research
12/05/2017

EmTaggeR: A Word Embedding Based Novel Method for Hashtag Recommendation on Twitter

The hashtag recommendation problem addresses recommending (suggesting) o...
research
03/06/2020

Automatic Machine Learning Derived from Scholarly Big Data

One of the challenging aspects of applying machine learning is the need ...
research
12/24/2021

Analyzing Scientific Publications using Domain-Specific Word Embedding and Topic Modelling

The scientific world is changing at a rapid pace, with new technology be...
research
06/14/2023

Towards Automatic Identification of Violation Symptoms of Architecture Erosion

Architecture erosion has a detrimental effect on maintenance and evoluti...
research
12/04/2018

Twitter-based traffic information system based on vector representations for words

Recently, researchers have shown an increased interest in harnessing Twi...

Please sign up or login with your details

Forgot password? Click here to reset