Deep Learning based Topic Analysis on Financial Emerging Event Tweets

08/03/2020
by   Shaan Aryaman, et al.
0

Financial analyses of stock markets rely heavily on quantitative approaches in an attempt to predict subsequent or market movements based on historical prices and other measurable metrics. These quantitative analyses might have missed out on un-quantifiable aspects like sentiment and speculation that also impact the market. Analyzing vast amounts of qualitative text data to understand public opinion on social media platform is one approach to address this gap. This work carried out topic analysis on 28264 financial tweets [1] via clustering to discover emerging events in the stock market. Three main topics were discovered to be discussed frequently within the period. First, the financial ratio EPS is a measure that has been discussed frequently by investors. Secondly, short selling of shares were discussed heavily, it was often mentioned together with Morgan Stanley. Thirdly, oil and energy sectors were often discussed together with policy. These tweets were semantically clustered by a method consisting of word2vec algorithm to obtain word embeddings that map words to vectors. Semantic word clusters were then formed. Each tweet was then vectorized using the Term Frequency-Inverse Document Frequency (TF-IDF) values of the words it consisted of and based on which clusters its words were in. Tweet vectors were then converted to compressed representations by training a deep-autoencoder. K-means clusters were then formed. This method reduces dimensionality and produces dense vectors, in contrast to the usual Vector Space Model. Topic modelling with Latent Dirichlet Allocation (LDA) and top frequent words were used to analyze clusters and reveal emerging events.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2016

Sentiment Analysis of Twitter Data for Predicting Stock Market Movements

Predicting stock market movements is a well-known problem of interest. N...
research
11/19/2019

Event detection in Colombian security Twitter news using fine-grained latent topic analysis

Cultural and social dynamics are important concepts that must be underst...
research
06/15/2021

Author Clustering and Topic Estimation for Short Texts

Analysis of short text, such as social media posts, is extremely difficu...
research
12/10/2020

A Sentiment Analysis Approach to the Prediction of Market Volatility

Prediction and quantification of future volatility and returns play an i...
research
05/08/2018

Investor Reaction to Financial Disclosures Across Topics: An Application of Latent Dirichlet Allocation

This paper provides a holistic study of how stock prices vary in their r...
research
04/12/2018

Cashtag piggybacking: uncovering spam and bot activity in stock microblogs on Twitter

Microblogs are increasingly exploited for predicting prices and traded v...
research
03/07/2020

Synthetic Error Dataset Generation Mimicking Bengali Writing Pattern

While writing Bengali using English keyboard, users often make spelling ...

Please sign up or login with your details

Forgot password? Click here to reset