StonkBERT: Can Language Models Predict Medium-Run Stock Price Movements?

by   Stefan Pasch, et al.

To answer this question, we fine-tune transformer-based language models, including BERT, on different sources of company-related text data for a classification task to predict the one-year stock price performance. We use three different types of text data: News articles, blogs, and annual reports. This allows us to analyze to what extent the performance of language models is dependent on the type of the underlying document. StonkBERT, our transformer-based stock performance classifier, shows substantial improvement in predictive accuracy compared to traditional language models. The highest performance was achieved with news articles as text source. Performance simulations indicate that these improvements in classification accuracy also translate into above-average stock market returns.


page 1

page 2

page 3

page 4


Towards Earnings Call and Stock Price Movement

Earnings calls are hosted by management of public companies to discuss t...

S P 500 Stock Price Prediction Using Technical, Fundamental and Text Data

We summarized both common and novel predictive models used for stock pri...

Stock trend prediction using news sentiment analysis

Efficient Market Hypothesis is the popular theory about stock prediction...

Networks of News and Cross-Sectional Returns

We uncover networks from news articles to study cross-sectional stock re...

Stock Market Analysis with Text Data: A Review

Stock market movements are influenced by public and private information ...

Choosing News Topics to Explain Stock Market Returns

We analyze methods for selecting topics in news articles to explain stoc...

"TL;DR:" Out-of-Context Adversarial Text Summarization and Hashtag Recommendation

This paper presents Out-of-Context Summarizer, a tool that takes arbitra...