Form 10-K Itemization

02/18/2023
by   Yanci Zhang, et al.
0

Form 10-K report is a financial report disclosing the annual financial state of a public company. It is an important evidence to conduct financial analysis, i.e., asset pricing, corporate finance. Practitioners and researchers are constantly designing algorithms to better conduct analysis on information in the Form 10-K report. The vast majority of previous works focus on quantitative data. With recent advancement on natural language processing (NLP), textual data in financial filing attracts more attention. However, to incorporate textual data for analyzing, Form 10-K Itemization is a necessary pre-process step. It aims to segment the whole document into several Item sections, where each Item section focuses on a specific financial aspect of the company. With the segmented Item sections, NLP techniques can directly apply on those Item sections related to downstream tasks. In this paper, we develop a Form 10-K Itemization system which can automatically segment all the Item sections in 10-K documents. The system is both effective and efficient. It reaches a retrieval rate of 93

READ FULL TEXT
research
04/23/2021

Form 10-Q Itemization

Form 10-Q, the quarterly financial statement, is one of the most crucial...
research
06/14/2022

FETILDA: An Effective Framework For Fin-tuned Embeddings For Long Financial Text Documents

Unstructured data, especially text, continues to grow rapidly in various...
research
01/06/2021

Text analysis in financial disclosures

Financial disclosure analysis and Knowledge extraction is an important f...
research
05/24/2023

Leveraging LLMs for KPIs Retrieval from Hybrid Long-Document: A Comprehensive Framework and Dataset

Large Language Models (LLMs) demonstrate exceptional performance in text...
research
08/05/2023

Textual Data Mining for Financial Fraud Detection: A Deep Learning Approach

In this report, I present a deep learning approach to conduct a natural ...
research
01/24/2020

The Enron Corpus: Where the Email Bodies are Buried?

To probe the largest public-domain email database for indicators of frau...
research
07/11/2022

Learning Mutual Fund Categorization using Natural Language Processing

Categorization of mutual funds or Exchange-Traded-funds (ETFs) have long...

Please sign up or login with your details

Forgot password? Click here to reset