Learning Mutual Fund Categorization using Natural Language Processing

07/11/2022
by   Dimitrios Vamvourellis, et al.
0

Categorization of mutual funds or Exchange-Traded-funds (ETFs) have long served the financial analysts to perform peer analysis for various purposes starting from competitor analysis, to quantifying portfolio diversification. The categorization methodology usually relies on fund composition data in the structured format extracted from the Form N-1A. Here, we initiate a study to learn the categorization system directly from the unstructured data as depicted in the forms using natural language processing (NLP). Positing as a multi-class classification problem with the input data being only the investment strategy description as reported in the form and the target variable being the Lipper Global categories, and using various NLP models, we show that the categorization system can indeed be learned with high accuracy. We discuss implications and applications of our findings as well as limitations of existing pre-trained architectures in applying them to learn fund categorization.

READ FULL TEXT
research
05/29/2020

Machine Learning Fund Categorizations

Given the surge in popularity of mutual funds (including exchange-traded...
research
09/13/2023

Beyond original Research Articles Categorization via NLP

This work proposes a novel approach to text categorization – for unknown...
research
09/12/2018

Using the Tsetlin Machine to Learn Human-Interpretable Rules for High-Accuracy Text Categorization with Medical Applications

Medical applications challenge today's text categorization techniques by...
research
10/20/2019

A Semi-Automated Approach for Information Extraction, Classification and Analysis of Unstructured Data

In this paper, we show how Quantitative Narrative Analysis and simple Na...
research
06/23/2016

Explaining Predictions of Non-Linear Classifiers in NLP

Layer-wise relevance propagation (LRP) is a recently proposed technique ...
research
04/17/2019

Casting Light on Invisible Cities: Computationally Engaging with Literary Criticism

Literary critics often attempt to uncover meaning in a single work of li...
research
02/18/2023

Form 10-K Itemization

Form 10-K report is a financial report disclosing the annual financial s...

Please sign up or login with your details

Forgot password? Click here to reset