OpenEDGAR: Open Source Software for SEC EDGAR Analysis

06/13/2018
by   Michael J Bommarito II, et al.
1

OpenEDGAR is an open source Python framework designed to rapidly construct research databases based on the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system operated by the US Securities and Exchange Commission (SEC). OpenEDGAR is built on the Django application framework, supports distributed compute across one or more servers, and includes functionality to (i) retrieve and parse index and filing data from EDGAR, (ii) build tables for key metadata like form type and filer, (iii) retrieve, parse, and update CIK to ticker and industry mappings, (iv) extract content and metadata from filing documents, and (v) search filing document contents. OpenEDGAR is designed for use in both academic research and industrial applications, and is distributed under MIT License at https://github.com/LexPredict/openedgar.

READ FULL TEXT
research
06/10/2018

LexNLP: Natural language processing and information extraction for legal and regulatory texts

LexNLP is an open source Python package focused on natural language proc...
research
02/06/2023

FastCat Catalogues: Interactive Entity-based Exploratory Analysis of Archival Documents

We describe FastCat Catalogues, a Web application that supports research...
research
02/23/2022

TARexp: A Python Framework for Technology-Assisted Review Experiments

Technology-assisted review (TAR) is an important industrial application ...
research
12/06/2022

ACRO: A multi-language toolkit for supporting Automated Checking of Research Outputs

This paper discusses the development of an open source tool ACRO, (Autom...
research
03/17/2023

A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents

Extracting information from academic PDF documents is crucial for numero...
research
07/01/2021

Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

Electronic Theses and Dissertations (ETDs) contain domain knowledge that...
research
07/02/2021

You Only Write Thrice: Creating Documents, Computational Notebooks and Presentations From a Single Source

Academic trade requires juggling multiple variants of the same content p...

Please sign up or login with your details

Forgot password? Click here to reset