LexNLP: Natural language processing and information extraction for legal and regulatory texts

06/10/2018
by   Michael J Bommarito II, et al.
0

LexNLP is an open source Python package focused on natural language processing and machine learning for legal and regulatory text. The package includes functionality to (i) segment documents, (ii) identify key text such as titles and section headings, (iii) extract over eighteen types of structured information like distances and dates, (iv) extract named entities such as companies and geopolitical entities, (v) transform text into features for model training, and (vi) build unsupervised and supervised models such as word embedding or tagging models. LexNLP includes pre-trained models based on thousands of unit tests drawn from real documents available from the SEC EDGAR database as well as various judicial and regulatory proceedings. LexNLP is designed for use in both academic research and industrial applications, and is distributed at https://github.com/LexPredict/lexpredict-lexnlp.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2021

BNLP: Natural language processing toolkit for Bengali language

BNLP is an open source language processing toolkit for Bengali language ...
research
06/13/2018

OpenEDGAR: Open Source Software for SEC EDGAR Analysis

OpenEDGAR is an open source Python framework designed to rapidly constru...
research
03/29/2022

An Evaluation Dataset for Legal Word Embedding: A Case Study On Chinese Codex

Word embedding is a modern distributed word representations approach wid...
research
03/08/2021

AfriVEC: Word Embedding Models for African Languages. Case Study of Fon and Nobiin

From Word2Vec to GloVe, word embedding models have played key roles in t...
research
06/28/2023

ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases

Large Language Models (LLMs) have shown the potential to revolutionize n...
research
03/13/2020

Predicting Legal Proceedings Status: an Approach Based on Sequential Text Data

Machine learning applications in the legal field are numerous and divers...
research
11/03/2021

Automatic Embedding of Stories Into Collections of Independent Media

We look at how machine learning techniques that derive properties of ite...

Please sign up or login with your details

Forgot password? Click here to reset