L3Cube-MahaNLP: Marathi Natural Language Processing Datasets, Models, and Library

05/29/2022
by   Raviraj Joshi, et al.
0

Despite being the third most popular language in India, the Marathi language lacks useful NLP resources. Moreover, popular NLP libraries do not have support for the Marathi language. With L3Cube-MahaNLP, we aim to build resources and a library for Marathi natural language processing. We present datasets and transformer models for supervised tasks like sentiment analysis, named entity recognition, and hate speech detection. We have also published a monolingual Marathi corpus for unsupervised language modeling tasks. Overall we present MahaCorpus, MahaSent, MahaNER, and MahaHate datasets and their corresponding MahaBERT models fine-tuned on these datasets. We aim to move ahead of benchmark datasets and prepare useful resources for Marathi. The resources are available at https://github.com/l3cube-pune/MarathiNLP.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2018

VnCoreNLP: A Vietnamese Natural Language Processing Toolkit

We present an easy-to-use and fast toolkit, namely VnCoreNLP---a Java NL...
research
11/18/2018

Quantifying Uncertainties in Natural Language Processing Tasks

Reliable uncertainty quantification is a first step towards building exp...
research
09/07/2021

Datasets: A Community Library for Natural Language Processing

The scale, variety, and quantity of publicly-available NLP datasets has ...
research
03/02/2022

Mukayese: Turkish NLP Strikes Back

Having sufficient resources for language X lifts it from the under-resou...
research
12/07/2021

Dataset Geography: Mapping Language Data to Language Users

As language technologies become more ubiquitous, there are increasing ef...
research
04/20/2020

The Panacea Threat Intelligence and Active Defense Platform

We describe Panacea, a system that supports natural language processing ...
research
05/30/2018

Anaphora and Coreference Resolution: A Review

Entity resolution aims at resolving repeated references to an entity in ...

Please sign up or login with your details

Forgot password? Click here to reset