MaintNet: A Collaborative Open-Source Library for Predictive Maintenance Language Resources

05/25/2020
by   Farhad Akhbardeh, et al.
0

Maintenance record logbooks are an emerging text type in NLP. They typically consist of free text documents with many domain specific technical terms, abbreviations, as well as non-standard spelling and grammar, which poses difficulties to NLP pipelines trained on standard corpora. Analyzing and annotating such documents is of particular importance in the development of predictive maintenance systems, which aim to provide operational efficiencies, prevent accidents and save lives. In order to facilitate and encourage research in this area, we have developed MaintNet, a collaborative open-source library of technical and domain-specific language datasets. MaintNet provides novel logbook data from the aviation, automotive, and facilities domains along with tools to aid in their (pre-)processing and clustering. Furthermore, it provides a way to encourage discussion on and sharing of new datasets and tools for logbook data analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2021

Collaborative construction of lexicographic and parallel datasets for African languages: first assessment

Faced with a considerable lack of resources in African languages to carr...
research
06/15/2023

SCALE: Scaling up the Complexity for Advanced Language Model Evaluation

Recent strides in Large Language Models (LLMs) have saturated many NLP b...
research
12/31/2022

Logic Mill – A Knowledge Navigation System

Logic Mill is a scalable and openly accessible software system that iden...
research
10/28/2022

System Demo: Tool and Infrastructure for Offensive Language Error Analysis (OLEA) in English

The automatic detection of offensive language is a pressing societal nee...
research
10/23/2020

When the Open Source Community Meets COVID-19: Characterizing COVID-19 themed GitHub Repositories

Ever since the beginning of the outbreak of the COVID-19 pandemic, resea...
research
06/11/2020

A Proposal for a Revision of ISO Modula-2

The Modula-2 language was first specified in [Wir78] by N. Wirth at ETH ...
research
06/21/2017

JaTeCS an open-source JAva TExt Categorization System

JaTeCS is an open source Java library that supports research on automati...

Please sign up or login with your details

Forgot password? Click here to reset