Logic Mill – A Knowledge Navigation System

12/31/2022
by   Sebastian Erhardt, et al.
0

Logic Mill is a scalable and openly accessible software system that identifies semantically similar documents within either one domain-specific corpus or multi-domain corpora. It uses advanced Natural Language Processing (NLP) techniques to generate numerical representations of documents. Currently it leverages a large pre-trained language model to generate these document representations. The system focuses on scientific publications and patent documents and contains more than 200 million documents. It is easily accessible via a simple Application Programming Interface (API) or via a web interface. Moreover, it is continuously being updated and can be extended to text corpora from other domains. We see this system as a general-purpose tool for future research applications in the social sciences and other domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/05/2018

Not just about size - A Study on the Role of Distributed Word Representations in the Analysis of Scientific Publications

The emergence of knowledge graphs in the scholarly communication domain ...
research
04/09/2021

Automatic Knowledge Extraction with Human Interface

OrbWeaver, an automatic knowledge extraction system paired with a human ...
research
08/26/2021

SAUCE: Truncated Sparse Document Signature Bit-Vectors for Fast Web-Scale Corpus Expansion

Recent advances in text representation have shown that training on large...
research
05/25/2020

MaintNet: A Collaborative Open-Source Library for Predictive Maintenance Language Resources

Maintenance record logbooks are an emerging text type in NLP. They typic...
research
06/04/2018

History Playground: A Tool for Discovering Temporal Trends in Massive Textual Corpora

Recent studies have shown that macroscopic patterns of continuity and ch...
research
06/30/2021

Machine Reading of Hypotheses for Organizational Research Reviews and Pre-trained Models via R Shiny App for Non-Programmers

The volume of scientific publications in organizational research becomes...
research
08/01/2017

An Investigation into the Pedagogical Features of Documents

Characterizing the content of a technical document in terms of its learn...

Please sign up or login with your details

Forgot password? Click here to reset