This article introduces to the interactive Leipzig Corpus Miner (iLCM) -...
Fine-tuning of pre-trained transformer networks such as BERT yield
state...
Contextualized word embeddings (CWE) such as provided by ELMo (Peters et...
De-identification is the task of detecting protected health information ...
We investigate different strategies for automatic offensive language
cla...
For named entity recognition (NER), bidirectional recurrent neural netwo...
We introduce an advanced information extraction pipeline to automaticall...
Investigative journalism in recent years is confronted with two major
ch...
The iLCM project pursues the development of an integrated research
envir...
For digitization of paper files via OCR, preservation of document contex...
In terminology work, natural language processing, and digital humanities...
This paper presents the "Leipzig Corpus Miner", a technical infrastructu...