DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains

04/03/2023
by   Yanis Labrak, et al.
0

In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains. In this paper, we propose an original study of PLMs in the medical domain on French language. We compare, for the first time, the performance of PLMs trained on both public data from the web and private data from healthcare establishments. We also evaluate different learning strategies on a set of biomedical tasks. In particular, we show that we can take advantage of already existing biomedical PLMs in a foreign language by further pre-train it on our targeted data. Finally, we release the first specialized PLMs for the biomedical field in French, called DrBERT, as well as the largest corpus of medical data under free license on which these models are trained.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2021

ELECTRAMed: a new pre-trained language representation model for biomedical NLP

The overwhelming amount of biomedical scientific texts calls for the dev...
research
02/16/2023

Do We Still Need Clinical Language Models?

Although recent advances in scaling large language models (LLMs) have re...
research
10/11/2021

Pre-trained Language Models in Biomedical Domain: A Systematic Survey

Pre-trained language models (PLMs) have been the de facto paradigm for m...
research
05/30/2023

Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models

Large language models (LLMs) have significantly advanced the field of na...
research
04/06/2022

Language Model for Text Analytic in Cybersecurity

NLP is a form of artificial intelligence and machine learning concerned ...
research
10/14/2021

Building Chinese Biomedical Language Models via Multi-Level Text Discrimination

Pre-trained language models (PLMs), such as BERT and GPT, have revolutio...

Please sign up or login with your details

Forgot password? Click here to reset