Federated Word2Vec: Leveraging Federated Learning to Encourage Collaborative Representation Learning

04/19/2021
by   Daniel Garcia Bernal, et al.
0

Large scale contextual representation models have significantly advanced NLP in recent years, understanding the semantics of text to a degree never seen before. However, they need to process large amounts of data to achieve high-quality results. Joining and accessing all these data from multiple sources can be extremely challenging due to privacy and regulatory reasons. Federated Learning can solve these limitations by training models in a distributed fashion, taking advantage of the hardware of the devices that generate the data. We show the viability of training NLP models, specifically Word2Vec, with the Federated Learning protocol. In particular, we focus on a scenario in which a small number of organizations each hold a relatively large corpus. The results show that neither the quality of the results nor the convergence time in Federated Word2Vec deteriorates as compared to centralised Word2Vec.

READ FULL TEXT
research
09/08/2020

A Real-time Contribution Measurement Method for Participants in Federated Learning

In recent years, individuals, business organizations or the country have...
research
02/06/2023

Adaptive Parameterization of Deep Learning Models for Federated Learning

Federated Learning offers a way to train deep neural networks in a distr...
research
04/29/2021

From Distributed Machine Learning to Federated Learning: A Survey

In recent years, data and computing resources are typically distributed ...
research
05/19/2021

A Privacy-Preserving Approach to Extraction of Personal Information through Automatic Annotation and Federated Learning

We curated WikiPII, an automatically labeled dataset composed of Wikiped...
research
10/08/2022

Collaborative Domain Blocking: Using federated NLP To Detect Malicious Domains

Current content filtering and blocking methods are susceptible to variou...
research
11/14/2022

Federated Learning for Appearance-based Gaze Estimation in the Wild

Gaze estimation methods have significantly matured in recent years, but ...
research
08/07/2022

Low-Latency Cooperative Spectrum Sensing via Truncated Vertical Federated Learning

In recent years, the exponential increase in the demand of wireless data...

Please sign up or login with your details

Forgot password? Click here to reset