Privacy-Preserving Federated Discovery of DNA Motifs with Differential Privacy

04/04/2023
by   Yao Chen, et al.
0

DNA motif discovery is an important issue in gene research, which aims to identify transcription factor binding sites (i.e., motifs) in DNA sequences to reveal the mechanisms that regulate gene expression. However, the phenomenon of data silos and the problem of privacy leakage have seriously hindered the development of DNA motif discovery. On the one hand, the phenomenon of data silos makes data collection difficult. On the other hand, the collection and use of DNA data become complicated and difficult because DNA is sensitive private information. In this context, how discovering DNA motifs under the premise of ensuring privacy and security and alleviating data silos has become a very important issue. Therefore, this paper proposes a novel method, namely DP-FLMD, to address this problem. Note that this is the first application of federated learning to the field of genetics research. The federated learning technique is used to solve the problem of data silos. It has the advantage of enabling multiple participants to train models together and providing privacy protection services. To address the challenges of federated learning in terms of communication costs, this paper applies a sampling method and a strategy for reducing communication costs to DP-FLMD. In addition, differential privacy, a privacy protection technique with rigorous mathematical proof, is also applied to DP-FLMD. Experiments on the DNA datasets show that DP-FLMD has high mining accuracy and runtime efficiency, and the performance of the algorithm is affected by some parameters.

READ FULL TEXT
research
11/09/2021

DP-REC: Private Communication-Efficient Federated Learning

Privacy and communication efficiency are important challenges in federat...
research
12/03/2021

Differential Privacy in Privacy-Preserving Big Data and Learning: Challenge and Opportunity

Differential privacy (DP) has become the de facto standard of privacy pr...
research
02/08/2023

Exploratory Analysis of Federated Learning Methods with Differential Privacy on MIMIC-III

Background: Federated learning methods offer the possibility of training...
research
03/28/2022

FLDP: Flexible strategy for local differential privacy

Local differential privacy (LDP), a technique applying unbiased statisti...
research
11/16/2017

Privacy-preserving Edit Distance on Genomic Data

Suppose Alice holds a DNA sequence and Bob owns a database of DNA sequen...
research
09/30/2022

RL-MD: A Novel Reinforcement Learning Approach for DNA Motif Discovery

The extraction of sequence patterns from a collection of functionally li...
research
03/10/2022

Facilitating Federated Genomic Data Analysis by Identifying Record Correlations while Ensuring Privacy

With the reduction of sequencing costs and the pervasiveness of computin...

Please sign up or login with your details

Forgot password? Click here to reset