Privacy-Preserving Phishing Email Detection Based on Federated Learning and LSTM

10/12/2021
by   Yuwei Sun, et al.
0

Phishing emails that appear legitimate lure people into clicking on the attached malicious links or documents. Increasingly more sophisticated phishing campaigns in recent years necessitate a more adaptive detection system other than traditional signature-based methods. In this regard, natural language processing (NLP) with deep neural networks (DNNs) is adopted for knowledge acquisition from a large number of emails. However, such sensitive daily communications containing personal information are difficult to collect on a server for centralized learning in real life due to escalating privacy concerns. To this end, we propose a decentralized phishing email detection method called the Federated Phish Bowl (FPB) leveraging federated learning and long short-term memory (LSTM). FPB allows common knowledge representation and sharing among different clients through the aggregation of trained models to safeguard the email security and privacy. A recent phishing email dataset was collected from an intergovernmental organization to train the model. Moreover, we evaluated the model performance based on various assumptions regarding the total client number and the level of data heterogeneity. The comprehensive experimental results suggest that FPB is robust to a continually increasing client number and various data heterogeneity levels, retaining a detection accuracy of 0.83 and protecting the privacy of sensitive email communications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2022

Efficient and Privacy Preserving Group Signature for Federated Learning

Federated Learning (FL) is a Machine Learning (ML) technique that aims t...
research
12/20/2017

Differentially Private Federated Learning: A Client Level Perspective

Federated learning is a recent advance in privacy protection. In this co...
research
10/26/2021

DPCOVID: Privacy-Preserving Federated Covid-19 Detection

Coronavirus (COVID-19) has shown an unprecedented global crisis by the d...
research
12/22/2020

Turn Signal Prediction: A Federated Learning Case Study

Driving etiquette takes a different flavor for each locality as drivers ...
research
01/04/2022

Semantics-Preserved Distortion for Personal Privacy Protection

Privacy protection is an important and concerning topic in Federated Lea...
research
10/14/2022

Close the Gate: Detecting Backdoored Models in Federated Learning based on Client-Side Deep Layer Output Analysis

Federated Learning (FL) is a scheme for collaboratively training Deep Ne...
research
01/18/2023

Robust Knowledge Adaptation for Federated Unsupervised Person ReID

Person Re-identification (ReID) has been extensively studied in recent y...

Please sign up or login with your details

Forgot password? Click here to reset