On Multi-Session Website Fingerprinting over TLS Handshake

by   Aida Ramezani, et al.

Analyzing users' Internet traffic data and activities has a certain impact on users' experiences in different ways, from maintaining the quality of service on the Internet and providing users with high-quality recommendation systems to anomaly detection and secure connection. Considering that the Internet is a complex network, we cannot disintegrate the packets for each activity. Therefore we have to have a model that can identify all the activities an Internet user does in a given period of time. In this paper, we propose a deep learning approach to generate a multi-label classifier that can predict the websites visited by a user in a certain period. This model works by extracting the server names appearing in chronological order in the TLSv1.2 and TLSv1.3 Client Hello packets. We compare the results on the test data with a simple fully-connected neural network developed for the same purpose to prove that using the time-sequential information improves the performance. For further evaluations, we test the model on a human-made dataset and a modified dataset to check the model's accuracy under different circumstances. Finally, our proposed model achieved an accuracy of 95 both the modified dataset and the human-made dataset.



page 1

page 2

page 3

page 4


Tracking Users across the Web via TLS Session Resumption

User tracking on the Internet can come in various forms, e.g., via cooki...

Deep Learning for Encrypted Traffic Classification and Unknown Data Detection

Despite the widespread use of encryption techniques to provide confident...

Gamers Private Network Performance Forecasting. From Raw Data to the Data Warehouse with Machine Learning and Neural Nets

Gamers Private Network (GPN) is a client/server technology that guarante...

TG-PSM: Tunable Greedy Packet Sequence Morphing Based on Trace Clustering

Common privacy enhancing technologies fail to effectively hide certain s...

Comparison Between IPv4 to IPv6 Transition Techniques

The IPv4 addresses exhaustion demands a protocol transition from IPv4 to...

Suspicious ARP Activity Detection and Clustering Based on Autoencoder Neural Networks

The rapidly increasing number of smart devices on the Internet necessita...

Abnormal activity capture from passenger flow of elevator based on unsupervised learning and fine-grained multi-label recognition

We present a work-flow which aims at capturing residents' abnormal activ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.