Identifying DNS-tunneled traffic with predictive models

06/26/2019
by   Andreas Berg, et al.
0

DNS is a distributed, fault tolerant system that avoids a single point of failure. As such it is an integral part of the internet as we use it today and hence deemed a safe protocol which is let through firewalls and proxies with no or little checks. This can be exploited by malicious agents. Network forensics is effective but struggles due to size of data and manual labour. This paper explores to what extent predictive models can be used to predict network traffic, what protocols are tunneled in the DNS protocol and more specifically whether the predictive performance is enhanced when analyzing DNS-queries and responses together and which feature set that can be used for DNS-tunneled network prediction. The tested protocols are SSH, SFTP and Telnet and the machine learning models used are Multi Layered Perceptron and Random Forests. To train the models we extract the IP Packet length, Name length and Name entropy of both the queries and responses in the DNS traffic. With an experimental research strategy it is empirically shown that the performance of the models increases when training the models on the query and respose pairs rather than using only queries or responses. The accuracy of the models is >83 and reduction in data size when features are extracted is roughly 95 results provides evidence that machine learning is a valuable tool in detecting network protocols in a DNS tunnel and that only an small subset of network traffic is needed to detect this anomaly.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2018

PIDS - A Behavioral Framework for Analysis and Detection of Network Printer Attacks

Nowadays, every organization might be attacked through its network print...
research
11/24/2022

Network Security Modelling with Distributional Data

We investigate the detection of botnet command and control (C2) hosts in...
research
07/18/2019

Analyzing the Costs (and Benefits) of DNS, DoT, and DoH for the Modern Web

Essentially all Internet communication relies on the Domain Name System ...
research
07/20/2023

Prediction of Handball Matches with Statistically Enhanced Learning via Estimated Team Strengths

We propose a Statistically Enhanced Learning (aka. SEL) model to predict...
research
09/03/2021

Predicting Process Name from Network Data

The ability to identify applications based on the network data they gene...
research
05/06/2020

An Overview of Self-Similar Traffic: Its Implications in the Network Design

The knowledge about the true nature of the traffic in computer networkin...
research
08/03/2023

Experiments on Computer Networks: Quickly Knowing the Protocols in the TCP/IP Suite

Manual of practical experiments on protocols used in the Internet or TCP...

Please sign up or login with your details

Forgot password? Click here to reset