Ensemble-based Feature Selection and Classification Model for DNS Typo-squatting Detection

06/08/2020
by   Abdallah Moubayed, et al.
0

Domain Name System (DNS) plays in important role in the current IP-based Internet architecture. This is because it performs the domain name to IP resolution. However, the DNS protocol has several security vulnerabilities due to the lack of data integrity and origin authentication within it. This paper focuses on one particular security vulnerability, namely typo-squatting. Typo-squatting refers to the registration of a domain name that is extremely similar to that of an existing popular brand with the goal of redirecting users to malicious/suspicious websites. The danger of typo-squatting is that it can lead to information threat, corporate secret leakage, and can facilitate fraud. This paper builds on our previous work in [1], which only proposed majority-voting based classifier, by proposing an ensemble-based feature selection and bagging classification model to detect DNS typo-squatting attack. Experimental results show that the proposed framework achieves high accuracy and precision in identifying the malicious/suspicious typo-squatting domains (a loss of at most 1.5 that used the complete feature set) while having a lower computational complexity due to the smaller feature set (a reduction of more than 50 feature set size).

READ FULL TEXT
research
12/25/2020

DNS Typo-squatting Domain Detection: A Data Analytics Machine Learning Based Approach

Domain Name System (DNS) is a crucial component of current IP-based netw...
research
12/16/2020

Optimized Random Forest Model for Botnet Detection Based on DNS Queries

The Domain Name System (DNS) protocol plays a major role in today's Inte...
research
12/15/2022

A new weighted ensemble model for phishing detection based on feature selection

A phishing attack is a sort of cyber assault in which the attacker sends...
research
10/06/2022

Effective Metaheuristic Based Classifiers for Multiclass Intrusion Detection

Network security has become the biggest concern in the area of cyber sec...
research
02/19/2020

Detection and Analysis of Drive-by Downloads and Malicious Websites

A drive by download is a download that occurs without users action or kn...
research
06/02/2020

Less is More: Robust and Novel Features for Malicious Domain Detection

Malicious domains are increasingly common and pose a severe cybersecurit...
research
04/30/2021

A User-Guided Bayesian Framework for Ensemble Feature Selection in Life Science Applications (UBayFS)

Training machine learning models on high-dimensional datasets is a chall...

Please sign up or login with your details

Forgot password? Click here to reset