TypoSwype: An Imaging Approach to Detect Typo-Squatting

09/02/2022
by   Joon Sern Lee, et al.
0

Typo-squatting domains are a common cyber-attack technique. It involves utilising domain names, that exploit possible typographical errors of commonly visited domains, to carry out malicious activities such as phishing, malware installation, etc. Current approaches typically revolve around string comparison algorithms like the Demaru-Levenschtein Distance (DLD) algorithm. Such techniques do not take into account keyboard distance, which researchers find to have a strong correlation with typical typographical errors and are trying to take account of. In this paper, we present the TypoSwype framework which converts strings to images that take into account keyboard location innately. We also show how modern state of the art image recognition techniques involving Convolutional Neural Networks, trained via either Triplet Loss or NT-Xent Loss, can be applied to learn a mapping to a lower dimensional space where distances correspond to image, and equivalently, textual similarity. Finally, we also demonstrate our method's ability to improve typo-squatting detection over the widely used DLD algorithm, while maintaining the classification accuracy as to which domain the input domain was attempting to typo-squat.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2023

GlyphNet: Homoglyph domains dataset and detection using attention-based Convolutional Neural Networks

Cyber attacks deceive machines into believing something that does not ex...
research
05/24/2018

Detecting Homoglyph Attacks with a Siamese Neural Network

A homoglyph (name spoofing) attack is a common technique used by adversa...
research
03/12/2020

Inline Detection of DGA Domains Using Side Information

Malware applications typically use a command and control (C C) server ...
research
10/07/2021

Scale Invariant Domain Generalization Image Recapture Detection

Recapturing and rebroadcasting of images are common attack methods in in...
research
06/02/2020

Less is More: Robust and Novel Features for Malicious Domain Detection

Malicious domains are increasingly common and pose a severe cybersecurit...
research
10/04/2018

Detecting DGA domains with recurrent neural networks and side information

Modern malware typically makes use of a domain generation algorithm (DGA...
research
06/24/2020

PhishGAN: Data Augmentation and Identification of Homoglpyh Attacks

Homoglyph attacks are a common technique used by hackers to conduct phis...

Please sign up or login with your details

Forgot password? Click here to reset