Context-Sensitive Malicious Spelling Error Correction

01/23/2019
by   Hongyu Gong, et al.
0

Misspelled words of the malicious kind work by changing specific keywords and are intended to thwart existing automated applications for cyber-environment control such as harassing content detection on the Internet and email spam detection. In this paper, we focus on malicious spelling correction, which requires an approach that relies on the context and the surface forms of targeted keywords. In the context of two applications--profanity detection and email spam detection--we show that malicious misspellings seriously degrade their performance. We then propose a context-sensitive approach for malicious spelling correction using word embeddings and demonstrate its superior performance compared to state-of-the-art spell checkers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2020

Vartani Spellcheck – Automatic Context-Sensitive Spelling Correction of OCR-generated Hindi Text Using BERT and Levenshtein Distance

Traditional Optical Character Recognition (OCR) systems that generate te...
research
04/30/2018

Inherent Biases in Reference-based Evaluation for Grammatical Error Correction and Text Simplification

The prevalent use of too few references for evaluating text-to-text gene...
research
10/07/2020

CATBERT: Context-Aware Tiny BERT for Detecting Social Engineering Emails

Targeted phishing emails are on the rise and facilitate the theft of bil...
research
09/14/2023

Malicious Cyber Activity Detection Using Zigzag Persistence

In this study we synthesize zigzag persistence from topological data ana...
research
09/05/2021

A Transformer-based Model to Detect Phishing URLs

Phishing attacks are among emerging security issues that recently draws ...
research
03/24/2022

Email Summarization to Assist Users in Phishing Identification

Cyber-phishing attacks recently became more precise, targeted, and tailo...
research
10/21/2016

Exploitation of Semantic Keywords for Malicious Event Classification

Learning an event classifier is challenging when the scenes are semantic...

Please sign up or login with your details

Forgot password? Click here to reset