Classifier Suites for Insider Threat Detection

01/30/2019
by   David Noever, et al.
6

Better methods to detect insider threats need new anticipatory analytics to capture risky behavior prior to losing data. In search of the best overall classifier, this work empirically scores 88 machine learning algorithms in 16 major families. We extract risk features from the large CERT dataset, which blends real network behavior with individual threat narratives. We discover the predictive importance of measuring employee sentiment. Among major classifier families tested on CERT, the random forest algorithms offer the best choice, with different implementations scoring over 98 obscure or black-box alternatives, random forests are ensembles of many decision trees and thus offer a deep but human-readable set of detection rules (>2000 rules). We address performance rankings by penalizing long execution times against higher median accuracies using cross-fold validation. We address the relative rarity of threats as a case of low signal-to-noise (< 0.02 malicious to benign activities), and then train on both under-sampled and over-sampled data which is statistically balanced to identify nefarious actors.

READ FULL TEXT

page 1

page 2

page 3

page 7

page 8

research
04/05/2020

XtracTree for Regulator Validation of Bagging Methods Used in Retail Banking

Bootstrap aggregation, known as bagging, is one of the most popular ense...
research
02/09/2018

Predicting University Students' Academic Success and Choice of Major using Random Forests

In this paper, a large data set containing every course taken by every u...
research
09/02/2019

Guided Random Forest and its application to data approximation

We present a new way of constructing an ensemble classifier, named the G...
research
09/27/2020

Machine Learning for Searching the Dark Energy Survey for Trans-Neptunian Objects

In this paper we investigate how implementing machine learning could imp...
research
07/24/2019

Predicting Malicious Insider Threat Scenarios Using Organizational Data and a Heterogeneous Stack-Classifier

Insider threats continue to present a major challenge for the informatio...
research
02/02/2018

Detecting Zones and Threat on 3D Body for Security in Airports using Deep Machine Learning

In this research, it was used a segmentation and classification method t...
research
10/03/2018

Machine Learning Suites for Online Toxicity Detection

To identify and classify toxic online commentary, the modern tools of da...

Please sign up or login with your details

Forgot password? Click here to reset