Large Scale Enrichment and Statistical Cyber Characterization of Network Traffic

09/07/2022
by   Ivan Kawaminami, et al.
0

Modern network sensors continuously produce enormous quantities of raw data that are beyond the capacity of human analysts. Cross-correlation of network sensors increases this challenge by enriching every network event with additional metadata. These large volumes of enriched network data present opportunities to statistically characterize network traffic and quickly answer a key question: "What are the primary cyber characteristics of my network data?" The Python GraphBLAS and PyD4M analysis frameworks enable anonymized statistical analysis to be performed quickly and efficiently on very large network data sets. This approach is tested using billions of anonymized network data samples from the largest Internet observatory (CAIDA Telescope) and tens of millions of anonymized records from the largest commercially available background enrichment capability (GreyNoise). The analysis confirms that most of the enriched variables follow expected heavy-tail distributions and that a large fraction of the network traffic is due to a small number of cyber activities. This information can simplify the cyber analysts' task by enabling prioritization of cyber activities based on statistical prevalence.

READ FULL TEXT

page 6

page 7

research
04/24/2018

Automated Big Traffic Analytics for Cyber Security

Network traffic analytics technology is a cornerstone for cyber security...
research
12/15/2017

Network Intell: Enabling the Non-Expert Analysis of Large Volumes of Intercepted Network Traffic

In criminal investigations, telecommunication wiretaps have become a com...
research
09/04/2023

Focusing and Calibration of Large Scale Network Sensors using GraphBLAS Anonymized Hypersparse Matrices

Defending community-owned cyber space requires community-based efforts. ...
research
12/08/2021

ESAFE: Enterprise Security and Forensics at Scale

Securing enterprise networks presents challenges in terms of both their ...
research
06/14/2023

LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting

Traffic forecasting plays a critical role in smart city initiatives and ...
research
12/23/2018

Exploratory Data Analysis of a Network Telescope Traffic and Prediction of Port Probing Rates

Understanding the properties exhibited by large scale network probing tr...
research
01/14/2019

Statistical Models for the Number of Successful Cyber Intrusions

We propose several generalized linear models (GLMs) to predict the numbe...

Please sign up or login with your details

Forgot password? Click here to reset