GPT-2C: A GPT-2 parser for Cowrie honeypot logs

09/14/2021
by   Febrian Setianto, et al.
0

Deception technologies like honeypots produce comprehensive log reports, but often lack interoperability with EDR and SIEM technologies. A key bottleneck is that existing information transformation plugins perform well on static logs (e.g. geolocation), but face limitations when it comes to parsing dynamic log topics (e.g. user-generated content). In this paper, we present a run-time system (GPT-2C) that leverages large pre-trained models (GPT-2) to parse dynamic logs generate by a Cowrie SSH honeypot. Our fine-tuned model achieves 89% inference accuracy in the new domain and demonstrates acceptable execution latency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2023

Hue: A User-Adaptive Parser for Hybrid Logs

Log parsing, which extracts log templates from semi-structured logs and ...
research
08/10/2022

LogStamp: Automatic Online Log Parsing Based on Sequence Labelling

Logs are one of the most critical data for service management. It contai...
research
02/13/2019

Delog: A Privacy Preserving Log Filtering Framework for Online Compute Platforms

In many software applications, logs serve as the only interface between ...
research
10/29/2021

AWSOM-LP: An Effective Log Parsing Technique Using Pattern Recognition and Frequency Analysis

Logs provide users with useful insights to help with a variety of develo...
research
03/21/2023

LogQA: Question Answering in Unstructured Logs

Modern systems produce a large volume of logs to record run-time status ...
research
12/23/2021

SemParser: A Semantic Parser for Log Analysis

Logs, being run-time information automatically generated by software, re...
research
12/29/2022

System Log Parsing: A Survey

Modern information and communication systems have become increasingly ch...

Please sign up or login with your details

Forgot password? Click here to reset