File Fragment Classification using Light-Weight Convolutional Neural Networks

05/01/2023
by   Mustafa Ghaleb, et al.
0

In digital forensics, file fragment classification is an important step toward completing file carving process. There exist several techniques to identify the type of file fragments without relying on meta-data, such as using features like header/footer and N-gram to identify the fragment type. Recently, convolutional neural network (CNN) models have been used to build classification models to achieve this task. However, the number of parameters in CNNs tends to grow exponentially as the number of layers increases. This results in a dramatic increase in training and inference time. In this paper, we propose light-weight file fragment classification models based on depthwise separable CNNs. The evaluation results show that our proposed models provide faster inference time with comparable accuracy as compared to the state-of-art CNN based models. In particular, our models were able to achieve an accuracy of 79% on the FFT-75 dataset with nearly 100K parameters and 164M FLOPs, which is 4x smaller and 6x faster than the state-of-the-art classifier in the literature.

READ FULL TEXT
research
04/14/2023

A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram Embeddings

File fragment classification (FFC) on small chunks of memory is essentia...
research
11/17/2021

Using Convolutional Neural Networks to Detect Compression Algorithms

Machine learning is penetrating various domains virtually, thereby proli...
research
01/27/2023

Adversarial Networks and Machine Learning for File Classification

Correctly identifying the type of file under examination is a critical p...
research
10/28/2017

A Study of All-Convolutional Encoders for Connectionist Temporal Classification

Connectionist temporal classification (CTC) is a popular sequence predic...
research
08/16/2019

FiFTy: Large-scale File Fragment Type Identification using Neural Networks

We present FiFTy, a modern file type identification tool for memory fore...
research
09/16/2019

A few filters are enough: Convolutional Neural Network for P300 Detection

In this paper, we aim to provide elements to contribute to the discussio...
research
03/03/2016

Decision Forests, Convolutional Networks and the Models in-Between

This paper investigates the connections between two state of the art cla...

Please sign up or login with your details

Forgot password? Click here to reset