A new approach to content-based file type detection

02/17/2010
by   M. C. Amirani, et al.
0

File type identification and file type clustering may be difficult tasks that have an increasingly importance in the field of computer and network security. Classical methods of file type detection including considering file extensions and magic bytes can be easily spoofed. Content-based file type detection is a newer way that is taken into account recently. In this paper, a new content-based method for the purpose of file type detection and file type clustering is proposed that is based on the PCA and neural networks. The proposed method has a good accuracy and is fast enough.

READ FULL TEXT
research
09/12/2017

A HelloWord Bib stile file .bst

A HelloWord Bib stile file .bst is described...
research
01/21/2021

Content-Based Textual File Type Detection at Scale

Programming language detection is a common need in the analysis of large...
research
02/17/2015

Randomized LU decomposition: An Algorithm for Dictionaries Construction

In recent years, distinctive-dictionary construction has gained importan...
research
08/16/2019

FiFTy: Large-scale File Fragment Type Identification using Neural Networks

We present FiFTy, a modern file type identification tool for memory fore...
research
01/27/2023

Adversarial Networks and Machine Learning for File Classification

Correctly identifying the type of file under examination is a critical p...
research
02/25/2021

File fragment recognition based on content and statistical features

Nowadays, the speed up development and use of digital devices such as sm...
research
10/05/2020

Metadata-Based Detection of Child Sexual Abuse Material

In the last decade, the scale of creation and distribution of child sexu...

Please sign up or login with your details

Forgot password? Click here to reset