Metadata-Based Detection of Child Sexual Abuse Material

10/05/2020
by   Mayana Pereira, et al.
0

In the last decade, the scale of creation and distribution of child sexual abuse medias (CSAM) has exponentially increased. Technologies that aid law enforcement agencies worldwide to identify such crimes rapidly can potentially result in the mitigation of child victimization, and the apprehending of offenders. Machine learning presents the potential to help law enforcement rapidly identify such material, and even block such content from being distributed digitally. However, collecting and storing CSAM files to train machine learning models has many ethical and legal constraints, creating a barrier to the development of accurate computer vision-based models. With such restrictions in place, the development of accurate machine learning classifiers for CSAM identification based on file metadata becomes crucial. In this work, we propose a system for CSAM identification on file storage systems based solely on metadata - file paths. Our aim is to provide a tool that is material type agnostic (image, video, PDF), and can potentially scans thousands of file storage systems in a short time. Our approach uses convolutional neural networks, and achieves an accuracy of 97 94 evade detection by this model by evaluating the robustness of our model against adversarial modifications in the file paths.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

page 7

page 8

page 9

research
10/29/2020

Short Text Classification Approach to Identify Child Sexual Exploitation Material

Producing or sharing Child Sexual Exploitation Material (CSEM) is a seri...
research
02/17/2010

A new approach to content-based file type detection

File type identification and file type clustering may be difficult tasks...
research
01/27/2023

Adversarial Networks and Machine Learning for File Classification

Correctly identifying the type of file under examination is a critical p...
research
11/08/2019

CFS: A Distributed File System for Large Scale Container Platforms

We propose CFS, a distributed file system for large scale container plat...
research
01/09/2022

Camera-Model Identification Using Encoding and Container Characteristics of Video Files

We introduce a new method for camera-model identification. Our approach ...
research
12/02/2020

Automated Artefact Relevancy Determination from Artefact Metadata and Associated Timeline Events

Case-hindering, multi-year digital forensic evidence backlogs have becom...
research
11/10/2019

A Multimodal CNN-based Tool to Censure Inappropriate Video Scenes

Due to the extensive use of video-sharing platforms and services for the...

Please sign up or login with your details

Forgot password? Click here to reset