TS-CNN: Text Steganalysis from Semantic Space Based on Convolutional Neural Network

10/18/2018
by   Zhongliang Yang, et al.
0

Steganalysis has been an important research topic in cybersecurity that helps to identify covert attacks in public network. With the rapid development of natural language processing technology in the past two years, coverless steganography has been greatly developed. Previous text steganalysis methods have shown unsatisfactory results on this new steganography technique and remain an unsolved challenge. Different from all previous text steganalysis methods, in this paper, we propose a text steganalysis method(TS-CNN) based on semantic analysis, which uses convolutional neural network(CNN) to extract high-level semantic features of texts, and finds the subtle distribution differences in the semantic space before and after embedding the secret information. To train and test the proposed model, we collected and released a large text steganalysis(CT-Steg) dataset, which contains a total number of 216,000 texts with various lengths and various embedding rates. Experimental results show that the proposed model can achieve nearly 100% precision and recall, outperforms all the previous methods. Furthermore, the proposed model can even estimate the capacity of the hidden information inside. These results strongly support that using the subtle changes in the semantic space before and after embedding the secret information to conduct text steganalysis is feasible and effective.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2019

TS-RNN: Text Steganalysis Based on Recurrent Neural Networks

With the rapid development of natural language processing technologies, ...
research
04/23/2018

Clinical Assistant Diagnosis for Electronic Medical Record Based on Convolutional Neural Network

Automatically extracting useful information from electronic medical reco...
research
08/28/2020

An Intelligent CNN-VAE Text Representation Technology Based on Text Semantics for Comprehensive Big Data

In the era of big data, a large number of text data generated by the Int...
research
04/02/2019

Short Text Classification Improved by Feature Space Extension

With the explosive development of mobile Internet, short text has been a...
research
05/16/2019

TraceWalk: Semantic-based Process Graph Embedding for Consistency Checking

Process consistency checking (PCC), an interdiscipline of natural langua...
research
06/21/2022

General Framework for Reversible Data Hiding in Texts Based on Masked Language Modeling

With the fast development of natural language processing, recent advance...
research
11/12/2018

Automatically Generate Steganographic Text Based on Markov Model and Huffman Coding

Steganography, as one of the three basic information security systems, h...

Please sign up or login with your details

Forgot password? Click here to reset