Semantic-Preserving Linguistic Steganography by Pivot Translation and Semantic-Aware Bins Coding

03/08/2022
by   Tianyu Yang, et al.
0

Linguistic steganography (LS) aims to embed secret information into a highly encoded text for covert communication. It can be roughly divided to two main categories, i.e., modification based LS (MLS) and generation based LS (GLS). Unlike MLS that hides secret data by slightly modifying a given text without impairing the meaning of the text, GLS uses a trained language model to directly generate a text carrying secret data. A common disadvantage for MLS methods is that the embedding payload is very low, whose return is well preserving the semantic quality of the text. In contrast, GLS allows the data hider to embed a high payload, which has to pay the high price of uncontrollable semantics. In this paper, we propose a novel LS method to modify a given text by pivoting it between two different languages and embed secret data by applying a GLS-like information encoding strategy. Our purpose is to alter the expression of the given text, enabling a high payload to be embedded while keeping the semantic information unchanged. Experimental results have shown that the proposed work not only achieves a high embedding payload, but also shows superior performance in maintaining the semantic consistency and resisting linguistic steganalysis.

READ FULL TEXT

page 1

page 8

research
06/21/2022

General Framework for Reversible Data Hiding in Texts Based on Masked Language Modeling

With the fast development of natural language processing, recent advance...
research
03/26/2022

Autoregressive Linguistic Steganography Based on BERT and Consistency Coding

Linguistic steganography (LS) conceals the presence of communication by ...
research
03/10/2023

ICStega: Image Captioning-based Semantically Controllable Linguistic Steganography

Nowadays, social media has become the preferred communication platform f...
research
09/03/2019

Neural Linguistic Steganography

Whereas traditional cryptography encrypts a secret message into an unint...
research
11/12/2018

Automatically Generate Steganographic Text Based on Markov Model and Huffman Coding

Steganography, as one of the three basic information security systems, h...
research
06/03/2021

Provably Secure Generative Linguistic Steganography

Generative linguistic steganography mainly utilized language models and ...
research
05/29/2019

SECRET: Semantically Enhanced Classification of Real-world Tasks

Supervised machine learning (ML) algorithms are aimed at maximizing clas...

Please sign up or login with your details

Forgot password? Click here to reset