Segmenting Numerical Substitution Ciphers

05/25/2022
by   Nada Aldarrab, et al.
0

Deciphering historical substitution ciphers is a challenging problem. Example problems that have been previously studied include detecting cipher type, detecting plaintext language, and acquiring the substitution key for segmented ciphers. However, attacking unsegmented, space-free ciphers is still a challenging task. Segmentation (i.e. finding substitution units) is the first step towards cracking those ciphers. In this work, we propose the first automatic methods to segment those ciphers using Byte Pair Encoding (BPE) and unigram language models. Our methods achieve an average segmentation error of 2% on 100 randomly-generated monoalphabetic ciphers and 27% on 3 real homophonic ciphers. We also propose a method for solving non-deterministic ciphers with existing keys using a lattice and a pretrained language model. Our method leads to the full solution of the IA cipher; a real historical cipher that has not been fully solved until this work.

READ FULL TEXT
research
12/30/2020

Can Sequence-to-Sequence Models Crack Substitution Ciphers?

Decipherment of historical ciphers is a challenging problem. The languag...
research
04/07/2020

Byte Pair Encoding is Suboptimal for Language Model Pretraining

The success of pretrained transformer language models in natural languag...
research
10/09/2020

Solving Historical Dictionary Codes with a Neural Language Model

We solve difficult word-based substitution codes by constructing a decod...
research
04/06/2021

LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring

Neural network-based language models are commonly used in rescoring appr...
research
09/06/2023

SLiMe: Segment Like Me

Significant strides have been made using large vision-language models, l...
research
05/09/2023

Investigating the effect of sub-word segmentation on the performance of transformer language models

We would like to explore how morphemes can affect the performance of a l...
research
07/11/2021

NeoUNet: Towards accurate colon polyp segmentation and neoplasm detection

Automatic polyp segmentation has proven to be immensely helpful for endo...

Please sign up or login with your details

Forgot password? Click here to reset