Can Sequence-to-Sequence Models Crack Substitution Ciphers?

12/30/2020
by   Nada Aldarrab, et al.
13

Decipherment of historical ciphers is a challenging problem. The language of the target plaintext might be unknown, and ciphertext can have a lot of noise. State-of-the-art decipherment methods use beam search and a neural language model to score candidate plaintext hypotheses for a given cipher, assuming plaintext language is known. We propose an end-to-end multilingual model for solving simple substitution ciphers. We test our model on synthetic and real historical ciphers and show that our proposed method can decipher text without explicit language identification and can still be robust to noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2019

Candidate Fusion: Integrating Language Modelling into a Sequence-to-Sequence Handwritten Word Recognition Architecture

Sequence-to-sequence models have recently become very popular for tackli...
research
05/25/2022

Segmenting Numerical Substitution Ciphers

Deciphering historical substitution ciphers is a challenging problem. Ex...
research
10/18/2019

End-to-End Speech Recognition: A review for the French Language

Recently, end-to-end ASR based either on sequence-to-sequence networks o...
research
04/04/2019

Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions

We propose a fully convolutional sequence-to-sequence encoder architectu...
research
05/02/2020

A language score based output selection method for multilingual speech recognition

The quality of a multilingual speech recognition system can be improved ...
research
12/06/2018

Neural Word Search in Historical Manuscript Collections

We address the problem of segmenting and retrieving word images in colle...
research
06/16/2019

Neural Decipherment via Minimum-Cost Flow: from Ugaritic to Linear B

In this paper we propose a novel neural approach for automatic decipherm...

Please sign up or login with your details

Forgot password? Click here to reset