Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character Recognition

08/04/2023
by   Jiacheng Deng, et al.
0

Optical Character Recognition (OCR) enables automatic text extraction from scanned or digitized text images, but it also makes it easy to pirate valuable or sensitive text from these images. Previous methods to prevent OCR piracy by distorting characters in text images are impractical in real-world scenarios, as pirates can capture arbitrary portions of the text images, rendering the defenses ineffective. In this work, we propose a novel and effective defense mechanism termed the Universal Defensive Underpainting Patch (UDUP) that modifies the underpainting of text images instead of the characters. UDUP is created through an iterative optimization process to craft a small, fixed-size defensive patch that can generate non-overlapping underpainting for text images of any size. Experimental results show that UDUP effectively defends against unauthorized OCR under the setting of any screenshot range or complex image background. It is agnostic to the content, size, colors, and languages of characters, and is robust to typical image operations such as scaling and compressing. In addition, the transferability of UDUP is demonstrated by evading several off-the-shelf OCRs. The code is available at https://github.com/QRICKDD/UDUP.

READ FULL TEXT
research
09/03/2023

Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning

Scene text recognition has been studied for decades due to its broad app...
research
04/30/2022

SVTR: Scene Text Recognition with a Single Visual Model

Dominant scene text recognition models commonly contain two building blo...
research
10/17/2019

Convolutional Character Networks

Recent progress has been made on developing a unified framework for join...
research
11/10/2020

On-Device Language Identification of Text in Images using Diacritic Characters

Diacritic characters can be considered as a unique set of characters pro...
research
05/17/2013

Font Acknowledgment and Character Extraction of Digital and Scanned Images

The font recognition and character extraction is of immense importance a...
research
07/12/2022

Collaborative Neural Rendering using Anime Character Sheets

Drawing images of characters at desired poses is an essential but labori...
research
09/21/2020

PP-OCR: A Practical Ultra Lightweight OCR System

The Optical Character Recognition (OCR) systems have been widely used in...

Please sign up or login with your details

Forgot password? Click here to reset