To show or not to show: Redacting sensitive text from videos of electronic displays

08/19/2022
by   Abhishek Mukhopadhyay, et al.
0

With the increasing prevalence of video recordings there is a growing need for tools that can maintain the privacy of those recorded. In this paper, we define an approach for redacting personally identifiable text from videos using a combination of optical character recognition (OCR) and natural language processing (NLP) techniques. We examine the relative performance of this approach when used with different OCR models, specifically Tesseract and the OCR system from Google Cloud Vision (GCV). For the proposed approach the performance of GCV, in both accuracy and speed, is significantly higher than Tesseract. Finally, we explore the advantages and disadvantages of both models in real-world applications.

READ FULL TEXT

page 1

page 4

research
07/09/2023

A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing

Optical Character Recognition (OCR) technology finds applications in dig...
research
05/03/2023

Training Natural Language Processing Models on Encrypted Text for Enhanced Privacy

With the increasing use of cloud-based services for training and deployi...
research
06/09/2022

Transformer based Urdu Handwritten Text Optical Character Reader

Extracting Handwritten text is one of the most important components of d...
research
05/16/2023

A Video Is Worth 4096 Tokens: Verbalize Story Videos To Understand Them In Zero Shot

Multimedia content, such as advertisements and story videos, exhibit a r...
research
07/13/2017

Lithium NLP: A System for Rich Information Extraction from Noisy User Generated Text on Social Media

In this paper, we describe the Lithium Natural Language Processing (NLP)...
research
06/07/2022

An Insight into The Intricacies of Lingual Paraphrasing Pragmatic Discourse on The Purpose of Synonyms

The term "paraphrasing" refers to the process of presenting the sense of...
research
01/17/2023

Command Line Interface Risk Modeling

Protecting sensitive data is an essential part of security in cloud comp...

Please sign up or login with your details

Forgot password? Click here to reset