CRYPTEXT: Database and Interactive Toolkit of Human-Written Text Perturbations in the Wild

01/16/2023
by   Thai Le, et al.
0

User-generated textual contents on the Internet are often noisy, erroneous, and not in correct forms in grammar. In fact, some online users choose to express their opinions online through carefully perturbed texts, especially in controversial topics (e.g., politics, vaccine mandate) or abusive contexts (e.g., cyberbullying, hate-speech). However, to the best of our knowledge, there is no framework that explores these online “human-written" perturbations (as opposed to algorithm-generated perturbations). Therefore, we introduce an interactive system called CRYPTEXT. CRYPTEXT is a data-intensive application that provides the users with a database and several tools to extract and interact with human-written perturbations. Specifically, CRYPTEXT helps look up, perturb, and normalize (i.e., de-perturb) texts. CRYPTEXT also provides an interactive interface to monitor and analyze text perturbations online. A short demo video is available at: https://youtu.be/8WT3G8xjIoI

READ FULL TEXT

page 2

page 3

research
03/19/2022

Perturbations in the Wild: Leveraging Human-Written Text Perturbations for Realistic Adversarial Attack and Defense

We proposes a novel algorithm, ANTHRO, that inductively extracts over 60...
research
03/18/2023

NoisyHate: Benchmarking Content Moderation Machine Learning Models with Human-Written Perturbations Online

Online texts with toxic content are a threat in social media that might ...
research
08/17/2023

Contrasting Linguistic Patterns in Human and LLM-Generated Text

We conduct a quantitative analysis contrasting human-written English new...
research
05/22/2023

RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text

The fixed-size context of Transformer makes GPT models incapable of gene...
research
03/14/2016

Interactive Tools and Tasks for the Hebrew Bible

This contribution to a special issue on "Computer-aided processing of in...
research
04/16/2022

Persua: A Visual Interactive System to Enhance the Persuasiveness of Arguments in Online Discussion

Persuading people to change their opinions is a common practice in onlin...
research
09/29/2022

Concepts and Experiments on Psychoanalysis Driven Computing

This research investigates the effective incorporation of the human fact...

Please sign up or login with your details

Forgot password? Click here to reset