Investigating the detection of Tortured Phrases in Scientific Literature

10/24/2022
by   Puthineath Lay, et al.
0

With the help of online tools, unscrupulous authors can today generate a pseudo-scientific article and attempt to publish it. Some of these tools work by replacing or paraphrasing existing texts to produce new content, but they have a tendency to generate nonsensical expressions. A recent study introduced the concept of 'tortured phrase', an unexpected odd phrase that appears instead of the fixed expression. E.g. counterfeit consciousness instead of artificial intelligence. The present study aims at investigating how tortured phrases, that are not yet listed, can be detected automatically. We conducted several experiments, including non-neural binary classification, neural binary classification and cosine similarity comparison of the phrase tokens, yielding noticeable results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2021

Tortured phrases: A dubious writing style emerging in science. Evidence of critical issues affecting established journals

Probabilistic text generators have been used to produce fake scientific ...
research
12/06/2021

Learning to Reason from General Concepts to Fine-grained Tokens for Discriminative Phrase Detection

Phrase detection requires methods to identify if a phrase is relevant to...
research
09/12/2017

Human Associations Help to Detect Conventionalized Multiword Expressions

In this paper we show that if we want to obtain human evidence about con...
research
08/01/2022

Patents Phrase to Phrase Semantic Matching Dataset

There are many general purpose benchmark datasets for Semantic Textual S...
research
10/01/2020

RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation

The task of video object segmentation with referring expressions (langua...
research
08/30/2022

Combining keyphrase extraction and lexical diversity to characterize ideas in publication titles

Beyond bibliometrics, there is interest in characterizing the evolution ...
research
03/02/2022

Theoretical Foundation of Colored Petri Net through an Analysis of their Markings as Multi-classification

Barwise and Seligman stated the first principle of information flow: "In...

Please sign up or login with your details

Forgot password? Click here to reset