Modeling Information Change in Science Communication with Semantically Matched Paraphrases

10/24/2022
by   Dustin Wright, et al.
0

Whether the media faithfully communicate scientific information has long been a core issue to the science community. Automatically identifying paraphrased scientific findings could enable large-scale tracking and analysis of information changes in the science communication process, but this requires systems to understand the similarity between scientific information across multiple domains. To this end, we present the SCIENTIFIC PARAPHRASE AND INFORMATION CHANGE DATASET (SPICED), the first paraphrase dataset of scientific findings annotated for degree of information change. SPICED contains 6,000 scientific finding pairs extracted from news stories, social media discussions, and full texts of original papers. We demonstrate that SPICED poses a challenging task and that models trained on SPICED improve downstream performance on evidence retrieval for fact checking of real-world scientific claims. Finally, we show that models trained on SPICED can reveal large-scale trends in the degrees to which people and organizations faithfully communicate new scientific findings. Data, code, and pre-trained models are available at http://www.copenlu.com/publication/2022_emnlp_wright/.

READ FULL TEXT

page 6

page 14

page 17

research
09/30/2021

Measuring Sentence-Level and Aspect-Level (Un)certainty in Science Communications

Certainty and uncertainty are fundamental to science communication. Hedg...
research
08/30/2021

Semi-Supervised Exaggeration Detection of Health Science Press Releases

Public trust in science depends on honest and factual communication of s...
research
10/25/2021

SciClops: Detecting and Contextualizing Scientific Claims for Assisting Manual Fact-Checking

This paper describes SciClops, a method to help combat online scientific...
research
05/30/2021

Determining the Credibility of Science Communication

Most work on scholarly document processing assumes that the information ...
research
05/04/2022

A Computational Inflection for Scientific Discovery

We stand at the foot of a significant inflection in the trajectory of sc...
research
05/17/2018

Neural language representations predict outcomes of scientific research

Many research fields codify their findings in standard formats, often by...
research
04/11/2023

The Many Publics of Science: Using Altmetrics to Identify Common Communication Channels by Scientific field

Altmetrics have led to new quantitative studies of science through socia...

Please sign up or login with your details

Forgot password? Click here to reset