Leveraging Large Language Models and Weak Supervision for Social Media data annotation: an evaluation using COVID-19 self-reported vaccination tweets

09/12/2023
by   Ramya Tekumalla, et al.
0

The COVID-19 pandemic has presented significant challenges to the healthcare industry and society as a whole. With the rapid development of COVID-19 vaccines, social media platforms have become a popular medium for discussions on vaccine-related topics. Identifying vaccine-related tweets and analyzing them can provide valuable insights for public health research-ers and policymakers. However, manual annotation of a large number of tweets is time-consuming and expensive. In this study, we evaluate the usage of Large Language Models, in this case GPT-4 (March 23 version), and weak supervision, to identify COVID-19 vaccine-related tweets, with the purpose of comparing performance against human annotators. We leveraged a manu-ally curated gold-standard dataset and used GPT-4 to provide labels without any additional fine-tuning or instructing, in a single-shot mode (no additional prompting).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2021

Fine-Tuning Transformers for Identifying Self-Reporting Potential Cases and Symptoms of COVID-19 in Tweets

We describe our straight-forward approach for Tasks 5 and 6 of 2021 Soci...
research
07/11/2022

TweetDIS: A Large Twitter Dataset for Natural Disasters Built using Weak Supervision

Social media is often utilized as a lifeline for communication during na...
research
07/27/2021

A Biomedically oriented automatically annotated Twitter COVID-19 Dataset

The use of social media data, like Twitter, for biomedical research has ...
research
04/10/2023

A Large-Scale Comparative Study of Accurate COVID-19 Information versus Misinformation

The COVID-19 pandemic led to an infodemic where an overwhelming amount o...
research
10/11/2022

COVID-19-related Nepali Tweets Classification in a Low Resource Setting

Billions of people across the globe have been using social media platfor...
research
04/06/2023

Leveraging Social Interactions to Detect Misinformation on Social Media

Detecting misinformation threads is crucial to guarantee a healthy envir...
research
06/22/2021

Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of COVID-19 Infodemic

The spreading COVID-19 misinformation over social media already draws th...

Please sign up or login with your details

Forgot password? Click here to reset