The Keyword Explorer Suite: A Toolkit for Understanding Online Populations

01/12/2023
by   Philip Feldman, et al.
0

We have developed a set of Python applications that use large language models to identify and analyze data from social media platforms relevant to a population of interest. Our pipeline begins with using OpenAI's GPT-3 to generate potential keywords for identifying relevant text content from the target population. The keywords are then validated, and the content downloaded and analyzed using GPT-3 embedding and manifold reduction. Corpora are then created to fine-tune GPT-2 models to explore latent information via prompt-based queries. These tools allow researchers and practitioners to gain valuable insights into population subgroups online. Source code at https://github.com/pgfeldman/KeywordExplorer

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2022

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Text analysis of social media for sentiment, topic analysis, and other a...
research
11/13/2019

Finding Social Media Trolls: Dynamic Keyword Selection Methods for Rapidly-Evolving Online Debates

Online harassment is a significant social problem. Prevention of online ...
research
01/04/2023

InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval

Recently, InPars introduced a method to efficiently use large language m...
research
05/11/2019

Mining Hidden Populations through Attributed Search

Researchers often query online social platforms through their applicatio...
research
08/28/2023

Detecting Inactive Cyberwarriors from Online Forums

The proliferation of misinformation has emerged as a new form of warfare...
research
05/09/2023

A Review of Vision-Language Models and their Performance on the Hateful Memes Challenge

Moderation of social media content is currently a highly manual task, ye...
research
06/13/2023

ChatGPT vs. Lightweight Security: First Work Implementing the NIST Cryptographic Standard ASCON

This study, to the best of our knowledge, is the first to explore the in...

Please sign up or login with your details

Forgot password? Click here to reset