LLM-Assisted Content Analysis: Using Large Language Models to Support Deductive Coding

06/23/2023
by   Robert Chew, et al.
0

Deductive coding is a widely used qualitative research method for determining the prevalence of themes across documents. While useful, deductive coding is often burdensome and time consuming since it requires researchers to read, interpret, and reliably categorize a large body of unstructured text documents. Large language models (LLMs), like ChatGPT, are a class of quickly evolving AI tools that can perform a range of natural language processing and reasoning tasks. In this study, we explore the use of LLMs to reduce the time it takes for deductive coding while retaining the flexibility of a traditional content analysis. We outline the proposed approach, called LLM-assisted content analysis (LACA), along with an in-depth case study using GPT-3.5 for LACA on a publicly available deductive coding data set. Additionally, we conduct an empirical benchmark using LACA on 4 publicly available data sets to assess the broader question of how well GPT-3.5 performs across a range of deductive coding tasks. Overall, we find that GPT-3.5 can often perform deductive coding at levels of agreement comparable to human coders. Additionally, we demonstrate that LACA can help refine prompts for deductive coding, identify codes for which an LLM is randomly guessing, and help assess when to use LLMs vs. human coders for deductive coding. We conclude with several implications for future practice of deductive coding and related research methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2023

Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding

Qualitative analysis of textual contents unpacks rich and valuable infor...
research
05/17/2023

Scratch Copilot Evaluation: Assessing AI-Assisted Creative Coding for Families

How can AI enhance creative coding experiences for families? This study ...
research
11/15/2019

Assigning Medical Codes at the Encounter Level by Paying Attention to Documents

The vast majority of research in computer assisted medical coding focuse...
research
08/12/2022

What is it like to program with artificial intelligence?

Large language models, such as OpenAI's codex and Deepmind's AlphaCode, ...
research
02/09/2023

Flexible, Model-Agnostic Method for Materials Data Extraction from Text Using General Purpose Language Models

Accurate and comprehensive material databases extracted from research pa...
research
09/11/2023

Textbooks Are All You Need II: phi-1.5 technical report

We continue the investigation into the power of smaller Transformer-base...
research
04/14/2023

CollabCoder: A GPT-Powered Workflow for Collaborative Qualitative Analysis

The Collaborative Qualitative Analysis (CQA) process can be time-consumi...

Please sign up or login with your details

Forgot password? Click here to reset