Leveraging GPT-4 for Food Effect Summarization to Enhance Product-Specific Guidance Development via Iterative Prompting

06/28/2023
by   Yiwen Shi, et al.
0

Food effect summarization from New Drug Application (NDA) is an essential component of product-specific guidance (PSG) development and assessment. However, manual summarization of food effect from extensive drug application review documents is time-consuming, which arouses a need to develop automated methods. Recent advances in large language models (LLMs) such as ChatGPT and GPT-4, have demonstrated great potential in improving the effectiveness of automated text summarization, but its ability regarding the accuracy in summarizing food effect for PSG assessment remains unclear. In this study, we introduce a simple yet effective approach, iterative prompting, which allows one to interact with ChatGPT or GPT-4 more effectively and efficiently through multi-turn interaction. Specifically, we propose a three-turn iterative prompting approach to food effect summarization in which the keyword-focused and length-controlled prompts are respectively provided in consecutive turns to refine the quality of the generated summary. We conduct a series of extensive evaluations, ranging from automated metrics to FDA professionals and even evaluation by GPT-4, on 100 NDA review documents selected over the past five years. We observe that the summary quality is progressively improved throughout the process. Moreover, we find that GPT-4 performs better than ChatGPT, as evaluated by FDA professionals (43 Importantly, all the FDA professionals unanimously rated that 85 summaries generated by GPT-4 are factually consistent with the golden reference summary, a finding further supported by GPT-4 rating of 72 results strongly suggest a great potential for GPT-4 to draft food effect summaries that could be reviewed by FDA professionals, thereby improving the efficiency of PSG assessment cycle and promoting the generic drug product development.

READ FULL TEXT
research
05/24/2023

SummIt: Iterative Text Summarization via ChatGPT

Existing text summarization systems have made significant progress in re...
research
05/13/2020

End-to-end Semantics-based Summary Quality Assessment for Single-document Summarization

ROUGE is the de facto criterion for summarization research. However, its...
research
05/24/2023

Improving Factuality of Abstractive Summarization without Sacrificing Summary Quality

Improving factual consistency of abstractive summarization has been a wi...
research
06/05/2023

Interactive Editing for Text Summarization

Summarizing lengthy documents is a common and essential task in our dail...
research
02/08/2023

Leveraging Summary Guidance on Medical Report Summarization

This study presents three deidentified large medical text datasets, name...
research
11/26/2015

TGSum: Build Tweet Guided Multi-Document Summarization Dataset

The development of summarization research has been significantly hampere...

Please sign up or login with your details

Forgot password? Click here to reset