Large Language Models in Analyzing Crash Narratives – A Comparative Study of ChatGPT, BARD and GPT-4

08/25/2023
by   Maroa Mumtarin, et al.
0

In traffic safety research, extracting information from crash narratives using text analysis is a common practice. With recent advancements of large language models (LLM), it would be useful to know how the popular LLM interfaces perform in classifying or extracting information from crash narratives. To explore this, our study has used the three most popular publicly available LLM interfaces- ChatGPT, BARD and GPT4. This study investigated their usefulness and boundaries in extracting information and answering queries related to accidents from 100 crash narratives from Iowa and Kansas. During the investigation, their capabilities and limitations were assessed and their responses to the queries were compared. Five questions were asked related to the narratives: 1) Who is at-fault? 2) What is the manner of collision? 3) Has the crash occurred in a work-zone? 4) Did the crash involve pedestrians? and 5) What are the sequence of harmful events in the crash? For questions 1 through 4, the overall similarity among the LLMs were 70 respectively. The similarities were higher while answering direct questions requiring binary responses and significantly lower for complex questions. To compare the responses to question 5, network diagram and centrality measures were analyzed. The network diagram from the three LLMs were not always similar although they sometimes have the same influencing events with high in-degree, out-degree and betweenness centrality. This study suggests using multiple models to extract viable information from narratives. Also, caution must be practiced while using these interfaces to obtain crucial safety related information.

READ FULL TEXT
research
04/26/2023

Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery

Despite growing interest in using large language models (LLMs) in health...
research
05/18/2023

Are Large Language Models Fit For Guided Reading?

This paper looks at the ability of large language models to participate ...
research
01/18/2019

Identifying Unclear Questions in Community Question Answering Websites

Thousands of complex natural language questions are submitted to communi...
research
05/22/2023

Observations on LLMs for Telecom Domain: Capabilities and Limitations

The landscape for building conversational interfaces (chatbots) has witn...
research
09/10/2023

AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions

The development of large high-quality datasets and high-performing model...
research
03/29/2023

Evaluating GPT-3.5 and GPT-4 Models on Brazilian University Admission Exams

The present study aims to explore the capabilities of Language Models (L...
research
05/19/2023

Graphologue: Exploring Large Language Model Responses with Interactive Diagrams

Large language models (LLMs) have recently soared in popularity due to t...

Please sign up or login with your details

Forgot password? Click here to reset