ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about

04/06/2023
by   Aman Rangapur, et al.
0

Large language models have gained considerable interest for their impressive performance on various tasks. Among these models, ChatGPT developed by OpenAI has become extremely popular among early adopters who even regard it as a disruptive technology in many fields like customer service, education, healthcare, and finance. It is essential to comprehend the opinions of these initial users as it can provide valuable insights into the potential strengths, weaknesses, and success or failure of the technology in different areas. This research examines the responses generated by ChatGPT from different Conversational QA corpora. The study employed BERT similarity scores to compare these responses with correct answers and obtain Natural Language Inference(NLI) labels. Evaluation scores were also computed and compared to determine the overall performance of GPT-3 & GPT-4. Additionally, the study identified instances where ChatGPT provided incorrect answers to questions, providing insights into areas where the model may be prone to error.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2023

ExpertQA: Expert-Curated Questions and Attributed Answers

As language models are adapted by a more sophisticated and diverse set o...
research
12/12/2022

"I think this is the most disruptive technology": Exploring Sentiments of ChatGPT Early Adopters using Twitter Data

Large language models have recently attracted significant attention due ...
research
09/20/2023

Chain-of-Verification Reduces Hallucination in Large Language Models

Generation of plausible yet incorrect factual information, termed halluc...
research
06/19/2020

A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19

COVID-19 has resulted in an ongoing pandemic and as of 12 June 2020, has...
research
04/02/2023

LLMMaps – A Visual Metaphor for Stratified Evaluation of Large Language Models

Large Language Models (LLMs) have revolutionized natural language proces...
research
06/28/2023

Towards Measuring the Representation of Subjective Global Opinions in Language Models

Large language models (LLMs) may not equitably represent diverse global ...
research
04/11/2023

chatIPCC: Grounding Conversational AI in Climate Science

Large Language Models (LLMs) have made significant progress in recent ye...

Please sign up or login with your details

Forgot password? Click here to reset