"So You Think You're Funny?": Rating the Humour Quotient in Standup Comedy

10/25/2021
by   Anirudh Mittal, et al.
0

Computational Humour (CH) has attracted the interest of Natural Language Processing and Computational Linguistics communities. Creating datasets for automatic measurement of humour quotient is difficult due to multiple possible interpretations of the content. In this work, we create a multi-modal humour-annotated dataset (∼40 hours) using stand-up comedy clips. We devise a novel scoring mechanism to annotate the training data with a humour quotient score using the audience's laughter. The normalized duration (laughter duration divided by the clip duration) of laughter in each clip is used to compute this humour coefficient score on a five-point scale (0-4). This method of scoring is validated by comparing with manually annotated scores, wherein a quadratic weighted kappa of 0.6 is obtained. We use this dataset to train a model that provides a "funniness" score, on a five-point scale, given the audio and its corresponding text. We compare various neural language models for the task of humour-rating and achieve an accuracy of 0.813 in terms of Quadratic Weighted Kappa (QWK). Our "Open Mic" dataset is released for further research along with the code.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2023

Multi-modal Machine Learning for Vehicle Rating Predictions Using Image, Text, and Parametric Data

Accurate vehicle rating prediction can facilitate designing and configur...
research
05/29/2023

Short Answer Grading Using One-shot Prompting and Text Similarity Scoring Model

In this study, we developed an automated short answer grading (ASAG) mod...
research
10/26/2016

Automatic measurement of vowel duration via structured prediction

A key barrier to making phonetic studies scalable and replicable is the ...
research
03/01/2022

Improving Performance of Automated Essay Scoring by using back-translation essays and adjusted scores

Automated essay scoring plays an important role in judging students' lan...
research
02/13/2023

Large Scale Multi-Lingual Multi-Modal Summarization Dataset

Significant developments in techniques such as encoder-decoder models ha...
research
02/28/2023

Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners

Grapheme-to-phoneme (G2P) transduction is part of the standard text-to-s...
research
04/03/2018

Real-Time Prediction of the Duration of Distribution System Outages

This paper addresses the problem of predicting duration of unplanned pow...

Please sign up or login with your details

Forgot password? Click here to reset