Contextualized Topic Coherence Metrics

05/23/2023
by   Hamed Rahimi, et al.
0

The recent explosion in work on neural topic modeling has been criticized for optimizing automated topic evaluation metrics at the expense of actual meaningful topic identification. But human annotation remains expensive and time-consuming. We propose LLM-based methods inspired by standard human topic evaluations, in a family of metrics called Contextualized Topic Coherence (CTC). We evaluate both a fully automated version as well as a semi-automated CTC that allows human-centered evaluation of coherence while maintaining the efficiency of automated methods. We evaluate CTC relative to five other metrics on six topic models and find that it outperforms automated topic coherence methods, works well on short documents, and is not susceptible to meaningless but high-scoring topics.

READ FULL TEXT
research
07/05/2021

Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence

Topic model evaluation, like evaluation of other unsupervised methods, c...
research
06/30/2021

Evaluation of Thematic Coherence in Microblogs

Collecting together microblogs representing opinions about the same topi...
research
05/18/2019

Automatic Evaluation of Local Topic Quality

Topic models are typically evaluated with respect to the global topic di...
research
11/20/2019

A Coefficient of Determination for Probabilistic Topic Models

This research proposes a new (old) metric for evaluating goodness of fit...
research
05/25/2023

Diversity-Aware Coherence Loss for Improving Neural Topic Models

The standard approach for neural topic modeling uses a variational autoe...
research
10/28/2022

Are Neural Topic Models Broken?

Recently, the relationship between automated and human evaluation of top...
research
10/16/2017

Which is better? A Modularized Evaluation for Topic Popularity Prediction

Topic popularity prediction in social networks has drawn much attention ...

Please sign up or login with your details

Forgot password? Click here to reset