Constructing a Testbed for Psychometric Natural Language Processing

07/25/2020
by   Ahmed Abbasi, et al.
0

Psychometric measures of ability, attitudes, perceptions, and beliefs are crucial for understanding user behaviors in various contexts including health, security, e-commerce, and finance. Traditionally, psychometric dimensions have been measured and collected using survey-based methods. Inferring such constructs from user-generated text could afford opportunities for timely, unobtrusive, collection and analysis. In this paper, we describe our efforts to construct a corpus for psychometric natural language processing (NLP). We discuss our multi-step process to align user text with their survey-based response items and provide an overview of the resulting testbed which encompasses survey-based psychometric measures and accompanying user-generated text from over 8,500 respondents. We report preliminary results on the use of the text to categorize/predict users' survey response labels. We also discuss the important implications of our work and resulting testbed for future psychometric NLP research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/26/2023

Nature Language Reasoning, A Survey

This survey paper proposes a clearer view of natural language reasoning ...
research
04/30/2021

Summarization, Simplification, and Generation: The Case of Patents

We survey Natural Language Processing (NLP) approaches to summarizing, s...
research
08/31/2022

Efficient Methods for Natural Language Processing: A Survey

Getting the most out of limited resources allows advances in natural lan...
research
06/10/2021

Graph Neural Networks for Natural Language Processing: A Survey

Deep learning has become the dominant approach in coping with various ta...
research
04/19/2020

Evolution of Semantic Similarity – A Survey

Estimating the semantic similarity between text data is one of the chall...
research
11/02/2020

Automatic Detection of Machine Generated Text: A Critical Survey

Text generative models (TGMs) excel in producing text that matches the s...
research
06/05/2023

Easy-to-Read in Germany: A Survey on its Current State and Available Resources

Easy-to-Read Language (E2R) is a controlled language variant that makes ...

Please sign up or login with your details

Forgot password? Click here to reset