Extending the Abstraction of Personality Types based on MBTI with Machine Learning and Natural Language Processing

by   Carlos Basto, et al.

A data-centric approach with Natural Language Processing (NLP) to predict personality types based on the MBTI (an introspective self-assessment questionnaire that indicates different psychological preferences about how people perceive the world and make decisions) through systematic enrichment of text representation, based on the domain of the area, under the generation of features based on three types of analysis: sentimental, grammatical and aspects. The experimentation had a robust baseline of stacked models, with premature optimization of hyperparameters through grid search, with gradual feedback, for each of the four classifiers (dichotomies) of MBTI. The results showed that attention to the data iteration loop focused on quality, explanatory power and representativeness for the abstraction of more relevant/important resources for the studied phenomenon made it possible to improve the evaluation metrics results more quickly and less costly than complex models such as the LSTM or state of the art ones as BERT, as well as the importance of these results by comparisons made from various perspectives. In addition, the study demonstrated a broad spectrum for the evolution and deepening of the task and possible approaches for a greater extension of the abstraction of personality types.


page 5

page 11


Simple Natural Language Processing Tools for Danish

This technical note describes a set of baseline tools for automatic proc...

A Comprehensive Survey on Word Representation Models: From Classical to State-Of-The-Art Word Representation Language Models

Word representation has always been an important research area in the hi...

Spoiler Alert: Using Natural Language Processing to Detect Spoilers in Book Reviews

This paper presents an NLP (Natural Language Processing) approach to det...

A Systematic Study and Analysis of Bengali Folklore with Natural Language Processing Systems

Folklore, a solid branch of folk literature, is the hallmark of any nati...

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Attention is an increasingly popular mechanism used in a wide range of n...

Optimizing Neural Network Hyperparameters with Gaussian Processes for Dialog Act Classification

Systems based on artificial neural networks (ANNs) have achieved state-o...

Robustness Tests of NLP Machine Learning Models: Search and Semantically Replace

This paper proposes a strategy to assess the robustness of different mac...

Please sign up or login with your details

Forgot password? Click here to reset