Random forests, sound symbolism and Pokemon evolution

This study constructs machine learning algorithms that are trained to classify samples using sound symbolism, and then it reports on an experiment designed to measure their understanding against human participants. Random forests are trained using the names of Pokemon, which are fictional video game characters, and their evolutionary status. Pokemon undergo evolution when certain in-game conditions are met. Evolution changes the appearance, abilities, and names of Pokemon. In the first experiment, we train three random forests using the sounds that make up the names of Japanese, Chinese, and Korean Pokemon to classify Pokemon into pre-evolution and post-evolution categories. We then train a fourth random forest using the results of an elicitation experiment whereby Japanese participants named previously unseen Pokemon. In Experiment 2, we reproduce those random forests with name length as a feature and compare the performance of the random forests against humans in a classification experiment whereby Japanese participants classified the names elicited in Experiment 1 into pre-and post-evolution categories. Experiment 2 reveals an issue pertaining to overfitting in Experiment 1 which we resolve using a novel cross-validation method. The results show that the random forests are efficient learners of systematic sound-meaning correspondence patterns and can classify samples with greater accuracy than the human participants.

READ FULL TEXT

page 1

page 6

page 9

page 10

page 11

page 13

page 17

page 18

research
04/06/2016

Comments on: "A Random Forest Guided Tour" by G. Biau and E. Scornet

This paper is a comment on the survey paper by Biau and Scornet (2016) a...
research
01/15/2023

What artificial intelligence might teach us about the origin of human language

This study explores an interesting pattern emerging from research that c...
research
03/15/2017

Cost-complexity pruning of random forests

Random forests perform bootstrap-aggregation by sampling the training sa...
research
05/17/2021

Cross-Cluster Weighted Forests

Adapting machine learning algorithms to better handle the presence of na...
research
04/11/2022

Random Similarity Forests

The wealth of data being gathered about humans and their surroundings dr...
research
09/13/2020

That looks interesting! Personalizing Communication and Segmentation with Random Forest Node Embeddings

Communicating effectively with customers is a challenge for many markete...
research
10/28/2021

Exoplanet atmosphere evolution: emulation with random forests

Atmospheric mass-loss is known to play a leading role in sculpting the d...

Please sign up or login with your details

Forgot password? Click here to reset