Initial Study into Application of Feature Density and Linguistically-backed Embedding to Improve Machine Learning-based Cyberbullying Detection

06/04/2022
by   Juuso Eronen, et al.
0

In this research, we study the change in the performance of machine learning (ML) classifiers when various linguistic preprocessing methods of a dataset were used, with the specific focus on linguistically-backed embeddings in Convolutional Neural Networks (CNN). Moreover, we study the concept of Feature Density and confirm its potential to comparatively predict the performance of ML classifiers, including CNN. The research was conducted on a Formspring dataset provided in a Kaggle competition on automatic cyberbullying detection. The dataset was re-annotated by objective experts (psychologists), as the importance of professional annotation in cyberbullying research has been indicated multiple times. The study confirmed the effectiveness of Neural Networks in cyberbullying detection and the correlation between classifier performance and Feature Density while also proposing a new approach of training various linguistically-backed embeddings for Convolutional Neural Networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2021

Improving Classifier Training Efficiency for Automatic Cyberbullying Detection with Feature Density

We study the effectiveness of Feature Density (FD) using different lingu...
research
05/20/2022

A Dynamic Weighted Tabular Method for Convolutional Neural Networks

Traditional Machine Learning (ML) models like Support Vector Machine, Ra...
research
06/04/2022

Exploring the Potential of Feature Density in Estimating Machine Learning Classifier Performance with Application to Cyberbullying Detection

In this research. we analyze the potential of Feature Density (HD) as a ...
research
01/17/2022

Using Machine Learning Based Models for Personality Recognition

Personality can be defined as the combination of behavior, emotion, moti...
research
02/11/2021

A proof of concept study for machine learning application to stenosis detection

This proof of concept (PoC) assesses the ability of machine learning (ML...
research
07/22/2019

Polyp Detection and Segmentation using Mask R-CNN: Does a Deeper Feature Extractor CNN Always Perform Better?

Automatic polyp detection and segmentation are highly desirable for colo...
research
03/19/2021

Empirical Analysis of Machine Learning Configurations for Prediction of Multiple Organ Failure in Trauma Patients

Multiple organ failure (MOF) is a life-threatening condition. Due to its...

Please sign up or login with your details

Forgot password? Click here to reset