A Machine Learning Approach to Comment Toxicity Classification

02/27/2019
by   Navoneel Chakrabarty, et al.
0

Now-a-days, derogatory comments are often made by one another, not only in offline environment but also immensely in online environments like social networking websites and online communities. So, an Identification combined with Prevention System in all social networking websites and applications, including all the communities, existing in the digital world is a necessity. In such a system, the Identification Block should identify any negative online behaviour and should signal the Prevention Block to take action accordingly. This study aims to analyse any piece of text and detecting different types of toxicity like obscenity, threats, insults and identity-based hatred. The labelled Wikipedia Comment Dataset prepared by Jigsaw is used for the purpose. A 6-headed Machine Learning tf-idf Model has been made and trained separately, yielding a Mean Validation Accuracy of 98.08 of 91.61 online conversation

READ FULL TEXT
04/30/2021

Learning for Detecting Norm Violation in Online Communities

In this paper, we focus on normative systems for online communities. The...
09/20/2020

Phishing Detection Using Machine Learning Techniques

The Internet has become an indispensable part of our life, However, It a...
05/18/2020

Cognitive Analysis of Security Threats on Social Networking Services: Slovakia in need of stronger action

This short paper examines some of the ongoing research at the UMB Data a...
09/22/2016

Social Network Processes in the Isabelle and Coq Theorem Proving Communities

We identify the main actors in the Isabelle and Coq communities and desc...
06/21/2022

muBoost: An Effective Method for Solving Indic Multilingual Text Classification Problem

Text Classification is an integral part of many Natural Language Process...
08/16/2016

Learning Latent Local Conversation Modes for Predicting Community Endorsement in Online Discussions

Many social media platforms offer a mechanism for readers to react to co...
02/20/2006

Methods for scaling a large member base

The technical challenges of scaling websites with large and growing memb...