A Machine Learning Approach to Comment Toxicity Classification

by   Navoneel Chakrabarty, et al.

Now-a-days, derogatory comments are often made by one another, not only in offline environment but also immensely in online environments like social networking websites and online communities. So, an Identification combined with Prevention System in all social networking websites and applications, including all the communities, existing in the digital world is a necessity. In such a system, the Identification Block should identify any negative online behaviour and should signal the Prevention Block to take action accordingly. This study aims to analyse any piece of text and detecting different types of toxicity like obscenity, threats, insults and identity-based hatred. The labelled Wikipedia Comment Dataset prepared by Jigsaw is used for the purpose. A 6-headed Machine Learning tf-idf Model has been made and trained separately, yielding a Mean Validation Accuracy of 98.08 of 91.61 online conversation


Learning for Detecting Norm Violation in Online Communities

In this paper, we focus on normative systems for online communities. The...

Phishing Detection Using Machine Learning Techniques

The Internet has become an indispensable part of our life, However, It a...

Cognitive Analysis of Security Threats on Social Networking Services: Slovakia in need of stronger action

This short paper examines some of the ongoing research at the UMB Data a...

Social Network Processes in the Isabelle and Coq Theorem Proving Communities

We identify the main actors in the Isabelle and Coq communities and desc...

muBoost: An Effective Method for Solving Indic Multilingual Text Classification Problem

Text Classification is an integral part of many Natural Language Process...

Learning Latent Local Conversation Modes for Predicting Community Endorsement in Online Discussions

Many social media platforms offer a mechanism for readers to react to co...

Methods for scaling a large member base

The technical challenges of scaling websites with large and growing memb...