Ensemble Maximum Entropy Classification and Linear Regression for Author Age Prediction

10/04/2016
by   Joey Hong, et al.
0

The evolution of the internet has created an abundance of unstructured data on the web, a significant part of which is textual. The task of author profiling seeks to find the demographics of people solely from their linguistic and content-based features in text. The ability to describe traits of authors clearly has applications in fields such as security and forensics, as well as marketing. Instead of seeing age as just a classification problem, we also frame age as a regression one, but use an ensemble chain method that incorporates the power of both classification and regression to learn the authors exact age.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2020

Too good to be true? Predicting author profiles from abusive language

The problem of online threats and abuse could potentially be mitigated w...
research
10/12/2020

Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian Language

Italian is a Romance language that has its roots in Vulgar Latin. The bi...
research
10/14/2016

A Language-independent and Compositional Model for Personality Trait Recognition from Short Texts

Many methods have been used to recognize author personality traits from ...
research
09/30/2022

PART: Pre-trained Authorship Representation Transformer

Authors writing documents imprint identifying information within their t...
research
03/11/2021

Integrated Age Estimation Mechanism

Machine-learning-based age estimation has received lots of attention. Tr...

Please sign up or login with your details

Forgot password? Click here to reset