Determining crucial factors for the popularity of scientific articles

01/27/2020
by   Robert Jankowski, et al.
0

Using a set of over 70.000 records from PLOS One journal consisting of 37 lexical, sentiment and bibliographic variables we perform analysis backed with machine learning methods to predict the class of popularity of scientific papers defined by the number of times they have been viewed. Our study shows correlations among the features and recovers a threshold for the number of views that results in the best prediction results in terms of Matthew's correlation coefficient. Moreover, by creating a variable importance plot for random forest classifier, we are able to reduce the number of features while keeping similar predictability and determine crucial factors responsible for the popularity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2018

Understanding Book Popularity on Goodreads

Goodreads has launched the Readers Choice Awards since 2009 where users ...
research
11/15/2021

Automatic Analysis of Linguistic Features in Journal Articles of Different Academic Impacts with Feature Engineering Techniques

English research articles (RAs) are an essential genre in academia, so t...
research
10/08/2019

Random forest model identifies serve strength as a key predictor of tennis match outcome

Tennis is a popular sport worldwide, boasting millions of fans and numer...
research
12/03/2018

Music Popularity: Metrics, Characteristics, and Audio-Based Prediction

Understanding music popularity is important not only for the artists who...
research
08/12/2019

Assessing the Quality of Scientific Papers

A multitude of factors are responsible for the overall quality of scient...
research
08/05/2021

Spotify Danceability and Popularity Analysis using SAP

Our analysis reviews and visualizes the audio features and popularity of...

Please sign up or login with your details

Forgot password? Click here to reset