IBM Employee Attrition Analysis

by   Shenghuan Yang, et al.

In this paper, we analyzed the dataset IBM Employee Attrition to find the main reasons why employees choose to resign. Firstly, we utilized the correlation matrix to see some features that were not significantly correlated with other attributes and removed them from our dataset. Secondly, we selected important features by exploiting Random Forest, finding monthlyincome, age, and the number of companies worked significantly impacted employee attrition. Next, we also classified people into two clusters by using K-means Clustering. Finally, We performed binary logistic regression quantitative analysis: the attrition of people who traveled frequently was 2.4 times higher than that of people who rarely traveled. And we also found that employees who work in Human Resource have a higher tendency to leave.



page 5


Using Machine Learning to Evaluate Real Estate Prices Using Location Big Data

With everyone trying to enter the real estate market nowadays, knowing t...

Do elderly want to work? Modeling elderly's decision to fight aging Thailand

Thailand has entered into an aging society since the year 2000. Using th...

MARF: Multiscale Adaptive-switch Random Forest for Leg Detection with 2D Laser Scanners

For the 2D laser-based tasks, e.g., people detection and people tracking...

Improved Clustering with Augmented k-means

Identifying a set of homogeneous clusters in a heterogeneous dataset is ...

Auto-Detection of Safety Issues in Baby Products

Every year, thousands of people receive consumer product related injurie...

CryptoCredit: Securely Training Fair Models

When developing models for regulated decision making, sensitive features...

Analysis of the Pennsylvania Additive Classification Tool: Biases and Important Features

The Pennsylvania Additive Classification Tool (PACT) is a carceral algor...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.