Using Machine Learning to Evaluate Real Estate Prices Using Location Big Data

05/02/2022
by   Walter Coleman, et al.
0

With everyone trying to enter the real estate market nowadays, knowing the proper valuations for residential and commercial properties has become crucial. Past researchers have been known to utilize static real estate data (e.g. number of beds, baths, square footage) or even a combination of real estate and demographic information to predict property prices. In this investigation, we attempted to improve upon past research. So we decided to explore a unique approach: we wanted to determine if mobile location data could be used to improve the predictive power of popular regression and tree-based models. To prepare our data for our models, we processed the mobility data by attaching it to individual properties from the real estate data that aggregated users within 500 meters of the property for each day of the week. We removed people that lived within 500 meters of each property, so each property's aggregated mobility data only contained non-resident census features. On top of these dynamic census features, we also included static census features, including the number of people in the area, the average proportion of people commuting, and the number of residents in the area. Finally, we tested multiple models to predict real estate prices. Our proposed model is two stacked random forest modules combined using a ridge regression that uses the random forest outputs as predictors. The first random forest model used static features only and the second random forest model used dynamic features only. Comparing our models with and without the dynamic mobile location features concludes the model with dynamic mobile location features achieves 3/ than the same model but without dynamic mobile location features.

READ FULL TEXT

page 1

page 3

research
10/12/2022

Predicting housing prices and analyzing real estate market in the Chicago suburbs using Machine Learning

The pricing of housing properties is determined by a variety of factors....
research
12/02/2020

IBM Employee Attrition Analysis

In this paper, we analyzed the dataset IBM Employee Attrition to find th...
research
11/19/2017

How much is my car worth? A methodology for predicting used cars prices using Random Forest

Cars are being sold more than ever. Developing countries adopt the lease...
research
04/06/2019

A Novel Big Data Analytics Framework to Predict the Risk of Opioid Use Disorder

Addiction and overdose related to prescription opioids have reached an e...
research
02/27/2020

To be or not to be? A spatial predictive crime model for Rochester

This project uses a spatial model (Geographically Weighted Regression) t...
research
07/05/2022

Local Multi-Label Explanations for Random Forest

Multi-label classification is a challenging task, particularly in domain...
research
05/09/2023

A Kriging-Random Forest Hybrid Model for Real-time Ground Property Prediction during Earth Pressure Balance Shield Tunneling

A kriging-random forest hybrid model is developed for real-time ground p...

Please sign up or login with your details

Forgot password? Click here to reset