A Comparison of Statistical and Machine Learning Algorithms for Predicting Rents in the San Francisco Bay Area

11/26/2020
by   Paul Waddell, et al.
0

Urban transportation and land use models have used theory and statistical modeling methods to develop model systems that are useful in planning applications. Machine learning methods have been considered too 'black box', lacking interpretability, and their use has been limited within the land use and transportation modeling literature. We present a use case in which predictive accuracy is of primary importance, and compare the use of random forest regression to multiple regression using ordinary least squares, to predict rents per square foot in the San Francisco Bay Area using a large volume of rental listings scraped from the Craigslist website. We find that we are able to obtain useful predictions from both models using almost exclusively local accessibility variables, though the predictive accuracy of the random forest model is substantially higher.

READ FULL TEXT

page 7

page 9

research
03/29/2023

Local Interpretability of Random Forests for Multi-Target Regression

Multi-target regression is useful in a plethora of applications. Althoug...
research
07/22/2021

Inter and Intra-Annual Spatio-Temporal Variability of Habitat Suitability for Asian Elephants in India: A Random Forest Model-based Analysis

We develop a Random Forest model to estimate the species distribution of...
research
12/09/2019

Prediction of Sewer Pipe Deterioration Using Random Forest Classification

Wastewater infrastructure systems deteriorate over time due to a combina...
research
01/28/2020

A random forest based approach for predicting spreads in the primary catastrophe bond market

We introduce a random forest approach to enable spreads' prediction in t...
research
01/07/2022

Applying Machine Learning and AI Explanations to Analyze Vaccine Hesitancy

The paper quantifies the impact of race, poverty, politics, and age on C...
research
02/23/2021

Bridging Breiman's Brook: From Algorithmic Modeling to Statistical Learning

In 2001, Leo Breiman wrote of a divide between "data modeling" and "algo...
research
05/24/2023

Applications of Machine Learning in Detecting Afghan Fake Banknotes

Fake currency, unauthorized imitation money lacking government approval,...

Please sign up or login with your details

Forgot password? Click here to reset