An Unsupervised Random Forest Clustering Technique for Automatic Traffic Scenario Categorization

04/05/2020
by   Friedrich Kruber, et al.
0

A modification of the Random Forest algorithm for the categorization of traffic situations is introduced in this paper. The procedure yields an unsupervised machine learning method. The algorithm generates a proximity matrix which contains a similarity measure. This matrix is then reordered with hierarchical clustering to achieve a graphically interpretable representation. It is shown how the resulting proximity matrix can be visually interpreted and how the variation of the methods' metaparameter reveals different insights into the data. The proposed method is able to cluster data from any data source. To demonstrate the methods' potential, multiple features derived from a traffic simulation are used in this paper. The knowledge of traffic scenario clusters is crucial to accelerate the validation process. The clue of the method is that scenario templates can be generated automatically from actual traffic situations. These templates can be employed in all stages of the development process. The results prove that the procedure is well suited for an automatic categorization of traffic scenarios. Diverse other applications can benefit from this work.

READ FULL TEXT

page 1

page 5

page 6

page 7

research
04/05/2020

Unsupervised and Supervised Learning with the Random Forest Algorithm for Traffic Scenario Clustering and Classification

The goal of this paper is to provide a method, which is able to find cat...
research
07/15/2015

Unsupervised Decision Forest for Data Clustering and Density Estimation

An algorithm to improve performance parameter for unsupervised decision ...
research
05/17/2021

Cross-Cluster Weighted Forests

Adapting machine learning algorithms to better handle the presence of na...
research
12/22/2020

Unsupervised Machine learning methods for city vitality index

This paper concerns the challenge to evaluate and predict a district vit...
research
10/26/2020

Data Segmentation via t-SNE, DBSCAN, and Random Forest

This research proposes a data segmentation technique which is easy to in...
research
07/14/2020

Misclassification cost-sensitive ensemble learning: A unifying framework

Over the years, a plethora of cost-sensitive methods have been proposed ...

Please sign up or login with your details

Forgot password? Click here to reset