Characterizing Classes of Potential Outliers through Traffic Data Set Data Signature 2D nMDS Projection

02/24/2017
by   Erlo Robert F. Oquendo, et al.
0

This paper presents a formal method for characterizing the potential outliers from the data signature projection of traffic data set using Non-Metric Multidimensional Scaling (nMDS) visualization. Previous work had only relied on visual inspection and the subjective nature of this technique may derive false and invalid potential outliers. The identification of correct potential outliers had already been an open problem proposed in literature. This is due to the fact that they pinpoint areas and time frames where traffic incidents/accidents occur along the North Luzon Expressway (NLEX) in Luzon. In this paper, potential outliers are classified into (1) absolute potential outliers; (2) valid potential outliers; and (3) ambiguous potential outliers through the use of confidence bands and confidence ellipse. A method is also described to validate cluster membership of identified ambiguous potential outliers. Using the 2006 NLEX Balintawak Northbound (BLK-NB) data set, we were able to identify two absolute potential outliers, nine valid potential outliers, and five ambiguous potential outliers. In a literature where Vector Fusion was used, 10 potential outliers were identified. Given the results for the nMDS visualization using the confidence bands and confidence ellipses, all of these 10 potential outliers were also found and 8 new potential outliers were also found.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2022

Outliers, Dynamics, and the Independence Postulate

We show that outliers occur almost surely in computable dynamics over in...
research
04/05/2023

A system for exploring big data: an iterative k-means searchlight for outlier detection on open health data

The interactive exploration of large and evolving datasets is challengin...
research
07/20/2023

Edgewise outliers of network indexed signals

We consider models for network indexed multivariate data involving a dep...
research
04/12/2022

Analysing and visualising bike sharing demand with outliers

Bike-sharing is a popular component of sustainable urban mobility. It re...
research
01/25/2015

Robust Subjective Visual Property Prediction from Crowdsourced Pairwise Labels

The problem of estimating subjective visual properties from image and vi...
research
06/01/2016

Identifying Outliers using Influence Function of Multiple Kernel Canonical Correlation Analysis

Imaging genetic research has essentially focused on discovering unique a...
research
05/16/2019

How Entropic Regression Beats the Outliers Problem in Nonlinear System Identification

System identification (SID) is central in science and engineering applic...

Please sign up or login with your details

Forgot password? Click here to reset