Zooming in on NYC taxi data with Portal

09/18/2017
by   Julia Stoyanovich, et al.
0

In this paper we develop a methodology for analyzing transportation data at different levels of temporal and geographic granularity, and apply our methodology to the TLC Trip Record Dataset, made publicly available by the NYC Taxi & Limousine Commission. This data is naturally represented by a set of trajectories, annotated with time and with additional information such as passenger count and cost. We analyze TLC data to identify hotspots, which point to lack of convenient public transportation options, and popular routes, which motivate ride-sharing solutions or addition of a bus route. Our methodology is based on using a system called Portal, which implements efficient representations and principled analysis methods for evolving graphs. Portal is implemented on top of Apache Spark, a popular distributed data processing system, is inter-operable with other Spark libraries like SparkSQL, and supports sophisticated kinds of analysis of evolving graphs efficiently. Portal is currently under development in the Data, Responsibly Lab at Drexel. We plan to release Portal in the open source in Fall 2017.

READ FULL TEXT

page 2

page 4

page 7

research
02/13/2022

Democratizing Aviation Emissions Estimation: Development of an Open-Source, Data-Driven Methodology

Through an aviation emissions estimation tool that is both publicly-acce...
research
06/17/2022

Assessing transportation accessibility equity via open data

We propose a methodology to assess transportation accessibility inequity...
research
09/03/2019

Los Angeles Metro Bus Data Analysis Using GPS Trajectory and Schedule Data (Demo Paper)

With the widespread installation of location-enabled devices on public t...
research
03/12/2020

Learning distributed representations of graphs with Geo2DR

We present Geo2DR, a Python library for unsupervised learning on graph-s...
research
09/06/2016

OpenTripPlanner, OpenStreetMap, General Transit Feed Specification: Tools for Disaster Relief and Recovery

Open Trip Planner was identified as the most promising open source multi...
research
06/26/2023

Methodology for generating synthetic labeled datasets for visual container inspection

Nowadays, containerized freight transport is one of the most important t...
research
02/12/2019

ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero

The AlphaGo, AlphaGo Zero, and AlphaZero series of algorithms are a rema...

Please sign up or login with your details

Forgot password? Click here to reset