Automating and Optimizing a Transportation Mode Classification Model for use on Smartphone Data
Abstract
As transportation engineering and planning evolve from “data poor” to “data rich” practices, methods to automate the collection and translation of data to information become increasingly important. Advances in wireless communications and technologies provide the opportunity to collect detailed data on travel trajectories using smartphones equipped with GPS and accelerometers. These types of smartphones are ubiquitous and, as such, present an opportunity to conveniently collect spatial and temporal data at regular time intervals. This can be useful to utilize as a method to document trip attributes of interest, namely origin, destination, departure time, route choice, trip purpose, and mode choice. Though some of these attributes can be relatively easily extracted from the smartphone data, inferring transportation mode(s) used by the trip maker remains a challenging problem.
This research presents a data-driven classification model to infer the transportation mode(s) used by trip makers on the basis of data collected with GPS equipped smart phones. Rather than making a priori assumptions, we instead employ an optimization method to objectively produce the following classifier components and methods: a ranked feature vector based on the power of differentiation between different modes; the classification technique between the range of candidate classifiers; the number of ranked attributes to include in the feature vector; data formatting; and optimal model parameters. The model is trained and tested using labelled trip data. The calibrated model is evaluated by testing its ability to classify travel mode correctly for GPS data at a different level of disaggregation than the one used in the model training step. The model provides an accuracy of approximately 86% at the disaggregated level (e.g., Walk, Bike, Transit, and Private Automobile) and approximately 94% at aggregated level (e.g., Non-Motorized and Motorized.)
The results obtained from the optimized model are supplemented with a GIS based model to improve the identification of transit trips. The method employed integrates GIS data such as the locations of transit stops and signalized intersections with observed travel patterns from the GPS embedded smartphone data. The combination of these two data sources generates new classification features that, when applied to the collected data, demonstrate that this technique vastly improves the accuracy of the classification model for identifying transit mode usage.
Collections
Cite this version of the work
Akram Nour
(2015).
Automating and Optimizing a Transportation Mode Classification Model for use on Smartphone Data. UWSpace.
http://hdl.handle.net/10012/9218
Other formats
Related items
Showing items related by title, author, creator and subject.
-
Highway Development Decision-Making Under Uncertainty: Analysis, Critique and Advancement
El-Khatib, Mayar (University of Waterloo, 2011-01-20)While decision-making under uncertainty is a major universal problem, its implications in the field of transportation systems are especially enormous; where the benefits of right decisions are tremendous, the consequences ... -
Exploring the Accessibility Gap: Quantifying Transport Disadvantage in the City of Toronto
Lee, Janelle (University of Waterloo, 2019-07-19)Researchers and policymakers have become increasingly interested in understanding the intersection between transportation and equity. Many scholars argue that it is important to understand transportation through an equity ... -
Improving the Fiscal Transparency and Sustainability of Public-Sector Transportation Infrastructure
Terry, Jacob Ross (University of Waterloo, 2017-08-25)Public-sector infrastructure funding is a complex and often unclear process. Cities operate most efficiently if there is an adequately-funded transportation network with which residents and visitors can easily travel from ...