To read the full version of this content please select one of the options below:

Travel time prediction in transport and logistics: Towards more efficient vehicle GPS data management using tree ensemble methods

Xia Li (School of Computer Science, University of Nottingham, Ningbo, China)
Ruibin Bai (School of Computer Science, University of Nottingham, Ningbo, China)
Peer-Olaf Siebers (School of Computer Science, University of Nottingham, Nottingham, UK)
Christian Wagner (University of Nottingham, Nottingham, UK)

VINE Journal of Information and Knowledge Management Systems

ISSN: 2059-5891

Article publication date: 25 July 2019

Issue publication date: 12 September 2019

Abstract

Purpose

Many transport and logistics companies nowadays use raw vehicle GPS data for travel time prediction. However, they face difficult challenges in terms of the costs of information storage, as well as the quality of the prediction. This paper aims to systematically investigate various meta-data (features) that require significantly less storage space but provide sufficient information for high-quality travel time predictions.

Design/methodology/approach

The paper systematically studied the combinatorial effects of features and different model fitting strategies with two popular decision tree ensemble methods for travel time prediction, namely, random forests and gradient boosting regression trees. First, the investigation was conducted using pseudo travel time data that were generated using a pseudo travel time sampling algorithm, which allows generating travel time data using different noise processes so that the prediction performance under different travel conditions and noise characteristics can be studied systematically. The results and findings were then further compared and evaluated through a real-life case.

Findings

The paper provides empirical insights and guidelines about how raw GPS data can be reduced into a small-sized feature vector for the purposes of vehicle travel time prediction. It suggests that, add travel time observations from the previous departure time intervals are beneficial to the prediction, particularly when there is no other types of real-time information (e.g. traffic flow, speed) are available. It was also found that modular model fitting does not improve the quality of the prediction in all experimental settings used in this paper.

Research limitations/implications

The findings are primarily based on empirical studies on limited real-life data instances, and the results may lack generalisabilities. Therefore, the researchers are encouraged to test them further in more real-life data instances.

Practical implications

The paper includes implications and guidelines for the development of efficient GPS data storage and high-quality travel time prediction under different types of travel conditions.

Originality/value

This paper systematically studies the combinatorial feature effects for tree-ensemble-based travel time prediction approaches.

Keywords

Acknowledgements

The authors acknowledge the financial support from the National Natural Science Foundation of China (71471092), Zhejiang Natural Science Foundation (LR17G010001), the International Doctoral Innovation Centre, Ningbo Science and Technology Bureau (2014A35006, 2017D10034), China’s MoST and The University of Nottingham.

Citation

Li, X., Bai, R., Siebers, P.-O. and Wagner, C. (2019), "Travel time prediction in transport and logistics: Towards more efficient vehicle GPS data management using tree ensemble methods", VINE Journal of Information and Knowledge Management Systems, Vol. 49 No. 3, pp. 277-306. https://doi.org/10.1108/VJIKMS-11-2018-0102

Publisher

:

Emerald Publishing Limited

Copyright © 2019, Emerald Publishing Limited