Abstract
Vehicle flow forecasting is of crucial importance for the management of road traffic in complex urban networks, as well as a useful input for route planning algorithms. In general traffic predictive models rely on data gathered by different types of sensors placed on roads, which occasionally produce faulty readings due to several causes, such as malfunctioning hardware or transmission errors. Filling in those gaps is relevant for constructing accurate forecasting models, a task which is engaged by diverse strategies, from a simple null value imputation to complex spatio-temporal context imputation models. This work elaborates on two machine learning approaches to update missing data with no gap length restrictions: a spatial context sensing model based on the information provided by surrounding sensors, and an automated clustering analysis tool that seeks optimal pattern clusters in order to impute values. Their performance is assessed and compared to other common techniques and different missing data generation models over real data captured from the city of Madrid (Spain). The newly presented methods are found to be fairly superior when portions of missing data are large or very abundant, as occurs in most practical cases.
Original language | English |
---|---|
Pages (from-to) | 18-33 |
Number of pages | 16 |
Journal | Transportation Research Part C: Emerging Technologies |
Volume | 90 |
DOIs | |
Publication status | Published - May 2018 |
Keywords
- Traffic forecasting
- Missing data
- Cluster analysis
- Data imputation
Project and Funding Information
- Project ID
- info:eu-repo/grantAgreement/EC/H2020/691735/EU/REnaissance of Places with Innovative Citizenship and TEchnolgy/REPLICATE
- Funding Info
- This work has been supported by the Basque Government through the ELKARTEK program (Ref. KK-2015/0000080 and the_x000D_ BID3ABI project), as well as by the H2020 programme of the European Commission (Grant No. 691735).