Prediction Hourly Bording Demand Of Bus Passenger Using Imbalance Record
Keywords:
Deep-GANAbstract
The tap-on smart-card data provides a
valuable source to learn passengers’ boarding
behaviour and predict future travel demand.
However, when examining the smart-card records
(or instances) by the time of day and by boarding
stops, the positive instances (i.e. boarding at a
specific bus stop at a specific time) are rare
compared to negative instances (not boarding at
that bus stop at that time). Imbalanced data has
been demonstrated to significantly reduce the
accuracy of machine learning models deployed for
predicting hourly boarding numbers from a
particular location. This paper addresses this data
imbalance issue in the smart-card data before
applying it to predict bus boarding demand. We
propose the deep generative adversarial nets (Deep-
GAN) to generate dummy travelling instances to
add to a synthetic training dataset with more
balanced travelling and non-travelling instances.
The synthetic dataset is then used to train a deep
neural network (DNN) for predicting the travelling
and non-travelling instances from a particular stop
in a given time window. The results show that
addressing the data imbalance issue can
significantly improve the predictive model’s
performance and better fit ridership’s actual profile.
Comparing the performance of the Deep-GAN with
other traditional resampling methods shows that the
proposed method can produce a synthetic training
dataset with a higher similarity and diversity and,
thus, a stronger prediction power. The paper
highlights the significance and provides practical
guidance in improving the data quality and model
performance on travel behaviour prediction and
individual travel behaviour analysis.
Downloads
References
[1] X. Guo, J. Wu, H. Sun, R. Liu, and Z. Gao,
“Timetable coordination of
first trains in urban railway network: A case study
of beijing,” Applied
Mathematical Modelling, vol. 40, no. 17, pp. 8048–
8066, 2016.
[2] W. Wu, P. Li, R. Liu, W. Jin, B. Yao, Y. Xie,
and C. Ma, “Predicting
peak load of bus routes with supply optimization
and scaled shepard
interpolation: A newsvendor model,”
Transportation Research Part E:
Logistics and Transportation Review, vol. 142, p.
102041, 2020.
[3] N. Beˇsinovi´c, L. De Donato, F. Flammini, R.
M. Goverde, Z. Lin, R. Liu,
S. Marrone, R. Nardone, T. Tang, and V. Vittorini,
“Artificial intelligence
in railway transport: Taxonomy, regulations and
applications,” IEEE
Transactions on Intelligent Transportation Systems,
2021.
[4] S. C. Kwan and J. H. Hashim, “A review on cobenefits
of mass public
transportation in climate change mitigation,”
Sustainable Cities and
Society, vol. 22, pp. 11–18, 2016.
[5] Y. Wang, W. Zhang, T. Tang, D. Wang, and Z.
Liu, “Bus od matrix reconstruction based on clustering wi-fi
probe data,”
Transportmetrica B: Transport Dynamics, pp. 1–16,
2021, doi:
10.1080/21680566.2021.1956388.
[6] S. J. Berrebi, K. E. Watkins, and J. A. Laval, “A
real-time bus
dispatching policy to minimize passenger wait on a
high frequency
route,” Transportation Research Part B:
Methodological, vol. 81, pp.
377–389, 2015.
[7] A. Fonzone, J.-D. Schm¨ocker, and R. Liu, “A
model of bus bunching
under reliability-based passenger arrival patterns,”
Transportation Research
Part C: Emerging Technologies, vol. 59, pp. 164–
182, 2015.
[8] J. D. Schm¨ocker, W. Sun, A. Fonzone, and R.
Liu, “Bus bunching
along a corridor served by two lines,”
Transportation Research Part
B: Methodological, vol. 93, pp. 300–317, 2016.
[9] D. Chen, Q. Shao, Z. Liu, W. Yu, and C. L. P.
Chen, “Ridesourcing
behavior analysis and prediction: A network
perspective,” IEEE Transactions
on Intelligent Transportation Systems, pp. 1–10,
2020.
[10] E. Nelson and N. Sadowsky, “Estimating the
impact of ride-hailing app
company entry on public transportation use in
major us urban areas,”
The B.E. Journal of Economic Analysis & Policy,
vol. 19, no. 1, p.
20180151, 2019.
[11] Z. Chen, K. Liu, J. Wang, and T. Yamamoto,
“H-convlstm-based bagging
learning approach for ride-hailing demand
prediction considering
imbalance problems and sparse uncertainty,”
Transportation Research
Part C: Emerging Technologies, vol. 140, p.
103709, 2022.
[12] R. Liu and S. Sinha, “Modelling urban bus
service and passenger
reliability,” 2007.
[13] J. A. Sorratini, R. Liu, and S. Sinha,
“Assessing bus transport teliability
using micro-simulation,” Transportation Planning
and Technology,
vol. 31, no. 3, pp. 303–324, 2008.
[14] Y. Wang, W. Zhang, T. Tang, D. Wang, and
Z. Liu, “Bus od matrix
reconstruction based on clustering wi-fi probe
data,” Transportmetrica
B: Transport Dynamics, pp. 1–16, 2021.
[15] Y. Hollander and R. Liu, “Estimation of the
distribution of travel times
by repeated simulation,” Transportation Research
Part C: Emerging
Technologies, vol. 16, no. 2, pp. 212–231, 2008.