Abstract
Predicting demand for bike share systems (BSSs) is critical for both the management of an existing BSS and the planning for a new BSS. While researchers have mainly focused on improving prediction accuracy and analysing demand-influencing factors, there are few studies examining the inherent randomness of stations’ observed demands and to what degree the demands at individual stations are predictable. Using Divvy bike-share one-year data from Chicago, USA, we measured demand entropy and quantified the station-level predictability. Additionally, to verify that these predictability measures could represent the performance of prediction models, we implemented two commonly used demand prediction models to compare the empirical prediction accuracy with the calculated entropy and predictability. Furthermore, we explored how city- and system-specific temporally-constant features would impact entropy and predictability to inform estimating these measures when historical demand data are unavailable. Our results show that entropy and predictability of demands across stations are polarized as some stations exhibit high uncertainty (a low predictability of 0.65) and others have almost no check-out demand uncertainty (a high predictability of around 1.0). We also validated that the entropy and predictability are a priori model-free indicators for prediction error, given a sequence of bike usage demands. Lastly, we identified that key factors contributing to station-level entropy and predictability include per capita income, spatial eccentricity, and the number of parking lots near the station. Findings from this study provide more fundamental understanding of BSS demand prediction, which can help decision makers and system operators anticipate diverse station-level prediction errors from their prediction models both for existing stations and for new ones.
Similar content being viewed by others
References
Bachand-Marleau J, Lee B, El-Geneidy A (2012). Better understanding of factors influencing likelihood of using shared bicycle systems and frequency of use. Transportation Research Record: Journal of the Transportation Research Board, 2314(1): 66–71
Bao J, Xu C, Liu P, Wang W (2017). Exploring bikesharing travel patterns and trip purposes using smart card data and online point of interests. Networks and Spatial Economics, 17(4): 1231–1253
Belgiu M, Drăguţ L (2016). Random forest in remote sensing: A review of applications and future directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114: 24–31
Benvenuto D, Giovanetti M, Vassallo L, Angeletti S, Ciccozzi M (2020). Application of the ARIMA model on the COVID-2019 epidemic dataset. Data in Brief, 29: 105340
Biau G, Scornet E (2016). A random forest guided tour. Test, 25(2): 197–227
Cazabet R, Borgnat P, Jensen P (2017). Using degree constrained gravity null-models to understand the structure of journeys’ networks in bicycle sharing systems. In: 25th European Symposium on Artificial Neural Networks, Computational Intelligence, and Machine Learning. Bruges: ENSANN, 25
Chai D, Wang L, Yang Q (2018). Bike flow prediction with multigraph convolutional networks. In: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. Seattle, WA: ACM, 397–400
Chen L, Zhang D, Wang L, Yang D, Ma X, Li S, Wu Z, Pan G, Nguyen T, Jakubowicz J (2016). Dynamic cluster-based over-demand prediction in bike sharing systems. In: Proceedings of the International Joint Conference on Pervasive and Ubiquitous Computing. Heidelberg: ACM, 841–852
Chen P, Yuan H, Shu X (2008). Forecasting crime using the ARIMA model. In: 5th International Conference on Fuzzy Systems and Knowledge Discovery. **an: IEEE, 627–630
Chen T, Guestrin C (2016). Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, CA: ACM, 785–794
El-Assi W, Salah Mahmoud M, Nurul Habib K (2017). Effects of built environment and weather on bike sharing demand: A station level analysis of commercial bike sharing in Toronto. Transportation, 44(3): 589–613
El Sibai R, Chabchoub Y, Fricker C (2018). Using spatial outliers detection to assess balancing mechanisms in bike sharing systems. In: 32nd International Conference on Advanced Information Networking and Applications (AINA). Krakow: IEEE, 988–995
Fano R, Hawkins D (1961). Transmission of information: A statistical theory of communication. American Journal of Physics, 29(11): 793–794
Fishman E (2016). Bikeshare: A review of recent literature. Transport Reviews, 36(1): 92–113
Fishman E, Allan V (2019). Chapter six: Bike share. In: Fishman E, ed. Advances in Transport Policy and Planning. Volume 4. The Sharing Economy and the Relevance for Transport. Cambridge: Elsevier Inc., 121–152
He S, Shin K (2020). Towards fine-grained flow forecasting: A graph attention approach for bike sharing systems. In: Proceedings of the Web Conference. Taipei: ACM, 88–98
Hulot P, Aloise D, Jena S (2018). Towards station-level demand prediction for effective rebalancing in bike-sharing systems. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. London: ACM, 378–386
Hyland M, Hong Z, Pinto H, Chen Y (2018). Hybrid cluster-regression approach to model bikeshare station usage. Transportation Research Part A: Policy and Practice, 115: 71–89
Hyndman R, Khandakar Y (2008). Automatic time series forecasting: The forecast package for R. Journal of Statistical Software, 27(3): 1–22
Jia W, Tan Y, Liu L, Li J, Zhang H, Zhao K (2019). Hierarchical prediction based on two-level Gaussian mixture model clustering for bike-sharing system. Knowledge-Based Systems, 178: 84–97
Kim D, Cho Y, Kim D, Park C, Choo J (2022). Residual correction in real-time traffic forecasting. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management. Atlanta, GA: ACM, 962–971
Kocer U U (2013). Forecasting intermittent demand by Markov chain model. International Journal of Innovative Computing, Information & Control, 9(8): 3307–3318
Kontoyiannis I, Algoet P, Suhov Y, Wyner A (1998). Nonparametric entropy estimation for stationary processes and random fields, with applications to English text. IEEE Transactions on Information Theory, 44(3): 1319–1327
Kou Z, Cai H (2019). Understanding bike sharing travel patterns: An analysis of trip data from eight cities. Physica A, 515: 785–797
Kou Z, Cai H (2021a). Comparing the performance of different types of bike share systems. Transportation Research Part D: Transport and Environment, 94: 102823
Kou Z, Cai H (2021b). Incorporating spatial network information to improve demand prediction for bike share system expansion. In: Proceedings of the 10th International Workshop on Urban Computing. Bei**g
Kou Z, Wang X, Chiu S, Cai H (2020). Quantifying greenhouse gas emissions reduction from bike share systems: A model considering real-world trips and transportation mode choice patterns. Resources, Conservation and Recycling, 153: 104534
Li Y, Zheng Y (2020). Citywide bike usage prediction in a bike-sharing system. IEEE Transactions on Knowledge and Data Engineering, 32(6): 1079–1091
Li Y, Zheng Y, Zhang H, Chen L (2015). Traffic prediction in a bike-sharing system. In: Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems. Seattle, WA: ACM, 1–10
Lin L, He Z, Peeta S (2018). Predicting station-level hourly demand in a large-scale bike-sharing network: A graph convolutional neural network approach. Transportation Research Part C: Emerging Technologies, 97: 258–276
Lin P, Weng J, Hu S, Alivanistos D, Li X, Yin B (2020). Revealing spatio-temporal patterns and influencing factors of dockless bike sharing demand. IEEE Access, 8: 66139–66149
Liu C, Gao X, Wang X (2022). Data adaptive functional outlier detection: Analysis of the Paris bike sharing system data. Information Sciences, 602: 13–42
Lu X, Wetter E, Bharti N, Tatem A, Bengtsson L (2013). Approaching the limit of predictability in human mobility. Scientific Reports, 3(1): 2923
Luo H, Zhao F, Chen W, Cai H (2020). Optimizing bike sharing systems from the life cycle greenhouse gas emissions perspective. Transportation Research Part C: Emerging Technologies, 117: 102705
Médard de Chardon C, Caruso G (2015). Estimating bike-share trips using station level data. Transportation Research Part B: Methodological, 78: 260–279
Niu Z, Zhou M, Wang L, Gao X, Hua G (2016). Ordinal regression with multiple output CNN for age estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV: IEEE, 4920–4928
Nochai R, Nochai T (2006). ARIMA model for forecasting oil palm price. In: Proceedings of the 2nd IMT-GT Regional Conference on Mathematics, Statistics and Applications. Penang: Academia, 13–15
Regue R, Recker W (2014). Proactive vehicle routing with inferred demand to solve the bikesharing rebalancing problem. Transportation Research Part E: Logistics and Transportation Review, 72: 192–209
Saboia J L M (1977). Autoregressive integrated moving average (ARIMA) models for birth forecasting. Journal of the American Statistical Association, 72(358): 264–270
Senter H (2008). In: Kutner M H, Nachtsheim C J, Neter J, Li W, eds. Applied Linear Statistical Models. 5th ed. Boston: McGraw-Hill
Shaheen S, Guzman S, Zhang H (2010). Bikesharing in Europe, the Americas, and Asia. Transportation Research Record: Journal of the Transportation Research Board, 2143(1): 159–167
Shannon C E (1948). A mathematical theory of communication. Bell System Technical Journal, 27(3): 379–423
Singhvi D, Singhvi S, Frazier P I, Henderson S G, Mahony E O, Shmoys D B, Woodard D B (2015). Predicting bike usage for New York City’s bike sharing system. In: AAAI Workshop: Computational Sustainability. Austin, TX: AAAI
Smith D A, Shen Y, Barros J, Zhong C, Batty M, Giannotti M (2020). A compact city for the wealthy? Employment accessibility inequalities between occupational classes in the London metropolitan region 2011. Journal of Transport Geography, 86: 102767
Song C, Qu Z, Blumm N, Barabási A L (2010). Limits of predictability in human mobility. Science, 327(5968): 1018–1021
Straits Research (2021). Bike sharing market new research analysis and forecast 2030. Online Report
Stromberg J (2015). Bike share users are mostly rich and white. Here’s why that’s hard to change
US Census Bureau (2012). 2009–2011 American Community Survey 3-year Public Use Microdata Samples
Yang Z, Chen J, Hu J, Shu Y, Cheng P (2019). Mobility modeling and data-driven closed-loop prediction in bike-sharing systems. IEEE Transactions on Intelligent Transportation Systems, 20(12): 4488–4499
Yang Z, Hu J, Shu Y, Cheng P, Chen J, Moscibroda T (2016). Mobility modeling and prediction in bike-sharing systems. In: Proceedings of the 14th Annual International Conference on Mobile Systems, Applications, and Services. Singapore: ACM, 165–178
Yu B, Yin H, Zhu Z (2018). Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. In: Proceedings of the 27th International Joint Conference on Artificial Intelligence. Stockholm: ACM, 3634–3640
Zhang Y, Thomas T, Brussel M J G, van Maarseveen M F A M (2016). Expanding bicycle-sharing systems: Lessons learnt from an analysis of usage. PLoS One, 11(12): e0168604
Zheng V Z, Choi S, Sun L (2023). Enhancing deep traffic forecasting models with dynamic regression. ar**v preprint, ar**v:2301.06650
Zhou X (2015). Understanding spatiotemporal patterns of biking behavior by analyzing massive bike sharing data in Chicago. PLoS One, 10(10): e0137922
Zhou Y, Wang L, Zhong R, Tan Y (2018). A Markov chain based demand prediction model for stations in bike sharing systems. Mathematical Problems in Engineering, 8028714
Zhou Y, Yu Y, Wang Y, He B, Yang L (2023). Mode substitution and carbon emission impacts of electric bike sharing systems. Sustainable Cities and Society, 89: 104312
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing Interests The authors declare that they have no competing interests.
Electronic Supplementary Material
Rights and permissions
About this article
Cite this article
Yin, Z., Hardaway, K., Feng, Y. et al. Understanding the demand predictability of bike share systems: A station-level analysis. Front. Eng. Manag. 10, 551–565 (2023). https://doi.org/10.1007/s42524-023-0279-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42524-023-0279-8