Mapping hourly dynamics of urban population using trajectories reconstructed from mobile phone records
Zhang Liu
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorCorresponding Author
Ting Ma
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Correspondence Ting Ma, State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, 11A, Datun Road, Chaoyang District, Beijing 100101, China. Email: [email protected]Search for more papers by this authorYunyan Du
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorTao Pei
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorJiawei Yi
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorHui Peng
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorZhang Liu
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorCorresponding Author
Ting Ma
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Correspondence Ting Ma, State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, 11A, Datun Road, Chaoyang District, Beijing 100101, China. Email: [email protected]Search for more papers by this authorYunyan Du
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorTao Pei
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorJiawei Yi
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorHui Peng
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorFunding information: National Natural Science Foundation of China, Grant/Award Nos. 4159840011 and 41771418; The Foundation for Innovative Research Groups of the National Natural Science Foundation of China, Grant/Award No. 41421001; The Innovation Program of the State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Grant/Award No. 088RA500PA; The Distinguished Young Scholar Program of the Institute of Geographic Sciences and Natural Resources Research, CAS, Grant/Award No. 2014RC102
Abstract
Understanding the spatiotemporal dynamics of urban population is crucial for addressing a wide range of urban planning and management issues. Aggregated geospatial big data have been widely used to quantitatively estimate population distribution at fine spatial scales over a given time period. However, it is still a challenge to estimate population density at a fine temporal resolution over a large geographical space, mainly due to the temporal asynchrony of population movement and the challenges to acquiring a complete individual movement record. In this article, we propose a method to estimate hourly population density by examining the time-series individual trajectories, which were reconstructed from call detail records using BP neural networks. We first used BP neural networks to predict the positions of mobile phone users at an hourly interval and then estimated the hourly population density using log-linear regression at the cell tower level. The estimated population density is linearly correlated with population census data at the sub-district level. Trajectory clustering results show five distinct diurnal dynamic patterns of population movement in the study area, revealing spatially explicit characteristics of the diurnal commuting flows, though the driving forces of the flows need further investigation.
CONFLICT OF INTEREST
We declare that we have no financial or personal relationships with other people or organizations that can inappropriately influence our work, there is no professional or other personal interest of any nature or kind in any product, service, and/or company that could be construed as influencing the position presented in, or the review of, this manuscript.
REFERENCES
-
Agard, B.,
Morency, C., &
Trépanier, M. (2006). Mining public transport user behaviour from smart card data. IFAC Proceedings, 39(3), 399–404.
10.3182/20060517-3-FR-2903.00211 Google Scholar
- Balk, D., Deichmann, U., Yetman, G., Pozzi, F., Hay, S., & Nelson, A. (2006). Determining global population distribution: Methods, applications and data. Advances in Parasitology, 62, 119–156.
- Balk, D., & Yetman, G. (2004). The global distribution of population: Evaluating the gains in resolution refinement. New York, NY: Center for International Earth Science Information Network, Columbia University.
- Barabási, A.-L. (2005). The origin of bursts and heavy tails in human dynamics. Nature, 435, 207–211.
- Becker, R. A., Caceres, R., Hanson, K., Loh, J. M., Urbanek, S., Varshavsky, A., & Volinsky, C. (2011). A tale of one city: Using cellular network data for urban planning. IEEE Pervasive Computing, 10(4), 18–26.
- Bengtsson, L., Lu, X., Thorson, A., Garfield, R., & Von Schreeb, J. (2011). Improved response to disasters and outbreaks by tracking population movements with mobile phone network data: A post-earthquake geospatial study in Haiti. PLoS Med, 8(8), e1001083.
- Bhaduri, B., Bright, E., Coleman, P., & Urban, M. L. (2007). LandScan USA: A high-resolution geospatial and temporal modeling approach for population distribution and dynamics. GeoJournal, 69(1&2), 103–117.
- Chen, X., Chi, R., Wang, J., Pang, C., & Xiao, Q. (2011). Driver safe speed model based on BP neural network for rural curved roads. In D. Li & Y. Chen (Eds.), Computer and Computing Technologies in Agriculture V: 5th IFIP TC 5/SIG 5.1 Conference, CCTA 2011, Beijing, China, October 29–31, 2011, Proceedings, Part III (pp. 92–102). Berlin, Germany: Springer.
- Chen, Y., Liu, X., Li, X., Liu, X., Yao, Y., Hu, G., … Pei, F. (2017). Delineating urban functional areas with building-level social media data: A dynamic time warping (DTW) distance based k-medoids method. Landscape & Urban Planning, 160, 48–60.
- De Nadai, M., Staiano, J., Larcher, R., Sebe, N., Quercia, D., & Lepri, B. (2016). The death and life of great Italian cities: A mobile phone data perspective. In Proceedings of the 25th International Conference on World Wide Web. Montreal, Canada: ACM.
- Deichmann, U., Balk, D., & Yetman, G. (2001). Transforming population data for interdisciplinary usages: From census to grid. Washington, DC: Center for International Earth Science Information Network.
-
Demissie, M. G.,
Correia, G., &
Bento, C. (2015). Analysis of the pattern and intensity of urban activities through aggregate cellphone usage. Transportmetrica A: Transport Science, 11(6), 502–524.
10.1080/23249935.2015.1019591 Google Scholar
- Deville, P., Linard, C., Martin, S., Gilbert, M., Stevens, F. R., Gaughan, A. E., … Tatem, A. J. (2014). Dynamic population mapping using mobile phone data. Proceedings of the National Academy of Sciences of the United States of America, 111(45), 15888–15893.
- Ding, C., Wang, W., Wang, X., & Baumann, M. (2013). A neural network model for driver's lane-changing trajectory prediction in urban traffic flow. Mathematical Problems in Engineering, 2013, 967358.
- Domínguez, D. R., Redondo, R. P. D., Vilas, A. F., & Khalifa, M. B. (2017). Sensing the city with instagram: Clustering geolocated data for outlier detection. Expert Systems with Applications, 78, 319–333.
- Dong, Y., Pinelli, F., Gkoufas, Y., Nabi, Z., Calabrese, F., & Chawla, N. V. (2015). Inferring unusual crowd events from mobile phone call detail records. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Porto, Portugal: Springer.
- Douglass, R. W., Meyer, D. A., Ram, M., Rideout, D., & Song, D. (2015). High resolution population estimates from telecommunications data. EPJ Data Science, 4(1), 4.
-
Doxsey-Whitfield, E.,
MacManus, K.,
Adamo, S. B.,
Pistolesi, L.,
Squires, J.,
Borkovska, O., &
Baptista, S. R. (2015). Taking advantage of the improved availability of census data: A first look at the gridded population of the world, version 4. Papers in Applied Geography, 1(3), 226–234.
10.1080/23754931.2015.1014272 Google Scholar
- Eagle, N., Pentland, A. S., & Lazer, D. (2009). Inferring friendship network structure by using mobile phone data. Proceedings of the National Academy of Sciences of the United States of America, 106(36), 15274–15278.
-
Eicher, C. L., &
Brewer, C. A. (2001). Dasymetric mapping and areal interpolation: Implementation and evaluation. Cartography & Geographic Information Science, 28(2), 125–138.
10.1559/152304001782173727 Google Scholar
- Faria, N. R., Rambaut, A., Suchard, M. A., Baele, G., Bedford, T., Ward, M. J., … Pépin, J. (2014). The early spread and epidemic ignition of HIV-1 in human populations. Science, 346, 56–61.
- Furletti, B., Gabrielli, L., Giannotti, F., Milli, L., Nanni, M., Pedreschi, D., … Garofalo, G. (2014). Use of mobile phone data to estimate mobility flows: Measuring urban population and inter-city mobility using big data in an integrated approach. In Proceedings of the 47th Meeting of the Italian Statistical Society. Cagliari, Italy.
-
Hecht-Nielsen, R. (1988). Theory of the backpropagation neural network. Neural Networks, 1(Suppl. 1), 445–448.
10.1016/0893-6080(88)90469-8 Google Scholar
- Hoteit, S., Chen, G., Viana, A., & Fiore, M. (2016). Filling the gaps: On the completion of sparse call detail records for mobility analysis. ACM Chants. In Proceedings of the 11th ACM Workshop on Challenged Networks (pp. 45–50). New York, NY: ACM.
- Hoteit, S., Secci, S., Sobolevsky, S., Ratti, C., & Pujolle, G. (2014). Estimating human trajectories and hotspots through mobile phone data. Computer Networks, 64, 296–307.
- Hyvärinen, A., & Oja, E. (1997). One-unit learning rules for independent component analysis. In Proceedings of the 10th Conference on Advances in Neural Information Processing Systems. Denver, CO.
- Kang, C., Liu, Y., Ma, X., & Wu, L. (2012). Towards estimating urban population distributions from mobile call data. Journal of Urban Technology, 19(4), 3–21.
- Kang, C., Ma, X., Tong, D., & Liu, Y. (2012). Intra-urban human mobility patterns: An urban morphology perspective. Physica A: Statistical Mechanics & Its Applications, 391(4), 1702–1717.
- Khodabandelou, G., Gauthier, V., El-Yacoubi, M., & Fiore, M. (2016). Population estimation from mobile network traffic metadata. In Proceedings of the 17th International IEEE Symposium on the World of Wireless, Mobile and Multimedia Networks. Coimbra, Portugal: IEEE.
- Le Cun, Y. (1988). A theoretical framework for back-propagation. In D. Touresky, G. Hinton, & T. Sejnowski (Eds.), Proceedings of the 1988 Connectionist Models Summer School (pp. 21–28). San Mateo, CA: Morgan Kauffman Publishers.
- Levin, B. (1981). A representation for multinomial cumulative distribution functions. Annals of Statistics, 9(5), 1123–1126.
- Li, Q., Xu, B., Ma, Y., & Chung, T. (2015). Real-time monitoring and forecast of active population density using mobile phone data. In W. Chen, G. Yin, G. Zhao, Q. Han, W. Jing, G. Sun, & Z. Lu (Eds.), Big Data Technology and Applications: First National Conference, BDTA 2015, Harbin, China, December 25–26, 2015, Proceedings. Berlin, Germany: Springer.
- Liu, Y., Sui, Z., Kang, C., & Gao, Y. (2014). Uncovering patterns of inter-urban trip and spatial interaction from social media check-in data. PLoS One, 9(1), e86026.
- Lloyd, C. T., Sorichetta, A., & Tatem, A. J. (2017). High resolution global gridded data for use in population studies. Scientific Data, 4, 170001.
- Long, Y., Zhang, Y., & Cui, C. (2012). Identifying commuting pattern of Beijing using bus smart card data. Acta Geographica Sinica, 67(10), 1339–1352.
- Lopez, D., Gunasekaran, M., Murugan, B. S., Kaur, H., & Abbas, K. M. (2014). Spatial big data analytics of influenza epidemic in Vellore, India. In Proceedings of the 2014 IEEE International Conference on Big Data. Washington, DC: IEEE.
- Lwin, K. K., Sugiura, K., & Zettsu, K. (2016). Space-time multiple regression model for grid-based population estimation in urban areas. International Journal of Geographical Information Science, 30(8), 1579–1593.
- Mann, H. B., & Whitney, D. R. (1947). On a test of whether one of two random variables is stochastically larger than the other. Annals of Mathematical Statistics, 18(1), 50–60.
- Mennis, J. (2003). Generating surface models of population using dasymetric mapping. Professional Geographer, 55(1), 31–42.
- Min, G. Y., & Jeong, D. H. (2013). Research on assessment of impact of big data attributes to disaster response decision-making process. Journal of Society for e-Business Studies, 18(3), 17–43.
- Murtagh, F., & Legendre, P. (2014). Ward's hierarchical agglomerative clustering method: Which algorithms implement Ward's criterion? Journal of Classification, 31(3), 274–295.
- Oliveira, J. G., & Barabási, A.-L. (2005). Human dynamics: Darwin and Einstein correspondence patterns. Nature, 437, 1251.
- Olver, S., & Townsend, A. (2013). Fast inverse transform sampling in one and two dimensions. arXiv preprint, arxiv:1307.1223.
-
Partsinevelos, P.,
Agouris, P., &
Stefanidis, A. (2005). Reconstructing spatiotemporal trajectories from sparse data. ISPRS Journal of Photogrammetry & Remote Sensing, 60(1), 3–16.
10.1016/j.isprsjprs.2005.10.004 Google Scholar
- Patel, N. N., Stevens, F. R., Huang, Z., Gaughan, A. E., Elyazar, I., & Tatem, A. J. (2016). Improving large area population mapping using geotweet densities. Transactions in GIS, 21(2), 317–331.
- Pei, T., Sobolevsky, S., Ratti, C., Shaw, S.-L., Li, T., & Zhou, C. (2014). A new insight into land use classification based on aggregated mobile phone data. International Journal of Geographical Information Science, 28(9), 1988–2007.
-
Rokach, L., &
Maimon, O. (2005). Clustering methods. In O. Maimon & L. Rokach (Eds.), Data mining and knowledge discovery handbook (pp. 321–352). Berlin, Germany: Springer.
10.1007/0-387-25465-X_15 Google Scholar
- Sagl, G., Delmelle, E., & Delmelle, E. (2014). Mapping collective human activity in an urban environment based on mobile phone data. Cartography & Geographic Information Science, 41(3), 272–285.
- Shapiro, S. S., & Wilk, M. B. (1965). An analysis of variance test for normality (complete samples). Biometrika, 52(3&4), 591–611.
- Sterly, H., Hennig, B., & Dongo, K. (2013). “ Calling Abidjan”: Improving population estimations with mobile communication data. In V. Blondel, N. Cordes, A. Decuyper, P. Deville, J. Raguenez, & Z. Smoreda (Eds.), Mobile phone data for development: Analysis of mobile phone datasets for the development of Ivory Coast (pp. 108–114). Cambridge, MA: MIT Media Lab.
- Stevens, F. R., Gaughan, A. E., Linard, C., & Tatem, A. J. (2015). Disaggregating census data for population mapping using random forests with remotely-sensed and ancillary data. PLoS One, 10(2), e0107042.
- Tao, S., Corcoran, J., Mateo-Babiano, I., & Rohde, D. (2014). Exploring bus rapid transit passenger travel behaviour using big data. Applied Geography, 53, 90–104.
- Tatem, A. J., Noor, A. M., Von Hagen, C., Di Gregorio, A., & Hay, S. I. (2007). High resolution population maps for low income nations: Combining land cover and census in East Africa. PLoS One, 2(12), e1298.
- Traag, V. A., Browet, A., Calabrese, F., & Morlot, F. (2011). Social event detection in massive mobile phone data using probabilistic location inference. In Proceedings of the 3rd International Conference on Privacy, Security, Risk and Trust and the 3rd IEEE International Conference on Social Computing. Boston, MA: IEEE.
- Vespignani, A. (2009). Predicting the behavior of techno-social systems. Science, 325(5939), 425–428.
- Wilcoxon, F., Katti, S., & Wilcox, R. A. (1970). Critical values and probability levels for the Wilcoxon rank sum test and the Wilcoxon signed rank test. In H. L. Harter & D. B. Owen (Eds.), Selected tables in mathematical statistics (Vol. I, pp. 171–259). Providence, RI: American Mathematical Society.
- Wilson, R., zu Erbach-Schoenberg, E., Albert, M., Power, D., Tudge, S., Gonzalez, M., … Hughes, C. (2016). Rapid and near real-time assessments of population displacement using mobile phone data following disasters: The 2015 Nepal earthquake. PLoS Current Disasters. https://doi.org/10.1371/currents.dis.d073fbece328e4c39087bc086d694b5c.
- Xu, T., Li, X., & Claramunt, C. (2018). Trip-oriented travel time prediction (TOTTP) with historical vehicle trajectories. Frontiers of Earth Science, 15, in press.
- Xu, Y., Shaw, S.-L., Zhao, Z., Yin, L., Fang, Z., & Li, Q. (2015). Understanding aggregate human mobility patterns using passive mobile phone location data: A home-based approach. Transportation, 42(4), 625–646.
-
Yang, X.,
Zhao, Z., &
Lu, S. (2016). Exploring spatial–temporal patterns of urban human mobility hotspots. Sustainability, 8(7), 674.
10.3390/su8070674 Google Scholar
- Yao, Y., Liu, X., Li, X., Zhang, J., Liang, Z., Mai, K., & Zhang, Y. (2017). Mapping fine-scale population distributions at the building level by integrating multisource geospatial big data. International Journal of Geographical Information Science, 31(6), 1220–1244.
- Yuan, J., Zheng, Y., Zhang, C., Xie, W., Xie, X., Sun, G., & Huang, Y. (2010). T-drive: Driving directions based on taxi trajectories. In Proceedings of the 18th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. San Jose, CA: ACM.
- Yuan, N. J., Zheng, Y., Xie, X., Wang, Y., Zheng, K., & Xiong, H. (2015). Discovering urban functional zones using latent activity trajectories. IEEE Transactions on Knowledge & Data Engineering, 27(3), 712–725.
- Yuan, Y., & Raubal, M. (2012). Extracting dynamic urban mobility patterns from mobile phone data. In Proceedings of the 7th International Conference on Geographic Information Science. Columbus, OH.
- Yuan, Y., & Raubal, M. (2016). Analyzing the distribution of human activity space from mobile phone usage: An individual and urban-oriented study. International Journal of Geographical Information Science, 30(8), 1594–1621.
- Zhang, P., Zhou, J., & Zhang, T. (2017). Quantifying and visualizing jobs–housing balance with big data: A case study of Shanghai. Cities, 66, 10–22.
- Zhou, J., Pei, H., & Wu, H. (2016). Early warning of human crowds based on query data from Baidu map: Analysis based on Shanghai stampede. arXiv preprint, axXiv:1603.06780.
- Zipser, D., & Andersen, R. A. (1988). A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons. Nature, 331(6158), 679–684.