Coupling human mobility and social relationships to predict individual socioeconomic status: A graph neural network approach
Xiao Chen
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorCorresponding Author
Tao Pei
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing, China
Correspondence
Tao Pei, State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, 11A, Datun Road, Chaoyang District, Beijing 100101, China.
Email: [email protected]
Search for more papers by this authorCi Song
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorHua Shu
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorSihui Guo
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorXi Wang
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorYaxi Liu
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorJie Chen
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorXiao Chen
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorCorresponding Author
Tao Pei
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing, China
Correspondence
Tao Pei, State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, 11A, Datun Road, Chaoyang District, Beijing 100101, China.
Email: [email protected]
Search for more papers by this authorCi Song
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorHua Shu
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorSihui Guo
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorXi Wang
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorYaxi Liu
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
University of Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorJie Chen
State Key Laboratory of Resources and Environmental Information System, Institute of Geographical Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
Search for more papers by this authorAbstract
Understanding individual's socioeconomic status (SES) can provide supporting information for designing political and economic policies. Acquiring large-scale economic survey data is time-consuming and laborious. The widespread mobile phone data, which can reflect human mobility and social network characteristics, has become a low-cost data source for researchers to infer SES. However, previous studies often oversimplify human mobility features and social network features extracted from mobile phone data into general statistical features, resulting in discounting some important temporal and relational information. Therefore, we propose a comprehensive framework for individual SES prediction that effectively utilizes a combination of human mobility and social relationships. In this framework, Word2Vec module extracts human mobility features from mobile phone positioning data, and graph neural network (GNN) module GraphSAGE captures social network characteristics constructed from call detail records. We evaluated the effectiveness of our proposed approach by training the model with real-world data in Beijing. According to the experimental results, our proposed hybrid approach outperformed the other methods evidently, demonstrating that human mobility and social links are complementary in the characterization of SES. Coupling human mobility and social links can further deepen our understanding of cities' economic geography.
CONFLICT OF INTEREST STATEMENT
The authors declare no competing interests.
Open Research
DATA AVAILABILITY STATEMENT
Research data are not shared.
REFERENCES
- Abitbol, J. L., & Morales, A. J. (2021). Socioeconomic patterns of twitter user activity. Entropy, 23(6), 780. https://doi.org/10.3390/e23060780
- Agarwal, R. R., Lin, C.-C., Chen, K.-T., & Singh, V. K. (2018). Predicting financial trouble using call data—On social capital, phone logs, and financial trouble. PLoS ONE, 13(2), e0191863. https://doi.org/10.1371/journal.pone.0191863
- Ahas, R., Silm, S., Järv, O., Saluveer, E., & Tiru, M. (2010). Using mobile positioning data to model locations meaningful to users of mobile phones. Journal of Urban Technology, 17(1), 3–27. https://doi.org/10.1080/10630731003597306
- Akiba, T., Sano, S., Yanase, T., Ohta, T., & Koyama, M. (2019). Optuna: A next-generation hyperparameter optimization framework. 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA. https://doi.org/10.1145/3292500.3330701
10.1145/3292500.3330701 Google Scholar
- Al-Bishawi, M. A., & Ghadban, S. S. (2015). Physical space, social behaviour and socioeconomic changes in traditional neighbourhoods: A case study of the traditional city of Nablus. City, Culture and Society, 6(4), 125–133. https://doi.org/10.1016/j.ccs.2015.06.002
10.1016/j.ccs.2015.06.002 Google Scholar
- Aletras, N., & Chamberlain, B. P. (2018). Predicting twitter user socioeconomic attributes with network and language information. Proceedings of the 29th on hypertext and social media, Baltimore, MD, USA. https://doi.org/10.1145/3209542.3209577.
10.1145/3209542.3209577 Google Scholar
- Almaatouq, A., Prieto-Castrillo, F., & Pentland, A. (2016). Mobile communication signatures of unemployment. In Social informatics: 8th International conference, SocInfo 2016, Bellevue, WA, USA, November 11–14, 2016, Proceedings, Part I 8 2016 (pp. 407–418). Springer International Publishing. https://doi.org/10.1007/978-3-319-47880-7_25
10.1007/978?3?319?47880?7_25 Google Scholar
- Baker, E. H. (2014). Socioeconomic status, definition. In The Wiley Blackwell encyclopedia of health, illness, behavior, and society (pp. 2210–2214). John Wiley & Sons, Ltd. https://doi.org/10.1002/9781118410868.wbehibs395
10.1002/9781118410868.wbehibs395 Google Scholar
- Barbosa, H., Hazarie, S., Dickinson, B., Bassolas, A., Frank, A., Kautz, H., Sadilek, A., Ramasco, J. J., & Ghoshal, G. (2021). Uncovering the socioeconomic facets of human mobility. Scientific Reports, 11(1), 8616. https://doi.org/10.1038/s41598-021-87407-4
- Blumenstock, J. (2015, November). Calling for better measurement: Estimating an individual's wealth and well-being from mobile phone transaction records. 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, New York, NY.
- Blumenstock, J. (2018). Estimating economic characteristics with phone data. AEA Papers and Proceedings, 108, 72–76. https://doi.org/10.1257/pandp.20181033
10.1257/pandp.20181033 Google Scholar
- Blumenstock, J., Cadamuro, G., & On, R. (2015). Predicting poverty and wealth from mobile phone metadata. Science, 350(6264), 1073–1076. https://doi.org/10.1126/science.aac4420
- Brea, J., Burroni, J., & Sarraute, C. (2018). Inference of users demographic attributes based on homophily in communication networks. arXiv Preprint https://doi.org/10.48550/arxiv.1808.00527
10.48550/arxiv.1808.00527 Google Scholar
- Cai, H., Zheng, V. W., & Chang, K. C.-C. (2018). A comprehensive survey of graph embedding: Problems, techniques, and applications. IEEE Transactions on Knowledge and Data Engineering, 30(9), 1616–1637. https://doi.org/10.1109/tkde.2018.2807452
- Carey, R. M., & Markus, H. R. (2017). Social class shapes the form and function of relationships and selves. Current Opinion in Psychology, 18, 123–130. https://doi.org/10.1016/j.copsyc.2017.08.031
- Carlsson-Kanyama, A., & Linden, A.-L. (1999). Travel patterns and environmental effects now and in the future: Implications of differences in energy consumption among socio-economic groups. Ecological Economics, 30(3), 405–417. https://doi.org/10.1016/S0921-8009(99)00006-3
- Chetty, R., Jackson, M. O., Kuchler, T., Stroebel, J., Hendren, N., Fluegge, R. B., Gong, S., Gonzalez, F., Grondin, A., Jacob, M., Johnston, D., Koenen, M., Laguna-Muggenburg, E., Mudekereza, F., Rutter, T., Thor, N., Townsend, W., Zhang, R., Bailey, M., … Wernerfelt, N. (2022). Social capital I: Measurement and associations with economic mobility. Nature, 608(7921), 108–121. https://doi.org/10.1038/s41586-022-04996-4
- Crivellari, A., & Beinat, E. (2019). From motion activity to geo-embeddings: Generating and exploring vector representations of locations, traces and visitors through large-scale mobility data. ISPRS International Journal of Geo-Information, 8(3), 134. https://doi.org/10.3390/ijgi8030134
- Davidoff, T. (2006). Labor income, housing prices, and homeownership. Journal of Urban Economics, 59(2), 209–235. https://doi.org/10.1016/j.jue.2005.04.001
- Ding, S., Huang, H., Zhao, T., & Fu, X. (2019, July). Estimating socioeconomic status via temporal-spatial mobility analysis—A case study of smart card data. 28th International Conference on Computer Communication and Networks (ICCCN), Valencia, VLC. https://doi.org/10.1109/icccn.2019.8847051
10.1109/ICCCN.2019.8847051 Google Scholar
- Eagle, N., Macy, M., & Claxton, R. (2010). Network diversity and economic development. Science, 328(5981), 1029–1031. https://doi.org/10.1126/science.1186605
- Fixman, M., Berenstein, A., Brea, J., Minnoni, M., Travizano, M., & Sarraute, C. (2016, August). A Bayesian approach to income inference in a communication network. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), San Francisco, CA. https://doi.org/10.1109/ASONAM.2016.7752294
10.1109/ASONAM.2016.7752294 Google Scholar
- Frias-Martinez, V., Soguero-Ruiz, C., Frias-Martinez, E., & Josephidou, M. (2013). Forecasting socioeconomic trends with cell phone records. 3rd ACM Symposium on Computing for Development, Bangalore, India. https://doi.org/10.1145/2442882.2442902
10.1145/2442882.2442902 Google Scholar
- Grover, A., & Leskovec, J. (2016). node2vec: Scalable feature learning for networks. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA.
10.1145/2939672.2939754 Google Scholar
- Hamilton, W., Ying, Z., & Leskovec, J. (2018). Inductive representation learning on large graphs. Advances in Neural Information Processing Systems, 30. https://arxiv.org/abs/1706.02216
- Hu, S., Gao, S., Wu, L., Xu, Y., Zhang, Z., Cui, H., & Gong, X. (2021). Urban function classification at road segment level using taxi trajectory data: A graph convolutional neural network approach. Computers, Environment and Urban Systems, 87, 101619. https://doi.org/10.1016/j.compenvurbsys.2021.101619
- Ilägcrstrand, T. (1970). What about people in regional science. Regional Science Association, 24(1), 7–21. https://doi.org/10.1111/j.1435-5597.1970.tb01464.x
10.1111/j.1435-5597.1970.tb01464.x Google Scholar
- Jackson, M. (2021). Inequality's economic and social roots: The role of social networks and homophily. SSRN Electronic Journal, 1–45. https://doi.org/10.2139/ssrn.3795626.
10.2139/ssrn.3795626 Google Scholar
- Jahani, E., Saint-Jacques, G., Sundsøy, P., Bjelland, J., Moro, E., & Pentland, A. S. (2017). Differential network effects on economic outcomes: A structural perspective (pp. 41–50). Springer International Publishing. https://doi.org/10.1007/978-3-319-67256-4_5
- Kipf, T. N., & Welling, M. (2017). Semi-supervised classification with graph convolutional networks. arXiv preprint. https://doi.org/10.48550/arXiv.1609.02907
- Kulkarni, A., Chong, D., & Batarseh, F. A. (2020). Chapter 5: Foundations of data imbalance and solutions for a data democracy. In F. A. Batarseh & R. Yang (Eds.), Data democracy (pp. 83–106). Academic Press. https://doi.org/10.1016/B978-0-12-818366-3.00005-8
10.1016/B978-0-12-818366-3.00005-8 Google Scholar
- Lampos, V., Aletras, N., Geyti, J. K., Zou, B., & Cox, I. J. (2016). Inferring the socioeconomic status of social media users based on behaviour and language. European Conference on Information Retrieval, 689–695. https://doi.org/10.1007/978-3-319-30671-1_54
10.1007/978?3?319?30671?1_54 Google Scholar
- Leo, Y., Fleury, E., Alvarez-Hamelin, J. I., Sarraute, C., & Karsai, M. (2016). Socioeconomic correlations and stratification in social-communication networks. Journal of the Royal Society Interface, 13(125), 20160598. https://doi.org/10.1098/rsif.2016.0598
- Liu, K., Gao, S., Qiu, P., Liu, X., Yan, B., & Lu, F. (2017). Road2Vec: Measuring traffic interactions in urban road system from massive travel routes. ISPRS International Journal of Geo-Information, 6(11), 321. https://doi.org/10.3390/ijgi6110321
- Liu, P., & Biljecki, F. (2022). A review of spatially-explicit GeoAI applications in urban geography. International Journal of Applied Earth Observation and Geoinformation, 112, 102936. https://doi.org/10.1016/j.jag.2022.102936
- Lu, J., Zhou, S., Liu, L., & Li, Q. (2021). You are where you go: Inferring residents' income level through daily activity and geographic exposure. Cities, 111, Article 102984. https://doi.org/10.1016/j.cities.2020.102984
- Luo, S., Morone, F., Sarraute, C., Travizano, M., & Makse, H. A. (2017). Inferring personal economic status from social network location. Nature Communications, 8(1), 15227. https://doi.org/10.1038/ncomms15227
- Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv Preprint https://doi.org/10.48550/arXiv.1301.3781
10.48550/arXiv.1301.3781 Google Scholar
- National Bureau of Statistics. (2020). Income and consumption of residents in 2020. http://www.stats.gov.cn/tjsj/zxfb/202101/t20210118_1812425.html
- Niaz, M. U. (2022). Socio-economic development and sustainable development goals: A roadmap from vulnerability to sustainability through financial inclusion. Economic Research-Ekonomska Istraživanja, 35(1), 3243–3275. https://doi.org/10.1080/1331677X.2021.1989319
- Pappalardo, L., Pedreschi, D., Smoreda, Z., & Giannotti, F. (2015). Using big data to study the link between human mobility and socio-economic development, Santa Clara, CA, USA. IEEE International Conference on Big Data (Big Data), 871–878. https://doi.org/10.1109/BigData.2015.7363835
10.1109/BigData.2015.7363835 Google Scholar
- Perozzi, B., Al-Rfou, R., & Skiena, S. (2014). Deepwalk: Online learning of social representations, New York, NY, USA. 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. https://doi.org/10.1145/2623330.2623732
10.1145/2623330.2623732 Google Scholar
- Preoţiuc-Pietro, D., Volkova, S., Lampos, V., Bachrach, Y., & Aletras, N. (2015). Studying user income through language, behaviour and affect in social media. PLoS ONE, 10(9), e0138717. https://doi.org/10.1371/journal.pone.0138717
- Ren, Y., Mai, W., Li, Y., & Chen, X. (2019, August). Predicting socio-economic levels of individuals via APP usage records machine learning and intelligent communications. 4th International Conference, Nanjing, China. https://doi.org/10.1007/978-3-030-32388-2_17
- Solomon, A., Bar, A., Yanai, C., Shapira, B., & Rokach, L. (2018, July). Predict demographic information using Word2Vec on spatial trajectories. 26th Conference on User Modeling, Adaptation and Personalization, Singapore, Singapore. https://doi.org/10.1145/3209219.3209224
10.1145/3209219.3209224 Google Scholar
- Soto, V., Frias-Martinez, V., Virseda, J., & Frias-Martinez, E. (2011, July). Prediction of socioeconomic levels using cell phone records. User Modeling, Adaption and Personalization, Girona, Spain. https://doi.org/10.1007/978-3-642-22362-4_35
10.1007/978-3-642-22362-4_35 Google Scholar
- Sundsøy, P., Bjelland, J., Reme, B.-A., Iqbal, A. M., & Jahani, E. (2016, January). Deep learning applied to mobile phone data for individual income classification. International Conference on Artificial Intelligence: Technologies and Applications, Bangkok, Thailand. https://doi.org/10.2991/icaita-16.2016.24
10.2991/icaita?16.2016.24 Google Scholar
- Toole, J. L., Herrera-Yaqüe, C., Schneider, C. M., & González, M. C. (2015). Coupling human mobility and social ties. Journal of the Royal Society Interface, 12(105), 20141128. https://doi.org/10.1098/rsif.2014.1128
- Toole, J. L., Lin, Y.-R., Muehlegger, E., Shoag, D., González, M. C., & Lazer, D. (2015). Tracking employment shocks using mobile phone data. Journal of the Royal Society Interface, 12(107), 20150185. https://doi.org/10.1098/rsif.2015.0185
- Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., & Bengio, Y. (2017). Graph attention networks. arXiv Preprint https://doi.org/10.48550/arXiv.1710.10903
10.48550/arXiv.1710.10903 Google Scholar
- Wijesinghe, W. O. K. I. S., Kumarasinghe, C. U., Mannapperuma, J., & Liyanage, K. L. D. U. (2020). Socioeconomic status classification of geographic regions in Sri Lanka through anonymized call detail records (pp. 299–311). Springer International Publishing. https://doi.org/10.1007/978-3-030-19562-5_30
10.1007/978-3-030-19562-5_30 Google Scholar
- Wu, L., Yang, L., Huang, Z., Wang, Y., Chai, Y., Peng, X., & Liu, Y. (2019). Inferring demographics from human trajectories and geographical context. Computers, Environment and Urban Systems, 77, 101368. https://doi.org/10.1016/j.compenvurbsys.2019.101368
- Xiao, X., Zheng, Y., Luo, Q., & Xie, X. (2014). Inferring social ties between users with human location history. Journal of Ambient Intelligence and Humanized Computing, 5(1), 3–19. https://doi.org/10.1007/s12652-012-0117-z
- Xu, M. (2021). Understanding graph embedding methods and their applications. SIAM Review, 63(4), 825–853. https://doi.org/10.1137/20m1386062
- Xu, Y., Belyi, A., Bojic, I., & Ratti, C. (2017). How friends share urban space: An exploratory spatiotemporal analysis using mobile phone data. Transactions in GIS, 21(3), 468–487. https://doi.org/10.1111/tgis.12285
- Xu, Y., Belyi, A., Bojic, I., & Ratti, C. (2018). Human mobility and socioeconomic status: Analysis of Singapore and Boston. Computers, Environment and Urban Systems, 72, 51–67. https://doi.org/10.1016/j.compenvurbsys.2018.04.001
- Xu, Y., Santi, P., & Ratti, C. (2022). Beyond distance decay: Discover homophily in spatially embedded social networks. Annals of the American Association of Geographers, 112(2), 505–521. https://doi.org/10.1080/24694452.2021.1935208
- Yang, C., Sun, M., Zhao, W. X., Liu, Z., & Chang, E. Y. (2017). A neural network approach to joint modeling social networks and mobile trajectories. ACM Transactionson on Information Systems, 35, 1–28. https://arxiv.org/abs/1606.08154, https://doi.org/10.1145/3041658
- Yang, Z., & Shen, Y. (2008). The affordability of owner occupied housing in Beijing. Journal of Housing and the Built Environment, 23(4), 317–335. https://doi.org/10.1007/s10901-008-9120-2
- Yao, Y., Li, X., Liu, X., Liu, P., Liang, Z., Zhang, J., & Mai, K. (2017). Sensing spatial distribution of urban land use by integrating points-of-interest and Google Word2Vec model. International Journal of Geographical Information Science, 31(4), 825–848. https://doi.org/10.1080/13658816.2016.1244608
- Zhang, J., Li, X., Yao, Y., Hong, Y., He, J., Jiang, Z., & Sun, J. (2021). The Traj2Vec model to quantify residents' spatial trajectories and estimate the proportions of urban land-use types. International Journal of Geographical Information Science, 35(1), 193–211. https://doi.org/10.1080/13658816.2020.1726923
- Zhang, Y., Sari Aslam, N., Lai, J., & Cheng, T. (2020). You are how you travel: A multi-task learning framework for geodemographic inference using transit smart card data. Computers, Environment and Urban Systems, 83, 101517. https://doi.org/10.1016/j.compenvurbsys.2020.101517
- Zhao, T., Huang, H., Yao, X., Luo, J.-D., & Fu, X. (2020). Predicting individual socioeconomic status from mobile phone data: A semi-supervised hypergraph-based factor graph approach. International Journal of Data Science and Analytics, 9(3), 361–372. https://doi.org/10.1007/s41060-019-00195-z
- Zhao, T., Tang, L., Huang, J., & Fu, X. (2022). Coupled social media content representation for predicting individual socioeconomic status. Expert Systems with Applications, 198, 116744. https://doi.org/10.1016/j.eswa.2022.116744
- Zheng, Y., & Zhou, X. (2011). Computing with spatial trajectories. Springer Science & Business Media. https://doi.org/10.1007/978-1-4614-1629-6
- Zhu, Y., Chen, F., Li, M., & Wang, Z. (2018). Inferring the economic attributes of urban rail transit passengers based on individual mobility using multisource data. Sustainability, 10(11), 4178. https://doi.org/10.3390/su10114178