Using machine learning to unveil the predictors of intergenerational mobility
Corresponding Author
Luís Clemente-Casinhas
Instituto Universitário de Lisboa (ISCTE-IUL) and Business Research Unit (BRU-IUL)
Correspondence to: Luís Clemente-Casinhas, Avenida das Forças Armadas, 1649-026, Lisboa, Portugal ([email protected]).
Search for more papers by this authorAlexandra Ferreira-Lopes
Instituto Universitário de Lisboa (ISCTE-IUL) and Business Research Unit (BRU-IUL)
Search for more papers by this authorLuís Filipe Martins
Instituto Universitário de Lisboa (ISCTE-IUL), Business Research Unit (BRU-IUL), and CIMS-University of Surrey
Search for more papers by this authorCorresponding Author
Luís Clemente-Casinhas
Instituto Universitário de Lisboa (ISCTE-IUL) and Business Research Unit (BRU-IUL)
Correspondence to: Luís Clemente-Casinhas, Avenida das Forças Armadas, 1649-026, Lisboa, Portugal ([email protected]).
Search for more papers by this authorAlexandra Ferreira-Lopes
Instituto Universitário de Lisboa (ISCTE-IUL) and Business Research Unit (BRU-IUL)
Search for more papers by this authorLuís Filipe Martins
Instituto Universitário de Lisboa (ISCTE-IUL), Business Research Unit (BRU-IUL), and CIMS-University of Surrey
Search for more papers by this authorNote: We express gratitude to three referees and an editor for helpful contributions. We also thank Ambar Narayan, Daniel Mahler, and João Moura for their valuable help. We extend our appreciation to the participants of the 44th Meeting of the Association of Southern European Economic Theorists for their comments. We acknowledge financial support from FCT—Fundação para a Ciência e a Tecnologia (National Science and Technology Foundation) through grants 2020.04449.BD (DOI: 10.54499/2020.04449.BD) and UIDB/00315/2020 (DOI: 10.54499/UIDB/00315/2020).
Abstract
We assess the predictors of intergenerational mobility in income and education for a sample of 137 countries, between 1960 and 2018, using the World Bank's Global Database on Intergenerational Mobility (GDIM). The Rigorous LASSO and the Random Forest and Gradient Boosting algorithms are considered, to avoid the consequences of an ad-hoc model selection in our high dimensionality context. We obtain variable importance plots and analyze the relationships between mobility and its predictors through Shapley values. Results show that intergenerational income mobility is expected to be positively predicted by the parental average education, the share of married individuals and negatively predicted by the share of children that have completed less than primary education, the growth rate of population density, and inequality. Mobility in education is expected to have a positive relationship with the adult literacy, government expenditures on primary education, and the stock of migrants. The unemployment and poverty rates matter for income mobility, although the direction of their relationship is not clear. The same occurs for education mobility and the growth rate of real GDP per capita, the degree of urbanization, the share of female population, and income mobility. Income mobility is found to be greater for the 1960s cohort. Countries belonging to the Latin America and Caribbean region present lower mobility in income and education. We find a positive relationship between predicted income mobility and observed mobility in education.
Open Research
DATA AVAILABILITY STATEMENT
The data supporting the findings of this study are available upon request.
Supporting Information
Filename | Description |
---|---|
roiw12710-sup-0001-Supinfo.zipZip archive, 7 MB |
Data S1. Supplementary Information. |
roiw12710-sup-0002-Appendix.pdfPDF document, 291.1 KB |
Appendix A. Variables. Appendix B. Tables. Appendix C. Figures. |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
REFERENCES
- Acciari, P., Polo, A., & Violante, G. (2022). “And yet it moves”: Intergenerational mobility in Italy. American Economic Journal: Applied Economics, 14(3), 118–163. https://doi.org/10.1257/app.20210151
- Alesina, A., Hohmann, S., Michalopoulos, S., & Papaioannou, E. (2021). Intergenerational mobility in Africa. Econometrica, 89(1), 1–35. https://doi.org/10.3982/ECTA17018
- Alesina, A., Hohmann, S., Michalopoulos, S., & Papaioannou, E. (2023). Religion and educational mobility in Africa. Nature, 618, 134–143. https://doi.org/10.1038/s41586-023-06051-2
- Becker, G., Kominers, S., Murphy, K., & Spenkuch, J. (2018). A theory of intergenerational mobility. Journal of Political Economy, 126(S1), S7–S25.
- Becker, G., & Tomes, N. (1979). An equilibrium theory of the distribution of income and intergenerational mobility. Journal of Political Economy, 87(6), 1153–1189.
- Belloni, A., Chen, D., Chernozhukov, V., & Hansen, C. (2012). Sparse models and methods for optimal instruments with an application to eminent domain. Econometrica, 80(6), 2369–2429. https://doi.org/10.3982/ECTA9626
- Belloni, A., & Chernozhukov, V. (2013). Least squares after model selection in high-dimensional sparse models. Bernoulli, 19(2), 521–547. https://doi.org/10.3150/11-BEJ410
- Bloise, F., Brunori, P., & Piraino, P. (2021). Estimating intergenerational income mobility on sub-optimal data: A machine learning approach. The Journal of Economic Inequality, 19(4), 643–665.
- Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32. https://doi.org/10.1023/A:1010933404324
- Brunori, P., Hufe, P., & Mahler, D. (2023). The roots of inequality: Estimating inequality of opportunity from regression trees and forests. The Scandinavian Journal of Economics (forthcoming), 125, 900–932. https://doi.org/10.1111/sjoe.12530
- Cervini-Plá, M. (2015). Intergenerational earnings and income mobility in Spain. Review of Income and Wealth, 61(4), 812–828. https://doi.org/10.1111/roiw.12130
- Chetty, R., Friedman, J., Hendren, N., Jones, M., & Porter, S. (2020). The Opportunity Atlas: Mapping the Childhood Roots of Social Mobility. National Bureau of Economic Research Working Papers No. 25147. https://doi.org/10.3386/w25147
10.3386/w25147 Google Scholar
- Chetty, R., Grusky, D., Hell, M., Hendren, N., Manduca, R., & Narang, J. (2017). The fading American dream: Trends in absolute income mobility since 1940. Science, 356(6336), 398–406. https://doi.org/10.1126/science.aal4617
- Chetty, R., & Hendren, N. (2018). The impacts of neighborhoods on intergenerational mobility II: County-level estimates. The Quarterly Journal of Economics, 133(3), 1163–1228. https://doi.org/10.1093/qje/qjy006
- Chetty, R., Hendren, N., Kline, P., & Saez, E. (2014a). Where is the land of opportunity? The geography of intergenerational mobility in the United States. The Quarterly Journal of Economics, 129(4), 1553–1623. https://doi.org/10.1093/qje/qju022
- Chetty, R., Hendren, N., Kline, P., Saez, E., & Turner, N. (2014b). Is the United States still a land of opportunity? Recent trends in intergenerational mobility. American Economic Review: Papers & Proceedings, 104(5), 141–147. https://doi.org/10.1257/aer.104.5.141
- Choi, I. (2001). Unit root tests for panel data. Journal of International Money and Finance, 20(2), 249–272. https://doi.org/10.1016/S0261-5606(00)00048-6
- Corak, M. (2019). The Canadian geography of intergenerational income mobility. The Economic Journal, 130(631), 2134–2174. https://doi.org/10.1093/ej/uez019
- Daruich, D., & Kozlowski, J. (2020). Explaining intergenerational mobility: The role of fertility and family transfers. Review of Economic Dynamics, 36, 220–245. https://doi.org/10.1016/j.red.2019.10.002
- Daude, C., & Robano, V. (2015). On intergenerational (Im)mobility in Latin America. Latin American Economic Review, 24(9), 1–29. https://doi.org/10.1007/s40503-015-0030-x
10.1007/s40503?015?0030?x Google Scholar
- Deutscher, N. (2020). Place, peers, and the teenage years: Long-run neighborhood effects in Australia. American Economic Journal: Applied Economics, 12(2), 220–249. https://doi.org/10.1257/app.20180329
- Deutscher, N., & Mazumder, B. (2020). Intergenerational mobility across Australia and the stability of regional estimates. Labor Economics, 66. https://doi.org/10.1016/j.labeco.2020.101861 101861.
- Deutscher, N., & Mazumder, B. (2023). Measuring intergenerational income mobility: A synthesis of approaches. Journal of Economic Literature, 61(3), 988–1036. https://doi.org/10.1257/jel.20211413
10.1257/jel.20211413 Google Scholar
- Eriksen, J., & Munk, M. (2020). The geography of intergenerational mobility – Danish evidence. Economic Letters, 189, 109024. https://doi.org/10.1016/j.econlet.2020.109024
- Ermisch, J., Francesconi, M., & Siedler, T. (2006). Intergenerational economic mobility and marital sorting. Economic Journal, 116(513), 659–679. https://doi.org/10.1111/j.1468-0297.2006.01105.x
- Feenstra, R., Inklaar, R., & Timmer, M. (2015). The next generation of the Penn world table. American Economic Review, 105(10), 3150–3182. https://doi.org/10.1257/aer.20130954
- Friedman, J. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189–1232. https://doi.org/10.1214/aos/1013203451
- Gallagher, R., Kaestner, R., & Persky, J. (2019). The geography of family differences and intergenerational mobility. Journal of Economic Geography, 19(3), 589–618. https://doi.org/10.1093/jeg/lby026
- GDIM. (2018). Global database on intergenerational mobility. Development research group, World Bank. World Bank Group.
- Global Change Data Lab. (2021). Our World in Data Project.
- Haider, S., & Solon, G. (2006). Life-cycle variation in the association between current and lifetime earnings. American Economic Review, 96(4), 1308–1320. https://doi.org/10.1257/aer.96.4.1308
- Helsø, A. (2020). Intergenerational income mobility in Denmark and the United States. The Scandinavian Journal of Economics, 1-24, 508–531. https://doi.org/10.1111/sjoe.12420
10.1111/sjoe.12420 Google Scholar
- Herrington, C. (2015). Public education financing, earnings inequality, and intergenerational mobility. Review of Economic Dynamics, 18(4), 822–842. https://doi.org/10.1016/j.red.2015.07.006
- Hilger, N. (2016). The Great Escape: Intergenerational Mobility in the United States Since 1940. National Bureau of Economic Research Working Papers No. 21217. https://doi.org/10.3386/w21217
10.3386/w21217 Google Scholar
- Hodrick, R., & Prescott, E. (1997). Postwar U.S. business cycles: An empirical investigation. Journal of Money, Credit and Banking, 29(1), 1–16.
- Inglehart, R., Haerpfer, C., Moreno, A., Welzel, C., Kizilova, K., Diez-Medrano, J., Lagos, M., Norris, P., Ponarin, E., & Puranen, B. (2018). World values survey: All rounds – Country-pooled Datafile. J.D. Systems Institute & WVSA Secretariat.
- International Monetary Fund. (2018). Global Debt Database. International Monetary Fund.
- Joseph, V. (2022). Optimal ratio for data splitting. Statistical Analysis and Data Mining, 15(4), 531–538. https://doi.org/10.1002/sam.11583
- Kourtellos, A. (2021). The great Gatsby curve in education with a kink. Economic Letters, 208. https://doi.org/10.1016/j.econlet.2021.110054 110054.
- Lam, K., & Liu, P. (2019). Intergenerational educational mobility in Hong Kong: Are immigrants more Mobile than natives? Pacific Economic Review, 24(1), 137–157. https://doi.org/10.1111/1468-0106.12215
- Lee, H., & Lee, J. (2020). Patterns and determinants of intergenerational educational mobility: Evidence across countries. Pacific Economic Review, 26(1), 70–90. https://doi.org/10.1111/1468-0106.12342
- Lochner, L., & Park, Y. (2022). Earnings dynamics and intergenerational transmission of skill. Journal of Econometrics (forthcoming), 105255. https://doi.org/10.1016/j.jeconom.2021.12.009
10.1016/j.jeconom.2021.12.009 Google Scholar
- Louppe, G. (2014). Understanding Random Forests: From Theory to Practice. PhD Thesis.
- Lundberg, S., & Lee, S. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30.
- Narayan, A., Weide, R., Cojocaru, A., Lakner, C., Redaelli, S., Mahler, D., Ramasubbaiah, R., & Thewissen, S. (2018). Fair Progress? Economic mobility across generations around the world. Equity and development. World Bank.
10.1596/978-1-4648-1210-1 Google Scholar
- Neidhöfer, G., & Stockhausen, M. (2018). Dynastic inequality compared: Multigenerational mobility in the United States, the United Kingdom, and Germany. Review of Income and Wealth, 65(2), 383–414. https://doi.org/10.1111/roiw.12364
10.1111/roiw.12364 Google Scholar
- Nybom, M., & Stuhler, J. (2016). Heterogeneous income profiles and lifecycle bias in intergenerational mobility estimation. Journal of Human Resources, 51(1), 239–268.
- OECD. (2018). A broken social elevator? In How to promote social mobility. OECD Publishing, Paris.
10.1787/9789264301085-en Google Scholar
- Olivetti, C., & Paserman, M. (2015). In the name of the son (and the daughter): Intergenerational mobility in the United States, 1850-1940. American Economic Review, 105(8), 2695–2724. https://doi.org/10.1257/aer.20130821
- Probst, P. (2019). Hyperparameters, Tuning and Meta-Learning for Random Forest and Other Machine Learning Algorithms. PhD Thesis.
- Schneebaum, A., Rumplmaier, B., & Altzinger, W. (2016). Gender and migration background in intergenerational educational mobility. Education Economics, 24(3), 239–260. https://doi.org/10.1080/09645292.2015.1006181
10.1080/09645292.2015.1006181 Google Scholar
- Shapley, L. (1953). A Value for N-person Games. In H. Kuhn & A. Tucker (Eds.), Contributions to the theory of games II (pp. 307–317). Princeton University Press.
10.1515/9781400881970-018 Google Scholar
- Shorrocks, A. (2013). Decomposition procedures for distributional analysis: A unified framework based on the Shapley value. The Journal of Economic Inequality, 11(1), 99–126. https://doi.org/10.1007/s10888-011-9214-z
- Solon, G. (2004). A model of intergenerational mobility variation over time and place. In M. Corak (Ed.), Generational income mobility in North America and Europe (pp. 38–47). Cambridge University Press.
10.1017/CBO9780511492549.003 Google Scholar
- Städler, N., & Bühlmann, P. (2014). Pattern alternating maximization algorithm for missing data in high-dimensional problems. Journal of Machine Learning Research, 15(55), 1903–1928.
- Stekhoven, D., & Bühlmann, P. (2018). MissForest – Non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112–118. https://doi.org/10.1093/bioinformatics/btr597
10.1093/bioinformatics/btr597 Google Scholar
- Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., & Altman, R. (2001). Missing value estimation methods for DNA microarrays. Bioinformatics, 17(6), 520–525. https://doi.org/10.1093/bioinformatics/17.6.520
- Urbina, D. (2018). Intergenerational educational mobility during expansion reform: Evidence from Mexico. Population Research and Policy Review, 37(3), 367–417. https://doi.org/10.1007/s11113-018-9466-4
- Van Buuren, S., & Oudshoorn, K. (1999). Flexible multivariate imputation by MICE. Technical Report. TNO Prevention Center.
- World Bank. (2018a). World development indicators. World Bank Group.
- World Bank. (2018b). World development report 2018: Learning to realize Education's promise. World Bank Group.
10.1596/978-1-4648-1096-1 Google Scholar