Volume 28, Issue 7 pp. 2328-2345
RESEARCH ARTICLE

Investigating the Potential of Nighttime Light Data to Estimate Travel Demand

Chao Sun

Corresponding Author

Chao Sun

Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing, China

Jiangsu Province Collaborative Innovation Center of Modern Urban Traffic Technologies, Southeast University, Nanjing, China

School of Transportation, Southeast University, Nanjing, China

Urban and Data Science Lab, Hiroshima University, Hiroshima, Japan

Correspondence:

Chao Sun ([email protected])

Search for more papers by this author
Jian Lu

Jian Lu

Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing, China

Jiangsu Province Collaborative Innovation Center of Modern Urban Traffic Technologies, Southeast University, Nanjing, China

School of Transportation, Southeast University, Nanjing, China

Search for more papers by this author
First published: 30 August 2024

Funding: This work was supported by the National Natural Science Foundation of China (Grant 52072071), the Postgraduate Research & Practice Innovation Program of Jiangsu Province (Grant KYCX22_0285), and the Program of China Scholarship Council (Grant 202306090136).

ABSTRACT

Taking the bike-sharing travel demand (BSTD) as an example, this study investigates the potential of Nighttime Light (NTL) data to optimize forecasting performance and replace the land use factors. Stepwise regression is trained with the travel demand in each unit as the dependent variable, and land use factors are introduced as the independent variable one by one, which finds the set of independent variables. Five machine learning algorithms driven by ensemble learning and decision trees including the GBDT, Random Forecast, Adaboost, Extratrees, and Catboost, are employed and evaluated to achieve comparative analysis of “before considering-after considering NTL data”. The methodological verification of Beijing city shows: (1) Adaboost and GBDT are superior to all other algorithms, since they generally have the highest R2, lowest RMSE, and lowest absolute MAPE. (2) All methods by employing NTL data obviously optimize the performance of BSTD forecast with decreased RMSE, decreased MAPE, etc. In particular, GBDT performs the best in reducing MSE, with a percentage of −99.99% in the training set and −86.985% in the test set, which AdaBoost, Extratrees, and Catboost follow. (3) Land use factors no longer make sense in predicting BSTD after employing NTL data, and NTL data has covered the roles of land use factors to ensure accuracy. The conclusions presented here enrich our understanding of the relative roles of land use factors and NTL data in travel demand and boost our optimization in traffic prediction in the future.

Conflicts of Interest

The authors declare no conflicts of interest.

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.