Recognition of Structural Components and Surface Damage Using Regularization-Based Continual Learning
Yung-I Chang
Department of Civil Engineering , National Taiwan University , Taipei , Taiwan , ntu.edu.tw
Search for more papers by this authorCorresponding Author
Rih-Teng Wu
Department of Civil Engineering , National Taiwan University , Taipei , Taiwan , ntu.edu.tw
Search for more papers by this authorYung-I Chang
Department of Civil Engineering , National Taiwan University , Taipei , Taiwan , ntu.edu.tw
Search for more papers by this authorCorresponding Author
Rih-Teng Wu
Department of Civil Engineering , National Taiwan University , Taipei , Taiwan , ntu.edu.tw
Search for more papers by this authorAbstract
The identification of surface damage and structural components is critical for structural health monitoring (SHM) in order to evaluate building safety. Recently, deep neural networks (DNNs)–based approaches have emerged rapidly. However, the existing approaches often encounter catastrophic forgetting when the trained model is used to learn new classes of interest. Conventionally, joint training of the network on both the previous and new data is employed, which is time-consuming and demanding for computation and memory storage. To address this issue, we propose a new approach that integrates two continual learning (CL) algorithms, i.e., elastic weight consolidation (EWC) and learning without forgetting (LwF), denoted as EWCLwF. We also investigate two scenarios for a comprehensive discussion, incrementally learning the classes with similar versus dissimilar data characteristics. Results have demonstrated that EWCLwF requires significantly less training time and data storage compared to joint training, and the average accuracy is enhanced by 0.7%–4.5% compared against other baseline references in both scenarios. Furthermore, our findings reveal that all CL-based approaches benefit from similar data characteristics, while joint training not only fails to benefit but performs even worse, which indicates a scenario that can emphasize the advantage of our proposed approach. The outcome of this study will enhance the long-term monitoring of progressively increasing learning classes in SHM, leading to more efficient usage and management of computing resources.
Conflicts of Interest
The authors declare no conflicts of interest.
Open Research
Data Availability Statement
The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.
References
- 1 Cha Y.-J., Ali R., Lewis J., and Büyüköztürk O., Deep Learning-Based Structural Health Monitoring, Automation in Construction. (2024) 161, https://doi.org/10.1016/j.autcon.2024.105328.
- 2
Ali R.,
Chuah J. H.,
Talip M. S. A.,
Mokhtar N., and
Shoaib M. A., Structural Crack Detection Using Deep Convolutional Neural Networks, Automation in Construction. (2022) 133, https://doi.org/10.1016/j.autcon.2021.103989.
10.1016/j.autcon.2021.103989 Google Scholar
- 3 Cha Y.-J., Choi W., and Büyüköztürk O., Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks, Computer-Aided Civil and Infrastructure Engineering. (2017) 32, no. 5, 361–378, https://doi.org/10.1111/mice.12263, 2-s2.0-85017098035.
- 4
Zhang L.,
Yang F.,
Zhang Y. D., and
Zhu Y. J., Road Crack Detection Using Deep Convolutional Neural Network, 2016 IEEE International Conference on Image Processing (ICIP), 2016, IEEE, 3708–3712.
10.1109/ICIP.2016.7533052 Google Scholar
- 5 Xu Y., Bao Y., Chen J., Zuo W., and Li H., Surface Fatigue Crack Identification in Steel Box Girder of Bridges by a Deep Fusion Convolutional Neural Network Based on Consumer-Grade Camera Images, Structural Health Monitoring. (2019) 18, no. 3, 653–674, https://doi.org/10.1177/1475921718764873, 2-s2.0-85045036996.
- 6 Gao Y. and Mosalam K. M., Deep Transfer Learning for Image-Based Structural Damage Recognition, Computer-Aided Civil and Infrastructure Engineering. (2018) 33, no. 9, 748–768, https://doi.org/10.1111/mice.12363, 2-s2.0-85045740725.
- 7 Wang C., Wang N., Ho S.-C., Chen X., and Song G., Design of a New Vision-Based Method for the Bolts Looseness Detection in Flange Connections, IEEE Transactions on Industrial Electronics. (2020) 67, no. 2, 1366–1375, https://doi.org/10.1109/tie.2019.2899555.
- 8 Thai H.-T., Machine Learning for Structural Engineering: A State-Of-The-Art Review, Structures, 2022, 38, Elsevier, 448–491.
- 9 Dong C.-Z. and Catbas F. N., A Review of Computer Vision–Based Structural Health Monitoring at Local and Global Levels, Structural Health Monitoring. (2021) 20, no. 2, 692–743, https://doi.org/10.1177/1475921720935585.
- 10 Jia J. and Li Y., Deep Learning for Structural Health Monitoring: Data, Algorithms, Applications, Challenges, and Trends, Sensors. (2023) 23, no. 21, https://doi.org/10.3390/s23218824.
- 11 Gao Y., Zhai P., and Mosalam K. M., Balanced Semisupervised Generative Adversarial Network for Damage Assessment from Low-Data Imbalanced-Class Regime, Computer-Aided Civil and Infrastructure Engineering. (2021) 36, no. 9, 1094–1113, https://doi.org/10.1111/mice.12741.
- 12 Jia D., Dong W., Socher R., Li Li-J., Li K., and Fei-Fei Li, Imagenet: A Large-Scale Hierarchical Image Database, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, IEEE, 248–255.
- 13
Gao Y. and
Mosalam K. M., Peer Hub Imagenet: A Large-Scale Multiattribute Benchmark Data Set of Structural Images, Journal of Structural Engineering. (2020) 146, no. 10, https://doi.org/10.1061/(asce)st.1943-541x.0002745.
10.1061/(ASCE)ST.1943-541X.0002745 Google Scholar
- 14 Chiang M. and Zhang T., Fog and Iot: An Overview of Research Opportunities, IEEE Internet of Things Journal. (2016) 3, no. 6, 854–864, https://doi.org/10.1109/jiot.2016.2584538, 2-s2.0-85010031130.
- 15 Patel M., Naughton B., Chan C. et al., Mobile-Edge Computing Introductory Technical White Paper, White Paper, Mobile-Edge Computing (MEC) Industry Initiative. (2014) 29, 854–864.
- 16
Mao Y.,
You C.,
Zhang J.,
Huang K., and
Letaief K. B., A Survey on Mobile Edge Computing: The Communication Perspective, IEEE Communications Surveys Tutorials. (2017) 19, no. 4, 2322–2358, https://doi.org/10.1109/comst.2017.2745201, 2-s2.0-85028734877.
10.1109/COMST.2017.2745201 Google Scholar
- 17 McCloskey M. and Cohen N. J., Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem, Psychology of Learning and Motivation, 1989, 24, Elsevier, 109–165, https://doi.org/10.1016/s0079-7421(08)60536-8, 2-s2.0-77957064197.
- 18 Goodfellow I. J., Mirza M., Xiao Da, Courville A., and Bengio Y., An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks, 2013, https://arxiv.org/abs/1312.6211.
- 19 Chen Z., Golan I., Hoffer E., and Soudry D., Task Agnostic Continual Learning Using Online Variational Bayes, 2018, https://arxiv.org/abs/1803.10123.
- 20 Van de Ven G. M. and Tolias A. S., Generative Replay With Feedback Connections as a General Strategy for Continual Learning, 2018, https://arxiv.org/abs/1809.10635.
- 21 Van de Ven G. M. and Tolias A. S., Three Scenarios for Continual Learning, 2019, https://arxiv.org/abs/1904.07734.
- 22 Masana M., Liu X., Twardowski B., Menta M., Bagdanov A. D., and Van De Weijer J., Class-Incremental Learning: Survey and Performance Evaluation on Image Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence. (2023) 45, no. 5, 5513–5533, https://doi.org/10.1109/tpami.2022.3213473.
- 23 Masse N. Y., Grant G. D., and Freedman D. J., Alleviating Catastrophic Forgetting Using Context-Dependent Gating and Synaptic Stabilization, Proceedings of the National Academy of Sciences of the United States of America. (2018) 115, no. 44, E10467–E10475, https://doi.org/10.1073/pnas.1803839115, 2-s2.0-85055659881.
- 24 Shu J., Ding W., Zhang J., Lin F., and Duan Y., Continual-Learning-Based Framework for Structural Damage Recognition, Structural Control and Health Monitoring. (2022) 29, no. 11, https://doi.org/10.1002/stc.3093.
- 25
Rebuffi S.-A.,
Kolesnikov A.,
Sperl G., and
Lampert C. H., Icarl: Incremental Classifier and Representation Learning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, 2001–2010.
10.1109/CVPR.2017.587 Google Scholar
- 26
Wu Y.,
Chen Y.,
Wang L. et al., Large Scale Incremental Learning, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, 374–382.
10.1109/CVPR.2019.00046 Google Scholar
- 27
Castro F. M.,
áMarín-Jiménez M. J.,
Guil N.,
Schmid C., and
Alahari K., End-to-End Incremental Learning, Proceedings of the European Conference on Computer Vision, 2018, ECCV, 233–248.
10.1007/978-3-030-01258-8_15 Google Scholar
- 28 Prabhu A., Torr P. H. S., and Dokania P. K., Gdumb: A Simple Approach that Questions Our Progress in Continual Learning, Computer Vision–ECCV 2020: 16th European Conference, August 2020, Glasgow, UK, Springer, 524–540.
- 29
Chaudhry A.,
Dokania P. K.,
Ajanthan T., and
Torr P. H. S., Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence, Proceedings of the European Conference on Computer Vision, 2018, ECCV, 532–547.
10.1007/978-3-030-01252-6_33 Google Scholar
- 30 Zhang M., Wang T., Lim J. H., Kreiman G., and Feng J., Variational Prototype Replays for Continual Learning, 2019, https://arxiv.org/abs/1905.09447.
- 31 Shin H., Lee J. K., Kim J., and Kim J., Continual Learning with Deep Generative Replay, Advances in Neural Information Processing Systems. (2017) 30.
- 32
Ostapenko O.,
Puscas M.,
Klein T.,
Jahnichen P., and
Nabi M., Learning to Remember: A Synaptic Plasticity Driven Framework for Continual Learning, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, 11321–11329.
10.1109/CVPR.2019.01158 Google Scholar
- 33
Xiang Ye,
Fu Y.,
Ji P., and
Huang H., Incremental Learning Using Conditional Adversarial Networks, Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, 6619–6628.
10.1109/ICCV.2019.00672 Google Scholar
- 34 Kemker R. and Kanan C., Fearnet: Brain-Inspired Model for Incremental Learning, 2017, https://arxiv.org/abs/1711.10563.
- 35
Aljundi R.,
Babiloni F.,
Elhoseiny M.,
Rohrbach M., and
Tuytelaars T., Memory Aware Synapses: Learning what (Not) to Forget, Proceedings of the European Conference on Computer Vision (ECCV), 2018, 139–154.
10.1007/978-3-030-01219-9_9 Google Scholar
- 36 Kirkpatrick J., Pascanu R., Rabinowitz N. et al., Overcoming Catastrophic Forgetting in Neural Networks, Proceedings of the National Academy of Sciences. (2017) 114, no. 13, 3521–3526, https://doi.org/10.1073/pnas.1611835114, 2-s2.0-85016395012.
- 37 Zenke F., Poole B., and Ganguli S., Continual Learning through Synaptic Intelligence, International Conference on Machine Learning, 2017, PMLR, 3987–3995.
- 38
Buciluǎ C.,
Caruana R., and
Niculescu-Mizil A., Model Compression, Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006, 535–541.
10.1145/1150402.1150464 Google Scholar
- 39 Li Z. and Hoiem D., Learning Without Forgetting, IEEE Transactions on Pattern Analysis and Machine Intelligence. (2018) 40, no. 12, 2935–2947, https://doi.org/10.1109/tpami.2017.2773081, 2-s2.0-85035137409.
- 40 Jung H., Ju J., Jung M., and Kim J., Less-forgetting Learning in Deep Neural Networks, 2016, https://arxiv.org/abs/1607.00122.
- 41
Dhar P.,
Singh R. V.,
Peng K.-C.,
Wu Z., and
Chellappa R., Learning without Memorizing, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, 5138–5146.
10.1109/CVPR.2019.00528 Google Scholar
- 42
Zhang J.,
Zhang J.,
Ghosh S. et al., Class-Incremental Learning via Deep Model Consolidation, Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2020, 1131–1140.
10.1109/WACV45572.2020.9093365 Google Scholar
- 43
Hou S.,
Pan X.,
Loy C. C.,
Wang Z., and
Lin D., Learning a Unified Classifier Incrementally via Rebalancing, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, 831–839.
10.1109/CVPR.2019.00092 Google Scholar
- 44
Belouadah E. and
Popescu A., Il2m: Class Incremental Learning with Dual Memory, Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, 583–592.
10.1109/ICCV.2019.00067 Google Scholar
- 45
Kann B.,
Castellanos-Paez S., and
Lalanda P., Evaluation of Regularization-Based Continual Learning Approaches: Application to Har, 2023 IEEE International Conference on Pervasive Computing and Communications Workshops and Other Affiliated Events (PerCom Workshops), 2023, IEEE, 460–465.
10.1109/PerComWorkshops56833.2023.10150281 Google Scholar
- 46 Kim H.-E., Kim S., and Lee J., Keep and Learn: Continual Learning by Constraining the Latent Space for Knowledge Preservation in Neural Networks, Medical Image Computing and Computer Assisted Intervention–MICCAI 2018: 21st International Conference, September 2018, Granada, Spain, Springer, 520–528.
- 47 Wu R.-T., Singla A., Jahanshahi M. R., Bertino E., Ko B. J., and Verma D., Pruning Deep Convolutional Neural Networks for Efficient Edge Computing in Condition Assessment of Infrastructures, Computer-Aided Civil and Infrastructure Engineering. (2019) 34, no. 9, 774–789, https://doi.org/10.1111/mice.12449, 2-s2.0-85066137718.
- 48 Dorafshan S., Thomas R. J., and Maguire M., Comparison of Deep Convolutional Neural Networks and Edge Detectors for Image-Based Crack Detection in Concrete, Construction and Building Materials. (2018) 186, 1031–1045, https://doi.org/10.1016/j.conbuildmat.2018.08.011, 2-s2.0-85051379739.
- 49 He H. and Garcia E. A., Learning from Imbalanced Data, IEEE Transactions on Knowledge and Data Engineering. (2009) 21, no. 9, 1263–1284, https://doi.org/10.1109/tkde.2008.239, 2-s2.0-68549133155.
- 50 Ma Z., Xie X., and Geng Z., Structural Learning of Chain Graphs via Decomposition, Journal of Machine Learning Research: JMLR. (2008) 9, no. 11, 2847–2880.
- 51 Hotelling H., Analysis of a Complex of Statistical Variables into Principal Components, Journal of Educational Psychology. (1933) 24, no. 7, 498–520, https://doi.org/10.1037/h0070888, 2-s2.0-0001710505.
- 52 Torgerson W. S., Multidimensional Scaling: I. Theory and Method, Psychometrika. (1952) 17, no. 4, 401–419, https://doi.org/10.1007/bf02288916, 2-s2.0-84950351930.
- 53 Hinton G. E. and Roweis S., Stochastic Neighbor Embedding, Advances in Neural Information Processing Systems. (2002) 15.
- 54 MacKay D. J. C., A Practical Bayesian Framework for Backpropagation Networks, Neural Computation. (1992) 4, no. 3, 448–472, https://doi.org/10.1162/neco.1992.4.3.448.
- 55 Hinton G., Vinyals O., and Dean J., Distilling the Knowledge in a Neural Network, 2015.
- 56
He K.,
Zhang X.,
Ren S., and
Sun J., Deep Residual Learning for Image Recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, 770–778.
10.1109/CVPR.2016.90 Google Scholar
- 57 Lopez-Paz D. and Ranzato M. A., Gradient Episodic Memory for Continual Learning, Advances in Neural Information Processing Systems. (2017) 30.