Graph isomorphism network with weighted multi-aggregators for building shape classification
Ya Zhang
Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu, China
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Search for more papers by this authorCorresponding Author
Jiping Liu
Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu, China
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Correspondence
Jiping Liu, Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China.
Email: [email protected]
Search for more papers by this authorYong Wang
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Search for more papers by this authorYungang Cao
Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu, China
Search for more papers by this authorShenghua Xu
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Search for more papers by this authorAn Luo
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Search for more papers by this authorYa Zhang
Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu, China
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Search for more papers by this authorCorresponding Author
Jiping Liu
Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu, China
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Correspondence
Jiping Liu, Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China.
Email: [email protected]
Search for more papers by this authorYong Wang
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Search for more papers by this authorYungang Cao
Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu, China
Search for more papers by this authorShenghua Xu
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Search for more papers by this authorAn Luo
Research Center of Geospatial Big Data Application, Chinese Academy of Surveying and Mapping, Beijing, China
Search for more papers by this authorAbstract
Building shape cognition is essential for tasks, such as map generalization, urban modeling, and building semantics and distribution pattern recognition. Traditional geometric and statistical methods rely on human-defined shape indicators, and spectral-based graph neural networks (GNNs) require Laplacian eigendecomposition, resulting in high algorithmic complexity. Therefore, we proposed a low-complexity and simple-to-use spatial-domain GNN for differentiating building shapes. To examine the influence of the building vertices on their shape, we treated each building as a graph and proposed a graph isomorphic network with weighted multi-aggregators (GIN-WMA) by analyzing the node connectivity of a building graph. The GIN-WMA utilizes a novel aggregator that combines the sum and max aggregators, enhancing its recognition and differentiation capabilities. This approach can effectively differentiate nodes that have identical features after aggregation by the sum aggregator. We extracted features considering both local node and global shape features, drawing inspiration from Gestalt cognitive psychology and GNN's “node–graph” differentiation strategy. In addition, we compared the performance of GIN-WMA with existing methods, studying the effect of various node features and their combinations on classification accuracy. The results demonstrated that GIN-WMA outperforms other methods in discriminating building shapes, demonstrating superior capabilities in shape classification and enabling end-to-end extraction and classification of building shapes.
CONFLICT OF INTEREST STATEMENT
The authors declare that they have no conflict of interest.
Open Research
DATA AVAILABILITY STATEMENT
The data that support this study are available at: https://figshare.com/articles/dataset/building_data/22258768.
REFERENCES
- Ai, T., Cheng, X., Liu, P., & Yang, M. (2013). A shape analysis and template matching of building features by the Fourier transform method. Computers, Environment and Urban Systems, 41(9), 219–233. https://doi.org/10.1016/j.compenvurbsys.2013.07.002
10.1016/j.compenvurbsys.2013.07.002 Google Scholar
- Ai, T., Shuai, Y., & Li, J. (2008). The shape cognition and query supported by Fourier transform, advances in spatial data handling. The 13rd SDH International Conference, Montpellier, France. Springer-Verlag.
- Ai, T., Shuai, Y., & Li, J. (2009). A spatial query based on shape similarity cognition. Acta Geodaetica et Cartographica Sinica, 38(4), 356–362. https://doi.org/10.3321/j.issn:1001-1595.2009.04.012
10.3321/j.issn:1001?1595.2009.04.012 Google Scholar
- Bai, X., & Latecki, L. J. (2008). Path similarity skeleton graph matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(7), 1282–1292. https://doi.org/10.1109/TPAMI.2007.70769
- Basaraner, M., & Cetinkaya, S. (2017). Performance of shape indices and classification schemes for characterising perceptual shape complexity of building footprints in GIS. International Journal of Geographical Information Science, 31(10), 1–26. https://doi.org/10.1080/13658816.2017.1346257
10.1080/13658816.2017.1346257 Google Scholar
- Belongie, S. J., Malik, J. M., & Puzicha, J. (2002). Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(4), 509–522. https://doi.org/10.1109/34.993558
- Bronstein, M. M., Bruna, J., Lecun, Y., Szlam, A., & Vandergheynst, P. (2017). Geometric deep learning: Going beyond euclidean data. IEEE Signal Processing Magazine, 34(4), 18–42. https://doi.org/10.1109/MSP.2017.2693418
- Guo, Q., Wei, Z., Wang, Y., & Wang, L. (2017). The method of extracting spatial distribution characteristics of buildings combined with the feature classification and proximity graph. Acta Geodaetica et Cartographica Sinica, 46(5), 631–638. https://doi.org/10.11947/j.AGCS.2017.20160374
10.11947/j.AGCS.2017.20160374 Google Scholar
- Jiang, Z., Li, M., & Zhang, J. (2020). Graph convolution of fusion meta-path based heterogeneous network representation learning. Computer Science, 47(7), 231–235. https://doi.org/10.11896/jsjkx.190600085
10.11896/jsjkx.190600085 Google Scholar
- Jiao, Y., Liu, P., Liu, A., & Liu, L. (2022). Map building shape classification method based on Alexnet. Journal of Geo-Information Science, 24(12), 2333–2341. https://doi.org/10.12082/dqxxkx.2022.210396
10.12082/dqxxkx.2022.210396 Google Scholar
- Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84–90. https://doi.org/10.1145/3065386
10.1145/3065386 Google Scholar
- Li, W., Goodchild, M. F., & Church, R. (2013). An efficient measure of compactness for two-dimensional shapes and its application in regionalization problems. International Journal of Geographical Information Science, 27(6), 1227–1250. https://doi.org/10.1080/13658816.2012.752093
- Li, Z., Yan, H., Ai, T., & Chen, J. (2004). Automated building generalization based on urban morphology and gestalt theory. International Journal of Geographical Information Science, 18(5), 513–534. https://doi.org/10.1080/13658810410001702021
- Liu, C., Zhai, R., Qian, H., Gong, X., Wang, A., & Wu, F. (2023). Identification of drainage patterns using a graph convolutional neural network. Transactions in GIS, 27, 752–776. https://doi.org/10.1111/tgis.13041
10.1111/tgis.13041 Google Scholar
- Mao, L., Ren, F., Yang, D., & Zhang, R. (2023). CNN based transformer for panoptic segmentation. Journal of Software, 34(7), 3408–3421. (in Chinese). https://doi.org/10.13328/j.cnki.jos.006530
10.13328/j.cnki.jos.006530 Google Scholar
- Niu, N., Liu, X., Jin, H., Ye, X., Liu, Y., Li, X., Chen, Y., & Li, S. (2017). Integrating multi-source big data to infer building functions. International Journal of Geographical Information Science, 31(9), 1871–1890. https://doi.org/10.1080/13658816.2017.1325489
- Osserman, R. (1978). Isoperimetric inequality. Bulletin of the American Mathematical Society, 84(6), 1182–1238. https://doi.org/10.1090/S0002-9904-1978-14553-4
- Palmer, S. E. (1999). Vision science: Photons to phenomenology. MIT Press.
- Shuman, D. I., Narang, S. K., Frossard, P., Ortega, A., & Vandergheynst, P. (2013). The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Processing Magazine, 30(3), 83–98. https://doi.org/10.1109/MSP.2012.2235192
- Steiniger, S., Lange, T., Burghardt, D., & Weibel, R. (2008). An approach for the classification of urban building structures based on discriminant analysis techniques. Transactions in GIS, 12(1), 31–59. http://doi.org/10.1111/j.1467-9671.2008.01085.x
10.1111/j.1467-9671.2008.01085.x Google Scholar
- Wang, M., Shi, D., Guan, N., Zhang, T., Wang, L., & Li, R. (2019). Unsupervised pedestrian trajectory prediction with graph neural networks. IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI), Portland, OR, 832–839. https://doi.org/10.1109/ICTAI.2019.00119
10.1109/ICTAI.2019.00119 Google Scholar
- Wei, Z., Guo, Q., Cheng, L., Liu, Y., & Tong, Y. (2021). Shape similarity measurement based on DNA alignment for buildings with multiple orthogonal features. Acta Geodaetica et Cartographica Sinica, 50(12), 1683–1693. https://doi.org/10.11947/j.AGCS.2021.20200227
10.11947/j.AGCS.2021.20200227 Google Scholar
- Weisfeiler, B. Y., & Lehman, A. A. (1968). A reduction of a graph to a canonical form and an algebra arising during this reduction. Nauchno-Technicheskaya Informatsia, 2(9), 12–16.
- Wertheimer, M. (1938). Laws of organization in perceptual forms. In W. D. Ellis (Ed.), A source book of gestalt psychology (pp. 71–88). Routledge & Kegan Paul. https://doi.org/10.1037/11496-005
10.1037/11496-005 Google Scholar
- Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., & Philip, S. Y. (2021). A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 32(1), 4–24. https://doi.org/10.1109/TNNLS.2020.2978386
- Xu, K., Hu, W., Leskovec, J., & Jegelka, S. (2019). How powerful are graph neural networks? Preprint, arXiv:1810.00826v3.
- Xu, X., Liu, P., & Guo, M. (2023). Drainage pattern recognition of river network based on graph convolutional neural network. ISPRS International Journal of Geo-Information, 12(7), 253. https://doi.org/10.3390/ijgi12070253
10.3390/ijgi12070253 Google Scholar
- Yan, H., Ying, S., & Li, L. (2008). An approach for automated building grouping and generalization considering multiple parameters. Geomatics and Information Science of Wuhan University, 33(1), 51–54.
- Yan, X., Ai, T., & Yang, M. (2016). A simplification of residential feature by the shape cognition and template matching method. Acta Geodaetica et Cartographica Sinica, 45(7), 874–882. https://doi.org/10.11947/j.AGCS.2016.20150162
10.11947/j.AGCS.2016.20150162 Google Scholar
- Yan, X., Ai, T., Yang, M., & Tong, X. (2020). Graph convolutional autoencoder model for the shape coding and cognition of buildings in maps. International Journal of Geographical Information Science, 35(3), 490–512. https://doi.org/10.1080/13658816.2020.176826
10.1080/13658816.2020.1768260 Google Scholar
- Yan, X., Ai, T., Yang, M., & Yin, H. (2019). A graph convolutional neural network for classification of building patterns using spatial vector data. ISPRS Journal of Photogrammetry and Remote Sensing, 150, 259–273. https://doi.org/10.1016/j.isprsjprs.2019.02.010
- Yan, X., Yuan, T., Yang, M., Kong, B., & Liu, P. (2022). An adaptive building simplification approach based on shape analysis and representation. Acta Geodaetica et Cartographica Sinica, 51(2), 269–278. https://doi.org/10.11947/j.AGCS.2022.20210302
10.11947/j.AGCS.2022.20210302 Google Scholar
- Yang, C., Wei, H., & Yu, Q. (2018). A novel method for 2D nonrigid partial shape matching. Neurocomputing, 275(31), 1160–1176. https://doi.org/10.1016/j.neucom.2017.09.067
10.1016/j.neucom.2017.09.067 Google Scholar
- Yu, H., Ai, T., Yang, M., Huang, L., & Yuan, J. (2022). A recognition method for drainage patterns using a graph convolutional network. International Journal of Applied Earth Observation and Geoinformation, 107, 102696. https://doi.org/10.1016/j.jag.2022.102696
- Yu, Y., He, K., Wu, F., & Xu, J. (2021). Graph convolution neural network method for shape classification of areal settlements. Acta Geodaetica et Cartographica Sinica, 51(11), 2390–2402. https://doi.org/10.11947/j.AGCS.2022.20210134
10.11947/j.AGCS.2022.20210134 Google Scholar
- Zahn, C. T., & Roskies, R. Z. (2009). Fourier descriptors for plane closed curves. IEEE Transactions on Computers, 100(3), 269–281. https://doi.org/10.1109/TC.1972.5008949
10.1109/TC.1972.5008949 Google Scholar
- Zhang, D., & Lu, G. (2003). A comparative study of curvature scale space and Fourier descriptors for shape-based image retrieval. Journal of Visual Communication and Image Representation, 14(1), 39–57. https://doi.org/10.1016/S1047-3203(03)00003-8
- Zhang, X., Stoter, J., Ai, T., Kraak, M. J., & Molenaar, M. (2013). Automated evaluation of building alignments in generalized maps. International Journal of Geographical Information Science, 27(8), 1550–1571. https://doi.org/10.1080/13658816.2012.758264
- Zhao, L., Song, Y., Zhang, C., Liu, Y., Wang, P., Lin, T., Deng, M., & Li, H. (2020). T-GCN: A temporal graph convolutional network for traffic prediction. IEEE Transactions on Intelligent Transportation Systems, 21(9), 3848–3858. https://doi.org/10.1109/TITS.2019.2935152
- Zhao, R., Ai, T., Yu, W., He, Y., & Shen, Y. (2020). Recognition of building group patterns using graph convolutional network. Cartography and Geographic Information Science, 47(5), 400–417. https://doi.org/10.1080/15230406.2020.1757512