In this study, we introduce a Generative learning-based approach to accelerate Finite Element simulations. The aim is to explore the ability of a Generative learning-based approach to predict the output of Finite Element simulations. Often, the drawback of classical regression models is that they are bound to the distribution of training data and cannot extrapolate outside of the training domain. In the current study, to overcome this, we propose a Generative Adversarial Network (GAN) based approach for learning to infer structural mechanics simulations. GANs have the ability to learn the underlying distribution of data and can efficiently extrapolate outside the training range. We train the proposed network on data obtained from classical Finite Element simulations based on linear elastic and Johnson-Cook plasticity models. The goal here is to understand the ability of a GAN to replace FEM by incorporating the underlying mechanics knowledge into the network to enhance the generalizing ability of the network for various materials on various loading and boundary conditions. We propose an encoder-decoder-like U-Net generator architecture with activated skip connections and a PatchGAN-based discriminator that discriminates based on local patches of data leading to a more robust critique for the generator. The advantage of the proposed framework is that it takes inputs such as the nodal information, their corresponding edges, nodal coordinates and the boundary conditions for each particular node from a Finite Element pre-processor and computes the von Mises stress at each node as output that can be read by a Finite Element post-processor. Also, the novelty of the proposed GAN is that, unlike the existing literature, it takes nodal values such as applied Boundary conditions and coordinates as inputs instead of images and performs nonlinear regression with Finite Element data and gives von-Mises stresses at each node as output.

1 INTRODUCTION

Finite Element simulations have become a crucial part of manufacturing and are extensively used in research, academics and industry. These numerical methods can, depending on the complexity of the boundary value problem (BVP) and geometry being solved, lead to computationally extensive and complex simulations. Often their high computational cost is controlled by using reduced order models or surrogates, but at the expense of accuracy [1-6]. Vehicle crash analysis, for instance, is one of the more complicated problems solved using FE methods that have high computational costs and consume a lot of time [7] due to their geometric, material and contact nonlinearity.

Machine learning-based algorithms are gaining a lot of momentum for accelerating the computation time of structural simulations. Deep learning methods have demonstrated excellent results when utilized to speed up the physical simulations and learn the underlying physics without prior knowledge of the mechanical model [8-11]. In [12, 13], CNN-based models have been used to accelerate FE simulations without penalizing accuracy. Also, CNN-based models have been extensively used in solid mechanical simulations for inhomogeneous non-linear materials [14] and in continuum mechanical simulations [15].

The success in deep learning has emerged from finding the right structural inductive bias to apply to the correct network architecture. The success of CNN models in modelling FE simulation results from their translation invariance, weight sharing and locality which are desirable inductive biases for these FE or continuum-based models. That said, CNNs have a huge drawback concerning the generalization of the model. They can only interpolate inside the training range but cannot extrapolate to unseen combinations of data. This is where Generative Adversarial Networks (GANs) come in. The generator is indirectly trained based on the loss of the discriminator. This helps the generator understand the training set's underlying data distribution and generate samples from a similar kind of distribution rather than mapping inputs to outputs. This quality of the GANs is very desirable in modelling networks that do not overfit and can generalize on new data. In [16], a hybrid FEM and GAN-based method was used to classify faults in rotor-bearing systems. TopologyGAN was proposed in [17] to perform topology optimization of structures with isotropic solid material behaviour. Simple body deformations with a simply supported structure loaded at the centre are modelled in [18]. The novelty of the present study is that, unlike existing literature where GANs are used to generate synthetic data or used for data augmentation, we implement the generator as an FE surrogate that solves the BVP predicting required outputs. Hence rather than generating synthetic data through GANs, we use them to accept input from a FE Pre-processor and generate an output of a FE solver which can be further read by an FE Post-processor.

In the literature, images have been predominantly used as input and output to the GANs and even in mechanics-based applications to model the physical problem. In this study, we take the discretized BVP from an FE pre-processor with inputs such as the nodal information, their corresponding edges, nodal coordinates and the boundary conditions for each particular node from a Finite Element pre-processor and compute the von-Mises stress at each node along with their edge connections as output that can be read by a Finite Element post-processor.

2 GENERATION OF DATA FROM FE SIMULATIONS

We chose a rectangular plate with a hole, a well-studied DFG benchmark problem, for our study. The dimensions of the plate are varied between 0.2 to 0.4 m and also the position and diameter of the hole. The plate has Dirichlet boundary conditions applied on its left and right edges with the left edge completely fixed and the right edge loaded with tensile and compressive displacements. The displacement is varied from 0.01 to 0.1 m with an increment set of [0.02,0.05,0.1] m depending on the sidelength of the geometry so as to prevent excessive deformation or unrealistically high load for the geometry. This resulted in 6 different cases which constitute 5 plates for each case and different loads resulting in 3200 samples. Figure 1 shows the 6 cases of plates with holes.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Pre-processor configuration of the six cases considered for data generation.

The plate is simulated using aluminium with elastoplastic material behaviour using the isotropic Johnson-Cook model with the following material properties [19] as shown in Table 1.

TABLE 1. Material parameters.

Young's modulus (GPa)	Poisson's ratio (-)	A (MPa)	B (MPa)	n (-)
70	0.33	520	477	0.52

The isotropic elasto-plastic Jonhson-Cook model is given by

$urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0001$ (1a)

$urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0002$ (1b)

Where $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0003$ is the yield stress at nonzero strain rate, $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0004$ is the nondimensional temperature, $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0005$ is the equivalent plastic strain rate, $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0006$ , n and C are material parameters measured from experiments, $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0007$ is the static yield stress and $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0008$ is the ratio of the yield stress at nonzero strain rate to the static yield stress.

The Finite Element solver ABAQUS is used to solve the BVP and generate data required for the GANs and the whole process is automated using Python scripting. Since the finite element solver does not output the results in an open-source format, an ODB2VTK interface is used to convert the ODB files into VTK format that is open-source and is imported into Python for training the GAN. From the discussed input combinations, a total of 3200 samples are generated.

3 GENERATIVE ADVERASARIAL NETWORKS

GANs are a class of deep learning algorithms that can generate synthetic data from latent space or from a given condition set. They possess a special ability to generate a new multidimensional tensor space that can aptly represent the underlying data distribution of the problem domain forming a compressed representation of the data distribution. The GAN framework typically consists of two deep networks namely, a generator and a discriminator. Some GANs have more than one generator or discriminator depending on the requirement. These two neural networks compete with each other in a min-max game or a zero-sum game, where one network's gain is the other network's loss. The loss function of the GAN is expressed by

$urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0009$ (2)

Here, G represents the generator, D represents the discriminator, D(x) represents the output of real input, $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0010$ is the target or actual output over all the real data instances, $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0011$ is the output generated by the generator with given noise z. The output of the discriminator for a generated input for a sample z is denoted by D(G(z)) and $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0012$ is the expected value over all random inputs to the generator. A lot of generator and discriminator architectures have been proposed since their inception. GANs have been widely used in large language models [20, 21], image segmentation, deepfakes and upscaling [22-24] and disease diagnosis [25, 26].

In this study, we propose a residual U-net-based generator architecture with activated skip connections. The architecture comprises the encoder, decoder and symmetric skip connections between the encoder and decoder blocks. The encoder block consists of densely connected convolution blocks that help propagate the input features without any losses. The skip connections from the encoder to the decoder play a key role in reconstructing the fine details of the final prediction. Each block in the encoder consists of a convolutional layer, a batch normalization layer, and a swish activation.

The discriminator is based on a PatchGAN which differentiates data based on local patches rather than single probability output. This is rather desirable in this study because the local distribution of stresses plays a vital role in the global distribution. We employ binary cross entropy as a loss function for the discriminator with a gradient penalty and gradient clipping. For the generator, a combination of mean squared error and cross-entropy losses have been used for training. The architecture of the proposed generator is shown in Figure 2.

The generator receives the nodal information from the FE preprocessor and boundary conditions as input. The generator then tries to reconstruct and predict the von-Mises stress distribution in the output. The discriminator then takes the generated data and discriminates between the real and synthesized data and this process iteratively trains both the networks.

4 NETWORK TRAINING AND HYPERPARAMETER TUNING

Out of 3200 training data samples, 3000 samples are used for training the proposed network. The network was trained for 220k epochs. The network hyperparameters such as the number of filters of convolution layers, the dimension of the latent vector for the generator and the number of residual blocks in the generator and layers of the discriminator are critical to have good model performance. To achieve this, the Hyperband search algorithm [27] is employed. The hyperparameter sampling domain and the selected hyperparameters are shown in Table 2.

TABLE 2. Hyperparameter tuning.

Hyperparameters	Paremeter set	Selected Hyperparameters
Residual blocks	[1,2,…,8]	4
Latent vector dimension	[4,8,16,…,128]	16
Discriminator layers	[1,2,…,8]	5
Generator lrate	[1e-5 to 1e-3]	5e-4
Discriminator lrate	[1e-5 to 1e-3]	1e-3

The network is optimized and trained using the Adam optimizer and mean squared error is used as the loss metric. A learning rate of 3E-3 is used with an exponential decay over epochs starting from the 200th epoch. This is done to eliminate the local minima that were being observed after 200 epochs into the training.

5 RESULTS AND DISCUSSION

The network computed results are then embedded into the VTK file with a key named “Computed Stresses” to facilitate the comparison of the results in a FE post-processing environment. Here a comparison is drawn between the von-Mises stress distribution from the FE simulation with the generator-computed von-Mises stress distribution. A computational gain of 9.8% has been observed by using the GAN, also considering the computational effort for training the network. The results section is subdivided to discuss the performance of the GANs on extrapolation results.

5.1 From the trained limit

The proposed GAN is evaluated on the test set and an MSE error of $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0013$ is achieved for the whole test batch, which roughly translated to an error of around $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0014$ and $urn:x-wiley:16177061:media:pamm202300201:pamm202300201-math-0015$ GPa for different input geometry and loading cases. The results from the generator show an excellent correlation to the FE simulations with a very small error in the magnitude. This shows that the GANs can efficiently replicate FEM simulations within the training range.

To determine the efficiency of purely data-driven models, in modelling FEM simulations, deformed displacements are also predicted using the same architecture and show good agreement with the FE results as shown in Figure 3. The prediction has a good overall structure but has some artefacts at the boundary which are discussed in the next section.

Figure 4 shows the epoch versus loss plot of the trained GAN. One can observe that the losses of the generator and discriminator follow opposite trends that is, when the generator loss increases, the discriminator loss decreases and vice versa. It is because the networks engage in a Zero-sum game and this is the reason for GANs to be notoriously hard to train.

5.2 From the extrapolation

The extrapolation ability of the proposed network is tested on a test sample as shown in Figure 5. The geometry and the loading condition are taken out of the training set for that particular case and the samples are evaluated using the trained generator. In Figure 5, although it looks identical, some artefacts can be observed in the stress distribution over the entire domain. This example is taken with a rectangular plate of side lengths 0.3 and 0.5 m (where the maximum side length of the training set is 0.4 m) respectively and a displacement of 0.125 m applied at the right end of the plate (maximum displacement applied in training is 0.1 m). The network although is able to predict the stress distribution and magnitude very well, it is not as accurate as that of the test samples with parameters in the bounds of the training range.

Although GANs have the ability to extrapolate and generate realistic samples, it is limited, in this case by the lack of physical knowledge in the network. Usually, GANs are applied to probabilistic data and trained on the likelihood of the generated data being close to realistic data. But in this case, the nature of the problem is nonlinear regression and it is a very unique scenario for GANs to be used for curve fitting. In this scenario, to improve the generalizing ability, physical information is also required to be introduced into the loss function by using a physics-informed GAN. Also, the convolution layers used in the generator are not efficient in dealing with temporal data. Hence the current architecture cannot be used to get the evolution of stresses over time. This limitation can be overcome by using recurrent-based layers inside the generator.

6 CONCLUSION

In the current study, we deploy a GAN-based Finite Element model, to compute the final deformations and von-Mises stress distribution that can be read by an FE postprocessor from a discretized FE Preprocessor input. We propose a novel approach where instead of generating synthetic data, we employ the generator to synthesize required FE output such as von-Mises stress at each node, in this case. The proposed architecture is based on a residual U-net generator architecture and PatchGAN-based discriminator architecture. A data-driven approach is employed to train the model and exploit the relational inductive bias of the GANs. A plate with a hole with various dimensions, positions and boundary conditions is simulated using an FE-Solver which is used for training the model. The trained generator shows a strong correlation with FE simulations and has the ability to extrapolate the data outside of training bounds to an extent. The model's extrapolation ability deteriorates when the data exceeds 20%–25% over the training bounds and cannot be used for any meaningful predictions. That said, the model can interpolate between the training set efficiently. The next steps will be to overcome the limitations of the current architecture by proposing a hybrid, data+physics-driven model, with an LSTM-based generator. The proposed architecture has a very low error relative to that of FEM (2.78%) and can generate results that can be read by an FE Postprocessor. We explore nonlinear regression for GANs using Finite Element methods in which the novelty of the study lies.

ACKNOWLEDGMENTS

The authors gratefully acknowledge the financial support provided by Deutsche Forschungsgemeinschaft Priority Programme: SPP 2353 (DFG Grant No. STO 469/16-1).

Open access funding enabled and organized by Projekt DEAL.

REFERENCES

1Rudnyi, E. B., & Korvink, J. G. (2006). Model order reduction for large scale engineering models developed in ansys. In J. Dongarra, K. Madsen, & J. Waśniewski (Eds.), Applied Parallel Computing. State of the Art in Scientific Computing (pp 349–356). Springer Berlin Heidelberg.
10.1007/11558958_41
Google Scholar
2Wang, S., Wang, Y., Zi, Y., & He, Z. (2015). A 3D finite element-based model order reduction method for parametric resonance and whirling analysis of anisotropic rotor-bearing systems. Journal of Sound and Vibration, 359, 116–135.
10.1016/j.jsv.2015.08.027
Web of Science® Google Scholar
3Wulfinghoff, S., Cavaliere, F., & Reese, S. (2018). Model order reduction of nonlinear homogenization problems using a Hashin-Shtrikman type finite element method. Computer Methods in Applied Mechanics and Engineering, 330, 149–179.
10.1016/j.cma.2017.10.019
Web of Science® Google Scholar
4van Ophem, S., Atak, O., Deckers, E., & Desmet, W. (2017). Stable model order reduction for time-domain exterior vibro-acoustic finite element simulations. Computer Methods in Applied Mechanics and Engineering, 325, 240–264.
10.1016/j.cma.2017.06.022
Web of Science® Google Scholar
5Kůdela, J., & Matousek, R. (2022). Recent advances and applications of surrogate models for finite element method computations: A review. Soft Computing, 26(24), 13709–13733.
10.1007/s00500-022-07362-8
Web of Science® Google Scholar
6Hoffer, J. G., Geiger, B. C., Ofner, P., & Kern, R. (2021). Mesh-free surrogate models for structural mechanic FEM simulation: A comparative study of approaches. Applied Sciences, 11(20), 9411.
10.3390/app11209411
Google Scholar
7Duddeck, F. (2008). Multidisciplinary optimization of car bodies. Structural and Multidisciplinary Optimization, 35, 375–389.
10.1007/s00158-007-0130-6
Web of Science® Google Scholar
8Stoffel, M., Gulakala, R., Bamer, F., & Markert, B. (2020). Artificial neural networks in structural dynamics: A new modular radial basis function approach vs. convolutional and feedforward topologies. Computer Methods in Applied Mechanics and Engineering, 364, 112989.
10.1016/j.cma.2020.112989
Web of Science® Google Scholar
9Stoffel, M., Bamer, F., & Markert, B. (2019). Neural network based constitutive modeling of nonlinear viscoplastic structural response. Mechanics Research Communications, 95, 85–88.
10.1016/j.mechrescom.2019.01.004
Web of Science® Google Scholar
10Tandale, S. B., Markert, B., & Stoffel, M. (2022). Physics-based self-learning recurrent neural network enhanced time integration scheme for computing viscoplastic structural finite element response. Computer Methods in Applied Mechanics and Engineering, 401PB, 115668.
10.1016/j.cma.2022.115668
Google Scholar
11Gulakala, R., Markert, B., & Stoffel, M. (2023). Graph neural network enhanced finite element modelling. PAMM, 22(1), e202200306.
10.1002/pamm.202200306
Google Scholar
12Koeppe, A., Bamer, F., & Markert, B. (2020). An intelligent nonlinear meta element for elastoplastic continua: deep learning using a new time-distributed residual U-net architecture. Computer Methods in Applied Mechanics and Engineering, 366, 113088.
10.1016/j.cma.2020.113088
Web of Science® Google Scholar
13Tan, Q., Pan, Z., Gao, L., & Manocha, D. (2020). Realtime simulation of thin-shell deformable materials using cnn-based mesh embedding. IEEE Robotics and Automation Letters, 5(2), 2325–2332.
10.1109/LRA.2020.2970624
Web of Science® Google Scholar
14Mianroodi, J., Siboni, N., & Raabe, D. (2021). Teaching solid mechanics to artificial intelligence: A fast solver for heterogeneous materials. npj Computational Materials, 7, 99.
10.1038/s41524-021-00571-z
CAS Web of Science® Google Scholar
15Ye, S., Zhang, Z., Song, X., Wang, Y., Chen, Y., & Huang, C. (2020). A flow feature detection method for modeling pressure distribution around a cylinder in non-uniform flows by using a convolutional neural network. Scientific Reports, 10, 4459.
10.1038/s41598-020-61450-z
CAS PubMed Web of Science® Google Scholar
16Gao, Y., Liu, X., Huang, H., & Xiang, J. (2021). A hybrid of FEM simulations and generative adversarial networks to classify faults in rotor-bearing systems. ISA Transactions, 108, 356–366.
10.1016/j.isatra.2020.08.012
PubMed Web of Science® Google Scholar
17Nie, Z., Lin, T., Jiang, H., & Kara, L. B. (2021). TopologyGAN: Topology optimization using generative adversarial networks based on physical fields over the initial domain. Journal of Mechanical Design, 143(3), 031715.
10.1115/1.4049533
Web of Science® Google Scholar
18Wang, Z., Rosa, S., Xie, L., Yang, B., Wang, S., Trigoni, N., & Markham, A. (2018). Defo-net: Learning body deformation using generative adversarial networks. IEEE International Conference on Robotics and Automation (IRCA), (pp. 2440–2447).
Google Scholar
19Flores-Johnson, E., Shen, L., Guiamatsia, I., & Nguyen, G. (2014). Numerical investigation of the impact behaviour of bioinspired nacre-like aluminium composite plates. Composites Science and Technology, 96, 13–22.
10.1016/j.compscitech.2014.03.001
CAS Web of Science® Google Scholar
20Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D., Wu, J., Winter, C., … Amodei, D. (2020). Language models are few-shot learners. In H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, & H. Lin (Eds.), Advances in Neural Information Processing Systems (pp. 1877–1901). Curran Associates, Inc.
Google Scholar
21Touvron, H., Lavril, T., Izacard, G., Martinet, X., Lachaux, M. A., Lacroix, T., Roziére, B., Goyal, N., Hambro, E., Azhar, F., Rodriguez, A., Joulin, A., Grave, E., & Lample, G. (2023). Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971.
Google Scholar
22Park, T., Liu, M. Y., Wang, T. C., & Zhu, J. Y. (2019). Semantic image synthesis with spatially-adaptive normalization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 2337–2346).
10.1109/CVPR.2019.00244
Google Scholar
23Karras, T., Laine, S., & Aila, T. (2019). A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 4401–4410).
10.1109/CVPR.2019.00453
Google Scholar
24Watson, A. (2020). Deep learning techniques for super-resolution in video games.
Google Scholar
25Gulakala, R., Markert, B., & Stoffel, M. (2023). Rapid diagnosis of covid-19 infections by a progressively growing GAN and CNN optimisation. Computer Methods and Programs in Biomedicine, 229, 107262.
10.1016/j.cmpb.2022.107262
PubMed Web of Science® Google Scholar
26Gulakala, R., Markert, B., & Stoffel, M. (2022). Generative adversarial network based data augmentation for CNN based detection of Covid-19. Scientific Reports, 12(1), 19186.
10.1038/s41598-022-23692-x
CAS PubMed Web of Science® Google Scholar
27Li, L., Jamieson, K., DeSalvo, G., Rostamizadeh, A., & Talwalkar, A. (2017). Hyperband: A novel bandit-based approach to hyperparameter optimization. The Journal of Machine Learning Research, 18(1), 6765–6816.
Google Scholar

Volume23, Issue4

Special Issue:93rd Annual Meeting of the International Association of Applied Mathematics and Mechanics (GAMM)

December 2023

e202300201

Generative learning-based model for the prediction of 2D stress distribution

Abstract

1 INTRODUCTION

2 GENERATION OF DATA FROM FE SIMULATIONS

3 GENERATIVE ADVERASARIAL NETWORKS

4 NETWORK TRAINING AND HYPERPARAMETER TUNING

5 RESULTS AND DISCUSSION

5.1 From the trained limit

5.2 From the extrapolation

6 CONCLUSION

ACKNOWLEDGMENTS

REFERENCES

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Generative learning-based model for the prediction of 2D stress distribution

Abstract

1 INTRODUCTION

2 GENERATION OF DATA FROM FE SIMULATIONS

3 GENERATIVE ADVERASARIAL NETWORKS

4 NETWORK TRAINING AND HYPERPARAMETER TUNING

5 RESULTS AND DISCUSSION

5.1 From the trained limit

5.2 From the extrapolation

6 CONCLUSION

ACKNOWLEDGMENTS

REFERENCES

Figures

References

Related

Information