Volume 23, Issue 4 e202300205

RESEARCH ARTICLE

Open Access

Double U-Net: Improved multiscale modeling via fully convolutional neural networks

Julian Lißner,

Corresponding Author

Julian Lißner

[email protected]

orcid.org/0000-0002-2286-5211

Data Analytics in Engineering, Institute of Applied Mechanics, University of Stuttgart, Stuttgart, Germany

Correspondence

Julian Lißner, Data Analytics in Engineering, Institute of Applied Mechanics, University of Stuttgart, 70569 Stuttgart, Germany.

Email: [email protected]

Search for more papers by this author

Felix Fritzen,

Felix Fritzen

orcid.org/0000-0003-4926-0068

Data Analytics in Engineering, Institute of Applied Mechanics, University of Stuttgart, Stuttgart, Germany

Search for more papers by this author

Julian Lißner,

Corresponding Author

Julian Lißner

[email protected]

orcid.org/0000-0002-2286-5211

Data Analytics in Engineering, Institute of Applied Mechanics, University of Stuttgart, Stuttgart, Germany

Correspondence

Julian Lißner, Data Analytics in Engineering, Institute of Applied Mechanics, University of Stuttgart, 70569 Stuttgart, Germany.

Email: [email protected]

Search for more papers by this author

Felix Fritzen,

Felix Fritzen

orcid.org/0000-0003-4926-0068

Data Analytics in Engineering, Institute of Applied Mechanics, University of Stuttgart, Stuttgart, Germany

Search for more papers by this author

First published: 22 September 2023

https://doi.org/10.1002/pamm.202300205

Share a link

Email
Wechat
Bluesky

Abstract

In multiscale modeling, the response of the macroscopic material is computed by considering the behavior of the microscale at each material point. To keep the computational overhead low when simulating such high performance materials, an efficient, but also very accurate prediction of the microscopic behavior is of utmost importance. Artificial neural networks are well known for their fast and efficient evaluation. We deploy fully convolutional neural networks, with one advantage being that, compared to neural networks directly predicting the homogenized response, any quantity of interest can be recovered from the solution, for example, peak stresses relevant for material failure. We propose a novel model layout, which outperforms state-of-the-art models with fewer model parameters. This is achieved through a staggered optimization scheme ensuring an accurate low-frequency prediction. The prediction is further improved by superimposing an efficient to evaluate U-net, which captures the remaining high-level features.

1 INTRODUCTION

When considering the size effect [1] for larger structures, a safety factor of up to 2 has to be adopted in order to ensure the material of a component will not break. This safety factor arises due to effects occurring on the microscale of the material, that is, due to accumulated damage on the microstructured material. In multiscale modeling, the material response on the macroscopic scale is obtained by considering the effects occurring on the microscale. Thus, enabling the engineering of high-performance materials suitable to effectively and efficiently fulfill the requirements of the deployed structure. In a multiscale simulation, at every material point on the macroscale, the microscopic behavior of the material is considered and modeled [2, 3], leading to prohibitively high computational cost. To circumvent the costly and repetitive simulation of the microscale, machine learning models are deployed to obtain an efficient, yet accurate prediction of the microscopic material behavior [4]. Artificial neural networks are a popular choice to predict the quantity of interest [5, 6], often being the homogenized response of the microstructured material, either obtained via a feature transform with a subsequent machine learning model [4, 5], or via the direct prediction using convolutional neural networks (Conv Net) [6]. The microstructured material is often represented as image data, for example, obtained via a CT-scan, making it suitable for Conv Nets to directly operate on the image input and efficiently yield an accurate prediction.

A subtype in the field of Conv Nets, that is, fully convolutional neural networks, yield another image as their prediction, which can be used to predict the full field solution in the context of microstructure modeling [7, 8]. An additional advantage of the predicted full field solution is that any quantity of interest can be extracted a posteriori, such that one is not constrained to the homogenized response, but one could also recover, for example, the peak stresses, which are relevant for material failure. The proposition of the U-net by Ronneberger et al. [9] surged the interest in the image to image prediction, which has an encoder–decoder structure, and first compresses the spatial resolution of the input image, before increasing it again to recover the prediction at the original resolution. Since the fully Conv Net operates on multiple resolutions, various approaches have emerged to utilize an approximation of the solution in a lower resolution, which contributes to the loss/model optimization and can be trivially obtained through coarse graining during training [10, 11]. Another approach simulates the material at the microscale in coarse resolution and uses the solution as input to the model while upsampling [7]. A different layout has also been considered, which incorporates information of the image on multiple spatial resolution levels [11, 12], starting at a very coarse resolution, on which the convolutional layers can consider large features within the image, and increasingly refining the prediction on each level when recovering the original resolution for its prediction.

This article diverts its attention to fully Conv Nets. We utilize the U-net structure and coarse grained solutions during the training for optimization. The model is designed to have comparatively few parameters to optimize, while being significantly more accurate than recent state-of-the-art models. This is achieved through a model layout, which ensures very accurate predictions on lower resolutions. To further refine the prediction of a model, a second efficient to evaluate U-Net is superimposed to capture high-level features. Due to the nature of the multiple interdependent prediction contributions, a staggered training scheme is proposed. The code of the model's implementation is made freely available in Lissner [13].

2 MULTISCALE MODELING

In multiscale modeling, the behavior of each point at the macroscopic scale $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0001$ is governed by its underlying microscale with the domain Ω. For arbitrarily different length scales, that is, $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0002$ , a separation of length scale can be safely assumed. Further, it is assumed that the micro- and the macroscale follow the same material laws, which is described in more detail in Leuschner [14]. Thus, we will focus on the microscale and consider a microstructural unit cell Ω to motivate the materials behavior. For the sake of brevity, consider the equilibrium condition for the steady state heat equation

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0003$ (1)

with the heat flux $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0004$ . The constitutive equation relating the heat flux $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0005$ to the temperature gradient $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0006$ is given as

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0007$ (2)

when following Fourier's law. Here the heterogeneous thermal conductivity tensor $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0008$ is introduced. The investigated microstructured material is subsided to periodic and antiperiodic boundary conditions with the fluctuating terms $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0009$ , being expressed as

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0010$ (3)

where each quantity $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0011$ belongs to the periodic point set of the boundary $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0012$ , respectively. The loading on the microscale is prescribed by its macroscopic counterpart. Once the materials response on the microscale is calculated, the homogenized quantity of interest $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0013$ is recovered by the homogenization operation

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0014$ (4)

The Hill–Mandel condition is expressed as

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0015$ (5)

which completes the scale transition from micro- to macroscale. Considering the material at the microscale for each material point of the macroscopic body leads to a prohibitive computational cost. Machine learning surrogates are deployed to efficiently predict the microstructural behavior. There are numerous studies, which directly predict the homogenized quantity via machine learning [4-6], and others which predict the full field solution [11, 15], and recover the quantities of interest thereof, for example, via Equation (4). This study falls into the latter subcategory, where the full-field solution of the heat flux $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0016$ is predicted.

3 CONVOLUTIONAL NEURAL NETWORKS

3.1 Introduction

Convolutional neural networks (Conv Nets) are originally inspired by dense feed forward neural networks and developed to handle high-dimensional input data. This is realized by replacing the dense connection between two layers by a convolution operation, such that the forward propagation reads as

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0017$ (6)

where generally square kernels k of size 3 × 3 are deployed to globally operate on the previous layer's output $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0018$ . After the addition of the bias term b, an activation function f is deployed to introduce nonlinearity between layers. To improve information processing between two layers, channels are introduced (represented as multiple rectangles in Figure 1), which are comparable to the number of hidden neurons in dense neural networks. Here, a brief motivation for convolutional neural networks is given to familiarize the reader with the relevant notion for the effective design of Conv Nets. Basic algorithmic operations are nicely illustrated in Dumoulin and Visin [16].

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

A schematic illustration of the U-Net is shown. The operations between the connections are typically modularized on the down/up path, such that each connecting arrow reflects the same operations. Each blue rectangle represents one channel at current spatial resolution.

The forward pass via a convolution operation implies that each data point in the input image is processed by the same machine learned weights k. The underlying feature transform between layers is a global convolution operation, affected by the spatially local neighborhood. Therefore, we introduce the notion of the receptive field, that is, the number of adjacent pixels, which are at most considered by a single convolution operation. Since each 3 × 3 convolution increases the receptive field for all subsequent layers by two pixels, a prohibitive number of convolution operations would be required to achieve large receptive fields. When reducing the spatial resolution, which is implemented by evaluating the kernel each stride increment, the receptive field is virtually increased. After applying an operation with a stride of two, every subsequent operation has virtually doubled the receptive field with respect to the original spatial resolution of the input image. After resolution reduction, the number of feature channels can be increased while keeping memory usage constant. In terms of fully convolutional neural networks, the spatial resolution of the Conv Nets output has to match the resolution of the input image. To increase the resolution, either upsampling, or transpose convolutions[16] are used. In this end to end setting, Ronneberger et al. [9] proposed the U-net (Figure 1), which reuses the feature channels from the left while decreasing the resolution, and concatenates them to later layers on the right at the respective spatial resolution. This improves information processing in the model and provides low-level information to latter layers of the model.

Recent research has progressed in the direction of improved information processing [17-19], where shortcuts are used between two modules/layers. For the design of Conv Nets, modular blocks are generally used, which replace a single convolution operation between two layers by multiple operations. In Szegedy et al. [17], multiple parallel convolutions of differently sized kernels are deployed as inception modules. The ResNet [18] deploys blocks of two 3 × 3 convolutions, augmented by an identity mapping, which adds the previous features to the module's output, improving the gradient flow [19] and enabling deeper models to reliably converge.

In the field of fully convolutional neural networks, additional improvements have been found by infusing some knowledge of the approximate solution on lower resolutions into the model while the U-net structure increases the resolution. A popular option to infuse the knowledge is to assist the neural network during training by fitting the solution on multiple resolutions [11] to the coarse grained (average pooled) solution. Another interesting approach has been proposed by Zhou et al. [7], evaluating the simulation on a very coarse resolution, serving as a feature input on the respective spatial resolution.

3.2 Model proposition

The general structure of the U-net [9] is adopted, and the data usage of the model is optimized to reduce the computational overhead of the model while retaining/increasing accuracy. A main idea of the model is to have two prediction contributions, where the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0019$ , the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0020$ -net, aims to predict a good low-frequency field, while the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0021$ , the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0022$ -net, is superimposed to capture the remainder of the high-level/frequency features.

The structure and information flow of the proposed layout is illustrated in Figure 2 (left), where each arrow indicates one module. The multiple feature types, which contribute to the prediction in the proposed U-net structure, are denoted as

I—the input image, also coarse grained to each spatial resolution.
II—features obtained after decreasing the spatial resolution in the down path.
III—features while increasing the spatial resolution in the up path (III* + the upsampled prediction of the previous level).
IV—the prediction at current spatial resolution.

The model reuses modular blocks of convolutions on each operation in the same path, shown in Table 1, where the more intricate predictor on each spatial resolution of the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0023$ -net is shown in Figure 2 (right). Note that the coarse grained prediction (IV) is not directly added to the next levels prediction, but combined with the feature channels used for upsampling and followed by a few convolution operations. This improved the models prediction quality significantly, since a Conv Net struggles to remove the introduced upsampling artifacts with a single convolution, or by adding a correction term after each upsampling operation. Such upsampling artifacts can be spotted by a critical eye in Marcato et al. [11].

The final prediction of the model is given as

(7)

Since the two contributions do interact with each other, a multistage training scheme is proposed. First, the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0025$ -net is optimized, and frozen thereafter. As a next step, the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0026$ -net is activated and trained to refine the prediction. In a final step, both predictors of the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0027$ -net are optimized in a combined manner.

When optimizing the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0028$ -net, the prediction $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0029$ on each level $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0030$ contributes to the total loss $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0031$ , that is,

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0032$ (8)

where multiple contributions of the loss Φ are weighted with predefined constants $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0033$ . The solution y is coarse-grained with the average pooling operation to match the current levels spatial resolution. In general, the smaller the spatial resolution, the more channels are used. Using the base number of channels of $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0034$ , the number of channels per level are scaled as

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0035$ (9)

In addition to the training split of the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0036$ -net and $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0037$ -net, the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0038$ -net is optimized in a staggered manner, while increasing the levels and the number of active modules, during a pretraining loop. One pretraining loop consists of a first optimization loop where the constant $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0039$ is zero for all but the current levels, that is,

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0040$ (10)

optimized in an incremental manner for each level i in $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0041$ . Since the unconstrained optimization only ensures that the current level is a good approximation of the solution, in a second step, the approximation on each level is ensured with

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0042$ (11)

such that the current level is weighted as the most important. After one complete pretraining loop, that is, both incremental optimization schemes (10), (11), the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0043$ -net is frozen except for the final predicting level, and the remainder of the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0044$ -net is optimized as described above.

4 RESULTS

The proposed model is applied for the full-field prediction of the microstructured material characterized by representative volume elements (RVE), which is subsided to periodic boundary conditions (3). Thus, a single frame, that is, the input image to the fully convolutional neural network (Conv Net), suffices to fully characterize the material at the microscale. We consider the stationary heat equation outlined in Section 2 in a linear setting under constant loading conditions. Through superpositioning, the effective heat conduction tensor $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0045$ from Equation (2) can be recovered and generalized to arbitrary loading conditions.

The proposed model is compared to the U-ResNet used in Santos et al. [15]. It has also been compared to the more recent models [11, 12], which have significantly underperformed the presented models. We deploy a data augmentation scheme during runtime outlined in Lißner and Fritzen [20], which utilizes the periodic boundary conditions to artificially generate new samples. In total, 1500 samples have been used for training, where the validation set contained 300 samples thereof. For optimization, all models have been trained using the mean squared error (MSE) as loss and the adam optimizer, with constant initial learning rate of 10⁻³ and early stopping. Each model is set out to predict all four components of the heat flux $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0046$ , that is, x and y response under x and y loading, respectively. The RVE image is the input to the model of fixed resolution of 128 × 128 voxels, where different input features following [15] did not improve the prediction. All error measures below refer to a test set consisting of 500 samples.

The effectiveness of the proposed optimization scheme of Section 3.2 has been investigated in an ablation study, where the results are summarized in Table 2. The table outlines statistical moments of the resulting heat flux distribution $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0047$ , as well as global MSE metrics. There it can be observed that the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0048$ -net significantly outperforms the UResNet when utilizing the optimization scheme. Adding the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0049$ -net on top of the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0050$ -net significantly improves the prediction, especially the errors on the peak values of the heat flux, which supports the design purpose of the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0051$ -net, delivering a good field prediction.

TABLE 1. Each repeated module in the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0052$ -net is shown. The “conv” keyword before each layer has been omitted for readability. Each operation is read as “(conv) kernel size/stride.” “Max” refers to Max-pooling. Operations displayed in parallel are evaluated in parallel as an inception module.

$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0053$ -net			$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0054$ -net
Down path: II^VE	Up path III^VE	Predictor^VE	Down path II^V	Up path III^V	Predictor^V
3/2–5/2–Max 3/2	2/2^T–4/2^T	2· 3/1–1/1 $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0055$	3/2–5/2–Max/2	Add ← II^V	3· 3/1
Concat	Concat	Add	Concat	2/2^T- 4/2^T	1/1
1/1	1/1	1/1 $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0056$	1/1	Concat
	Concat $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0057$		3· 3/1	1/1
				3· 3/1

TABLE 2. Different global error measures are given of the heat flux distribution for the differently trained models. Every error measure but the mean squared error (MSE) is given as a relative error measure as percentages [%]. Error measures in the left box refer to one component of the heat flux $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0058$ , whereas the error metrics in the right box are global over all components. The best performing model is marked with boldface in each metric.

	$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0059$ [%]	std $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0060$ [%]	skew $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0061$ [%]	peak flux $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0062$ [%]	MSE [ $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0063$ ]	rel $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0064$ [%](field)	$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0065$ [%] (homogenized)	≈params
U-ResNet	0.35	1.95	19.80	3.31	9.35	2.74	1.11	780 000
$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0066$ direct	1.41	2.22	84.96	32.36	60.60	6.97	1.73	310 000
$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0067$ pretrained	0.21	0.61	14.14	7.79	6.71	2.32	0.51	310 000
$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0068$ direct	0.33	0.71	15.96	5.06	7.51	2.45	0.64	505 000
$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0069$ pretrained	0.19	0.61	10.13	2.33	3.11	1.58	0.46	505 000
$urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0070$ multi-pretrain	0.23	0.57	9.14	2.14	2.96	1.54	0.41	505 000

In the subsequent figures, the U-ResNet, which has been reimplemented from Santos et al. [15], is compared to the previously best performing model, that is, the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0071$ -net pretrained multiple times. In Figure 3, it is quantified how well the machine learned model recovers the actual distribution of the heat flux. As can be seen, the proposed $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0072$ -net miss-classifies only ≈6% of the pixels, whereas the comparable U-ResNet has more than ≈12% of pixels wrongly predicted over the entire test set. The $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0073$ -net captures the underlying physics of the problem, delivers a smooth field prediction as well as accurate peak value predictions.

**FIGURE 3**
Open in figure viewer PowerPoint

The predicted distribution of the heat flux $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0074$ is compared using the models best predicting $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0075$ -net and the U-ResNet for one random sample of the test set (left). The red bars denote the overestimation of the model in the current bin, and the orange bars denote the under estimation, respectively. On the right, the relative cumulative bin errors are shown, for the plotted sample (on the left) in dashed lines, and averaged over the entire test set in full lines, where the colors associate to each model shown on the left.

An increasingly strict error measure is considered in Figure 4, since Figure 3 disregards the position of the pixels, for example, a random permutation of the predicted pixels would not be reflected in the error metric. The pixel error compares each pixel with the solution at its position. Once again, a distribution over the entire test set and all components of $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0076$ is presented. Every prediction error larger than $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0077$ is capped and summarized to 0.25, which hides the long tail of the U-ResNet's prediction errors and ignores the single pixel maximum prediction error. In general, the left skew of the error distribution of the $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0078$ -net implies that most of the pixels have a very low prediction error, which is also reflected in the cumulative pixel error distribution, where more than 80% of pixels have a lower absolute error than $urn:x-wiley:16177061:media:pamm202300205:pamm202300205-math-0079$ .

5 CONCLUSION

We developed a new efficient model layout of fully convolutional neural networks, which significantly outperforms recent state-of-the-art models while having fewer model parameters. There is multiple contributions improving the model, one being a staggered optimization scheme to ensure an excellent low-frequency prediction. The other improvement is found by superimposing a U-Net to refine the low-level prediction, reducing the errors on the peak values by 70%. The code of the model implementation is made freely available in Lissner [13]. The model has been applied in microstructure modeling under thermal boundary conditions, achieving a homogenization error of 0.2%. The model could be easily adopted to different physical boundary conditions once the data are readily available.

ACKNOWLEDGMENTS

Funded by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy—EXC 2075-390740016. Contributions by Felix Fritzen are funded by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) within the Heisenberg program DFG-FR2702/8-406068690 and DFG-FR2702/10-517847245. We acknowledge the support by the Stuttgart Center for Simulation Science (SimTech).

Open access funding enabled and organized by Projekt DEAL.

REFERENCES

1Bažant, Z. P. (1999). Size effect on structural strength: A review. Archive of applied Mechanics, 69, 703–725.
10.1007/s004190050252
Web of Science® Google Scholar
2Miehe, C. (2002). Strain-driven homogenization of inelastic microstructures and composites based on an incremental variational formulation. International Journal for Numerical Methods in Engineering, 55, 1285–1322.
10.1002/nme.515
Web of Science® Google Scholar
3Keshav, S., Fritzen, F., & Kabel, M. (2023). FFT-based homogenization at finite strains using composite boxels (ComBo). Computational Mechanics, 71(1), 191–212.
10.1007/s00466-022-02232-4
Web of Science® Google Scholar
4Li, H., Kafka, O. L., Gao, J., Yu, C., Nie, Y., Zhang, L., Tajdari, M., Tang, S., Guo, X., Li, G., Tang, S., Cheng, G., Liu, & W. K. (2019). Clustering discretization methods for generation of material performance databases in machine learning and design optimization. Computational Mechanics, 64, 281–305.
10.1007/s00466-019-01716-0
Web of Science® Google Scholar
5Lißner, J., & Fritzen, F. (2019). Data-driven microstructure property relations. Mathematical and Computational Applications, 24(2), 57.
10.3390/mca24020057
Web of Science® Google Scholar
6Eidel, B. (2023). Deep CNNs as universal predictors of elasticity tensors in homogenization. Computer Methods in Applied Mechanics and Engineering, 403, 115741.
10.1016/j.cma.2022.115741
Web of Science® Google Scholar
7Zhou, X. H., McClure, J. E., Chen, C., & Xiao, H. (2022). Neural network-based pore flow field prediction in porous media using super resolution. Physical Review Fluids, 7(7), 074302.
10.1103/PhysRevFluids.7.074302
Web of Science® Google Scholar
8Santos, J. E., Yin, Y., Jo, H., Pan, W., Kang, Q., Viswanathan, H. S., Prodanović, M., Pyrcz, M. J., & Lubbers, N. (2021). Computationally efficient multiscale neural networks applied to fluid flow in complex 3D porous media. Transport in Porous Media, 140(1), 241–272.
10.1007/s11242-021-01617-y
CAS Web of Science® Google Scholar
9Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention (pp. 234–241).
Google Scholar
10Wang, F., Eljarrat, A., Müller, J., Henninen, T. R., Erni, R., & Koch, C. T. (2020). Multi-resolution convolutional neural networks for inverse problems. Scientific Reports, 10(1), 1–11.
10.1038/s41598-020-78799-w
CAS PubMed Web of Science® Google Scholar
11Marcato, A., Santos, J. E., Boccardo, G., Viswanathan, H., Marchisio, D., & Prodanović, M. (2023). Prediction of local concentration fields in porous media with chemical reaction using a multi scale convolutional neural network. Chemical Engineering Journal, 455, 140367.
10.1016/j.cej.2022.140367
CAS Web of Science® Google Scholar
12Shaham, T. R., Dekel, T., & Michaeli, T., (2019). SinGAN: Learning a generative model from a single natural image. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 4570–4580).
Google Scholar
13Lissner, J. (2023, June). TensorFlow models, plotting templates and python utilities. Computer Software,https://github.com/J-lissner/python_scripts
Google Scholar
14Leuschner, M. (2018). Numerically efficient computational homogenization: Fourier-accelerated nodal solvers and reduced basis model order reduction. Institute of Applied Mechanics.
Google Scholar
15Santos, J. E., Xu, D., Jo, H., Landry, C. J., Prodanović, M., & Pyrcz, M. J. (2020). PoreFlow-Net: A 3D convolutional neural network to predict fluid flow through porous media. Advances in Water Resources, 138, 103539.
10.1016/j.advwatres.2020.103539
Web of Science® Google Scholar
16Dumoulin, V., & Visin, F. (2016). A guide to convolution arithmetic for deep learning. arXiv preprint arXiv:1603.07285.
Google Scholar
17Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., & Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1–9).
Google Scholar
18He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770–778).
Google Scholar
19Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4700–4708).
Google Scholar
20Lißner, J., & Fritzen, F. (2023). Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks. arXiv preprint arXiv:2302.12545.
Google Scholar

Volume23, Issue4

Special Issue:93rd Annual Meeting of the International Association of Applied Mathematics and Mechanics (GAMM)

December 2023

e202300205

Double U-Net: Improved multiscale modeling via fully convolutional neural networks

Abstract

1 INTRODUCTION

2 MULTISCALE MODELING

3 CONVOLUTIONAL NEURAL NETWORKS

3.1 Introduction

3.2 Model proposition

4 RESULTS

5 CONCLUSION

ACKNOWLEDGMENTS

REFERENCES

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Double U-Net: Improved multiscale modeling via fully convolutional neural networks

Abstract

1 INTRODUCTION

2 MULTISCALE MODELING

3 CONVOLUTIONAL NEURAL NETWORKS

3.1 Introduction

3.2 Model proposition

4 RESULTS

5 CONCLUSION

ACKNOWLEDGMENTS

REFERENCES

Figures

References

Related

Information