Stochastic inversion approaches provide a valuable framework for geophysical applications due to their ability to explore multiple plausible models rather than offering a single deterministic solution. In this paper, we introduce a probabilistic joint inversion framework combining the very fast simulated annealing optimization technique with generalized fuzzy c-means clustering for coupling of model parameters. Since very fast simulated annealing requires extensive computational resources to converge when dealing with a large number of inversion parameters, we employ sparse parameterization, where models are sampled at sparse nodes and interpolated back to the modelling grid for forward computations. By executing multiple independent inversion chains with varying initial models, our method effectively samples the model space, thereby providing insights into model variability. We demonstrate our joint inversion methodology through numerical experiments using synthetic seismic traveltime and controlled-source electromagnetic datasets derived from the SEAM Phase I model. The results illustrate that the presented approach offers a practical compromise between computational efficiency and the ability to approximate model uncertainties, making it suitable as an alternative for realistic larger-scale joint inversion purposes.

1 Introduction

Geophysical inverse problems are known to be inherently non-unique, which means, there exist a number of plausible models that would fit the data. The idea behind an integrated inversion is to reduce the number of possible models by using different but complementary geophysical, geological and petrophysical data in a unified geophysical inversion framework. The term ‘joint inversion’ refers to one of the many integrated (coupled) inversion approaches where cost functions of different methods are efficiently combined to construct a joint-objective function, which is minimized while adjusting all the model parameters concurrently. Since all the involved methods contribute to the model update, the inversion artefacts are likely to be reduced in a certain subspace of the model, which is sensitive to more than one method (Moorkamp et al. 2016). There are some specific challenges in the development of an efficient joint inversion algorithm:

1. Although a joint inversion is likely to narrow down the number of possible solutions, the inversion problem remains non-unique and the constructed model may still not be a true representation of the subsurface.
2. In realistic models, petrophysical relationship(s) among different model parameters could be complicated, which requires an efficient coupling strategy in the joint inversion algorithm.

The first challenge becomes particularly important when a deterministic method is used for joint inversion, as it typically produces a single ‘best-fit’ model without conveying any information about variability in the solution space. In contrast, probabilistic global optimization methods such as very fast simulated annealing (VFSA) used in this paper, although not designed to sample the full posterior distribution like Markov chain Monte Carlo (MCMC), operate within a stochastic framework that enables limited exploration of the model space. By running multiple inversion chains from different initial conditions, VFSA can yield an ensemble of plausible models that fit the data. While this ensemble does not constitute a statistically rigorous posterior sample, it can still reveal meaningful variability in model parameters and provide qualitative insights into solution stability. In this article, we use the term uncertainty in a qualified sense to refer to this ensemble-based variability, acknowledging that it offers only an approximate view of the inherent non-uniqueness of the geophysical inverse problem.

Some previous works in the context of probabilistic joint inversion have used Monte Carlo (MC) method (Bosch and McGaughey 2001; Chen et al. 2004; Bosch et al. 2006; Jardani and Revil 2009; Shen et al. 2013), co-kriging method (Shamsipour et al. 2012), MCMC method (Rosas-Carbajal et al. 2014; Wéber 2018), trans-dimensional MCMC (Blatter et al. 2019), and VFSA method (Kaikkonen and Sharma 1998; Yang et al. 2002; Hertrich and Yaramanci 2002; Santos et al. 2006).

Among the approaches listed, MC methods are considered the most statistically rigorous, as they aim to produce independent samples drawn directly from the target posterior distribution. Model proposals are generated randomly across the parameter space and accepted or rejected based on the Metropolis–Hastings criterion (Metropolis et al. 1953). While MC sampling provides a high-fidelity estimate of the posterior probability distribution (PPD), it is computationally intensive and often impractical for high-dimensional or computationally expensive forward models (Sen and Stoffa 2013).

MCMC methods improve computational feasibility by constructing a Markov chain whose stationary distribution is the posterior. Each new model is generated conditionally based on the current model, enabling the chain to concentrate sampling in regions of high posterior probability. Although this results in dependent samples, the method is more efficient than standard MC for large-scale inverse problems. Sen and Stoffa (1996) review a range of such sampling-based approaches and show that with suitable modifications, optimization algorithms like VFSA can approximate features of the PPD at substantially lower computational cost. VFSA, originally developed as a global optimizer, uses a cooling schedule combined with the Metropolis–Hastings acceptance rule. When applied with multiple runs or at a fixed temperature, VFSA can yield ensembles that approximate posterior-like distributions (Roy, Sen, Blankenship, et al. 2005; Roy, Sen, McIntosh, et al. 2005), offering a practical alternative for exploring model variability in joint inversion frameworks.

There are two main differences between MCMC and VFSA as the latter uses a temperature-dependent Cauchy distribution to draw the proposal model, which tends to narrow down the proposal to the previous state as the temperature decreases. Moreover, the probability of accepting a ‘bad’ model also decreases over the number of iterations and becomes sufficiently low near the global minimum. The PPD derived from a single chain of VFSA is inherently biased towards the global minimum; therefore, multiple chains of VFSA are needed to get many plausible models for uncertainty quantification. Although the PPD estimated through rigorous sampling methods is more accurate, the same obtained through the VFSA does provide a ‘sweet spot’ between affordability and accuracy.

The second challenge, that is, effective coupling of model parameters has mostly been discussed in the context of deterministic joint inversion, which can be categorized as (1) structure-based coupling (Haber and Oldenburg 1997; Gallardo and Meju 2004) and (2) petrophysical coupling (Koketsu and Nakagawa 2002; Jegen et al. 2009). A detailed review of different approaches for parameter coupling can be found in Colombo and Rovetta (2018). In this paper, we use a guided fuzzy c-means (FCM) clustering developed by Sun and Li (2012), which is a generalized version of the the method proposed by Lelièvre et al. (2012) and has been effectively used in deterministic joint inversion in geoscience (Sun and Li 2016a, 2016b; Darijani et al. 2020).

In this paper, we introduce a probabilistic joint inversion approach for multi-physics data integration, utilizing the VFSA algorithm combined with a generalized FCM clustering method (Dunn 1973; Bezdek 1981). Given that a large number of inversion parameters would typically necessitate an impractically high number of VFSA iterations for convergence, we employ a sparse parameterization strategy to randomly distribute inversion points across the model space. We provide a detailed discussion of the VFSA and FCM algorithms, explaining our rationale for their selection in the joint inversion framework. To validate the proposed algorithm, we present numerical experiments on the joint inversion of first-arrival seismic traveltime and controlled-source electromagnetic data for a 2D slice of the SEAM Phase I model, focusing on computing mean models and associated uncertainties.

2 Methods

2.1 Sparse Parameterization

To ensure that very fast simulated annealing (VFSA) can reliably converge within realistic computational budgets, we reduce the number of free parameters in the inverse problem using a sparse parameterization strategy. The sparse parameterization strategy applied in this study is illustrated in Figure 1, which displays four panels highlighting each step of the workflow. Panel a shows the model at iteration $n$ on the full computational grid, including a blue water layer at the top and a red anomaly in the subsurface. Rather than assigning an inversion parameter to every cell, a set of control points (150) is defined in panel b to represent the model with far fewer parameters. These points are placed more densely in regions of anticipated complexity, such as the anomalous zone, and each carries a single value later interpolated across the entire domain. This approach substantially reduces the dimensionality of the inverse problem while maintaining the flexibility needed to capture critical structural variations. This shows that one can adaptively distribute control points if one expects a more complex model in certain locations. Panel c demonstrates how a model is updated during a VFSA iteration. Panel d then shows the updated velocity model after re-interpolation onto the full simulation mesh, reflecting new parameter values below the water layer. Subsequent tests in this paper adopt a simpler random distribution of control points and use linear radial-basis functions for interpolation.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Illustration of the sparse parameterization for VFSA. Panel (a) shows the model at iteration $n$ on the full simulation mesh, with a fixed water layer (blue) and an embedded anomaly (red). Panel (b) samples the same model at a set of control points (circles), providing a significantly reduced set of inversion parameters. Panel (c) displays the updated control-point values at iteration $n+1$ , during a VFSA iteration. Panel (d) depicts the updated velocity model once these new values are interpolated back onto the modelling mesh. This approach preserves the water layer as a known zone (blue) while enabling flexible updates in the deeper section.

One important concern in sparse parameterization is how to choose the number of control points. This choice is somewhat ad hoc: too many control points can hinder the convergence of VFSA within a realistic number of iterations, while too few may lead to under-parameterization, making it difficult to resolve smaller-scale features in the model. In the Supporting Information, we provide a MATLAB script (SparseParameterization.m) that generates Figure 1, allowing readers to experiment with how varying the number and spatial distribution of control points influences the model representation.

2.2 Objective Function and Parameter Coupling

We formulate the objective function under a joint inversion framework for two distinct data types—seismic and controlled source electromagnetic (CSEM), —while also introducing a generalized fuzzy clustering scheme to impose statistical constraints on the model parameters. This integrated approach ensures that each method's individual physical principles are respected, yet both contribute to a single-shared model space. The joint objective function is expressed as

\begin{equation} \Phi = \alpha \Phi ^{\rm CSEM} + \beta \Phi ^{\rm SE} + \gamma \Phi ^{\rm FCM} + \lambda \sum _{i=1}^{c} \Vert \mathbf {v}_i - \mathbf {g}_i\Vert _2, \end{equation}

(1)

where

\Phi ^{\rm CSEM}

\Phi ^{\rm SE}

and

\Phi ^{\rm FCM}

are the misfit terms for electromagnetic (EM) data, seismic data and a fuzzy clustering constraint, respectively, and

\alpha

\beta

\gamma

control each term’ contribution. Although a single, standardized error measure might be conceptually appealing, we maintain the original cost functions from the legacy EM and seismic inversion codes, used for the test, to preserve consistency with domain-specific practices. In the following paragraphs, we discuss the cost functions for both geophysical methods that were used in the numerical test in this paper. We emphasize that this is one of many possible ways of constructing the cost function and not a general recommendation.

\begin{equation} \Phi ^{\rm CSEM}(\sigma) = \sum _{\mathbf {r}_s,\mathbf {r}_r,F,i,f} 0.5\,W_{i}^{F}\!\bigl (\mathbf {r}_r \mid \mathbf {r}_s; \mathbf {J},f\bigr) \,\bigl |\Delta F_i\bigl (\mathbf {r}_s,\mathbf {r}_r,F,i,f,\sigma \bigr)\bigr |^2, \end{equation}

(2)

where

F_i

denotes the electric or magnetic field component (

i \in \lbrace x,y\rbrace

) and

\Delta F_i

is the difference between observed and calculated fields at receiver

\mathbf {r}_r

due to the source

\mathbf {J}

located at

\mathbf {r}_s

. Each residual is multiplied by the weight

\begin{equation} W_{i}^{F} = \frac{1}{|F_{i}^{\mathrm{obs}}|^2 + \eta ^2}, \end{equation}

(3)

ensuring that extremely small signal amplitudes do not disproportionately affect the cost. For the numerical experiments with CSEM, we have added a fixed noise of

10^{-17}

for each frequency, respectively, for creating the observed data.

The misfit

\Phi ^{\rm SE}

is defined as

\begin{equation} \Phi ^{\rm SE} = \sum _{i=1}^{N} W_i {\left(T_i^{\rm pick} - T_i^{\rm cal} \right)}^2, \end{equation}

(4)

where

T_i^{\rm pick}

and

T_i^{\rm cal}

are the picked and calculated traveltimes, respectively,

\sigma _i^{\rm err}

is the uncertainty in picking,

N

is the total number of picks and the weight

W_i

is defined as

\begin{equation} W_i = \frac{1}{{\left(\sigma _i^{\rm err} \right)}^2}. \end{equation}

(5)

For numerical experiments here, we add small Gaussian noise for the observed traveltime of around 2 ms.

For forward modelling of seismic data (raytracing of seismic first-arrival paths), we use the shortest path method, which is an efficient and flexible approach to compute the raypaths and traveltimes of first arrivals to all points in the earth simultaneously. More details of the forward modelling can be found in Moser (1991) and Arnulf et al. (2011), 2014), 2018).

For CSEM forward modelling, we solve the quasi-static form of Maxwell's equations using a staggered-grid finite-difference scheme (Yee 1966; Newman and Alumbaugh 1995), following the implementation described in Jaysaval et al. (2014) and Streich (2009).

The third-term $\Phi ^{\rm FCM}$ integrates fuzzy clustering into the inversion, thereby imposing statistical correlations across model parameters. Clustering identifies groups of data (here, cell-wise or control-point-based parameter vectors) such that data within the same cluster share greater similarity compared to data in different clusters. In the fuzzy clustering approach, each data point can exhibit partial membership in multiple clusters, which is advantageous when petrophysical properties do not conform to strict one-to-one relationships.

The cost function for a traditional fuzzy c-means (FCM) algorithm is given by

\begin{equation} \Phi ^{\rm FCM} = \sum _{k=1}^{N} \sum _{i=1}^{c} (\mu _{i,k})^m (\mathbf {x}_k - \mathbf {v}_i)^T A (\mathbf {x}_k - \mathbf {v}_i), \end{equation}

(6)

where

\mathbf {x}_k

is the

k\text{th}

data vector (e.g., a pair of model parameters from the seismic and CSEM domains),

\mathbf {v}_i

is the centre of the

i\text{th}

cluster, and

\mu _{i,k}

represents the membership of

\mathbf {x}_k

in the

i\text{th}

cluster. The exponent

m

governs the “fuzziness” of the membership values, and

A

is the distance-norm matrix that defines the geometric shape of each cluster. The classical Euclidean norm corresponds to

A=I

, producing spherical clusters, while more general choices for

A

allow the detection of ellipsoidal or otherwise more complex cluster shapes. A generalized FCM approach (Gustafson and Kessel 1979) can adopt an adaptive (Mahalanobis) distance for each cluster,

\begin{equation} A_i = \bigl [\rho _i\,\det (F_i)\bigr]^{\frac{1}{n}}\,F_i^{-1}, \end{equation}

(7)

where

\rho _i

is the determinant of

A_i

and

F_i

is the fuzzy covariance matrix given by

\begin{equation} F_i = \frac{\sum _{k=1}^{N} (\mu _{i,k})^m (\mathbf {x}_k - \mathbf {v}_i)^T (\mathbf {x}_k - \mathbf {v}_i)}{\sum _{k=1}^{N} (\mu _{i,k})^m}. \end{equation}

(8)

This formulation permits clusters to adapt their shapes to the local data distribution. To incorporate prior information about cluster centres within this fuzzy clustering, we follow Sun and Li (2016a), 2016b) and include the additional term

\lambda \sum _{i=1}^{c} \Vert \mathbf {v}_i - \mathbf {g}_i\Vert _2

in the overall objective (Equation 1). The update formula for the cluster centres thus becomes

\begin{equation} \mathbf {v}_i = \frac{\sum _{k=1}^{N}(\mu _{i,k})^m \,\mathbf {x}_k + \lambda \,\mathbf {g}_i}{\sum _{k=1}^{N}(\mu _{i,k})^m + \lambda }, \end{equation}

(9)

where

\mathbf {g}_i

is the prior centre for the

i\text{th}

cluster.

The last term in the joint-objective function incorporates prior information about known geology by penalizing the deviation of the inverted cluster centres $\mathbf {v}_i$ from user-specified centres $\mathbf {g}_i$ and $\lambda$ quantifies our confidence in those priors. A value of $\lambda = 0$ implies no prior information about the cluster centres, leaving the final centres entirely determined by the inversion. To balance the contributions of each term, we first normalize the individual cost functions by their target misfits, determined from separate single-method inversions. This ensures that all data types contribute comparably toward the global minimum, avoiding the need to introduce additional weighting factors.

By merging the fuzzy clustering cost function $\Phi ^{\rm FCM}$ with the CSEM and seismic data misfits, we obtain a holistic joint inversion framework capable of accommodating both multi-physics observations and prior geological information. The membership functions enable a flexible parameter coupling scheme, particularly for complex relationships between model parameters, while the additional penalization term maintains consistency with any known geological constraints.

2.3 Very-Fast Simulated Annealing

Simulated annealing (SA) is an optimization algorithm inspired by the thermodynamic annealing process (Kirkpatrick et al. 1983; see Sen and Stoffa 2013 for geophysical applications). Annealing involves heating a metal until it melts and then cooling it slowly in a controlled manner to achieve the lowest energy state. During this process, thermal equilibrium at each temperature level yields a probability distribution for the molecular configurations described by the Boltzmann distribution:

\begin{equation} p_i = \frac{\text{e}^\frac{-E_i}{kT}}{\sum _{j\in \mathcal {S}}\text{e}^\frac{-E_j}{kT}}, \end{equation}

(10)

Where

E_i

is the energy state of configuration

i

\mathcal {S}

is the set of all possible configurations,

k

is the Boltzmann constant, and

T

is the temperature.

In the SA algorithm, optimization begins with an initial model and a high initial temperature. The temperature is gradually lowered according to a predefined cooling schedule, and new model proposals are drawn from a flat distribution. The objective function to be minimized corresponds to the energy difference between the current and proposed models. A proposed model with lower energy is always accepted, while a higher energy model can still be accepted according to the temperature-dependent Metropolis–Hastings criterion:

\begin{equation} P_n = \text{e}^{{\left(-\frac{E(\mathbf {m}_n) - E(\mathbf {m}_{n-1})}{t_n}\right)}}, \end{equation}

(11)

where

t_n

is the temperature at the current iteration. The acceptance probability

P_n

decreases with temperature, becoming more selective as the algorithm converges towards the global minimum. The cooling schedule significantly affects convergence: faster cooling reduces computational cost but may fail to find the global optimum, while slower cooling increases the probability of finding a global solution but at a higher computational cost.

To enhance computational efficiency, Ingber (1989) proposed VFSA, a variant that achieves rapid convergence with minimal compromise on global optimization capability. VFSA differs from standard SA primarily in the method for generating proposal models. Instead of drawing from a flat distribution, VFSA uses a temperature-dependent Cauchy-like distribution around the previously accepted model:

\begin{equation} m^{n}_{i} = m^{n-1}_i + y_i(m^{ub}_i - m^{lb}_i), \end{equation}

(12)

where

m^{ub}

and

m^{lb}

represent upper and lower bounds for model parameters, respectively, and

\begin{equation} y_i = sgn(u_i - 0.5)t_{n} {\left[(1 + 1/t_{n})^{|2u_i - 1|} - 1\right]}, \end{equation}

(13)

With $u$ being a uniformly distributed random number between 0 and 1. This approach enables VFSA to initially explore a wide region of the parameter space and subsequently focus narrowly as it approaches the global optimum. Additionally, VFSA allows distinct cooling schedules and parameter bounds for each model parameter, making it especially suited for multi-parameter optimization scenarios. Compared to other global optimization methods like genetic algorithms or particle swarm optimization, VFSA offers greater flexibility in controlling exploration versus exploitation via its temperature schedule and has been shown to be more effective in problems where a rough global search is needed early, followed by fine-tuned local refinement (Sen and Stoffa 2013).

In this study, we apply VFSA for joint inversion of seismic and CSEM data, explicitly interpreting the objective function as the analogue to the thermodynamic energy. Each VFSA iteration perturbs sparse control-point parameters, interpolates the resulting model onto the full computational grid, computes the forward solution and evaluates the joint objective function. Sparse parameterization greatly enhances computational efficiency while preserving the essential complexity of geological models.

Given the VFSA's stochastic nature, a single inversion run might converge to different local minima depending on randomly initialized conditions. Individual VFSA runs may also become trapped in local minima due to an insufficient cooling schedule or a finite number of iterations. Therefore, we conduct multiple independent VFSA inversion runs to adequately explore the model space and obtain an ensemble of plausible subsurface models. Although individual runs provide insights into local solutions and parameter exploration, we rely primarily on statistical averaging across multiple runs —computing mean, median and variance, to quantify model uncertainties and evaluate parameter stability. This ensemble approach effectively identifies regions of the model space that are consistently well-constrained versus those that exhibit high uncertainty, providing a pragmatic balance between computational cost and solution reliability, particularly suited for large-scale joint inversion problems where fully rigorous posterior sampling methods (e.g., Markov Chain Monte-Carlo or full MC methods) remain computationally prohibitive.

In the following algorithm, we explain the probabilistic joint inversion workflow for CSEM and seismic data for one inner loop inside the main VFSA loop. The model parameters $\mathbf {m}_{res}$ and $\mathbf {m}_{vel}$ represent the vertical resistivity ( $R$ ) and P-wave velocity ( $V_p$ ), respectively.

InlineGraphics

We have normalized individual cost functions for CSEM and seismic with their respective target misfit and do not use relative weights.

3 Test Case

We apply the proposed joint inversion workflow for controlled source electromagnetic (CSEM) and seismic traveltime data generated on a subset of the SEAM Phase I model (Pangman 2007). The SEAM Phase I model is a widely used benchmark in geophysical exploration research that simulates realistic subsurface conditions, including salt structures, for validating inversion methods. The subsurface model built on the SEAM Phase I dataset mimics a realistic geology of a salt-containing region in the Gulf of Mexico (Fehler and Keliher 2011). There is a massive salt body with steep flanks embedded into a layered sediment environment. The velocity model has complex geometrical structures and strong velocity variations that make seismic imaging below salt: challenging. The top boundary of the salt is rugose and has a thin layer of muddy salt having a velocity slightly lower than the main salt body. Since we use using only first-arrival traveltime data for this numerical test, we restrict our area of interest to $4 \ \text{km}$ depth. The SEAM model for this test is a subset of a 2D slice of the original 3D model (at north = $23,900 \ \text{m}$ ) having a dimension of $35 \ \text{km }\times 4 \ \text{km}$ . The model has a seawater layer of $0.3125 \ \Omega \text{m}$ vertical resistivity and $1490 \ \text{m/s}$ P-wave velocity, and the thickness of the seabed varies from $0.7269 \ \text{km}$ to $1.606 \ \text{km}$ . The true synthetic models for this experiment are shown in Figure 2. The sediments on either side of the salt body have some interesting formations, which are not visible in the velocity model but are prominent in the vertical resistivity model. A preliminary cluster analysis of this cross-plot between true model parameters shows that the geology of the model can reasonably be described with five clusters. We will assume these cluster centres as a prior geological information about the facies in the model. The goal of this numerical test is to perform the joint inversion of seismic and CSEM data over this SEAM model using the given petrophysical and geological constraints and quantify the uncertainty in the estimated models.

For the seismic data, we assumed a typical ocean bottom seismometers profile with 34 receivers uniformly distributed every $1 \ \text{km}$ , with the seismic wavefield downward extrapolated to the seafloor (Arnulf et al. 2011, 2014). For the seismic modelling, we took advantage of the source–receiver reciprocity. As such, we are modelling 34 shots, uniformly distributed at the ocean bottom between $x=1 \ \text{km}$ to $x=34 \ \text{km}$ and receivers at a $50 \ \text{m}$ interval. For CSEM modelling, we have used 17 sources between $x=3 \ \text{km}$ and $31 \ \text{km}$ and receivers at every $500 \ \text{m}$ . The CSEM source is an $x\text{-}$ oriented horizontal electric dipole, which is towed $30 \ \text{m}$ above the seabed, and receivers are at seabed depth. We use two frequencies $0.1$ and $0.25 \ \text{Hz}$ and set their corresponding maximum offset to $\hskip.001pt 10$ and $8 \ \text{km}$ , respectively. Forward modelling for both methods has been done on regular grids ( $200\ \text{m} \times 100\ \text{m}$ ); however, for the inversion, we use a sparse parameterization approach. That is, we interpolate models on 400 randomly generated points for very fast simulated annealing (VFSA) inversion. Once a model is accepted, we transform the model back to an orthogonal grid for forward computations. The interpolation of the model on the sparse grid uses a linear radial basis interpolation. The choice of the number of points for the sparse parameters is a trade-off between how well it can capture the features of the model and how long it takes for the VFSA algorithm to converge.

Since the water layer is known as a prior, we perturb models only below that. For sparse parameterization, we fix 400 inversion points (same for both models) for one chain and do scattered data interpolation to transform the perturbation to regular modelling grids.

For each VFSA chain, the sparse parameterization was randomly generated (see the Supporting Information). As such, each starting model sampled a different spatial location of the model space. For this experiment, we have run 15 different chains (the initial models and inversion points are shown in the Supporting Information). For fuzzy c-means (FCM) parameters, we assume four clusters (not including the water layer) in the model and provide prior centres $\mathbf {g}_i$ (with prior weight $\lambda = 1000$ ) as deduced from the true models. For a real dataset, these centres would be inferred using the prior knowledge about the subsurface. Figure 3 shows the resistivity and velocity model recovered in one-chain of the joint inversion. The probabilistic nature of the joint inversion workflow allows us to generate a number of models, which can be used to compute uncertainty in the model via statistical analysis. We compute mean, median and uncertainty in the joint inversion for 15 independent chains of VFSA for 3000 iterations. Since a single chain of VFSA provides one model (unlike sampling methods), the mean is calculated from the final models of each chain. Figure 4a,b shows the mean and median resistivity models. The top of the main salt diaper and the flanks are recovered. The reservoir $R_1$ is also clearly visible; however, $R_2$ and the left flank of the salt are not clearly resolved. Similarly, the reservoirs on the right side of the salt $R_3$ and $R_4$ are recovered together and not clearly distinguished. The background sediments are well-recovered. Due to the lack of EM signal in the bottom corners as well as inside the salt body, we see higher uncertainties in those areas as shown in Figure 4c,d. We notice that the reservoirs in the inverted models are slightly deeper than their location in the true resistivity model. As far as the velocity model is concerned, the top boundary of the salt is well resolved. The salt boundary is clearly visible as shown in the mean and median models in Figure 4e and 4f, respectively. The background sediments are well recovered except for the bottom corners and lower part of the salt, which is due to the lack of rays passing through these areas. Given that we started with random initial models, the estimated models from the joint inversion show excellent agreement with the true synthetic models.

The joint inversion framework allows us to manually decide the weight of the prior cluster centres by adjusting the value of the parameter $\lambda$ . A smaller value of $\lambda$ shows less prior constraints and final cluster centres are mostly recovered through the inversion. A higher value of $\lambda$ , on the other hand, does not let the centres in the proposal model move too far away from the prior centre by forcing a high prior constraint. For example, Figure 5a shows the cross-plot between velocity and resistivity of the true synthetic model clustered by using FCM with five centres. Using these centres as priors, Figure 5b,c shows the recovered petrophysics from the joint inversion with $\lambda = 10$ and $\lambda = 1000$ .

Figure 6 shows the posterior probability density of five vertical profiles in both the estimated resistivity (top row) and the estimated P-wave velocity model (bottom row). Assuming the estimated values at each location (not the estimated models themselves) in all the chains have the Gaussian distribution, the posterior probability distribution (PPD) has been computed using histograms. In the resistivity models, the profile at $x=12 \ \text{km}$ passes through the reservoir $R_1$ between $2.7 \text{ and } 3.0 \ \text{km}$ depth. The uncertainty at the top of $R_1$ is less than that of the bottom of $R_1$ , which means that the upper part of $R_1$ is better resolved than the lower part. The vertical profile at $x=14 \ \text{km}$ passes through a part of the reservoir $R_2$ between $3.0 \text{ and } 3.2 \ \text{km}$ depth. Since $R_2$ is close to the salt diapir, it is not as well resolved as $R_1$ . The vertical profile at $x=18 \ \text{km}$ passes through the salt diapir between $2.0 \text{ and } 4.0 \ \text{km}$ depth. This profile shows that the uncertainties are lower near the boundary of the salt and higher as the observation point goes towards the centre of the salt. The uncertainty between $R_3$ ( $2.4 \text{--} 2.6 \ \text{km}$ depth) and $R_4$ ( $2.7 \text{--} 2.9 \ \text{km}$ depth) in the vertical profile at $x=24 \ \text{km}$ have lower uncertainty bounds; however, uncertainties inside the reservoir are relatively higher.

Figure 7 shows the convergence of individual (CSEM and seismic) as well as a total (joint) cost function for 3000 iterations of 15 different chains of VFSA. The individual costs of CSEM and seismic are normalized by their target errors, that is, 1 and 0.01, respectively. The convergence plots show that the joint inversion converges in 3000 iterations of VFSA and uses approximately equal weight of individual cost functions. This shows that VFSA is a more affordable alternative to Monte Carlo or Markov Chain Monte-Carlo methods, which require thousands of iterations to reach convergence for posterior analysis.

4 Conclusions

We have proposed a probabilistic workflow for joint inversion and uncertainty estimation, incorporating petrophysical and geological constraints. We applied this workflow to the joint inversion of controlled source electromagnetic and seismic synthetic data from the SEAM Phase I model. The workflow efficiently integrates petrophysical constraints and prior geological knowledge of the model. With better priors, such as facies interpreted from existing well logs, one can assign a significantly higher prior weight, causing the joint inversion to more rigorously honour the geological information.

We have demonstrated that VFSA with sparse parameterization converges faster and enables the affordable computation of multiple chains, which in turn provides uncertainty estimates in the model. The generalized fuzzy c-means approach can accommodate different distance measures, which are necessary for efficient clustering based on the statistical relationships between model parameters. VFSA achieves a balance between the efficiency of deterministic methods and the robustness of sampling-based approaches. However, its performance depends on the chosen cooling schedule and, for certain configurations, it can become trapped in local minima. It should also be noted that the required number of iterations, although significantly fewer than those needed for Markov chain Monte Carlo methods, are still substantially higher than those typically used in deterministic approaches. For three-dimensional inversion, a more practical application of this stochastic joint inversion approach would be to estimate starting models for deterministic inversion methods given the high computational cost of the forward solvers.

Acknowledgements

The research was funded by TOTAL E and P, Houston, USA. The first author was partially supported by the Research Council of Finland (359261)

Open access publishing facilitated by Geologian tutkimuskeskus, as part of the Wiley - FinELib agreement.

Open Research

Data Availability Statement

The resistivity and velocity models used in the test case can be openly accessed from Fehler and Keliher (2011). A MATLAB function for fuzzy c-means clustering used in this paper is freely available from Balasko et al. (2005).

Supporting Information

References

Arnulf, A., A. Harding, G. Kent, S. Singh, and W. Crawford. 2014. “Constraints on the Shallow Velocity Structure of the Lucky Strike Volcano, Mid-Atlantic Ridge, from Downward Continued Multichannel Streamer Data.” Journal of Geophysical Research: Solid Earth 119, no. 2: 1119–1144.
10.1002/2013JB010500
Web of Science® Google Scholar
Arnulf, A., A. Harding, G. Kent, and W. Wilcock. 2018. “Structure, Seismicity, and Accretionary Processes at the Hot Spot-Influenced Axial Seamount on the Juan de Fuca Ridge.” Journal of Geophysical Research: Solid Earth 123, no. 6: 4618–4646.
10.1029/2017JB015131
Web of Science® Google Scholar
Arnulf, A., S. Singh, A. Harding, G. Kent, and W. Crawford. 2011. “Strong Seismic Heterogeneity in Layer 2A Near Hydrothermal Vents at the Mid-Atlantic Ridge.” Geophysical Research Letters 38, no. 13: L13320.
10.1029/2011GL047753
Google Scholar
Balasko, B., J. Abonyi, and B. Feil. 2005. “Fuzzy Clustering and Data Analysis Toolbox.” Department of Process Engineering, University of Veszprem, Veszprem.
Google Scholar
Bezdek, J. C. 1981. Pattern Recognition With Fuzzy Objective Function Algorithms. Kluwer Academic Publishers.
Google Scholar
Blatter, D., K. Key, A. Ray, C. Gustafson, and R. Evans. 2019. “Bayesian Joint Inversion of Controlled Source Electromagnetic and Magnetotelluric Data to Image Freshwater Aquifer Offshore New Jersey.” Geophysical Journal International 218, no. 3: 1822–1837.
10.1093/gji/ggz253
Web of Science® Google Scholar
Bosch, M., and J. McGaughey. 2001. “Joint Inversion of Gravity and Magnetic Data Under Lithologic Constraints.” The Leading Edge 20, no. 8: 877–881.
10.1190/1.1487299
Google Scholar
Bosch, M., R. Meza, R. Jiménez, and A. Hönig. 2006. “Joint Gravity and Magnetic Inversion in 3D Using Monte Carlo Methods.” Geophysics 71, no. 4: G153–G156.
10.1190/1.2209952
Web of Science® Google Scholar
Chen, J., G. M. Hoversten, D. Vasco, Y. Rubin, and Z. Hou. 2004. “Joint Inversion of Seismic AVO and EM Data for Gas Saturation Estimation Using a Sampling-Based Stochastic Model.” In SEG Technical Program Expanded Abstracts 2004, 236–239. Society of Exploration Geophysicists.
Google Scholar
Colombo, D., and D. Rovetta. 2018. “Coupling Strategies in Multiparameter Geophysical Joint Inversion.” Geophysical Journal International 215, no. 2: 1171–1184.
10.1093/gji/ggy341
Web of Science® Google Scholar
Darijani, M., C. G. Farquharson, and P. G. Lelièvre. 2020. “Clustering and Constrained Inversion of Seismic Refraction and Gravity Data for Overburden Stripping: Application to Uranium Exploration in the Athabasca Basin, Canada.” Geophysics 85, no. 4: B133–B146.
10.1190/geo2019-0525.1
Web of Science® Google Scholar
Dunn, J. C. 1973. “A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters.” Journal of Cybernetics 3, no. 3: 32–57.
10.1080/01969727308546046
Google Scholar
Fehler, M., and P. J. Keliher. 2011. SEAM Phase 1: Challenges of Subsalt Imaging in Tertiary Basins, With Emphasis on Deepwater Gulf of Mexico. Society of Exploration Geophysicist. https://doi.org/10.1190/1.9781560802945
10.1190/1.9781560802945
Google Scholar
Gallardo, L. A., and M. A. Meju. 2004. “Joint Two-Dimensional DC Resistivity and Seismic Travel Time Inversion with Cross-Gradients Constraints.” Journal of Geophysical Research: Solid Earth 109, no. B3: B03311.
10.1029/2003JB002716
Web of Science® Google Scholar
Gustafson, D. E., and W. C. Kessel. 1979. “Fuzzy Clustering with a Fuzzy Covariance Matrix.” In 1978 IEEE Conference on Decision and Control Including the 17th Symposium on Adaptive Processes, 761–766. IEEE.
Google Scholar
Haber, E., and D. Oldenburg. 1997. “Joint Inversion: A Structural Approach.” Inverse Problems 13, no. 1: 63.
10.1088/0266-5611/13/1/006
Web of Science® Google Scholar
Hertrich, M., and U. Yaramanci. 2002. “Joint Inversion of Surface Nuclear Magnetic Resonance and Vertical Electrical Sounding.” Journal of Applied Geophysics 50, no. 1–2: 179–191.
10.1016/S0926-9851(02)00138-6
Web of Science® Google Scholar
Ingber, L. 1989. “Very Fast Simulated Re-Annealing.” Mathematical and Computer Modelling 12, no. 8: 967–973.
10.1016/0895-7177(89)90202-1
Web of Science® Google Scholar
Jardani, A., and A. Revil. 2009. “Stochastic Joint Inversion of Temperature and Self-Potential Data.” Geophysical Journal International 179, no. 1: 640–654.
10.1111/j.1365-246X.2009.04295.x
Web of Science® Google Scholar
Jaysaval, P., D. Shantsev, and S. de la Kethulle de Ryhove. 2014. “Fast Multimodel Finite-Difference Controlled-Source Electromagnetic Simulations Based on a Schur Complement Approach.” Geophysics 79, no. 6: E315–E327.
10.1190/geo2014-0043.1
Web of Science® Google Scholar
Jegen, M. D., R. W. Hobbs, P. Tarits, and A. Chave. 2009. “Joint Inversion of Marine Magnetotelluric and Gravity Data Incorporating Seismic Constraints: Preliminary Results of Sub-Basalt Imaging Off the Faroe Shelf.” Earth and Planetary Science Letters 282, no. 1-4: 47–55.
10.1016/j.epsl.2009.02.018
CAS Web of Science® Google Scholar
Kaikkonen, P., and S. Sharma. 1998. “2-D Nonlinear Joint Inversion of VLF and VLF-R Data Using Simulated Annealing.” Journal of Applied Geophysics 39, no. 3: 155–176.
10.1016/S0926-9851(98)00025-1
Web of Science® Google Scholar
Kirkpatrick, S., C. D. Gelatt, and M. P. Vecchi. 1983. “Optimization by Simulated Annealing.” Science 220, no. 4598: 671–680.
10.1126/science.220.4598.671
CAS PubMed Web of Science® Google Scholar
Koketsu, K., and K. Nakagawa. 2002. “Joint Inversion of Refraction and Gravity Data for the Three-Dimensional Topography of a Sediment–Basement Interface.” Geophysical Journal International 151, no. 1: 243–254.
10.1046/j.1365-246X.2002.01772.x
Web of Science® Google Scholar
Lelièvre, P. G., C. G. Farquharson, and C. A. Hurich. 2012. “Joint Inversion of Seismic Traveltimes and Gravity Data on Unstructured Grids With Application to Mineral Exploration.” Geophysics 77, no. 1: K1–K15.
10.1190/geo2011-0154.1
Web of Science® Google Scholar
Metropolis, N., A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, and E. Teller. 1953. “Equation of State Calculations by Fast Computing Machines.” The Journal of Chemical Physics 21, no. 6: 1087–1092.
10.1063/1.1699114
CAS Web of Science® Google Scholar
M. Moorkamp, P. G. Lelièvre, N. Linde, and A. Khan (eds). 2016. Integrated Imaging of the Earth: Theory and Applications, 218. American Geophysical Union.
Google Scholar
Moser, T. 1991. “Shortest Path Calculation of Seismic Rays.” Geophysics 56, no. 1: 59–67.
10.1190/1.1442958
Web of Science® Google Scholar
Newman, G. A., and D. L. Alumbaugh. 1995. “Frequency-Domain Modelling of Airborne Electromagnetic Responses Using Staggered Finite Differences1.” Geophysical Prospecting 43, no. 8: 1021–1042.
10.1111/j.1365-2478.1995.tb00294.x
Web of Science® Google Scholar
Pangman, P. 2007. “SEAM Launched in March.” The Leading Edge 26, no. 6: 718–720.
10.1190/tle26060718.1
Google Scholar
Rosas-Carbajal, M., N. Linde, T. Kalscheuer, and J. A. Vrugt. 2014. “Two-Dimensional Probabilistic Inversion of Plane-Wave Electromagnetic Data: Methodology, Model Constraints and Joint Inversion With Electrical Resistivity Data.” Geophysical Journal International 196, no. 3: 1508–1524.
10.1093/gji/ggt482
Web of Science® Google Scholar
Roy, L., M. K. Sen, D. D. Blankenship, P. L. Stoffa, and T. G. Richter. 2005. “Inversion and Uncertainty Estimation of Gravity Data Using Simulated Annealing: An Application Over Lake Vostok, East Antarctica.” Geophysics 70, no. 1: J1–J12.
10.1190/1.1852777
Web of Science® Google Scholar
Roy, L., M. K. Sen, K. McIntosh, P. L. Stoffa, and Y. Nakamura. 2005. “Joint Inversion of First Arrival Seismic Travel-Time and Gravity Data.” Journal of Geophysics and Engineering 2, no. 3: 277–289.
10.1088/1742-2132/2/3/011
Web of Science® Google Scholar
Santos, F. M., S. Sultan, P. Represas, and A. E. Sorady. 2006. “Joint Inversion of Gravity and Geoelectrical Data for Groundwater and Structural Investigation: Application to the Northwestern Part of Sinai, Egypt.” Geophysical Journal International 165, no. 3: 705–718.
10.1111/j.1365-246X.2006.02923.x
Web of Science® Google Scholar
Sen, M. K., and P. L. Stoffa. 1996. “Bayesian Inference, Gibbs' Sampler and Uncertainty Estimation in Geophysical Inversion 1.” Geophysical Prospecting 44, no. 2: 313–350.
10.1111/j.1365-2478.1996.tb00152.x
Web of Science® Google Scholar
Sen, M. K., and P. L. Stoffa. 2013. Global Optimization Methods in Geophysical Inversion. Cambridge University Press.
10.1017/CBO9780511997570
Google Scholar
Shamsipour, P., D. Marcotte, and M. Chouteau. 2012. “3D Stochastic Joint Inversion of Gravity and Magnetic Data.” Journal of Applied Geophysics 79: 27–37.
10.1016/j.jappgeo.2011.12.012
Web of Science® Google Scholar
Shen, W., M. H. Ritzwoller, V. Schulte-Pelkum, and F.-C. Lin. 2013. “Joint Inversion of Surface Wave Dispersion and Receiver Functions: A Bayesian Monte-Carlo Approach.” Geophysical Journal International 192, no. 2: 807–836.
10.1093/gji/ggs050
Web of Science® Google Scholar
Streich, R. 2009. “3D Finite-Difference Frequency-Domain Modeling of Controlled-Source Electromagnetic Data: Direct Solution and Optimization for High Accuracy.” Geophysics 74, no. 5: F95–F105.
10.1190/1.3196241
Web of Science® Google Scholar
Sun, J., and Y. Li. 2012. “Joint Inversion of Multiple Geophysical Data: A Petrophysical Approach Using Guided Fuzzy C-Means Clustering.” in SEG Technical Program Expanded Abstracts 2012, 1–5. Society of Exploration Geophysicists.
Google Scholar
Sun, J., and Y. Li. 2016a. “Joint Inversion of Multiple Geophysical and Petrophysical Data Using Generalized Fuzzy Clustering Algorithms.” Geophysical Journal International 208, no. 2: 1201–1216.
10.1093/gji/ggw442
Google Scholar
Sun, J., and Y. Li. 2016b. “Joint Inversion of Multiple Geophysical Data Using Guided Fuzzy C-Means Clustering.” Geophysics 81, no. 3: ID37–ID57.
10.1190/geo2015-0457.1
Web of Science® Google Scholar
Wéber, Z. 2018. “Probabilistic Joint Inversion of Waveforms and Polarity Data for Double-Couple Focal Mechanisms of Local Earthquakes.” Geophysical Journal International 213, no. 3: 1586–1598.
10.1093/gji/ggy096
Web of Science® Google Scholar
Yang, H., J.-L. Wang, J.-S. Wu, P. Yu, and X.-M. Wang. 2002. “Constrained Joint Inversion of Magneto-Telluric and Seismic Data Using Simulated Annealing Algorithm.” Chinese Journal of Geophysics 45, no. 5: 764–776.
10.1002/cjg2.290
Google Scholar
Yee, K. 1966. “Numerical Solution of Initial Boundary Value Problems Involving Maxwell's Equations in Isotropic Media.” IEEE Transactions on Antennas and Propagation 14, no. 3: 302–307.
10.1109/TAP.1966.1138693
Google Scholar

Volume73, Issue6

July 2025

e70043

This article also appears in:

Advances in Geophysical Modelling and Interpretation for Mineral Exploration

Filename	Description
gpr70043-sup-0001-SuppMat.pptx7.6 MB	Supporting material: gpr70043-sup-0001-SuppMat.pptx
SparseParameterization.m8.7 KB	Supporting material: SparseParameterization.m

Stochastic Joint Inversion of Seismic and Controlled-Source Electromagnetic Data

ABSTRACT

1 Introduction

2 Methods

2.1 Sparse Parameterization

2.2 Objective Function and Parameter Coupling

2.3 Very-Fast Simulated Annealing

3 Test Case

4 Conclusions

Acknowledgements

Open Research

Data Availability Statement

Supporting Information

References

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Stochastic Joint Inversion of Seismic and Controlled-Source Electromagnetic Data

ABSTRACT

1 Introduction

2 Methods

2.1 Sparse Parameterization

2.2 Objective Function and Parameter Coupling

2.3 Very-Fast Simulated Annealing

3 Test Case

4 Conclusions

Acknowledgements

Open Research

Data Availability Statement

Supporting Information

References

Figures

References

Related

Information