Volume 2025, Issue 1 6613167

Research Article

Open Access

Comparison of Machine Learning Methods and Ordinary Kriging for Gravimetric Mapping: Application to Yagoua Area (Northern Cameroon)

Mfenjou Martin Luther,

Corresponding Author

Mfenjou Martin Luther

[email protected]

orcid.org/0000-0002-6296-3414

Department of Applied Mathematics and Computer Science , University of Ngaoundere , Meiganga , Cameroon , univ-ndere.cm

LaRI Lab , University of Maroua , Maroua , Cameroon , uni-maroua.citi.cm

Search for more papers by this author

Boroh Andre William,

Boroh Andre William

orcid.org/0000-0001-7105-7301

Department Mining Geology , University of Ngaoundere , Meiganga , Cameroon , univ-ndere.cm

Search for more papers by this author

Kasi Njeudjang,

Kasi Njeudjang

orcid.org/0000-0003-2739-6693

Department of Quality Industrial Safety and Environment , University of Maroua , Kaele , Cameroon , uni-maroua.citi.cm

Search for more papers by this author

Kabe Moukete Eric Bruno,

Kabe Moukete Eric Bruno

orcid.org/0000-0003-4556-9287

Department Mining Geology , University of Ngaoundere , Meiganga , Cameroon , univ-ndere.cm

Search for more papers by this author

Amaya Adama,

Amaya Adama

orcid.org/0009-0008-1597-8761

Department of Geological Mapping and Geomatic , University of Ngaoundere , Meiganga , Cameroon , univ-ndere.cm

Search for more papers by this author

Mfenjou Martin Luther,

Corresponding Author

Mfenjou Martin Luther

[email protected]

orcid.org/0000-0002-6296-3414

Department of Applied Mathematics and Computer Science , University of Ngaoundere , Meiganga , Cameroon , univ-ndere.cm

LaRI Lab , University of Maroua , Maroua , Cameroon , uni-maroua.citi.cm

Search for more papers by this author

Boroh Andre William,

Boroh Andre William

orcid.org/0000-0001-7105-7301

Department Mining Geology , University of Ngaoundere , Meiganga , Cameroon , univ-ndere.cm

Search for more papers by this author

Kasi Njeudjang,

Kasi Njeudjang

orcid.org/0000-0003-2739-6693

Department of Quality Industrial Safety and Environment , University of Maroua , Kaele , Cameroon , uni-maroua.citi.cm

Search for more papers by this author

Kabe Moukete Eric Bruno,

Kabe Moukete Eric Bruno

orcid.org/0000-0003-4556-9287

Department Mining Geology , University of Ngaoundere , Meiganga , Cameroon , univ-ndere.cm

Search for more papers by this author

Amaya Adama,

Amaya Adama

orcid.org/0009-0008-1597-8761

Department of Geological Mapping and Geomatic , University of Ngaoundere , Meiganga , Cameroon , univ-ndere.cm

Search for more papers by this author

First published: 05 March 2025

https://doi.org/10.1155/je/6613167

Academic Editor: Andras Szekrenyes

Share a link

Email
Wechat
Bluesky

Abstract

This work focuses on the comparison of a number of machine learning methods (random forest, support vector machine (SVM), and artificial neural networks (ANN)) and ordinary kriging (OK). It is based on OK. Indeed, OK, which is a stochastic spatial interpolation method, predicts the value of a natural phenomenon at unsampled sites, and it is an unbiased linear combination with a minimal variance that yields observations on the model at neighbouring sites. So, knowing the various improvements made by machine learning–based methods, we used them. The analysis of the different methods provides a basis for comparison according to the defined indicators. A better gravimetric mapping requires the sampling of a certain number of points whose densities will make it possible to carry out geostatistical analyses and interpretations and thus be able to estimate the deposit. Thus, concerning the prediction of the parameters used in the detection of gravity anomalies, OK is better with R² = 0.99. Regarding the prediction of gravity anomalies, OK is able to reproduce a good variability of the anomalies, but when the spatial variability interval of the ANNs is close, it is then better indicated than OK. However, an increase in the data size would allow us to see the best performance of machine learning–based methods in gravity mapping.

1. Introduction

During the last two decades, machine learning has been increasingly applied in the geosciences, notably in the fields of geochemistry [1, 2], geomatics and geological mapping [3, 4], structural geology [5, 6], and also geostatistics [7, 8]. In geophysics, its application started in fault detection and prevention [9, 10], then extended to geological and mining fields through geoelectrics [11, 12], magnetics [13], and seismics [14, 15]. In gravimetry, the contribution of machine learning has been mainly for data inversion. For example, Chen et al. [16] used deep neural networks on Bouguer anomalies for the determination of the spatial structure of salt and concluded that the machine learning–based method from gravity data complements the processing and interpretation of seismic data for subsurface exploration. Next, a 3D inversion tool for gravity data was developed by Zhang et al. [17] for density determination. Instead of learning the density information of each 2D grid point as is often used, this network learns the boundary position, vertical center, thickness, and density distribution and reconstructs the 3D model using these predicted parameters [17]. Finally, in a study at Eastern Goldfields in Australia, random forest (RF) was combined with teledetection for lithological mapping, and it was concluded that the method can be an effective additional tool available to geoscientists in a pristine gold-bearing environment when faced with limited data [18]. The Northern Cameroon region has been the subject of previous research in the geophysical framework [19–24]. This paper builds on the work of Nouck et al. [25], which focused on the geostatistical reinterpretation of gravity surveys in the Yagoua area. The contribution of this paper is to use machine learning–based methods to improve gravimetric mapping. The main idea is to leave the so-called classical methods, such as ordinary kriging (OK), to go towards RF, support vector machine (SVM), and artificial neural network (ANN). A comparison is then made according to certain defined indicators.

The rest of this article is organized as follows: Section 2 presents a review of gravity mapping. Section 3 shows the geological setting of the study area. Section 4 presents the conceptual and methodological framework of this work. Section 5 presents the results obtained and discusses them. Finally, Section 6 presents a conclusion and perspectives related to this work.

2. Survey of Gravimetric Mapping

Gravimetry is a geophysical method that measures variations in the Earth’s gravitational potential field [26]. It is a prospecting method that offers the determination of density anomalies in the subsurface. Depending on the objectives and the means available, there are several gravity surveys: land, sea, airborne, and satellite gravity surveys. In the context of this work, the focus is on terrestrial gravity surveys. Better gravity mapping requires the sampling of a certain number of points whose densities will allow geostatistical analyses and interpretations to be carried out and thus be able to estimate the deposit. There are few works that abound on topics related to gravimetric mapping. Figure 1 shows the steps required for good gravimetric mapping.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Step of realization of gravimetric mapping.

In this work, we focus on the third step: the use of linear estimation methods. Indeed, this is a set of methods used for the estimation of regionalized stationary variables [27]. We build on the work of Nouck et al. [25]. The method used in their work was based on variographic analysis and OK.

2.1. OK

Spatial prediction is the prediction of unobserved values from observed data [28]. To do this, it takes into account knowledge of spatial variation as represented in the variogram or covariance function. OK does not require any information other than the measurements and their geographical coordinates. It is by far the most popular type of kriging, and for good reason; it serves well in most situations with its easily satisfied assumptions [29]. OK is based on the assumption that variation is random and spatially dependent and that the underlying random process is inherently stationary with a constant mean and variance that depends only on the distance and direction between locations and not on absolute position. A kriging prediction is a linear sum of data, which can be in one, two, or three dimensions. Predictions can be made for points (i.e., having the same support as the measurements) or blocks. For this purpose, suppose that the values of a random variable Z have been recorded at the sample points x₁, x₂, ⋯, x_N to give N data, z(x_i), i = 1, 2, ⋯, N. For point kriging, the prediction in x₀ is made by

()

For the estimator to be unbiased, the usual constraint must be imposed:

()

However, the emergence of new technologies in data engineering and artificial intelligence is challenging the results obtained in many fields of activity. Even geostatistical methods are not to be spared. Several studies have shown the limitations of kriging [30]. Indeed, kriging uses simple criteria that do not allow confidence intervals to be characterized. Kriging-based methods apply smoothing to the variability of the data. This could lead to estimation errors. Machine learning is then an alternative for the improvement of geostatistical estimates, including their application to gravity mapping. Machine learning is an application of artificial intelligence that aims to have systems learn and improve from experience without being programmed [9]. The objective of this paper is to make a comparative study of the estimates made with OK and some machine learning methods (RF and ANN) for gravimetric mapping. Before doing so, it is therefore important to present the study area.

3. Geological Framework of the Study Area

The study area used for this work is a vast sandy plain located between longitudes 15°148 ^′ and 15°354 ^′ E and latitudes 10°12 ^′ and 10°395 ^′ N (Figure 2). It covers the locality of Yagoua and its surroundings. Located about 211 km from Maroua, this area is bounded by the Logone River and therefore borders Chad. The whole area is easily accessible by land. It is mostly made up of sedimentary cover of tertiary and quaternary age. The sediments are mainly sandstone, clay, and shale and are overlain by sandy-alluvial and dune formations [25]. These sedimentary deposits are the result of a succession of regressions and transgressions caused by alternating periods of no rainfall and rainy periods.

4. Methodology

4.1. Proposal Design

Figure 3 shows the approach used in this work to compare the methods we are using in order to produce a better predictive map.

For this work, it is chosen, on the one hand, to use OK, as realized by Nouck et al. [25], and, on the other hand, three methods based on machine learning, namely, RF, ANN, and SVM. Subsequently, model validation methods are used to assess the most suitable method, and finally, the gravimetric map is generated using the best method.

4.2. Machine Learning Methods Used

4.2.1. RF

RF method is part of the ensemble learning (EL) methods [31]. It is a method that was developed by Breiman [32] in order to alleviate the overfitting problem encountered in decision tree (DT) algorithms. In this section, we present the RF method in its entirety. RFs are a combination of DTs such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalisation error for forests converges to a limit when the number of trees in the forest becomes large. The generalisation error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node gives error rates that compare favorably with Adaboost but are more robust to noise. Internal estimates monitor error, strength, and correlation, and these are used to show the response to increasing the number of features used in the split. Internal estimates are also used to measure the importance of variables [33]. The method involves using randomly selected inputs or combinations of inputs at each node to grow each tree. The resulting forests give an accuracy that compares favorably with Adaboost. Zhang [34] explains the implementation principle of RF. As the name implies, an RF is a tree-like set, with each tree depending on a set of random variables. More formally, for a p-dimensional random vector

representing the real-valued input or predictor variables and a random variable Y representing the real-valued response, we assume an unknown joint distribution P_XY (X, Y). The goal is to find a prediction function f (X) to predict Y. The prediction function is determined by a loss function L(Y, f (X)) and defined to minimize the expected value of the loss.

()

where the indices denote the expectation with respect to the joint distribution of X and Y. Intuitively, L (Y, f (X)) is a measure of the proximity of f (X) to Y; it penalises values of f (X) that are far from Y. Figure 4 shows the algorithm of the RF algorithm as follows [35]:

4.2.2. ANN

ANNs are designed on the basis of human brain simulation in order to determine the relationship between the outputs and inputs of a system. An ANN is trained with the experimental data available throughout the learning process and is used to estimate the unknown data [36]. Neural networks consist of simple synchronous processing components called nodes or neurons located in layers. Usually, an ANN has three layers: an output layer, a hidden layer, and an input layer. The most common type of ANN in petroleum engineering applications is the multilayer perceptron (MLP), which is trained for the purpose of a backpropagation (BP) approach [37]. MLP networks rank as conventional methods in terms of their very short improvement time and ability to implement the associated information. Starting from the left of Figure 5, the input values are denoted X₁ and X₂. The hidden scores are denoted Σ₁ and Σ₂, and the output is denoted Σ₃. The input value of each masked node of

•
the sum of all input variables
•
the weighting factor feeding these nodes

In addition, there may be a bias term that behaves similarly to the intersection in a typical linear regression. The following framework summarises the functioning of an ANN [38]:

4.2.3. SVM

This is a computer algorithm that learns by example to assign labels to objects [39]. It represents a great technique for general (nonlinear) classification and outlier detection by using an intuitive model representation. SVM approaches are summarised as follows [40]: nonlinearity, class separation, overlapping, one-class classification, and multiclassification. Several problems today have their solutions improved thanks to SVM. The literature abounds with works that exploit the SVM method. In this work, we are interested in the use of SVM in the process of estimating reserves in the mining and petroleum domain. In estimating ore reserves from sparse and imprecise data, Dutta et al. [8] use several automatic learning methods, including the SVM method. This method allows for predicting the drilling parameters in a minimal way and obtaining a good estimate of the reserve. Wong et al. [41] use SVM methods for reservoir characterization in an intelligent way using the petrophysical properties of reservoirs. Figure 6 shows the structure of the SVM algorithm [41].

4.3. Model of Cross-Validation

Validation models are those used to evaluate the performance of this type of mathematical model. Generally, in machine learning, it is to separate the database into three groups. The first part is used for training, the second is for testing, and the third is for model validation. Then, an estimation error is calculated by cross-validation according to certain parameters [42, 43]. Three validation methods are used in the validation of geostatistical and machine learning prediction models: the coefficient of determination (R²), the mean absolute error (MAE), and the root mean square error (RMSE).

4.3.1. Coefficient of Determination

This coefficient is defined as the difference in one variable that can be expressed from the difference in another variable. It goes from 0 to 1; the higher its value, the better the model is. Its mathematical formula is

()

4.3.2. Mean Absolute

This parameter measures the average of the differences between the true values and the predicted values. It is all the better for a dataset when its value is close to 0.

()

4.3.3. RMSE

This metric is a quadratic measure of the error for cross-validation. His mathematical expression is given as being the square root of the difference from the error squared.

()

5. Results

5.1. Exploratory Data Analysis

A basic statistical analysis is carried out on the data in order to visualize their overall trend. A total of 104 points will be used for this study, and the results of which are presented in Table 1. This analysis has made it possible to summarise and synthesize the information contained in the statistical series and to highlight its properties. As can be seen in Table 1, the different statistical parameters calculated illustrate the degree to which the sampled data tend to deviate with a standard deviation of 1295 and a variance of 1.676∗10⁶. These anomalies have a mean of −4471 mgal and range from a minimum value of −6643 mgal to a maximum value of −1090 mgal.

Table 1. Statistical analysis of field data.

Min.	Max.	Mean	Std dev.	Total
−6643.00	−1090.00	−4471	1295	104

The histogram in Figure 7 divides the statistical series of anomalies into different classes. These different classes of anomalies are represented on the abscissa, and the values on the ordinate indicate the frequency of anomaly data belonging to this class. Thus, in the case of our dataset, we have 12 classes represented. The mode of this series is the class (−5000 mgal; −4600 mgal). This implies that the majority of our anomalies have variations within this range.

The distribution is positively skewed because the frequencies decrease much more to the right. So the anomalies do not follow a Gaussian distribution. In order to better examine the spatial continuity of the anomalies, it would be important to study the variogram and the variogram map.

5.2. Model Formulation

5.2.1. Variogram Analysis

This map (Figure 8) allows one to examine the spatial correlation of the anomalies in the four directions of space. A cell in this variogram map represents a family of point pairs with equal directions and distances between points. The value of the cell represents the value of the variogram for that family of pairs, and these values vary between 1512.50 and 1307943.61. This map shows the presence of a strong major anisotropy along the N20 direction, and we also note a minor N290 direction. These directions will be considered for the theoretical fitting of the experimental variogram.

Figure 9 shows the representation of the experimental and theoretical variograms. Two experimental variograms have been plotted along the N20 direction (red dotted line) and the N290 direction (green dotted line). The red line represents the fit of the two experimental variograms. It results from the superposition of the two variograms with different ranges; it is a jiggly structure.

5.2.2. Hyperparameter Tuning

Hyperparameter optimisation is critical for maximising the predictive performance of machine learning models. This study explored the optimal configuration of key hyperparameters for SVMs, ANNs, and RFs using grid search cross-validation. Table 2 summarises the hyperparameter search ranges, their descriptions, and the best values identified, along with the corresponding model performance metrics.

Table 2. Hyperparameter tuning.

Model	Parameter	Search range	Description	Best value
SVM	Kernel	[“linear”, “rbf”]	Specifies the kernel type for transforming data.	“rbf”
	C	[0.1, 1, 10]	Regularisation parameter controlling the trade-off between margin maximisation and classification error.	10
	Gamma	[0.01, 0.1, 1]	Kernel coefficient defining the influence of individual data points in nonlinear models.	0.1

ANN	Hidden layer sizes	[(50, 50), (100,), (100, 100)]	Number of neurons in each hidden layer.	(100, 100)
	Activation	[“relu”, “tanh”]	Nonlinear transformation function applied within layers.	“relu”
	Alpha	[0.0001, 0.001, 0.01]	L2 regularisation parameter to prevent overfitting.	0.0001

Random forest	Number of trees (n_estimators)	[100, 200, 500]	Number of decision trees in the ensemble.	500
Random forest	Maximum depth	[None, 10, 20]	Maximum depth of each decision tree to control overfitting.	20

5.3. Model Validation

In order to properly predict anomalies by using four models, it is important to validate the different models. The validation of each approach is done according to the method used.

5.3.1. Validation of OK

Figure 10 shows the correlogram between the true and estimated values on the left and the error histogram of the estimate on the right. The red bisector of the correlogram separates the true values from the estimated values. It can be seen that the correlation coefficient is 0.99 and the correlation cloud is tighter. This is one of the criteria to be taken into account to authenticate a good estimate because the closer the correlation is to 1 with a tighter cloud, the better the quality of the model. We can also visualize the histogram of the errors of the OK, where the average values of these errors have a major frequency of around 0 and the standard error is 0.66, showing that the estimation made is good.

5.3.2. RF Validation

The three graphs in Figure 11 show the relevance of the linear character that exists between the true values and the estimated values. These will allow you to validate the model by their behaviour. Our dataset allowed us to train and test the RF algorithm by the k-fold method in order to validate it. This graphical representation shows a good correlation with a value of R² of 0.91 between the anomalies predicted by this model and the real values.

Figure 12 represents the histograms of the errors of the data that have been trained, tested, and validated. On these different histograms of the trained, tested, and validated data, the modes of the errors are very close to 0, reflecting that the mean of the errors is also close to 0, and therefore, we have a good fit for the model.

5.3.3. Validation ANN

We can notice here a correlation R² of 0.96, translating to a good quality of the prediction made by the ANN algorithm approach. Some of the predicted data are linearly close to the real anomalies, showing the robustness of this model. Figure 13 shows the linear link between the predicted and the measured data using ANN validation.

When looking at the error histograms shown in Figure 14, it can be seen that there is a fluctuation in the mean error in each case of the histogram, reflecting an instability in the mean error but not far from 0.

5.3.4. SVM Validation

The correlation is 0.88, illustrating a fairly strong correlation between the predicted anomalies and the actual anomalies, so we can speak of a fairly good fit for the SVM model.

The histograms in Figure 15 show a very low average error for each of the data, which is very close to 0, illustrating the quality of the prediction of these anomalies, as the lower the error, the better the model.

Figure 16 shows the validation model using SVM. In order to compare the four different approaches used, several parameters were determined, including R², RMSE, MAE, and σ_e. These will allow us to identify the best performing model among the four. Table 3 gives the different values of the parameters calculated for each method used, including the variogram, RF, ANN, and SVM. We note R² of 0.99, which translates to the good quality of the OK because it is stronger compared to the other models; RMSE of 221.86; and σ_e of 0.66, which are lower, showing the best performance. In view of these different parameters, the OK method is the best for the prediction of gravity anomalies.

Table 3. Validation of prediction models.

	R²	RMSE	MAE	σ_e
Variogram model	0.99	221.86		0.66
RF	0.91	397.51	271.6	396.08
ANN	0.96	369.3023	291.5606	359.6318
SVM	0.88	564.8996	399.8502	559.1238

5.4. Predictive Maps

Once the prediction was made, each model was plotted on a map to better visualize the gravity anomalies. Figure 17 represents the prediction map obtained by the RF algorithm, and the estimated anomalies vary, respectively, between −6721.17 and −1121.87 mgal from blue to red. The strong anomalies are located to the east, and the weak ones are located to the west in relation to the north of this study area.

The gravity anomalies estimated by OK fluctuate between −6721.17 and −1121.87 mgal in contrast to those predicted by RF. A quick look at the results obtained by the RF allows one to observe that the tools have guessed wrong about the proper spatial variability of the body given the contrast of the measured gravimetric data. In the upper map, the data seems to be lumped per block or per area, which means that with this method applied to gravimetric data, we can at least guess the limitation of the bodies in the investigated field. The OK has the strong asset that it tends to minimize the variance of the predicting error, which means that all predictions are likely to be close to their expectation if we were working with a continuously complete series of gravimetric measurements. That way, one would better appreciate the spatial variability of the gravimetric information and then would be able to interpret that properly, depending on the geology of the investigated area.

The prediction by the ANN algorithm used as a model for anomaly mapping is shown in Figure 18 as a map. The predicted values vary from −7112.54 to −1666.83 mgal, totally different from the minimum and maximum of OK and RF. Indeed, because the OK tends to bring all the predictions to a mean, it is normal that the prediction from it will be tight in range compared to a brute method, such as ANN, which in this case, unlike the latter, runs a forecasting that uses a complex system of correlation between the measurements to provide the more realistic possible value for one prediction. Solely, the lack of information may be cruel for the ANN, which usually demands a lot of data to train the neural system. Despite the scarce amount of information, the ANN can still perform good prediction faster than the OK, and one may even observe that the spatial distribution of the gravimetric data seems to be quite close for both methods. It is also observed that the anomalies between −4389.68 and −3845.11 mgal are in the center, but the strong ones are on the extreme right, and the weak ones are on the extreme left.

The map obtained by SVM is shown in Figure 19 with the estimated values varying between −7058.71 and −1028.55 mgal. It can be seen that these predicted values are closer to the ANN model than to the OK and RF models.

Figure 20 shows the prediction map obtained by OK. Finally, it is obvious that because of the spatial correlation embedded in the OK, the prediction can offer an appropriate observational product with a good spatial resolution to afford good interpretation from geophysicists as well as from geologists. However, if looking for quick observational products, or when the database starts growing fast, the use of the OK becomes less appropriate to fulfill the demand.

6. Conclusion and Future Work

The objective of this work was to provide information on machine learning methods for gravity mapping using RF, SVM, and ANN methods in the Yagoua region (Northern Cameroon). The mathematical foundations of the different methods were examined and compared in order to provide support for a reliable variographic analysis for interpolation issues. The choice of these methods allowed us to verify that the existing data was the same when modelling. Finally, it is clear that due to the integrated spatial correlation, OK seems to be good at prediction. Despite the limited information, ANNs can make good predictions faster than OK, and it can even be observed that the spatial distribution of the gravity data seems to be quite close for both methods. Prediction can provide a suitable observational product with good spatial resolution to allow good interpretation by geophysicists and geologists on future exploration projects. In addition, geostatistics gives an overview of the estimation from different interpolation techniques for locating, estimating, and simulating reserves.

Conflicts of Interest

The authors declare conflicts of interest.

Funding

This work has not received any funding.

Open Research

Data Availability Statement

The data used to write this article is confidential. It may be made available if necessary.

References

1 Kirkwood C., Cave M., Beamish D., Grebby S., and Ferreira A., A machine learning approach to geochemical mapping, Journal of Geochemical Exploration. (2016) 167, 49–61, https://doi.org/10.1016/j.gexplo.2016.05.003, 2-s2.0-84969988943.
10.1016/j.gexplo.2016.05.003
CAS Web of Science® Google Scholar
2 Zuo R., Machine learning of mineralization-related geochemical anomalies: a review of potential methods, Natural Resources Research. (2017) 26, no. 4, 457–464, https://doi.org/10.1007/s11053-017-9345-4, 2-s2.0-85019550384.
10.1007/s11053-017-9345-4
CAS Web of Science® Google Scholar
3 Cracknell M. J., Machine learning for geological mapping: algorithms and applications, 2014, (PhD Thesis), University of Tasmania.
Google Scholar
4 Harvey A. S. and Fotopoulos G., Geological mapping using machine learning algorithms, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. (2016) 41, 423–430.
10.5194/isprs-archives-XLI-B8-423-2016
Google Scholar
5 De la Varga M. and Wellmann F., Probabilistic machine learning in structural geology, 2020, https://doi.org/10.5194/egusphere-egu2020-10785.
10.5194/egusphere-egu2020-10785
Google Scholar
6 Marjanovic M., Bajat B., and Kovacevic M., Landslide susceptibility assessment with machine learning algorithms, 2009 International Conference on Intelligent Networking and Collaborative Systems, November 2009, Barcelona, Spain, 273–278, https://doi.org/10.1109/INCOS.2009.25, 2-s2.0-77649277889.
10.1109/INCOS.2009.25
Google Scholar
7 Dumakor-Dupey N. K. and Arya S., Machine learning- a review of applications in mineral resource estimation, Energies. (2021) 14, no. 14, https://doi.org/10.3390/en14144079.
10.3390/en14144079
Web of Science® Google Scholar
8 Dutta S., Bandopadhyay S., Ganguli R., and Misra D., Machine learning algorithms and their application to ore reserve estimation of sparse and imprecise data, Journal of Intelligent Learning Systems and Applications. (2010) 2, no. 2, 86–96, https://doi.org/10.4236/jilsa.2010.22012.
10.4236/jilsa.2010.22012
Google Scholar
9 Abdelgayed T. S., Morsi W. G., and Sidhu T. S., Fault detection and classification based on co-training of semisupervised machine learning, IEEE Transactions on Industrial Electronics. (2018) 65, no. 2, 1595–1605, https://doi.org/10.1109/TIE.2017.2726961, 2-s2.0-85028828056.
10.1109/TIE.2017.2726961
Web of Science® Google Scholar
10 Amruthnath N. and Gupta T., A research study on unsupervised machine learning algorithms for early fault detection in predictive maintenance, 2018 5th International Conference on Industrial Engineering and Applications (ICIEA), April 2018, Singapore, 355–361, https://doi.org/10.1109/IEA.2018.8387124, 2-s2.0-85048370084.
10.1109/IEA.2018.8387124
Google Scholar
11 Danilovskiy K., Loginov G., and Nechaev O., Automatic geoelectric boundaries detection on the resistivity images based on 3D numerical simulation and convolutional neural network, Saint Petersburg 2020, 2020, European Association of Geoscientists & Engineers, Saint Petersburg, Russia, 1–5, https://doi.org/10.3997/2214-4609.202053015.
10.3997/2214-4609.202053015
Google Scholar
12 Wang K., Huang Q., and Wu S., Application of long short-term memory neural network in geoelectric field data processing, Chinese Journal of Geophysics. (2020) 63, no. 8, 3015–3024, https://doi.org/10.6038/cjg2020O0119.
10.6038/cjg2020O0119
Web of Science® Google Scholar
13 Reading A. M., Cracknell M. J., Bombardieri D. J., and Chalke T., Combining machine learning and geophysical inversion for applied geophysics, ASEG Extended Abstracts. (2015) 2015, no. 1, 1–5, https://doi.org/10.1071/ASEG2015ab070.
10.1071/ASEG2015ab070
Google Scholar
14 Adler A., Araya-Polo M., and Poggio T., Deep learning for seismic inverse problems: toward the acceleration of geophysical analysis workflows, IEEE Signal Processing Magazine. (2021) 38, no. 2, 89–119, https://doi.org/10.1109/MSP.2020.3037429.
10.1109/MSP.2020.3037429
Web of Science® Google Scholar
15 Huang L., Dong X., and Clee T. E., A scalable deep learning platform for identifying geologic features from seismic attributes, The Leading Edge. (2017) 36, no. 3, 249–256, https://doi.org/10.1190/tle36030249.1, 2-s2.0-85017184709.
10.1190/tle36030249.1
Google Scholar
16 Chen J., Schiek-Stewart C., Lu L., Witte S., Eres Guardia K., Menapace F., Devarakota P., and Sidahmed M., Machine learning method to determine salt structures from gravity data, SPE Annual Technical Conference and Exhibition, October 2020, https://doi.org/10.2118/201424-MS.
10.2118/201424-MS
Google Scholar
17 Zhang S., Yin C., Cao X., Sun S., Liu Y., and Ren X., DecNet: decomposition network for 3D gravity inversion, Geophysics. (2022) 87, no. 5, G103–G114, https://doi.org/10.1190/geo2021-0744.1.
10.1190/geo2021-0744.1
Web of Science® Google Scholar
18 Kuhn S., Cracknell M. J., and Reading A. M., Lithologic mapping using random forests applied to geophysical and remote-sensing data: a demonstration study from the Eastern Goldfields of Australia, Geophysics. (2018) 83, no. 4, B183–B193, https://doi.org/10.1190/geo2017-0590.1, 2-s2.0-85047934277.
10.1190/geo2017-0590.1
Web of Science® Google Scholar
19 Bouba A., Njeudjang K., Yap L., Saidou B., Kamguia J., and Tabod T. C., Interpretation of locally high gravity anomalies using terrestrial gravity data in Bagodo, North Cameroon, Earth and Planetary Physics. (2022) 6, no. 4, 378–384, https://doi.org/10.26464/epp2022033.
10.26464/epp2022033
Web of Science® Google Scholar
20 Eyike A. and Ebbing J., Lithospheric structure of the West and Central African rift system from regional three-dimensional gravity modelling, South African Journal of Geology. (2015) 118, no. 3, 285–298, https://doi.org/10.2113/gssajg.118.3.285, 2-s2.0-84959255669.
10.2113/gssajg.118.3.285
Web of Science® Google Scholar
21 Mkoumbe E., Estelle Eric F. T. M., Albert E. Y., Philippe N. N., and Tabod T. C., Depositional and structural styles in the Logone Birni Basin (LBB), northern Cameroon, from 3D potential field modeling: preliminary results, Open Journal of Geology. (2019) 9, no. 4, 226–244, https://doi.org/10.4236/ojg.2019.94016.
10.4236/ojg.2019.94016
CAS Google Scholar
22 Mouzong Pemi M., Kamguia J., Nguiya S., and Manguelle-Dicoum E., Depth and lineament maps derived from North Cameroon gravity data computed by artificial neural network, International Journal of Geophysics. (2018) 2018, 13, https://doi.org/10.1155/2018/1298087, 2-s2.0-85050186857, 1298087.
10.1155/2018/1298087
Google Scholar
23 Njeudjang K., Abate Essi J. M., Kana J. D., Teikeu W. A., Njandjock Nouck P., Djongyang N., and Tchinda R., Gravity investigation of the Cameroon Volcanic Line in Adamawa region: geothermal features and structural control, Journal of African Earth Sciences. (2020) 165, 103809, https://doi.org/10.1016/j.jafrearsci.2020.103809.
10.1016/j.jafrearsci.2020.103809
Web of Science® Google Scholar
24 Valentin O., Philippe N. N., Dieudonné B., Diab D. A., and Eliézer M.-D., A geostatistical approach to map gravity data over Logone-Birni sementary basin (Chad-Cameroon), European Journal of Scientific Research. (2012) 93, no. 2, 183–189.
Google Scholar
25 Nouck P. N., Kenfack C., Diab A. D., Njeudjang K., Meli L. J., and Kamseu R., A geostatistical re-interpretation of gravity surveys in the Yagoua, Cameroon region, Geofísica Internacional. (2013) 52, no. 4, 365–373, https://doi.org/10.1016/S0016-7169(13)71483-1, 2-s2.0-84884948368.
10.1016/S0016-7169(13)71483-1
Web of Science® Google Scholar
26 Kopejkin S. M., Relativistic manifestations of gravitational fields in gravimetry and geodesy, Manuscripta Geodaetica. (1991) 16, no. 5, 301–312, https://doi.org/10.1007/BF03655420.
10.1007/BF03655420
Google Scholar
27 Nelson W. and Hahn G. J., Linear estimation of a regression relationship from censored data part i—simple methods and their application, Technometrics. (1972) 14, no. 2, 247–269, https://doi.org/10.1080/00401706.1972.10488912, 2-s2.0-0015340868.
10.1080/00401706.1972.10488912
Web of Science® Google Scholar
28 Emery X., Géostatistique linéaire, 2001, Éc. Natl. Supér. Mines Paris Cent. Géostatistique.
Google Scholar
29 Chiles J.-P. and Delfiner P., Geostatistics: Modeling Spatial Uncertainty, 2009, John Wiley & Sons.
Google Scholar
30 Joseph V. R., Limit kriging, Technometrics. (2006) 48, no. 4, 458–466, https://doi.org/10.1198/004017006000000011, 2-s2.0-33845246400.
10.1198/004017006000000011
Web of Science® Google Scholar
31 Cutler A., Cutler D. R., and Stevens J. R., C. Zhang and Y. Ma, Random forests, Ensemble Machine Learning, 2012, Springer US, Boston, MA, 157–175, https://doi.org/10.1007/978-1-4419-9326-7_5.
10.1007/978-1-4419-9326-7_5
Google Scholar
32 Breiman L., Random forests, Machine Learning. (2001) 45, no. 1, 5–32, https://doi.org/10.1023/A:1010933404324, 2-s2.0-0035478854.
10.1023/A:1010933404324
Web of Science® Google Scholar
33 Freund Y. and Schapire R. E., A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences. (1997) 55, no. 1, 119–139, https://doi.org/10.1006/jcss.1997.1504, 2-s2.0-0031211090.
10.1006/jcss.1997.1504
Web of Science® Google Scholar
34 Zhang H., Inconsistent estimation and asymptotically equal interpolations in model-based geostatistics, Journal of the American Statistical Association. (2004) 99, no. 465, 250–261, https://doi.org/10.1198/016214504000000241, 2-s2.0-2142734871.
10.1198/016214504000000241
Web of Science® Google Scholar
35 Wild Ali A. B., Prediction of employee turn over using random forest classifier with intensive optimized PCA algorithm, Wireless Personal Communications. (2021) 119, no. 4, 3365–3382, https://doi.org/10.1007/s11277-021-08408-0.
10.1007/s11277-021-08408-0
Web of Science® Google Scholar
36 Kapageridis I. K., Application of artificial neural network systems to grade estimation from exploration data, 1999, (PhD Thesis), University of Nottingham.
Google Scholar
37 Monjezi M., Ahmadi Z., Varjani A. Y., and Khandelwal M., Backbreak prediction in the Chadormalu iron mine using artificial neural network, Neural Computing and Applications. (2013) 23, no. 3-4, 1101–1107, https://doi.org/10.1007/s00521-012-1038-7, 2-s2.0-84884590729.
10.1007/s00521-012-1038-7
Web of Science® Google Scholar
38 Srivastava T., How does artificial neural network (ANN) algorithm work? Simplified!, 2014, Analytics Vidhya.
Google Scholar
39 Boser B. E., Guyon I. M., and Vapnik V. N., A training algorithm for optimal margin classifiers, COLT ‘92: Proceedings of the fifth annual workshop on Computational learning theory, July 1992, 144–152, https://doi.org/10.1145/130385.130401.
10.1145/130385.130401
Google Scholar
40 Cortes C. and Vapnik V., Support-vector networks, Machine Learning. (1995) 20, no. 3, 273–297.
10.1007/BF00994018
Web of Science® Google Scholar
41 Wong K. W., Fung C. C., Ong Y. S., and Gedeon T. D., Reservoir characterization using support vector machines, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC’06), November 2005, Vienna, 354–359, https://doi.org/10.1109/CIMCA.2005.1631494.
10.1109/CIMCA.2005.1631494
Google Scholar
42 Vabalas A., Gowen E., Poliakoff E., and Casson A. J., Machine learning algorithm validation with a limited sample size, PLoS One. (2019) 14, no. 11, e0224365, https://doi.org/10.1371/journal.pone.0224365, 31697686.
10.1371/journal.pone.0224365
CAS PubMed Web of Science® Google Scholar
43 Boroh A. W., Lawou S. K., Mfenjou M. L., and Ngounouno I., Comparison of geostatistical and machine learning models for predicting geochemical concentration of iron: case of the Nkout iron deposit (south Cameroon), Journal of African Earth Sciences. (2022) 195, article 104662, https://doi.org/10.1016/j.jafrearsci.2022.104662.
10.1016/j.jafrearsci.2022.104662
Google Scholar

All articles

Comparison of Machine Learning Methods and Ordinary Kriging for Gravimetric Mapping: Application to Yagoua Area (Northern Cameroon)

Abstract

1. Introduction