Volume 3, Issue 4 e1139

RESEARCH ARTICLE

Full Access

Proposition of a new index for projection pursuit in the multiple factor analysis

Paulo César Ossani,

Corresponding Author

Paulo César Ossani

[email protected]

orcid.org/0000-0002-6617-8085

Department of Statistics (DES), Universidade Federal de Lavras (UFLA), Lavras, Brazil

Correspondence Paulo César Ossani, Department of Statistics (DES), Universidade Federal de Lavras (UFLA), Campus Universitário, Lavras-MG 37200-000, Brazil.

Email: [email protected]

Search for more papers by this author

Mariana Figueira Ramos,

Mariana Figueira Ramos

Department of Statistics (DES), Universidade Federal de Lavras (UFLA), Lavras, Brazil

Search for more papers by this author

Marcelo Ângelo Cirillo,

Marcelo Ângelo Cirillo

Department of Statistics (DES), Universidade Federal de Lavras (UFLA), Lavras, Brazil

Search for more papers by this author

Paulo César Ossani,

Corresponding Author

Paulo César Ossani

[email protected]

orcid.org/0000-0002-6617-8085

Department of Statistics (DES), Universidade Federal de Lavras (UFLA), Lavras, Brazil

Correspondence Paulo César Ossani, Department of Statistics (DES), Universidade Federal de Lavras (UFLA), Campus Universitário, Lavras-MG 37200-000, Brazil.

Email: [email protected]

Search for more papers by this author

Mariana Figueira Ramos,

Mariana Figueira Ramos

Department of Statistics (DES), Universidade Federal de Lavras (UFLA), Lavras, Brazil

Search for more papers by this author

Marcelo Ângelo Cirillo,

Marcelo Ângelo Cirillo

Department of Statistics (DES), Universidade Federal de Lavras (UFLA), Lavras, Brazil

Search for more papers by this author

First published: 02 December 2020

https://doi.org/10.1002/cmm4.1139

Share a link

Email
Wechat
Bluesky

Abstract

This study proposes a new index for projection pursuit used to reduce the dimensions of groups of variables using multiple factor analysis. The main advantage with respect to other indexes is that the methodological procedure preserves the variance and covariance structures to perform singular value decomposition, when the index is used to compare groups of variables. Among other contributions, the study presents a modification in the grand tour algorithm with simulated annealing, adapting it to deal with groups of variables. The methodology used to assess the proposed index was based on Monte Carlo simulations, in several scenarios and with configurations of the following factors: degrees of correlation between the variables; and number of groups and degrees of heterogeneity among groups of variables. The proposed index was compared with thirteen indexes known in the literature. It was concluded that the proposed index was efficient in the reduction of data to use multiple factor analysis. This index is recommended for situations in which the groups exhibit low or high heterogeneity and a strong degree of correlation between the variables $(ρ = 0.9)$ . In general terms, indexes are affected by the increase in the number of groups, depending on the scenarios assessed.

1 INTRODUCTION

In spaces with many dimensions, the samples become sparse and not very similar to each other. This fact becomes a problem when using dimensionality reduction methods based on singular value decomposition, such as the analysis of principal components, given that the information of the first components is diluted in the others. Therefore, it is advisable to reduce the dimensions without losing the information contained in the original dimension. In this context, the projection pursuit technique emerges after being suggested by Kruskal¹ and implemented by Friedman and Tukey.²

In summary, projection pursuit searches for low-dimensional linear projections in high-dimensional data structures. To that end, a projection index $I (u, v)$ is understood as an objective function that quantifies the degree of “interest” of a projection on the plane by vectors (orthogonal) u and v. A numerical optimization procedure is used to find the plane that maximizes this index. The problem consists in choosing the index that best represents the degree of “interest” of the projection.

It is noteworthy that projection pursuit has been implemented in several applications, namely: supervised exploratory data classification,³ robust principal component analysis,⁴ and independent component analysis.⁵ Regarding multiple factor analysis (MFA), there are no reports in the literature about the feasibility of applying this projection, or propositions of new indexes that require less computational effort with promising results.

When dealing with multiple factors analysis technique in groups of quantitative variables in high-dimensional, when applying the singular values decomposition on data in order to find the eigenvalues and eigenvectors, losses in the components explanations usually occur, by diluting the explanations on the large number of components found.

In this study, it sought to solve this deficiency, by reducing data dimensions, preserving the variances and covariance structure, to perform the decomposition, thereby ensuring a greater explanation in the first components, giving greater reliability when applying the multiple factor analysis in quantitative data in high-dimensional.

The present study proposes a new index to be used in projection pursuit applied in MFA considering groups of quantitative variables. This way, the article is divided into the following sections: Section 2—Projection pursuit and projection indexes; Section 3—Methods; Section 4—Results and discussions; Section 5—Application example; Section 6—Conclusions; and Appendix.

2 PROJECTION PURSUIT AND PROJECTION INDEX

The understanding of projection pursuit begins with a set of multivariate observations given by $X = (X_{1}, X_{2}, \dots, X_{p})$ , each component being defined by $X_{i} = (x_{1 i}, x_{2 i}, \dots, x_{n i})$ , for i = 1, … , p. Mathematically, the projection pursuit method seeks a linear transformation, $T : ℝ^{p} \to ℝ^{d}$ , with d < p, and T = XA, with A_p × d. This way, T is the linear projection of X_n × p, and A is the projection matrix, with the columns representing the bases of the projection space.

We are looking for linear projections with orthonormality constraints in the projection bases, that is, A · A^T = I_n, with I_n being the identity matrix. This constraint ensures that each dimension of the projection space has different aspects of the data.^6-8

When considering the PI (projection index) index function that measures the degree of “interest” of the T projection, the projection pursuit method becomes an optimization problem (1) that looks for a matrix A that maximizes PI.^{7, 8}

Ã = \arg \max_{A} {P I (X A)}, with A \cdot A^{T} = I_{n} .

()

According to Cook et al.,⁹ it is common to use spherical data before starting the projection pursuit, which are obtained using Equation (2), removing the influence of location and the scale for the search of structured projections. This procedure is necessary for indexes that measure the output of projected data density from a standard normal density, due to the fact that the differences in location and scale may dominate the other structural differences.

Using spectral decomposition in

\sum

, the covariance matrix of the elements of X, the spherical data are obtained using the following equation:

Z_{i} = Λ^{- 1 / 2} P^{T} (X_{i} - {\overline{X}}_{i}), i = 1, \dots, p,

()

Λ

is the diagonal matrix of eigenvalues, and P is the matrix of eigenvectors. It should be noted that, for the calculation of the ith spherical vector, the data are centered on the means of the respective variables.¹⁰

2.1 Notation

z_i is the ith observation of the spherical data of the data matrix X_n × p.
$α$ and $β$ are n-dimensional orthonormal vectors $(α^{T} α = 1 = β^{T} β and α^{T} β = 0)$ of the projection plane.
$(z^{α}, z^{β})$ are the spherical observations projected on vectors $α$ and $β$ ,

Most low-dimensional projections are approximately normal.^{6, 11} The main appropriate indexes for reduction in dimension d ≥ 1, are described in Tables A1 and A2 (Appendix A).

The optimization of the equation is done by numerical methods, which are traditionally based on the gradient,^{6, 7} or the Newton-Raphson method;^{2, 12} however, they are not appropriate for scenarios above three dimensions.⁸

Other global optimizers have been proposed, for example: TRIVES,¹³ random scanning algorithm,¹⁴ simulated annealing,³ genetic algorithm,¹⁵ and particle swarm optimization.¹⁶

3 METHODS

In accordance with the proposed objectives of the present study, the methodology used consisted of the following steps: (i) Generation of the samples for the application of MFA (Section 3.1); and (ii) proposed index for use with MFA (Section 3.2).

3.1 Generation of samples for the application of multiple factor analysis

In order to assess the performance of the index proposed in Section 3.2, we considered several scenarios that exhibited different degrees of correlation between the variables belonging to each group, and heterogeneity between the groups. To that end, we performed the procedure proposed by Cirillo et al.,¹⁷ in which the samples were generated from normal dependent and heterogeneous populations.

Multivariate observation was specified by vector

\overset{ˇ}{X}

, in which each component was given by

{\overset{ˇ}{X}}_{j} = {(X_{j 1}, \dots, X_{j p})}^{T}

for j = 1, … , k, being k the total number of groups and p the number of variables, considering the autoregressive of order 1 correlation structure,

A R (1)

, defined in (3). Each block delimited by dashed lines corresponds to the correlation structure of the samples generated by a normal multivariate distribution with global correlation structure R_b.

The global covariance matrix was obtained through (4)

\sum^{*} = D^{\frac{1}{2}} R_{b} D^{\frac{1}{2}},

()

in which

D^{\frac{1}{2}}

is a diagonal matrix with the standard deviations of the variables. With these specifications, the multivariate samples from normal dependent populations were generated by

{\overset{ˇ}{X} \sim N}_{p k} ({\overset{ˇ}{μ}}_{p k}, \sum^{*})

taking the parametric values defined in (5) and (6).

{\overset{ˇ}{μ}}_{p k \times 1} = [\begin{array}{c} {\tilde{μ}}_{1} = 0 \\ ⋮ \\ {\tilde{μ}}_{1} = 0 \end{array}]

()

\sum^{*} = [\begin{array}{ccc} \sum_{11} & \dots & \sum_{k 1} \\ ⋮ & ⋱ & ⋮ \\ \sum_{k 1} & \dots & \sum_{k k} \end{array}] .

()

The covariances represented in $\sum^{*}$ (6) are non-zero, since each element on the diagonal represents the jth covariance matrix of the group of variables indexed by j = 1, … , k.

The heterogeneity between each covariance matrix in the simulation process was determined through a degree of heterogeneity

δ

specified in (8), following the algorithm proposed by Cirillo et al.¹⁷ as follows:

A sample of the multivariate normal distribution is simulated $N_{p k} ({\overset{ˇ}{μ}}_{p k}, \sum^{*})$ and a matrix Y_n × pk, is obtained (Table 1). Each block of p columns (variables) corresponds to the jth group. In this way, the multivariate sampling unit is arranged in n lines.
The observations were not altered in group 1^a $(j = 1)$ . The p variables of the jth group were multiplied by $d_{j}^{*}$ , $(j > 1)$ defined by
$d_{j}^{*} = {(1 + (j - 1) \frac{(δ - 1)}{(k - 1)})}^{\frac{1}{p}},$ ()
$δ$ being the degree of heterogeneity among the covariance matrices specified. After completing this procedure, $\sum_{n}$ was adopted as parameter of the global covariance matrix defined by
$\sum_{n} = \frac{1}{n} \sum_{j = 1}^{n} (Y_{j} - \overline{Y}) (Y_{j} - \overline{Y}) = \frac{1}{n} [Y^{T} Q Y],$ ()
with Q being a projection matrix.

TABLE 1. Layout of the matrix Y_n × pk used in the determination of the covariance matrix parameter under heterogeneity

(δ > 1)

Groups
	1			2			j	k
1	y₁₁₁	…	y_1p1	y₁₁₂	…	y_1p2	…	y_11k	…	y_1pk
2	y₂₁₁	…	y_2p1	y₂₁₂	…	y_2p2	…	y_21k	…	y_2pk
⋮	⋮	⋱	⋮	⋮	⋱	⋮	⋮	⋮	⋱	⋮
n	y_n11	…	y_np1	y_n12	…	y_np2	…	y_n1k	…	y_npk

After defining the parameters of the covariance matrix

\sum^{*}

and

\sum_{n}

through heterogeneity values among the matrices of covariance, specified in

δ = 2

and 8, we performed Monte Carlo simulations, in which the multivariate sampling observations were generated. The matrix of the data samples used in MFA was composed of n vectors generated by (9)

X = [\begin{array}{c} X_{1}^{T} \\ ⋮ \\ X_{n}^{T} \end{array}],

()

then, fixing p = 10 and n = 150, the scenarios considered in the simulation procedure were defined in Table 2.

TABLE 2. Scenarios for generation of the normal multivariate samples to be used in multiple factor analysis

Heterogeneity among covariance matrices $(δ)$	Number of groups $(k)$	Degree of correlation between variables $(ρ)$
2	7	0.2
		0.5
	10	0.9
8	7	0.2
		0.5
	10	0.9

3.2 Proposed index for use with MFA

The multivariate sample $X_{n \times m} = [X_{n \times p}^{1} | \dots | X_{n \times p}^{j} | \dots | X_{n \times p}^{k}]$ was composed of groups of variables k in n observations in each group, with m = pk. This way, the observation x_ipj being i = 1, … , n and j = 1, … , k is identified as the ith observation in the jth group to which the pth variable belongs, as suggested by the layout of Figure 1.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Layout of the data structure of multiple factor analysis

The application of projection pursuit in MFA consisted of reducing the dimension of each group

X_{n \times p}^{j}

for a dimension d < p, with the possibility of d being different in each group. Following these specifications, the multivariate sample was represented by the vector

{\tilde{X}}_{n \times s} = [{\tilde{X}}_{n \times d}^{1} | \dots | {\tilde{X}}_{n \times d}^{j} | \dots | {\tilde{X}}_{n \times d}^{k}]

, with

{\tilde{x}}_{i d j}

being the ith observation in the dth column of the jth group, with s = dk. The purpose was that, when applying MFA in the matrices X_n × m and

{\tilde{X}}_{n \times s}

, the results were analogous to the similarities between the groups, as suggested by Equation (10)

W_{j t} ≅ {\tilde{W}}_{j r} \Rightarrow λ_{r} \times \sum_{1}^{p} v_{p r j}^{2} ≅ {\tilde{λ}}_{\tilde{r}} \times \sum_{1}^{d} v_{d \tilde{r} j}^{2}

()

where W_jr is the index of similarity of the jth group in the rth component of the data, with non-reduced dimensions, and

{\tilde{W}}_{j \tilde{r}}

the index of similarity of the jth group in the rth component of the groups with reduced dimensions, with

r > \tilde{r}

, and r and

\tilde{r}

being the rank of the matrixes X and

\tilde{X}

, respectively. Given that

X = U Λ V^{T}

, it should be noted that the similarity indexes described in Equation (10) were initially defined using the square of the elements of the matrix V of eigenvectors and the square of the singular values

Λ^{2} = d i a g (λ_{r})

. This way,

v_{p r j}^{2}

represents the square of the pth variable, in the rth component of the jth group in V, similar to

{\tilde{v}}_{d \tilde{r} j}^{2}

\tilde{V}

It is worth noting that the equality in (10) is due to the comparison of the results with different dimensions. For example, considering matrices X_n × m and ${\tilde{X}}_{n \times s}$ , when applying the MFA technique, we will obtain the respective eigenvector matrices V and $\tilde{V}$ . The similarities between the groups are expressed in Tables 3 and 4, respectively.

TABLE 3. Similarity of the groups of variables in X_n × m by MFA

Comp.	Group 1	…	Group j	…	Group k
1	$λ_{1} \times (v_{111}^{2} + \dots + v_{p 11}^{2})$	…	$λ_{1} \times (v_{11 j}^{2} + \dots + v_{p 1 j}^{2})$	…	…
2	$λ_{2} \times (v_{121}^{2} + \dots + v_{p 21}^{2})$	…	$λ_{2} \times (v_{12 j}^{2} + \dots + v_{p 2 j}^{2})$	…	…
⋮	⋮	⋮	⋮	⋮	⋮
r	$λ_{r} \times (v_{1 r 1}^{2} + \dots + v_{p r 1}^{2})$	…	$λ_{r} \times (v_{1 r j}^{2} + \dots + v_{p r j}^{2})$	…	…

TABLE 4. Similarity of the groups of variables in

{\tilde{X}}_{n \times s}

by MFA

Comp.	Group 1	…	Group j	…	Group k
1	${\tilde{λ}}_{1} \times ({\tilde{v}}_{111}^{2} + \dots + {\tilde{v}}_{d 11}^{2})$	…	${\tilde{λ}}_{1} \times ({\tilde{v}}_{11 j}^{2} + \dots + {\tilde{v}}_{d 1 j}^{2})$	…	…
2	${\tilde{λ}}_{2} \times ({\tilde{v}}_{121}^{2} + \dots + {\tilde{v}}_{d 21}^{2})$	…	${\tilde{λ}}_{2} \times ({\tilde{v}}_{12 j}^{2} + \dots + {\tilde{v}}_{d 2 j}^{2})$	…	…
⋮	⋮	⋮	⋮	⋮	⋮
$\tilde{r}$	${\tilde{λ}}_{\tilde{r}} \times ({\tilde{v}}_{1 \tilde{r} 1}^{2} + \dots + {\tilde{v}}_{d \tilde{r} 1}^{2})$	…	${\tilde{λ}}_{\tilde{r}} \times ({\tilde{v}}_{1 \tilde{r} j}^{2} + \dots + {\tilde{v}}_{d \tilde{r} j}^{2})$	…	…

Following these specifications, we confirmed the results (1) and (2) that can be generalized to any matrix. In this context, we determined the justification for the formalization of the proposed index.

Considering X_n × m a matrix of rank r by the decomposition of the singular value $X = U Λ V^{T}$ , and $[x_{i j}^{2}]$ representing the square of each element in X, we have:
$\begin{align} \sum_{j = 1}^{m} \sum_{i = 1}^{n} x_{i j}^{2} & = \sum_{j = 1}^{m} \sum_{q = 1}^{r} \sum_{i = 1}^{n} u_{i q}^{2} \times λ_{q q} \times v_{q j}^{2} \\ = \sum_{q = 1}^{r} \sum_{i = 1}^{n} u_{i q}^{2} \times λ_{q q} \\ = \sum_{j = 1}^{m} \sum_{q = 1}^{r} λ_{q q} \times v_{q j}^{2} \\ = \sum_{q = 1}^{r} λ_{q} . \end{align}$ ()
with $[u_{i r}^{2}]$ , $[λ_{r r}]$ and $[v_{r j}^{2}]$ being the square of the elements of U, $Λ$ e V^T, respectively (Appendix B).
Matrix X_n × m is composed of k groups of variables, each group consisting of s > 1 variables, then we have $X_{n \times m} = [X_{n \times s}^{1} | \dots | X_{n \times s}^{j} | \dots | X_{n \times s}^{k}]$ , with x_nsj representing the nth observation in the sth column of the jth group, with m = s × k. With $[x_{i l}^{2}]$ representing the square of each element of matrix X, then, for each group j in X we have:
$\sum_{l = 1}^{s} \sum_{i = 1}^{n} x_{i l j}^{2} = \sum_{l = 1}^{s} \sum_{q = 1}^{r} λ_{q q} \times v_{q l j}^{2},$ ()
with $[λ_{q q}]$ being the square of the elements of $Λ$ , and $[v_{r s j}^{2}]$ the square of the elements of the projections of the jth group of X in V^T.

In order to standardize the variables with the same units of measure, it was necessary to perform a linear transformation—which is usually done in the MFA—according to the procedure suggested by Bécue-Bertaut and Pagés,¹⁸ considering the data centered on the average matricially described by (13)

C_{n \times p} = X_{n \times p} - J_{n \times p} \times d i a g ({\overline{X}}_{1 \times p}),

()

with

J_{n \times p}

being the unitary matrix, and

{\overline{X}}_{1 \times p}

the row vector of averages in columns of X_n × p. C_n × p is normalized dividing each element of the column by the square root of the sum of the square of the respective column, according to (14)

N_{n \times p} = C_{n \times p} \times d i a g (1 / \sqrt{1_{1 \times n} \times (C_{n \times p} ⊙ C_{n \times p})}),

()

with

1_{1 \times n}

being the unitary vector, and C_n × p ⊙ C_n × p the Hadamard product. Then, given the first eigenvalue

λ_{1}

obtained from N_n × p, for the transformed data, we have:

S_{n \times p} = \frac{1}{\sqrt{λ_{1}}} \times N_{n \times p},

()

therefore, the proposed index, called multiple factorial (MF), is defined by the expression (16)

P Ì_{M F} (A) = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{p} {(s_{i j})}^{2},

()

with s_ij being the ith line of the jth column

S = (S_{1}, S_{2}, \dots, S_{p})

, and p the number of random variables (p-dimensional), in which each component is represented by

S_{p} = (s_{1 p}, s_{2 p}, \dots, s_{n p})

The numerical optimization to find a plane that maximized the index (16) was obtained by a modification in the grand tour algorithm (Section 3.3) using the simulated annealing method proposed by Cook et al.,¹⁹ so that groups of variables could be considered in line with the MFA.

The validation of the proposed index (16) was performed by comparing the degree of agreement in validating the reduction of projection pursuit in two dimensions for each group of variables. For this purpose, we performed 1,000 Monte Carlo simulations. Scripts were developed using the R software²⁰ and a grand tour alternated with a simulated annealing optimization adapted for MFA (Section 3.3), with the specifications of 5e³ iteractions, cooling =0.95, eps = 1e⁻⁴, and half =30 to compute the indexes mentioned in Section 2, according to the scenarios described in Table 2 and using non-spherical data in the simulations with reduction of two dimensions in the data of each group.

3.3 Projection pursuit algorithm for the MFA technique

Input: $X_{n \times p k} = [X_{n \times p}^{1} | \dots | X_{n \times p}^{j} | \dots | X_{n \times p}^{k}]$ matrix with k groups of variables.

p: number of variables in the group j.

n: number of observations.

Maxiter: maximum number of iterations.

Cooling: initial value of the cooling parameter in the range $(0, 1) .$

Half: number of steps without changing Cooling.

Eps: approximation accuracy for Cooling.

for j = 1 : k do

Generate random initial projection A_a, with d < p

Perform linear transformation $T_{0} = X^{j} {\times A}_{a}$ for the jth group.

Calculate the search index of the initial projection, ${P I}_{M F}^{0} (T_{0}) .$

Perform h = 0, Cooling = 0.95, Half = 30, Eps = 1e⁻⁴ and i = 1.

while(i < Maxiter and Cooling > Eps) do

Generate a new random projection A_i.

Then, generate a new projection $A_{i}^{*} = A_{a} + C o o l i n g \times A_{i}$ .

Perform $A_{z} = i n t e r p o l a t i o n (A_{a}, A_{i}^{*})$ , interpolation from the projection A_a

until projection $A_{i}^{*}$ .

Perform linear transformation $T_{i} = X^{j} {\times A}_{z}$ , for jth group.

Calculate ${P I}_{M F}^{i} (T_{i})$ .

if ${P I}_{M F}^{i} < {P I}_{M F}^{0}$ do

Perform A_a = A_z, ${P I}_{M F}^{0} = {P I}_{M F}^{i}$

else

Perform h + =1

end if

if h = Half then

Perform $C o o l i n g = C o o l i n g * 0.9$ and h = 0

end if

Perform i + =1

end while

Perform ${\tilde{X}}_{n \times d}^{j} = T_{i}$ , which represents the matrix with the reduced dimension

of the jth group

end for

The new matrix with k groups of variables with reduced dimensions will be

represented by: ${\tilde{X}}_{n \times d k} = [{\tilde{X}}_{n \times d}^{1} | \dots | {\tilde{X}}_{n \times d}^{j} | \dots | {\tilde{X}}_{n \times d}^{k}] .$

Output: The MFA technique is applied to groups formed in ${\tilde{X}}_{n \times d k}$ .

4 RESULTS AND DISCUSSIONS

According to the proposed objectives of the present study, the results shown in Figures 2-5 correspond to the reduced dimensions relating to the first principal component, precisely because it always presents the greatest explanation in MFA. This way, the results were presented as follows: (i) Evaluation of the proposed index with respect to the heterogeneity between the groups (Section 4.1); and (ii) evaluation of the proposed index with respect to the increase in the number of groups (Section 4.2).

4.1 Evaluation of the proposed index with respect to the heterogeneity between the groups

The results illustrated in Figure 2 indicate that the proposed index initially exhibited a high discordance rate in comparison to the other indexes, when considering a weak degree of correlation between the variables $(ρ = 0.2)$ . However, as the degree of correlation increased, this rate was reduced. This way, maintaining a strong degree of correlation between the variables $(ρ = 0.9)$ , the proposed index resulted in low disagreement and promising results with respect to agreement with the others. It is worth noting that the simulated data were non-spherical, a condition that usually undermines the applicability of the proposed indexes.

In view of the above, in order to compare the validation of the index for the same scenarios, even though the degree of heterogeneity between the groups was increased $(δ = 8)$ , the results illustrated in Figure 3 were compared with the condition of low heterogeneity $(δ = 2)$ illustrated in Figure 2.

The results illustrated in Figure 3 practically confirmed the same behavior of the indexes with respect to the degree of correlation between the variables and the degrees of agreement and disagreement between the indexes. Virtually, any difference results from error oscillation in the Monte Carlo method. However, a result that should be highlighted was the strong degree of correlation $(ρ = 0.9)$ . In this context, we observed that the proposed index exhibited a high degree of agreement when the samples were simulated for a low degree of heterogeneity $(δ = 2)$ , with a percentage close to 75%. When the index was increased to $(δ = 8)$ , it exhibited considerable improvement, with a percentage of agreement close to 81%.

4.2 Evaluation of the proposed index with respect to the increase in the number of groups

Compared with the results illustrated in Figure 2, in which low heterogeneity was considered between groups and groups $(k = 7)$ , it is noted in Figure 4 that, with the same configurations, the increase in the number of groups $(k = 10)$ confirmed that the degrees of agreement and disagreement of the proposed index were approximately equal when $(ρ = 0.9)$ . Therefore, there is statistical evidence that the index yields promising results when the variables are strongly correlated, regardless of the dimension of the group.

Considering the high heterogeneity between the groups $(δ = 8)$ , the increase in the number of groups $(k = 10)$ , in general terms, did not affect the performance of the indexes. Figure 5 shows that the same behavior of the indexes occurred regarding the degree of correlation and the agreement rate in comparison with the competing indexes.

It should be emphasized that in all simulated scenarios, in conditions of poor $(ρ = 0.2)$ and moderate correlation $(ρ = 0.5),$ the proposed index exhibited a reduction in the disagreement rate with respect to its competitors. In this context, there is evidence to affirm that, on average, all indexes are affected by the increase in the number of groups. This way, it can be affirmed that the occurrence of these results does not prevent the application of the indexes evaluated in the present study, that is, projection pursuit applied in MFA.

5 EXAMPLE OF APPLICATION

For the didactic purpose, this section contains an example of the use of MFA, and an example of the use of MFA with dimension reduction using projection pursuit with the proposed index.

5.1 Use of the MFA technique

We simulated three groups of variables (Table 5). The data of each group were generated according to the methodology described in Section 3.1, using the parameters $ρ = 0.9$ and $δ = 1$ .

TABLE 5. Matrix X_6 × 10 of simulated data of the groups

Obs.	V₁	V₂	V₃	V₄	V₅	V₆	V₇	V₈	V₉	V₁₀
	Group 1				Group 2			Group 3
1	1.39	2.44	3.06	4.35	1.75	2.13	2.88	1.90	3.08	3.58
2	0.98	1.85	3.02	3.76	−0.42	0.19	1.38	1.58	2.05	3.76
3	2.38	3.56	4.37	5.40	0.03	1.27	1.76	−0.06	1.21	2.21
4	1.51	2.41	3.68	5.31	0.33	1.02	2.36	0.62	2.40	3.58
5	0.53	1.69	1.88	4.24	1.01	2.41	2.41	1.73	3.10	3.66
6	1.40	2.02	3.40	3.58	1.03	2.56	3.16	1.24	1.89	2.77

The explanations of the principal components described in Table 6 were obtained applying the MFA technique,¹⁸ with scripts elaborated using the R software and the MVar package.²¹

TABLE 6. Explanation of eigenvalues with respect to principal components

Components	Eigenvalues	% of variance	% accumulated variance
1	2.0010	57.79	57.79
2	0.9087	26.24	84.03
3	0.3335	9.63	93.66
4	0.1294	3.74	97.39
5	0.0902	2.61	100.00

The inertia of the groups of variables are described in Table 7.

TABLE 7. Inertia of groups in each principal component

Group	Comp. 1	Comp. 2	Comp. 3	Comp. 4	Comp. 5
1	0.8240	0.0915	0.1803	0.0626	0.0298
2	0.3052	0.6991	0.0191	0.0407	0.0340
3	0.8718	0.1181	0.1341	0.0260	0.0264
$λ_{i}$	2.0010	0.9087	0.3335	0.1294	0.0902

With respect to the first principal component, Table 7 shows that groups 1 and 3 exhibited strong similarity between them (0.8240 and 0.8718, respectively) and group 2 differed from the others. Regarding the second component, it can be affirmed that groups 1 and 3 were similarly weak, and group 2 remained original. Other analyzes could have been performed with the other components; however, as full explanation had already been given by the first component—with the greatest explanation (57.79%)—no other explanation was necessary besides that given by the first principal component.

From the inertias obtained in the groups (Table 7), aiming at a better interpretation, a chart of inertia (Figure 6) was elaborated showing that there was strong relationship between groups 1 and 3 with respect to the first component, thus confirming the originality of group 2.

5.2 Use of the MFA technique with dimension reduction performing projection pursuit with the proposed index

We used matrix X_6 × 10 illustrated in Table 5 of Section 5.1. We obtained the projection matrices for each group with the projection indexes (Table 8) by applying projection pursuit with the proposed new index presented by Equation (16), and new algorithm presented in Section 3.3.

TABLE 8. Projection matrix of each group X^j in X_6 × 10

Projection	Vector 1	Vector 2	Vector 1	Vector 2	Vector 1	Vector 2
	Group 1		Group 2		Group 3
1	−0.2438	0.9151	−0.7770	−0.0341	0.5207	0.3659
2	0.6822	0.0018	−0.5748	−0.3685	−0.0633	0.9175
3	0.6545	0.2296	0.2565	−0.9289	0.8513	−0.1555
4	0.2160	0.3312	−	−	−	−
Projection index	0.16686		0.17255		0.17860

Figure 7 shows the convergence of the new index for each group of variables in the numerical optimizations.

When we applied the projection matrices in the groups in X_6 × 10, we obtained the data shown in Table 9, which represent the data with reduced dimensions in two dimensions for each set X^j, so that matrix X_6 × 10 will be represented by the new matrix ${\tilde{X}}_{6 \times 6}$ .

TABLE 9. Matrix

{\tilde{X}}_{6 \times 6}

of the data with the reduced dimensions of X_6 × 10

Obs.	Projection	Projection	Projection	Projection	Projection	Projection
	Group 1		Group 2		Group 3
	1	2	1	2	1	2
1	4.2683	3.4201	−1.8451	−3.5201	3.8420	2.9645
2	3.8121	2.8392	0.5712	−1.3376	3.8939	1.8744
3	5.8753	4.9768	−0.3017	−2.1040	1.7735	0.7445
4	4.8318	3.9902	−0.2371	−2.5795	3.2185	1.8722
5	3.1702	2.3242	−1.5517	−3.1615	3.8203	2.9082
6	4.0356	3.2515	−1.4610	−3.9142	2.8841	1.7571

We applied the MFA technique in the new groups of variables (Table 9). The explanations of the principal components are given in Table 10.

TABLE 10. Explanation of eigenvalues with respect to the principal components

Components	Eigenvalues	% of variance	% accumulated variance
1	2.0306	65.33	65.33
2	0.8681	27.93	93.26
3	0.1678	5.40	98.66
4	0.0382	1.23	99.89
5	0.0034	0.11	100.00

The inertias of groups of variables with reduced dimensions are described in Table 11.

TABLE 11. Inertia of groups in each principal component

Group	Comp. 1	Comp. 2	Comp. 3	Comp. 4	Comp. 5
1	0.8278	0.0956	0.0768	0.0009	0.0001
2	0.3300	0.6705	0.0065	0.0273	0.0010
3	0.8727	0.1021	0.0846	0.0100	0.0023
$λ_{i}$	2.0306	0.8681	0.1678	0.0382	0.0034

The same observations made in Table 7, which deal with the similarities of groups of variables with original dimensions, apply to Table 11. Regarding the first principal component, groups 1 and 3 exhibited strong similarity between them, and group 2 differed from the others.

Aiming at a better interpretation, the chart of inertia illustrated in Figure 8 was again elaborated from the inertias of the groups shown in Table 11. There was strong relationship between groups 1 and 3, again confirming the originality of group 2.

As can be observed, the results of the similarities in data with reduced dimensions produced the same results than the data with original dimensions. This fact demonstrates the efficiency of the proposed new index with application in MFA using high-dimensional data.

6 CONCLUSIONS

The proposed new index proved to be efficient in the reduction of data for application in the MFA technique. This index is recommended for situations in which the groups exhibit low or high heterogeneity, and a strong degree of correlation between the variables $(ρ = 0.9)$ . In general terms, indexes are affected by the increase in the number of groups according to the scenarios evaluated.

APPENDIX A: TABLES

See Tables A1 and A2.

TABLE A1. Indexes used in projection pursuit for reduction in a space d ≥ 1

Index (PI)	Characteristics
Holes²²	Obtained by means of the normal density function, being sensitive to projections with few points in the center.
Central mass²²	Obtained by means of the normal density function, being sensitive to projections with many points in the center.
LDA⁸	Obtained through linear discriminant analysis in order to find linear projections with the greatest separation between classes and the lowest intra-class dispersion.
PDA²³	Penalized discriminant analysis, it is based on penalized LDA, being applied in situations with many highly correlated predictors when classification is required.
Lr-norm³	It is based on the supervised exploratory classification, being used in the detection of outliers.

TABLE A2. Indexes used in projection pursuit for reduction in a space d = 2

Index (PI)	Characteristics
Moments¹⁰	It is based on the third and fourth bivariate moments, being used mainly in large datasets.
Chi-square²⁴	Based on the chi-square distance, considering certain divisions in the projection plane, and the radial symmetry of the normal bivariate distribution.
Friedman-Tukey²	It is based on interpoint distances in the search for optimum projection. The directions chosen are those that maximize the coefficient, providing the greatest separation for the different clusters. By means of a recursive process, the index is applied again to each cluster in order to find new projections that reveal more clusters.
Entropy⁷	It is an extension of the Friedman-Tukey index constructed using the negative entropy of a density core estimate.
Legendre²⁵	Based on distance L² between the density of the projected data and the standard normal bivariate density. It is constructed by inversion of density through a normal cumulative distribution function with the transformations $y^{α} = 2 ϕ (z^{α}) - 1$ and $y^{β} = 2 ϕ (z^{β}) - 1$ , where $ϕ$ is the standard normal distribution, and using J terms of Legendre polynomials for expansion.
Laguerre-Fourier²⁵	Based on distance L² between the projected data density and the standard normal bivariate density in polar coordinates, with $ρ = {(z^{α})}^{2} + {(z^{β})}^{2}$ and $θ = a r c t a n (\frac{z^{β}}{z^{α}})$ , and using K terms of a Fourier series and the radial part in L terms of Laguerre polynomials.
Hermite¹⁰	Based on distance L² between the density of the projected data and the standard normal bivariate density. Expanding the function $f_{α β}$ - marginal density of Z in plane $P (α, β)$ - in H terms of Hermite polynomials being orthogonal to $ϕ_{1}$ (standard normal distribution).
Natural Hermite¹⁰	Based on distance L² between the density of the projected data and the standard normal bivariate density. Expanding the function $f_{α β}$ in N terms of Natural Hermite index being orthogonal to $ϕ_{2}$ (standard normal bivariate distribution).

APPENDIX B: R CODE EXEMPLIFICATION OF SOME MATRIX RESULTS

n <- 10
p <- 3
X <- matrix(rnorm(n*p), nrow = n)
X <- round(cbind(X, X[, 1:2] + matrix(rnorm(20) * 0.000001, nrow = n)), 3)
svd.x <- svd(X)
U <- svd.x$u[, 1:3]
L <- svd.x$d[1:3]
V <- svd.x$v[, 1:3]
U %*% diag(L) %*% t(V)
sum(X^∧2)
sum(U^∧2 %*% diag(L^∧2) %*% t(V^∧2))

Biographies

Paulo César Ossani PhD holder in Statistics and Agricultural Experimentation (Universidade Federal de Lavras/UFLA, 2019) and PostDoc at Universidade Federal de Maringá (UEM, 2020). Working with Multivariate Statistics, Computational Statistics, Machine Learning and software development to aid in the solving of mathematical and statistical problems.
Mariana Figueira Ramos graduate in Statistics (Universidade Federal do Espirito Santo, 2010) and Master's degree in Statistics and Agricultural Experimentation (Universidade Federal de Lavras, 2013). Working with digital business. Seasoned in the field of Probability and Statistics with focus in classification. Working specially with the following topics: multivariate analysis, discriminant analysis, logistic regression, and correspondence analysis.
Marcelo Ângelo Cirillo Associate Professor at Universidade Federal de Lavras, Department of Exact Sciences, working with graduate researches and tutoring in the following fields: Multivariate Analysis, Computational Statistics, Generalized Models and Response Surface Methodology. All the statistical methodologies are applied in agrarian sciences and food sciences.

REFERENCES

1 Kruskal J. Toward a practical method which helps uncover the structure of a set of multivariate observations by finding the linear transformation which optimizes a new index of condensation. Stat Comput. 1969; 1: 427-440.
10.1016/B978-0-12-498150-8.50024-0
Google Scholar
2Friedman JH, Tukey JW. A projection pursuit algorithm for exploratory data analysis. IEEE Trans Comput. 1974; 23: 881-890.
10.1109/T-C.1974.224051
Web of Science® Google Scholar
3Lee EK, Cook D, Klinke S, Lumley T. Projection pursuit for exploratory supervised classification. J Comput Graph Stat. 2005; 14: 831-846.
10.1198/106186005X77702
Web of Science® Google Scholar
4Croux C, Filzmoser P, Oliveira M. Algorithms for projection-pursuit robust principal component analysis. Chemom Intell Lab Syst. 2007; 87: 218-225.
10.1016/j.chemolab.2007.01.004
CAS Web of Science® Google Scholar
5Hyvarinen A, Oja E. Independent component analysis: algorithms and applications. Neural Netw. 2000; 13(4-5): 411-430.
10.1016/S0893-6080(00)00026-5
CAS PubMed Web of Science® Google Scholar
6Huber PJ. Projection pursuit. Ann Stat. 1985; 13: 435-475.
10.1214/aos/1176349519
Web of Science® Google Scholar
7Jones MC, Sibson R. What is projection pursuit, (with discussion). J Royal Stat Soc Ser A. 1987; 150: 1-36.
10.2307/2981662
Web of Science® Google Scholar
8Espezua S, Villanueva E, Maciel CD, Carvalho A. Projection pursuit framework for supervised dimension reduction of high dimensional small sample datasets. Neurocomputing. 2015; 149: 767-776.
10.1016/j.neucom.2014.07.057
Web of Science® Google Scholar
9Cook D, Buja A, Cabrera J, Hurley C. Grand tour and projection pursuit. J Comput Graph Stat. 1995; 4: 155-172.
10.2307/1390844
Google Scholar
10Posse C. Tools for two-dimensional exploratory projection pursuit. J Comput Graph Stat. 1995b; 4: 83-100.
Google Scholar
11Diaconis P, Freedman D. Asymptotics of graphical projection pursuit. Ann Stat. 1984; 12: 793-815.
10.1214/aos/1176346703
Web of Science® Google Scholar
12Friedman JH. Exploratory projection pursuit. Am Stat Assoc. 1987; 82: 249-266.
10.1080/01621459.1987.10478427
Web of Science® Google Scholar
13Cooren Y, Clerc M, Siarry P. Performance evaluation of TRIBES, an adaptive particle swarm optimization algorithm. Swarm Intelligence. 2009; 3: 149-178.
10.1007/s11721-009-0026-8
Google Scholar
14Webb-Robertson B, Jarman K, Harvey S, Posse C, Wright B. An improved optimization algorithm and Bayes factor termination criterion for sequential projection pursuit. Chemom Intell Lab Syst. 2005; 77(1-2): 149-160.
10.1016/j.chemolab.2004.09.014
CAS Web of Science® Google Scholar
15Guo Q, Wu W, Questier F, Massart D, Boucon C, De Jong S. Sequential projection pursuit using genetic algorithms for data mining of analytical data. Anal Chem. 2000; 72: 2846-2855.
10.1021/ac0000123
CAS PubMed Web of Science® Google Scholar
16Kennedy J., Eberhart R.. Particle swarm optimization. Paper presented at: Proceedings of IEEE International Conference on Neural Networks, Perth, WA, Australia; Vol. IV 1995:1942-1948.
Google Scholar
17Cirillo MA, Ferreira DF, Safadi T, Ferreira EB. Generalized variances ratio test for comparing k covariance matrices from dependent normal populations. J Modern Appl Stat Methods. 2010; 9(2): 369-378.
10.22237/jmasm/1288584300
Google Scholar
18Bécue-Bertaut M, Pagés J. Multiple factor analysis and clustering of a mixture of quantitative, categorical and frequence data. Comput Stat Data Anal. 2008; 52: 3255-3268.
10.1016/j.csda.2007.09.023
Web of Science® Google Scholar
19Cook D, Lee EK, Buja A, Wickham H. Grand tours, projection pursuit guided tours and manual controls. In: CH Chen, WK Härdle, A Unwin, eds. Handbook of Data Visualization. New York, NY: Springer; 2008.
10.1007/978-3-540-33037-0_13
Google Scholar
20 Team R Core R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing, 2018. http://www.R-project.org.
Google Scholar
21Ossani P. C., Cirillo M. A.. MVar: Multivariate Analysis Package R Package Version 2.0.2. Vienna, Austria: R Foundation for Statistical Computing, 2018. https://cran.r-project.org/web/packages/MVar/index.html.
Google Scholar
22Cook D, Buja A, Cabrera J. Projection pursuit indexes based on orthonormal function expansions. J Comput Graph Stat. 1993; 2: 225-250.
10.2307/1390644
Google Scholar
23Lee EK, Cook D. A projection pursuit index for large p small n data. Stat Comput. 2010; 20: 381-392.
10.1007/s11222-009-9131-1
Web of Science® Google Scholar
24Posse C. Projection pursuit exploratory data analysis. Comput Stat Data Anal. 1995a; 20: 669-687.
10.1016/0167-9473(95)00002-8
Web of Science® Google Scholar
25Martinez WL, Martinez AR. Computational Statistics Handbook with MATLAB. 2nd ed. New York, NY: Chapman & Hall/CRC; 2007.
10.1201/b13622
Google Scholar

All articles

Proposition of a new index for projection pursuit in the multiple factor analysis

Abstract

1 INTRODUCTION

2 PROJECTION PURSUIT AND PROJECTION INDEX

2.1 Notation

3 METHODS

3.1 Generation of samples for the application of multiple factor analysis

3.2 Proposed index for use with MFA

3.3 Projection pursuit algorithm for the MFA technique

4 RESULTS AND DISCUSSIONS

4.1 Evaluation of the proposed index with respect to the heterogeneity between the groups

4.2 Evaluation of the proposed index with respect to the increase in the number of groups

5 EXAMPLE OF APPLICATION

5.1 Use of the MFA technique

5.2 Use of the MFA technique with dimension reduction performing projection pursuit with the proposed index

6 CONCLUSIONS

APPENDIX A: TABLES

APPENDIX B: R CODE EXEMPLIFICATION OF SOME MATRIX RESULTS

Biographies

REFERENCES

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Proposition of a new index for projection pursuit in the multiple factor analysis

Abstract

1 INTRODUCTION

2 PROJECTION PURSUIT AND PROJECTION INDEX

2.1 Notation

3 METHODS

3.1 Generation of samples for the application of multiple factor analysis

3.2 Proposed index for use with MFA

3.3 Projection pursuit algorithm for the MFA technique

4 RESULTS AND DISCUSSIONS

4.1 Evaluation of the proposed index with respect to the heterogeneity between the groups

4.2 Evaluation of the proposed index with respect to the increase in the number of groups

5 EXAMPLE OF APPLICATION

5.1 Use of the MFA technique

5.2 Use of the MFA technique with dimension reduction performing projection pursuit with the proposed index

6 CONCLUSIONS

APPENDIX A: TABLES

APPENDIX B: R CODE EXEMPLIFICATION OF SOME MATRIX RESULTS

Biographies

REFERENCES

Figures

References

Related

Information