Volume 2013, Issue 1 206265
Research Article
Open Access

Interpolation and Best Approximation for Spherical Radial Basis Function Networks

Shaobo Lin

Shaobo Lin

Institute for Information and System Sciences, School of Mathematics and statistics, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, China xjtu.edu.cn

Search for more papers by this author
Jinshan Zeng

Jinshan Zeng

Institute for Information and System Sciences, School of Mathematics and statistics, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, China xjtu.edu.cn

Search for more papers by this author
Zongben Xu

Corresponding Author

Zongben Xu

Institute for Information and System Sciences, School of Mathematics and statistics, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, China xjtu.edu.cn

Search for more papers by this author
First published: 30 October 2013
Citations: 1
Academic Editor: Mieczysław Mastyło

Abstract

Within the conventional framework of a native space structure, a smooth kernel generates a small native space, and radial basis functions stemming from the smooth kernel are intended to approximate only functions from this small native space. In this paper, we embed the smooth radial basis functions in a larger native space generated by a less smooth kernel and use them to interpolate the samples. Our result shows that there exists a linear combination of spherical radial basis functions that can both exactly interpolate samples generated by functions in the larger native space and near best approximate the target function.

1. Introduction

Many scientific questions boil down to synthesizing an unknown but definite function from finitely many samples . The purpose is to find a functional model that can effectively represent or approximate the underlying relation between the input xi and the output yi. If the unknown functions are defined on spherical domains, then data collected by satellites or ground stations are usually not restricted on any regular region and are scattered. Thus, any numerical method relying on the structure of a “grid” is doomed to fail.

The success of the radial basis function networks methodology in Euclidean space derives from its ability to generate estimators from data with essentially unstructured geometry. Therefore, it is natural to borrow this ideal to deal with spherical scattered data. This method, called the spherical radial basis function networks (SRBFNs), has been extensively used in gravitational phenomenon [1, 2], image processing [3, 4], and learning theory [5].

Mathematically, the SRBFN can be represented as
()
where ciR, ξiSd, and ϕ : [−1,1] → R are called the connection weight, center, and activation function in the terminology of neural networks, respectively. Here and hereafter, Sd denotes the unit sphere embedded in the d + 1-dimensional Euclidean space, Rd+1. Both the connection weights ci and the centers ξi are adjustable in the process of training. We denote by Φn the collection of functions formed as (1).
A basic and classical approach for training SRBFN is to construct exact interpolant based on the given samples (xi, yi) m, that is, to find a function Sn in Φn such that
()
If the activation function ϕ is chosen to be positive definite [2], then the matrix is nonsingular. Thus, the system of (2) can be easily solved by taking the scattered data as the centers of SRBFN. This method has been already named the spherical basis function (SBF) method. Under this circumstance, the connection weights are determined via
()
where c = (c1, c2, …, cm) T, y = (y1, …, ym) T, UT denotes the transpose of the matrix (or vector) U, and A−1 denotes the inverse matrix of A.
For the SBF method, if the target function f belongs to the native space Nϕ associated with the activation function ϕ, then the best SBF approximant of f is characterized by the exact interpolation:
()
where are determined by (3). This property makes the SBF interpolation strategy popular in spherical scattered data fitting [614]. However, there are also two disadvantages for the SBF method. On one hand, since the centers of SRBFN interpolants are chosen as the scattered data, the interpolation capability depends heavily on their geometric distributions. This implies that we cannot obtain a satisfactory interpolation error estimate if the data are “badly” located on the sphere. On the other hand, the well known “native space barrier" [11, 12, 15] shows that (4) only holds for a small class of smooth functions if ϕ is smooth. Therefore, for functions outside Nϕ, the SBF interpolants are not guaranteed to be the best approximants.
Along the flavor of the previous papers [7, 11, 12], we use SRBFNs to interpolate functions in a large native space Nψ associated with the kernel ψ which is less smooth than ϕ and study its interpolation capability. Different from the previous work, the centers are chosen in advance to be quasi-uniform located on spheres, which makes the interpolation error depend on the number rather than the geometric distribution of centers. Our purpose is not to give the detailed error estimate of the SRBFN interpolation. Instead, we focus on investigating the relation between interpolation and approximation for SRBFN. Indeed, we find that there exists an SRBFN interpolant which is also the near best approximant of functions in Nψ, when the number of centers and geometric distribution of the scattered data satisfy a certain assumption. That is, for a suitable choice of n, there exists a function gn ∈ Φn such that
  • (1)

    g exactly interpolates the samples ;

  • (2)

    ,

where C is a constant depending only on d, α, and β.

2. Positive Radial Basis Function on the Sphere

It is easy to deduce that the volume of Sd, Ωd, satisfies
()
where dω denotes the surface area element on Sd. For integer k ≥ 0, the restriction to Sd of a homogeneous harmonic polynomial of degree k on the unit sphere is called a spherical harmonic of degree k. The span of all spherical harmonics of degree k is denoted by , and the class of all spherical harmonics (or spherical polynomials) of degree ks is denoted by . It is obvious that . The dimension of is given by
()
and that of is .
Denote by an orthonormal basis of ; then the following addition formula [16, 17] holds
()
where is the Legendre polynomial with degree k and dimension d + 1. The Legendre polynomial can be normalized such that and satisfies the orthogonality relation
()
where δk,j is the usual Kronecker symbol.
Positive definite radial basis functions on spheres were introduced and characterized by Schoenberg [18]. Namely, a radial basis function φ is positive definite if and only if its expansion
()
has all Fourier-Legendre coefficients ek ≥ 0 and . We define the native space Nφ as
()
with its inner product
()
where .

3. Interpolation and Near Best Approximation

Let be a set of points and d(x, y) = arccos  x · y be the spherical distance between x and y. We denote by , qΛ : = (1/2)min jkd(ξj, ηk), and τΛ : = hΛ/qΛ the mesh norm, separation radius, and mesh ratio of Λ, respectively. It is easy to check that these three quantities describe the geometric distribution of points in Λ. The τ-uniform set Fτ : = Fτ(Sd) is defined by the family of all centers sets Ξ with τΛτ.

Let ϕ and ψ satisfy
()
()
with
()

The Sobolev embedding theorem [12] implies that if α, β > d/2, then Nϕ and Nψ are continuously embedded in C(Sd), and so there are reproducing kernel Hilbert spaces, with reproducing kernels being ϕ and ψ, respectively.

The aim of this section is to study the relation between the exact interpolation and best approximation for Φn with its centers set and activation function ϕ satisfying (12) and (14). It is obvious that such a Φn is a linear space. The following Theorem 1 shows that there exists an SRBFN interpolant which can near best approximate fNψ in the metric of Nψ, where ψ satisfies (13) and (14).

Theorem 1. Let be the set of scattered data with separation radius qX, and α > β > d/2. If ΞnFτ and , where Cτ > 1 is a constant depending only on τ and d, then, for every fNψ, there exists an SRBFN interpolant Sn ∈ Φn such that

  • (i)

    Sn exactly interpolates the samples ,

  • (ii)

    .

Remark 2. Similar results have been considered for spherical polynomials both in C(Sd) and Nψ. Narcowich et al. [11, 12] proved that there exists a spherical polynomial interpolant of degree at most LCqX which can also best approximate the target both in C(Sd) and Nψ.

To prove Theorem 1, we need the following three lemmas, which can be found in [12, Proposition 5.2], [12, Theorem 5.5], and [19, Example 2.10], respectively.

Lemma 3. Let 𝒴 be a (possibly complex) Banach space, 𝒱 a subspace of 𝒴, and Z* a finite-dimensional subspace of 𝒴*, the dual of 𝒴. If for every z*Z* and some γ > 1, γ independent of z*,

()
then for y𝒴 there exists v𝒱 such that v interpolates y on Z*; that is, z*(y) = z*(v) for all z*Z*. In addition, v approximates y in the sense that ∥yv𝒴 ≤ (1 + 2γ) dist 𝒴(y, 𝒱).

Lemma 4. Let β > d/2. If fNϕ, then there is a u ∈ Φn such that

()

Lemma 5. Let ψ be defined in (13) and (14) and β > d/2. Then for arbitrary set of real numbers , we have

()
where C is a constant depending only on d and β.

Now we provide the proof of Theorem 1.

Proof of Theorem 1. We apply Lemma 3 to the case in which the underlying space is the native space 𝒴 = Nψ. Let and 𝒱 = Φn. So in order to prove Theorem 1, it suffices to prove that for arbitrary m real numbers such that

()
Since Nψ is a reproducing kernel Hilbert space with ψ its reproducing kernel, we have
()
Similarly, we obtain
()
where s is the orthogonal projection of to Φn in the metric of Nψ. Then the Pythagorean theorem yields that
()
Thus, Lemma 3 with γ = 2 yields that in order to prove (18), it suffices to prove
()
Let L, PΠL be the best polynomial approximation of in the metric of Nψ; then for arbitrary s ∈ Φn, we have
()
It follows from [12, Page 382] that there exists a constant, κ, depending only on β such that for arbitrary , there holds
()

Let s be the best Φn approximation of P in the metric of Nψ. Then it follows from Lemma 4 and the well-known Bernstein inequality [17] that

()
Since P is the best polynomial approximation of in the metric of Nψ, a simple computation yields
()
Furthermore, it follows from Lemma 5 that
()
Then, ΞnFτ together with yields that
()
where C is a constant depending only on β, d, and τ. Thus, there exists a constant Cτ depending only on α, β, d, and τ such that for arbitrary , there holds
()
Inserting (24) and (29) into (23), we finish the proof of (22) and then complete the proof of Theorem 1.

Acknowledgments

An anonymous referee has carefully read the paper and has provided to us numerous constructive suggestions. As a result, the overall quality of the paper has been noticeably enhanced, to which we feel much indebted and are grateful. The research was supported by the National 973 Programming (2013CB329404), the Key Program of National Natural Science Foundation of China (Grant no. 11131006), and the National Natural Science Foundations of China (Grant no. 61075054).

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.