1. Introduction
Many methods to solve inverse problems (e.g., the Landweber iteration, conjugate gradient method) use the Fréchet derivatives of the cost functionals of these problems [1]. The explicit formula for the Fréchet derivative in terms of the variation of the unknowns of the inverse problem contains the solution of an adjoint problem.
The derivation of the explicit formula for such a Fréchet derivative includes testing the direct problem with the solution of the adjoint problem and vice versa: testing the adjoint problem with the solution of the direct problem. In the case of the parabolic weak problem, such a procedure is cumbersome, because of the asymmetry of the properties of the solution and the test function. In the classical formulation of the parabolic weak problem (see, e.g., [2] and also (19) below), the test function must have higher time regularity than the weak solution. This means that in case of nonsmooth coefficients neither the solution of the direct problem nor the solution of the adjoint problem can be used as a test function. Another formulation of the weak parabolic problem consists in reducing the problem to an abstract Cauchy problem over the time variable (see, e.g., [3]). In such a case, a partial integration over the time has to be implemented within singular distributions in the derivation procedure.
In this paper, we present a new method that enables the deduction of the formulas for the Fréchet derivatives for cost functionals of inverse problems related to weak solutions of parabolic problems. The method is based on an integrated convolutional form of the weak direct problem. The requirements to the test function are weaker than in the classical case and coincide with the properties of the solution of the direct problem. All computations in the deduction procedure can be implemented within usual regular Sobolev spaces.
More precisely, we will consider inverse problems related to a parabolic integrodifferential equation that occur in heat flow with memory [4–6]. This equation contains a time convolution. Therefore, the convolutional form of the weak problem is especially suitable. Supposedly, the proposed method can be generalised to parabolic systems, as well.
2. Formal Direct Problem: Notation
Let Ω be an
n-dimensional domain, where
n ≥ 1, and Γ be the boundary of Ω. Let Γ = Γ
1 ∪ Γ
2 where either Γ
1 or Γ
2 is allowed to be an empty set. In case
n ≥ 2, we assume that Γ is sufficiently smooth, meas Γ
1 ∩ Γ
2 = 0, and for any
j ∈ {1; 2} it holds either Γ
j =
∅ or meas
Γ
j > 0. Denote
()
for
t ≥ 0. Consider the problem (direct problem) to find
u(
x,
t) : Ω
T →
ℝ such that
()
()
()
()
where
T > 0 is a fixed number,
()
aij,
a,
u0 : Ω →
ℝ,
f : Ω
T →
ℝ,
ϕ : Ω
T →
ℝn,
g : Γ
1,T →
ℝ,
ϑ : Γ
2 →
ℝ,
h : Γ
2,T →
ℝ,
m : (0,
T) →
ℝ are given functions, the subscripts
t,
xj,
xi denote the partial derivatives and
()
denotes the time convolution. In case Γ
1 =
∅ (Γ
2 =
∅), the boundary condition (
4) and (
5) is dropped.
The problem (2)–(5) describes the heat flow in the body Ω with the thermal memory. Concerning the physical background, we refer the reader to [4, 6, 7]. The solution u is the temperature of the body and m is the heat flux relaxation (or memory) kernel. The boundary condition (5) is of the third kind where the term −νA · ∇u + m*νA · ∇u equals the heat flux in the direction of the conormal vector.
Let us introduce some additional notations. Let
t > 0. We use the Sobolev spaces
()
Here,
l = 0,1, 2, …,
α = (
α1, …,
αn) is the multiindex, |
α| =
α1 + ⋯+
αn and
. Further, let
X be a Banach space. We denote by
C([0,
t];
X) the space of abstract continuous functions from [0,
t] to
X endowed with the usual maximum norm ∥
v∥
C([0,t];X) : = max
s∈[0,t]∥
v(
s)∥. Moreover, let
()
By means of these spaces, we define the following important functional spaces:
()
Convention. In case n = 1, the integrals , j = 1,2 are equal to , where xk ∈ Γj and K is the number of points in Γj, and Lp(Γj) is simply ℝK.
3. Weak Direct Problem and Its Convolutional Form
Let us return to the direct problem (
2)–(
5). Throughout the paper we assume the following basic regularity conditions on the coefficients, the kernel, and the initial and boundary functions:
()
()
()
()
()
()
and the ellipticity condition
()
(for the sake of simplicity we introduced an assumption for the extension of
g onto Ω
T).
The first aim is to reformulate the problem (
2)–(
5) in a weak form. Let us suppose that
, (
∂/
∂xi)
ϕi ∈
L2(Ω
T),
i = 1, …,
n and (
2)–(
5) has a classical solution
. Then, we multiply (
2) with a test function
η from the space
()
and integrate by parts with respect to time and space variables. We obtain the following relation:
()
This relation makes sense also in a more general case when
aij,
ϕ satisfies only (
11) and (
15) and
u does not have regular first-order time and second-order spatial derivatives.
We call a weak solution of the problem (2)–(5) a function from the space 𝒰(ΩT) that satisfies the relation (19) for any η ∈ 𝒯(ΩT) and in case Γ1 ≠ ∅ fulfills the boundary condition (4).
Lemma 1. The following assertions are valid:
- (i)
where q3 = ∞ if n = 1, q3 ∈ (q1q2/(q1 − q2), ∞) if n = 2 and q3 = 2n/(n − 2) if n > 2, where q1 and q2 are given in (12) and (14), respectively;
- (ii)
for any it holds and , where C is a constant.
Proof. Since , the assertion (i) follows from the continuous embedding of in . The assertion (ii) can be proved by means of Hölder′s inequality.
Theorem 2. The problem (2)–(5) has a unique weak solution. This solution satisfies the estimate
()
where
θ1 = 0 in case Γ
1 =
∅,
θ2 = 0 in case Γ
2 =
∅ and
C0 is a constant depending on Ω, Γ
j,
aij,
a,
ϑ and
m.
Proof. The assertion of the theorem in case m = 0 is well known from the theory of parabolic equations (see, e.g., [2]). Let 𝒵 be the operator that assigns to the data vector d : = (u0, f, ϕ, g, h) the weak solution of the problem (2)–(5) in case m = 0. Then it holds
()
where RHS is the right-hand side of (
20).
Further, let us formulate the problem for the difference v = u − 𝒵d. Introducing the linear operator 𝒜 by the formula
()
the weak problem (
2)–(
5) for the solution
u ∈
𝒰(Ω
T) equivalent to the following operator equation for the quantity
v:
()
We have to estimate
𝒜. For this purpose, we firstly prove the following auxiliary inequality:
()
for any
p ≥ 1 and
y ∈
L2((0,
t);
Lp(Ω)).
Denoting , , making use of the following property of the Bochner integral: for functions w ∈ L1((0, T); Lp(Ω)) and the Cauchy′s inequality, the relation (24) can be deduced by means of the following computations:
()
Next, let t ∈ [0, T] and introduce the operator
()
Due to the causality we have
𝒵(0,
Ptf,
Ptϕ, 0,0)(
x,
t) =
𝒵(0,
f,
ϕ, 0,0)(
x,
t) for any (
x,
t) ∈ Ω
t. Using these relations, the continuity of the linear operator
𝒵, the inequality (
24), and the boundedness of
aij, we compute the following:
()
with some constants
and
depending on Ω, Γ
j,
aij,
a,
ϑ. Using Lemma
1, we obtain
()
Using this relation in (
27), we arrive at the following basic estimate for
𝒜:
()
where
C2 is a constant depending on Ω, Γ
j,
aij,
a,
ϑ. Let us define the weighted norms in
𝒰(Ω
T):
where
σ ≥ 0. The estimate (
29) implies the further estimate
()
Since
as
σ →
∞, there exists
σ0, depending on
C2 and
m, such that
. Thus,
. The operator
𝒜 is a contraction in
𝒰(Ω
T). This implies the existence and uniqueness assertions of the theorem.
To prove the estimate (20), we firstly deduce from (23) the inequality . This implies . Using the equivalence relations , where and (21), we reach (20).
We note the upper integration bound
T in (
19) can be released to be any number
t from the interval [0,
T]. Indeed, (
19) is equivalent to the following problem:
()
for any
η ∈
𝒯(Ω
T). This assertion can be proved using the standard technique defining the test function as follows:
()
and letting the parameter
ϵ to approach 0.
Next we transform the weak direct problem (31) to a form that does not contain a time derivative of the test function η. This form enables the extension of the test space. This is useful for treatment of problems for adjoint states of quasisolutions of inverse problems in next sections.
Theorem 3. The function u ∈ 𝒰(ΩT) satisfies the relation (19) for any η ∈ 𝒯(ΩT) if and only if it satisfies the following relation:
()
for any
η ∈
𝒰0(Ω
T).
Here, according to the definition of the time convolution in the previous section, .
Proof. It is sufficient to prove that u ∈ 𝒰(ΩT) satisfies (31) for any η ∈ 𝒯(ΩT) if and only if it satisfies (33) for any η ∈ 𝒰0(ΩT). Suppose that u ∈ 𝒰(ΩT) satisfies (31) and choose an arbitrary η ∈ 𝒯(ΩT). Let t1 be an arbitrary number on the interval [0, T] and choose some function such that the relation
()
is valid. For instance, it is possible to define
as a periodic function with respect to
t, that is,
for
t ∈ [0,
t1],
for
t ∈ [
t1, 2
t1],
for
t ∈ [2
t1, 3
t1] and so on. Using the relation (
31) with
η replaced by
and setting there
t =
t1 we obtain the equality
()
where
()
()
Note that the time derivative of
η can be removed from
K1 by integration. Indeed, let
t2 ∈ [0,
T]. Then
()
Changing the order of the integrals over
τ and
t1 in the last term, we easily obtain
()
Integrating now the whole equality (
35) over
t1 from 0 to
t2, observing (
37) and (
39), and finally redenoting
t2 by
t, we reach the desired relation (
33). Summing up, we have proved that (
33) holds for any
η ∈
𝒯(Ω
T). But all terms in the right-hand side of (
33) are well defined for
η ∈
𝒰0(Ω
T), too. Since
𝒯(Ω
T) is densely embedded in
𝒰0(Ω
T), we conclude that (
33) holds for any
η ∈
𝒰0(Ω
T).
It remains to show that (33) implies (31). Suppose that u ∈ 𝒰(ΩT) satisfies (33) and choose an arbitrary η ∈ 𝒯(ΩT) and t1 ∈ [0, T]. Again, let be a function from 𝒯(ΩT) such (34) is valid. Inserting instead of η into (33), differentiating with respect to t and setting t = t1 we come to the relation (31). Theorem is proved.
Corollary 4. A function u ∈ 𝒰(ΩT) is a weak solution of (2)–(5) if and only if it satisfies the relation (33) for any η ∈ 𝒰0(ΩT) and in case Γ1 ≠ ∅ fulfills the boundary condition (4).
4. Inverse Problems and Quasisolutions
In the sequel, let us pose some inverse problems for the weak solution of (2)–(5). These problems are selected in order to demonstrate the wide possibilities of the method that we will introduce in Section 5.
Firstly, we suppose that (
2)–(
5) has the following specific form:
()
where
ω = (
ω1, …,
ωN) is unknown. The coefficients and other given functions
f0,
ϕ,
u0,
g,
h are assumed to satisfy (
11)–(
17). Moreover,
is prescribed.
IP1. Find the vector
such that the weak solution of (
40) satisfies the following instant additional conditions:
()
where 0 <
T1 <
T2 < ⋯<
TN ≤
T and
,
i = 1, …,
N are given functions (observations of
u).
Since for , the weak solution u of (40) exists in 𝒰(ΩT); hence, it has traces u(·, Ti) ∈ L2(Ω), i = 1, …, N. In practice, the term may represent an approximation of a more general function F(x, t) ∈ L2(ΩT), where γj, j = 1,2, … form a basis in L2(0, T).
Further, let u0 also be unknown.
IP2. Find the vector
and
u0 ∈
L2(Ω) such that the weak solution of (
40) satisfies the following integral additional conditions:
()
where
vi ∈
L2(Ω),
i = 1, …,
N + 1 are given observation functions and
κi,
i = 1, …,
N are given weights that satisfy the following condition:
()
Note that the integral
in (
42) belongs to
L2(Ω) for any
and
u0 ∈
L2(Ω). Indeed, for such
ω and
u0 it holds
u ∈
𝒰(Ω
T) ⊂
L2(Ω
T), which implies
()
In practice, the weights
κi are usually concentrated in neighborhoods of some fixed values of time
t =
Ti.
Finally, let us pose a nonlinear inverse problem for the coefficient a and the kernel m. Assume that n ∈ {1; 2; 3}. Then any coefficient a that belongs to L2(Ω) satisfies (12). Moreover, let us set q1 = 2 if n = 2 and Γ2≢∅. The other coefficients and the given functions u0, f, ϕ, g, h are assumed to satisfy (11)–(17).
IP3. Find
a ∈
L2(Ω) and
m ∈
L1(0,
T) such that the weak solution of (
2)–(
5) satisfies the following integral additional conditions:
()
where
uT ∈
L2(Ω),
v ∈
L2(0,
T) are given observation functions and
κ is a given weight function such that
κ ∈
L∞((0,
T);
L2(Γ
2)).
As in IP1, we can show that the trace u(·, T) belongs to L2(Ω). Moreover, using the property u ∈ 𝒰(ΩT), the embedding of in L2(Γ2) and Hölder’s inequality, one can immediately check that the term in (45) belongs to L2(0, T).
Available existence, uniqueness, and stability results for IP1–IP3 require stronger smoothness of the data than imposed in the present paper. Let us cite some of these results.
In case N = 1, the well posedness of IP1 was proved in [8]. Partial results were deduced earlier in [9]. A more general problem involving both IP1 and IP2 without the unknown u0 in case N = 1 was studied in [10] by means of different techniques. IP1 and IP2 in case m = 0 and N = 1 were treated in many papers, for example, [11–14]. The case N > 1 is open even if m = 0. Inverse problems to determine m with given a were studied in a number of papers, for example, [7, 15–23]. The problem for a with given m was treated in [8].
In the present paper, we will deal with quasisolutions of IP1–IP3 and related cost functionals. Denote
𝒵1 = (
L2(Ω))
N. Let
M⊆
𝒵1. The quasi-solution of IP1 in the set
M is an element
ω* ∈ arg min
ω∈MJ1(
ω), where
J1 is the following cost functional
()
and
u(
x,
t;
ω) is the solution of (
40) that corresponds to a fixed element
ω.
Similarly, let
. The quasi-solution of IP2 in the set
M is
z* ∈ arg min
z∈MJ2(
z), where
J2 is the cost functional
()
and
u(
x,
t;
z) is the weak solution of (
40) that corresponds to a given vector
z = (
ω,
u0).
Finally, defining
M⊆
𝒵3 : =
L2(Ω) ×
L2(0,
T), the quasi-solution of IP3 in
M is an element
z* ∈ arg min
z∈MJ3(
z), where
J3 is the cost functional
()
and
u(
x,
t;
z) is the weak solution of the direct problem (
2)–(
5) corresponding to given
z = (
a,
m). Here, we restricted the space for the unknown
m to
L2(0,
T), because we will seek for the Fréchet derivative of
J3 in a Hilbert space. Moreover, the kernel of the second addend corresponding to
m in the representation formula of
(
90) is an element of
L2(0,
T) and in general does not belong to the adjoint space
L∞(0,
T).
According to the above-mentioned arguments, the functionals Jk, k = 1,2, 3, are well-defined in 𝒵1, 𝒵2, and 𝒵3, respectively.
5. The Fréchet Derivatives of Cost Functionals of Inverse Problems
5.1. General Procedure
Suppose that the solution u of the direct problem depends on a vector of parameters p that has to be determined in an inverse problem making use of certain measurements of u. Let the quasi-solution of the inverse problem be sought by a method involving the Fréchet derivative of the cost functional (i.e., some gradient-type method). Usually in practice, a solution of a proper adjoint problem is used to represent the Fréchet derivative.
We introduce a general procedure to deduce such adjoint problems. Assume that Δ
u is the difference of solutions of the direct problem corresponding to a difference of the vector of the parameters Δ
p. More precisely, we suppose that Δ
u is a solution of the following problem:
()
()
()
()
with some data
f†,
ϕ†, Δ
u0,
h† depending on Δ
p. We restrict ourselves to the case when the Dirichlet boundary condition of the state
u is independent of
p. Therefore, the condition (
51) for Δ
u is homogeneous.
In practice, the adjoint parabolic problems are usually formulated as backward problems. In our context, it is better to pose adjoint problems in the forward form. The involved memory term with m is defined via a forward convolution and from the practical viewpoint, it is preferable to have the direct and adjoint problems in a similar form (e.g., to simplify parallelisation of computations).
More precisely, let the adjoint state
ψ be a solution of the following problem:
()
where
f∘,
ϕ∘,
u∘, and
h∘ are some data depending on Δ
u and the cost functional under consideration.
Assume that the quadruplets f†, ϕ†, Δu0, h†, and f∘, ϕ∘, u∘, h∘ satisfy the conditions (14)–(16). Then, due to Theorem 2, the problems (49)–(52) and (53) have unique weak solutions in the space 𝒰(ΩT). Actually, we have Δu, ψ ∈ 𝒰0(ΩT) because of the homogeneous boundary conditions on Γ1,T.
Let us write the relation (
33) for Δ
u and use the test function
η =
ψ. Then we obtain for any
t ∈ [0,
T]
()
Secondly, let us write this relation for
ψ and use the test function
η = Δ
u. Then we have for any
t ∈ [0,
T]
()
Subtracting (
54) from (
55), using the commutativity of the convolution, the symmetricity relations
aij =
aji and differentiating with respect to
t, we arrive at the following basic equality that can be used in various inverse problems:
()
5.2. Derivative of J1
Theorem 5. The functional J1 is the Fréchet differentiable in and
()
where
,
i = 1, …,
N, are the unique
ω-dependent weak solutions of the following problems:
()
i = 1, …,
N.
Proof. Let us fix some . One can immediately check that it holds
()
where Δ
u(
x,
t;
ω) =
u(
x,
t;
ω + Δ
ω) −
u(
x,
t;
ω) ∈
𝒰0(Ω
T) is the weak solution of the following problem:
()
Applying the estimate (
20) to the solution of this problem we deduce the following estimate for the second term in the right-hand side of (
59):
()
with some constant
C4. This implies that
J1 is the Fréchet differentiable and the first term in the right-hand side of (
59) represents the Fréchet derivative, that is,
()
Further, let us use the method presented in Section 5.1 to deduce the proper adjoint problems. Comparing (60) with (49)–(52) we see that , ϕ† = Δu0 = h† = 0. Therefore, the relation (56) has the form
()
In order to deduce a formula for the component
σi in the quantity
, we set
,
h∘ =
f∘ =
ϕ∘ = 0 and
t =
Ti in (
63). Then we immediately have
()
where according to (
53) and the definition of
, the function
ψi is the weak solution of the problem (
58) in the domain Ω
T instead of
. Due to Theorem
2, this problem has a unique solution. From (
62) and (
64) we obtain (
57). The latter formula contains the values of
ψi in
. Therefore, we can restrict the problem (
58) from Ω
T to
.
To use the formula (57) one has to solve N weak problems for the functions ψi in domains . In the following theorem, we will show that computational work related to the evaluation of the Fréchet derivative can be considerably reduced. Actually, it is sufficient to solve N weak problems in the smaller domains , i = 1, …, N. Here, T0 = 0.
Theorem 6. The Fréchet derivative of the functional J1 can also be written in the form
()
where
are the unique
ω-dependent weak solutions of the following sequence of recursive problems in the domains
:
()
where
l =
N,
N − 1, …, 2,1. Here,
()
and the function
fl and the vector Φ
l are defined via
βN,
βN−1, …,
βl+1 as follows:
()
,
and Θ
N = 0, Θ
l = 1 for
l <
N.
Proof. Firstly, let us check that (66) indeed have unique weak solutions βl in . To this end we can use Theorem 2. For the problem βN this is immediate, because the initial condition of the problem for βN belongs to L2(Ω) and other equations in this problem are homogeneous. Further, we use the induction. Choose some l in the range N > l ≥ 1 and suppose that for all k such that N − 1 ≥ k ≥ l. The aim is to us to show that then the problem for βl has a unique weak solution in . Let us represent the kth addend in (68) in the form
()
For any
k in the range
N − 1 ≥
k ≥
l we have
()
where
for
t ∈ [0,
Tk+1 −
Tk],
zk,α(
t) = 0 for
t ∉ [0,
Tk+1 −
Tk] and
mk(
t) = |
m(
Tk −
Tl +
t)|. Since
m ∈
L1(0,
T) and
, we have
mk ∈
L1(0,
Tl −
Tl−1 +
Tk+1 −
Tk) and
zk,α ∈
L2(0,
Tl −
Tl−1 +
Tk+1 −
Tk). Due to the Young’s theorem for convolutions, we get
mk*
zk,α ∈
L2(0,
Tl −
Tl−1 +
Tk+1 −
Tk). Therefore,
. This implies that
fl belongs to
. From the latter relation and
aij ∈
L∞(Ω) we immediately have
. Using the embedding theorem and Lemma
1 we see that
afl satisfies the property (
14). Finally, the initial condition
belongs to
L2(Ω), because
,
βl+1 ∈
C([0,
Tl+1 −
Tl],
L2(Ω)). All assumptions of Theorem
2 are satisfied for the problem for
βl. Consequently, it possesses a unique weak solution in
.
Secondly, let us define the functions
()
where
l = 1 … ,
N and
ψi are the solutions of (
58). We are going to show that
,
l = 1, …,
N. From the definition of
using the value of
ψl(
x, 0) and simple computations, we immediately get
()
Let us fix
l = 1, …,
N and choose some
. We continue
η by the formulae
η(
x,
t) =
η(
x,
Tl −
Tl−1) for
t >
Tl −
Tl−1 and
η(
x,
t) =
η(
x, 0) for
t < 0. Further, let us define
ηi(
x,
t) =
η(
x,
Tl −
Ti +
t) where
i =
l, …,
N. By the definition, it holds
.
Let us write down the weak form (31) for the problem for ψi (58) with the test function ηi. We fix some t ∈ [0, Tl − Tl−1] and compute the difference of this weak problem with t replaced by Ti − Tl + t and t replaced by Ti − Tl and take the sum over i = l, …, N. This results in the following expression:
()
where
()
Using the definitions of
η and
and the formula (
72), we have
()
Similarly, using the definitions of
η and
and changing the variable of integration in
Z2, we deduce
()
By the change of variable, the quantity
Z3 is transformed to
()
Let us consider the term
in the latter formula. We compute
()
Thus, (
77) reads
()
Using similar computations, we obtain
()
Plugging (
75), (
76), (
79), and (
80) into (
73), we arrive at a certain weak problem for
that coincides with the weak problem for
βl. Moreover, since
, from (
71) we see that
. But we have shown the uniqueness of the weak solutions of the problems for
βl in
. This implies
.
Finally, from (57), we have
()
Changing here the order of sums over
i and
l and observing (
71) with
replaced by
βl, we obtain (
65). The proof is complete.
5.3. Derivative of J2
Theorem 7. The functional J2 is the Fréchet differentiable in and
()
where
ψ ∈
𝒰(Ω
T) is the unique
z-dependent weak solution of the following problem:
()
Proof. Let us fix some . It holds
()
where Δ
u(
x,
t;
z) =
u(
x,
t;
z + Δ
z) −
u(
x,
t;
z) ∈
𝒰0(Ω
T) is the weak solution of the following problem:
()
Using (
43), the Cauchy inequality and estimate (
20) from Theorem
2 for the problem of Δ
u(
x,
t;
z), we come to the estimate
()
with some constants
C5 and
C6. Therefore,
J2 is the Fréchet differentiable and the first term in the right-hand side of (
84) represents the Fréchet derivative, that is,
()
Comparing (
85) with (
49)–(
52), we see that
,
ϕ† =
h† = 0. Consequently, the relation (
56) has the form
()
To deduce a formula for
, we define
()
u∘ =
h∘ =
ϕ∘ = 0 and
t =
T in (
88). Then from (
87) and (
88), we obtain (
82), where due to (
53),
ψi is the weak solution of the problem (
83). In view of Theorem
2, this problem has a unique solution in
𝒰(Ω
T).
5.4. Derivative of J3
Theorem 8. The functional J3 is the Fréchet differentiable in L2(Ω) × L2(0, T) and
()
where
ψ ∈
𝒰0(Ω
T) is the unique
z-dependent weak solution of the problem
()
Proof. Due to u(x, t; z) ∈ 𝒰(ΩT), κ ∈ L∞((0, T); L2(Γ2)), v ∈ L2(0, T), and uT ∈ L2(Ω), the problem (91) satisfies the assumptions of Theorem 2. Therefore, it has a unique weak solution in 𝒰0(ΩT).
Let Δz = (Δa, Δm) ∈ L2(Ω) × L2(0, T) and define . We split as follows: , where Δu is the weak solution of the following problem:
()
In view of Lemma
1(i),
u ∈
𝒰(Ω
T),
m ∈
L1(0,
T), and the Young’s theorem, it holds
. Therefore, Lemma
1(ii) implies
()
where
C8 and
C9 are some constants depending on
u,
m. Moreover, since
, by Young’s inequality we have also
()
with some constants
C10 and
C11 depending on
u. The obtained estimates show that assumptions of Theorem
2 are satisfied for the problem (
92) and it indeed has a unique weak solution Δ
u ∈
𝒰(Ω
T). Moreover, applying the relation (
20) from Theorem
2, we get
()
where
C12(
m,
u) is a constant depending on
m,
u.
Further, writing the problem for and subtracting the problem for Δu, we obtain the following problem for :
()
where
()
Using again Lemma 1 and the Young’s inequality, we deduce the estimates
()
with some constants
C13 ⋯
C17. Therefore, applying the relation (
20) to the solution of the problem (
96) we obtain
()
with some constant
C18. In case ∥Δ
z∥ is small enough, that is,
()
we have
()
In view of (
95), this implies
()
with a constant
C19.
Similarly, for the solution of the problem (92), we deduce the estimate
()
with a constant
C20.
Now, we write the difference of J3 in the following form:
()
where
()
Using (
102), (
103), and the property
κ ∈
L∞((0,
T);
L2(Γ
2)), we obtain the estimate
in case (
100). This shows that
J3 is the Fréchet differentiable and
()
Finally, let us prove (90) and (91). Comparing (92) with (49)–(52), we see that f† = Δa[u − m*u] − Δm*au, and Δu0 = h† = 0. Thus, (56) reads
()
In order to obtain a formula for the right-hand side in (
106), we set
u∘ = 2[
u(
x,
T;
z) −
uT(
x)],
()
f∘ =
ϕ∘ = 0 and
t =
T. Then, we obtain (
90), where in view of (
53) the function
ψ is the weak solution of (
91).
6. Further Aspects of Minimisation
6.1. Existence of Quasisolutions
For the convenience, we will use also the symbol z to denote the argument ω of J1.
Theorem 9. (i) Let k ∈ {1; 2} and M ⊂ 𝒵k be bounded, closed, and convex. Then, IPk has a quasi-solution in M. The set of quasisolutions is closed and convex.
(ii) Let k ∈ {1; 2; 3} and M ⊂ 𝒵3 be compact. Then IPk has a quasi-solution in M.
Proof. Let us prove (i). The existence assertion follows from Weierstrass existence theorem (see [24, Section 2.5, Theorem 2D]) once we have proved that Jk is weakly sequentially lower semicontinuous in ℱ, that is,
()
But (
109) follows from the continuity and convexity of
Jk [
24]. The convexity of
Jk can be immediately deduced making use of the linearity of the ingredient
u(
x,
t;
z) with respect to
z inside the quadratic functional
Jk (for similar computations see [
25, Theorem 2]). The closedness of the set of quasisolutions is again a direct consequence of the continuity of
Jk. The convexity of the set of solutions follows from the convexity of
Jk.
Next, we prove (ii). Let m = infz∈MJk(z) and zl ∈ M be the minimising sequence, that is, . By the compactness, there exists a subsequence such that . Due to the continuity of Jk we have . Thus, Jk(z*) = m. This proves (ii).
In practice, the compact set M may be a bounded and closed finite-dimensional subset of 𝒵k. The proof of weak lower semicontinuity of J3 may be harder because this functional is not convex.
6.2. Discretisation and Minimisation
Let us consider the penalised discrete problems
()
where
k ∈ {1; 2; 3},
𝒵k,L is an
L-dimensional subspace of
Zk (
L ∈ {1,2, …}) and
ΠL is a penalty function related to the set
ML =
PLM with
PL being the orthogonal projection onto
𝒵k,L. The general assumptions for
ΠL are
()
Theorem 10. The problem (110) has a solution.
Proof. The proof repeats the proof of the statement (ii) of Theorem 9, because in view of the accretivity of Φk,L, a minimizing sequence is bounded and in a finite-dimensional space any bounded sequence is compact.
The Fréchet derivative of Φ
k,L, that is,
can be identified by a certain element in
𝒵k,L, that is,
()
where
is the inner product of
𝒵k. In particular, the addend
is identical to the element
PLwk(
z) where
wk(
z) is the kernel of the functional
. Thus, by virtue of (
57), (
65), (
82), and (
90), it holds
()
In
w1, the functions
ψi and
βl are the
z- (or, equivalently,
ω-) dependent weak solutions of the problems (
58) and (
66), respectively. In
w2 the function
ψ is the weak solution of (
83) and in
w3 the functions
u and
ψ are the
z-dependent weak solutions of (
2)–(
5) and (
91), respectively.
Example 11. Consider the case k = 1. Let M = {z ∈ 𝒵1 : ∥z∥ ≤ ρ}, where ρ > 0. Further, let ξj, j = 1,2, …, be an orthonormal basis in L2(Ω) and 𝒵1,L = (span(ξ1, …, ξL)) N. Then is in 𝒵1,L identical to the element
()
Moreover, it holds
ML = {
z ∈
𝒵1,L : ∥
z∥ ≤
ρ}. Define a convex penalty function
ΠL ∈
C∞[0,
∞) such that
ΠL(
z) = 0 for ∥
z∥ ≤
ρ and
ΠL(
z) =
d(∥
z∥
2 −
ρ2) for ∥
z∥ ≥
ρ +
ε with some
d,
ε > 0. Then
ΠL satisfies (
111).
Let
k ∈ {1; 2; 3}. Choose some initial guess
z0 ∈
𝒵k,L. Compute the approximate solutions by the gradient method
()
where
s = 0,1, 2, … and
cs > 0.
Theorem 12. Let k ∈ {1; 2} and cs be chosen by the rule
()
where
δs ≥ 0,
. Then it holds dist(
zs,
S) → 0 as
s →
∞, where
S is the set of solutions of (
110).
Proof. The assertion follows from Theorem 5.1.2 of [26] once we have proved that is uniformly Lipschitz continuous, Φk,L is convex, and the set M(z0) = {z ∈ 𝒵k,L : Φk,L(z) ≤ Φk,L(z0) + δ} is bounded. The convexity of Φk,L follows from the convexity of its addends ΠL and Jk. The boundedness of M(z0) is a direct consequence of the accretivity of Φk,L following from the accretivity of the addend ΠL.
It remains to show the uniform Lipschitz continuity of in 𝒵k,L (such a property for is assumed in (111)). Let k = 1. Then by (113) and for any , we have
()
where
C22 is a constant independent of
z and
. Further, observing (
58) and (
40), the estimate (
20) of Theorem
2 and
z =
ω, we deduce
()
where
C23,
C24 are independent of
z and
. This proves the uniform Lipschitz continuity of
. Such a property of
can be proved in a similar manner.
The convergence of zs in case k = 3 is an open issue. This case is more complex because IP3 is nonlinear and the Fréchet derivative of J3 is not uniformly Lipschitz continuous.
The quasisolutions of IP1–IP3 are not expected to be stable with respect to the noise of the data, that is, the problems under consideration may be ill posed. Nevertheless, from the intuitive viewpoint, a discretisation should regularise an ill-posed problem. Such a property of the discretisation has been proved in many cases [27, 28]. Alternatively, the index s of the gradient method could be used as a regularization parameter (see [29, 30]). Moreover, the addend ΠL can be defined to be the stabilizing term of the Tikhonov′s method instead of the penalty function, that is, ΠL = α∥z∥2, where α > 0 is the regularisation parameter. Such a ΠL satisfies (111).
Acknowledgments
The paper was supported by the Estonian Science Foundation (Grant 7728), Estonian Ministry of Education and Science target financed theme SF0140011s09, and the Estonian state programme Smart Composites-Design and Manufacturing.