Volume 31, Issue 18 pp. 8703-8724
SPECIAL ISSUE ARTICLE
Open Access

Robust adaptive model predictive control: Performance and parameter estimation

Xiaonan Lu

Xiaonan Lu

Department of Engineering Science, University of Oxford, Oxford, UK

Search for more papers by this author
Mark Cannon

Corresponding Author

Mark Cannon

Department of Engineering Science, University of Oxford, Oxford, UK

Correspondence Mark Cannon, Department of Engineering Science, University of Oxford, Oxford, UK

Email: [email protected]

Search for more papers by this author
Denis Koksal-Rivet

Denis Koksal-Rivet

Department of Mathematics, University of Chicago, Chicago, Illinois, USA

Search for more papers by this author
First published: 20 August 2020
Citations: 14

Summary

For systems with uncertain linear models, bounded additive disturbances and state and control constraints, a robust model predictive control (MPC) algorithm incorporating online model adaptation is proposed. Sets of model parameters are identified online and employed in a robust tube MPC strategy with a nominal cost. The algorithm is shown to be recursively feasible and input-to-state stable. Computational tractability is ensured by using polytopic sets of fixed complexity to bound parameter sets and predicted states. Convex conditions for persistence of excitation are derived and are related to probabilistic rates of convergence and asymptotic bounds on parameter set estimates. We discuss how to balance conflicting requirements on control signals for achieving good tracking performance and parameter set estimate accuracy. Conditions for convergence of the estimated parameter set are discussed for the case of fixed complexity parameter set estimates, inexact disturbance bounds, and noisy measurements.

1 INTRODUCTION

Model predictive control (MPC) repeatedly solves a finite-horizon optimal control problem subject to input and state constraints. At each sampling instant a model of the plant is used to optimize predicted behavior and the first element of the optimal predicted control sequence is applied to the plant.1 Any mismatch between model and plant causes degradation of controller performance.2 As a result, the amount of model uncertainty strongly affects the bounds of the achievable performance of a robust MPC algorithm.3

To avoid the disruption caused by intrusive plant tests,2 adaptive MPC attempts to improve model accuracy online while satisfying operating constraints and providing stability guarantees. Although the literature on adaptive control has long acknowledged the need for persistently exciting (PE) inputs for system identification,4 few articles have explored how to incorporate PE conditions with feasibility guarantees within adaptive MPC.5 In addition, adaptive MPC algorithms must balance conflicting requirements for system identification accuracy and computational complexity.5, 6

Various methods for estimating system parameters and meeting operating constraints are described in the adaptive MPC literature. Depending on the assumptions on model parameters, parameter identification methods such as recursive least squares (RLS)7, comparison sets,8 set membership identification,9, 10 and neural networks11, 12 have been proposed. Heirung et al13 propose an algorithm where the unknown parameters are estimated using RLS and system outputs are predicted using the resulting parameter estimates. The use of RLS introduces nonlinear equality constraints into the optimization. On the other hand, the comparison model approach described in Aswani et al8 addresses the trade-off between probing for information and output regulation by decoupling these two tasks; a nominal model is used to impose operating constraints whereas performance is evaluated via a model learned online using statistical identification tools. However, the use of a nominal model implies that the comparison model approach cannot guarantee robust constraint satisfaction.

Tanaskovic et al9 consider a linear finite impulse response (FIR) model with measurement noise and constraints. This approach updates a model parameter set using online set membership identification; constraints are enforced for the entire parameter set and performance is optimized for a nominal prediction model. The article proves recursive feasibility but does not show convergence of the identified parameter set to the true parameters. To avoid the restriction to FIR models, Lorenzen et al10 consider a linear state space model with additive disturbance. An online-identified set of possible model parameters is used to robustly stabilize the system. However, the approach suffers from a lack of flexibility in its robust MPC formulation, which is based on homothetic tubes,14 allowing only the centers and scalings of tube cross-sections to be optimized online, and it does not provide convex and recursively feasible conditions to ensure PE control inputs.

In this article, we also consider linear systems with parameter uncertainty, additive disturbances, and constraints on system states and control inputs. Compared with Reference 10, the proposed algorithm reduces the conservativeness in approximating predicted state tubes by adopting more flexible cross-section representations. Building on Reference 15, we take advantage of fixed complexity polytopic tube representations and use hyperplane and vertex representations interchangeably to further simplify computation. We use, similarly to Reference 13, a nominal performance objective, but we impose constraints robustly on all possible models within the identified model set. We prove that the closed-loop system is input-to-state stable (ISS). In comparison with the min-max approach of Reference 15, the resulting performance bound takes the form of an asymptotic bound on the 2-norm of the sequence of closed-loop states in terms of the 2-norms of the additive disturbance and parameter estimate error sequences. In addition, we convexify the persistence of excitation (PE) condition around a reference trajectory and include a penalty term in the cost function to promote convergence of the parameter set. The convexification method is somewhat analogous to that proposed in References 16, 17, where the uncertainty information of the parameter set is approximated using a nominal gain. Here, however, the convexification is obtained by direct linearization of PE constraints on predicted trajectories. The cost function modification proposed here allows the relative importance of the two objectives, namely, controller performance and convergence of model parameters, to be specified.

Bai et al18 consider a particular set membership identification algorithm and show that the parameter set estimate converges with probability 1 to the actual parameter vector (assumed constant) if: (a) a tight bound on disturbances is known; (b) the input sequence is persistent exciting; and (c) the minimal parameter set estimate is employed. However, the minimal set estimate can be arbitrarily complex, and to provide computational tractability various nonminimal parameter set approximations have been proposed, such as n-dimensional balls19 and bounded complexity polytopes.9 The current article allows the use of parameter set estimates with fixed complexity and proves that, despite their approximate nature, such parameter sets converge with probability 1 to the true parameter values. We also derive lower bounds on convergence rates for the case of inexact knowledge of the disturbance bounding set and for the case that model states are estimated in the presence of measurement noise.

This article has five main parts. Section 2 defines the problem and basic assumptions. Section 3 gives details of the parameter estimation, robust constraint satisfaction, nominal cost function, convexified PE conditions, and the MPC algorithm. Section 4 proves recursive feasibility and ISS of the proposed algorithm. Section 5 proves the convergence of the parameter set in various conditions and Section 6 illustrates the approach with numerical examples.

Notation: urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0001 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0002 denote the sets of integers and reals, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0003, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0004. The ith row of a matrix A and ith element of a vector a are denoted [A]i and [a]i. Vectors and matrices of 1s are denoted 1, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0005 is the identity matrix. For a vector a, ‖a‖ is the Euclidean norm and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0006; the largest element of a is urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0007 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0008. The absolute value of a scalar s is urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0009 and the floor value is ⌊s⌋. urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0010 is the number of elements in a set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0011. urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0012 is Minkowski addition for sets urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0013 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0014, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0015. The matrix inequality A ≽ 0 (or A ≻ 0) indicates that A is positive semidefinite (positive definite) matrix. The k steps ahead predicted value of a variable x is denoted xk, and the more complete notation xk|t indicates the k steps ahead prediction at time t. A continuous function urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0016 is a urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0017-function if it is strictly increasing with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0018, and is a urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0019-function if in addition urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0020 as s → . A continuous function urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0021 is a urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0022-function if, for all t ≥ 0, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0023 is a urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0024-function, and, for all s ≥ 0, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0025 is decreasing with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0026 as t → . For functions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0027 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0028 we denote urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0029, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0030 with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0031.

2 PROBLEM FORMULATION AND PRELIMINARIES

This article considers a linear system with linear state and input constraints and unknown additive disturbance:
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0032(1)
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0033 is the system state, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0034 is the control input, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0035 is an unknown disturbance input, and t is the discrete time index. The system matrices urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0036 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0037 depend on an unknown but constant parameter urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0038. The disturbance sequence {w0,w1, …} is stochastic and (wi,wj) is independent for all i ≠ j. States and control inputs are subject to linear constraints, defined for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0039, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0040 by
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0041(2)

Assumption 1. (Additive disturbance)The disturbance wt lies in a convex and compact polytope urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0042, where

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0043(3)
with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0044, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0045 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0046.

Assumption 2. (Parameter uncertainty)The system matrices A and B are affine functions of the parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0047:

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0048(4)
for known matrices Aj, Bj, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0049, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0050 lies in a known, bounded, convex polytope urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0051 given by
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0052

Assumption 3. (State and control constraints)The set

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0053
is compact and contains the origin in its interior.

To obtain finite numbers of decision variables and constraints in the MPC optimization problem, the predicted control sequence at time t is assumed to be expressed in terms of optimization variables v0|t, … , vN − 1|t as
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0054(5)
where N is the prediction horizon. The gain K is designed offline and is assumed to robustly stabilize the uncertain system urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0055, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0056 in the absence of constraints. This assumption can be stated as follows.

Assumption 4. (Feedback gain and contractive set)There exists a polytopic set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0057 and feedback gain K such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0058 is urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0059-contractive for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0060, that is

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0061(6)
for all x ∈ {x : Tx ≤ 1} and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0062. The representation urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0063 is assumed to be minimal in the sense that it contains no redundant inequalities.

3 ADAPTIVE ROBUST MPC

In this section, a parameter estimation scheme based on References 20, 21 is introduced. We then discuss the construction of tubes to bound predicted model states and associated constraints.

3.1 Set-based parameter estimation

At time t we use observations of the system state xt to determine a set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0064 of unfalsified model parameters. The set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0065 is then combined with the parameter set estimate urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0066 to construct a new parameter set estimate urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0067.

Unfalsified parameter set: Define Dt and dt as the matrix and vector
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0068(7)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0069(8)
Then, given xt, xt − 1, ut − 1 and the disturbance set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0070 in (3), the unfalsified parameter set at time t is given by
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0071(9)
with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0072 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0073.
Parameter set update: Let urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0074 be an a priori chosen matrix. The estimated parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0075 is defined by
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0076(10)
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0077 is updated online at times urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0078. The complexity of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0079 is controlled by fixing urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0080, which fixes the directions of the half-spaces defining the parameter set. We assume that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0081 is chosen so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0082 is compact for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0083 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0084. Using a block recursive polytopic update method,20 urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0085 is defined as the smallest set (10) containing the intersection of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0086 and unfalsified sets urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0087 over a window of length Nu:
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0088(11)
(where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0089 for all j ≤ 0). We refer to Nu as the PE window. Note that Nu is independent of the MPC prediction horizon N. Using linear conditions for polyhedral set inclusion22 urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0090 in (11) can be obtained by solving a linear program for each urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0091:
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0092

Lemma 1.If urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0093 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0094 is defined by (10), (11), then urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0095 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0096 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0097.

3.2 Polytopic tubes for robust constraint satisfaction

This section considers predicted state and control trajectories. To simplify notation, we omit the subscript t indicating the time at which state and control predictions are made, whenever t indicates current time; thus the k steps ahead predictions xk|t, vk|t are denoted xk, vk. To ensure that the predicted state and control sequences satisfy the operating constraints (2) robustly for the given uncertainty bounds, we construct a tube (a sequence of sets) urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0098 satisfying, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0099, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0100, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0101,
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0102(12)
Hyperplane form: For given urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0103 satisfying Assumption 4 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0104, let urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0105 denote the k steps ahead cross-section of the predicted state tube:
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0106(13)
The MPC algorithm described in Section 3.5 optimizes the shape of the predicted state tube online by allowing urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0107 to be an optimization variable. If, for a given urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0108, the constraint urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0109 is redundant for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0110 in the hyperplane description (13) (ie, if the set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0111 is unchanged by removing this constraint), we define (without loss of generality) urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0112. Thus, for each urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0113, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0114 necessarily holds for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0115. Then (12) is equivalent to, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0116 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0117,
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0118
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0119 is the vector with ith element urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0120 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0121. Substituting D(x,u) and d(x,u) from (7), (8), this implies linear conditions on urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0122, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0123, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0124:
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0125
and, for a given initial state x, the constraint urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0126 requires
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0127(14)
Vertex formurn:x-wiley:rnc:media:rnc5175:rnc5175-math-0128 has an equivalent representation in terms of its vertices, which we denote as urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0129, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0130:
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0131(15)
Note that m, the number of vertices of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0132, is fixed, and for given urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0133 in (13), we may have urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0134 for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0135 in (15) (ie, the vertex description may contain repeated vertices). The presence of repeated vertices does not affect the formulation. For each urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0136, define an index set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0137 (with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0138), such that the ith row of T and ith element of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0139 satisfy
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0140
Since T is constant, the index set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0141 associated with active inequalities at the vertex x(j) is independent of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0142 and can be computed offline. Therefore, for each urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0143, we have
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0144(16)
where the matrix urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0145 can be computed offline given knowledge of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0146 using the property that
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0147(17)
Using the vertex representation (16), the condition that (2) is satisfied for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0148 is equivalent to, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0149,
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0150(18)
Substituting D(x,u) and d(x,u) from (7), (8), condition (12) can be expressed equivalently as
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0151
for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0152, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0153 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0154. This is equivalent (see e.g. Reference 22, Proposition 3.31) to the requirement that there exist matrices urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0155 satisfying, for each prediction time step urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0156 and each vertex urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0157, the conditions
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0158(19a)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0159(19b)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0160(19c)
Given the dual mode predicted control law (5), we introduce the terminal conditions that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0161 and (F + GK)x ≤ 1 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0162, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0163, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0164. Then (2) and (12) are satisfied if (18), (19) hold for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0165 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0166, and there exist matrices urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0167 satisfying the conditions, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0168
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0169(20a)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0170(20b)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0171(20c)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0172(20d)

3.3 Objective function

Consider the nominal cost defined for Q, R ≻ 0 by
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0173(21)
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0174 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0175 are elements of predicted state and control sequences generated by a nominal parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0176:
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0177(22a)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0178(22b)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0179(22c)
for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0180 and where v = {v0, … , vN − 1}. Define urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0181 as the solution of the Lyapunov matrix equation
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0182(23)
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0183. Note that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0184 is well-defined for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0185 due to Assumption 4. Then (21) is equivalent to
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0186(24)
We assume knowledge of an initial nominal parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0187, which could be estimated using physical modeling or offline system identification, alternatively urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0188 could be defined as the Chebyshev center of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0189. For t > 0, we assume that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0190 is updated by projecting urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0191 onto the parameter set estimate urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0192, that is
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0193(25)

Remark 1.For the ISS analysis in Section 4 it is essential that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0194. However, subject to this constraint, alternative update laws for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0195 are possible; for example, a least mean squares estimate projected onto urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0196.10

3.4 Augmented objective function and persistent excitation

The regressor Dt in (7) is PE if
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0197(26)
for some PE window urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0198, some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0199, and all times t0.4 Although the upper bound in (26) implies convex constraints on xt and ut, the lower bound is nonconvex. The bounds on convergence of the parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0200 derived in Section 5 suggest faster convergence as urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0201 in the PE condition (26) increases.

Previously proposed MPC strategies that incorporate PE constraints consider the PE condition to be defined on an interval such as {t − Nu + 1, … , t}, where t is current time, which means that the PE constraint depends on only the first element of the predicted control sequence. Marafioti et al23 simplify the PE condition by expressing it as a nonconvex quadratic inequality in ut. Likewise, Lorenzen et al10 show that the PE condition is equivalent to a nonconvex constraint on the current control input. Lu and Cannon15 linearize the PE condition about a reference trajectory and thus obtain a sufficient condition for persistent excitation.

In this article, on the hand, we define the PE condition over predicted trajectories (from k = 0 to k = Nu − 1 steps ahead), and we therefore require, at time t and for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0202,
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0203(27)

The inclusion of predicted future states and control inputs in this PE condition allows for greater flexibility in meeting the constraint. To avoid nonconvex constraints, we derive a convex relaxation that provides a sufficient condition for (27).

Assume reference state and control predicted sequences, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0204 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0205, approximating the optimal predicted state and control sequences, are available. To derive sufficient conditions for (27), we consider the difference between the reference and optimized sequences, denoted urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0206 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0207 (ie, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0208 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0209). Since the prediction tube implies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0210, we therefore have urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0211 where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0212. Denote the vertices of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0213 as urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0214 and for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0215, we have
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0216
Moreover, Dk depends linearly on uk and xk, and hence for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0217,
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0218
Here, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0219 is a positive semidefinite matrix, and by omitting this term we obtain sufficient conditions for (27) as a set of LMIs in urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0220 and vk. The following convex conditions are thus sufficient to ensure (27) whenever urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0221
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0222(28a)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0223(28b)

where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0224 are intermediate variables.

Another innovation of this article is the inclusion of PE coefficient in the cost function. Previous approaches10, 15 face the difficulty of choosing a suitable urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0225 value for the PE constraint in the implementation. A larger value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0226 is generally desirable, but a large urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0227 might make the optimization problem with the PE condition infeasible. In this article, we incentivize a large value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0228 by modifying the MPC objective function as follows
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0229(29)
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0230 is a weight that controls the relative priority given to satisfaction of the PE condition (27) and tracking performance. This modification does not affect the feasibility of the optimization.

Although incorporating a condition such as (27) or the convex relaxation (28) and (29) into a MPC strategy does not ensure that the closed-loop system satisfies a corresponding PE condition, its effect on the convergence rate of the estimated parameter set is significant, as shown in the numerical example in Section 6. In addition, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0231 is a meaningful and straightforward coefficient to tune, and the PE constraint can be easily switched off by setting urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0232.

3.5 Proposed algorithm

Offline:
  • 1. Choose suitable T defining the predicted state tube and compute the corresponding Uj in (17).
  • 2. Obtain a nominal urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0233.
  • 3. Minimize the contracitivity factor urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0234 satisfying (6) and obtain a feedback gain K.
Online: For t = 0,1,2,…
  • 1. Obtain the current state xt and set x = xt.
  • 2. Update urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0235 using (11) and the nominal parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0236 using (25), and solve (23) for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0237.
  • 3.

    At t = 0 compute the initial reference state and control sequences urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0238 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0239, for example, by solving the nominal problem described in Remark 4.

    At t > 0 compute the reference state and control sequences urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0240 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0241, using the solution urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0242 at t − 1, and

    urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0243(30)

  • 4. Compute urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0244, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0245, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0246, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0247, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0248 , urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0249 the solution of the semidefinite program (SDP):
    urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0250
  • 5. Implement the current control input urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0251.

Remark 2.In offline step 1, T can be chosen so that the polytope urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0252 in Assumption 4 approximates a robust control invariant set of the form urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0253, where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0254 satisfies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0255 for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0256. Using the vertex representation of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0257, the matrix urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0258 can be computed by solving a SDP (see e.g. Reference 24, Chapter 5). This approach allows the number of rows in T to be specified by the designer. However, T can alternatively be chosen so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0259 approximates the minimal robust positive invariant (RPI) set or the maximal RPI set for the system (1) and (2) under a specified stabilizing feedback law.22

Remark 3.In offline step 3, the computation of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0260 subject to (6) for given T can be performed by solving a LP using the vertex representation of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0261 [ 22, chap. 7]. The objective of minimizing urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0262 is chosen to make the constraints of problem urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0263 easier to satisfy. In particular, choosing K so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0264 in (6) ensures that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0265 exists satisfying the terminal constraints (20b-d).

Remark 4.At t = 0, the reference sequences urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0266 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0267 may be computed by solving

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0268(31)

Remark 5.The online computation of the proposed algorithm may be reduced by updating urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0269 only once every Nu > 1 time steps. For example, in Step 2, set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0270 for t ∈ {Nu,2Nu, …} and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0271 at all times t ∉ {Nu,2Nu, …}.

In Section 4, we use the property that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0272 to show that the solution, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0273, of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0274 at time t − 1 forms part of a feasible solution of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0275 at time t. As a result, the reference sequences urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0276, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0277 in Step 3 are feasible for problem urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0278 at all times t > 0.

4 RECURSIVE FEASIBILITY AND STABILITY

4.1 Recursive feasibility

At time t ≥ 1, let a suboptimal set of decision variables, denoted urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0279 be defined in terms of the optimal solution urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0280 of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0281 at time t − 1 by
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0282

Proposition 1. (Recursive Feasibility)The online MPC optimization urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0283 is feasible at all times urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0284 if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0285 is feasible at t = 0 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0286 for all time t.

Proof.If urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0287 is feasible at t − 1, then at time t, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0288 is: feasible for (14) because urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0289; feasible for (18) and (19) for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0290 because urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0291 is feasible for (18) and (19) for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0292 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0293; and feasible for (18) and (19) for k = N − 1 and feasible for (20) because urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0294 is feasible for (20) and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0295. Finally, we note that (22) is necessarily feasible and (28) necessarily holds for some scalar urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0296 if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0297.

4.2 Input-to-state stability

Throughout this section we set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0298 in problem urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0299. Therefore, the objective of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0300 is urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0301 where J is the nominal cost (24). As a result of parameter adaption, the change of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0302 online might increase the cost, but this is absorbed in the ISS terms. To simplify notation we define a stage cost L(x,v) and terminal cost urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0303 as urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0304 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0305 so that (24) is equivalent to
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0306
Denoting the actual state at next time step as x+, we define the function urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0307 as
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0308
so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0309.

Lemma 2. (ISS-Lyapunov function [25])The system

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0310(32)
with control law urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0311 is ISS with region of attraction urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0312 if the following conditions are satisfied.

(i).  urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0313 contains the origin in its interior, is compact, and is a robust positively invariant set for (32), that is, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0314 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0315, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0316, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0317, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0318.

(ii). There exist urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0319- functions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0320, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0321-functions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0322, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0323 and a function urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0324 such that for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0325, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0326 is continuous, and for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0327,

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0328(33)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0329(34)

In the following, we define urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0330 as the set of states x such that problem urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0331 is feasible and assume that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0332 is nonempty. In addition, for a given state x, nominal parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0333, and parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0334, we denote urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0335 as the optimal solution of problem urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0336, and let urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0337 be the corresponding optimal value of the cost in (24), so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0338.

Theorem 1.Assume that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0339 and the nominal parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0340 is not updated, that is, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0341 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0342. Then for all initial conditions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0343, the system (1) with control law urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0344, where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0345 is the first element of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0346, robustly satisfies the constraint (2) and is ISS with region of attraction urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0347.

Proof.We first show that condition (i) of Lemma 2 is satisfied with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0348. If urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0349 is feasible at t = 0, then (20) implies that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0350 exists such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0351 satisfies

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0352(35)

Therefore urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0353 is feasible for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0354 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0355, and hence urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0356. In addition, the robust invariance of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0357 implied by (35) ensures that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0358, and since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0359 due to Assumption 1, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0360 must contain the origin in its interior. Furthermore, Proposition 1 shows that if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0361 is initially feasible, then it is feasible for all t ≥ 0, so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0362 is positively invariant for (32). Finally, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0363 is necessarily compact by Assumption 3, and it follows that condition (i) of Lemma 2 is satisfied if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0364.

We next consider the bounds (33) in condition (ii) of Lemma 2. For a given state x, nominal parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0365 and parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0366, problem urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0367 with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0368 and Q, R ≻ 0 is a convex quadratic program. Therefore, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0369 is a continuous positive definite, piecewise quadratic function of x26 for each urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0370 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0371. Furthermore, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0372 is compact due to Assumption 2 and it follows that there exist urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0373-functions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0374, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0375 such that (33) holds with

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0376
To show that the bound (34) in condition (ii) of Lemma 2 holds, let urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0377 denote the set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0378. Then, given the linear dependence of the system (1), the model parameterization (4), and the predicted control law (5) on the state x, disturbance w, and parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0379, and since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0380, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0381, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0382, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0383 are compact sets by Assumptions 1,2, and 3, there exist urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0384 functions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0385, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0386, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0387, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0388, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0389 such that, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0390, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0391, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0392, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0393,
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0394

Following the proof of Theorem 5 in Limon et al25 and using the weak triangle inequality for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0395-functions,27 we obtain

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0396
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0397 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0398, and both urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0399 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0400 are urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0401-functions. Since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0402 is a feasible but suboptimal solution of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0403 at x+, and since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0404 by assumption, the optimal cost function satisfies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0405 and hence
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0406
Thus, all conditions of Lemma 2 are satisfied.

Corollary 1.Assume that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0407 and the nominal parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0408 is updated at each time urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0409 using (25). Then for all initial conditions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0410, the system (1) with control law urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0411, where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0412 is the first element of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0413, robustly satisfies the constraint (2) and is ISS with region of attraction urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0414.

Proof.It can be shown that condition (i) of Lemma 2 and the bounds (33) in condition (ii) of Lemma 2 are satisfied with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0415 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0416 for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0417-functions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0418, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0419 using the same argument as the proof of Theorem 1. To show that (34) is also satisfied and hence complete the proof we use an argument similar to the proof of Theorem 1. In particular, as before we define urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0420 using the optimal solution of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0421, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0422, and

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0423
However, here we define z = {z0, … , zN} as the sequence
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0424
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0425 has urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0426 for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0427 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0428. Then urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0429 implies
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0430(36)

The update law (25) ensures that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0431 since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0432. Hence, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0433, we have

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0434
and it follows that, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0435,
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0436
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0437. In addition we have, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0438
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0439
and, since Q ≻ 0, there exists a urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0440 function urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0441 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0442, while (23) implies
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0443
Furthermore (23) is linear in urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0444 and the solution urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0445 is unique for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0446 (see, eg, Reference 28) since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0447 is by assumption stable. Therefore, by the implicit function theorem, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0448 is Lipschitz continuous and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0449 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0450, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0451, for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0452. Hence
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0453
Collecting the bounds derived above on individual terms in the expression for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0454 in (36), we obtain
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0455
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0456, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0457 are urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0458-functions. But by optimality we have urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0459. Therefore (34) holds and hence all of the conditions of Lemma 2 are satisfied.

Remark 6.The ISS property implies that there exists a urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0460-function urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0461 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0462-functions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0463 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0464 such that for all feasible initial conditions urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0465, the closed-loop system trajectories satisfy, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0466,

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0467

Remark 7.Theorem 1 and Corollary 1 do not apply to the case that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0468. However, if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0469 is replaced by a time-varying weight urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0470 in the objective of problem urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0471, then ISS can be guaranteed by setting urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0472 for all t ≥ t0, for some finite horizon t0.

5 CONVERGENCE OF THE ESTIMATED PARAMETER SET

In terms of D and d defined in (7) and (8), the system model urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0473 can be rewritten as
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0474(37)
where xt + 1, D(xt,ut), and d(xt,ut) are known at time t + 1. Thus, the system is linear with regressor Dt, uncertain parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0475, and additive disturbance urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0476.

Bai et al18 show that, for such a system, the diameter of the parameter set constructed using a set-membership identification method converges to zero with probability 1 if the uncertainty bound urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0477 is tight and the regressor Dt is PE. We extend this result and prove convergence of the estimated parameter set in more general cases. Specifically, in this article we avoid the problem of computational intractability arising from a minimum volume update law of the form urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0478. Instead, we derive stochastic convergence results for parameter sets with fixed complexity and update laws of the form urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0479.

In this section, we first discuss relevant results for an update law that gives a minimal parameter set estimate for a given sequence of states (but whose representation has potentially unbounded complexity), before considering convergence of the fixed-complexity parameter set update law of Section 3.1. We then compute bounds on the parameter set diameter if the bounding set for the additive disturbances is overestimated. Finally, we demonstrate that similar results can be achieved when errors are present in the observed state, as would be encountered, for example, if the system state were estimated from noisy measurements. In each case we relate the PE condition to the rate of parameter set convergence. We also prove that the parameter set converges to a point (or minimal uncertainty set) with probability one.

In common with Bai et al,18, 29 we do not assume a specific distribution for the disturbance input. However, the set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0480 bounding the model disturbance is assumed to be tight in the sense that there is nonzero probability of a realization wt lying arbitrarily close to any given point on the boundary, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0481, of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0482.

Assumption 5. (Tight disturbance bounds)For all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0483 and any urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0484 the disturbance sequence {w0,w1, …} satisfies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0485, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0486, where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0487 whenever urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0488.

Assumption 6. (Persistent Excitation)There exist positive scalars urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0489, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0490 and an integer Nu ≥ ⌈p/nx⌉ such that, for each urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0491 we have urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0492 and

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0493

We further assume throughout this section that the rows of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0494 are normalized so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0495 for all i.

5.1 Minimal parameter set

The unfalsified parameter set at time t defined in (9) can be expressed as
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0496(38)
where wt is the disturbance realization at time t and Dt = D(xt,ut). Let w0 be an arbitrary point on the boundary urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0497, then the normal cone urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0498 to urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0499 at w0 is defined
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0500(39)

Proposition 2.For all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0501, all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0502, and for any urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0503 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0504, under Assumptions 1, 5, and 6 we have

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0505
for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0506.

Proof.Assumption 1 implies that there exists urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0507 so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0508 for any given urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0509 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0510 Therefore, if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0511 satisfies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0512, then the definition (39) of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0513 implies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0514, and hence urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0515 from (38). But

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0516
Therefore, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0517 whenever urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0518. Furthermore, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0519 Assumption 6 implies
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0520
Hence, if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0521, then there must exist some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0522 such that
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0523
If urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0524, then it follows that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0525 and thus urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0526. Assumption 5 implies the probability of this event is at least urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0527.

Theorem 2.If urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0528 and Assumptions 1, 5, and 6 hold, then for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0529 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0530, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0531 and any urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0532, we have

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0533

Proof.For the nontrivial case of t ≥ Nu we have urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0534 since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0535. Also urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0536 and Proposition 2 implies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0537 if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0538. Therefore,

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0539
and the result follows by applying this inequality ⌊t/Nu⌋ times.

Corollary 2.Under Assumptions 1, 5, and 6, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0540 converges to urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0541 with probability 1.

Proof.For any urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0542 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0543 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0544, Theorem 2 implies that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0545 is necessarily finite, and since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0546 requires that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0547, the Borel-Cantelli lemma therefore implies that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0548. It follows that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0549 as t →  with probability 1 since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0550 is arbitrary.

5.2 Fixed complexity parameter set

In order to reduce computational load and ensure numerical tractability, we assume that the parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0551 is defined by a fixed complexity polytope, as in (10) and (11). This section shows that, although a degree of conservativeness is introduced by fixing the complexity of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0552, asymptotic convergence of this set to the true parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0553 still holds with probability 1.

Theorem 3.If urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0554 is updated according to (10), (11) and Remark 5, and Assumptions 1, 5, and 6 hold, then for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0555 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0556 for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0557, we have, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0558 and any urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0559,

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0560

Proof.For the nontrivial case of t ≥ Nu we have urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0561 since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0562 by Lemma 1. Consider therefore the probability that any given urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0563 satisfying urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0564 lies in urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0565. Define vectors gj for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0566 by

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0567(40)

Assumption 1 implies that, for any given urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0568, there exists a urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0569 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0570. Accordingly, choose urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0571 so that gj in (40) satisfies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0572 for each urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0573. Then

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0574(41)
is a necessary condition for urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0575 due to (38) and (39). But (40) and Assumption 6 imply
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0576
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0577 by assumption, and it follows from (41) that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0578 if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0579 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0580. From Assumption 5 and the independence of the sequence {w0,w1, …} we therefore conclude that
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0581
and the result follows by applying this inequality ⌊t/Nu⌋ times.

Corollary 3.Under Assumptions 1, 5, and 6, the fixed complexity parameter set estimate urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0582 converges to urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0583 with probability 1.

Proof.By applying the Borel-Cantelli lemma to Theorem 3 it can be shown (analogously to the proof of Corollary 2) that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0584 if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0585 for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0586 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0587. Since urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0588 is assumed to be chosen so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0589 is compact for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0590 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0591 is nonempty, it follows that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0592 as t →  with probability 1.

5.3 Inexact disturbance bounds

We next consider the case in which the set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0593 bounding wt in Assumption 1 does not satisfy Assumption 5. Instead, we assume that a compact set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0594 providing a tight bound on wt exists but is either unknown or nonpolytopic or nonconvex. We define the unit ball urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0595 and use a scalar urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0596 to characterize the accuracy to which urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0597 approximates urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0598.

Assumption 7. (Inexact disturbance bounds)urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0599 is a compact set such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0600 for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0601, and, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0602 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0603, the disturbance sequence {w0,w1, …} satisfies, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0604, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0605 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0606, where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0607 whenever urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0608.

Remark 8.Assumption 7 implies that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0609. As a result, every point in urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0610 can be a distance no greater than urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0611 from a point in urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0612, that is urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0613.

Theorem 4.If urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0614 and Assumptions 1, 6, and 7 hold, then for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0615 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0616, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0617 and any urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0618, we have

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0619

Corollary 4.Under Assumptions 1, 6, and 7, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0620 converges to urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0621 with probability 1.

Theorem 5.Let Assumptions 1, 6, and 7 hold and let urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0622 be the fixed complexity parameter set defined by (10), (11) with Remark 5. Then, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0623 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0624 for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0625 and any urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0626, we have, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0627,

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0628

Proof.A bound on the probability that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0629 satisfying urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0630 lies in urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0631 can be found using the same argument as the proof of Theorem 3. Thus, choose urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0632, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0633 so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0634, where

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0635
and pick urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0636 so that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0637 for each urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0638. Then, from urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0639 and Assumptions 6 and 7 we have
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0640

Therefore, if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0641 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0642, then urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0643 which implies urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0644. From Assumption 7 and the independence of the sequence {w0,w1, …} we therefore conclude that

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0645
and the result follows by applying this inequality ⌊t/Nu⌋ times.

Corollary 5.Under Assumptions 1, 6, and 7, the fixed complexity parameter set defined by (10), (11) and Remark 5 converges with probability 1 to a subset of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0646.

5.4 System with measurement noise

Consider the system model with an unknown parameter vector urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0647 and measurement noise st:
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0648(42a)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0649(42b)
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0650 is a measurement (or state estimate) and the noise sequence {s0,s1, …} has independent elements satisfying urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0651 for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0652.

Assumption 8. (Measurement noise bounds)urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0653 is a compact convex polytope with vertex representation urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0654.

Due to the measurement noise, the unfalsified parameter set must be constructed at each time urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0655 using the available measurements yt, yt − 1, the known control input ut − 1, and sets urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0656 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0657 bounding the disturbance and the measurement noise. To be consistent with (42), urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0658 must clearly lie in the set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0659, and the smallest unfalsified parameter set based on this information is given by
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0660(43a)
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0661(43b)

Thus, Assumption 8 implies that the unfalsified set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0662 is a convex polytope and the parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0663 can be estimated using, for example, the update law (10), (11) if urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0664 is known.

Assumption 9. (Tight measurement noise and disturbance bounds)For all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0665, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0666, and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0667 we have

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0668
where urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0669 whenever urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0670.

Given Assumptions 8 and 9, the results of Sections 5.1 and 5.2 apply with minor modifications. Define urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0671, then Assumption 9 implies
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0672
for any given urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0673 with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0674 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0675. This implies the following straightforward extensions of Theorems 2 and 3 and Corollaries 2 and 3.

Corollary 6.Let Assumptions 1, 6, 8, and 9 hold and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0676, with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0677 given by (43). Then for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0678 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0679, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0680 and all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0681, we have

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0682

Corollary 7.Let Assumptions 1, 6, 8, and 9 hold and let urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0683 be the fixed complexity parameter set defined by (10), (11) with Remark 5 and (43). Then for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0684 such that urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0685 for some urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0686 and any urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0687, we have, for all urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0688,

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0689

Corollary 8.Under Assumptions 1, 6, 8, 9 the parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0690 defined in Corollary 6 or 7 converges to urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0691 with probability 1.

Remark 9.If measurement noise is present, modifications to the proposed algorithm in Section 3.5 are needed. If no additional output constraints are present, then, given the noisy measurement yt, the constraint (14) should be replaced by

urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0692
in order to ensure robust satisfaction of input and state constraints. In addition, the unfalsified parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0693 is in this case given by (43).

6 NUMERICAL EXAMPLES

This section presents simulations to illustrate the operation of the proposed adaptive robust MPC scheme. The section consists of two parts. The first part investigates the effect of additional weight urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0694 in optimization problem urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0695 by using the example of a second-order system from Reference 30. The second part demonstrates the relationship between the speed of parameter convergence and minimal eigenvalue urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0696 from the PE condition.

6.1 Objective function with weighted PE condition

Consider the second-order discrete-time uncertain linear system from Reference 30, with model parameters
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0697
and with true system parameter urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0698. The initial parameter set estimate is urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0699, and for all t ≥ 0, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0700 is a hyperrectangle, with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0701 . The elements of the disturbance sequence {w0,w1, …} are independent and identically (uniformly) distributed on urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0702. The state and input constraints are [xt]2 ≥ −0.3 and ut ≤ 1. The MPC prediction horizon and PE window length are set to be N = 10 and Nu = 2, respectively. The matrix T is chosen according to Remark 2 and has 9 rows.

All simulations were performed in Matlab on a 3.4 GHz Intel Core i7 processor, and the online MPC optimization urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0703 was solved using Mosek.31 For purposes of comparison, the same parameter set update law and nominal parameter update law were used in all cases. Robust satisfaction of input and state constraints and recursive feasibility were observed in all simulations, in agreement with Proposition 1. To illustrate satisfaction of the state constraint [x]2 ≥ −0.3, Figure 1 shows the cross-sections of the robust state tube predicted at t = 0, with initial condition x0 = (3,6), together with the closed-loop state trajectories for 10 different initial conditions.

Details are in the caption following the image
Closed-loop trajectories (solid lines) for different initial conditions, and predicted state tube cross-sections urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0704 at t = 0 for initial condition x0 = (3,2), with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0705 shown in red and enclosed by dashed line

Table 1 compares the computational time and parameter sizes of the proposed algorithm and existing algorithms when the same initial conditions and disturbance sequences {w0,w1, … } are used. Algorithm (A) refers to the robust adaptive MPC in Section 3.4 of Lorenzen et al.10 Algorithm (B) is a modification of algorithm (A) that incorporates the PE constraint urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0706, which is implemented as described in Lu and Cannon15 with a fixed urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0707 value: urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0708. Algorithms (C) and (D) are the algorithm proposed in Section 3.5, with and without the PE condition, respectively.

TABLE 1. Performance comparison of robust MPC algorithms, with and without PE conditions
(A) (B) (C) (D)
Algorithm Homothetic tube (no PE)10a Homothetic tube with PE10 Proposed algorithm (no PE)a Proposed Algorithm with PE (urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0709)
Yalmip time (s) 0.1825 0.7407 0.2053 0.2608
Solver time (s) 0.1354 0.2065 0.1016 0.0766
Computational time (s) (Yalmip + solver time) 0.3179 0.9472 0.3069 0.3374
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0710 set size (%) 18.26 18.51 18.26 16.56
  • aFor algorithms without PE constraint, a QP solver, Gurobi,32 is used instead of Mosek.
  • Abbreviations: MPC, model predictive control; PE, persistently exciting.

Consider first algorithms (A) and (C) in Table 1. Although (C) uses a more flexible tube representation, there is negligible difference in overall computational time relative to (A). This is due to the use of a more efficient method of enforcing constraints on predicted tubes in (A) than (C), which introduces additional optimization variables to enforce these constraints. The more flexible tube representation employed in (C) provides a larger terminal set, as shown in Figure 2. Moreover, the formulation of (C) incorporates information on urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0711 in the constraints, and as a result, the terminal set increases in size over time as the parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0712 shrinks. On the other hand, the homothetic tube MPC employed in (A) employs a terminal set that is computed offline and is not updated online.

Details are in the caption following the image
Terminal sets for Algorithm (C) at times t = 0,1,100,1000 and the terminal set for (A) (which is computed offline and not updated online). The terminal sets for (C) are shown in green with solid boundaries, and are nested and increasing over time. The terminal set for (A) is shown in blue with dashed line boundary

Comparing algorithms (B) and (D) in Table 1, it can be seen that implementing the PE condition using an augmented cost function and linearized constraint (as in (D)) results in lower computation and faster parameter convergence than using a PE constraint with a fixed value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0713 (as in (B)). Tuning the value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0714 in algorithm (B) is challenging, since a value that is too small results in slow convergence whereas choosing urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0715 too large frequently causes the PE constraint to be infeasible. Moreover, whenever the PE constraint is infeasible in algorithm (B), the MPC optimization is solved a second time without the PE constraint, thus increasing computation.

Figure 3 shows the effect of the weighting coefficient urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0716 in the objective function (29) on the parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0717 when the same initial conditions (x0 = [3, 4], urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0718, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0719) and disturbance sequences {w0,w1, …} are used. Larger values of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0720 place greater weighting on urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0721 in the MPC cost (29), and thus on satisfaction of the PE condition (27). Therefore, increasing urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0722 results in a faster convergence rate in the parameter set volume. When the same weighting coefficient urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0723 is used, performing the parameter set update periodically (as discussed in Remark 5) slows down the convergence rate of the parameter set, as shown by the green line. For this simulation, the parameter set update (online step 2 in Section 3.5) takes only 2% of the total computational time.

Details are in the caption following the image
Volume of parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0724 over time for a range of weights urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0725 in the MPC objective function (29)

The relationship between weighting coefficient urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0726 and volume of parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0727 is shown in Figure 4. For values of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0728 between 10−3 and 103, closed-loop simulations were performed with the same initial conditions, disturbance sequences, and initial nominal model and parameter set. The parameter set volume after 20 time-steps is shown. Figure 4 also shows that increasing urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0729 results in a faster parameter set convergence rate, in agreement with Figure 3.

Details are in the caption following the image
Volume of parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0730 against weighting urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0731 in the MPC objective function (29)

For the same set of simulations, Figure 5 shows the optimal value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0732 in (28) and (27) against urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0733. From (29), it is expected that a larger urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0734 value will increase the influence of the term urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0735, thus pushing urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0736 to be more positive. The left-hand figure shows the value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0737 in the convexified constraint (28). As expected, the increase in urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0738 leads to a smooth increase in urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0739 initially, but after a certain point, any further increase in the weighting factor urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0740 does not affect the calculated urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0741 value. The right-hand figure shows the value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0742 in the PE condition (27). The difference between urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0743 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0744 illustrates the conservativeness of the convexification proposed in Section 3.4. Note that this can be reduced by repeating steps (3) and (4) in the online part of the proposed algorithm, thus iteratively recomputing the reference sequences urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0745, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0746 and reducing the conservativeness of linearization to any desired level. It is interesting to note that, although the optimal value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0747 in (28) levels off at urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0748, the value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0749 in the PE condition (27) increases monotonically between urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0750 and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0751. The smaller urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0752 values observed with (27) also explain the lower rates of parameter convergence for small values of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0753 in Figure 4. In practice, it can be used as a guideline for the tuning of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0754.

Details are in the caption following the image
Degree of satisfaction of conditions for persistency of excitation as a function of the weighting urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0755 in the MPC objective function (29). Left: optimal value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0756 in the constraint (28). Right: computed value of the PE coefficient urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0757 in (27)

Table 2 illustrates the convergence of the estimated parameter set over a large number of time steps for the initial condition x0 = [2, 3] and a randomly generated disturbance sequence {w0,w1, … }. Here, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0758 was chosen as 103 to speed up the convergence process. In agreement with Theorem 3, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0759 has shrunk to a small region around the true parameter value at t = 5000.

TABLE 2. Asymptotic convergence of the estimated parameter set
Time step (t) 0 1 5 50 100 500 1000 5000
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0760 set size (%) 100 30.21 18.50 14.46 12.75 11.11 1.51 0.25

6.2 Relationship between PE coefficient and convergence rate

We next consider third-order discrete-time linear systems given by (1) with urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0761, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0762, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0763 and
urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0764
The system matrices urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0765 satisfy (4) with randomly generated Ai, Bi, urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0766 parameters and initial parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0767. In each case the estimated parameter sets urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0768 have fixed complexity, with face normals aligned with the coordinate axes in parameter space. A linear feedback law is applied, ut = Kxt, where K is a stabilizing gain. We use these systems to investigate the relationship between the coefficient urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0769 in the PE condition (27) and rate of convergence of the estimated parameter set.

Taking the window length in (27) to be Nu = 10, closed-loop trajectories were computed for 10 time steps and the parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0770 was updated according to (11). Simulations were performed for 500 different initial conditions, and the average value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0771 was computed for each initial condition using 100 random disturbance sequences {w0,w1, …}. Figure 6 illustrates the relationship between the average size of the identified parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0772 and the average value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0773 in the PE condition (27). Clearly, increasing urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0774 results in a smaller parameter set on average, and hence a faster rate of convergence of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0775, which is consistent with the analysis of Section 5.2. The inner and outer radii shown in the figure on the left are the radii of the smallest and largest spheres, respectively, that contain and are contained within the parameter set estimate after 10 time steps. A similar trend can also be seen between the average volume of the parameter set urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0776 and the ensemble average value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0777.

Details are in the caption following the image
Average size of parameter set after 10 time steps against average value of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0778 in the PE condition (27) with Nu = 10. The parameter set size and urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0779 were computed for 500 different initial conditions. Left: mean side length and inner and outer radii of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0780. Right: volume of urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0781

7 CONCLUSIONS

In this article, we propose an adaptive robust MPC algorithm that combines robust tube MPC and set membership identification. The MPC formulation employs a nominal performance index and guarantees robust constraint satisfaction, recursive feasibility and ISS. A convexified persistent excitation condition is included in the MPC objective via a weighting coefficient, and the relationship between this weight and the convergence rate of the estimated parameter set is investigated. For computational tractability, a fixed complexity polytope is used to approximate the estimated parameter set. The article proves that the parameter set will converge to the vector of system parameters with probability 1 despite this approximation. Conditions for convergence of the estimated parameter set are derived for the case of inexact disturbance bounds and noisy measurements. Future work will consider systems with stochastic model parameters and probabilistic constraints. Quantitative relationships between the convergence rate of the estimated parameter set and conditions for PE will be investigated further and methods of enforcing PE of the closed-loop system will be considered.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.