For systems with uncertain linear models, bounded additive disturbances and state and control constraints, a robust model predictive control (MPC) algorithm incorporating online model adaptation is proposed. Sets of model parameters are identified online and employed in a robust tube MPC strategy with a nominal cost. The algorithm is shown to be recursively feasible and input-to-state stable. Computational tractability is ensured by using polytopic sets of fixed complexity to bound parameter sets and predicted states. Convex conditions for persistence of excitation are derived and are related to probabilistic rates of convergence and asymptotic bounds on parameter set estimates. We discuss how to balance conflicting requirements on control signals for achieving good tracking performance and parameter set estimate accuracy. Conditions for convergence of the estimated parameter set are discussed for the case of fixed complexity parameter set estimates, inexact disturbance bounds, and noisy measurements.

1 INTRODUCTION

Model predictive control (MPC) repeatedly solves a finite-horizon optimal control problem subject to input and state constraints. At each sampling instant a model of the plant is used to optimize predicted behavior and the first element of the optimal predicted control sequence is applied to the plant.¹ Any mismatch between model and plant causes degradation of controller performance.² As a result, the amount of model uncertainty strongly affects the bounds of the achievable performance of a robust MPC algorithm.³

To avoid the disruption caused by intrusive plant tests,² adaptive MPC attempts to improve model accuracy online while satisfying operating constraints and providing stability guarantees. Although the literature on adaptive control has long acknowledged the need for persistently exciting (PE) inputs for system identification,⁴ few articles have explored how to incorporate PE conditions with feasibility guarantees within adaptive MPC.⁵ In addition, adaptive MPC algorithms must balance conflicting requirements for system identification accuracy and computational complexity.^{5, 6}

Various methods for estimating system parameters and meeting operating constraints are described in the adaptive MPC literature. Depending on the assumptions on model parameters, parameter identification methods such as recursive least squares (RLS)⁷, comparison sets,⁸ set membership identification,^{9, 10} and neural networks^{11, 12} have been proposed. Heirung et al¹³ propose an algorithm where the unknown parameters are estimated using RLS and system outputs are predicted using the resulting parameter estimates. The use of RLS introduces nonlinear equality constraints into the optimization. On the other hand, the comparison model approach described in Aswani et al⁸ addresses the trade-off between probing for information and output regulation by decoupling these two tasks; a nominal model is used to impose operating constraints whereas performance is evaluated via a model learned online using statistical identification tools. However, the use of a nominal model implies that the comparison model approach cannot guarantee robust constraint satisfaction.

Tanaskovic et al⁹ consider a linear finite impulse response (FIR) model with measurement noise and constraints. This approach updates a model parameter set using online set membership identification; constraints are enforced for the entire parameter set and performance is optimized for a nominal prediction model. The article proves recursive feasibility but does not show convergence of the identified parameter set to the true parameters. To avoid the restriction to FIR models, Lorenzen et al¹⁰ consider a linear state space model with additive disturbance. An online-identified set of possible model parameters is used to robustly stabilize the system. However, the approach suffers from a lack of flexibility in its robust MPC formulation, which is based on homothetic tubes,¹⁴ allowing only the centers and scalings of tube cross-sections to be optimized online, and it does not provide convex and recursively feasible conditions to ensure PE control inputs.

In this article, we also consider linear systems with parameter uncertainty, additive disturbances, and constraints on system states and control inputs. Compared with Reference 10, the proposed algorithm reduces the conservativeness in approximating predicted state tubes by adopting more flexible cross-section representations. Building on Reference 15, we take advantage of fixed complexity polytopic tube representations and use hyperplane and vertex representations interchangeably to further simplify computation. We use, similarly to Reference 13, a nominal performance objective, but we impose constraints robustly on all possible models within the identified model set. We prove that the closed-loop system is input-to-state stable (ISS). In comparison with the min-max approach of Reference 15, the resulting performance bound takes the form of an asymptotic bound on the 2-norm of the sequence of closed-loop states in terms of the 2-norms of the additive disturbance and parameter estimate error sequences. In addition, we convexify the persistence of excitation (PE) condition around a reference trajectory and include a penalty term in the cost function to promote convergence of the parameter set. The convexification method is somewhat analogous to that proposed in References 16, 17, where the uncertainty information of the parameter set is approximated using a nominal gain. Here, however, the convexification is obtained by direct linearization of PE constraints on predicted trajectories. The cost function modification proposed here allows the relative importance of the two objectives, namely, controller performance and convergence of model parameters, to be specified.

Bai et al¹⁸ consider a particular set membership identification algorithm and show that the parameter set estimate converges with probability 1 to the actual parameter vector (assumed constant) if: (a) a tight bound on disturbances is known; (b) the input sequence is persistent exciting; and (c) the minimal parameter set estimate is employed. However, the minimal set estimate can be arbitrarily complex, and to provide computational tractability various nonminimal parameter set approximations have been proposed, such as n-dimensional balls¹⁹ and bounded complexity polytopes.⁹ The current article allows the use of parameter set estimates with fixed complexity and proves that, despite their approximate nature, such parameter sets converge with probability 1 to the true parameter values. We also derive lower bounds on convergence rates for the case of inexact knowledge of the disturbance bounding set and for the case that model states are estimated in the presence of measurement noise.

This article has five main parts. Section 2 defines the problem and basic assumptions. Section 3 gives details of the parameter estimation, robust constraint satisfaction, nominal cost function, convexified PE conditions, and the MPC algorithm. Section 4 proves recursive feasibility and ISS of the proposed algorithm. Section 5 proves the convergence of the parameter set in various conditions and Section 6 illustrates the approach with numerical examples.

Notation: $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0001$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0002$ denote the sets of integers and reals, and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0003$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0004$ . The ith row of a matrix A and ith element of a vector a are denoted [A]_i and [a]_i. Vectors and matrices of 1s are denoted 1, and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0005$ is the identity matrix. For a vector a, ‖a‖ is the Euclidean norm and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0006$ ; the largest element of a is $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0007$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0008$ . The absolute value of a scalar s is $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0009$ and the floor value is ⌊s⌋. $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0010$ is the number of elements in a set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0011$ . $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0012$ is Minkowski addition for sets $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0013$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0014$ , and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0015$ . The matrix inequality A ≽ 0 (or A ≻ 0) indicates that A is positive semidefinite (positive definite) matrix. The k steps ahead predicted value of a variable x is denoted x_k, and the more complete notation x_k|t indicates the k steps ahead prediction at time t. A continuous function $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0016$ is a $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0017$ -function if it is strictly increasing with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0018$ , and is a $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0019$ -function if in addition $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0020$ as s → ∞. A continuous function $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0021$ is a $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0022$ -function if, for all t ≥ 0, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0023$ is a $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0024$ -function, and, for all s ≥ 0, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0025$ is decreasing with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0026$ as t → ∞. For functions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0027$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0028$ we denote $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0029$ , and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0030$ with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0031$ .

2 PROBLEM FORMULATION AND PRELIMINARIES

This article considers a linear system with linear state and input constraints and unknown additive disturbance:

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0032$ (1)

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0033$ is the system state, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0034$ is the control input, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0035$ is an unknown disturbance input, and t is the discrete time index. The system matrices $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0036$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0037$ depend on an unknown but constant parameter $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0038$ . The disturbance sequence {w₀,w₁, …} is stochastic and (w_i,w_j) is independent for all i ≠ j. States and control inputs are subject to linear constraints, defined for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0039$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0040$ by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0041$ (2)

Assumption 1. (Additive disturbance)The disturbance w_t lies in a convex and compact polytope $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0042$ , where

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0043$ (3)

with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0044$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0045$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0046$ .

Assumption 2. (Parameter uncertainty)The system matrices A and B are affine functions of the parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0047$ :

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0048$ (4)

for known matrices A_j, B_j, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0049$ , and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0050$ lies in a known, bounded, convex polytope $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0051$ given by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0052$

Assumption 3. (State and control constraints)The set

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0053$

is compact and contains the origin in its interior.

To obtain finite numbers of decision variables and constraints in the MPC optimization problem, the predicted control sequence at time t is assumed to be expressed in terms of optimization variables v_0|t, … , v_{N − 1|t} as

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0054$ (5)

where N is the prediction horizon. The gain K is designed offline and is assumed to robustly stabilize the uncertain system $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0055$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0056$ in the absence of constraints. This assumption can be stated as follows.

Assumption 4. (Feedback gain and contractive set)There exists a polytopic set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0057$ and feedback gain K such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0058$ is $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0059$ -contractive for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0060$ , that is

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0061$ (6)

for all x ∈ {x : Tx ≤ 1} and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0062$ . The representation $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0063$ is assumed to be minimal in the sense that it contains no redundant inequalities.

3 ADAPTIVE ROBUST MPC

In this section, a parameter estimation scheme based on References 20, 21 is introduced. We then discuss the construction of tubes to bound predicted model states and associated constraints.

3.1 Set-based parameter estimation

At time t we use observations of the system state x_t to determine a set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0064$ of unfalsified model parameters. The set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0065$ is then combined with the parameter set estimate $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0066$ to construct a new parameter set estimate $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0067$ .

Unfalsified parameter set: Define D_t and d_t as the matrix and vector

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0068$ (7)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0069$ (8)

Then, given x_t, x_t − 1, u_t − 1 and the disturbance set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0070$ in (3), the unfalsified parameter set at time t is given by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0071$ (9)

with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0072$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0073$ .

Parameter set update: Let $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0074$ be an a priori chosen matrix. The estimated parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0075$ is defined by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0076$ (10)

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0077$ is updated online at times $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0078$ . The complexity of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0079$ is controlled by fixing $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0080$ , which fixes the directions of the half-spaces defining the parameter set. We assume that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0081$ is chosen so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0082$ is compact for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0083$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0084$ . Using a block recursive polytopic update method,²⁰ $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0085$ is defined as the smallest set (10) containing the intersection of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0086$ and unfalsified sets $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0087$ over a window of length N_u:

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0088$ (11)

(where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0089$ for all j ≤ 0). We refer to N_u as the PE window. Note that N_u is independent of the MPC prediction horizon N. Using linear conditions for polyhedral set inclusion²² $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0090$ in (11) can be obtained by solving a linear program for each $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0091$ :

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0092$

Lemma 1.If $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0093$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0094$ is defined by (10), (11), then $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0095$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0096$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0097$ .

3.2 Polytopic tubes for robust constraint satisfaction

This section considers predicted state and control trajectories. To simplify notation, we omit the subscript t indicating the time at which state and control predictions are made, whenever t indicates current time; thus the k steps ahead predictions x_k|t, v_k|t are denoted x_k, v_k. To ensure that the predicted state and control sequences satisfy the operating constraints (2) robustly for the given uncertainty bounds, we construct a tube (a sequence of sets) $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0098$ satisfying, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0099$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0100$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0101$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0102$ (12)

Hyperplane form: For given $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0103$ satisfying Assumption 4 and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0104$ , let $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0105$ denote the k steps ahead cross-section of the predicted state tube:

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0106$ (13)

The MPC algorithm described in Section 3.5 optimizes the shape of the predicted state tube online by allowing $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0107$ to be an optimization variable. If, for a given $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0108$ , the constraint $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0109$ is redundant for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0110$ in the hyperplane description (13) (ie, if the set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0111$ is unchanged by removing this constraint), we define (without loss of generality) $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0112$ . Thus, for each $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0113$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0114$ necessarily holds for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0115$ . Then (12) is equivalent to, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0116$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0117$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0118$

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0119$ is the vector with ith element $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0120$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0121$ . Substituting D(x,u) and d(x,u) from (7), (8), this implies linear conditions on $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0122$ , for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0123$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0124$ :

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0125$

and, for a given initial state x, the constraint $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0126$ requires

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0127$ (14)

Vertex form: $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0128$ has an equivalent representation in terms of its vertices, which we denote as $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0129$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0130$ :

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0131$ (15)

Note that m, the number of vertices of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0132$ , is fixed, and for given $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0133$ in (13), we may have $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0134$ for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0135$ in (15) (ie, the vertex description may contain repeated vertices). The presence of repeated vertices does not affect the formulation. For each $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0136$ , define an index set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0137$ (with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0138$ ), such that the ith row of T and ith element of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0139$ satisfy

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0140$

Since T is constant, the index set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0141$ associated with active inequalities at the vertex x^(j) is independent of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0142$ and can be computed offline. Therefore, for each $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0143$ , we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0144$ (16)

where the matrix $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0145$ can be computed offline given knowledge of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0146$ using the property that

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0147$ (17)

Using the vertex representation (16), the condition that (2) is satisfied for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0148$ is equivalent to, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0149$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0150$ (18)

Substituting D(x,u) and d(x,u) from (7), (8), condition (12) can be expressed equivalently as

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0151$

for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0152$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0153$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0154$ . This is equivalent (see e.g. Reference 22, Proposition 3.31) to the requirement that there exist matrices $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0155$ satisfying, for each prediction time step $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0156$ and each vertex $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0157$ , the conditions

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0158$ (19a)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0159$ (19b)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0160$ (19c)

Given the dual mode predicted control law (5), we introduce the terminal conditions that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0161$ and (F + GK)x ≤ 1 for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0162$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0163$ , and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0164$ . Then (2) and (12) are satisfied if (18), (19) hold for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0165$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0166$ , and there exist matrices $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0167$ satisfying the conditions, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0168$

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0169$ (20a)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0170$ (20b)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0171$ (20c)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0172$ (20d)

3.3 Objective function

Consider the nominal cost defined for Q, R ≻ 0 by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0173$ (21)

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0174$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0175$ are elements of predicted state and control sequences generated by a nominal parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0176$ :

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0177$ (22a)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0178$ (22b)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0179$ (22c)

for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0180$ and where v = {v₀, … , v_N − 1}. Define $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0181$ as the solution of the Lyapunov matrix equation

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0182$ (23)

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0183$ . Note that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0184$ is well-defined for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0185$ due to Assumption 4. Then (21) is equivalent to

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0186$ (24)

We assume knowledge of an initial nominal parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0187$ , which could be estimated using physical modeling or offline system identification, alternatively $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0188$ could be defined as the Chebyshev center of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0189$ . For t > 0, we assume that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0190$ is updated by projecting $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0191$ onto the parameter set estimate $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0192$ , that is

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0193$ (25)

Remark 1.For the ISS analysis in Section 4 it is essential that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0194$ . However, subject to this constraint, alternative update laws for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0195$ are possible; for example, a least mean squares estimate projected onto $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0196$ .¹⁰

3.4 Augmented objective function and persistent excitation

The regressor D_t in (7) is PE if

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0197$ (26)

for some PE window $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0198$ , some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0199$ , and all times t₀.⁴ Although the upper bound in (26) implies convex constraints on x_t and u_t, the lower bound is nonconvex. The bounds on convergence of the parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0200$ derived in Section 5 suggest faster convergence as $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0201$ in the PE condition (26) increases.

Previously proposed MPC strategies that incorporate PE constraints consider the PE condition to be defined on an interval such as {t − N_u + 1, … , t}, where t is current time, which means that the PE constraint depends on only the first element of the predicted control sequence. Marafioti et al²³ simplify the PE condition by expressing it as a nonconvex quadratic inequality in u_t. Likewise, Lorenzen et al¹⁰ show that the PE condition is equivalent to a nonconvex constraint on the current control input. Lu and Cannon¹⁵ linearize the PE condition about a reference trajectory and thus obtain a sufficient condition for persistent excitation.

In this article, on the hand, we define the PE condition over predicted trajectories (from k = 0 to k = N_u − 1 steps ahead), and we therefore require, at time t and for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0202$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0203$ (27)

The inclusion of predicted future states and control inputs in this PE condition allows for greater flexibility in meeting the constraint. To avoid nonconvex constraints, we derive a convex relaxation that provides a sufficient condition for (27).

Assume reference state and control predicted sequences, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0204$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0205$ , approximating the optimal predicted state and control sequences, are available. To derive sufficient conditions for (27), we consider the difference between the reference and optimized sequences, denoted $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0206$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0207$ (ie, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0208$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0209$ ). Since the prediction tube implies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0210$ , we therefore have $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0211$ where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0212$ . Denote the vertices of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0213$ as $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0214$ and for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0215$ , we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0216$

Moreover, D_k depends linearly on u_k and x_k, and hence for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0217$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0218$

Here, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0219$ is a positive semidefinite matrix, and by omitting this term we obtain sufficient conditions for (27) as a set of LMIs in $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0220$ and v_k. The following convex conditions are thus sufficient to ensure (27) whenever $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0221$

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0222$ (28a)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0223$ (28b)

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0224$ are intermediate variables.

Another innovation of this article is the inclusion of PE coefficient in the cost function. Previous approaches^{10, 15} face the difficulty of choosing a suitable $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0225$ value for the PE constraint in the implementation. A larger value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0226$ is generally desirable, but a large $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0227$ might make the optimization problem with the PE condition infeasible. In this article, we incentivize a large value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0228$ by modifying the MPC objective function as follows

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0229$ (29)

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0230$ is a weight that controls the relative priority given to satisfaction of the PE condition (27) and tracking performance. This modification does not affect the feasibility of the optimization.

Although incorporating a condition such as (27) or the convex relaxation (28) and (29) into a MPC strategy does not ensure that the closed-loop system satisfies a corresponding PE condition, its effect on the convergence rate of the estimated parameter set is significant, as shown in the numerical example in Section 6. In addition, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0231$ is a meaningful and straightforward coefficient to tune, and the PE constraint can be easily switched off by setting $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0232$ .

3.5 Proposed algorithm

Offline:

1. Choose suitable T defining the predicted state tube and compute the corresponding U_j in (17).
2. Obtain a nominal $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0233$ .
3. Minimize the contracitivity factor $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0234$ satisfying (6) and obtain a feedback gain K.

Online: For t = 0,1,2,…

1. Obtain the current state x_t and set x = x_t.
2. Update $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0235$ using (11) and the nominal parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0236$ using (25), and solve (23) for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0237$ .
3.
At t = 0 compute the initial reference state and control sequences $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0238$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0239$ , for example, by solving the nominal problem described in Remark 4.

At t > 0 compute the reference state and control sequences $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0240$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0241$ , using the solution $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0242$ at t − 1, and
$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0243$ (30)
4. Compute $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0244$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0245$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0246$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0247$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0248$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0249$ the solution of the semidefinite program (SDP):
$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0250$
5. Implement the current control input $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0251$ .

Remark 2.In offline step 1, T can be chosen so that the polytope $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0252$ in Assumption 4 approximates a robust control invariant set of the form $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0253$ , where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0254$ satisfies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0255$ for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0256$ . Using the vertex representation of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0257$ , the matrix $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0258$ can be computed by solving a SDP (see e.g. Reference 24, Chapter 5). This approach allows the number of rows in T to be specified by the designer. However, T can alternatively be chosen so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0259$ approximates the minimal robust positive invariant (RPI) set or the maximal RPI set for the system (1) and (2) under a specified stabilizing feedback law.²²

Remark 3.In offline step 3, the computation of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0260$ subject to (6) for given T can be performed by solving a LP using the vertex representation of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0261$ [ 22, chap. 7]. The objective of minimizing $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0262$ is chosen to make the constraints of problem $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0263$ easier to satisfy. In particular, choosing K so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0264$ in (6) ensures that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0265$ exists satisfying the terminal constraints (20b-d).

Remark 4.At t = 0, the reference sequences $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0266$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0267$ may be computed by solving

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0268$ (31)

Remark 5.The online computation of the proposed algorithm may be reduced by updating $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0269$ only once every N_u > 1 time steps. For example, in Step 2, set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0270$ for t ∈ {N_u,2N_u, …} and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0271$ at all times t ∉ {N_u,2N_u, …}.

In Section 4, we use the property that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0272$ to show that the solution, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0273$ , of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0274$ at time t − 1 forms part of a feasible solution of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0275$ at time t. As a result, the reference sequences $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0276$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0277$ in Step 3 are feasible for problem $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0278$ at all times t > 0.

4 RECURSIVE FEASIBILITY AND STABILITY

4.1 Recursive feasibility

At time t ≥ 1, let a suboptimal set of decision variables, denoted $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0279$ be defined in terms of the optimal solution $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0280$ of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0281$ at time t − 1 by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0282$

Proposition 1. (Recursive Feasibility)The online MPC optimization $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0283$ is feasible at all times $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0284$ if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0285$ is feasible at t = 0 and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0286$ for all time t.

Proof.If $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0287$ is feasible at t − 1, then at time t, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0288$ is: feasible for (14) because $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0289$ ; feasible for (18) and (19) for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0290$ because $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0291$ is feasible for (18) and (19) for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0292$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0293$ ; and feasible for (18) and (19) for k = N − 1 and feasible for (20) because $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0294$ is feasible for (20) and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0295$ . Finally, we note that (22) is necessarily feasible and (28) necessarily holds for some scalar $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0296$ if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0297$ .

4.2 Input-to-state stability

Throughout this section we set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0298$ in problem $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0299$ . Therefore, the objective of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0300$ is $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0301$ where J is the nominal cost (24). As a result of parameter adaption, the change of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0302$ online might increase the cost, but this is absorbed in the ISS terms. To simplify notation we define a stage cost L(x,v) and terminal cost $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0303$ as $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0304$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0305$ so that (24) is equivalent to

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0306$

Denoting the actual state at next time step as x⁺, we define the function $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0307$ as

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0308$

so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0309$ .

Lemma 2. (ISS-Lyapunov function [25])The system

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0310$ (32)

with control law $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0311$ is ISS with region of attraction $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0312$ if the following conditions are satisfied.

(i). $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0313$ contains the origin in its interior, is compact, and is a robust positively invariant set for (32), that is, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0314$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0315$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0316$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0317$ , and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0318$ .

(ii). There exist $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0319$ - functions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0320$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0321$ -functions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0322$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0323$ and a function $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0324$ such that for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0325$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0326$ is continuous, and for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0327$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0328$ (33)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0329$ (34)

In the following, we define $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0330$ as the set of states x such that problem $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0331$ is feasible and assume that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0332$ is nonempty. In addition, for a given state x, nominal parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0333$ , and parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0334$ , we denote $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0335$ as the optimal solution of problem $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0336$ , and let $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0337$ be the corresponding optimal value of the cost in (24), so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0338$ .

Theorem 1.Assume that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0339$ and the nominal parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0340$ is not updated, that is, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0341$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0342$ . Then for all initial conditions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0343$ , the system (1) with control law $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0344$ , where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0345$ is the first element of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0346$ , robustly satisfies the constraint (2) and is ISS with region of attraction $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0347$ .

Proof.We first show that condition (i) of Lemma 2 is satisfied with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0348$ . If $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0349$ is feasible at t = 0, then (20) implies that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0350$ exists such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0351$ satisfies

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0352$ (35)

Therefore $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0353$ is feasible for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0354$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0355$ , and hence $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0356$ . In addition, the robust invariance of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0357$ implied by (35) ensures that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0358$ , and since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0359$ due to Assumption 1, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0360$ must contain the origin in its interior. Furthermore, Proposition 1 shows that if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0361$ is initially feasible, then it is feasible for all t ≥ 0, so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0362$ is positively invariant for (32). Finally, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0363$ is necessarily compact by Assumption 3, and it follows that condition (i) of Lemma 2 is satisfied if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0364$ .

We next consider the bounds (33) in condition (ii) of Lemma 2. For a given state x, nominal parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0365$ and parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0366$ , problem $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0367$ with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0368$ and Q, R ≻ 0 is a convex quadratic program. Therefore, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0369$ is a continuous positive definite, piecewise quadratic function of x²⁶ for each $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0370$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0371$ . Furthermore, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0372$ is compact due to Assumption 2 and it follows that there exist $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0373$ -functions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0374$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0375$ such that (33) holds with

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0376$

To show that the bound (34) in condition (ii) of Lemma 2 holds, let $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0377$ denote the set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0378$ . Then, given the linear dependence of the system (1), the model parameterization (4), and the predicted control law (5) on the state x, disturbance w, and parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0379$ , and since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0380$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0381$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0382$ , and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0383$ are compact sets by Assumptions 1,2, and 3, there exist $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0384$ functions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0385$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0386$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0387$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0388$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0389$ such that, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0390$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0391$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0392$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0393$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0394$

Following the proof of Theorem 5 in Limon et al²⁵ and using the weak triangle inequality for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0395$ -functions,²⁷ we obtain

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0396$

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0397$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0398$ , and both $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0399$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0400$ are $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0401$ -functions. Since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0402$ is a feasible but suboptimal solution of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0403$ at x⁺, and since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0404$ by assumption, the optimal cost function satisfies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0405$ and hence

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0406$

Thus, all conditions of Lemma 2 are satisfied.

Corollary 1.Assume that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0407$ and the nominal parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0408$ is updated at each time $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0409$ using (25). Then for all initial conditions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0410$ , the system (1) with control law $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0411$ , where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0412$ is the first element of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0413$ , robustly satisfies the constraint (2) and is ISS with region of attraction $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0414$ .

Proof.It can be shown that condition (i) of Lemma 2 and the bounds (33) in condition (ii) of Lemma 2 are satisfied with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0415$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0416$ for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0417$ -functions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0418$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0419$ using the same argument as the proof of Theorem 1. To show that (34) is also satisfied and hence complete the proof we use an argument similar to the proof of Theorem 1. In particular, as before we define $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0420$ using the optimal solution of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0421$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0422$ , and

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0423$

However, here we define z = {z₀, … , z_N} as the sequence

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0424$

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0425$ has $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0426$ for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0427$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0428$ . Then $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0429$ implies

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0430$ (36)

The update law (25) ensures that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0431$ since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0432$ . Hence, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0433$ , we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0434$

and it follows that, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0435$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0436$

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0437$ . In addition we have, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0438$

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0439$

and, since Q ≻ 0, there exists a $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0440$ function $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0441$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0442$ , while (23) implies

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0443$

Furthermore (23) is linear in $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0444$ and the solution $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0445$ is unique for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0446$ (see, eg, Reference 28) since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0447$ is by assumption stable. Therefore, by the implicit function theorem, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0448$ is Lipschitz continuous and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0449$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0450$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0451$ , for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0452$ . Hence

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0453$

Collecting the bounds derived above on individual terms in the expression for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0454$ in (36), we obtain

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0455$

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0456$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0457$ are $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0458$ -functions. But by optimality we have $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0459$ . Therefore (34) holds and hence all of the conditions of Lemma 2 are satisfied.

Remark 6.The ISS property implies that there exists a $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0460$ -function $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0461$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0462$ -functions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0463$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0464$ such that for all feasible initial conditions $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0465$ , the closed-loop system trajectories satisfy, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0466$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0467$

Remark 7.Theorem 1 and Corollary 1 do not apply to the case that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0468$ . However, if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0469$ is replaced by a time-varying weight $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0470$ in the objective of problem $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0471$ , then ISS can be guaranteed by setting $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0472$ for all t ≥ t₀, for some finite horizon t₀.

5 CONVERGENCE OF THE ESTIMATED PARAMETER SET

In terms of D and d defined in (7) and (8), the system model $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0473$ can be rewritten as

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0474$ (37)

where x_t + 1, D(x_t,u_t), and d(x_t,u_t) are known at time t + 1. Thus, the system is linear with regressor D_t, uncertain parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0475$ , and additive disturbance $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0476$ .

Bai et al¹⁸ show that, for such a system, the diameter of the parameter set constructed using a set-membership identification method converges to zero with probability 1 if the uncertainty bound $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0477$ is tight and the regressor D_t is PE. We extend this result and prove convergence of the estimated parameter set in more general cases. Specifically, in this article we avoid the problem of computational intractability arising from a minimum volume update law of the form $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0478$ . Instead, we derive stochastic convergence results for parameter sets with fixed complexity and update laws of the form $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0479$ .

In this section, we first discuss relevant results for an update law that gives a minimal parameter set estimate for a given sequence of states (but whose representation has potentially unbounded complexity), before considering convergence of the fixed-complexity parameter set update law of Section 3.1. We then compute bounds on the parameter set diameter if the bounding set for the additive disturbances is overestimated. Finally, we demonstrate that similar results can be achieved when errors are present in the observed state, as would be encountered, for example, if the system state were estimated from noisy measurements. In each case we relate the PE condition to the rate of parameter set convergence. We also prove that the parameter set converges to a point (or minimal uncertainty set) with probability one.

In common with Bai et al,^{18, 29} we do not assume a specific distribution for the disturbance input. However, the set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0480$ bounding the model disturbance is assumed to be tight in the sense that there is nonzero probability of a realization w_t lying arbitrarily close to any given point on the boundary, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0481$ , of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0482$ .

Assumption 5. (Tight disturbance bounds)For all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0483$ and any $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0484$ the disturbance sequence {w₀,w₁, …} satisfies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0485$ , for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0486$ , where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0487$ whenever $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0488$ .

Assumption 6. (Persistent Excitation)There exist positive scalars $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0489$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0490$ and an integer N_u ≥ ⌈p/n_x⌉ such that, for each $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0491$ we have $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0492$ and

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0493$

We further assume throughout this section that the rows of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0494$ are normalized so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0495$ for all i.

5.1 Minimal parameter set

The unfalsified parameter set at time t defined in (9) can be expressed as

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0496$ (38)

where w_t is the disturbance realization at time t and D_t = D(x_t,u_t). Let w⁰ be an arbitrary point on the boundary $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0497$ , then the normal cone $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0498$ to $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0499$ at w⁰ is defined

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0500$ (39)

Proposition 2.For all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0501$ , all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0502$ , and for any $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0503$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0504$ , under Assumptions 1, 5, and 6 we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0505$

for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0506$ .

Proof.Assumption 1 implies that there exists $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0507$ so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0508$ for any given $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0509$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0510$ Therefore, if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0511$ satisfies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0512$ , then the definition (39) of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0513$ implies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0514$ , and hence $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0515$ from (38). But

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0516$

Therefore, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0517$ whenever $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0518$ . Furthermore, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0519$ Assumption 6 implies

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0520$

Hence, if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0521$ , then there must exist some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0522$ such that

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0523$

If $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0524$ , then it follows that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0525$ and thus $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0526$ . Assumption 5 implies the probability of this event is at least $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0527$ .

Theorem 2.If $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0528$ and Assumptions 1, 5, and 6 hold, then for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0529$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0530$ , for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0531$ and any $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0532$ , we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0533$

Proof.For the nontrivial case of t ≥ N_u we have $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0534$ since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0535$ . Also $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0536$ and Proposition 2 implies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0537$ if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0538$ . Therefore,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0539$

and the result follows by applying this inequality ⌊t/N_u⌋ times.

Corollary 2.Under Assumptions 1, 5, and 6, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0540$ converges to $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0541$ with probability 1.

Proof.For any $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0542$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0543$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0544$ , Theorem 2 implies that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0545$ is necessarily finite, and since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0546$ requires that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0547$ , the Borel-Cantelli lemma therefore implies that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0548$ . It follows that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0549$ as t → ∞ with probability 1 since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0550$ is arbitrary.

5.2 Fixed complexity parameter set

In order to reduce computational load and ensure numerical tractability, we assume that the parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0551$ is defined by a fixed complexity polytope, as in (10) and (11). This section shows that, although a degree of conservativeness is introduced by fixing the complexity of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0552$ , asymptotic convergence of this set to the true parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0553$ still holds with probability 1.

Theorem 3.If $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0554$ is updated according to (10), (11) and Remark 5, and Assumptions 1, 5, and 6 hold, then for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0555$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0556$ for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0557$ , we have, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0558$ and any $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0559$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0560$

Proof.For the nontrivial case of t ≥ N_u we have $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0561$ since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0562$ by Lemma 1. Consider therefore the probability that any given $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0563$ satisfying $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0564$ lies in $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0565$ . Define vectors g_j for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0566$ by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0567$ (40)

Assumption 1 implies that, for any given $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0568$ , there exists a $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0569$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0570$ . Accordingly, choose $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0571$ so that g_j in (40) satisfies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0572$ for each $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0573$ . Then

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0574$ (41)

is a necessary condition for $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0575$ due to (38) and (39). But (40) and Assumption 6 imply

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0576$

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0577$ by assumption, and it follows from (41) that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0578$ if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0579$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0580$ . From Assumption 5 and the independence of the sequence {w₀,w₁, …} we therefore conclude that

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0581$

and the result follows by applying this inequality ⌊t/N_u⌋ times.

Corollary 3.Under Assumptions 1, 5, and 6, the fixed complexity parameter set estimate $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0582$ converges to $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0583$ with probability 1.

Proof.By applying the Borel-Cantelli lemma to Theorem 3 it can be shown (analogously to the proof of Corollary 2) that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0584$ if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0585$ for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0586$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0587$ . Since $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0588$ is assumed to be chosen so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0589$ is compact for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0590$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0591$ is nonempty, it follows that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0592$ as t → ∞ with probability 1.

5.3 Inexact disturbance bounds

We next consider the case in which the set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0593$ bounding w_t in Assumption 1 does not satisfy Assumption 5. Instead, we assume that a compact set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0594$ providing a tight bound on w_t exists but is either unknown or nonpolytopic or nonconvex. We define the unit ball $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0595$ and use a scalar $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0596$ to characterize the accuracy to which $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0597$ approximates $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0598$ .

Assumption 7. (Inexact disturbance bounds) $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0599$ is a compact set such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0600$ for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0601$ , and, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0602$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0603$ , the disturbance sequence {w₀,w₁, …} satisfies, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0604$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0605$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0606$ , where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0607$ whenever $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0608$ .

Remark 8.Assumption 7 implies that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0609$ . As a result, every point in $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0610$ can be a distance no greater than $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0611$ from a point in $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0612$ , that is $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0613$ .

Theorem 4.If $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0614$ and Assumptions 1, 6, and 7 hold, then for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0615$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0616$ , for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0617$ and any $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0618$ , we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0619$

Corollary 4.Under Assumptions 1, 6, and 7, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0620$ converges to $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0621$ with probability 1.

Theorem 5.Let Assumptions 1, 6, and 7 hold and let $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0622$ be the fixed complexity parameter set defined by (10), (11) with Remark 5. Then, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0623$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0624$ for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0625$ and any $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0626$ , we have, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0627$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0628$

Proof.A bound on the probability that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0629$ satisfying $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0630$ lies in $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0631$ can be found using the same argument as the proof of Theorem 3. Thus, choose $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0632$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0633$ so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0634$ , where

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0635$

and pick $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0636$ so that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0637$ for each $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0638$ . Then, from $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0639$ and Assumptions 6 and 7 we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0640$

Therefore, if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0641$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0642$ , then $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0643$ which implies $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0644$ . From Assumption 7 and the independence of the sequence {w₀,w₁, …} we therefore conclude that

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0645$

and the result follows by applying this inequality ⌊t/N_u⌋ times.

Corollary 5.Under Assumptions 1, 6, and 7, the fixed complexity parameter set defined by (10), (11) and Remark 5 converges with probability 1 to a subset of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0646$ .

5.4 System with measurement noise

Consider the system model with an unknown parameter vector $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0647$ and measurement noise s_t:

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0648$ (42a)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0649$ (42b)

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0650$ is a measurement (or state estimate) and the noise sequence {s₀,s₁, …} has independent elements satisfying $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0651$ for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0652$ .

Assumption 8. (Measurement noise bounds) $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0653$ is a compact convex polytope with vertex representation $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0654$ .

Due to the measurement noise, the unfalsified parameter set must be constructed at each time $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0655$ using the available measurements y_t, y_t − 1, the known control input u_t − 1, and sets $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0656$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0657$ bounding the disturbance and the measurement noise. To be consistent with (42), $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0658$ must clearly lie in the set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0659$ , and the smallest unfalsified parameter set based on this information is given by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0660$ (43a)

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0661$ (43b)

Thus, Assumption 8 implies that the unfalsified set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0662$ is a convex polytope and the parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0663$ can be estimated using, for example, the update law (10), (11) if $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0664$ is known.

Assumption 9. (Tight measurement noise and disturbance bounds)For all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0665$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0666$ , and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0667$ we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0668$

where $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0669$ whenever $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0670$ .

Given Assumptions 8 and 9, the results of Sections 5.1 and 5.2 apply with minor modifications. Define $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0671$ , then Assumption 9 implies

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0672$

for any given $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0673$ with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0674$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0675$ . This implies the following straightforward extensions of Theorems 2 and 3 and Corollaries 2 and 3.

Corollary 6.Let Assumptions 1, 6, 8, and 9 hold and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0676$ , with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0677$ given by (43). Then for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0678$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0679$ , for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0680$ and all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0681$ , we have

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0682$

Corollary 7.Let Assumptions 1, 6, 8, and 9 hold and let $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0683$ be the fixed complexity parameter set defined by (10), (11) with Remark 5 and (43). Then for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0684$ such that $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0685$ for some $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0686$ and any $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0687$ , we have, for all $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0688$ ,

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0689$

Corollary 8.Under Assumptions 1, 6, 8, 9 the parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0690$ defined in Corollary 6 or 7 converges to $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0691$ with probability 1.

Remark 9.If measurement noise is present, modifications to the proposed algorithm in Section 3.5 are needed. If no additional output constraints are present, then, given the noisy measurement y_t, the constraint (14) should be replaced by

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0692$

in order to ensure robust satisfaction of input and state constraints. In addition, the unfalsified parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0693$ is in this case given by (43).

6 NUMERICAL EXAMPLES

This section presents simulations to illustrate the operation of the proposed adaptive robust MPC scheme. The section consists of two parts. The first part investigates the effect of additional weight $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0694$ in optimization problem $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0695$ by using the example of a second-order system from Reference 30. The second part demonstrates the relationship between the speed of parameter convergence and minimal eigenvalue $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0696$ from the PE condition.

6.1 Objective function with weighted PE condition

Consider the second-order discrete-time uncertain linear system from Reference 30, with model parameters

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0697$

and with true system parameter $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0698$ . The initial parameter set estimate is $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0699$ , and for all t ≥ 0, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0700$ is a hyperrectangle, with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0701$ . The elements of the disturbance sequence {w₀,w₁, …} are independent and identically (uniformly) distributed on $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0702$ . The state and input constraints are [x_t]₂ ≥ −0.3 and u_t ≤ 1. The MPC prediction horizon and PE window length are set to be N = 10 and N_u = 2, respectively. The matrix T is chosen according to Remark 2 and has 9 rows.

All simulations were performed in Matlab on a 3.4 GHz Intel Core i7 processor, and the online MPC optimization $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0703$ was solved using Mosek.³¹ For purposes of comparison, the same parameter set update law and nominal parameter update law were used in all cases. Robust satisfaction of input and state constraints and recursive feasibility were observed in all simulations, in agreement with Proposition 1. To illustrate satisfaction of the state constraint [x]₂ ≥ −0.3, Figure 1 shows the cross-sections of the robust state tube predicted at t = 0, with initial condition x₀ = (3,6), together with the closed-loop state trajectories for 10 different initial conditions.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

Closed-loop trajectories (solid lines) for different initial conditions, and predicted state tube cross-sections $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0704$ at t = 0 for initial condition x₀ = (3,2), with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0705$ shown in red and enclosed by dashed line

**FIGURE 1**
Open in figure viewer PowerPoint

Closed-loop trajectories (solid lines) for different initial conditions, and predicted state tube cross-sections $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0704$ at t = 0 for initial condition x₀ = (3,2), with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0705$ shown in red and enclosed by dashed line

Table 1 compares the computational time and parameter sizes of the proposed algorithm and existing algorithms when the same initial conditions and disturbance sequences {w₀,w₁, … } are used. Algorithm (A) refers to the robust adaptive MPC in Section 3.4 of Lorenzen et al.¹⁰ Algorithm (B) is a modification of algorithm (A) that incorporates the PE constraint $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0706$ , which is implemented as described in Lu and Cannon¹⁵ with a fixed $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0707$ value: $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0708$ . Algorithms (C) and (D) are the algorithm proposed in Section 3.5, with and without the PE condition, respectively.

TABLE 1. Performance comparison of robust MPC algorithms, with and without PE conditions

Algorithm	Homothetic tube (no PE)¹⁰^a	Homothetic tube with PE¹⁰	Proposed algorithm (no PE)^a	Proposed Algorithm with PE ( $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0709$ )
	(A)	(B)	(C)	(D)
Yalmip time (s)	0.1825	0.7407	0.2053	0.2608
Solver time (s)	0.1354	0.2065	0.1016	0.0766
Computational time (s) (Yalmip + solver time)	0.3179	0.9472	0.3069	0.3374
$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0710$ set size (%)	18.26	18.51	18.26	16.56

^aFor algorithms without PE constraint, a QP solver, Gurobi,³² is used instead of Mosek.
Abbreviations: MPC, model predictive control; PE, persistently exciting.

Consider first algorithms (A) and (C) in Table 1. Although (C) uses a more flexible tube representation, there is negligible difference in overall computational time relative to (A). This is due to the use of a more efficient method of enforcing constraints on predicted tubes in (A) than (C), which introduces additional optimization variables to enforce these constraints. The more flexible tube representation employed in (C) provides a larger terminal set, as shown in Figure 2. Moreover, the formulation of (C) incorporates information on $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0711$ in the constraints, and as a result, the terminal set increases in size over time as the parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0712$ shrinks. On the other hand, the homothetic tube MPC employed in (A) employs a terminal set that is computed offline and is not updated online.

Comparing algorithms (B) and (D) in Table 1, it can be seen that implementing the PE condition using an augmented cost function and linearized constraint (as in (D)) results in lower computation and faster parameter convergence than using a PE constraint with a fixed value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0713$ (as in (B)). Tuning the value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0714$ in algorithm (B) is challenging, since a value that is too small results in slow convergence whereas choosing $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0715$ too large frequently causes the PE constraint to be infeasible. Moreover, whenever the PE constraint is infeasible in algorithm (B), the MPC optimization is solved a second time without the PE constraint, thus increasing computation.

Figure 3 shows the effect of the weighting coefficient $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0716$ in the objective function (29) on the parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0717$ when the same initial conditions (x₀ = [3, 4]^⊤, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0718$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0719$ ) and disturbance sequences {w₀,w₁, …} are used. Larger values of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0720$ place greater weighting on $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0721$ in the MPC cost (29), and thus on satisfaction of the PE condition (27). Therefore, increasing $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0722$ results in a faster convergence rate in the parameter set volume. When the same weighting coefficient $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0723$ is used, performing the parameter set update periodically (as discussed in Remark 5) slows down the convergence rate of the parameter set, as shown by the green line. For this simulation, the parameter set update (online step 2 in Section 3.5) takes only 2% of the total computational time.

**FIGURE 3**
Open in figure viewer PowerPoint

Volume of parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0724$ over time for a range of weights $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0725$ in the MPC objective function (29)

The relationship between weighting coefficient $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0726$ and volume of parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0727$ is shown in Figure 4. For values of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0728$ between 10⁻³ and 10³, closed-loop simulations were performed with the same initial conditions, disturbance sequences, and initial nominal model and parameter set. The parameter set volume after 20 time-steps is shown. Figure 4 also shows that increasing $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0729$ results in a faster parameter set convergence rate, in agreement with Figure 3.

**FIGURE 4**
Open in figure viewer PowerPoint

Volume of parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0730$ against weighting $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0731$ in the MPC objective function (29)

For the same set of simulations, Figure 5 shows the optimal value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0732$ in (28) and (27) against $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0733$ . From (29), it is expected that a larger $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0734$ value will increase the influence of the term $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0735$ , thus pushing $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0736$ to be more positive. The left-hand figure shows the value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0737$ in the convexified constraint (28). As expected, the increase in $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0738$ leads to a smooth increase in $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0739$ initially, but after a certain point, any further increase in the weighting factor $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0740$ does not affect the calculated $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0741$ value. The right-hand figure shows the value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0742$ in the PE condition (27). The difference between $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0743$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0744$ illustrates the conservativeness of the convexification proposed in Section 3.4. Note that this can be reduced by repeating steps (3) and (4) in the online part of the proposed algorithm, thus iteratively recomputing the reference sequences $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0745$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0746$ and reducing the conservativeness of linearization to any desired level. It is interesting to note that, although the optimal value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0747$ in (28) levels off at $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0748$ , the value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0749$ in the PE condition (27) increases monotonically between $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0750$ and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0751$ . The smaller $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0752$ values observed with (27) also explain the lower rates of parameter convergence for small values of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0753$ in Figure 4. In practice, it can be used as a guideline for the tuning of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0754$ .

**FIGURE 5**
Open in figure viewer PowerPoint

Degree of satisfaction of conditions for persistency of excitation as a function of the weighting $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0755$ in the MPC objective function (29). Left: optimal value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0756$ in the constraint (28). Right: computed value of the PE coefficient $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0757$ in (27)

Table 2 illustrates the convergence of the estimated parameter set over a large number of time steps for the initial condition x₀ = [2, 3]^⊤ and a randomly generated disturbance sequence {w₀,w₁, … }. Here, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0758$ was chosen as 10³ to speed up the convergence process. In agreement with Theorem 3, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0759$ has shrunk to a small region around the true parameter value at t = 5000.

TABLE 2. Asymptotic convergence of the estimated parameter set

Time step (t)	0	1	5	50	100	500	1000	5000
$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0760$ set size (%)	100	30.21	18.50	14.46	12.75	11.11	1.51	0.25

6.2 Relationship between PE coefficient and convergence rate

We next consider third-order discrete-time linear systems given by (1) with $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0761$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0762$ , $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0763$ and

$urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0764$

The system matrices $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0765$ satisfy (4) with randomly generated A_i, B_i, $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0766$ parameters and initial parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0767$ . In each case the estimated parameter sets $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0768$ have fixed complexity, with face normals aligned with the coordinate axes in parameter space. A linear feedback law is applied, u_t = Kx_t, where K is a stabilizing gain. We use these systems to investigate the relationship between the coefficient $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0769$ in the PE condition (27) and rate of convergence of the estimated parameter set.

Taking the window length in (27) to be N_u = 10, closed-loop trajectories were computed for 10 time steps and the parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0770$ was updated according to (11). Simulations were performed for 500 different initial conditions, and the average value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0771$ was computed for each initial condition using 100 random disturbance sequences {w₀,w₁, …}. Figure 6 illustrates the relationship between the average size of the identified parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0772$ and the average value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0773$ in the PE condition (27). Clearly, increasing $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0774$ results in a smaller parameter set on average, and hence a faster rate of convergence of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0775$ , which is consistent with the analysis of Section 5.2. The inner and outer radii shown in the figure on the left are the radii of the smallest and largest spheres, respectively, that contain and are contained within the parameter set estimate after 10 time steps. A similar trend can also be seen between the average volume of the parameter set $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0776$ and the ensemble average value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0777$ .

**FIGURE 6**
Open in figure viewer PowerPoint

Average size of parameter set after 10 time steps against average value of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0778$ in the PE condition (27) with N_u = 10. The parameter set size and $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0779$ were computed for 500 different initial conditions. Left: mean side length and inner and outer radii of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0780$ . Right: volume of $urn:x-wiley:rnc:media:rnc5175:rnc5175-math-0781$

7 CONCLUSIONS

In this article, we propose an adaptive robust MPC algorithm that combines robust tube MPC and set membership identification. The MPC formulation employs a nominal performance index and guarantees robust constraint satisfaction, recursive feasibility and ISS. A convexified persistent excitation condition is included in the MPC objective via a weighting coefficient, and the relationship between this weight and the convergence rate of the estimated parameter set is investigated. For computational tractability, a fixed complexity polytope is used to approximate the estimated parameter set. The article proves that the parameter set will converge to the vector of system parameters with probability 1 despite this approximation. Conditions for convergence of the estimated parameter set are derived for the case of inexact disturbance bounds and noisy measurements. Future work will consider systems with stochastic model parameters and probabilistic constraints. Quantitative relationships between the convergence rate of the estimated parameter set and conditions for PE will be investigated further and methods of enforcing PE of the closed-loop system will be considered.

REFERENCES

Citing Literature

Volume31, Issue18

Special Issue:Adaptive and Learning‐based Model Predictive Control

December 2021

Pages 8703-8724

Robust adaptive model predictive control: Performance and parameter estimation

Summary

1 INTRODUCTION

2 PROBLEM FORMULATION AND PRELIMINARIES

3 ADAPTIVE ROBUST MPC

3.1 Set-based parameter estimation

3.2 Polytopic tubes for robust constraint satisfaction

3.3 Objective function

3.4 Augmented objective function and persistent excitation

3.5 Proposed algorithm

4 RECURSIVE FEASIBILITY AND STABILITY

4.1 Recursive feasibility

4.2 Input-to-state stability

5 CONVERGENCE OF THE ESTIMATED PARAMETER SET

5.1 Minimal parameter set

5.2 Fixed complexity parameter set

5.3 Inexact disturbance bounds

5.4 System with measurement noise

6 NUMERICAL EXAMPLES

6.1 Objective function with weighted PE condition

6.2 Relationship between PE coefficient and convergence rate

7 CONCLUSIONS

REFERENCES

Citing Literature

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Robust adaptive model predictive control: Performance and parameter estimation

Summary

1 INTRODUCTION

2 PROBLEM FORMULATION AND PRELIMINARIES

3 ADAPTIVE ROBUST MPC

3.1 Set-based parameter estimation

3.2 Polytopic tubes for robust constraint satisfaction

3.3 Objective function

3.4 Augmented objective function and persistent excitation

3.5 Proposed algorithm

4 RECURSIVE FEASIBILITY AND STABILITY

4.1 Recursive feasibility

4.2 Input-to-state stability

5 CONVERGENCE OF THE ESTIMATED PARAMETER SET

5.1 Minimal parameter set

5.2 Fixed complexity parameter set

5.3 Inexact disturbance bounds

5.4 System with measurement noise

6 NUMERICAL EXAMPLES

6.1 Objective function with weighted PE condition

6.2 Relationship between PE coefficient and convergence rate

7 CONCLUSIONS

REFERENCES

Citing Literature

Figures

References

Related

Information