Volume 2012, Issue 1 305415

Research Article

Open Access

A Graph Approach to Observability in Physical Sparse Linear Systems

Santiago Vazquez-Rodriguez,

Corresponding Author

Santiago Vazquez-Rodriguez

[email protected]

Grupo Integrado de Ingeniería, Universidade da Coruña, Mendizábal S/N, 15403 Ferrol, Spain udc.es

Search for more papers by this author

Jesús Á. Gomollón,

Jesús Á. Gomollón

Grupo Integrado de Ingeniería, Universidade da Coruña, Mendizábal S/N, 15403 Ferrol, Spain udc.es

Search for more papers by this author

Richard J. Duro,

Richard J. Duro

Grupo Integrado de Ingeniería, Universidade da Coruña, Mendizábal S/N, 15403 Ferrol, Spain udc.es

Search for more papers by this author

Fernando López Peña,

Fernando López Peña

Grupo Integrado de Ingeniería, Universidade da Coruña, Mendizábal S/N, 15403 Ferrol, Spain udc.es

Search for more papers by this author

Santiago Vazquez-Rodriguez,

Corresponding Author

Santiago Vazquez-Rodriguez

[email protected]

Grupo Integrado de Ingeniería, Universidade da Coruña, Mendizábal S/N, 15403 Ferrol, Spain udc.es

Search for more papers by this author

Jesús Á. Gomollón,

Jesús Á. Gomollón

Grupo Integrado de Ingeniería, Universidade da Coruña, Mendizábal S/N, 15403 Ferrol, Spain udc.es

Search for more papers by this author

Richard J. Duro,

Richard J. Duro

Grupo Integrado de Ingeniería, Universidade da Coruña, Mendizábal S/N, 15403 Ferrol, Spain udc.es

Search for more papers by this author

Fernando López Peña,

Fernando López Peña

Grupo Integrado de Ingeniería, Universidade da Coruña, Mendizábal S/N, 15403 Ferrol, Spain udc.es

Search for more papers by this author

First published: 16 July 2012

https://doi.org/10.1155/2012/305415

Citations: 1

Academic Editor: Massimiliano Ferronato

Share a link

Email
Wechat
Bluesky

Abstract

A sparse linear system constitutes a valid model for a broad range of physical systems, such as electric power networks, industrial processes, control systems or traffic models. The physical magnitudes in those systems may be directly measured by means of sensor networks that, in conjunction with data obtained from contextual and boundary constraints, allow the estimation of the state of the systems. The term observability refers to the capability of estimating the state variables of a system based on the available information. In the case of linear systems, diffierent graphical approaches were developed to address this issue. In this paper a new unified graph based technique is proposed in order to determine the observability of a sparse linear physical system or, at least, a system that can be linearized after a first order derivative, using a given sensor set. A network associated to a linear equation system is introduced, which allows addressing and solving three related problems: the characterization of those cases for which algebraic and topological observability analysis return contradictory results; the characterization of a necessary and sufficient condition for topological observability; the determination of the maximum observable subsystem in case of unobservability. Two examples illustrate the developed techniques.

1. Introduction

The state variables that characterize a physical system are estimated by means of the data available at any given time. This data can be generated from a sensor network spread out over an area or from contextual and boundary constraints. In general, the known system variables are said to be sensed or measured variables whether they are sensed with a real device or their magnitudes are obtained in a sort of virtual sensors. The remaining variables are considered nonsensed or unmeasured variables. In such a context, the observability issue arises when we would like to know if the sensing system is enough to be able to determine the state of the system, that is, the system state variables.

This paper deals with a scenario where a well-known model describes the behavior of a physical system in terms of relationships between system variables and parameters. The system must be linear or linearized after a first-order derivative. In this context, a given sensing network is considered, and the system observability analysis is addressed.

The term observability was introduced in the realm of linear dynamical control systems [1]. It stems from the capability of estimating the state of a system based on the information available. Although observability is essentially a numerical and algebraic problem, some techniques based on topology and graph theory have been developed to provide solutions in this area.

Due to the fact that observability and the problems related to it were studied in different engineering disciplines, the technical terminology is not totally uniform. As a result, some terms are more widely used in some areas and not in others and, in a few cases, different terms describe the same thing in different fields.

Five examples are described below pursuing the following aims: on one hand, illustrating how observability and other related problems constitute research topics in different physical, engineering, and industrial areas, where a sensor network is designed in order to analyze a given system; on the other, showing the multiple points of view from which these issues can be addressed and, in particular, how topological and graph-based approaches were developed in some cases.

The term sensor network comprises a broad spectrum of engineering and physical systems and, in particular, the topic of wireless sensor networks has led to issues that, in one way or another, are related to observability. This is the case of coverage, optimal node placement, and the minimum number of nodes required to achieve connectivity. In [2], it is shown that a graph model can be used to describe those systems, and some graph approaches have been developed in order to provide an answer to the challenges posed.

Whithin the sphere of linear control systems, the controllability problem was addressed from a graph-theoretic approach. A graph associated to a system was defined in [3] and conclusions related to several system properties are derived from the analysis of such a graph. A survey of the techniques proposed in the literature for structured linear systems can be found in [4]. More recently, a graph approach to observability analysis is proposed in [5, 6].

High-voltage electric power networks constitute another field, where observability has been an important issue in system analysis for decades [7, 8]. It is worth mentioning that the approach to the problem in [9] where the authors characterize what they call topological observability through the existence of certain graphs that, defined in the electrical network, obey constraints derived from the sensing network. However, these graph techniques do not allow the inclusion of measurements that are currently being considered, such as current and phasor measures.

Observability has also been a motivation for research in traffic models in topics related to the origin/destination trip matrix estimation challenge. This is the case of [10], where the authors adapt topological techniques developed for electric power networks to this new context. Although this issue is more complex than the description made by the authors in their paper, it has been taken as an example to illustrate the techniques proposed in the present work as will be shown in a later section.

Material and energy balances that must take place in industrial processes are analyzed in [11]. There, its authors distinguish up to four categories of balancing equations, depending on whether they consider or not materials, chemical reactions, energy, and entropy. They study the solvability of the resulting equations, for which a set of sensed variables is taken into account. The observability and redundancy of measurements as well as the errors in the measured values are included in the dissertation. Statistical techniques are used to estimate the state of the system by reconciliation. In the case of linear systems, a parallelism is established between system and sensing observability conditions and the existence of certain graphs defined from the process balancing flowsheet.

The common topic of the aforementioned scenarios, with regards to graph theory, is that certain graph techniques were developed in all the cases because of the existence of graphs or networks that characterized the systems with a given sensor set. Furthermore, the equations that describe the networks are linear or linearized. In this paper, a new graph technique is presented in order to characterize the observability of any linear physical system. The implementation of such a technique imposes constraints on the problem, summarized by the fact that the systems must be sparse and of large dimension. For any sparse and large dimensional physical system, an associated network will be defined based, exclusively, on structural considerations, that is, the topology of the equation system in its matrix form that relates the sensed variables with the state variables. It will be demonstrated that the system can be said to be topologically observable if there exists a certain graph within the associated network.

Krumpholz et al. developed in [9] a topological approach for the observability issue in the scope of electric power systems. Nevertheless, the problem related to the characterization of those cases for which algebraic and structural techniques return contradictory results is not studied. In this paper, the latter problem is solved, which has allowed carrying out a more general demonstration of the necessary and sufficient condition for topological observability than the one proposed by Krumpholz. Numerous techniques have been developed and widely and successfully tested for decades [12–15] in the scope of topological observability analysis in electric power systems. In this paper, a new graph approach is presented, which allows addressing the observability of any linear physical system or, at least, a system linearized after a first-order derivative, and not exclusively electric power systems. Boukhobza et al. had already developed a graph-theoretic technique in order to determine the state and input observability in structured linear systems [5]. Unlike that proposal, the approach presented in this paper makes it possible to exploit techniques like those mentioned above [12–15] to characterize concepts like parametric unobservability and to easily determine the maximum observable subsystem.

The rest of the paper is organized as follows. Starting from a mathematical model, some terms will be introduced concerning observability and sparse physical systems in the next section. Section 3 is devoted to the bases of graph theory and the concepts used throughout the paper. Once the theoretical assumptions have been described, an analogy between linear equation systems and graph theory is established by means of a network associated to the physical system and a given sensor set. Section 5 introduces the concept of topological observability, which is characterized through the existence of a constrained graph in the associated network. The following section is devoted to the cases where the system is not observable and how the search for the maximum observable subsystem is addressed by means of the same graph techniques. Section 7 includes two examples in order to illustrate the techniques proposed in this paper, and how they can be implemented in absolutely different real engineering scenarios. Finally, some conclusions are presented in Section 8.

2. Mathematical Model

In order to determine the state of a system, S, consider a set of m variables ω that are sensed. These variables can be expressed in terms of the n system state variables, φ:

()

where ɛ represents a vector of errors due to the measurement acquisition process. In what follows, this error vector will be ignored because of its irrelevance regarding observability issues. Two different cases might be considered at this point, depending on the linearity of the above equations. On one hand, assume those equations are linear. Then, S is a linear system, and a matrix formulation can be proposed instead of (2.1):

()

where H is a m × n characterization matrix of the system. On the other hand, consider that S is a nonlinear system that can be linearized around a certain state φ₀ and let J(φ₀) be the m × n jacobian matrix, thus:

()

where Δω = ω − h(φ₀) and Δφ = φ − φ₀. Summarizing, both cases resemble an equation system of the form:

()

where z is a constant term vector that results from the m magnitudes sensed throughout the system, x is the unknown vector that is directly related to the n state variables, and M is a coefficient matrix. In what follows, and in order to simplify the explanation, we will refer to z and x as the measurement and state variable column vectors, respectively. Also, z_k will denote a generic measured variable, and x_i will be a generic state variable. The observability issue arises when we would like to know if the m variables considered in the sensor set are enough to determine the state of the system. It depends not only on how large the number of measurements is but also on their nature, and how they are spread out over the system. From an algebraic point of view, a system S is said to be observable if the system given by (2.4) is solvable, that is, the equation system is consistent, and there exist at least n linear independent equations. As Krumpholz et al. define in [9], the system is said to be algebraically observable if and only if the rank of M is equal to n. A well-known problem comes up when the system is ill-conditioned [16] and (2.4) must be solved or matrix M is manipulated. For such cases, different numerical algorithms are proposed in the literature [17, 18]. In order to avoid this problem, other authors [19] take advantage of symbolic methods for sparse matrices [20]. What this paper is related to are the cases, where the observability of a system such as the one defined above can be addressed in terms of structural considerations, what is called topological observability [9]. In order to introduce this topic, let us define some concepts and hypotheses.

Let S be an n-dimensional physical system that is going to be the object of our study, and let a sensed variables set z be defined, where m magnitudes are measured over S. Furthermore, let M be the m × n matrix associated to S, as defined in (2.4). We will say that S is a sparse system if the behavior of S at any point can be justified exclusively by means of the knowledge of the variables in an area based on a certain neighborhood relationship. This is the case of a traffic model system where flow fluctuations in a certain region are strongly dependent on what happens in that area, whereas the events that take place in other parts show a weak dependence or absolute independence from them. One of the features that characterize a sparse system is that matrix M is a sparse matrix. Then, some conclusions can be established in terms of structural considerations of M, when the matrix dimensions and the degree of sparsity are large enough. For this purpose, Bunch and Rose [21] define a graph associated to a matrix M, where a nonzero element m_ij ≠ 0 of M represents an edge that joins vertices i and j. Based on this, some properties can be studied in terms of graph theory because of the duality between sparse linear systems and graphs.

The obvious solution of calculating the rank of matrix M may present problems and may not be even possible in the case of ill-conditioned systems, as mentioned above. In these cases, a topological-based approach becomes a good choice that presents a series of additional advantages derived from the capability of graphs to answer questions related to observability analysis, including the identification of the maximum observable subsystem and optimal additional sensor placement. In short, in this paper we will introduce new topological analysis techniques by means of certain graphs associated with sparse systems in order to determine the topological observability of such systems.

3. Graph Theory

A graph is defined as a collection of nodes or vertices that are joined through the so-called edges or branches. For the sake of homogeneity here we will use the term branches both for general graphs and for the case of trees, which are basically graphs without loops. In the scope of this work we are interested in defining graphs within a given network, which is also a collection of nodes and branches. In other words, a network must be interpreted as the context where any given graph is declared, in such a way that nodes and branches belonging to a graph are also present in the network for which the graph is defined. Nevertheless, not all the nodes and branches of the network are always present in a graph.

Definition 3.1. Let X = {X⁰, X¹} be a network, where X⁰ and X¹ are the sets of nodes and branches, respectively; a graph G of X is defined as a set of nodes, G⁰⊆X⁰, and a set of branches, G¹⊆X¹.

Thus, as in the case of networks, a graph can be denoted by a couple, as follows:

()

In what follows, it is assumed that X is a connected network, that is, a network where there exists a path in X¹ between every pair of nodes of X⁰. In the same way, a connected or unconnected graph G of X can be defined. If a graph G of X is not connected, each connected subgraph that makes it up is known as a connected component. When a connected graph contains no loops, it is called a tree of X.

Definition 3.2. A graph T of X is said to be a spanning tree if T contains no loops, and T⁰ = X⁰.

A directed graph results from the assignment of a direction to each branch in such a way that a node is known as the source, while another node is the target of a directed link.

The matrix representation of any graph G is the node-to-branch incidence matrix, A(G) = (a_kj). This is a matrix with as many rows as nodes are in the graph, and where the number of columns is equal to the number of branches in the graph. The elements of A(G), in the case of directed graphs, are defined as follows:

()

The rank of a graph G of X is defined as:

()

where size {G⁰} denotes the number of nodes in G, while c indicates the number of connected components of G.

Definition 3.3. Let X be a connected network, a graph G of X is said to be of full rank if its rank equals the maximum possible value, rank {G} = size{X⁰} − 1.

The rank of a graph G of X is, by definition, equal to the rank of its associated incidence matrix A(G). If G is of full rank, the rank of A(G) equals the number of rows minus one. In other words, one row of A(G) is linearly dependent on the others. That is the reason why a reduced node-to-branch incidence matrix A_r(G) is defined, resulting from the elimination of a row from A(G). The following expression summarizes all of the above:

()

The selection of one node among others for which the associated row is erased is arbitrary. In what follows, this node is going to be known as the reference node.

Definition 3.4. The closure [22] of a connected graph G in X is defined as a graph , where and is composed by all the branches in X¹ that join pairs of nodes in G⁰.

4. Network Flow Analogy

Consider a set of linear independent variables, {x₁, …, x_n}, that determine the state of a system S. Let z be a system variable whose magnitude may be expressed as a linear relationship between the state variables, as follows (notation: in what follows, subscripts i and j, 1 ≤ i, j ≤ n are used to refer to generic state variables and network nodes; subscript s, 1 ≤ s ≤ n refers to a node that is known as source node; subscript k, 1 ≤ k ≤ m denotes measurements and equations; subscript a, 1 ≤ a ≤ r refers to generic network branches; subscript b under arrays or vectors denotes that those structures contain exclusively branch parameters or variables):

()

where there is at least one value of i for which α_i ≠ 0. Consider that α_s, where 1 ≤ s ≤ n, is the first nonzero coefficient in the above expression. In other words, α_i = 0 for all 1 ≤ i < s. Then, the expression can be rewritten as:

()

which is consistent with the analogy to a flow network as shown in Figure 1. In it, a current z is injected into the network through node s and flows to the remaining nodes, s + 1 to n, according to the admittance values and potential differences of the branches connecting them. Therefore, the following equality must hold:

()

where, for a generic node i, x_i represents the potential level of the node with respect to a zero potential reference node, GND in the figure; λ_si = −α_i represents the admittance that characterizes a branch connecting node s to node i, so that ϕ_si = (x_s − x_i)λ_si is the current that flows from node s to node i due to the potential difference (x_s − x_i) observed from node s to node i; similarly,

is the admittance between node s and GND; hence, a current ϕ_ss = x_sλ_ss flows from node s to the reference node. The network in Figure 1 is defined as the elementary network associated to the linear (4.3), which is known as the network nodal equation at node s. The elementary network is a tree, and s is defined as the source node of that tree, while the remaining nodes are considered target nodes.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Elementary network.

Note that, on one hand, the elementary network in Figure 1 is characterized by the nodal (4.3), where the flow injected in node s equals the z magnitude; on the other hand, a solution {x_s, …, x_n} to the elementary network in Figure 1 is consistent with (4.1).

Let S be a system, where {z₁, …, z_m} is a set of m variables whose magnitudes may be expressed as linear relationships of the form:

()

A network associated to a linear equation system such as the one shown above is defined as the result of the superposition of the elementary networks associated to each z_k for all 1 ≤ k ≤ m. Then, the solvability of the linear equation system (4.4) is equivalent to that of its associated network, since a particular solution {x₁, …, x_n} to the equation system is consistent with the associated network. Figure 2 shows an example of an associated network X as a result of considering all the elementary networks in their entirety, denoted by E₁, E₂, …, E_m. Let us take a look at a generic node in the figure, such as node number 4. It is easy to see how the incident branches to node 4 are due to elementary networks associated to variables for which 4 is the source node, such as E_m, and those elementary networks including 4 as a flow target node, such as E₁ and E₂.

Let E_k be the elementary network associated to a generic variable z_k defined in S as shown in (4.4), where s is the source node. A branch admittance matrix of E_k is defined as a diagonal matrix as follows:

()

In what follows, it is assumed that all coefficients α_ki considered in the construction of a matrix Y_b(E_k), such as the one defined above, are nonzero. In other words, null coefficients, α_ki, are removed from (4.4). Note that this constraint does not guarantee that all diagonal elements in Y_b(E_k) are nonzero because there might exist a case in which, for a certain k, the sum

equals zero. Those cases are related to the concept of parametric unobservability, and it will be introduced later.

Taking into account the contribution of all the variables {z₁, …, z_m} in S to the whole associated network X, a branch admittance matrix of X is defined as a block diagonal matrix:

()

Then, the following equality is satisfied:

()

where A_r(X) is the reduced node to branch incidence matrix of X; x = (x_i) is the n × 1 nodal potential vector, that is, the system state variable column vector; if r is the number of branches in X, and they are numbered from 1 to r, Φ_b(X) = (ϕ_a) is the r × 1 branch flow vector, that is, a column vector of magnitudes that flow through branches in X; Y_A(X) is a r × n matrix that relates potentials x_i at nodes in X with branch flows ϕ_a.

Equations (4.4) can be expressed in matrix form as follows:

()

where M = (α_ki) is defined as a m × n coefficient matrix, and where z = (z_k) and x = (x_i) are column vectors. Note that each row k of M, that is, each variable z_k considered in the system, will result in an elementary network of X that is a tree because of the lack of loops. Therefore, as any branch in X arises from the existence of a nonzero element in M, a m × r equation to branch incidence matrix, B(X) = (b_ka), associated to X can also be defined as follows:

()

The following equality holds:

()

Equation (4.7) characterizes network X as well as (4.8) characterizes system S from a set of variables {z₁, …, z_m} and, therefore, from equalities (4.7) and (4.10) it can be concluded that the study of the determinism of S is equivalent to the observability of X under constraints related to the variables z_k taken into account.

5. Topological Observability

Krumpholz et al. introduced in [9] the term parametric unobservability as a vague notion needed to justify the concept of topological observability in electric power networks under certain assumptions. In this section, we present a formal description that allows defining and characterizing parametric unobservability and demonstrates how topological observability can be addressed by means of the existence of certain graphs under constraints.

Let S be a large n-dimensional sparse physical system, where a sensing system z is defined by means of m measured variables, m ≥ n. Let M be the coefficient matrix, as defined in (4.8), associated to S and the sensing system, and let X be the associated network. It is important to note that M characterizes only those parts of the system related to measurements, but not the whole physical system. In particular, it shows the relationship between the sensor set considered and the state variables. Therefore, M might be a diagonal or block diagonal matrix, without implying either the existence or nonexistence of decoupled subsystems in S. Obviously, the observability analysis of decoupled subsystems, if they exist, can be carried out independently.

The necessary and sufficient condition for algebraic observability of a system S and a sensing configuration z, as proposed above, is

()

Let us consider an algebraically observable system S with respect to a sensor set z. As M is an m × n matrix and m ≥ n, from (5.1), it follows that a collection of n linearly independent rows of M can be found. Let zⁿ be the subset of z corresponding to those linearly independent rows of M. Therefore, an equation subsystem might be defined in S with respect to zⁿ, that should be characterized using an n × n coefficient matrix Mⁿ and its associated network Xⁿ in such a way that:

()

where zⁿ⊆z, Xⁿ⊆X, and the determinant |Mⁿ | ≠ 0. zⁿ is known as a critical sensing configuration in the sense that the loss of any measurement in zⁿ should derive in the loss of the observability condition with respect to zⁿ. For the same reason, system S is said to be critically observable with respect to zⁿ. The determinant |Mⁿ| is calculated as a sum of products, each coming from n elements in Mⁿ, and no two coming from the same row or column. Since Mⁿ is a nonsingular matrix, at least one of these products must be nonzero. Thus, without loss of generality, in what follows let a permutation of rows be considered such that all the factors of the aforementioned nonnull product lie on the principal diagonal of Mⁿ. Note that any row permutation in Mⁿ does not alter the associated network Xⁿ.

It is clear that the first entry in Mⁿ in the first row is nonnull and, therefore, there exists in Xⁿ a branch joining node 1 and the reference node. In the second row, there are two possible cases: on one hand, if the diagonal element is the first nonzero element in that row, there exists a branch in Xⁿ joining node 2 and the reference node and, indirectly, the first node too; on the other hand, if the diagonal element is not the first nonzero one, there is a link in Xⁿ between nodes 2 and 1. This argument can be repeated for the next row and up to the last one. Eventually, a spanning tree of full-rank T of Xⁿ is completed because of the lack of loops and the inclusion of the totality of the nodes in the network. Furthermore, the previous analysis leads exclusively to one branch in T from each row in Mⁿ. In other words, the n branches in T are derived from n different measurements in z. Since Xⁿ results from the superposition of n elementary networks, one for each sensed value, each branch in T belongs to a different elementary network.

In order to demonstrate that the existence of such a spanning tree is sufficient, under certain conditions, for the observability of a system with respect to a sensing configuration, a reverse path is considered in which branches are added recursively to a starting spanning tree until the entire network is encompassed.

Consider a spanning tree T of Xⁿ, where each one of the n branches of T belongs to a different elementary network out of the n that form Xⁿ. That is, each elementary network in Xⁿ has a branch and only one that belongs to T. From (4.10), it follows that a matrix M(T) can be defined as:

()

where B(T) is a selection of columns from B(Xⁿ), while Y_A(T) is a selection of rows from Y_A(Xⁿ) corresponding to the n branches of T. Thus, B(T) is the n × n identity matrix because the n branches of T belong to n different elementary networks and it follows that:

()

Note that as Y_A(T) has the same sparse pattern as

, and T is a spanning tree of Xⁿ of full rank, rank {Y_A(T)} = n. In other words, |M(T)| ≠ 0 because M(T) is nonsingular. Let k be a generic row of M(T). The first nonzero entry in row k is in the same column, generically represented by s, as the first nonzero element in row k of Mⁿ. At this point, two cases might take place: one in which column s is the only nonzero entry in row k, and another for which there exists a second nonzero element in column l of row k in M(T). Since the determinant of a square matrix can be calculated, according to Laplace’s formula, as a weighed sum of cofactors or adjuncts along a row or a column, it follows that:

()

where m_ks and m_kl are the elements of M(T) in row k and columns s and l, respectively, and C_ks and C_kl are their cofactors. Taking into account the same notation as used in (4.4), if m_ks is the only nonzero entry in row k,

; otherwise, m_ks = −m_kl = α_kl. In both cases, the determinant must be different from zero.

Let T⁺¹ be a graph of Xⁿ that results from the union of T and one-branch a of Xⁿ not in T. Consider that the additional branch belongs to an elementary network E_k that corresponds to row k of M(T) and whose source node is denoted by s. If matrix

is defined from T⁺¹ in the same way as matrix M(T) was from T, then, two different cases may follow

(1)
the additional branch a joins node s and the reference one. Therefore, as the admittance of this branch is equal to , the only entry that makes matrices M(T) and M(T⁺¹) different is:
()
and, from (5.5), the determinant:
()
that is equal to zero when:
()
(2)
the additional branch a joins node s and a node j, where s < j ≤ n. In this case, the branch admittance equals λ_a = −α_kj and both matrices M(T), and M(T⁺¹) are equal but for two entries in row k:
()
and, again, the determinant:
()
that vanishes when:
()

Consider T^+r to be a graph of Xⁿ that results from the addition to T of a number r of branches of Xⁿ not in T, and let the matrix M(T^+r) be defined such that |M(T^+r)| ≠ 0. Let T^+r+1 be a graph of Xⁿ formed after the inclusion in T^+r of a branch a of Xⁿ not in T^+r, and consider that the additional branch belongs to an elementary network E_k that corresponds to row k of M(T^+r) for which the source node is denoted by s. One of the next two cases will follow:

(1)
the additional branch a joins nodes s and the reference one. The admittance of branch a is equal to and the determinant of M(T^+r+1) is estimated by:
()
where is the cofactor of . The above determinant becomes null when:
()
(2)
the additional branch a joins node s and a node j, where s < j ≤ n. Then, the branch admittance is equal to λ_a = −α_kj, and the determinant of M(T^+r+1) is given by:
()
where and are the cofactors of and , respectively. The determinant will be null if:
()

Note that (5.13) and (5.15) allow identifying a set of values of coefficients α_ki for which the determinant |M(T^+r+1)| might be canceled.

New branches can be added to the given graph, one by one, until the entire network Xⁿ is completed, after the inclusion of all the branches in Xⁿ. Therefore, it is concluded by induction that the determinant of matrix M(Xⁿ) is nonzero if its entries α_ki, 1 ≤ k, i ≤ n, do not meet any equality such as (5.8) and (5.13) for branches that join the reference node and (5.11) and (5.15) otherwise.

Consider an example in which a collection of four sensed magnitudes

are acquired from a four-dimensional physical system. As a result, an equal number of linear equations that relate z and the state variables

are established, and a matrix of coefficients M is given by:

()

Figure 3(a) shows the resulting associated network, where the branch admittance values are indicated as well as the sensed variables to which each branch is associated. Figure 3(b) shows a spanning tree T of full rank, in which it can be noted that the four branches that conform the tree are associated to four different measured variables. Then, it follows that:

()

where |M(T)| ≠ 0. The graphs in Figure 4 show how the entire network X can be reached from T by the addition of each branch of X not belonging to T, and how, at each step r + 1, a new M(T^+r+1) is defined from the previous one M(T^+r) after modifying one or two matrix entries, depending on the case. It can be seen that it always follows that |M(T^+r+1)| ≠ 0 except for the exception cases defined in (5.8), (5.11), (5.13), and (5.15).

Note that this result was reached from the consideration of, on one hand, the network topology and the number, nature, and location of sensors in the network and, on the other, the network parameters. To deal with these two approaches, the concept of parametric unobservability is introduced.

Definition 5.1. A large dimensional and sparse physical system S, for which a sensing system z is defined, is said to be parametrically unobservable with respect to z if, in spite of the fact that the ranks of matrices B(X) and Y_A(X) are equal to n, the rank of M is less than n due to the value of one or more coefficients α_ki of M.

The relevance of this concept lies in the fact that, in large dimensional sparse physical systems where the parameters are roughly estimated from empirical data or are subject to environmental distortion, it is unlikely for parametric unobservability to occur [9]. In other fields, such as structured linear systems, it is often necessary to work under the assumption of a lack of knowledge of system parameters [4]. In these scenarios, the parametrically unobservability should be associated with a particular set of parameter values. Thus, the observability of a system is said to be true when it is so for almost all parameter values, that is, for all of them except for a set of particular cases in the parameter space. Even though not all physical systems may meet this requirements, there exist evidences that are true for real cases. For example, electric power network analysis involves hundreds or even thousands of state variables that are usually related to the voltage at network nodes. The system state [7] can be estimated by means of the measurement of the power that flows into and through the electric network and which is influenced only by neighboring node states. Thus, the resulting system is clearly sparse, and circuit parameters are affected by environmental conditions such as temperature and humidity as well as by the unreliability of parameter estimation. Another example is the case of traffic model analysis [10]. As explained later in the example in this paper, vehicles usually move along a geographical area according to a set of established origin/destination pairs. Traffic flows are sensed at routes in the network in order to estimate the state of the system, that is, origin/destination pair traffic flows. As the network grows, the sparsity becomes more plausible. Additionally, system coefficients are estimated, among other factors, from probabilistic considerations related to the ability of people to opt for one route or another. In brief, parametric unobservability is, in these two cases, highly improbable despite being mathematically possible.

On the basis of the large dimension, sparsity and parameterization uncertainty of such systems, in order to address the observability issue a new strategy is proposed involving exclusively structural and not numerical considerations. For this, a new observability definition should be provided.

Definition 5.2. Let S be a large dimensional sparse physical system, where a sensor network z is considered; S with z is said to be topologically observable if S is algebraically observable or parametrically unobservable with respect to z.

Summarizing, it has been demonstrated that the existence of a spanning tree of full-rank T of X where the n branches of T belong to n different elementary networks of X constitutes a necessary and sufficient condition for topological observability. In what follows, any graph G of X with a number r_G of branches that belong to r_G different elementary networks, that is, are associated to r_G different measurements z_k of z, is known as a measured graph.

Theorem 5.3. Let S be a linear and large n-dimensional sparse physical system, where a sensing system z is defined by means of a number of m measured variables, m ≥ n; S is said to be topologically observable with respect to z if and only if there exists a measured spanning tree T of X.

The analysis of the observability of a large dimensional sparse physical system S with respect to a sensing system z from a topological point of view involves searching for a measured spanning tree T of full rank among all possible graphs G of X constructed in such a way that each elementary network that forms X contributes with and only with one branch to G. If the number of sensed values m considered is larger than the dimension n of system S, T will be included, if it exists, as part of a measured spanning graph G of X. In what follows, it is assumed that any graph G of X is a measured graph.

There could be different ways to construct a spanning tree, and any one of them would be valid [12–15]. However what is important here is the fact that the existence of a measured spanning tree is a sufficient condition for the topological observability of a linear system.

Summarizing, taking as a basis the experience in observability analysis in electric power systems, a generalization of the topological approach was developed to address this issue in the scope of other linear, or linearized after a first order derivative, real engineering physical systems. A necessary and sufficient condition for topological observability was established by means of a graph theoretic approach. Finally, thanks to this approach, the characterization of the cases where algebraic strategies do not lead to the same results as those derived from structural analysis was carried out.

6. Maximum Observable Subsystem and Observability Islands

If the observability system test fails for a sensing configuration, it is said that the system is not observable or unobservable. In such cases, the knowledge that might be acquired about one or more parts of the system by all the measures considered should not be underestimated. If a system is not observable, it may be possible to identify a subsystem for which the state can be estimated, it is said that the subsystem is observable. A nondivisible observable subsystem is known as an observability island. The number of observability islands may vary and depends on the associated network topology, the sensors considered and their location in the network.

Consider an n-dimensional sparse physical system S and a sensing configuration z for which an associated network X is defined. Let O be an observable island of S and z, and let X^O⊆X be its associated subnetwork; X^O is known as an observable subnetwork.

A node belonging to X^O is said to be an observable node, and a branch belonging to X^O is an observable branch. A measured spanning graph G of X^O is known as an observable graph. Nodes and branches that do not belong to any observable subnetwork are said to be unobservable.

Let Y be a measured graph of X; a measure z_k associated to a branch of Y is said to be wholly contained [22] in Y if the elementary network associated to z_k is contained in the closure of Y in X. By extension, a measure is said to be wholly contained in a subnetwork X^O if its associated elementary network is included in the closure of X^O in X.

Any measurement z_k considered in O is wholly contained in X^O. Hence, the state of an observability island may be estimated by means of a wholly contained sensor set.

The union of all the observability islands in a system S derives in a maximum observable subsystem while the union of all their associated observable subnetworks in X results in the maximum observable subnetwork. This subnetwork is maximum because it comprises the largest possible number of observable nodes and, if it exists, it is unique [22].

Consider a system S that is not observable for a sensor set z. Then, no measured spanning tree will be found, as derived from Theorem 5.3. Instead, consider a measured graph G of X as one of the largest connected graphs that can be formed according to the sensing system and the constraints described earlier. Then, G is known as a maximum measured graph of X but not spanning. The next theorem was extracted from [22], where relevant properties concerning maximum measured graphs and observable subsystems are described.

Theorem 6.1. Let S be a system and X the associated network from a given sensor set. If G is any maximum measured graph of X, the maximum observable subnetwork is contained in the closure of G in X.

Therefore, based on one of the maximum measured graphs, an iterative process can take place by which the not wholly contained measurements, and their elementary networks are removed from the system until the maximum observable subnetwork is obtained. Additionally, other strategies concerning the search for the maximum observable subsystem can be found in [13, 14] in the scope of electric power networks.

7. Examples

Two examples are presented in this section in order to illustrate the techniques developed in this paper, focusing the attention on the fact that these techniques are valid for different real engineering problems, where a collection of linear equations or equations linearized after a first order derivative describe the behavior of the system from a measurement acquisition system viewpoint.

7.1. Traffic Model Example

One of the fundamental problems in traffic models concerns the estimation of the origin/destination (OD) trip matrix. Traffic flows are measured by means of sensors spread out at different locations in a study area. These data, in conjunction with other available information, are used to estimate the target matrix, that is, the traffic derived from any OD movement. For each OD pair there exist, in general, more than one alternative to complete the trip that are usually expressed in terms of percentages or probabilities based on contextual factors. In addition, the flow magnitudes at a link in a traffic network can be broken down into percentages of vehicles moving along different OD trips. Thus, linear relationships can be established between OD-pair and link flows. Let t = (t_od) and v = (v_b) be OD-pair and link flow vectors, respectively; their linear relationships can be described by a matrix F as follows:

()

Figure 5 shows a benchmark case, known as the Nguyen-Dupuis network [23] in the literature, consisting of 13 plausible origin/destination places that are interconnected by 19 bidirectional links. In that scenario, vehicles can move from one place to another through suitable routes. Figure 5 shows indices assigned to links along with their directions. Therefore, for an OD-pair, any possible path is defined as a series of oriented link indices; for example, the sequence {2,36,20} denotes an alternative for a displacement from 1 (origin) to 2 (destination).

In what follows, it is assumed that matrix F and the OD-pairs are known. Below are all the OD pairs considered in this example and their potential paths as well as matrix F. They are the same as those tested in [10]:

()

Note that F characterizes the physics of the whole traffic network because it relates the defined OD-pair flows with all the 38 possible oriented traffic link flows:

()

A question arises when we want to know if a given sensor network allows to estimate the state of the traffic system or where sensors should be placed in order to complete an observable sensed system. Two cases are going to be taken into account concerning these issues.

7.1.1. Case 1: Observable Configuration

Consider a sensor network consisting of 8 traffic flow meters that result in a measured variable vector z whose magnitudes might be estimated by means of a submatrix of F and the system state variables x, that is, OD-pair traffic flows, as follows:

()

The question arises as to whether OD-pair traffic flows x can be estimated from this sensor set z among the aforementioned oriented link flows.

In Figure 6, the elementary networks derived from the coefficient matrix M of (7.4) are shown. Note how OD-pairs play the role of network nodes, while OD-pair traffic flows are the network node potential levels. In the figures, branch admittance values are indicated; indices were assigned to the branches and are shown in the figures by smaller numbers next to the arrows.

Figure 7 shows the entire associated network and how a measured spanning tree, highlighted using thick line, was found among other possibilities. Note that each elementary network is related to one and only one branch in the resulting measured spanning tree. This tree is not unique but the existence of, at least one, guarantees the topological observability of the system for the sensor set defined in (7.4).

7.1.2. Case 2: Not Observable Configuration

In a second case, a total of 6 traffic flow meters are considered. The question arises as to whether system observability can be achieved by incorporating additional sensors. And if it is not possible, which is the maximum observable subsystem?

Let {v₂, v₄, v₅, v₇, v₁₂, v₃₈} be the initial sensor set. This is a subset of the observable configuration discussed earlier. Therefore, the linear equations that characterize this case are given by:

()

Figure 8 shows the resulting associated network, X, and one of the possible maximum measured graphs, G (thick lines). Note that OD-pairs 2-4 and 4-3 are clearly not observable, that is, their traffic flows cannot be estimated by means of the available measurements. A more detailed analysis leads to the conclusion that measurements v₄, v₇, and v₁₂ are not wholly contained in G and, therefore, their associated elementary networks E₂, E₄, and E₅, respectively, should be removed from the network in order to search for the maximum observable subnetwork. This argument should be repeated until the resulting subnetwork is made up exclusively of elementary networks associated to wholly contained measurements. That is the case after removing E₁, the elementary network associated to measure v₂. From there, the maximum observable traffic subsystem is immediate and is given by OD-pairs 3-1 and 3-4 and oriented traffic link flow sensed values {v₅, v₃₈}.

To achieve a totally observable system, it is necessary to add two new traffic flow meters that allow to join the maximum measured graph in Figure 8 and the isolated nodes given by OD-pairs 2-4 and 4-3. Each row in matrix F of (7.2) corresponds to an oriented traffic link flow and, in particular, those rows with nonzero coefficients in columns related to isolated OD-pairs are plausible candidates to improve the system observability. Thus, the inclusion of one of the sensed values from:

()

that allow joining the OD-pair 2-4 node, in conjunction with one of the following:

()

that allow joining OD-pair 4-3 node, would permit observing the whole traffic system.

7.2. Electric Power System Example

As it was mentioned in the introduction, observability analysis in electric power systems has been an important research topic for decades. In particular, this issue can be addressed by means of topological methods, when the set of measured variables are made up exclusively of bus voltages and active and reactive powers that are injected into or flow through the system [9]. In those cases the system can be considered as a decoupled system [7], that is, a pair of two independent subsystems: one of which can be analyzed by means of active power measurements, and is known as P-δ subsystem; the other one, the Q-U subsystem, can be studied exclusively from bus voltages and reactive powers measured in the system. Only the Q-U subsystem is going to be analyzed in this example. Such a subsystem is observable [7] when a sufficient number of well-placed reactive powers are measured, and, at least, one node voltage is known at any node.

An electric power system is commonly represented as a mesh where the edges denotes the lines in charge of transporting the electric energy, and where the nodes are the places where the lines are incident, that is, the places where electricity is generated, consumed, or transformed. Figure 9(a) shows the topology of an example of an electric power system with 8 nodes. The places where reactive powers are acquired in the system are shown in Figure 9(b), where two kinds of measurements may be distinguished:

(i)
node measurements, numbered as 1, 2, and 3 in Figure 9(b), corresponding to reactive powers injected into the system through a node. These derive in equations of the form:
()
where Q_i denotes the i-th node reactive power, U_i represents the voltage at node i, α_ki is a coefficient related to measurement k and node i, and c_k denotes a constant term related to measurement k;
(ii)
branch measurements, numbered as 4, 5, 6, and 7 in Figure 9(b), corresponding to reactive powers that flow through the lines. These derive in equations of the form:
()
where Q_ij denotes a branch reactive power that is acquired in a line that joins nodes i and j.

Finally, a voltage measure at node 1 is also considered, resulting in an equation as follows:

()

Summarizing, the linear equations that characterize the Q-U subsystem and the given measurement configuration are as follows:

()

Figure 9(c) shows the associated network derived from (7.11), where branch admittances were suppressed in order to clarify the drawing. The numbers close to the oriented edges of the graph denote the order of the measurement from which the edge is derived, that is, the order of the elementary network in which it is defined. Note that the only branch associated to measurement z₈ and, in general, to any node voltage measure, is the one that joins the node where the voltage is acquired and the reference node. As a result, the reference node is implicitly connected to the rest of the nodes due to the inclusion of just one node voltage measurement, and a simplified associated network may be taken into account as shown in Figure 9(d), where thicker lines represent the edges that are present in the entire individuals of the search space of measured graphs. Note that those edges are the ones due to the node voltage and branch reactive power measurements.

Finally, one of the possible measured spanning trees is shown in Figure 9(e), after the assignment of each of the eight measurements considered to one of the edges in the associated network. The existence of such a tree permits concluding that the electric power Q-U subsystem is topologically observable for the given sensing system.

8. Conclusions

In this paper, a new topological approach to the determination of the observability of a physical system where a sensor network is defined has been presented. The techniques developed in this paper were inspired by the contributions of researchers in the scope of electric power systems and generalized to other physical sparse linear systems. The terms parametric unobservability and topological observability have been introduced and justified in a formal way, which allows characterizing those parameter dependent cases where an algebraic approach to the observability issue led to different results than the topological one. A sensing system has been considered for any linear physical system or, at least, linearized after a first order derivative. From there, an associated network has been defined, and it has been demonstrated that the existence of certain constrained graphs, known as measured graphs, in the scope of the associated network permits characterizing the topological observability of the system. From this graph approach, the determination of the maximum observable subsystem can be carried out in case of unobservability. The technique has been illustrated with the help of two examples in the scope of traffic sensing structures and electric power systems.

Acknowledgments

This work was partially funded by the Xunta de Galicia, the MICINN of Spain and European Regional Development Funds through projects 09DPI012166PR, 10DPI005CT, and TIN2011-28753-C02-01.

References

1 Kalman R. E., Mathematical description of linear dynamical systems, SIAM Journal on Control and Optimization. (1963) 1, 152–192, 0152167, ZBL0145.34301.
Google Scholar
2 Li J., Andrew L. L. H., Foh C. H., Zukerman M., and Chen H. H., Connectivity, coverage and placement in wireless sensor networks, Sensors. (2009) 9, no. 10, 7664–7693, 2-s2.0-70350393597, https://doi.org/10.3390/s91007664.
10.3390/s91007664
PubMed Web of Science® Google Scholar
3 Lin C. T., Structural controllability, IEEE Transactions on Automatic Control. (1974) 19, 201–208, 0452870, ZBL0343.93009.
10.1109/TAC.1974.1100557
Web of Science® Google Scholar
4 Dion J.-M., Commault C., and van der Woude J., Generic properties and control of linear structured systems: a survey, Automatica. (2003) 39, no. 7, 1125–1144, https://doi.org/10.1016/S0005%2D1098(03)00104%2D3, 2140844, ZBL1023.93002.
10.1016/S0005-1098(03)00104-3
Web of Science® Google Scholar
5 Boukhobza T., Hamelin F., and Martinez-Martinez S., State and input observability for structured linear systems: a graph-theoretic approach, Automatica. (2007) 43, no. 7, 1204–1210, https://doi.org/10.1016/j.automatica.2006.12.004, 2306767, ZBL1123.93024.
10.1016/j.automatica.2006.12.004
Web of Science® Google Scholar
6 Boukhobza T. and Hamelin F., Observability analysis for structured bilinear systems: a graph-theoretic approach, Automatica. (2007) 43, no. 11, 1968–1974, https://doi.org/10.1016/j.automatica.2007.03.010, 2572923, ZBL1129.93331.
10.1016/j.automatica.2007.03.010
Web of Science® Google Scholar
7 Schweppe F. C., Power system static-state estimation, parts I,II and III, IEEE Transactions on Power Apparatus and Systems. (1970) 89, no. 1, 120–135.
10.1109/TPAS.1970.292678
Web of Science® Google Scholar
8 Monticelli A. and Wu F. F., Network observability: theory, IEEE Transactions on Power Apparatus and Systems. (1985) 104, no. 5, 1042–1048, 2-s2.0-0022059192.
10.1109/TPAS.1985.323454
Web of Science® Google Scholar
9 Krumpholz G. R., Clements K. A., and David P. W., Power system observability: a practical algorithm using network topology, IEEE Transactions on Power Apparatus and Systems. (1980) 99, no. 4, 1534–1542, 2-s2.0-0019042727.
10.1109/TPAS.1980.319578
Web of Science® Google Scholar
10 Castillo E., Jiménez P., Menéndez J. M., and Conejo A. J., The observability problem in traffic models: Algebraic and topological methods, IEEE Transactions on Intelligent Transportation Systems. (2008) 9, no. 2, 275–287, 2-s2.0-45249094079, https://doi.org/10.1109/TITS.2008.922929.
10.1109/TITS.2008.922929
Web of Science® Google Scholar
11 Veverka V. and Madron F., Material and Energy Balancing in the Process Industries: from Microscopic Balances to Large Plants, 1997, Elsevier Science, Computeraided Chemical Engineering.
Google Scholar
12 Quintana V. H., Simoes-Costa A., and Mandel A., Power system topological observability using a direct graph-theoretic approach, IEEE Transactions on Power Apparatus and Systems. (1982) 101, no. 3, 617–626, 2-s2.0-0019535748.
10.1109/TPAS.1982.317275
Web of Science® Google Scholar
13 Nucera R. R. and Gilles M. L., Observability analysis: a new topological algorithm, IEEE Transactions on Power Systems. (1991) 6, no. 2, 466–475, 2-s2.0-0026155558, https://doi.org/10.1109/59.76688.
10.1109/59.76688
Web of Science® Google Scholar
14 Korres G. N. and Katsikas P. J., A hybrid method for observability analysis using a reduced network graph theory, IEEE Transactions on Power Systems. (2003) 18, no. 1, 295–304, 2-s2.0-0037291892, https://doi.org/10.1109/TPWRS.2002.807072.
10.1109/TPWRS.2002.807072
Web of Science® Google Scholar
15 Vazquez-Rodriguez S., Faina A., and Neira-Duenas B., An evolutionary technique with fast convergence for power system topological observability analysis, Proceedings of the IEEE Congress on Evolutionary Computation (CEC ′06), 2006, 3086–3090.
Google Scholar
16 Rice J. R., Matrix Computations and Mathematical Software, 1983, McGraw-Hill, New York, NY, USA, McGraw-Hill Computer Science Series, 634029.
Google Scholar
17 Kim H. J., A new algorithm for solving ill-conditioned linear systems, IEEE Transactions on Magnetics. (1996) 32, no. 3, 1373–1376, 2-s2.0-0030141909.
10.1109/20.497502
Web of Science® Google Scholar
18 Kurtzl S. K., A direct algorithm for solving ill-conditioned linear algebraic systems, Advances. (2000) 42, 629–633.
Google Scholar
19 Cosnard M. and Grigori L., Using postordering and static symbolic factorization for parallel sparse lu, Proceedings of the 14th International Parallel and Distributed Processing Symposium (IPDPS ′00), 2000, 807–812.
Google Scholar
20 Tinney W. F., Brandwajn V., and Chan S. M., Sparse vector methods, IEEE Transactions on Power Apparatus and Systems. (1985) 104, no. 2, 295–301, 2-s2.0-0022012702.
10.1109/TPAS.1985.319043
Web of Science® Google Scholar
21 Bunch J. R. and Rose D. J., Partitioning, tearing and modification of sparse linear systems, Journal of Mathematical Analysis and Applications. (1974) 48, 574–593, 0353641, https://doi.org/10.1016/0022%2D247X(74)90179%2D6, ZBL0294.15003.
10.1016/0022-247X(74)90179-6
Web of Science® Google Scholar
22 Clements K. A., Krumpholz G. R., and Davis P. W., Power system state estimation with measurement deficiency: an algorithm that determines the maximal observable subnetwork, IEEE Transactions on Power Apparatus and Systems. (1982) 101, no. 9, 3044–3052, 2-s2.0-0020178080.
10.1109/TPAS.1982.317548
Web of Science® Google Scholar
23 Nguyen S. and Dupuis C., An efficient method for computing traffic equilibria in networks with asymmetric transportation costs, Transportation Science. (1984) 18, no. 2, 185–202, 2-s2.0-0021427308.
10.1287/trsc.18.2.185
Web of Science® Google Scholar

Citing Literature

All articles

A Graph Approach to Observability in Physical Sparse Linear Systems

Abstract

1. Introduction

2. Mathematical Model

3. Graph Theory

4. Network Flow Analogy

5. Topological Observability

6. Maximum Observable Subsystem and Observability Islands