A Behavioral Framework for Data-Driven Modeling of Nonlinear Systems in Vector-Valued Reproducing Kernel Hilbert Spaces
Abstract
We generalize Jan Willems’ behavioral approach to a class of discrete-time nonlinear systems in a vector-valued reproducing kernel Hilbert space (RKHS). Apart from linear time-invariant systems, this class covers nonlinear systems modeled by Volterra series and their autoregressive variants, as well as systems admitting Hammerstein-type state-space realizations. We apply the proposed framework to the problem of data-driven modeling of such systems, i.e., when simulation or control objectives for an unknown system are carried out without an explicit system identification step. To that end, we link the behavioral approach to two data-driven modeling methods in a vector-valued RKHS: (1) minimum-norm interpolation and (2) subspace identification.
I Introduction
The mathematical study of dynamical systems seeks to describe the evolution of a given system over time under its governing laws, initial conditions, and inputs (if any). This evolution can be studied internally through state-space methods or externally via the system’s behavior. Pioneered by Jan Willems in a series of papers starting with [25] (see [26] for an overview), the behavioral approach identifies dynamical systems with sets of their trajectories. For linear time-invariant systems, the cornerstone of this framework is the so-called fundamental lemma [24], which asserts that the set of all trajectories of a controllable linear time-invariant system over a finite time horizon can be reconstructed from a single trajectory driven by a persistently exciting input. This result has become central to recent advances of data-driven control; see, e.g., [14, 13, 6, 8, 23] and references therein.
The philosophy underlying these data-driven approaches, as clearly articulated in the paper by Markovsky and Rapisarda [13], is that simulation or control should be carried out without an explicit system identification step. In the context of simulation of linear systems, for example, this principle is supported by the fact that, given an initial condition and a subsequent input of interest, one can combine it with previous measurements of the system behavior to predict the resulting output without identifying the system transfer function, impulse response, or state-space model. Rather, simulation can be viewed as a “missing data” problem, to be solved directly using the available trajectories.
While this viewpoint has led to a rich and mature literature for linear systems, extending it to nonlinear systems is substantially more challenging, as the trajectory sets no longer form linear subspaces. Several works have explored nonlinear analogues of the fundamental lemma for structured nonlinear systems. Examples include bilinear systems [12], second-order Volterra systems [18], flat nonlinear systems [1], Hammerstein and Wiener systems [3], and a Koopman-type linear embedding posited in [20]. A promising direction is to use a reproducing kernel Hilbert space (RKHS) to lift the trajectories of nonlinear systems into a (possibly infinite-dimensional) feature space. In this vein, Huang, Lygeros, and Dörfler [10] investigated kernelized data-enabled predictive control, where the resulting models are regression-based approximations rather than behavioral characterizations. More recently, Molodchyk and Faulwasser [16] established the link between kernel regression and Willems’ fundamental lemma, and showed that several existing nonlinear extensions of the fundamental lemma, such as those for Hammerstein and flat systems, can be interpreted as instances of kernel regression with specific choices of kernel when the induced RKHS is finite-dimensional.
In this paper, we extend the behavioral framework beyond the linear settings for characterizing the behavior of nonlinear systems in RKHS via minimum-norm interpolation and subspace identification repurposed for nonlinear systems. We reinterpret both minimum-norm interpolation and subspace identification through the behavioral lens, and clarify the role of various structural assumptions on the system, the offline and online data, and the space of functions used to represent the nonlinear aspects of the system. A unifying theme across these two methods is that, like in the modern data-driven schemes and in the spirit of the fundamental lemma, predictors and the desired “online” trajectories can be expressed in terms of the kernelized “offline” examples. When working in an RKHS, as noted by Molodchyk and Faulwasser [16], this structure further mirrors the representer theorem [19]: the solution can be written as a finite linear combination of kernel evaluations along the observed trajectories.
The remainder of this paper is organized as follows. In Section II, we recall the fundamental lemma of Willems for linear time-invariant systems and introduce basic concepts regarding scalar-valued and vector-valued RKHSs. In Section III, we introduce a class of nonlinear systems whose nonlinearity is encoded by an element in the vector-valued RKHS of functions on the set of finite-length sequences of inputs. In Section IV, we establish a Behavior Representer Theorem for nonlinear systems using minimum-norm interpolation and contrast it with the fundamental lemma. Compared with the work of Molodchyk and Faulwasser [16], we consider minimum-norm interpolation instead of (regularized) least-squares regression in a vector-valued RKHS. We further establish an error estimate that holds with equality as in Lemma 4, rather than the inequaliity in [16, Lemma 2]. In Section V, we present a subspace identification framework for nonlinear systems in an RKHS and provide a fundamental lemma characterization of finite-length trajectories with kernelized input vectors and outputs.
II Preliminaries
II-A Notation and definitions
We will make use of the following notation and definitions throughout the paper. We will use to denote the set of nonnegative integers. The Moore–Penrose pseudoinverse of a matrix will be denoted by . Given the matrices , , and , the oblique projection of the rowspace of on the rowspace of along the rowspace of is defined as
(see [21, Section 1.4.2]). Given a finite sequence of vectors in , the Hankel matrix of depth is the matrix given by
We say that is persistently exciting (PE) of order if the Hankel matrix has full row rank. Given a signal (discrete-time vector-valued sequence) , is the backward shift operator defined by . Additional notation will be introduced as needed.
II-B A review of the fundamental lemma
In the behavioral framework, a linear time-invariant system with variables is identified with a linear subspace of the space of all one-sided sequences of -dimensional real vectors denoted as which is shift-invariant, i.e., . One can work with various equivalent representations of , such as autoregressive, input/output, or input/state/output representations [25]. Notions like controllability or observability can be defined solely in terms of set-theoretic properties of , without referring to a particular representation [26].
One of these structural properties pertains to the partition of the variables of into inputs and outputs. Without getting too much into technical details, the idea is that, up to a permutation of coordinates, each trajectory can be partitioned into an input trajectory and an output trajectory as , such that the input is free in the sense that
and the output is determined by the input, subject to the system laws and the initial condition. It can be shown that the number of inputs , the number of outputs , and the dimension of any minimal state space realization of are system invariants that depend only on and not on the particular representation of [25].
Let be a controllable linear time-invariant system with inputs, outputs, and minimum state dimension . The main result of [24], now commonly referred to as the fundamental lemma, is as follows:
Lemma 1.
For each , let denote the restriction of to times . Let a trajectory be given, such that the input part of is persistently exciting of order . Then is equal to the column space of the Hankel matrix .
The main message of Lemma 1 is that the length- behavior can be reconstructed, exactly and in a representation-independent manner, from a single input/output trajectory of length , provided the input is persistently exciting and the system is controllable. This makes the fundamental lemma a key ingredient in data-driven approaches to system simulation and control.
II-C Reproducing Kernel Hilbert Spaces
In this paper, we make extensive use of vector-valued reproducing kernel Hilbert spaces. We start by describing the more familiar definition of a scalar-valued RKHS; we refer interested readers to [4] for details. Let be a set. A Hilbert space111Unless indicated otherwise, all Hilbert spaces in this paper are assumed to be defined over the reals. with inner product is an RKHS on if its elements are functions from to and if for each there exists a positive constant , such that for all . In other words, is an RKHS on if the evaluation functional is bounded (hence continuous) for all . To each RKHS, we can associate a reproducing kernel, i.e., a mapping that satisfies the positivity condition
for all , , and , such that is an element of for each and the following property (called the reproducing kernel property holds:
Moreover, the set is total in (i.e., its linear span is dense in ). The map defined by is called the canonical feature map. By the Moore–Aronszajn theorem [2], the reproducing kernel specifies uniquely up to linear isomorphism.
We now describe the vector-valued generalization of the above definition [5]. Let be a Hilbert space, and let denote the Hilbert space of bounded linear operators on with the Hilbert–Schmidt norm. Then we say that is a -valued RKHS on if its elements are functions from to and if the evaluation map from to is bounded (hence continuous) for all , i.e., there exists a positive constant such that . The vector-valued analogue of the reproducing kernel is an operator-valued map of the positive type, i.e., such that the inequality
holds for all , , , and . The reproducing kernel property then takes the form
where, for each , is an element of the linear space of bounded operators from to . The map from into is the (operator-valued) canonical feature map.
A basic example of a vector-valued RKHS which will be used frequently in the sequel is , where and are finite-dimensional inner-product spaces and where we equip with the Hilbert–Schmidt inner product . For the sake of completeness, we present the straightforward proof of the following lemma in Appendix A.
Lemma 2.
is a -valued reproducing kernel Hilbert space on with the operator-valued reproducing kernel , where is the identity operator on .
III Nonlinear Systems in a Vector-Valued RKHS
We now introduce a class of discrete-time nonlinear systems, where the nonlinearity is represented by an element of a given vector-valued RKHS of functions on the set of finite-length sequences of inputs. We will use the following notation throughout:
-
•
is the space of inputs;
-
•
is the space of outputs;
-
•
is the space of length- input sequences, where is a fixed lag;
-
•
is the space of length- output sequences;
-
•
.
We introduce the system model in Section III-A and the associated behavioral constructs in Section III-B, and close by discussing several examples in Section III-C.
III-A System model
Let be a vector-valued RKHS of functions from into , with the operator-valued kernel of positive type.222Here and elsewhere, all finite-dimensional vector spaces are automatically treated as Hilbert spaces with the usual Euclidean inner product. We consider systems parametrized by -tuples , where and . The input/output relation corresponding to is given by
| (1) |
where is the restriction of the input sequence to the set . By the reproducing kernel property, we can also write
where is the adjoint of the canonical feature map .
It is also convenient to cast (1) in an alternative nonlinear regression form. For each , let denote the output at time and define the regression vectors
| (2) |
which is in analogy to the corresponding construct for linear systems [17, 9]. As we now show, we can introduce a -valued RKHS of functions from into , such that the autoregressive nonlinear model of (1) can be represented as
| (3) |
for some that depends on .
To that end, let us first express (1) as
| (4) |
where is the linear operator defined by
By Lemma 2, is a -valued RKHS on with the reproducing kernel . Consider the direct-sum Hilbert space , whose elements are pairs where , , and for each , . The inner product in is
for , . Since , are -valued RKHSs with operator-valued kernels and , is also an RKHS with operator-valued kernel defined by
This can be proved as follows: for any and , we have
where the second line follows from the reproducing kernel property. In particular, (3) holds with the function defined in (4).
III-B Behaviors
In this section, we introduce several behavioral constructs related to the system model of Section III-A.
Given , define the operator (with ), where is the shift operator acting on output sequences. Then (1) is equivalent to
| (5) |
and we can define the behavior of as the following subset of the space of all input/output sequences:
| (6) |
For , the restriction of the behavior to the finite time interval is defined by
When , we will use as shorthand for .
Next, we turn to the nonlinear regression form in (3). Recall that the identity
holds by the reproducing kernel property. Define the operator by
where
| (7) |
is the regression vector at time [cf. Eq. (2)] and is the output at time . We say that is an input-output (i/o) pair of the system (3) if . This leads naturally to the following behavioral description:
When and are related via (3), it is easily verified that
| (8) |
where and are defined in (7).
III-C Examples
III-C1 LTI Systems
Let consist of all linear operators from into , i.e., . By Lemma 2, this is a -valued RKHS on . Any can be represented as for some linear operators . Then Eq. (1) describes a linear autoregressive model of order :
The operator-valued kernel map is given by
where
is the inner product on viewed as a direct sum of Hilbert spaces with the Euclidean inner product .
III-C2 Volterra series
When for all , the nonlinear autoregressive model in (1) reduces to
| (9) |
This system has finite memory of length , since the output at each time is determined by the inputs in the finite time window of length . Systems of this type are often represented using Volterra series. De Figueiredo and Dwyer [7] used the formalism of weighted Fock spaces to connect Volterra series representation to the RKHS framework.
Let us assume for simplicity that (i.e., the outputs are scalar). Let be a sequence of positive reals, such that the infinite series
converges for all , and let denote the RKHS of functions from into with the reproducing kernel
Let and be a collection of real coefficients satisfying the condition
where and
for . Then any function of the form
is an element of [7].
Removing the restriction yields a broader class of nonlinear systems that subsumes the models studied in [7].
III-C3 State-space models
Consider a Hammerstein state-space model of the form
| (10) |
where are two (nonlinear) functions. We assume that the state takes values in , so , , , and . We can obtain an input-output representation (1) from the state-space representation (10) as follows.
Introduce the Markov parameters for . Define the -step controllablity matrix , the -step observability matrix , the reversed -step controllability matrix , and the modified block Toeplitz operator as
| (11) |
Let us assume that is such that . Then from (10), for , we have
| (12) |
Define the vectors
Then, stacking the equations in (12) for to , we have
Since , we can solve for :
| (13) |
On the other hand, when , we have
Plugging in the expression for from (13),
The matrix can be written in block form as with for . Setting for and moving them to the LHS of the above equation, we have
With , we can write the RHS as a function of on as
where , , and defined above is a mapping from into . Assume now that are elements of some -valued RKHS on . Thus, by definition, the evaluation maps and are bounded for each . Consider the direct-sum Hilbert space with inner product for . The function defined above is an element of the linear space of functions of the form
where ranges over and where are linear operators from into . For each , the evaluation map is continuous on . Thus, belongs to some -valued RKHS on .
IV Behavior Representer Theorem Via Minimum Norm Interpolation
Let be a finite input-output trajectory generated by an unknown system of the form (1). In other words, there exists an unknown -tuple , such that
The problem of data-driven behavioral modeling is to reconstruct the unknown length- behavior segment from the data without explicitly estimating .
Let be the regression representation of as in (3). Using the definitions of in (2) and , we can represent the data equivalently by , such that holds for all . In other words, for each , is a valid i/o pair for (3). In view of the equivalence (8), the question we would like to answer is whether we can reconstruct the set of all valid i/o pairs of the unknown from the given data.
The following result is easy to establish:
Theorem 3.
For any collection of real coefficients , is a valid i/o pair for the model (3).
Theorem 3 shows that any linear combination of is a valid i/o pair of (3). We now analyze the reverse direction by relating it to minimum-norm interpolation in a vector-valued RKHS [15]. Let finite input-output data be given and consider the following problem:
| (14) |
Define the sampling operator by . By [15, Theorem 3], if , then the minimum norm interpolation problem in (14) has a unique solution given by
| (15) |
where solves the system of equations
| (16) |
We can express this more succinctly as follows.
Since , for each we can view as a mapping from into , i.e., for each , . Define the mapping as
where . In particular, , where is the solution of (16). Next, define the block kernel matrix whose blocks are given by for , as well as the block row vector with blocks for . Using the reproducing property and the above definitions, we can write
| (17) |
where . Finally, for each define as
| (18) |
We now present two key lemmas. The first one is a structural characterization of :
Lemma 4.
For each , the operator is symmetric and positive semi-definite, and iff .
The second one is an extension of a result of Liang and Recht [11, Lemma 1] to the minimum-norm interpolation problem in the vector-valued RKHS :
Lemma 5.
Suppose that the problem (14) admits unique solutions and given the respective data and . Then the following holds:
-
1.
If , then .
-
2.
If , we have
(19)
Lemma 5 characterizes the error incurred when we use as a predictor of , where is the solution to the minimum norm interpolation problem on . In particular, the prediction is exact when . If is nonzero but still positive definite, the identity (19) expresses the squared norm of the weighted prediction error in terms of the norms of and .
In a scalar-valued RKHS, is equal to
| (20) | ||||
which is the squared distance from to the linear span of kernel functions centered at the observed samples [11]. While Lemma 5 applies to any vector-valued RKHS of -valued functions on , a natural choice of the operator-valued kernel could be , where is a scalar-valued kernel function. Let be the Gram matrix defined by the scalar-valued kernel on the points , let be the row vector with , , and . Finally, define
where the distance is computed in the scalar-valued RKHS induced by , cf. (20). Then, using the properties of the Kronecker product , we can compute as follows:
Hence, the equality in Lemma 5 reduces to
This indicates that, given a new , if , we have . Given Lemmas 4 and 5, the following is immediate:
Theorem 6 (Behavior Representer Theorem).
Let be a length- trajectory of regression and output vectors for an unknown , i.e.,
Let be an element of , and let be computed according to (7). Then the following holds:
-
1.
If , then , where and are computed from the data according to (17) and .
-
2.
If , then
(21) where (respectively, is the minimum-norm interpolator of (respectively, of .
The condition describes the setting when exact reconstruction is possible. By Lemma 4, this will be the case for all pairs for which . If does not satisfy this condition but , we can quantify the reconstruction error using (21). For LTI systems, a sufficient condition for the existence of unique minimum-norm interpolating solutions is
which is the classical persistence of excitation condition [17, 9]. Exact reconstruction is possible when . In the nonlinear setting, the condition plays an analogous role via Lemma 4.
It is useful to compare Theorem 6 with the fundamental lemma for LTI systems (cf. Section II). The latter characterizes behaviors through the image of the Hankel matrix built from observed input-output trajectories. Theorem 6 of this section is conceptually analogous, but is phrased in terms of a different data representation based on regression vectors, which are formed from inputs and past outputs.
V Subspace Identification in the Vector-Valued RKHS Setting
We now revisit systems that arise from state-space representations, as in Section III-C3. We will focus on a particular class of such systems, namely ones that can be represented as
| (22) |
where is a mapping from the input space into .
Given , we construct the -step controllability matrix , the -step observability matrix , the reversed -step controllability matrix , and the -step modified Toeplitz matrix exactly as in (11). The -step Toeplitz matrix is given by
Let denote input/output data of length collected from measurements of (22) starting from some initial condition . In the LTI case (i.e., when and is the identity map), subspace identification methods [21] allow one to reconstruct, under certain regularity conditions, the state trajectory and the observability matrix directly from the input/output data without knowledge of the system matrices .
In this section, we show that these methods can be extended to the set-up of (22) when is an element of a suitable vector-valued RKHS on . Moreover, we obtain a result in the spirit of the fundamental lemma for LTI systems, namely that the set of all valid length- input/output trajectories of (22) can be reconstructed directly from the data without an intermediate system identification step.
V-A The construction of a vector-valued RKHS
There are various ways of instantiating the state-space model (22) in the vector-valued RKHS framework presented in Section III-C, i.e., choosing a suitable -valued RKHS on the input space with reproducing kernel so that . Perhaps the simplest one is to define the operator-valued map by
which is readily seen to be of the positive type since, for any , any , any , and any ,
By [5, Prop. 2.3], there is a unique -valued RKHS of functions on with reproducing kernel . The kernel has the convenient property
| (23) |
V-B Subspace identification using RKHS methods
We now have all the ingredients in place for putting together a subspace identification framework analogous to the one for LTI systems [21].
Let the input/output data be given. Introduce the Hankel-type block matrices
| (24) |
and
| (25) |
where and designates a partition into “past” and “future” data. Let and denote, respectively, the concatenations of and and of and :
Similar to the linear case, let denote the oblique projection of the rowspace of onto the rowspace of along the rowspace of :
which satisfies . Let the SVD of be given by
Following [21], let , where is the state trajectory of (22) determined by the initial state and the inputs . In the LTI case, can be recovered, up to a similarity transformation, via the formula (see, e.g., [21, Theorem 2, Chapter 2]). In fact, the same result holds in the present nonlinear case (the idea is to view as an input to an LTI system given by ). Given the input/output data, let
| (26) |
and form the input Gram matrix of depth as with entries . The following theorem is a straightforward adaptation of subspace identification results for LTI systems:
Theorem 7.
Let be a sequence of length generated by the system (22), where is controllable and is observable. Suppose that the input sequence is such that . Then, and .
The process of computing the oblique projection and constructing the state vector can be carried out using Gram matrices computed from pairwise kernel evaluations , without explicitly using . Specifically, define the Gram matrices , , and , whose entries are
respectively, where (a) and (b) follow from (23).
The oblique projection can be computed as
| (27) |
where the second-to-last line follows from the identity .
To construct the state vector, instead of directly performing SVD on , we construct , , and such that via eigendecomposition of and . Denote and and notice that we have
Then, we can obtain as the eigenvalues and right eigenvectors of . Likewise, we have
Denote . Then, we have
That is, has an eigenvector associated with eigenvalue iff is an eigenvector of corresponding to the same eigenvalue. With , , and , we can define the extended observability matrix and states (up to a similarity transform) as
The following corollary is immediate from the above process.
Corollary 8 (Construction of state vectors).
Suppose that is controllable and is observable, i.e., . Let input-output data of length be given. Suppose the input sequence is such that . Let be the eigenpair of , and be the eigenpair of . Define . Then the extended observability matrix is , and the state sequence is (both up to a similarity transformation).
The next result explicitly connects subspace identification to an RKHS version of the fundamental lemma for nonlinear systems admitting state-space realization (22).
Theorem 9.
Consider the state-space model (22), where is controllable and is observable. Let input-output data of length be given, and suppose the input Gram matrix is such that and that is chosen so that has full column rank. Then a length- sequence is a valid input/output trajectory of the system (22) if and only if there exists such that
where, for , the vectors and have entries given by , ; and , are the Euclidean inner products of outputs.
In the context of the above theorem, we can view as kernel matrices of the “offline” training data generated by (22). By splitting the kernel vector of the “online” testing sequence into two parts (past and future), we test whether one can interpolate the kernel matrices of past and future using the same coefficients, where the initial length- segment of the testing sequence specifies the initial condition for the subsequent length- segment. A similar observation for LTI systems on the specification of initial condition has been given in [13, Proposition 1]. In the proof of Theorem 9, we further show that the trajectory passes through a state at time that is a linear combination of states from the training data, i.e., for some .
In light of Theorem 9, we can compute the predictor via
where is the solution of the following equation,
The interpolation-based method in Section IV characterizes the future output using kernelized offline data to make predictions regarding the online data. On the other hand, the realization-based method aims to capture the output as linear functions of states and kernelized input vectors . The preference between the two approaches depends on the memory length and the state dimension . E.g., in the regime , the states serve as an efficient representation of the system’s history.
VI Conclusion
In this paper, we have put forward a behavioral framework for modeling a class of nonlinear systems in a vector-valued RKHS. This formulation is rich enough to cover LTI systems as well as nonlinear systems modeled by Volterra series, autoregressive models based on Volterra series, and Hammerstein-type state-space models. Using this framework, we have analyzed two methods for data-driven modeling of such systems, minimum-norm interpolation and subspace identification. We have clarified the role of various structural assumptions on the system, the data (both offline and online), and the vector-valued RKHS that represents the nonlinear aspects of the system. More broadly, this work expands the scope of behavioral systems theory to nonlinear systems. In doing so, it reinforces the conceptual shift underlying data-driven control: rather than aiming to recover the state evolution, one can directly actualize the desired behaviors using observed trajectories.
Appendix A Proof of Lemma 2
Let and be the Hilbert-space norms on and induced by their respective inner products. Consider the evaluation functional given by . It is bounded since
where the last step follows from the relation
Hence, is an RKHS. We next show that is the reproducing kernel. It is obvious that is of the positive type. To show it satisfies the reproducing property, notice that for any and we have
where is the rank-one linear operator .
On the other hand, for , we have
Hence, we have
That is, satisfies the reproducing property.
Appendix B Proofs For Results in Section IV
B-A Proof of Theorem 3
Let be given. For each , the relation
holds for all . Multiplying both sides by and summing over , we have
for all , where the last line follows from the reproducing property. Hence, the pair satisfies Eq. (3) as claimed.
B-B Proof of Lemma 4
Define the following subspace of :
| (28) |
Let denote the orthogonal projection onto and notice that . Using this in the definition of in (18), we get
Hence, for any ,
Hence, is positive semi-definite. In particular, iff for all , that is, if .
B-C Proof of Lemma 5
Since interpolates , we have for all and ,
On the other hand, for and all ,
where and is the orthogonal projection onto . Since is arbitrary, for , i.e., both and interpolate . Since is the unique minimum-norm solution in , it must be the case that . Hence,
That is, there exists some such that
| (29) |
We now determine . By the reproducing property, we have
| (30) |
For the last line, applying the reproducing property again, we can write the first term as . For the second term, by orthogonality we have
where we have used the fact that, since is an orthogonal projection,
for all . Since is an orthogonal projection onto , we have
and
where and are the solutions of and . Therefore,
where the last line holds since .
Putting everything together and using the definition of , we arrive at
Plugging the above relation back into (30) and using the fact that is arbitrary and that is self-adjoint, we see that is determined by the relation
As interpolates , i.e., , we can further write
| (31) |
Hence, when , we will have .
Appendix C Proofs For Results in Section V
C-A Proof of Theorem 9
In our model (22), we can view as the -dimensional input to the LTI system parametrized by . Consequently, the argument in the proof in [22, Theorem 1] extends directly to our setting and guarantees that the matrix
has full row rank.
: Suppose that is a valid input-output trajectory of (22). Then there exists an initial state such that
Since has full row rank, for any , there exists some such that
Thus, we have
Moreover, from the dynamics (22), we have
| (32) |
Hence, at time ,
Thus, starting from with input sequence , we have
Since , and is constrained by (22), we have
Hence,
Since , we conclude that is the output from with input . As is a valid input-output trajectory, we can use (22) and defined in (32) to obtain the state vector at time as
| (33) |
For the second segment, using the assumptions and the matrix input-output relation for , , we have
Thus, we have
where the last equality follows from (33). Therefore, we conclude that is the output from with input .
Putting everything together, we have
Multiplying both sides of the first equation by and the second equation by proves the claim.
References
- [1] (2021) Data-based system analysis and control of flat nonlinear systems. In 2021 60th IEEE Conference on Decision and Control (CDC), pp. 1484–1489. Cited by: §I.
- [2] (1950) Theory of reproducing kernels. Transactions of the American mathematical society 68 (3), pp. 337–404. Cited by: §II-C.
- [3] (2020) A trajectory-based framework for data-driven system analysis and control. In 2020 European Control Conference (ECC), pp. 1365–1370. Cited by: §I.
- [4] (2011) Reproducing kernel hilbert spaces in probability and statistics. Springer Science & Business Media. Cited by: §II-C.
- [5] (2006-10) Vector valued reproducing kernel Hilbert spaces of integrable functions and Mercer theorem. Analysis and Applications 04 (04), pp. 377–408. Cited by: §II-C, §V-A.
- [6] (2019) Data-enabled predictive control: in the shallows of the deepc. In 2019 18th European control conference (ECC), pp. 307–312. Cited by: §I.
- [7] (1980) A best approximation framework and implementation for simulation of large-scale nonlinear systems. IEEE Transactions on circuits and systems 27 (11), pp. 1005–1014. Cited by: §III-C2, §III-C2, §III-C2.
- [8] (2022) Bridging direct and indirect data-driven control formulations via regularizations and relaxations. IEEE Transactions on Automatic Control 68 (2), pp. 883–897. Cited by: §I.
- [9] (1986) Persistence of excitation in linear systems. Systems & Control Letters 7 (5), pp. 351–360. Cited by: §III-A, §IV.
- [10] (2023) Robust and kernelized data-enabled predictive control for nonlinear systems. IEEE Transactions on Control Systems Technology 32 (2), pp. 611–624. Cited by: §I.
- [11] (2023) Interpolating classifiers make few mistakes. Journal of Machine Learning Research 24 (20), pp. 1–27. Cited by: §IV, §IV.
- [12] (2022) Data-driven dynamic interpolation and approximation. Automatica 135, pp. 110008. Cited by: §I.
- [13] (2008) Data-driven simulation and control. International Journal of Control 81 (12), pp. 1946–1959. Cited by: §I, §I, §V-B.
- [14] (2006) Exact and approximate modeling of linear systems: a behavioral approach. SIAM. Cited by: §I.
- [15] (2005) On learning vector-valued functions. Neural computation 17 (1), pp. 177–204. Cited by: §IV, §IV.
- [16] (2024) Exploring the links between the fundamental lemma and kernel regression. IEEE Control Systems Letters 8, pp. 2045–2050. Cited by: §I, §I, §I.
- [17] (1983) Persistence of excitation in extended least squares. IEEE Transactions on Automatic Control 28 (1), pp. 60–68. Cited by: §III-A, §IV.
- [18] (2020) Data-driven internal model control of second-order discrete volterra systems. In 2020 59th IEEE Conference on Decision and Control (CDC), pp. 4572–4579. Cited by: §I.
- [19] (2001) A generalized representer theorem. In International conference on computational learning theory, pp. 416–426. Cited by: §I.
- [20] (2024) Willems’ fundamental lemma for nonlinear systems with koopman linear embedding. IEEE Control Systems Letters. Cited by: §I.
- [21] (1996) Subspace identification for linear systems: theory—implementation—applications. Kluwer. Cited by: §II-A, §V-B, §V-B, §V.
- [22] (2020) Willems’ fundamental lemma for state-space systems and its extension to multiple datasets. IEEE Control Systems Letters 4 (3), pp. 602–607. Cited by: §C-A.
- [23] (2023) The informativity approach to data-driven analysis and control. IEEE Control Systems Magazine 43 (6), pp. 32–66. Cited by: §I.
- [24] (2005) A note on persistency of excitation. Systems & Control Letters 54 (4), pp. 325–329. Cited by: §I, §II-B.
- [25] (1986) From time series to linear system—part i. finite dimensional linear time invariant systems. Automatica 22 (5), pp. 561–580. Cited by: §I, §II-B, §II-B.
- [26] (1991) Paradigms and puzzles in the theory of dynamical systems. IEEE Transactions on Automatic Control 36 (3), pp. 259–294. Cited by: §I, §II-B.
Comments
· 0