Reduction of data amount in data-driven design of linear quadratic regulators

Izumi, Shinsaku; **n, **n

doi:10.1007/s11768-024-00220-y

Reduction of data amount in data-driven design of linear quadratic regulators

Research article
Published: 28 May 2024

(2024)
Cite this article

Control Theory and Technology Aims and scope Submit manuscript

Shinsaku Izumi¹ &
**n **n²

62 Accesses
Explore all metrics

Abstract

This paper discusses the data-driven design of linear quadratic regulators, i.e., to obtain the regulators directly from experimental data without using the models of plants. In particular, we aim to improve an existing design method by reducing the amount of the required experimental data. Reducing the data amount leads to the cost reduction of experiments and computation for the data-driven design. We present a simplified version of the existing method, where parameters yielding the gain of the regulator are estimated from only part of the data required in the existing method. We then show that the data amount required in the presented method is less than half of that in the existing method under certain conditions. In addition, assuming the presence of measurement noise, we analyze the relations between the expectations and variances of the estimated parameters and the noise. As a result, it is shown that using a larger amount of the experimental data might mitigate the effects of the noise on the estimated parameters. These results are verified by numerical examples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

Data-driven estimation of the algebraic Riccati equation for the discrete-time inverse linear quadratic regulator problem

Article Open access 20 May 2024

Linear Quadratic Regulator Controller Design

A Nonparametric Approach to Design Fixed-order Controllers for Systems with Constrained Input

Article 30 October 2018

Data Availability

Our data are available in the sense that the same (or similar) data can be generated by simulations based on the mathematical contents of this article and references.

Notes

Although two design methods were proposed in [9], we introduce one of them called Algorithm 1.

References

Campi, M. C., Lecchini, A., & Savaresi, S. M. (2002). Virtual reference feedback tuning: A direct method for the design of feedback controllers. Automatica, 38(8), 1337–1346.
Article MathSciNet Google Scholar
Soma, S., Kaneko, O., & Fujii, T. (2004). A new method of controller parameter tuning based on input-output data—Fictitious Reference Iterative Tuning (FRIT). IFAC Proceedings Volumes, 37(12), 789–794.
Yan, P., Liu, D., Wang, D., & Ma, H. (2016). Data-driven controller design for general MIMO nonlinear systems via virtual reference feedback tuning and neural networks. Neurocomputing, 171, 815–825.
Article Google Scholar
Nicoletti, A., Martino, M., & Karimi, A. (2019). A robust data-driven controller design methodology with applications to particle accelerator power converters. IEEE Transactions on Control Systems Technology, 27(2), 814–821.
Article Google Scholar
Baggio, G., Katewa, V., & Pasqualetti, F. (2019). Data-driven minimum-energy controls for linear systems. IEEE Control Systems Letters, 3(3), 589–594.
Article MathSciNet Google Scholar
Berberich, J., Koch, A., Scherer, C.W., & Allgöwer, F. (2020). Robust data-driven state-feedback design. In: Proceedings of the 2020 American Control Conference, pp. 1532–1538.
Bisoffi, A., De Persis, C., & Tesi, P. (2022). Data-driven control via Petersen’s lemma. Automatica, 145, 110537.
Article MathSciNet Google Scholar
Lewis, F. L., Vrabie, D. L., & Syrmos, V. L. (2012). Optimal Control. Hoboken, NJ: Wiley.
Book Google Scholar
Gonçalves da Silva, G. R., Bazanella, A. S., Lorenzini, C., & Campestrini, L. (2019). Data-driven LQR control design. IEEE Control Systems Letters, 3(1), 180–185.
van Waarde, H. J., & Mesbahi, M. (2020). Data-driven parameterizations of suboptimal LQR and $H_2$ controllers. IFAC-PapersOnLine, 53(2), 4234–4239.
Rotulo, M., De Persis, C., & Tesi, P. (2020). Data-driven linear quadratic regulation via semidefinite programming. IFAC-PapersOnLine, 53(2), 3995–4000.
Article Google Scholar
De Persis, C., & Tesi, P. (2021). Low-complexity learning of linear quadratic regulators from noisy data. Automatica, 128, 109548.
Article MathSciNet Google Scholar
Aangenent, W., Kostić, D., de Jager, B., Molengraft, R., & Steinbuch, M. (2005). Data-based optimal control. In: Proceedings of the 2005 American Control Conference, pp. 1460–1465.
Lim, R.K., Phan, M.Q., & Longman, R.W. (1998). State estimation with ARMarkov models. Princeton University Department of Mechanical and Aerospace Engineering Technical Report (3046).
Sturm, J. F. (1999). Using SeDuMi 1.02, a MATLAB toolbox for optimization over symmetric cones. Optimization Methods and Software, 11(1–4), 625–653.
Article MathSciNet Google Scholar
Guo, T., Al Makdah, A. A., Krishnan, V., & Pasqualetti, F. (2023). Imitation and transfer learning for LQG control. IEEE Control Systems Letters, 7, 2149–2154.
Article MathSciNet Google Scholar

Download references

Acknowledgements

The first author would like to thank Mr. A. Takeuchi for discussing the proposed design method.

Author information

Authors and Affiliations

School of Systems Engineering, Kochi University of Technology, Kami, Kochi, 782-8502, Japan
Shinsaku Izumi
School of Automation, Southeast University, and Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Nan**g, 210096, Jiangsu, China
**n **n

Authors

Shinsaku Izumi
View author publications
You can also search for this author in PubMed Google Scholar
**n **n
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shinsaku Izumi.

Ethics declarations

Conflicts of interest

The authors have no conflicts of interest to declare that are relevant to the contents of this paper.

Appendices

Proof of Theorem 1

By using (24) and the linearity of expectation, $\textsf{E}[\hat{\varTheta }]$ is calculated as

$$\begin{aligned} \textsf{E}[\hat{\varTheta }]&=\textsf{E}\left[ (Y_0^*+W_{y0})(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&=\textsf{E}\left[ Y_0^*(\bar{Z}^*+W_x)^\dagger +W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&=\textsf{E}\left[ Y_0^*(\bar{Z}^*+W_x)^\dagger \right] +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&=\textsf{E}\left[ \varTheta \bar{Z}^*(\bar{Z}^*\!+\!W_x)^\dagger \right] +\textsf{E}\left[ W_{y0}(\bar{Z}^*\!+\!W_x)^\dagger \right] \nonumber \\&=\varTheta \bar{Z}^*\textsf{E}\left[ (\bar{Z}^*\!+\!W_x)^\dagger \right] +\textsf{E}\left[ W_{y0}(\bar{Z}^*\!+\!W_x)^\dagger \right] , \end{aligned}$$

(A1)

where the fourth equality is derived from $Y_0^*=\varTheta \bar{Z}^*$ due to (21). Therefore, (26) holds.

A direct calculation using (1), (2), (24), (26), and the linearity of expectation yields

$$\begin{aligned}&\textsf{V}[\hat{\varTheta }]=\textsf{E}[\hat{\varTheta }\hat{\varTheta }^\textrm{T}]-\textsf{E}[\hat{\varTheta }]\textsf{E}[\hat{\varTheta }]^\textrm{T} \nonumber \\&\quad =\textsf{E}\left[ (Y_0^*+W_{y0})(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T} \right. \nonumber \\&\qquad \left. \times (Y_0^*+W_{y0})^\textrm{T}\right] -\left( \varTheta \textsf{E}\left[ \bar{Z}^*(\bar{Z}^*+W_x)^\dagger \right] \right. \nonumber \\&\qquad \left. +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \right) \left( \varTheta \textsf{E}\left[ \bar{Z}^*(\bar{Z}^*+W_x)^\dagger \right] \right. \nonumber \\&\qquad \left. +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \right) ^\textrm{T} \nonumber \\&\quad =\textsf{E}\Bigl [Y_0^*(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{Y_0^*}^\textrm{T} \nonumber \\&\qquad \quad +Y_0^*(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{W_{y0}}^\textrm{T} \nonumber \\&\qquad \quad +W_{y0}(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{Y_0^*}^\textrm{T} \nonumber \\&\qquad \quad +W_{y0}(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{W_{y0}}^\textrm{T}\Bigr ] \nonumber \\&\qquad -\left( \varTheta \bar{Z}^*\textsf{E}\left[ (\bar{Z}^*\!+\!W_x)^\dagger \right] \!+\!\textsf{E}\left[ W_{y0}(\bar{Z}^*\!+\!W_x)^\dagger \right] \right) \nonumber \\&\qquad \times \left( \varTheta \bar{Z}^*\textsf{E}\left[ (\bar{Z}^*\!+\!W_x)^\dagger \right] \!+\!\textsf{E}\left[ W_{y0}(\bar{Z}^*\!+\!W_x)^\dagger \right] \right) ^\textrm{T} \nonumber \\&\quad =\textsf{E}\left[ Y_0^*(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{Y_0^*}^\textrm{T} \right] \nonumber \\&\qquad +\textsf{E}\left[ Y_0^*(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{W_{y0}}^\textrm{T}\right] \nonumber \\&\qquad +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{Y_0^*}^\textrm{T}\right] \nonumber \\&\qquad +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{W_{y0}}^\textrm{T}\right] \nonumber \\&\qquad -\left( Y_0^*\textsf{E}\left[ (\bar{Z}^*\!+\!W_x)^\dagger \right] \!+\!\textsf{E}\left[ W_{y0}(\bar{Z}^*\!+\!W_x)^\dagger \right] \right) \nonumber \\&\qquad \times \left( Y_0^*\textsf{E}\left[ (\bar{Z}^*\!+\!W_x)^\dagger \right] \!+\!\textsf{E}\left[ W_{y0}(\bar{Z}^*\!+\!W_x)^\dagger \right] \right) ^\textrm{T} \nonumber \\&\quad =Y_0^*\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T} \right] {Y_0^*}^\textrm{T}\nonumber \\&\qquad +Y_0^*\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{W_{y0}}^\textrm{T}\right] \nonumber \\&\qquad +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}\right] {Y_0^*}^\textrm{T} \nonumber \\&\qquad +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}{W_{y0}}^\textrm{T}\right] \nonumber \\&\qquad -Y_0^*\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] \left( Y_0^*\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] \right) ^\textrm{T} \nonumber \\&\qquad -Y_0^*\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] \textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] ^\textrm{T} \nonumber \\&\qquad -\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \left( Y_0^*\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] \right) ^\textrm{T} \nonumber \\&\qquad -\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] ^\textrm{T} \nonumber \\&\quad =Y_0^*\Bigl (\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T} \right] \nonumber \\&\qquad \quad -\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] \textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] ^\textrm{T} \Bigr ) {Y_0^*}^\textrm{T} \nonumber \\&\qquad +Y_0^*\Bigl ( \textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger (W_{y0}(\bar{Z}^*+W_x)^\dagger )^\textrm{T}\right] \nonumber \\&\qquad \quad -\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] \textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] ^\textrm{T} \Bigr ) \nonumber \\&\qquad +\Bigl ( \textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger ((\bar{Z}^*+W_x)^\dagger )^\textrm{T}\right] \nonumber \\&\qquad \quad -\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] ^\textrm{T} \Bigr ) {Y_0^*}^\textrm{T} \nonumber \\&\qquad +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger (W_{y0}(\bar{Z}^*+W_x)^\dagger )^\textrm{T}\right] \nonumber \\&\qquad -\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] ^\textrm{T} \nonumber \\&\quad =Y_0^*\textsf{V}\left[ (\bar{Z}^*+W_x)^\dagger \right] {Y_0^*}^\textrm{T} +\textsf{V}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&\qquad +Y_0^*\textsf{C}\left[ (\bar{Z}^*+W_x)^\dagger , W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&\qquad +\textsf{C}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger ,(\bar{Z}^*+W_x)^\dagger \right] {Y_0^*}^\textrm{T}, \end{aligned}$$

(A2)

where the fourth equality is given by $Y_0^*=\varTheta \bar{Z}^*$. Hence, we obtain (27) due to (3). This completes the proof.

Proof of Theorem 2

We first prove (i). By using the linearity of expectation and $(\bar{Z}^*+W_x)(\bar{Z}^*+W_x)^\dagger = I_{n+mN}$, (26) can be rewritten as

$$\begin{aligned} \textsf{E}[\hat{\varTheta }]&=\varTheta \bar{Z}^*\textsf{E}\left[ (\bar{Z}^*+W_x)^\dagger \right] +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&=\varTheta \textsf{E}\left[ \bar{Z}^*(\bar{Z}^*+W_x)^\dagger \right] +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&=\varTheta \textsf{E}\left[ (\bar{Z}^*+W_x-W_x)(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&\quad +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&=\varTheta \textsf{E}\left[ I_{n+mN}-W_x(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&\quad +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&=\varTheta -\varTheta \textsf{E}\left[ W_x(\bar{Z}^*+W_x)^\dagger \right] \nonumber \\&\quad +\textsf{E}\left[ W_{y0}(\bar{Z}^*+W_x)^\dagger \right] . \end{aligned}$$

(B3)

Because the plant (4) is stable from the assumption and the input u(t) and the noises $w_x(t)$ and $w_y(t)$ satisfy $u(t)\in \mathbb {R}^m$, $w_x(t)\in \mathbb {R}^n$, and $w_y(t)\in \mathbb {R}^\ell $, respectively, the magnitudes of the elements of the matrices $\varTheta $, $W_x$, $W_{y0}$, and $Y_0^*$ in (B3) and (27) are finite. Hence, applying $(\bar{Z}^*+W_x)^\dagger \rightarrow 0_{L\times (n+mN)}$ to (B3) and (27) proves (i).

Next, we prove (ii). Under $L\ge n+mN$ shown in Table 1, the singular value decomposition of $\bar{Z}^*+W_x$ gives

$$\begin{aligned} (\bar{Z}^*+W_x)^\dagger =T\varSigma V^\textrm{T}, \end{aligned}$$

(B4)

where $T\in \mathbb {R}^{L\times L}$ and $V\in \mathbb {R}^{(n+mN)\times (n+mN)}$ are orthogonal matrices and

$$\begin{aligned} \varSigma := \begin{bmatrix} \textrm{diag}(1/\sigma _1,1/\sigma _2,\ldots ,1/\sigma _{n+mN}) \\ 0_{(L-n-mN)\times (n+mN)} \end{bmatrix} \end{aligned}$$

(B5)

for the singular values $\sigma _i$ $(i=1,2,\ldots ,n+mN)$ of $\bar{Z}^*+W_x$. Note in (B5) that $\sigma _i\ne 0$ holds for every $i\in \{1,2,\ldots ,n+mN\}$ because $\bar{Z}^*+W_x$ is assumed to be full row rank. From (B4) and the orthogonality of T and V, if $(\bar{Z}^*+W_x)^\dagger \rightarrow 0_{L\times (n+mN)}$ holds, $\sigma _i\rightarrow \infty $ holds for every $i\in \{1,2,\ldots ,n+mN\}$. This implies that the Frobenius norm $\Vert \bar{Z}^*+W_x\Vert _F$ of $\bar{Z}^*+W_x$ approaches infinity because its square is equal to the sum of $\sigma _i^2$ $(i=1,2,\ldots ,n+mN)$. By the definition of the Frobenius norm, if $\Vert \bar{Z}^*+W_x\Vert _F$ approaches infinity, the number of the elements of $\bar{Z}^*+W_x$, i.e., $L(n+mN)$, approaches infinity because the magnitudes of the elements are finite by a discussion similar to the above. This, together with the assumption that the parameter N is fixed, proves (ii), which completes the proof.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Izumi, S., **n, X. Reduction of data amount in data-driven design of linear quadratic regulators. Control Theory Technol. (2024). https://doi.org/10.1007/s11768-024-00220-y

Download citation

Received: 11 August 2023
Revised: 07 December 2023
Accepted: 08 January 2024
Published: 28 May 2024
DOI: https://doi.org/10.1007/s11768-024-00220-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

Reduction of data amount in data-driven design of linear quadratic regulators

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Data-driven estimation of the algebraic Riccati equation for the discrete-time inverse linear quadratic regulator problem

Linear Quadratic Regulator Controller Design

A Nonparametric Approach to Design Fixed-order Controllers for Systems with Constrained Input

Data Availability

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Appendices

Proof of Theorem 1

Proof of Theorem 2

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Reduction of data amount in data-driven design of linear quadratic regulators

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Data-driven estimation of the algebraic Riccati equation for the discrete-time inverse linear quadratic regulator problem

Linear Quadratic Regulator Controller Design

A Nonparametric Approach to Design Fixed-order Controllers for Systems with Constrained Input

Data Availability

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Appendices

Proof of Theorem 1

Proof of Theorem 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation