Research on over-approximations with previews for correct-by-construction control

This blog post outlines my contributions to correct-by-construction control through the use of over-approximations with preview information. It will play a central role in my research project. This work is currently under review and is also available as a preprint.

We are interested in known nonlinear and deterministic dynamical systems under state and input constraints. Since the dynamics $f$ is nonlinear, such a system is hard to control. For that reason, it is approximated by a simpler (e.g., linear or piecewise linear) dynamics $\hat{f}$ for which synthesizing a safe policy is easier. However, such an approximation introduces an error. The idea of over-approximations is to find a set $W\subset\mathbb{R}^{n_x}$ that contains all the possible errors over the domain, i.e.,

\[f(x,u)-\hat{f}(x,u)\in W,\ \text{ for all }\ (x,u)\in\mathcal{X}\times\mathcal{U}.\]

Instead of controlling the nonlinear system, the idea of over-approximations is to synthesize a policy for the following nondeterministic system:

\[x_{t+1}=\hat{f}(x_t,u_t)+w_t,\]

where

\[w_t=f(x_t,u_t)-\hat{f}(x_t,u_t)\in W\]

is treated as an unknown disturbance.

An over-approximation replaces a deterministic nonlinear system with a nondeterministic linear system such that the trajectory of the nonlinear system is one of the possible trajectories of the nondeterministic over-approximation. If a control policy guarantees safety for all trajectories of the over-approximation, then the same policy is guaranteed to ensure safety for the original system.

In previous works on over-approximations, the disturbance $w_t$ is treated as unknown (see e.g., [1],[2]), and a state-feedback policy $\pi:\mathcal{X}\rightarrow\mathcal{U}$ is designed to complete a control task for any realization of the disturbances. In contrast, in this work we show that the disturbance can be considered as known at runtime, and how it can be exploited by the policy. We make the key observation that at runtime, the state $x_t$ is known and the input $u_t$ is a decision variable, consequently, in the light of

\[w_t=f(x_t,u_t)-\hat{f}(x_t,u_t),\]

the disturbance $w_t$ can be considered as known as an input-dependent preview information. Consequently, rather than synthesizing a feedback policy that depends only on the state, one can design a policy that depends jointly on the state and the disturbance, i.e.,

\[u_t=\pi(x_t,w_t).\]

Using such a preview-based policy has the potential to reduce conservatism compared with a state-feedback policy, because the control input is chosen by taking strictly more information into account. However, it comes with a few challenges.

Control with preview information refers to control strategies that exploit knowledge on future realizations of the system disturbances. A typical example arises in wind turbine control, where LiDAR measurements are used to anticipate upcoming wind speed fluctuations before they affect the turbine dynamics [3].

In contrast, in our work the preview signal is endogenously generated by the over-approximation error and depends explicitly on the chosen control input. This input-dependent structure induces a coupling between the preview information and the control policy (see Figure).

This ensures that both equations are satisfied. This allowed us to prove that the resulting trajectory of the nonlinear system is guaranteed to be safe.

Description
Illustration of the interplay between the control input $u_t$ and the disturbance \( w_t \) due to the use of a preview-based policy.
\[u\mapsto \pi(x_t,f(x_t,u)-\hat{f}(x_t,u)).\]

We then rely on fixed-point theory to study this problem. Existence of solutions follows from the Brouwer fixed-point theorem, while efficient computation is enabled through closed-form, linear, or convex programs for input-affine systems, and through an iterative method based on the Banach fixed-point theorem for nonlinear systems.

Numerical experiments confirm that a preview-based policy leads to a substantial reduction in conservatism compared with a policy that depends on the state only. See the figure below for a comparison.

Description
Finite-horizon control of a nonlinear dynamical system under state and input constraints. The control objective is to steer the terminal state as far to the right as possible while remaining within the domain. A preview-based control policy exploits preview information to guarantee a terminal position further to the right than that achieved by a pure state-feedback policy, illustrating the resulting reduction in conservatism (see terminal sets).

References

[1] A. Girard and S. Martin, “Synthesis for constrained nonlinear systems using hybridization and robust controllers on simplices,” IEEE Transactions on Automatic Control, vol. 57, no. 4, pp. 1046–1051, 2011.

[2] M. Althoff, O. Stursberg, and M. Buss, “Reachability analysis of nonlinear systems with uncertain parameters using conservative linearization,” in 2008 47th IEEE Conference on Decision and Control, IEEE, 2008, pp. 4042–4048.

[3] A. A. Ozdemir, P. Seiler, and G. J. Balas, “Design tradeoffs of wind turbine preview control,” IEEE Transactions on Control Systems Technology, vol. 21, no. 4, pp. 1143–1154, 2013.