Relative Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems
Borna Sayedana – Université McGill, Canada
Séminaire hybride à l'Université McGill ou Zoom.
In this talk, we consider the learning and control problem for unknown Markov jump linear systems (MJLS) with perfect state observations. We propose a certainty equivalence-based learning algorithm and show that this algorithm achieves a regret of O(\sqrt{T} log(T)) relative to a certain subset of the sample space. As part of our analysis, we propose a switched least squares method for the identification of MJLS, show that this method is strongly consistent, and derive data-dependent and data-independent rates of convergence. These results show that certainty equivalence control along with the switched least squares method for MJLS has the same rate of convergence as the certainty equivalence control method for linear systems.
Lieu
CIM
Pavillon McConnell
Université McGill
Montréal QC H3A 0E9
Canada