Relative Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems
Borna Sayedana – McGill University, Canada
** Hybrid seminar at McGill University or Zoom.**
In this talk, we consider the learning and control problem for unknown Markov jump linear systems (MJLS) with perfect state observations. We propose a certainty equivalence-based learning algorithm and show that this algorithm achieves a regret of O(\sqrt{T} log(T)) relative to a certain subset of the sample space. As part of our analysis, we propose a switched least squares method for the identification of MJLS, show that this method is strongly consistent, and derive data-dependent and data-independent rates of convergence. These results show that certainty equivalence control along with the switched least squares method for MJLS has the same rate of convergence as the certainty equivalence control method for linear systems.
Location
CIM
McConnell Building
McGill University
Montréal QC H3A 0E9
Canada