Machine Learning Faculty Publications

Learning to Control under Time-Varying Environment

Yuzhen Han, Department of Mechanical & Industrial Engineering, University of Toronto, Toronto, ON, Canada
Ruben Solozabal, Mohamed bin Zayed University of Artificial Intelligence
Jing Dong, The Chinese University of Hong Kong, Shenzhen, China
Xingyu Zhou, Wayne State University, Detroit, United States
Martin Takac, Mohamed bin Zayed University of Artificial IntelligenceFollow
Bin Gu, Mohamed bin Zayed University of Artificial IntelligenceFollow

Document Type

Article

Publication Title

arXiv

Abstract

This paper investigates the problem of regret minimization in linear time-varying (LTV) dynamical systems. Due to the simultaneous presence of uncertainty and non-stationarity, designing online control algorithms for unknown LTV systems remains a challenging task. At a cost of NP-hard offline planning, prior works have introduced online convex optimization algorithms, although they suffer from nonparametric rate of regret. In this paper, we propose the first computationally tractable online algorithm with regret guarantees that avoids offline planning over the state linear feedback policies. Our algorithm is based on the optimism in the face of uncertainty (OFU) principle in which we optimistically select the best model in a high confidence region. Our algorithm is then more explorative when compared to previous approaches. To overcome non-stationarity, we propose either a restarting strategy (R-OFU) or a sliding window (SW-OFU) strategy. With proper configuration, our algorithm is attains sublinear regret O(T2/3). These algorithms utilize data from the current phase for tracking variations on the system dynamics. We corroborate our theoretical findings with numerical experiments, which highlight the effectiveness of our methods. To the best of our knowledge, our study establishes the first model-based online algorithm with regret guarantees under LTV dynamical systems. © 2022, CC BY.

DOI

10.48550/arXiv.2206.02507

Publication Date

6-6-2022

Keywords

Machine Learning (cs.LG), Systems and Control (cs.SY), Systems and Control (eess.SY)

Comments

Open access version thanks to arXiv

License: CC BY 4.0

Uploaded July 05, 2022

Recommended Citation

Y. Han et al., "Learning to Control under Time-Varying Environment", 2022, arXiv:2206.02507

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Machine Learning Faculty Publications

Learning to Control under Time-Varying Environment

Document Type

Publication Title

Abstract

DOI

Publication Date

Keywords

Comments

Recommended Citation

Included in

Browse

Contribute

Links

Machine Learning Faculty Publications

Learning to Control under Time-Varying Environment

Authors

Document Type

Publication Title

Abstract

DOI

Publication Date

Keywords

Comments

Recommended Citation

Included in

Share

Browse

Contribute

Links