Machine Learning Faculty Publications

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits

Yuriy Dorn, MSU Institute for Artificial Intelligence
Nikita Kornilov, Moscow Institute of Physics and Technology
Nikolay Kutuzov, Moscow Institute of Physics and Technology
Alexander Nazin, V. A. Trapeznikov Institute of Control Sciences, Russian Academy of Sciences
Eduard Gorbunov, Mohamed Bin Zayed University of Artificial IntelligenceFollow
Alexander Gasnikov, Moscow Institute of Physics and Technology

Document Type

Article

Publication Title

Computational Management Science

Abstract

The Implicitly Normalized Forecaster (INF) algorithm is considered to be an optimal solution for adversarial multi-armed bandit (MAB) problems. However, most of the existing complexity results for INF rely on restrictive assumptions, such as bounded rewards. Recently, a related algorithm was proposed that works for both adversarial and stochastic heavy-tailed MAB settings. However, this algorithm fails to fully exploit the available data. In this paper, we propose a new version of INF called the Implicitly Normalized Forecaster with clipping (INF-clip) for MAB problems with heavy-tailed reward distributions. We establish convergence results under mild assumptions on the rewards distribution and demonstrate that INF-clip is optimal for linear heavy-tailed stochastic MAB problems and works well for non-linear ones. Furthermore, we show that INF-clip outperforms the best-of-both-worlds algorithm in cases where it is difficult to distinguish between different arms.

DOI

10.1007/s10287-023-00500-z

Publication Date

6-1-2024

Keywords

Clipping, Multi-armed bandits, Online mirror descent

Recommended Citation

Y. Dorn et al., "Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits," Computational Management Science, vol. 21, no. 1, Jun 2024.

The definitive version is available at https://doi.org/10.1007/s10287-023-00500-z

This document is currently not available here.

COinS

Machine Learning Faculty Publications

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits

Document Type

Publication Title

Abstract

DOI

Publication Date

Keywords

Recommended Citation

Browse

Contribute

Links

Machine Learning Faculty Publications

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits

Authors

Document Type

Publication Title

Abstract

DOI

Publication Date

Keywords

Recommended Citation

Share

Browse

Contribute

Links