How to gamble with non-stationary X-armed bandits and have no regrets

Avanesov, Valeriy

doi:https://doi.org/10.34657/8374

How to gamble with non-stationary X-armed bandits and have no regrets

Files

wias_preprints_2686.pdf(299.76 KB)

Date

2020

Authors

Avanesov, Valeriy

Volume

2686

Series Titel

WIAS Preprints

Publisher

Berlin : Weierstraß-Institut für Angewandte Analysis und Stochastik

Link to publishers version

https://doi.org/10.20347/WIAS.PREPRINT.2686

Abstract

In X-armed bandit problem an agent sequentially interacts with environment which yields a reward based on the vector input the agent provides. The agent's goal is to maximise the sum of these rewards across some number of time steps. The problem and its variations have been a subject of numerous studies, suggesting sub-linear and sometimes optimal strategies. The given paper introduces a new variation of the problem. We consider an environment, which can abruptly change its behaviour an unknown number of times. To that end we propose a novel strategy and prove it attains sub-linear cumulative regret. Moreover, the obtained regret bound matches the best known bound for GP-UCB for a stationary case, and approaches the minimax lower bound in case of highly smooth relation between an action and the corresponding reward. The theoretical result is supported by experimental study.

URI

https://oa.tib.eu/renate/handle/123456789/9336
https://doi.org/10.34657/8374

Collections

Mathematik
WIAS Preprints

License

This document may be downloaded, read, stored and printed for your own use within the limits of § 53 UrhG but it may not be distributed via the internet or passed on to external parties.
Dieses Dokument darf im Rahmen von § 53 UrhG zum eigenen Gebrauch kostenfrei heruntergeladen, gelesen, gespeichert und ausgedruckt, aber nicht im Internet bereitgestellt oder an Außenstehende weitergegeben werden.

Full item page

How to gamble with non-stationary X-armed bandits and have no regrets

Files

Date

Authors

Volume

Issue

Journal

Series Titel

Book Title

Publisher

Link to publishers version

Abstract

Description

Keywords

URI

Collections

License