A closer look at adaptive regret

Dmitry Adamskiy; Wouter Koolen; Alexey Chernov; Vladimir Vovk

A closer look at adaptive regret

Dmitry Adamskiy, Wouter Koolen, Alexey Chernov, Vladimir Vovk

Research output: Contribution to journal › Article › peer-review

56 Downloads (Pure)

Abstract

For the prediction with expert advice setting, we consider methods to construct algorithms that have low adaptive regret. The adaptive regret of an algorithm on a time interval [t1,t2][t1,t2] is the loss of the algorithm minus the loss of the best expert over that interval. Adaptive regret measures how well the algorithm approximates the best expert locally, and so is different from, although closely related to, both the classical regret, measured over an initial time interval [1,t][1,t], and the tracking regret, where the algorithm is compared to a good sequence of experts over [1,t][1,t]. We investigate two existing intuitive methods for deriving algorithms with low adaptive regret, one based on specialist experts and the other based on restarts. Quite surprisingly, we show that both methods lead to the same algorithm, namely Fixed Share, which is known for its tracking regret. We provide a thorough analysis of the adaptive regret of Fixed Share. We obtain the exact worst-case adaptive regret for Fixed Share, from which the classical tracking bounds follow. We prove that Fixed Share is optimal for adaptive regret: the worst-case adaptive regret of any algorithm is at least that of an instance of Fixed Share.

Original language	English
Pages (from-to)	1-21
Number of pages	21
Journal	Journal of Machine Learning Research
Volume	17
Issue number	23
Publication status	Published - Apr 2016

Access to Document

13-533Final published version, 352 KBLicence: CC BY

http://jmlr.csail.mit.edu/papers/v17/13-533.html

Cite this

@article{eb97c8e99d8e4e0aba94e22ff02d34be,

title = "A closer look at adaptive regret",

abstract = "For the prediction with expert advice setting, we consider methods to construct algorithms that have low adaptive regret. The adaptive regret of an algorithm on a time interval [t1,t2][t1,t2] is the loss of the algorithm minus the loss of the best expert over that interval. Adaptive regret measures how well the algorithm approximates the best expert locally, and so is different from, although closely related to, both the classical regret, measured over an initial time interval [1,t][1,t], and the tracking regret, where the algorithm is compared to a good sequence of experts over [1,t][1,t]. We investigate two existing intuitive methods for deriving algorithms with low adaptive regret, one based on specialist experts and the other based on restarts. Quite surprisingly, we show that both methods lead to the same algorithm, namely Fixed Share, which is known for its tracking regret. We provide a thorough analysis of the adaptive regret of Fixed Share. We obtain the exact worst-case adaptive regret for Fixed Share, from which the classical tracking bounds follow. We prove that Fixed Share is optimal for adaptive regret: the worst-case adaptive regret of any algorithm is at least that of an instance of Fixed Share.",

author = "Dmitry Adamskiy and Wouter Koolen and Alexey Chernov and Vladimir Vovk",

year = "2016",

month = apr,

language = "English",

volume = "17",

pages = "1--21",

journal = "Journal of Machine Learning Research",

issn = "1533-7928",

publisher = "Microtome Publishing",

number = "23",

}

TY - JOUR

T1 - A closer look at adaptive regret

AU - Adamskiy, Dmitry

AU - Koolen, Wouter

AU - Chernov, Alexey

AU - Vovk, Vladimir

PY - 2016/4

Y1 - 2016/4

N2 - For the prediction with expert advice setting, we consider methods to construct algorithms that have low adaptive regret. The adaptive regret of an algorithm on a time interval [t1,t2][t1,t2] is the loss of the algorithm minus the loss of the best expert over that interval. Adaptive regret measures how well the algorithm approximates the best expert locally, and so is different from, although closely related to, both the classical regret, measured over an initial time interval [1,t][1,t], and the tracking regret, where the algorithm is compared to a good sequence of experts over [1,t][1,t]. We investigate two existing intuitive methods for deriving algorithms with low adaptive regret, one based on specialist experts and the other based on restarts. Quite surprisingly, we show that both methods lead to the same algorithm, namely Fixed Share, which is known for its tracking regret. We provide a thorough analysis of the adaptive regret of Fixed Share. We obtain the exact worst-case adaptive regret for Fixed Share, from which the classical tracking bounds follow. We prove that Fixed Share is optimal for adaptive regret: the worst-case adaptive regret of any algorithm is at least that of an instance of Fixed Share.

AB - For the prediction with expert advice setting, we consider methods to construct algorithms that have low adaptive regret. The adaptive regret of an algorithm on a time interval [t1,t2][t1,t2] is the loss of the algorithm minus the loss of the best expert over that interval. Adaptive regret measures how well the algorithm approximates the best expert locally, and so is different from, although closely related to, both the classical regret, measured over an initial time interval [1,t][1,t], and the tracking regret, where the algorithm is compared to a good sequence of experts over [1,t][1,t]. We investigate two existing intuitive methods for deriving algorithms with low adaptive regret, one based on specialist experts and the other based on restarts. Quite surprisingly, we show that both methods lead to the same algorithm, namely Fixed Share, which is known for its tracking regret. We provide a thorough analysis of the adaptive regret of Fixed Share. We obtain the exact worst-case adaptive regret for Fixed Share, from which the classical tracking bounds follow. We prove that Fixed Share is optimal for adaptive regret: the worst-case adaptive regret of any algorithm is at least that of an instance of Fixed Share.

M3 - Article

SN - 1533-7928

VL - 17

SP - 1

EP - 21

JO - Journal of Machine Learning Research

JF - Journal of Machine Learning Research

IS - 23

ER -