Online combinatorial optimization with stochastic decision sets and adversarial losses
Indexed inarxivdatacite
Abstract
Most work on sequential learning assumes a fixed set of actions that are available all the time. However, in practice, actions can consist of picking subsets of readings from sensors that may break from time to time, road segments that can be blocked or goods that are out of stock. In this paper we study learning algorithms that are able to deal with stochastic availability of such unreliable composite actions. We propose and analyze algorithms based on the Follow-The-Perturbed-Leader prediction method for several learning settings differing in the feedback provided to the learner. Our algorithms rely on a novel loss estimation technique that we call Counting Asleep Times. We deliver regret bounds for our…
Citation impact
19
total citations
- FWCI
- —
- Percentile
- —
- References
- 17
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Regret
- Computer science
- Adversarial system
- Set (abstract data type)
- Mathematical optimization
- Point (geometry)
- Artificial intelligence
- Machine learning
UN Sustainable Development Goals
- Peace, Justice and strong institutions
No related works found for this paper.