Pure exploration and regret minimization in matching bandits
Sentenac, F., Yi, J., Calauzènes, print., Perchet, V. & Vojnovic, M.
(2021).
Pure exploration and regret minimization in matching bandits.
In
Proceedings of the 38th International Conference on Machine Learning
(pp. 9434-9442).
Journal of Machine Learning Research.
Finding an optimal matching in a weighted graph is a standard combinatorial problem. We consider its semi-bandit version where either a pair or a full matching is sampled sequentially. We prove that it is possible to leverage a rank-1 assumption on the adjacency matrix to reduce the sample complexity and the regret of off-the-shelf algorithms up to reaching a linear dependency in the number of vertices (up to poly log terms).
| Item Type | Chapter |
|---|---|
| Copyright holders | © 2022 The Author(s). |
| Departments | LSE > Academic Departments > Statistics |
| Date Deposited | 30 Sep 2022 |
| Acceptance Date | 16 May 2021 |
| URI | https://researchonline.lse.ac.uk/id/eprint/116713 |
Explore Further
- https://www.lse.ac.uk/Statistics/People/Professor-Milan-Vojnovic (Author)
- https://www.scopus.com/pages/publications/85136075476 (Scopus publication)
- https://proceedings.mlr.press/v139/sentenac21a.htm... (Official URL)
ORCID: https://orcid.org/0000-0003-1382-022X