Sam O�Neill
An Online Learning Approach to a Multi-player N-armed Functional Bandit
O�Neill, Sam; Bagdasar, Ovidiu; Liotta, Antonio
Authors
Ovidiu Bagdasar
Antonio Liotta
Abstract
Congestion games possess the property of emitting at least one pure Nash equilibrium and have a rich history of practical use in transport modelling. In this paper we approach the problem of modelling equilibrium within congestion games using a decentralised multi-player probabilistic approach via stochastic bandit feedback. Restricting the strategies available to players under the assumption of bounded rationality, we explore an online multiplayer exponential weights algorithm for unweighted atomic routing games and compare this with a ϵ-greedy algorithm.
Citation
O’Neill, S., Bagdasar, O., & Liotta, A. (2020). An Online Learning Approach to a Multi-player N-armed Functional Bandit. In Numerical Computations: Theory and Algorithms (438-445). https://doi.org/10.1007/978-3-030-40616-5_41
Presentation Conference Type | Conference Paper (Published) |
---|---|
Conference Name | NUMTA: International Conference on Numerical Computations: Theory and Algorithms |
Start Date | Jun 15, 2019 |
End Date | Jun 21, 2019 |
Online Publication Date | Feb 14, 2020 |
Publication Date | 2020 |
Deposit Date | Apr 6, 2020 |
Publisher | Springer |
Pages | 438-445 |
Series Title | Lecture Notes in Computer Science |
Series Number | 11974 |
Series ISSN | 0302-9743 |
Book Title | Numerical Computations: Theory and Algorithms |
ISBN | 9783030406158 |
DOI | https://doi.org/10.1007/978-3-030-40616-5_41 |
Keywords | Congestion games, Online learning, Multi-armed bandit |
Public URL | http://researchrepository.napier.ac.uk/Output/2560144 |
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search