Abstract
Bounding the price of anarchy, which quantifies the damage to social welfare due to selfish behavior of the participants, has been an important area of research in algorithmic game theory. Classical work on such bounds in repeated games makes the strong assumption that the subsequent rounds of the repeated games are independent beyond any influence on play from past history. This work studies such bounds in environments that themselves change due to the actions of the agents. Concretely, we consider this problem in discrete-time queuing systems, where competitive queues try to get their packets served. In this model, a queue gets to send a packet each step to one of the servers, which will attempt to serve the oldest arriving packet, and unprocessed packets are returned to each queue. We model this as a repeated game where queues compete for the capacity of the servers, but where the state of the game evolves as the length of each queue varies. We analyze this queuing system from multiple perspectives: as a baseline measure, we first establish precise conditions on the queuing arrival rates and service capacities that ensure all packets clear efficiently under centralized coordination. We then show that if queues strategically choose servers according to independent and stationary distributions, the system remains stable provided it would be stable under coordination with arrival rates scaled up by a factor of just \(\frac{e}{e-1} \). Finally, we extend these results to no-regret learning dynamics: if queues use learning algorithms satisfying the no-regret property to choose servers, then the requisite factor increases to two, and both of these bounds are tight. Both of these results require new probabilistic techniques compared to the classical price of anarchy literature, and show that in such settings, no-regret learning can exhibit efficiency loss due to myopia.
- Elliot Anshelevich, Anirban Dasgupta, Jon M. Kleinberg, Éva Tardos, Tom Wexler, and Tim Roughgarden. 2008. The Price of Stability for Network Design with Fair Cost Allocation. SIAM J. Comput. 38, 4 (2008), 1602–1623. https://doi.org/10.1137/070680096Google ScholarDigital Library
- Elliot Anshelevich, Anirban Dasgupta, Éva Tardos, and Tom Wexler. 2008. Near-Optimal Network Design with Selfish Agents. Theory Comput. 4, 1 (2008), 77–109. https://doi.org/10.4086/toc.2008.v004a004Google ScholarCross Ref
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. 2002. The Nonstochastic Multiarmed Bandit Problem. SIAM J. Comput. 32, 1 (2002), 48–77. https://doi.org/10.1137/S0097539701398375Google ScholarDigital Library
- Maria-Florina Balcan, Avrim Blum, and Yishay Mansour. 2013. Circumventing the Price of Anarchy: Leading Dynamics to Good Behavior. SIAM J. Comput. 42, 1 (2013), 230–264. https://doi.org/10.1137/110821317Google ScholarDigital Library
- Lucas Baudin, Marco Scarsini, and Xavier Venel. 2023. Strategic Behavior and No-Regret Learning in Queueing Systems. CoRR abs/2302.03614(2023). https://doi.org/10.48550/arXiv.2302.03614 arXiv:2302.03614Google Scholar
- P. Billingsley. 2008. Probability and Measure, 3rd Edition. Wiley.Google Scholar
- Avrim Blum, MohammadTaghi Hajiaghayi, Katrina Ligett, and Aaron Roth. 2008. Regret minimization and the price of total anarchy. In Proceedings of the 40th Annual ACM Symposium on Theory of Computing. ACM, 373–382. https://doi.org/10.1145/1374376.1374430Google ScholarDigital Library
- Christian Borgs, Jennifer T. Chayes, Nicole Immorlica, Kamal Jain, Omid Etesami, and Mohammad Mahdian. 2007. Dynamics of bid optimization in online advertisement auctions. In Proceedings of the 16th International Conference on World Wide Web, WWW 2007, Banff, Alberta, Canada, May 8-12, 2007, Carey L. Williamson, Mary Ellen Zurko, Peter F. Patel-Schneider, and Prashant J. Shenoy (Eds.). ACM, 531–540. https://doi.org/10.1145/1242572.1242644Google ScholarDigital Library
- Allan Borodin, Jon M. Kleinberg, Prabhakar Raghavan, Madhu Sudan, and David P. Williamson. 2001. Adversarial queuing theory. J. ACM 48, 1 (2001), 13–38. https://doi.org/10.1145/363647.363659Google ScholarDigital Library
- G. W. Brown. 1951. Iterative Solutions of Games by Fictitious Play. In Activity Analysis of Production and Allocation. Wiley, 374–376.Google Scholar
- Vincent Conitzer, Christian Kroer, Debmalya Panigrahi, Okke Schrijvers, Eric Sodomka, Nicolás E. Stier Moses, and Chris Wilkens. 2019. Pacing Equilibrium in First-Price Auction Markets. In Proceedings of the 2019 ACM Conference on Economics and Computation, EC 2019, Phoenix, AZ, USA, June 24-28, 2019, Anna Karlin, Nicole Immorlica, and Ramesh Johari (Eds.). ACM, 587. https://doi.org/10.1145/3328526.3329600Google ScholarDigital Library
- Vincent Conitzer, Christian Kroer, Eric Sodomka, and Nicolás E. Stier Moses. 2018. Multiplicative Pacing Equilibria in Auction Markets. In Web and Internet Economics - 14th International Conference, WINE 2018, Oxford, UK, December 15-17, 2018, Proceedings(Lecture Notes in Computer Science, Vol. 11316), George Christodoulou and Tobias Harks (Eds.). Springer, 443. https://link.springer.com/content/pdf/bbm%3A978-3-030-04612-5%2F1.pdfGoogle Scholar
- Constantinos Daskalakis, Paul W. Goldberg, and Christos H. Papadimitriou. 2009. The Complexity of Computing a Nash Equilibrium. SIAM J. Comput. 39, 1 (2009), 195–259. https://doi.org/10.1137/070699652Google ScholarDigital Library
- Ofer Dekel, Ambuj Tewari, and Raman Arora. 2012. Online Bandit Learning against an Adaptive Adversary: from Regret to Policy Regret. In Proceedings of the 29th International Conference on Machine Learning, ICML 2012, Edinburgh, Scotland, UK, June 26 - July 1, 2012. icml.cc / Omnipress. http://icml.cc/2012/papers/749.pdfGoogle Scholar
- Rick Durrett. 2019. Probability: Theory and Examples. Vol. 49. Cambridge University Press.Google ScholarCross Ref
- Jerzy Filar and Koos Vrieze. 2012. Competitive Markov Decision Processes. Springer Science & Business Media.Google Scholar
- Daniel Freund, Thodoris Lykouris, and Wentao Weng. 2022. Efficient decentralized multi-agent learning in asymmetric queuing systems. In Conference on Learning Theory, 2-5 July 2022, London, UK(Proceedings of Machine Learning Research, Vol. 178), Po-Ling Loh and Maxim Raginsky (Eds.). PMLR, 4080–4084. https://proceedings.mlr.press/v178/freund22a.htmlGoogle Scholar
- Hu Fu, Qun Hu, and Jia’nan Lin. 2022. Stability of Decentralized Queueing Networks Beyond Complete Bipartite Cases. In Web and Internet Economics - 18th International Conference, WINE 2022, Troy, NY, USA, December 12-15, 2022, Proceedings(Lecture Notes in Computer Science, Vol. 13778), Kristoffer Arnsfelt Hansen, Tracy Xiao Liu, and Azarakhsh Malekian (Eds.). Springer, 96–114. https://doi.org/10.1007/978-3-031-22832-2_6Google ScholarDigital Library
- Drew Fudenberg and David Levine. 1998. The theory of learning in games. MIT Press.Google Scholar
- S. Hart and A. M. Colell. 2000. A Simple Adaptive Procedure Leading to Correlated Equilibrium. Econometrica 68, 5 (2000), 1127–1150.Google ScholarCross Ref
- Refael Hassin. 2020. Rational Queueing. CRC Press. https://books.google.com/books?id=M3G1zQEACAAJGoogle Scholar
- Refael Hassin and Moshe Haviv. 2003. To Queue or Not to Queue: Equilibrium Behavior in Queueing Systems. Springer US. 02034096 https://books.google.com/books?id=K5KhJSZkhHQCGoogle Scholar
- Ramesh Johari and John N. Tsitsiklis. 2004. Efficiency Loss in a Network Resource Allocation Game. Math. Oper. Res. 29, 3 (2004), 407–435. https://doi.org/10.1287/moor.1040.0091Google ScholarDigital Library
- Elias Koutsoupias and Christos H. Papadimitriou. 1999. Worst-case Equilibria. In STACS 99, 16th Annual Symposium on Theoretical Aspects of Computer Science. Springer, 404–413. https://doi.org/10.1007/3-540-49116-3_38Google Scholar
- Subhashini Krishnasamy, Rajat Sen, Ramesh Johari, and Sanjay Shakkottai. 2016. Regret of Queueing Bandits. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain. 1669–1677. http://papers.nips.cc/paper/6370-regret-of-queueing-banditsGoogle Scholar
- Thodoris Lykouris, Vasilis Syrgkanis, and Éva Tardos. 2016. Learning and Efficiency in Games with Dynamic Population. In Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2016, Arlington, VA, USA, January 10-12, 2016. SIAM, 120–129. https://doi.org/10.1137/1.9781611974331.ch9Google ScholarCross Ref
- Albert W Marshall, Ingram Olkin, and Barry C Arnold. 1979. Inequalities: Theory of Majorization and Its Applications. Vol. 143. Springer.Google Scholar
- Abraham Neyman and Sylvain Sorin. 2003. Stochastic Games and Applications. Vol. 570. Springer Science & Business Media.Google Scholar
- Robin Pemantle and Jeffrey S Rosenthal. 1999. Moment conditions for a sequence with negative drift to be uniformly bounded in Lr. Stochastic Processes and their Applications 82, 1 (1999), 143–155.Google Scholar
- Julia Robinson. 1851. An Iterative Method of Solving a Game. Annals of Mathematical Statistics 54 (1851), 296–301.Google ScholarCross Ref
- Tim Roughgarden. 2003. The price of anarchy is independent of the network topology. J. Comput. Syst. Sci. 67, 2 (2003), 341–364. https://doi.org/10.1016/S0022-0000(03)00044-8Google ScholarDigital Library
- Tim Roughgarden. 2015. Intrinsic Robustness of the Price of Anarchy. J. ACM 62, 5 (2015), 32:1–32:42. https://doi.org/10.1145/2806883Google ScholarDigital Library
- Tim Roughgarden. 2016. Twenty Lectures on Algorithmic Game Theory(1st ed.). Cambridge University Press, USA.Google Scholar
- Tim Roughgarden and Éva Tardos. 2002. How bad is selfish routing?J. ACM 49, 2 (2002), 236–259. https://doi.org/10.1145/506147.506153Google ScholarDigital Library
- Aviad Rubinstein. 2018. Inapproximability of Nash Equilibrium. SIAM J. Comput. 47, 3 (2018), 917–959. https://doi.org/10.1137/15M1039274Google ScholarCross Ref
- Alexander Schrijver. 2003. Combinatorial Optimization: Polyhedra and Efficiency. Vol. 24. Springer Science & Business Media.Google Scholar
- Flore Sentenac, Etienne Boursier, and Vianney Perchet. 2021. Decentralized Learning in Online Queuing Systems. In Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual, Marc’Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, and Jennifer Wortman Vaughan (Eds.). 18501–18512. https://proceedings.neurips.cc/paper/2021/hash/99ef04eb612baf0e86671a5109e22154-Abstract.htmlGoogle Scholar
- Shai Shalev-Shwartz and Shai Ben-David. 2014. Understanding Machine Learning: From Theory to Algorithms. Cambridge University Press.Google ScholarDigital Library
- John F Shortle, James M Thompson, Donald Gross, and Carl M Harris. 2018. Fundamentals of Queueing Theory. Wiley.Google Scholar
- Vasilis Syrgkanis and Éva Tardos. 2013. Composable and efficient mechanisms. In Symposium on Theory of Computing Conference, STOC’13, Palo Alto, CA, USA, June 1-4, 2013, Dan Boneh, Tim Roughgarden, and Joan Feigenbaum (Eds.). ACM, 211–220. https://doi.org/10.1145/2488608.2488635Google ScholarDigital Library
- Neil Walton and Kuang Xu. [n. d.]. Learning and Information in Stochastic Networks and Queues. Chapter 6, 161–198. https://doi.org/10.1287/educ.2021.0235 arXiv:https://pubsonline.informs.org/doi/pdf/10.1287/educ.2021.0235Google Scholar
- Carsten Witt. 2014. Fitness levels with tail bounds for the analysis of randomized search heuristics. Inf. Process. Lett. 114, 1-2 (2014), 38–41. https://doi.org/10.1016/j.ipl.2013.09.013Google ScholarCross Ref
Index Terms
The Price of Anarchy of Strategic Queuing Systems
Comments