research-article
Free Access

The Price of Anarchy of Strategic Queuing Systems

Just Accepted

Published:17 March 2023Publication History
Skip Abstract Section

Abstract

Bounding the price of anarchy, which quantifies the damage to social welfare due to selfish behavior of the participants, has been an important area of research in algorithmic game theory. Classical work on such bounds in repeated games makes the strong assumption that the subsequent rounds of the repeated games are independent beyond any influence on play from past history. This work studies such bounds in environments that themselves change due to the actions of the agents. Concretely, we consider this problem in discrete-time queuing systems, where competitive queues try to get their packets served. In this model, a queue gets to send a packet each step to one of the servers, which will attempt to serve the oldest arriving packet, and unprocessed packets are returned to each queue. We model this as a repeated game where queues compete for the capacity of the servers, but where the state of the game evolves as the length of each queue varies. We analyze this queuing system from multiple perspectives: as a baseline measure, we first establish precise conditions on the queuing arrival rates and service capacities that ensure all packets clear efficiently under centralized coordination. We then show that if queues strategically choose servers according to independent and stationary distributions, the system remains stable provided it would be stable under coordination with arrival rates scaled up by a factor of just \(\frac{e}{e-1} \). Finally, we extend these results to no-regret learning dynamics: if queues use learning algorithms satisfying the no-regret property to choose servers, then the requisite factor increases to two, and both of these bounds are tight. Both of these results require new probabilistic techniques compared to the classical price of anarchy literature, and show that in such settings, no-regret learning can exhibit efficiency loss due to myopia.

References

  1. Elliot Anshelevich, Anirban Dasgupta, Jon M. Kleinberg, Éva Tardos, Tom Wexler, and Tim Roughgarden. 2008. The Price of Stability for Network Design with Fair Cost Allocation. SIAM J. Comput. 38, 4 (2008), 1602–1623. https://doi.org/10.1137/070680096Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Elliot Anshelevich, Anirban Dasgupta, Éva Tardos, and Tom Wexler. 2008. Near-Optimal Network Design with Selfish Agents. Theory Comput. 4, 1 (2008), 77–109. https://doi.org/10.4086/toc.2008.v004a004Google ScholarGoogle ScholarCross RefCross Ref
  3. Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. 2002. The Nonstochastic Multiarmed Bandit Problem. SIAM J. Comput. 32, 1 (2002), 48–77. https://doi.org/10.1137/S0097539701398375Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Maria-Florina Balcan, Avrim Blum, and Yishay Mansour. 2013. Circumventing the Price of Anarchy: Leading Dynamics to Good Behavior. SIAM J. Comput. 42, 1 (2013), 230–264. https://doi.org/10.1137/110821317Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Lucas Baudin, Marco Scarsini, and Xavier Venel. 2023. Strategic Behavior and No-Regret Learning in Queueing Systems. CoRR abs/2302.03614(2023). https://doi.org/10.48550/arXiv.2302.03614 arXiv:2302.03614Google ScholarGoogle Scholar
  6. P. Billingsley. 2008. Probability and Measure, 3rd Edition. Wiley.Google ScholarGoogle Scholar
  7. Avrim Blum, MohammadTaghi Hajiaghayi, Katrina Ligett, and Aaron Roth. 2008. Regret minimization and the price of total anarchy. In Proceedings of the 40th Annual ACM Symposium on Theory of Computing. ACM, 373–382. https://doi.org/10.1145/1374376.1374430Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Christian Borgs, Jennifer T. Chayes, Nicole Immorlica, Kamal Jain, Omid Etesami, and Mohammad Mahdian. 2007. Dynamics of bid optimization in online advertisement auctions. In Proceedings of the 16th International Conference on World Wide Web, WWW 2007, Banff, Alberta, Canada, May 8-12, 2007, Carey L. Williamson, Mary Ellen Zurko, Peter F. Patel-Schneider, and Prashant J. Shenoy (Eds.). ACM, 531–540. https://doi.org/10.1145/1242572.1242644Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Allan Borodin, Jon M. Kleinberg, Prabhakar Raghavan, Madhu Sudan, and David P. Williamson. 2001. Adversarial queuing theory. J. ACM 48, 1 (2001), 13–38. https://doi.org/10.1145/363647.363659Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. W. Brown. 1951. Iterative Solutions of Games by Fictitious Play. In Activity Analysis of Production and Allocation. Wiley, 374–376.Google ScholarGoogle Scholar
  11. Vincent Conitzer, Christian Kroer, Debmalya Panigrahi, Okke Schrijvers, Eric Sodomka, Nicolás E. Stier Moses, and Chris Wilkens. 2019. Pacing Equilibrium in First-Price Auction Markets. In Proceedings of the 2019 ACM Conference on Economics and Computation, EC 2019, Phoenix, AZ, USA, June 24-28, 2019, Anna Karlin, Nicole Immorlica, and Ramesh Johari (Eds.). ACM, 587. https://doi.org/10.1145/3328526.3329600Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Vincent Conitzer, Christian Kroer, Eric Sodomka, and Nicolás E. Stier Moses. 2018. Multiplicative Pacing Equilibria in Auction Markets. In Web and Internet Economics - 14th International Conference, WINE 2018, Oxford, UK, December 15-17, 2018, Proceedings(Lecture Notes in Computer Science, Vol.  11316), George Christodoulou and Tobias Harks (Eds.). Springer, 443. https://link.springer.com/content/pdf/bbm%3A978-3-030-04612-5%2F1.pdfGoogle ScholarGoogle Scholar
  13. Constantinos Daskalakis, Paul W. Goldberg, and Christos H. Papadimitriou. 2009. The Complexity of Computing a Nash Equilibrium. SIAM J. Comput. 39, 1 (2009), 195–259. https://doi.org/10.1137/070699652Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ofer Dekel, Ambuj Tewari, and Raman Arora. 2012. Online Bandit Learning against an Adaptive Adversary: from Regret to Policy Regret. In Proceedings of the 29th International Conference on Machine Learning, ICML 2012, Edinburgh, Scotland, UK, June 26 - July 1, 2012. icml.cc / Omnipress. http://icml.cc/2012/papers/749.pdfGoogle ScholarGoogle Scholar
  15. Rick Durrett. 2019. Probability: Theory and Examples. Vol.  49. Cambridge University Press.Google ScholarGoogle ScholarCross RefCross Ref
  16. Jerzy Filar and Koos Vrieze. 2012. Competitive Markov Decision Processes. Springer Science & Business Media.Google ScholarGoogle Scholar
  17. Daniel Freund, Thodoris Lykouris, and Wentao Weng. 2022. Efficient decentralized multi-agent learning in asymmetric queuing systems. In Conference on Learning Theory, 2-5 July 2022, London, UK(Proceedings of Machine Learning Research, Vol.  178), Po-Ling Loh and Maxim Raginsky (Eds.). PMLR, 4080–4084. https://proceedings.mlr.press/v178/freund22a.htmlGoogle ScholarGoogle Scholar
  18. Hu Fu, Qun Hu, and Jia’nan Lin. 2022. Stability of Decentralized Queueing Networks Beyond Complete Bipartite Cases. In Web and Internet Economics - 18th International Conference, WINE 2022, Troy, NY, USA, December 12-15, 2022, Proceedings(Lecture Notes in Computer Science, Vol.  13778), Kristoffer Arnsfelt Hansen, Tracy Xiao Liu, and Azarakhsh Malekian (Eds.). Springer, 96–114. https://doi.org/10.1007/978-3-031-22832-2_6Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Drew Fudenberg and David Levine. 1998. The theory of learning in games. MIT Press.Google ScholarGoogle Scholar
  20. S. Hart and A. M. Colell. 2000. A Simple Adaptive Procedure Leading to Correlated Equilibrium. Econometrica 68, 5 (2000), 1127–1150.Google ScholarGoogle ScholarCross RefCross Ref
  21. Refael Hassin. 2020. Rational Queueing. CRC Press. https://books.google.com/books?id=M3G1zQEACAAJGoogle ScholarGoogle Scholar
  22. Refael Hassin and Moshe Haviv. 2003. To Queue or Not to Queue: Equilibrium Behavior in Queueing Systems. Springer US. 02034096 https://books.google.com/books?id=K5KhJSZkhHQCGoogle ScholarGoogle Scholar
  23. Ramesh Johari and John N. Tsitsiklis. 2004. Efficiency Loss in a Network Resource Allocation Game. Math. Oper. Res. 29, 3 (2004), 407–435. https://doi.org/10.1287/moor.1040.0091Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Elias Koutsoupias and Christos H. Papadimitriou. 1999. Worst-case Equilibria. In STACS 99, 16th Annual Symposium on Theoretical Aspects of Computer Science. Springer, 404–413. https://doi.org/10.1007/3-540-49116-3_38Google ScholarGoogle Scholar
  25. Subhashini Krishnasamy, Rajat Sen, Ramesh Johari, and Sanjay Shakkottai. 2016. Regret of Queueing Bandits. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain. 1669–1677. http://papers.nips.cc/paper/6370-regret-of-queueing-banditsGoogle ScholarGoogle Scholar
  26. Thodoris Lykouris, Vasilis Syrgkanis, and Éva Tardos. 2016. Learning and Efficiency in Games with Dynamic Population. In Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2016, Arlington, VA, USA, January 10-12, 2016. SIAM, 120–129. https://doi.org/10.1137/1.9781611974331.ch9Google ScholarGoogle ScholarCross RefCross Ref
  27. Albert W Marshall, Ingram Olkin, and Barry C Arnold. 1979. Inequalities: Theory of Majorization and Its Applications. Vol.  143. Springer.Google ScholarGoogle Scholar
  28. Abraham Neyman and Sylvain Sorin. 2003. Stochastic Games and Applications. Vol.  570. Springer Science & Business Media.Google ScholarGoogle Scholar
  29. Robin Pemantle and Jeffrey S Rosenthal. 1999. Moment conditions for a sequence with negative drift to be uniformly bounded in Lr. Stochastic Processes and their Applications 82, 1 (1999), 143–155.Google ScholarGoogle Scholar
  30. Julia Robinson. 1851. An Iterative Method of Solving a Game. Annals of Mathematical Statistics 54 (1851), 296–301.Google ScholarGoogle ScholarCross RefCross Ref
  31. Tim Roughgarden. 2003. The price of anarchy is independent of the network topology. J. Comput. Syst. Sci. 67, 2 (2003), 341–364. https://doi.org/10.1016/S0022-0000(03)00044-8Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Tim Roughgarden. 2015. Intrinsic Robustness of the Price of Anarchy. J. ACM 62, 5 (2015), 32:1–32:42. https://doi.org/10.1145/2806883Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Tim Roughgarden. 2016. Twenty Lectures on Algorithmic Game Theory(1st ed.). Cambridge University Press, USA.Google ScholarGoogle Scholar
  34. Tim Roughgarden and Éva Tardos. 2002. How bad is selfish routing?J. ACM 49, 2 (2002), 236–259. https://doi.org/10.1145/506147.506153Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Aviad Rubinstein. 2018. Inapproximability of Nash Equilibrium. SIAM J. Comput. 47, 3 (2018), 917–959. https://doi.org/10.1137/15M1039274Google ScholarGoogle ScholarCross RefCross Ref
  36. Alexander Schrijver. 2003. Combinatorial Optimization: Polyhedra and Efficiency. Vol.  24. Springer Science & Business Media.Google ScholarGoogle Scholar
  37. Flore Sentenac, Etienne Boursier, and Vianney Perchet. 2021. Decentralized Learning in Online Queuing Systems. In Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual, Marc’Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, and Jennifer Wortman Vaughan (Eds.). 18501–18512. https://proceedings.neurips.cc/paper/2021/hash/99ef04eb612baf0e86671a5109e22154-Abstract.htmlGoogle ScholarGoogle Scholar
  38. Shai Shalev-Shwartz and Shai Ben-David. 2014. Understanding Machine Learning: From Theory to Algorithms. Cambridge University Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. John F Shortle, James M Thompson, Donald Gross, and Carl M Harris. 2018. Fundamentals of Queueing Theory. Wiley.Google ScholarGoogle Scholar
  40. Vasilis Syrgkanis and Éva Tardos. 2013. Composable and efficient mechanisms. In Symposium on Theory of Computing Conference, STOC’13, Palo Alto, CA, USA, June 1-4, 2013, Dan Boneh, Tim Roughgarden, and Joan Feigenbaum (Eds.). ACM, 211–220. https://doi.org/10.1145/2488608.2488635Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Neil Walton and Kuang Xu. [n. d.]. Learning and Information in Stochastic Networks and Queues. Chapter 6, 161–198. https://doi.org/10.1287/educ.2021.0235 arXiv:https://pubsonline.informs.org/doi/pdf/10.1287/educ.2021.0235Google ScholarGoogle Scholar
  42. Carsten Witt. 2014. Fitness levels with tail bounds for the analysis of randomized search heuristics. Inf. Process. Lett. 114, 1-2 (2014), 38–41. https://doi.org/10.1016/j.ipl.2013.09.013Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. The Price of Anarchy of Strategic Queuing Systems

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Journal of the ACM
        Journal of the ACM Just Accepted
        ISSN:0004-5411
        EISSN:1557-735X
        Table of Contents

        Copyright © 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 March 2023
        • Accepted: 12 January 2023
        • Revised: 11 October 2022
        • Received: 15 December 2021

        Check for updates

        Qualifiers

        • research-article
      • Article Metrics

        • Downloads (Last 12 months)76
        • Downloads (Last 6 weeks)76

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader
      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!