10.1145/3578245.3585427acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
invited-talk

Using Reinforcement Learning to Control Auto-Scaling of Distributed Applications

Published:15 April 2023Publication History

ABSTRACT

Modern distributed systems can benefit from the availability of large-scale and heterogeneous computing infrastructures. However, the complexity and dynamic nature of these environments also call for self-adaptation abilities, as guaranteeing efficient resource usage and acceptable service levels through static configurations is very difficult.

In this talk, we discuss a hierarchical auto-scaling approach for distributed applications, where application-level managers steer the overall process by supervising component-level adaptation managers. Following a bottom-up approach, we first discuss how to exploit model-free and model-based reinforcement learning to compute auto-scaling policies for each component. Then, we show how Bayesian optimization can be used to automatically configure the lower-level auto-scalers based on application-level objectives. As a case study, we consider distributed data stream processing applications, which process high-volume data flows in near real-time and cope with varying and unpredictable workloads.

References

  1. Yahya Al-Dhuraibi, Fawaz Paraiso, Nabil Djarallah, and Philippe Merle. 2018. Elasticity in Cloud Computing: State of the Art and Research Challenges., Vol. 11, 2 (2018), 430--447. https://doi.org/10.1109/TSC.2017.2711009Google ScholarGoogle Scholar
  2. Valeria Cardellini, Francesco Lo Presti, Matteo Nardelli, and Gabriele Russo Russo. 2022. Run-Time Adaptation of Data Stream Processing Systems: The State of the Art. ACM Comput. Surv., Vol. 54 (2022), 36 pages.Issue 11s. https://doi.org/10.1145/3514496Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Marios Fragkoulis, Paris Carbone, Vasiliki Kalavri, and Asterios Katsifodimos. 2020. A Survey on the Evolution of Stream Processing Systems. CoRR, Vol. abs/2008.00842 (2020). arxiv: 2008.00842 https://arxiv.org/abs/2008.00842Google ScholarGoogle Scholar
  4. Peter I. Frazier. 2018. A Tutorial on Bayesian Optimization. Vol. abs/1807.02811. arxiv: 1807.02811 http://arxiv.org/abs/1807.02811Google ScholarGoogle Scholar
  5. Omid Gheibi, Danny Weyns, and Federico Quin. 2021. Applying Machine Learning in Self-Adaptive Systems: A Systematic Literature Review. ACM Transactions on Autonomous and Adaptive Systems, Vol. 15, 3, Article 9 (2021), 37 pages. https://doi.org/10.1145/3469440Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Thomas Heinze, Leonardo Aniello, Leonardo Querzoni, and Zbigniew Jerzak. 2014. Cloud-based Data Stream Processing. In Proc. of 8th ACM Int'l Conf. on Distributed Event-Based Systems, DEBS '14. 238--245. https://doi.org/10.1145/2611286.2611309Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, et al. 2015. Human-Level Control Through Deep Reinforcement Learning. Nat., Vol. 518, 7540 (2015), 529--533. https://doi.org/10.1038/nature14236Google ScholarGoogle ScholarCross RefCross Ref
  8. Henriette Rö ger and Ruben Mayer. 2019. A Comprehensive Survey on Parallelization and Elasticity in Stream Processing. ACM Comput. Surv., Vol. 52, 2 (2019), 36:1--36:37. https://doi.org/10.1145/3303849Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Gabriele Russo Russo, Valeria Cardellini, and Francesco Lo Presti. 2019. Reinforcement Learning Based Policies for Elastic Stream Processing on Heterogeneous Resources. In Proc. of 13th ACM Int'l Conf. on Distributed and Event-based Systems, DEBS '19. 31--42. https://doi.org/10.1145/3328905.3329506Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction 2 ed.). MIT Press, Cambridge, MA, USA.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Using Reinforcement Learning to Control Auto-Scaling of Distributed Applications

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0

        Other Metrics

      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!